Posts

Showing posts from September, 2012

Linux | File_Combiner4Hadoop

File_Combiner4Hadoop This shell script can be used to combine a set of small files into one or more big file. This script is very useful when working with hadoop( at least it did for me). With Hadoop the overhead involved in processing small files is very high. So it is better to combine all the small files and make one or more big files and use those big files for hadoop processing. At the same time if the "big/combined file" is more than the block size( by default it is 64MB) in hadoop, there is a possibility for the file to get split during the hadoop process( i.e one half of the file will be processed by one node and another half on another node). If you dont want the files to be split, then this is one of the easiest solution - combine the small files into one or more big files and make sure the big file's size does go above the hadoop block size ( in my case it is 64MB). This shell script has a parameter "-size" where you can specify the maximum all

Ora | dba_source - Extended

Image
dba_source - Extended This utility extends the default DBA_SOURCE view and gives the below additional details as separate columns. TEXT column for uncommented code name of the procedure, function and cursor name defined within a package/procedure/function. sub procedure / function name i.e procedures and functions defined within a another procedure/function. Download the Source Code TYPE  - typ_uc_dba_source_rec.sql TYPE - typ_uc_dba_source_tab.sql FUNCTION - uc_dba_source_fun.sql VIEW - uc_dba_source_v.sql Installation Steps Download the sqls from the above links. Run them in the same order. Usage Notes select * from uc_dba_source_v where owner = '[schema name]' and object_name = '[procedure/function/package name]' Example Here is the source code of the package body, taken for our example CREATE OR REPLACE PACKAGE BODY VSUBR.my_test IS /* this package is the test package to show how the DBA_source -extention wor