Perl script: Auto Detect the script file name

CODE: [code language=”perl”] use File::Basename; my $progname = basename($0); $progname =~ m/(.*)(\.)(.*)/; print "Script name along with extension: [$progname]\n"; print "Script name without extension: [$1]\n"; [/code] OUTPUT: [code language=”perl”] C:\>perl get-script-name.pl Script name along with extension: [get-script-name.pl] Script name without extension: [get-script-name] C:\> [/code]

Read more

Web Scraping with Perl scripting

Similar to various powerful modules, Perl comes with various modules for web scraping, that is extracting required information from web HTML pages. Below is a sample script that I authored to extract the movies information from www.justtollywood.com web site pages. [code language=”perl”] #! perl #=============================================================================== # Objective: # ———- # # Perl script to demo the web scraping modules to extract intended information # from web pages. # # For this example, I used www.justtollywood.com pages. # # $Header: $ #=============================================================================== # Include Modules #=============================================================================== use strict; use warnings; use Pod::Usage; use File::Basename; use HTML::TableExtract; use HTML::TreeBuilder 3; use Getopt::Long […]

Read more

Perl script: HtmlAsText.pl to convert HTML content into Text File format

This program is to illustrate how tools like HtmlAsText usually work. [code language=”perl”] #! perl #=============================================================================== # Objective: # ———- # # Script to convert HTML File content into Text File # # # $Header: $ #=============================================================================== # Include Modules #=============================================================================== use strict; use warnings; use Pod::Usage; use LWP::Simple; use File::Basename; use HTML::Table; use Alvis::HTML; use HTML::TableExtract qw(tree); use Getopt::Long qw(:config no_ignore_case bundling); #=============================================================================== # Global Variables Declaration #=============================================================================== use vars qw($DEBUG $SRC_FLDR $DEST_FLDR $HTML_TBL_OBJ $ALVIS_HTML_OBJ); #=============================================================================== # Prototypes Section #=============================================================================== sub DoAction; sub My_Readdir; sub InitGlobals; sub ProcessArgs; sub Info {my ($mesg) = @_; print STDOUT "INFO: $mesg\n";} sub […]

Read more