Book Image

Pig Design Patterns

By : Pradeep Pasupuleti
Book Image

Pig Design Patterns

By: Pradeep Pasupuleti

Overview of this book

Table of Contents (16 chapters)
Pig Design Patterns
Credits
Foreword
About the Author
Acknowledgments
About the Reviewers
www.PacktPub.com
Preface
Index

About the Reviewers

Aaron Binns spent over five years at the Internet Archive where he designed and built a petabyte-scale Hadoop cluster supporting full-text search and Big Data analytics, the majority of which was implemented in Pig. He was responsible for the construction and deployment of full-text search of domain-scale web archives of hundreds of millions of archived web pages, as well as the over two billion web pages indexed for full-text search in the Archive-It service. He also developed custom software, built on Lucene, to provide special functionality required for full-text search of archival web documents.

He currently works at TaskRabbit as a data scientist. He holds a Bachelor of Science degree in Computer Science from Case Western Reserve University.

Shingo Furuyama is a software engineer, who has specialized in domain logic implementation to realize the value of software in the financial industry. At weekends, he enjoys cycling, scuba diving, wind surfing, and coding. Currently, he is studying English in the Philippines to expand his career opportunities.

He started his career as a software engineer at Simplex Technology, taking major responsibility in developing interest rate derivatives and a Forex option management system for a Japanese mega bank. Before going to the Philippines, he was working for Nautilus Technologies, a Japanese start-up that specializes in Big Data technologies and cloud-related enterprise solutions.

You can get more information from his blog (http://marblejenka.blogspot.jp/) or LinkedIn (http://jp.linkedin.com/in/shingofuruyama). You can also follow him on Twitter (@marblejenka).

Shashwat Shriparv holds a master's degree in Computer Application from Cochin University of Science and Technology and currently working as Senior. System Engineer HPC with Cognilytics. With a total IT experience of six years, he spent three and a half years working on core Big Data technologies, such as Hadoop, Hive, HBase, Pig, Sqoop, Flume, and Mongo in the field of development and management, and the rest of his time in handling projects in technologies, such as .Net, Java, web programming languages, and mobile development.

He has worked with companies, such as HCL, C-DAC, PointCross, and Genilok. He actively participates and contributes to online Big Data forums and groups. He has also contributed to Big Data technologies by creating and uploading several videos for Big Data enthusiasts and practitioners on YouTube free of cost.

He likes writing articles, poems, and technology blogs, and also enjoys photography. More information about him can be found at https://github.com/shriparv and http://helpmetocode.blogspot.com. You can connect to him on LinkedIn at http://www.linkedin.com/pub/shashwat-shriparv/19/214/2a9 and can mail him at .

Fábio Franco Uechi has a bachelor's degree in Computer Science and is a Senior Software Engineer at CI&T Inc. He has been the architect of enterprise-grade solutions in the software industry for around 11 years and has been using Big Data and cloud technologies over the past four to five years to solve complex business problems.

He is highly interested in machine learning and Big Data technologies, such as R, Hadoop, Mahout, Pig, Hive, and related distributed processing platforms to analyze datasets to achieve informative insights.

Other than programming, he enjoys playing pinball, slacklining, and wakeboarding. You can learn more from his blog (http://fabiouechi.blogspot.com) and GitHub (https://github.com/fabito).