End-to-end Search and Analytics
About This Book
* Solve your data analytics problems with the Elastic Stack
* Improve your user search experience with Elasticsearch and develop your own Elasticsearch plugins
* Design your index, configure it, and distribute it - you'll also learn how it works
Who This Book Is For
This course is for anyone who wants to build efficient search and analytics applications. Some development experience is expected.
What You Will Learn
* Install and configure Elasticsearch, Logstash, and Kibana
* Write CRUDE operations and other search functionalities using the Elasticsearch Python and Java Clients
* Build analytics using aggregations
* Set up and scale Elasticsearch clusters using best practices
* Master document relationships and geospatial data
* Build your own data pipeline using Elastic Stack
* Choose the appropriate amount of shards and replicas for your deployment
* Become familiar with the Elasticsearch APIs
Elasticsearch is a modern, fast, distributed, scalable, fault tolerant, open source search and analytics engine. It provides a new level of control over how you can index and search even huge sets of data. This course will take you from the basics of Elasticsearch to using Elasticsearch in the Elastic Stack and in production.
You'll start with the very basics: Elasticsearch terminology, installation, and configuring Elasticsearch. After this, you'll take a look at analytics and indexing, search, and querying. You'll learn how to create maps and visualizations. You'll also be briefed on cluster scaling, search and bulk operations, backups, and security.
Then you'll be ready to get into Elasticsearch's internal functionalities including caches, Apache Lucene library, and its monitoring capabilities. You'll learn about the practical usage of Elasticsearch configuration parameters and how to use the monitoring API. You'll discover how to improve the user search experience, index distribution, segment statistics, merging, and more.
Once you have mastered this, you'll dive into end-to-end visualize-analyze-log techniques with Elastic Stack (also known as the ELK stack). You'll explore Elasticsearch, Logstash, and Kibana and see how to make them work together to build fresh insights and business metrics out of data. You'll be able to use Elasticsearch with other de facto components in order to get the most out of Elasticsearch. By the end of this course, you'll have developed a full-fledged data pipeline.
This Learning Path combines some of the best that Packt has to offer in one complete, curated package. It includes content from the following Packt products:
* Elasticsearch Essentials
* Mastering Elasticsearch, Second Edition
* Learning ELK Stack
Style and approach
This course aims to create a smooth learning path that will teach you how to effectively use Elasticsearch with other de facto components and get the most out of Elasticsearch. Through this comprehensive course, you'll learn the basics of Elasticsearch and progress to using Elasticsearch in the Elastic stack and in production.
Bharvi Dixit is an IT professional with extensive experience working on search servers (especially Elasticsearch) and NoSQL databases. He is currently working as a technology and search expert with GrownOut, a SAAS-based referral hiring solution provider company. He is the organizer and speaker of Delhi's Elasticsearch Meetup Group, which is one of the fastest growing Elasticsearch communities in India.
He also works as a freelance Elasticsearch consultant and has helped many small to medium size organizations in adapting Elasticsearch for different use cases, such as creating search solutions for big data-automated intelligence platforms in the area of counter-terrorism and risk management as well as in other domains such as recruitment, e-commerce, finance and log monitoring.
He holds a master's degree in computer science from LBSIM - Delhi, India, and has a keen interest in creating scalable backend platforms. His other areas of interest are data analytics, distributed computing, automations, and DevOps. Java and Python are the primary languages in which he loves to write code, and he has already built a proprietary software for consultancy firms.
In his spare time, he loves writing blogs and reading the latest technology books. He can be connected through LinkedIn at: https://in.linkedin.com/in/bharvidixit Rafal Kuc is a born team leader and software developer. Currently, he is working as a consultant and a software engineer at Sematext Group, Inc., where he concentrates on open source technologies, such as Apache Lucene, Solr, Elasticsearch, and the Hadoop stack. He has more than 13 years of experience in various software branches - from banking software to e-commerce products. He is mainly focused on Java but is open to every tool and programming language that will make the achievement of his goal easier and faster.
Rafal is one of the founders of the solr.pl website, where he tries to share his knowledge and help people with their problems related to Solr and Lucene. He is also a speaker at various conferences around the world, such as Lucene Eurocon, Berlin Buzzwords, ApacheCon, Lucene Revolution, and DevOps Days. He began his journey with Lucene in 2002, but it wasn't love at first sight. When he came back to Lucene in late 2003, he revised his thoughts about the framework and saw the potential in search technologies. Then came Solr, and that was it. He started working with Elasticsearch in the middle of 2010. Currently, Lucene, Solr, Elasticsearch, and information retrieval are his main points of interest.
Rafal is the author of Solr 3.1 Cookbook, its update Solr 4.0 Cookbook, and its third release Solr Cookbook, Third Edition. He is also the author of Elasticsearch Server and its second edition, along with the first edition of Mastering Elasticsearch - all published by Packt. Marek Rogozinski is a software architect and consultant with over 10 years of experience. He specializes in solutions based on open source search engines, such as Solr and Elasticsearch, and software stack for Big Data analytics, including Hadoop, Hbase, and Twitter Storm. He is also a cofounder of the solr.pl website, which publishes information and tutorials about Solr and Lucene libraries. He is the coauthor of Mastering ElasticSearch, ElasticSearch Server, and Elasticsearch Server Second Edition - all published by Packt.
Currently, he holds the position of chief technology officer and lead architect at ZenCard, a company processing and analyzing large amounts of payment transactions in real time, allowing automatic and anonymous identification of retail customers on all retailer channels (m-commerce / e-commerce / brick and mortar) and giving retailers a customer retention and loyalty tool. Saurabh Chhajed is a technologist with vast professional experience in building Enterprise applications that span across product and service industries. He has experience building some of the largest recommender engines using big data analytics and machine learning, and also enjoys acting as an evangelist for big data and NoSQL technologies. With his rich technical experience, Saurabh has helped some of the largest financial and industrial companies in USA build their large product suites and distributed applications from scratch. He shares his personal experiences with technology at: http://saurzcode.in
Saurabh has also reviewed books by Packt - Apache Camel Essentials and Java EE 7 Development with NetBeans 8 - in the past.