Estela

Web Scraping on-premise

The first open source scraping orchestration solution in the world

  • Save costs by owning your data pipeline
  • Usability reports of your projects and spiders
  • Elasticity, scalability and best practices built-in
Check out the Repo
Bitmaker brain

Trusted by



bitmaker web scraping

smat

agrometrika

emptor

Why Estela?

On-Premise

Estela is a scraping orchestration platform running on Kubernetes. It provides mechanisms to deploy, run and scale web scraping spiders via a REST API and a web interface.

Elasticity and scalability

Automate the workload generated by running spiders. Optimize your data operations effortlessly, ensuring scalability that grows with your business.

Cost Reduction

Experience substantial savings with Estela as it operates within your infrastructure, eliminating the need for external cloud platforms. Say goodbye to escalating costs and gain control over your budget.

Open-source

Our wholehearted commitment to the open-source community, its methodologies and principles runs in our core DNA and we embrace the open exchange of information, technology, transparency and collaborative development.

Ready to get started?

Bitmaker Cloud is an Estela instance hosted by Bitmaker.
Try it for free

Estela benefits
VS
Proprietary scraping platforms

Autonomy

Operations and processes can be reviewed on your own without depending on the service provider.

Multilingual

Supports many programming languages and frameworks such as Scrapy and python-requests, two most widely used in the industry.

Data sovereignty

Complete data security and privacy by being on your own servers

Control

On-premises infrastructure gives your organization complete control of resources, services, and data.

Technical Estela Features

Fault-Tolerant Architecture

Experience automation in resource distribution, ensuring seamless data extraction even in challenging scenarios.

Modular Architecture

Enjoy the freedom of creating and adapting new functionalities without compromising Estela's stellar performance.

Performance Evaluation

Harness the power of detailed graphs generated by Estela for an in-depth evaluation of resources consumed during execution.

Optimal Spider Execution Scheduling

Our fault-tolerant architecture optimizes spider execution scheduling, preventing undue pressure on your server infrastructure.

Traceability

Instantly identify problems and errors with Estela's traceability feature, providing a real-time view at the organization, project, spider, and job levels.

Elastic Compatibility

Adapt Estela to your needs; it accepts Scrapy and REQUEST, the two most widely used frameworks in the industry, ensuring flexibility and compatibility.

Latest news

2023-07-12

Estela Requests Support

Beta support for the Requests library (ec08db0) has been recently added to Estela and will continue to see improvements.


2023-07-12

Estela Notifications

Along with the record of activities on each project, users are now also notified when an action occurs in a project they are part of (2da9074).


2023-07-11

Live stats visualization

Redis is now used to store the stats of jobs in RUNNING status (5957952), allowing users to visualize the stats and resource consumption of their scraping jobs in real time.


2023-07-11

Estela Activity Menu

A new proposal and implementation for the Activity Menu have been introduced in Estela (5d4c8dc), allowing users to see the history of actions performed in each project.


2023-04-25

Project Stats

The project dashboard has seen a major overhaul. Features here include new charts that allow users to easily see the stats obtained from their scraping jobs, including various statistics views (536eaca, 2fdcd3e, fcd6e1e).

Technical Articles


Estela OSS release

Read More

Introducing requests support in Estela

Read More

Estela's Year-One Transformation

Read More