{% extends "se/base.html" %} {% load static %} {% block css %} .info-block { display: flex; align-items: center; margin-bottom: 1rem; } .info-icon { width: 32px; height: 32px; margin-right: 1rem; } .info-block strong { font-size: 1.2rem; margin-bottom: 0.5rem; } {% endblock %} {% block body %}

Sosse 🦦

Sosse (Selenium Open Source Search Engine) is an open source web archiving tool, crawler, and search engine. It’s designed to handle dynamic web content, and is built for transparency, reproducibility, and long-term usability.

It is ideal for monitoring, archiving, or indexing web pages — including those rendered with JavaScript. Sosse supports scheduled crawling, advanced querying, feed generation, and private search features.

Sosse is written in Python and supports browser-based crawling via Selenium, with Firefox or Chromium, and faster headless crawling using Requests. PostgreSQL is used as the primary database backend.

Resources

Website
Official Website
Visit sosse.org for commercial offers, hosting services, and documentation.
AGPL License
License
Sosse is released under the GNU AGPL v3, requiring all modifications and hosted uses to remain open.
GitLab
Main Repository
GitLab hosts the main codebase, issues, and CI. Use it for merge requests, bug reports, or tracking development.
GitHub
Mirror Repository
GitHub offers a convenient mirror for visibility and community interaction.

Current Version: Sosse {{ settings.SOSSE_VERSION_TAG }} {% if settings.SOSSE_VERSION_TAG != 'dev' %} (commit {{ settings.SOSSE_VERSION_COMMIT }}) {% endif %}

© 2022–2025 Laurent Defert.

{% endblock %}