
Language
Python
Tool Type
Web app
License
The MIT License
Version
1.0.0
Open Knowledge Foundation

The “Querido diario” Project of the Open Knowledge Foundation Brazil uses artificial intelligence to classify, contextualize and expand the information contained in the official Brazilian municipal newspapers, facilitating its access on a platform that allows the data to be viewed in an open and friendly way. Its objective is to bring the official municipal newspapers into the digital age, centralizing the information that is currently available only in dispersed and inaccessible formats. This centralization makes official data more accessible and verifiable, thus strengthening the Access to Information Law in Brazil.
It digitizes and centralizes official information dispersed in inaccessible formats, improving the accessibility and verifiability of data in an open and friendly format. In addition to improving access to information, the initiative promotes collaboration and community development, maintaining its open source base.
Scrapes official government websites for gazettes Standardizes data into a uniform format Provides searchable access to scraped data Maintains an open-source codebase for community contributions Focuses on Brazilian municipalities
Built primarily in Python, it employs the Scrapy framework for efficient data scraping. Adopts an open-source approach, promoting transparency and collaboration. Licensed under the MIT license, it ensures code flexibility and reuse. Encourages community contributions, facilitating collaborative development and continuous improvement of the tool.

Connect with the Development Code team and discover how our carefully curated open source tools can support your institution in Latin America and the Caribbean. Contact us to explore solutions, resolve implementation issues, share reuse successes or present a new tool. Write to [email protected]

OKFBR's "Dear Diary" project uses AI to enhance access to municipal newspaper data in Brazil, including initiatives like "Operation Serenata do Amor" and tools like Rosie and Jarbas.

"Dear Diary" site with search bar to explore official diaries. Stats: 12 diaries on platform, 2,226 ready to collect, 597 ready to scrape. Similar mobile display.

"Dear Diary" flowchart shows scrapers extracting data from the Official Gazette. It uses Elasticsearch for indexing, connects to census data via API, and visualizes on a web platform.
Project that centralizes and facilitates access to official municipal newspapers in Brazil.
Detailed guide on the installation and use of the tool.
Presentation on the impact of the project on access to public information.
