DokuAI – Smart Documentation in Minutes

Inspiration

DokuAI was inspired by a close friend working in the fire safety industry. His daily routine involves capturing hundreds or even thousands of photos and writing notes while on-site. In the evenings, he then spends hours compiling all this material into detailed reports. This repetitive and exhausting process highlighted a clear problem: valuable time is being lost to paperwork rather than actual fieldwork.

The vision behind DokuAI is to eliminate these inefficiencies by automating the reporting process. With modern AI technology and scalable infrastructure, it is possible to transform raw data into professional reports in minutes instead of hours. The purpose of DokuAI is to give professionals more time to focus on the work that matters, while ensuring documentation is consistent, accurate, and delivered quickly.


What it does

DokuAI transforms images and audio recordings into structured reports with minimal effort:

  • Photos and audio can be captured directly in the app or uploaded from the device.
  • Audio recordings are automatically transcribed, and images are analyzed with metadata extraction.
  • Geodata is processed through Google Maps with both automatic and manual location detection options.
  • Content is summarized, structured, and formatted into a professional report.
  • Reports include images, transcribed text, timestamps, and location data.
  • Reports can be generated in multiple languages with automatic translation into seven target languages.
  • An optional feature allows new photos taken in the app to be automatically saved into the device photo library.

On average, the complete process from media upload to finished report takes less than five minutes.


How I built it

Backend

  • Implemented using multiple Quarkus instances written in Java.
  • PostgreSQL database for structured data management.
  • Liquibase for database schema versioning and migrations.
  • Google Maps API integrated for geodata processing, supporting both automatic and manual location handling.

AI Processing

  • OpenAI API used for transcription, summarization, translation, and analysis.
  • Prompt engineering for accuracy and consistency in industry-focused scenarios.

Architecture

  • Asynchronous job handling designed with custom schedulers.
  • UUIDs used for job tracking, ensuring security and traceability.
  • Backend scaled horizontally with multiple Quarkus instances in production.

Frontend

  • Built with Flutter, initially focused on iOS.
  • Designed to provide a responsive and native user experience.

Deployment and CI/CD

  • Source code hosted and maintained on GitHub.
  • GitHub Actions pipelines automate build, test, and deployment processes.
  • Backend deployed to Kubernetes using ArgoCD for GitOps workflows.
  • Flutter app built automatically in GitHub Actions and deployed to App Store Connect for TestFlight and App Store distribution.

Security

  • HTTPS-only communication.
  • Token-based authentication.
  • Detailed tracking through job IDs and database timestamps.

Challenges I ran into

Developing DokuAI alone required solving multiple technical and organizational challenges:

  • Managing efficient and reliable uploads of large photo and audio files.
  • Ensuring accurate and consistent AI outputs in specialized industries such as inspection, construction, and fire safety.
  • Designing a user interface that is simple enough for non-technical users but robust enough for professionals.
  • Balancing AI processing costs with performance and accuracy.
  • Orchestrating a complex deployment setup, including multiple Quarkus services, PostgreSQL, Kubernetes clusters, ArgoCD, and CI/CD pipelines for both backend and frontend.
  • Handling automatic versus manual geodata capture reliably while preserving user control and compliance.
  • Supporting multilingual reporting and ensuring translations remain both accurate and context-aware.

Accomplishments that I am proud of

  • Delivered a complete end-to-end process that automates reporting in less than five minutes.
  • Achieved high accuracy in transcription, summarization, translation, and structured data extraction.
  • Built a scalable backend with multiple Quarkus instances running in Kubernetes.
  • Fully automated CI/CD pipelines: backend deployments via ArgoCD and Flutter app distribution directly to App Store Connect.
  • Added multilingual reporting with automatic translation into seven target languages.
  • Implemented both automatic and manual geodata detection to cover diverse field conditions.
  • Recruited nearly 90 beta testers through X.com, who provided valuable feedback during early development.
  • Achieved the number one ranking in the German App Store for the keyword “Dokumentation.”

What I learned

Building DokuAI alone provided direct experience across all aspects of modern software product development:

  • Prompt engineering is essential for accurate and consistent AI-driven outputs.
  • Asynchronous job handling is critical for responsive applications where heavy processing occurs in the background.
  • A simple and intuitive interface is a major factor for adoption, especially in industries with low digital affinity.
  • Building scalability early prevents costly redesigns in later stages.
  • Early App Store release accelerated the feedback loop and provided insights beyond private testing.
  • Direct engagement with users via X.com enabled fast recruitment of testers and established trust with the community.

What's next for DokuAI

The future development of DokuAI will focus on extending capabilities and deepening adoption:

  • Release of the Android version to reach a wider user base.
  • Industry-specific, customizable report templates (e.g., fire safety, construction, maintenance).
  • Enhanced offline capture functionality with delayed upload for poor connectivity scenarios.
  • Integrations with CRM systems and project management tools to align with enterprise workflows.
  • Team-based features including roles, approvals, shared reports, and comments.
  • Expanded quality assurance with validation rules, mandatory fields, and plausibility checks.
  • Additional multilingual support beyond the seven existing target languages.
  • Advanced cost optimization strategies including batch processing, caching, and dynamic model selection.

DokuAI continues to evolve as a comprehensive solution for structured documentation. It reduces the time required for reporting from hours to minutes, ensuring that professionals can focus on the tasks that matter most while producing consistent, reliable reports.

Built With

Share this project:

Updates