Success Story

Building an AI-enabled Speech Recognition Engine for a Legal-tech Firm

The client is a Nigerian legal technology solutions company that spearheads the development of innovative software products for seamless judgment delivery. The firm has also been providing a diverse catalog of digital solutions that assist 1000s of law students, legal practitioners, lecturers, and judges in their research work.

Services Used:

AI Application Development Conversational AI Solutions

Business Situation

Transcription of court case proceedings are essential for helping legal practitioners and judges in delivering trial outcomes accurately. Manual transcription requires very delicate work and careful attention to every single spoken detail during a particular deposition, which often leads to human errors in the final transcript.

Additionally, there is a considerable shift in accents while moving from region to region within Nigeria, and human transcribers are unlikely to be accustomed to all of these accents. So they would sometimes miss out entire sentences while listening and transcribing.

The legal technology company had the innovative idea of facilitating error-free transcription in Nigerian courts with an audio transcription software. The company chose to leverage Daffodil’s expertise in AI, and in enabling businesses to train machines to process large volumes of natural language data flawlessly.

The Daffodil team was required to ensure that the audio transcription software they would develop satisfies the following conditions:

Enable the capability to transcribe streaming audio and video.
Automate the process of data augmentation into the appropriate format and size that would be consumable for the software.
Train the software to understand variations in accents and intonations in spoken English across regions in Nigeria.
Eliminate issues with noise that could lead to unnecessary aberrations in the transcripts.
Regularly maintain and train the software solution based on new data sets collected from proceedings and judgements.

The Solution

When it comes to speech recognition technologies, the world is still figuring out the best AI strategies to do so with the cleanest possible methods. The Daffodil AI team opted for the transfer learning-based Machine Learning (ML) model to develop an Automated Speech Recognition (ASR) engine for transcribing live court depositions.

The development of the ASR engine and training the machine to listen to audio and transcribe accurately involved a lengthy process. There was plenty of research and innovation that went into the conceptualization of this process as described in the following sections:

Automating Data Preparation

The ASR machine learning model and engine required thousands of hours of data – both audio of depositions and previously transcribed texts. The audio had to be mono-channel, with a 16 kHz sampling rate for the model to be trained to give more accurate outcomes. This preparation of data before being fed to the ASR engine was automated by our team, so that the engine was equipped to decipher several local Nigerian accents and nuances in the vocabulary.

Pre-Training For Data Augmentation

Once the data was fed into the ASR engine, next came the inclusion of optional modules for data augmentation and pre-training wherever required. Pre-training the ASR machine learning model meant fine-tuning its accuracy through transfer learning methodologies. So the audio files were in WAV format of several people with different linguistic nuances saying the same words. Data training was done using a set of 100,000+ words and linguistics samples. The audio had to be converted and the model was monitored for loss and accordingly fixed when needed.

Incremental Readability

The post-processing pipeline of the ASR machine learning model is quite complex in terms of the text produced and may occasionally have readability issues. An Inverse Text Normalization (ITN) mechanism was implemented to take care of delicate changes that improved the overall readability of the transcribed depositions.

Accurate Text Records

Audio of depositions can be transcribed with near-perfect accuracy. The ASR engine has been trained with so much data that the Word Error Rate (WER) and Character Error Rate (CER) of the audio transcription software never rises above 20%. Depositions, conversations, and hearings can be transcribed to the very letter with no redundant elements caused by noise.

The Impact

The legal technology solutions company was able to use the final product to help hundreds of courts across Nigeria to automate the transcription of case proceedings, hearings, and depositions and achieve 80% accuracy of the speech-to-text conversion. This made for greater accuracy in the resultant transcripts and ease in delivering judgments. This also reduced the TAT of the documentation process by 35%. Daffodil’s innovative AI approach in developing the solution and swift turnaround time was extremely appreciated by the company.

100,000+
size of training dataset
80%
accurate text records of audio depositions
35%
decrease in documentation TAT

Read Related Case Studies

mercaDolar ensured fraud-free transactions of American dollars in Venezuela by developing a legal currency exchange platform

Explore Case Study

Developing an AI-based data management solution to remove redundancy from data points

Explore Case Study

Daffodil helps an urban planning firm to leverage AI and detect green space from geo-spatial data.

Explore Case Study

Get in Touch

Sign up for a 30 min no-obligation strategic session with us

Let us understand your business objectives, set up initial milestones, and plan your software project.

Sign up for a 30 min no-obligation strategic session with us

Let us understand your business objectives, set up initial milestones, and plan your software project.

At the end of this 30 min session, walk out with:

Validation of your project idea/ scope of your project
Actionable insights on which technology would suit your requirements
Industry specific best practices that can be applied to your project
Implementation and engagement plan of action
Ballpark estimate and time-frame for development

Software Consulting

Cloud Consulting

Legacy System Modernization

Security & Compliance Advisory

AI Strategy Consulting

UI/UX Design

Discovery & MVP Planning

Product Experience Design

Web Application Development

Mobile App Development

SaaS & Platform Development

API Development & Integrations

Cloud Services

Cloud-Native Applications

DevOps Services

Data & ML Engineering

GenAI & LLM Solutions

Conversational AI Solutions

Agentic AI Solutions

Computer Vision solutions

Automation Testing

VAPT Testing

Performance Testing

API Testing

Software Maintenance and Support

Application Modernzation

Cloud Migration

Performance Marketing

SEO Services

App Store Optimization

Data Enrichment

Hire Dedicated Developers

Offshore Development Center

Healthcare Software Development

Healthcare Apps Development

Telemedicine App Development

EMR/EHR Development

Patient Engagement Solutions

RPM Solutions

Home Care Software Development

AI-Based Healthcare Solutions

SaaS Product Development

Software Modernization

Offshore Development Center

Hire Software Developers

CTO as a Service

POS Solutions

Custom Accounting Solutions

Stock Trading Platforms

Gold Trading Platforms

Insurtech Solutions

eCommerce App Development

D2C Commerce Solutions

Marketplace Development

Retail Software Solutions

Travel Application Development

Fleet & Asset Management Systems

Ticketing & Fare Management

OTT Platform Development

Live Streaming Applications

CMS Solutions

CORE Banking Software

Cloud Banking Software

Loan Management Software

Inventory Management Systems

POS Systems

Restaurant Management Software

Custom CRM Software

About US

Partnerships

Career & Culture

Leadership

Podcasts

Insights Blog

Corporate Social Responsibility

Consulting

Design

Software Development

AI/ML Solutions

Software Testing and QA