Pritom Saha

Dallas, Texas · pritom.saha@utdallas.edu

I'm Pritom, a graduate Computer Science student at the University of Texas at Dallas. I am experienced in research, development and deployment of deep learning and machine learning models and accustomed to data analysis and visualization.

Research Interest: I am deeply passionate about artificial intelligence, with extensive experience in developing deep learning models, particularly in the field of computer vision. I am also keenly interested in natural language processing (NLP) and large language models. Currently, my work focuses on evidential deep learning, aiming to make AI models more trustworthy and safer. I am a strong advocate for the pursuit of Artificial General Intelligence (AGI), and it is my long-term goal to contribute to research in this area.

Experience

Exposure

Teaching Assistant

The University of Texas at Dallas

Served as a teaching assistant for the Programming Language Paradigms course and the Database Design course.
Held weekly office hours and graded assignments.

Aug. 2023 - PRESENT

Software Engineer (AI/ML)

REEA Digital Ltd.

Developed a knowledge mining solution utilizing Azure Cognitive Services and the OpenAI GPT model API.

Apr. 2023 - Jul 2023

Machine Learning Engineer

Apurba Technologies Ltd.

Trained and developed novel Bangla word-level Optical Character Recognition (OCR) and Bangla Optical Handwriting Recognition (OHR) deep learning models as part of the Enhancement of Bangla Language in ICT through Research & Development (EBLICT) project, under the Ministry of ICT, Government of Bangladesh.
Deployed multi-framework models for inference serving on both CPU and GPU servers using NVIDIA Triton Inference Server and Python web frameworks such as FastAPI and Flask.
Conducted Document Denoising efforts to improve the accuracy of word segmentation and recognition processes.
Contributed to the development of a Bangla Text-to-Speech deep learning model, enabling natural-sounding voice output from Bangla text.
Customized a Natural Language Processing (NLP) project to identify similarities between doctor notes in natural language and SNOMED CT database concepts, leveraging state-of-the-art language models.

Sep. 2021 - Mar. 2023

Research Assistant (RA)

Department of ECE, North South University

Explored deep learning techniques using synthetic data to address the scarcity of real-world data.
Developed and studied quantized models to achieve a reduced memory footprint and faster inference on CPU.

Dec. 2020 - Aug. 2021 / 9 Mon.

Undergraduate Teaching Assistant (UGTA)

Department of ECE, North South University

Served as a teaching assistant for the Object-Oriented Programming course and the Microprocessor Interfacing & Embedded Systems course.
Held weekly office hours and graded assignments.

Feb. 2020 - Sep. 2021 / 1 Yr. 8 Mon.

Education

Academics

The University of Texas at Dallas

M.Sc. in Computer Science & Ph.D. in Computer Science

CGPA: 3.83 on a scale of 4.00

Project: Uncertainty Quantification: Conducted research on uncertainty quantification for convolutional neural networks and graph neural networks.

Project: HAYSTAC: Contributed to the HAYSTAC project funded by IARPA, specifically working on a sub-task that involved applying constraint optimization techniques to determine the optimal path for a routing task.

Aug. 2023 - PRESENT

North South University

B.Sc. in Computer Science and Engineering

CGPA: 3.74 on a scale of 4.00 (Graduated magna cum laude.)

Thesis: Speaker verification: Speaker verification system using deep learning model that can verify the identity of a speaker from raw waveform.

Project: Line Follower Robot: An autonomous line‑follower robot that follows a line using an infrared sensor. Also capable of navigating through a cave in the absence of a line. Participated in national line follower robot competitions.

Project: Ashbab e-commerce Application: An e-commerce android application that uses augmented reality to preview furniture before buying.

Project: Sketch Colorization: Coloring black and white sketches using generative adversarial network.

Jan. 2017 - May 2021

Publications

Papers

A Multifaceted Evaluation of Representation of Graphemes for Practically Effective Bangla OCR

International Journal on Document Analysis and Recognition (IJDAR)

Abstract: Bangla Optical Character Recognition (OCR) poses a unique challenge due to the presence of hundreds of diverse conjunct characters formed by the combination of two or more letters. In this paper, we propose two novel grapheme representation methods that improve the recognition of these conjunct characters and the overall performance of OCR in Bangla. We have utilized the popular Convolutional Recurrent Neural Network architecture and implemented our grapheme representation strategies to design the final labels of the model. Due to the absence of a large-scale Bangla word-level printed dataset, we created a synthetically generated Bangla corpus containing 2 million samples that are representative and sufficiently varied in terms of fonts, domain, and vocabulary size to train our Bangla OCR model. To test the various aspects of our model, we have also created 6 test protocols. Finally, to establish the generalizability of our grapheme representation methods, we have performed training and testing on external handwriting datasets. Experimental results proved the effectiveness of our novel approach. Furthermore, our synthetically generated training dataset and the test protocols are made available to serve as benchmarks for future Bangla OCR research.

2023

Rethinking Task-Incremental Learning Baselines

26th International Conference on Pattern Recognition (ICPR)

Abstract: It is common to have continuous streams of new data that need to be introduced in the system in real-world applications. The model needs to learn newly added capabilities (future tasks) while retaining the old knowledge (past tasks). Incremental learning has recently become increasingly appealing for this problem. Task-incremental learning is a kind of incremental learning where task identity of newly included task (a set of classes) remains known during inference. A common goal of task-incremental methods is to design a network that can operate on minimal size, maintaining decent performance. To manage the stability-plasticity dilemma, different methods utilize replay memory of past tasks, specialized hardware, regularization monitoring etc. However, these methods are still less memory efficient in terms of architecture growth or input data costs. In this study, we present a simple yet effective adjustment network (SAN) for task incremental learning that achieves near state-of-the-art performance while using minimal architectural size without using memory instances compared to previous state-of-the-art approaches. We investigate this approach on both 3D point cloud object (ModelNet40) and 2D image (CIFAR10, CIFAR100, MiniImageNet, MNIST, PermutedMNIST, notMNIST, SVHN, and FashionMNIST) recognition tasks and establish a strong baseline result for a fair comparison with existing methods. On both 2D and 3D domains, we also observe that SAN is primarily unaffected by different task orders in a task-incremental setting.

2022

Analysis of Bangla Keyboard Layouts Based on Keystroke Dynamics

Intelligent Systems and Sustainable Computing, Proceedings of ICISSC 2021

Abstract: Keyboards are the primary devices for interaction with computer platforms and have been a central topic in HCI research for decades. The study of their layout design is important in deciding their efficiency, practicality and adoption. Multiple keyboard layouts have been developed for Bangla without any rigorous study for their comparison. In this paper, we take a quantitative data-driven approach to compare their efficiency. Our evaluation strategy is based on the key-pair stroke timing with data collected from standard QWERTY English keyboards. Our experiments conclude that the Bijoy keyboard layout is the most efficient design for Bangla among the four layouts studied. This quantitative approach can lay the groundwork for further study of these layouts based on other criteria.

2022

Skills

Tech Stack

Programming Languages & Tools

Workflow

Development of machine learning based solutions and prototyping.
Deployment of muti‑framework models for inference serving on both CPU and GPU servers.
Data analysis and visualization.
Android development.

Certifications

Licenses

Mathematics for Machine Learning and Data Science Specialization

DeepLearning.AI, Coursera

Courses covered Linear Algebra for Machine Learning and Data Science, Calculus for Machine Learning and Data Science and Probability & Statistics for Machine Learning & Data Science.

Jul. 2023

Machine Learning Specialization

DeepLearning.AI and Stanford University, Coursera

Courses covered Linear Regression, Logistic Regression, Regularization, Gradient Descent, Artificial Neural Network, Decision Trees, Tree Ensembles, Xgboost, Recommender Systems, K-means Algorithm and Reinforcement Learning.

Feb. 2023

Deep Learning Specialization

DeepLearning.AI, Coursera

Courses covered Deep Learning, Optimization Techniques, Convolutional Neural Networks and Sequence Models.

Sep. 2020

Automate the Boring Stuff with Python Programming

Al Sweigart, Udemy

Learned basic task automation with python, word, pdf and excel document parsing, desktop automation, web scrapping and email automation, regular expression etc.

Feb. 2020

Awards

Honors

Finalist - Robi Datathon 2.0, 2022
Champion - Whoosh!! Without Wheels by IEEE NSU SB, 2019
Runner up - IOT For Tomorrow by IEEE NSU SB, 2019
1^st Runner up - Electrathon by IEEE NSU SB, 2018
3^rd Place - Quiz Competition by Transparency International, 2012

Academic Projects

Projects

Speaker Verification

North South University

Speaker verification system using deep learning model that can verify the identity of a speaker from raw waveform.

Sep. 2020 - Apr. 2021

Sketch Colorization

North South University

Coloring black and white sketches using generative adversarial network.

May 2020 - Aug. 2020

Ashbab e-commerce app

North South University

An e-commerce android application that uses augmented reality to preview a product before buying.

2019

Line Follower Robot with Cave Navigation

North South University

An autonomous line-follower robot that follows a line using an infrared sensor. Also capable of navigating through a cave in the absence of a line.

2018

Extracurricular

Volunteering

Societies

Vice Chair

IEEE North South University Robotics and Automation Society Student Branch Chapter

Planned chapter activities and conducted weekly meetings. Took basic robotics workshops as an instructor.

Aug. 2020 - Dec. 2021

Content Writing Coordinator

IEEE North South University Student Branch

Planned events, wrote event proposals and event reports. Participated in all the workshops and seminars organized by the student branch.

Jan. 2019 - Aug. 2020

Committees

Content Writing Coordinator - IEEE Student Transition and Elevation Partnership (STEP) 2019, Bangladesh
Publication Coordinator - IEEE Student Professional Awareness Conference (SPAC) 2019, Bangladesh

Professional Memberships

Interests

Hobbies

Besides my professional and academic activities, I love to read books, watch TV shows and movies, travel to explore nature, and learn about diverse cultures of different nations throughout the world.

Watchlists