01.0 / Experience
AI Engineer
GeeksforGeeks KIIT
Operating inside applied ML systems where model accuracy, inference latency, and deployment constraints directly affected usability and system reliability.
Inference pipeline
−35%
inference latency after TensorRT optimization
Architecture
Input video
30 fps
CNN encoder
7.8 ms
LSTM state
128 MB
Beam decoder
98.2%
Prediction
edge
CNN-LSTM pipeline quantized with TensorRT and deployed to Jetson Nano under a 128 MB memory budget. Cold-start guards prevent partial-state serving on resource-constrained restarts.
Deployment
Change log
