1001 Freelance Projects
Latest Projects from
Freelance Marketplaces
View Project
View this project in detail
(Note: you will be redirected to external marketplace)
Project title:
AI Engineer Needed to Optimize LangChain + AWS Bedrock App
Posted by:
External project from PeoplePerHour
Started:
03-Nov-2025 17:02 GMT
Description:
We have an AI agent application built with Python, LangChain, and AWS Bedrock that currently takes around 40 seconds per LLM response. We need to reduce latency dramatically for investor demos, ideally under 10 seconds. The backend is Flask (Python 3.10) on AWS Lambda with a React frontend and Bedrock Claude models.

You’ll be responsible for targeted performance fixes focused on measurable speed gains. The work includes optimizing Bedrock configuration, implementing real token-by-token streaming, adding Redis caching to replace S3-based message storage, and validating performance improvements with before-and-after latency metrics.

Estimated 6 hours of work.

Tasks

Optimize Bedrock Model Configuration: update bedrock_config.py to disable thinking mode, remove unnecessary budget_tokens, and lower temperature from 1.0 to around 0.2–0.3 for deterministic, faster responses. Confirm that the configuration change reduces token generation delay and verbosity.

Implement Real Token Streaming (Backend): replace agent.invoke with a streaming method using Bedrock ConverseStream or LangChain’s stream API. Ensure partial tokens are sent to the client in real time and test time-to-first-token performance.

Enable Live Streaming Display (Frontend): update the React frontend to handle streamed events progressively so users see text as it generates. Confirm the UI starts displaying output within 2–3 seconds of sending input.

Add Redis Caching for Chat Session Memory: replace S3-based chat history with Redis for in-memory storage. Update the chat_history_manager logic, validate cache persistence, and confirm message load time is near-instant.

Measure and Document Latency Improvements: record baseline timing (total response and time-to-first-token), re-measure after optimizations, and summarize the before/after results. Confirm at least a 4–5× improvement in perceived speed. All optimizations must preserve the exact response content and formatting from the LLM - only response speed may change.

Deliverables
• Updated, tested backend and frontend code (GitHub commit or zip)
• Before/after latency test results (text or JSON summary)
• One short summary of what was changed and verified

Questions - please answer all in proposal

Describe your experience optimizing latency in LangChain or Bedrock-based applications.

Have you implemented real token streaming (not chunked post-processing) before?

What is your preferred setup for Redis caching in a Python/AWS environment?

Are you comfortable modifying both Python backend and React frontend code?

Can you start immediately and complete project within 48 hours of getting contract offer?
Project ID:
3456506
Project category:
Project budget:
View this project in detail
(Note: you will be redirected to external marketplace)
Last Projects / Browse Projects
  Project Started
Conga CLM Developer Needed ASAP
Category: Agile Development, Change Management, Contract Management, Project Management, Salesforce App Development, Salesforce.com, SAP, Technical Documentation
Budget: ₹150000 - ₹250000 INR
13 Mar 2026 11:04 GMT
Site Verification in Dakovo, Croatia
Category: Business Analysis, Inspections, Local Job, Photography, Travel Ready
Budget: $25 - $50 USD
13 Mar 2026 11:04 GMT
Real Estate Reporting UI Revamp
Category: CSS, CSS3, HTML, HTML5, JavaScript, Laravel, Web Design, Web Development
Budget: ₹600 - ₹1500 INR
13 Mar 2026 11:04 GMT
Revamp Crashed WordPress Site
Category: CSS, HTML, PHP, Web Hosting, Web Development, WordPress, WordPress Design, WordPress Plugin
Budget: ₹600 - ₹1500 INR
13 Mar 2026 11:04 GMT
Diseñador Gráfico y Creador de Contenido Remoto
Category: Photoshop, Content Creation, Graphic Design, Illustration, Logo Design, Social Media Marketing, Video Editing
Budget: $8 - $15 USD
13 Mar 2026 11:04 GMT
Automated AI Video Workflow Development
Category: 3D Animation, After Effects, AI Chatbot Development, AI Content Creation, AI Model Development, Animation, API Development, JavaScript, N8n, Video Production
Budget: $10 - $30 USD
13 Mar 2026 11:01 GMT
Incoming Call Message Specialist
Category: Audio Services, Call Center, Customer Service, Customer Support, English (US) Translator, Phone Support, Telemarketing, Telephone Handling, Virtual Assistant, Voice Talent
Budget: €6 - €12 EUR
13 Mar 2026 11:01 GMT
Sales Support (Loom/Video) Assistant Needed
Category: B2B Marketing, Closer, Customer Service, Internet Marketing, Lead Generation, Marketing, Sales, Sales Management, Video Services
Budget: €12 - €18 EUR
13 Mar 2026 10:59 GMT
Nominee Director for UK Ltd Company 13 Mar 2026 10:59 GMT
Support Ticket Data Spreadsheet -- 2
Category: Customer Support, Data Analysis, Data Management, Data Processing, Excel, Google Sheets, PHP
Budget: $2 - $8 CAD
13 Mar 2026 10:59 GMT
Google Ads misrepsentation 13 Mar 2026 10:59 GMT
B2B Google Ads Lead Campaign
Category: Analytics, B2B Marketing, Google Ads, Google Adwords, Internet Marketing, Keyword Research, Marketing, SEO
Budget: ₹600 - ₹1500 INR
13 Mar 2026 10:58 GMT
Convert Avada Site to Static HTML
Category: CSS, Frontend Development, HTML, HTML5, JavaScript, SEO, Web Development, Web Design, Website Optimization, WordPress
Budget: $250 - $750 USD
13 Mar 2026 10:58 GMT
Consultancy Website Development & Branding
Category: Elementor, Graphic Design, Web Design, Web Development, WordPress
Budget: €250 - €750 EUR
13 Mar 2026 10:57 GMT
Custom WooCommerce Store Development
Category: API Integration, HTML, Payment Gateway Integration, PHP, Web Design, Web Development, WooCommerce, WordPress
Budget: €6 - €12 EUR
13 Mar 2026 10:56 GMT
Browse All Projects
Projects by Skills ...
android
ajax
asp
aspnet
cms
cpp
csharp
css
delphi
design
drupal
excel
facebook
flash
html
java
javascript
joomla
iphone
mysql
photoshop
php
python
ruby
seo
sql
sysadm
translate
typing
twitter
vbnet
xml
wordpress
writing
New!
Проекты на русском
(Projects in Russian)

Copyright © 2005-2025
1001 Freelance Projects