Back to search
Harnham Linkedin · Posted 22d ago

AI Software Engineer

Dallas-Fort Worth Metroplex

Linkedin
Continue to application Add your email once, then Caio opens the original posting.

Indexed description

Software Engineer – Backend AI / LLM Applications


Location: Dallas, TX or Houston, TX

Work Model: Hybrid

Employment Type: Contract-to-Hire

Pay Rate: Around $60-80/hr

Sponsorship: Not available now or in the future

Relocation: Not available


About the Role

We’re looking for a Software Engineer to help build production AI features powered by large language models. This is a hands-on engineering role focused on backend development, APIs, services, and integrating AI functionality into real applications used across the business.

This is not a research role or a prompt-only position. The team needs someone who can build reliable backend systems, work with LLM APIs, support RAG-based features, and help deploy AI functionality into production environments.

The ideal candidate is a backend-focused engineer with strong Python, Java, or similar experience, plus hands-on exposure to LLMs, RAG, embeddings, vector search, or AI API integrations.


What You’ll Do

  • Build backend services and APIs that integrate LLMs into business applications
  • Develop AI-powered features using retrieval-augmented generation/RAG
  • Work with structured and unstructured data to support AI features
  • Integrate AI functionality into existing applications and workflows
  • Build scalable services using APIs, async processing, queues, and microservices patterns
  • Help improve AI features around latency, retrieval quality, cost, and reliability
  • Deploy, monitor, troubleshoot, and improve production applications
  • Partner with engineering, data, and product teams to deliver usable AI solutions


What We’re Looking For

  • Backend software engineering experience with Python, Java, or similar
  • Experience building APIs, backend services, microservices, or distributed applications
  • Hands-on exposure to LLMs, AI APIs, RAG, embeddings, vector search, or semantic search
  • Experience building and shipping software beyond prototypes or school projects
  • Familiarity with cloud environments such as AWS, Azure, or similar
  • Strong understanding of production reliability, monitoring, testing, and scaling
  • Ability to work hybrid in Dallas or Houston
  • Must be authorized to work in the U.S. without sponsorship now or in the future


Nice to Have

  • Experience with FastAPI, Flask, Django, Spring Boot, or Node.js
  • Experience with LangChain, LlamaIndex, OpenAI, Azure OpenAI, or similar
  • Familiarity with Pinecone, Weaviate, FAISS, Chroma, or other vector databases
  • Experience improving retrieval quality through chunking, ranking, filtering, or evaluation
  • Experience building internal AI tools, copilots, chatbots, automation tools, or document search applications
  • Docker, Kubernetes, CI/CD, or production deployment experience


Why This Role

This is a strong opportunity for a backend engineer who wants to move deeper into applied AI and production LLM development. You’ll be building real AI features, not just prototypes, and working on practical problems around APIs, RAG, scalability, reliability, and enterprise application integration.

The role is contract-to-hire, with the goal of converting into a long-term position based on performance and business need.

Free. 20 seconds. No password. See every match in this search.

Create a free Caio profile to unlock the full index and keep your job-search signal for future recommendations.

Unlock free search