Building a ChatBot from your Documentation Website | DocsGPT

Hi everyone! Been a long time, thought I should talk about an ongoing project I’m working on. Introduction - What are we trying to do here? Lemme start with some context and disclaimer first: This was a part of an interview process in one of the amazing startups, and I wanted to extend it to an end-to-end project (kinda out of scope from the requirement of that interview process). I won’t be naming the startup here to help keep them anonymous and candid for their future candidates. ...

February 26, 2024 · 6 min · Kushashwa Ravi Shrimali

Data Scrapping for ChatBot Model in Rust | DocsGPT | Part-2

Alright everyone, we are back. Just FYI, we’ve had a blog on introduction to DocsGPT before: https://krshrimali.github.io/posts/2024/02/building-a-chatbot-from-your-documentation-website-docsgpt/. This is a follow up blog where we’ll discuss data scraping and preprocessing to be able to finetune our model for ChatBot use-case. Quick recap? Input is going to be a single link to documentation page (index page). Need to fetch data for “all the internal pages”. Preprocess (and/or clean) and transform the data to be able to finetune the model. Finetune the model and use it for ChatBot use-case. In this blog, we’ll be covering the first two above, and the rest will be covered in the next blog(s). ...

February 26, 2024 · 6 min · Kushashwa Ravi Shrimali