As Google started to limit academic storage, I started looking at Google Takeout and started wondering what I could do with all that data. I downloaded all the posts from the blog, since we use Google's blogger, and ran them through OpenAI's Ada Embedding. The Ada embedding maps text up to 8192 words into a point on the 1536-dimensional unit sphere. You can measure the similarity between two embeddings via a simple dot product, giving you the cosine of the angle between them.
So I created a semantic search for the blog. Go ahead and try it out.
Does the search engine for the blog that you built differ from the one that is already on the blog (left upper corner?)
ReplyDelete