📝 Research :https://ojitha.blogspot.com.au for my lengthy articles.

Lua filters for Pandoc

February 28, 2025

Lua filter used in Pandoc 3.6.3. This blog has solutions for:

  • Creating Glossary for ePub ver 3 book
  • GitHub style alerts
More…

AWS PITR Explained

February 26, 2025

PITR stands for Point-in-Time Recovery, which is a feature offered by several AWS services to provide continuous data protection and the ability to restore data to a specific point in time.

More…

Semantic search with ELSER in Elasticsearch

May 11, 2024

Elastic Learned Sparse EncodeR(ELSER) is a retrieval model trained by Elastic that enables you to perform semantic search to retrieve more relevant search results.

Summary of ELSER process

  1. install ELSER v2: Only once (DevOPs will do for your)
  2. Create source index where you can insert all your documents
  3. Create target index
  4. Create ingestion pipeline
  5. Reindex process to create embeddings
  6. Ready to do semantic search using text expansion queries

I created this blog post on docker to demonstrate Linux-optimised ELSER v2. The Elasticsearch version is 8.11.1.

More…

Maven Proxy handling

April 26, 2024

Here are the common challenges when working with development tools like Maven and VSCode behind a corporate proxy within a WSL 2 environment.

Currently, my computer is behind the corporate proxy. As a Java Programmer, when I used Maven 3 on the WSL 2 Ubuntu 20.4 Linux environment, I had to set the proxy in the settings.xml under the /home/user/.m2 folder. Although I’ve set the proxy, it doesn’t work as expected: mvn compile complains about unresolved Scala dependencies. Another problem is that VSCode doesn’t show the intellisense. I’ve already installed the Microsoft Java extension pack completely.

More…