Skip to content

Commit

Permalink
website update oct 22 - 1
Browse files Browse the repository at this point in the history
  • Loading branch information
vasu-parspec committed Oct 22, 2023
1 parent 152578b commit f359542
Show file tree
Hide file tree
Showing 4 changed files with 45 additions and 4 deletions.
6 changes: 4 additions & 2 deletions index.html
Original file line number Diff line number Diff line change
Expand Up @@ -43,7 +43,9 @@ <h2>Education</h2>
</ul>
<h2>Work Experience</h2>
<ul>
<li><p>Freelancer, Computer Vision and Deep Learning (2020 - Present)</p>
<li><p>AI Lead, Parspec, India (Remote) (2021 - 2023)</p>
</li>
<li><p>Freelancer, Computer Vision and Deep Learning (2020 - 2021)</p>
</li>
<li><p>Senior Engineer, Chipset Architecture Design, Qualcomm, Bangalore, India (2016 - 2018) </p>
</li>
Expand All @@ -56,7 +58,7 @@ <h2>Work Experience</h2>
</ul>
<div id="footer">
<div id="footer-text">
Page generated 2021-08-28 14:33:35 India Standard Time, by <a href="http://jemdoc.jaboc.net/">jemdoc</a>.
Page generated 2023-10-22 22:28:58 India Standard Time, by <a href="http://jemdoc.jaboc.net/">jemdoc</a>.
(<a href="index.jemdoc">source</a>)
</div>
</div>
Expand Down
3 changes: 2 additions & 1 deletion index.jemdoc
Original file line number Diff line number Diff line change
Expand Up @@ -24,7 +24,8 @@ If you would like to connect or hack something cool together, feel free to reach
- B.Tech. Electrical Engineering, Indian Institute of Technology, Bhubaneswar, India (2008 - 2012)

== Work Experience
- Freelancer, Computer Vision and Deep Learning (2020 - Present)
- AI Lead, Parspec, India (Remote) (2021 - 2023)
- Freelancer, Computer Vision and Deep Learning (2020 - 2021)
- Senior Engineer, Chipset Architecture Design, Qualcomm, Bangalore, India (2016 - 2018)
- Application Engineer, Computational Electromagnetics and RF Simulations, CST of America, CA, USA (2015 - 2016)
- Intern, Computational Electromagnetics and RF Simulations, CST of America, CA, USA (Jun 2014 - Aug 2014)
Expand Down
27 changes: 26 additions & 1 deletion projects.html
Original file line number Diff line number Diff line change
Expand Up @@ -20,6 +20,31 @@
<div id="toptitle">
<h1>Projects</h1>
</div>
<h2>Work Projects at Parspec</h2>
<ul>
<li><p><b>Parsing and OCRing of tables in pdf documents</b> - developed custom computer vision algorithms for parsing structure of complex bordered tables ( tables with merged rows and columns ) with &nbsp;98% accuracy on our test set , much better than some of the available solutions we tested. OCRing of text was performed using available open-source and commercial solutions</p>
</li>
</ul>
<ul>
<li><p><b>Detecting header rows and specific columns in tabular data</b> - trained/fine-tuned BERT based models on custom dataset with &gt;95% accuracy on our test set</p>
</li>
</ul>
<ul>
<li><p><b>Extracting family/model names from lighting product datasheets</b> - developed custom annotations tool for faster, more efficient annotations; trained NER models on our custom dataset with &gt;90% accuracy on test set</p>
</li>
</ul>
<ul>
<li><p><b>Submittal Aggregator</b> - developed custom algorithms for matching rows in table of contents in submittal pdf documents to corresponding pages in the document.</p>
</li>
</ul>
<ul>
<li><p><b>Product Recommendation System</b> - developed and implemented a multi-stage pipeline involving word classification, next word prediction and character-token based NER for extracting important technical specs from product datasheets for recommending similar products. Also, evaluated other heuristics and sentence vector-similarity based approaches for product recommendation.</p>
</li>
</ul>
<ul>
<li><p><b>Distributed scalable pipeline for family name extraction and recommendation system</b> - developed and implemented a custom end-to-end pipeline involving document download, parsing and model inference for running on a distributed set of CPUs and GPUs. Motivation was to reduce number of GPUs required ( reduced from &gt;20 with a naive approach to 1 GPU with dynamic scaling option) and allow full customizability and easy debugging.</p>
</li>
</ul>
<h2>Consulting Projects </h2>
<ul>
<li><p><b>Analysed and implemented algorithms for estimation of melanin and hemoglobin distribution in face regions from
Expand Down Expand Up @@ -139,7 +164,7 @@ <h2>Signal Processing and Wireless Communications Projects</h2>
</ul>
<div id="footer">
<div id="footer-text">
Page generated 2021-12-02 13:31:04 India Standard Time, by <a href="http://jemdoc.jaboc.net/">jemdoc</a>.
Page generated 2023-10-22 22:30:09 India Standard Time, by <a href="http://jemdoc.jaboc.net/">jemdoc</a>.
(<a href="projects.jemdoc">source</a>)
</div>
</div>
Expand Down
13 changes: 13 additions & 0 deletions projects.jemdoc
Original file line number Diff line number Diff line change
@@ -1,6 +1,19 @@
# jemdoc: menu{MENU}{index.html}, showsource
= Projects

== Work Projects at Parspec
- *Parsing and OCRing of tables in pdf documents* - developed custom computer vision algorithms for parsing structure of complex bordered tables ( tables with merged rows and columns ) with ~98% accuracy on our test set , much better than some of the available solutions we tested. OCRing of text was performed using available open-source and commercial solutions

- *Detecting header rows and specific columns in tabular data* - trained/fine-tuned BERT based models on custom dataset with >95% accuracy on our test set

- *Extracting family/model names from lighting product datasheets* - developed custom annotations tool for faster, more efficient annotations; trained NER models on our custom dataset with >90% accuracy on test set

- *Submittal Aggregator* - developed custom algorithms for matching rows in table of contents in submittal pdf documents to corresponding pages in the document.

- *Product Recommendation System* - developed and implemented a multi-stage pipeline involving word classification, next word prediction and character-token based NER for extracting important technical specs from product datasheets for recommending similar products. Also, evaluated other heuristics and sentence vector-similarity based approaches for product recommendation.

- *Distributed scalable pipeline for family name extraction and recommendation system* - developed and implemented a custom end-to-end pipeline involving document download, parsing and model inference for running on a distributed set of CPUs and GPUs. Motivation was to reduce number of GPUs required ( reduced from >20 with a naive approach to 1 GPU with dynamic scaling option) and allow full customizability and easy debugging.

== Consulting Projects
- *Analysed and implemented algorithms for estimation of melanin and hemoglobin distribution in face regions from
images captured using mobile selfie camera for a Canada based healthcare startup.* \n
Expand Down

0 comments on commit f359542

Please sign in to comment.