AzureSearch_JFK_Files | repo contains the sample code | Azure library

by microsoft TypeScript Version: Current License: MIT

X-Ray Key Features Code Snippets Community Discussions(1)Vulnerabilities Install Support

kandi X-RAY | AzureSearch_JFK_Files Summary

AzureSearch_JFK_Files is a TypeScript library typically used in Cloud, Azure applications. AzureSearch_JFK_Files has no bugs, it has no vulnerabilities, it has a Permissive License and it has low support. You can download it from GitHub.

This repo contains the sample code of the Azure Search and Cognitive Services used to provide insights and analysis around the JFK Files.

Support

Quality

Security

License

Reuse

Support

AzureSearch_JFK_Files has a low active ecosystem.

It has 345 star(s) with 213 fork(s). There are 44 watchers for this library.

It had no major release in the last 6 months.

There are 9 open issues and 76 have been closed. On average issues are closed in 123 days. There are 4 open pull requests and 0 closed requests.

It has a neutral sentiment in the developer community.

The latest version of AzureSearch_JFK_Files is current.

Quality

AzureSearch_JFK_Files has 0 bugs and 0 code smells.

Security

AzureSearch_JFK_Files has no vulnerabilities reported, and its dependent libraries have no vulnerabilities reported.

AzureSearch_JFK_Files code analysis shows 0 unresolved vulnerabilities.

There are 0 security hotspots that need review.

License

AzureSearch_JFK_Files is licensed under the MIT License. This license is Permissive.

Permissive licenses have the least restrictions, and you can use them in most projects.

Reuse

AzureSearch_JFK_Files releases are not available. You will need to build from source code and install.

Installation instructions, examples and code snippets are available.

It has 1371 lines of code, 0 functions and 209 files.

It has low code complexity. Code complexity directly impacts maintainability of the code.

Top functions reviewed by kandi - BETA

kandi's functional review helps you automatically verify the functionalities of the libraries and avoid rework.
Currently covering the most popular Java, JavaScript and Python libraries. See a Sample of AzureSearch_JFK_Files

Get all kandi verified functions for this library.

AzureSearch_JFK_Files Key Features

No Key Features are available at this moment for AzureSearch_JFK_Files.

AzureSearch_JFK_Files Examples and Code Snippets

No Code Snippets are available at this moment for AzureSearch_JFK_Files.

Community Discussions

Trending Discussions on AzureSearch_JFK_Files

Azure Search - Highlights - Locating in image

QUESTION

Azure Search - Highlights - Locating in image

Asked 2021-Feb-08 at 17:56

Just looking for guidance or even a general outline on approach here.

I am using azure search to OCR a batch of pdfs. I have turned on hit highlighting and I am successfully getting results back there that I am looping through / displaying in my view for the end user. I was looking on expanding that functionality to show the pdf images with the highlighting on the images themselves like in the JFK azure example. I am not proficient in react and seem to be getting lost there.

I am assuming I need to save off the OCR images to a data store for reference using the normalized_images that are created? I do have pdfs locally I can load but assume the OCR images maybe different. Have turned on GeneratedNormalizedImagesPerPage and turned on cache which creates files in my storage account.

Then I assume I need to pull the associated image, display it, use the highlight results and pull a corresponding bounding box where the phrase was detected? Problem with that approach is that I do not see any association between the highlight hit and the location (bounding box) of the hit nor the associated image file the hit was on.

Probably way off on approach here but any guidance is appreciated.

Edit 1 I did noticed the items on this page in the JFK example: https://github.com/microsoft/AzureSearch_JFK_Files/tree/master/JfkWebApiSkills/JfkWebApiSkills Would trying to replicate the ImageStore (so those are stored in my storage account) and then the HocrGenerator (appears to handle points in a doc) into my skillset for my index be the approach?

...

ANSWER

Answered 2021-Feb-08 at 17:56

There are a few steps here:

you need to save the layoutText from the OCR skill somewhere the UI can access it. The JFK Files demo converts it to a HOCR (to display in the UI) and saves it in index as a field in the index so that it is retrieved in the search results. HOCR isn't necessary and you may find it more efficient to store the layout in blobs using a knowlege store object projection.
save the extracted images into blob storage using a file projection into the knowledge store. Keep in mind that the images may be resized in the process and the coordinates will match the resized image saved to the store. If you want to map the coordinates to the original image see this.
At search time, map the highlight to the the metadata. You will find this code in the nodejs frontend, however it may be simpler to follow in the original demo by following the code here. Essentially you just find the first occurrence of the highlighted word in the metadata, display the associated image, and calculate the bounding region of the word.

Source https://stackoverflow.com/questions/66103937

Community Discussions, Code Snippets contain sources that include Stack Exchange Network

Vulnerabilities

No vulnerabilities reported

Install AzureSearch_JFK_Files

These instructions will help you have your own version of the JFK files demo running in Azure in about 20 minutes, with most of that time being provisioning/deployment time.

Support

For any new features, suggestions and bugs create an issue on GitHub. If you have any questions check and ask questions on community page Stack Overflow .

Find more information at: