Mobile app scan ID card, local camera problem?

recently, there is a project that needs to do an app scan to identify the ID card, which requires that after the first picture is taken, the image is compressed by an asynchronous thread, and then api, is called to be identified by the background ocr and the result is returned. App can turn on the flash and focus automatically.
what does the front end need to do?
call the local camera? How are the rectangles and buttons on this camera embedded? It seems that many app cameras have been customized, but they don"t know how to do it.

Mar.04,2021

is usually done with input [file]. After clicking, there is a photo button, which is called by the system and cannot be customized.
in addition, you can use video stream to capture a segment, and the compatibility should not be very good

.
Menu