Formula Recognition Algorithm#

Introduction#

Formula detection involves recognizing the content of a given input formula image and converting it to LaTeX format.

Model Usage#

With the environment properly configured, you can run the layout detection algorithm script by executing scripts/formula_recognition.py.

$ python scripts/formula_recognition.py --config configs/formula_recognition.yaml

Model Configuration#

inputs: assets/demo/formula_recognition
outputs: outputs/formula_recognition
tasks:
   formula_recognition:
      model: formula_recognition_unimernet
      model_config:
         cfg_path: pdf_extract_kit/configs/unimernet.yaml
         model_path: models/MFR/unimernet_tiny
         visualize: False
  • inputs/outputs: Define the input file path and the directory for LaTeX prediction results, respectively.

  • tasks: Define the task type, currently only containing a formula recognition task.

  • model: Define the specific model type: Currently, only the UniMERNet formula recognition model is provided.

  • model_config: Define the model configuration.

  • cfg_path: Path to the UniMERNet configuration file.

  • model_path: Path to the model weights.

  • visualize: Whether to visualize the model results. Visualized results will be saved in the outputs directory.

Support for Diverse Inputs#

The formula detection script in PDF-Extract-Kit supports single formula images and document images with corresponding formula regions.

Viewing Visualization Results#

When the visualize setting in the config file is set to True, LaTeX prediction results will be saved in the outputs directory.