Mathematical Proofs

17 papers with code • 0 benchmarks • 2 datasets

This task has no description! Would you like to contribute one?

A New Approach Towards Autoformalization

jlab-nlp/arxiv2formal 12 Oct 2023

This is a challenging task, and especially for higher-level mathematics found in research papers.

5
12 Oct 2023

TransERR: Translation-based Knowledge Graph Embedding via Efficient Relation Rotation

dellixx/transerr 26 Jun 2023

This paper presents a translation-based knowledge geraph embedding method via efficient relation rotation (TransERR), a straightforward yet effective alternative to traditional translation-based knowledge graph embedding models.

4
26 Jun 2023

GENTLE: A Genre-Diverse Multilayer Challenge Set for English NLP and Linguistic Evaluation

gucorpling/gentle 3 Jun 2023

We evaluate state-of-the-art NLP systems on GENTLE and find severe degradation for at least some genres in their performance on all tasks, which indicates GENTLE's utility as an evaluation dataset for NLP systems.

4
03 Jun 2023

Sharpness-Aware Minimization Alone can Improve Adversarial Robustness

weizeming/sam_at 9 May 2023

In this paper, we explore SAM in the context of adversarial robustness.

10
09 May 2023

Towards Autoformalization of Mathematics and Code Correctness: Experiments with Elementary Proofs

gc974517/autoformalization 5 Jan 2023

The ever-growing complexity of mathematical proofs makes their manual verification by mathematicians very cognitively demanding.

5
05 Jan 2023

Draft, Sketch, and Prove: Guiding Formal Theorem Provers with Informal Proofs

facebookresearch/minif2f 21 Oct 2022

In this work, we introduce Draft, Sketch, and Prove (DSP), a method that maps informal proofs to formal proof sketches, and uses the sketches to guide an automated prover by directing its search to easier sub-problems.

52
21 Oct 2022

Formal Development of Safe Automated Driving using Differential Dynamic Logic

yuvrajselvam/safe-ad-dl 14 Apr 2022

The challenges in providing convincing arguments for safe and correct behavior of automated driving (AD) systems have so far hindered their widespread commercial deployment.

0
14 Apr 2022

Calibration of P-values for calibration and for deviation of a subpopulation from the full population

facebookresearch/cdeets 31 Jan 2022

The author's recent research papers, "Cumulative deviation of a subpopulation from the full population" and "A graphical method of cumulative differences between two subpopulations" (both published in volume 8 of Springer's open-access "Journal of Big Data" during 2021), propose graphical methods and summary statistics, without extensively calibrating formal significance tests.

4
31 Jan 2022

Theory-guided hard constraint projection (HCP): a knowledge-based data-driven scientific machine learning method

YuntianChen/Hard_constrant_projection_HCP 11 Dec 2020

Machine learning models have been successfully used in many scientific and engineering fields.

52
11 Dec 2020

IsarStep: a Benchmark for High-level Mathematical Reasoning

reactive-systems/ml2 ICLR 2021

In this paper, we present a benchmark for high-level mathematical reasoning and study the reasoning capabilities of neural sequence-to-sequence models.

8
13 Jun 2020