commit cbf9ff87b638096bdf0de227e7694e7eca776deb Author: Tanmai Khanna Date: Sat May 11 12:22:32 2019 +0530 Making it a general repo diff --git a/README.md b/README.md deleted file mode 100644 index 80e63a8..0000000 --- a/README.md +++ /dev/null @@ -1,35 +0,0 @@ -# Apertium Coding Challenge - -This is an attempt at the Apertium Coding Challenge for GSoC 2019. - -Project: Anaphora Resolution - -Author: Tanmai Khanna - -## Executing the code -Use Python 3 to execute the code. - -Apertium Streamparser is required to run the code. -Install from requirements using: - -`pip3 install -r requirements.txt` - -Anaphora Resolution where input is read byte by byte: - -`python3 byte_anaphora.py < input.txt > output_byte.txt` - -Anaphora Resolution where input is read as chunks: - -`python3 chunk_anaphora.py 512 < input.txt > output_chunk_512.txt` - -Here the chunk size you wish to use (in bytes) is given as an argument. - -If you do not specify a chunk size it uses 1024 bytes as default. - -`python3 chunk_anaphora.py < input.txt > output_chunk_default.txt` - -## Speed-up Report - -"Report_Chunk_vs_Byte.txt" shows the time taken by these different methods and compares them. - -We can clearly see that chunks perform much better, and bigger the chunk size, faster the execution time.