Fastaheader reading, Memory reduction and speedup #7

Imoteph · 2015-06-03T10:22:27Z

Changed some part of the code for the first Step. Reduced the memory footprint in our case from 300GB to 2GB. Additionally it will create a speedup as no unnecessary copies are produced.

Add new extractMummer function to reduce memory foodprint and speed up! As it does the necessary steps during reading the line!

Calls the new function in alignerRobot

Takes only the first header information of the for each fasta header as mummer does.

peterjc · 2018-09-24T12:35:30Z

IORobot.py

@@ -18,7 +18,8 @@ def obtainLength(folderName, fileName):
            if tmplen != 0:
                lenDic[tmpName] = tmplen
                tmplen = 0
-            tmpName = tmp[1:]
+            headerList = tmp.split();
+            tmpName = headerList[0][1:]


How about this:

tmpName = tmp.split(None, 1)[1:]

There is often more than one space in the FASTA header lines, so splitting at the first space may well be quicker overall.

Also there is no reason to create the extra temp variable headerList here, is there?

:) thats indeed looks better, at that time it was just a quick and dirty fix.

It is a shame the tool owner has not been active here recently...

Imoteph added 3 commits June 3, 2015 11:35

Update alignerRobot.py

a95546c

Add new extractMummer function to reduce memory foodprint and speed up! As it does the necessary steps during reading the line!

Update nonRedundantResolver.py

d49ddea

Calls the new function in alignerRobot

Update IORobot.py

7077dd9

Takes only the first header information of the for each fasta header as mummer does.

peterjc reviewed Sep 24, 2018

View reviewed changes

peterjc mentioned this pull request Sep 24, 2018

Interpretation of Fasta header #6

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fastaheader reading, Memory reduction and speedup #7

Fastaheader reading, Memory reduction and speedup #7

Imoteph commented Jun 3, 2015

peterjc Sep 24, 2018

Imoteph Sep 24, 2018

peterjc Sep 24, 2018

Fastaheader reading, Memory reduction and speedup #7

Are you sure you want to change the base?

Fastaheader reading, Memory reduction and speedup #7

Conversation

Imoteph commented Jun 3, 2015

peterjc Sep 24, 2018

Choose a reason for hiding this comment

Imoteph Sep 24, 2018

Choose a reason for hiding this comment

peterjc Sep 24, 2018

Choose a reason for hiding this comment