Skip to content

search and replace fials to work in huge files  #567

@myway880

Description

@myway880

edit: found out the files size is the problem not the regex
first thank you for you great app it is my go to regex app

Unfortunately however i stumbled on this bug:
I have a regex that i use to merge 2 XML files based on an identical id it succeeds in Calibre editor (it has a memory leak in undo so... ) and it dos work in dnGrep (but hangs on replace although it finds correctly)

but it fails to find a match in grepWin.

I'm unsure why
it is 1 file that is 134 mb in case that is relevant
I simply but the content of one file on top of the other and I'm using this regex to push the tags down so to speak
here is the regex

Find:
^(-?<verse id="([^"]*?)"[^\n]*?$(?:\n[^\n]*?\2[^\n]*?$){0,10})(?:\s\n+)?(\n.*<verse id="\2">[^@]*?$)
dot match new line ticked and treat file as utf8 is ticked
simplified to this
^(<verse id="([^"]*?)"[^\n]*?$)(.*<verse id="\2">)

replace:
\3\n\t\t\t\t\t\1\n

The preview turn the XML into part of the HTML for the GitHub page and it disappear on my end so I'm not sure if you will be able to see or extract it ...
I attached it in a file as well

bug test case.txt

text to test
`



  •   			<word class="heb_QR" id="hwid174377" clauseid="34167" word_lexical_Id="70008" part_of_speech="conj" strongs_id="H9000" heb_word="וַ" attached_punctuation="null" space_after="null" gloss="and"></word>
      		</phrase> 
      </clause>
    

@@
`

end text
`



  •   			<word class="heb_QR" id="hwid174377" clauseid="34167" word_lexical_Id="70008" part_of_speech="conj" strongs_id="H9000" heb_word="וַ" attached_punctuation="null" space_after="null" gloss="and"></word>
      		</phrase> 
      </clause>
    

@@
`

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions