extract text every line and check in regular expression

-2

Hi I have the following 2 documents amongst many in the form of a list.

Document 1:

 contact

 TomGonsalves

 985664774

 6560,m.d avenue.



  harrydsouza

  7676764

  543,gt,road.

Document 2 in the same python list:

  contact

  richard peterson

  4563565

  87,td street,Ny.

Here both the above text documents(1 and 2) are part of the same. In the text, I need to extract the 1st line after contact,i.e the name. However in Document-1 there are 2 names which I need to extract and just the names.

I need to write a regular expression to fetch the needful and the output should look like

     Names from Document 1 =[tomgonsalves,harrydsouza]

     names from document-2=[richard peterson]

Can someone please help me with the reGex script in python 3 for this.There are many docs in the same list which may have different number of names in its contact. I just need to extract the names alone(be it 1 or multiple)

asked Nov 19 '18 at 15:48

sayan_sen

217

1

Honestly, I might prefer just iterating the file line by line and implementing some simple parsing logic.
– Tim Biegeleisen
Nov 19 '18 at 15:51

add a comment |

-2

Hi I have the following 2 documents amongst many in the form of a list.

Document 1:

 contact

 TomGonsalves

 985664774

 6560,m.d avenue.



  harrydsouza

  7676764

  543,gt,road.

Document 2 in the same python list:

  contact

  richard peterson

  4563565

  87,td street,Ny.

I need to write a regular expression to fetch the needful and the output should look like

     Names from Document 1 =[tomgonsalves,harrydsouza]

     names from document-2=[richard peterson]

asked Nov 19 '18 at 15:48

sayan_sen

217

1

Honestly, I might prefer just iterating the file line by line and implementing some simple parsing logic.
– Tim Biegeleisen
Nov 19 '18 at 15:51

add a comment |

-2

Hi I have the following 2 documents amongst many in the form of a list.

Document 1:

 contact

 TomGonsalves

 985664774

 6560,m.d avenue.



  harrydsouza

  7676764

  543,gt,road.

Document 2 in the same python list:

  contact

  richard peterson

  4563565

  87,td street,Ny.

I need to write a regular expression to fetch the needful and the output should look like

     Names from Document 1 =[tomgonsalves,harrydsouza]

     names from document-2=[richard peterson]

asked Nov 19 '18 at 15:48

sayan_sen

217

Hi I have the following 2 documents amongst many in the form of a list.

Document 1:

 contact

 TomGonsalves

 985664774

 6560,m.d avenue.



  harrydsouza

  7676764

  543,gt,road.

Document 2 in the same python list:

  contact

  richard peterson

  4563565

  87,td street,Ny.

I need to write a regular expression to fetch the needful and the output should look like

     Names from Document 1 =[tomgonsalves,harrydsouza]

     names from document-2=[richard peterson]

regex python-3.x

asked Nov 19 '18 at 15:48

sayan_sen

217

asked Nov 19 '18 at 15:48

sayan_sen

217

asked Nov 19 '18 at 15:48

sayan_sen

217

asked Nov 19 '18 at 15:48

sayan_sen

217

asked Nov 19 '18 at 15:48

sayan_sen

217

1

Honestly, I might prefer just iterating the file line by line and implementing some simple parsing logic.
– Tim Biegeleisen
Nov 19 '18 at 15:51

add a comment |

1

Honestly, I might prefer just iterating the file line by line and implementing some simple parsing logic.
– Tim Biegeleisen
Nov 19 '18 at 15:51

Honestly, I might prefer just iterating the file line by line and implementing some simple parsing logic.
– Tim Biegeleisen
Nov 19 '18 at 15:51

add a comment |

2 Answers
2

active

oldest

votes

You can try something like this:

Read the txt file line-by-line. For each line check if it has only alphabets with isalpha() function of string.

In [309]: with open('ff.txt') as f: ## ff.txt is the file you shared

     ...:     for i in f:

     ...:         i = re.sub('n','',i)

     ...:         if i.isalpha():

     ...:             if not i == 'contact':

     ...:                 print(i)



TomGonsalves

harrydsouza

You can do this for each file you have.

edited Nov 20 '18 at 4:12

answered Nov 19 '18 at 16:01

Mayank Porwal

4,4991624

@sayan_sen Let me know if it works for you.
– Mayank Porwal
Nov 20 '18 at 4:11

add a comment |

Although this should be preferred to be attacked by simple text parsing but just in case you want a regex solution, you may use this regex,

(?:^s*[a-zA-Z]+s*|(ns*){2})([w ]+)

Here is a demo

Let me know if this works for you, else please provide more sample input so I can further refine my regex to match your precise input.

answered Nov 19 '18 at 16:13

Pushpesh Kumar Rajwanshi

5,3022827

add a comment |

Your Answer

StackExchange.ifUsing("editor", function () {
StackExchange.using("externalEditor", function () {
StackExchange.using("snippets", function () {
StackExchange.snippets.init();
});
});
}, "code-snippets");

StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "1"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});

function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});

}
});

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53378216%2fextract-text-every-line-and-check-in-regular-expression%23new-answer', 'question_page');
}
);

Post as a guest

Name

Required, but never shown

2 Answers
2

active

oldest

votes

2 Answers
2

active

oldest

votes

You can try something like this:

Read the txt file line-by-line. For each line check if it has only alphabets with isalpha() function of string.

In [309]: with open('ff.txt') as f: ## ff.txt is the file you shared

     ...:     for i in f:

     ...:         i = re.sub('n','',i)

     ...:         if i.isalpha():

     ...:             if not i == 'contact':

     ...:                 print(i)



TomGonsalves

harrydsouza

You can do this for each file you have.

edited Nov 20 '18 at 4:12

answered Nov 19 '18 at 16:01

Mayank Porwal

4,4991624

@sayan_sen Let me know if it works for you.
– Mayank Porwal
Nov 20 '18 at 4:11

add a comment |

You can try something like this:

Read the txt file line-by-line. For each line check if it has only alphabets with isalpha() function of string.

In [309]: with open('ff.txt') as f: ## ff.txt is the file you shared

     ...:     for i in f:

     ...:         i = re.sub('n','',i)

     ...:         if i.isalpha():

     ...:             if not i == 'contact':

     ...:                 print(i)



TomGonsalves

harrydsouza

You can do this for each file you have.

edited Nov 20 '18 at 4:12

answered Nov 19 '18 at 16:01

Mayank Porwal

4,4991624

@sayan_sen Let me know if it works for you.
– Mayank Porwal
Nov 20 '18 at 4:11

add a comment |

You can try something like this:

Read the txt file line-by-line. For each line check if it has only alphabets with isalpha() function of string.

In [309]: with open('ff.txt') as f: ## ff.txt is the file you shared

     ...:     for i in f:

     ...:         i = re.sub('n','',i)

     ...:         if i.isalpha():

     ...:             if not i == 'contact':

     ...:                 print(i)



TomGonsalves

harrydsouza

You can do this for each file you have.

edited Nov 20 '18 at 4:12

answered Nov 19 '18 at 16:01

Mayank Porwal

4,4991624

You can try something like this:

Read the txt file line-by-line. For each line check if it has only alphabets with isalpha() function of string.

In [309]: with open('ff.txt') as f: ## ff.txt is the file you shared

     ...:     for i in f:

     ...:         i = re.sub('n','',i)

     ...:         if i.isalpha():

     ...:             if not i == 'contact':

     ...:                 print(i)



TomGonsalves

harrydsouza

You can do this for each file you have.

edited Nov 20 '18 at 4:12

answered Nov 19 '18 at 16:01

Mayank Porwal

4,4991624

edited Nov 20 '18 at 4:12

answered Nov 19 '18 at 16:01

Mayank Porwal

4,4991624

answered Nov 19 '18 at 16:01

Mayank Porwal

4,4991624

answered Nov 19 '18 at 16:01

Mayank Porwal

4,4991624

@sayan_sen Let me know if it works for you.
– Mayank Porwal
Nov 20 '18 at 4:11

add a comment |

@sayan_sen Let me know if it works for you.
– Mayank Porwal
Nov 20 '18 at 4:11

@sayan_sen Let me know if it works for you.
– Mayank Porwal
Nov 20 '18 at 4:11

add a comment |

Although this should be preferred to be attacked by simple text parsing but just in case you want a regex solution, you may use this regex,

(?:^s*[a-zA-Z]+s*|(ns*){2})([w ]+)

Here is a demo

Let me know if this works for you, else please provide more sample input so I can further refine my regex to match your precise input.

answered Nov 19 '18 at 16:13

Pushpesh Kumar Rajwanshi

5,3022827

add a comment |

Although this should be preferred to be attacked by simple text parsing but just in case you want a regex solution, you may use this regex,

(?:^s*[a-zA-Z]+s*|(ns*){2})([w ]+)

Here is a demo

Let me know if this works for you, else please provide more sample input so I can further refine my regex to match your precise input.

answered Nov 19 '18 at 16:13

Pushpesh Kumar Rajwanshi

5,3022827

add a comment |

Although this should be preferred to be attacked by simple text parsing but just in case you want a regex solution, you may use this regex,

(?:^s*[a-zA-Z]+s*|(ns*){2})([w ]+)

Here is a demo

Let me know if this works for you, else please provide more sample input so I can further refine my regex to match your precise input.

answered Nov 19 '18 at 16:13

Pushpesh Kumar Rajwanshi

5,3022827

Although this should be preferred to be attacked by simple text parsing but just in case you want a regex solution, you may use this regex,

(?:^s*[a-zA-Z]+s*|(ns*){2})([w ]+)

Here is a demo

Let me know if this works for you, else please provide more sample input so I can further refine my regex to match your precise input.

answered Nov 19 '18 at 16:13

Pushpesh Kumar Rajwanshi

5,3022827

answered Nov 19 '18 at 16:13

Pushpesh Kumar Rajwanshi

5,3022827

answered Nov 19 '18 at 16:13

Pushpesh Kumar Rajwanshi

5,3022827

answered Nov 19 '18 at 16:13

Pushpesh Kumar Rajwanshi

5,3022827

add a comment |

draft saved

draft discarded

Thanks for contributing an answer to Stack Overflow!

Please be sure to answer the question. Provide details and share your research!

But avoid …

Asking for help, clarification, or responding to other answers.

Making statements based on opinion; back them up with references or personal experience.

To learn more, see our tips on writing great answers.

Some of your past answers have not been well-received, and you're in danger of being blocked from answering.

Please pay close attention to the following guidance:

Please be sure to answer the question. Provide details and share your research!

But avoid …

Asking for help, clarification, or responding to other answers.

Making statements based on opinion; back them up with references or personal experience.

To learn more, see our tips on writing great answers.

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Name

Required, but never shown

Name

Required, but never shown

This page is only for reference, If you need detailed information, please check here

Search This Blog

Ufyukyu