Skip to content

Error parsing .*\.\w\..* #131

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
alec-hoyland opened this issue Feb 25, 2025 · 0 comments
Open

Error parsing .*\.\w\..* #131

alec-hoyland opened this issue Feb 25, 2025 · 0 comments

Comments

@alec-hoyland
Copy link

alec-hoyland commented Feb 25, 2025

Describe the bug
A clear and concise description of what the bug is.

Error parsing the pattern .*\.\w\..* such as in the email:

alec.j.hoyland@gmail.com

To Reproduce
Steps to reproduce the behavior:

❯ uv run python
Python 3.12.8 (main, Feb 14 2025, 14:32:03) [GCC 11.4.0] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> import pysbd
>>> pysbd.__version__
'0.3.4'
>>> seg = pysbd.Segmenter(language="en", clean=False)
>>> seg.segment("alec.j.hoyland@gmail.com")
['alec.j.', 'hoyland@gmail.com']
>>> 

Expected behavior
A clear and concise description of what you expected to happen.
Expected output - list of expected sentences

["alec.j.hoyland@gmail.com"]

Additional context
Add any other context about the problem here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant