develooper Front page | perl.beginners | Postings from September 2021

Re: Regex to detect natural language fragment

Thread Previous
From:
Julius Hamilton
Date:
September 14, 2021 08:41
Subject:
Re: Regex to detect natural language fragment
Message ID:
CAEsMKX1KcSJa9M5-95ym1gTitL4xU0f2snabqoP6qmn9vobUhA@mail.gmail.com
 Thanks very much.

@Chankey Pathak, which of those libraries does you recommend for this task?

Best regards,
Julius

On Tue, Sep 14, 2021 at 2:33 AM Ken Peng <kenpeng@pubbox.net> wrote:

> Or use GPT-3 who has a free online API.
> https://openai.com/blog/openai-api/
>
> regards
>
> On Mon, Sep 13, 2021 at 11:42 PM Chankey Pathak <chankey007@gmail.com>
> wrote:
>
>> You can look into NLP https://metacpan.org/search?q=nlp
>>
>> On Mon, 13 Sept 2021 at 21:04, Julius Hamilton <
>> juliushamilton100@gmail.com> wrote:
>>
>>> Hey,
>>>
>>> I'm not sure if this is possible, and if it's not, I'll explore a better
>>> way to do this.
>>>
>>> I would like to write a script which analyzes if a line of text is
>>> (likely) a broken natural language sentence, i.e., it is probably part of a
>>> sentence, even if the start or end is not present, rather than it being a
>>> fully "complete" linguistic entity, for example, a header of a section,
>>> which does not have a period at the end and is not really a sentence, yet
>>> is in a complete and unbroken form.
>>>
>>> I'm pretty sure in principle this will require some kind of syntax
>>> parsing. I think I read somewhere regular expressions for some mathematical
>>> reason cannot parse tree / nested structures, for example HTML.
>>>
>>> Does anyone know what some next most ubiquitous, standard tool is for
>>> analyzing nested linguistic structures? Is that an XML parser?
>>>
>>> Thanks very much,
>>> Julius
>>>
>>

Thread Previous


nntp.perl.org: Perl Programming lists via nntp and http.
Comments to Ask Bjørn Hansen at ask@perl.org | Group listing | About