Lazy evaluation on slow boolean functions (optimization)

Question

0.00/5 (No votes)

See more:

Short-circuit evaluation is a straight forward subject for lazy evaluation on Boolean expressions, I noted that two factors could be useful for ordering Boolean functions to gain run-time. But this is applicable in a particular hypothesis:

Knowing a percentage of positive (hence negative) result of a function. the following is a hypothesis followed be an example.
Knowing execution run-time of the function on data (knowing size of data can be a hint)
My case is easy to understand, here is a description:

Let's say F is a Boolean function that scores sentiment and returns -1 or 1. I've been doing some improvements to predict sentiment scores on big arrays of text, and I can calculate how is likely a function will return 1 (probability withing a tolerated error range) lets call it P(f) = Pf (probability of being positive)

We have also an estimation of execution time since data to be analysed is measurable. lets call it T(f) = Tf we define positiveness speed as follows:

Python

PS (f) =  Pf  / Tf

Let Lazy be a function which takes a Boolean expression, and gives the best ordering of functions for run-time.

Python

Lazy (and, f1, f2, …, fn) = and_Order (f1, f2, …, fn)

Where and_Order is ascending of positiveness speed.

Python

Lazy (or, f1, f2, …, fn) = or_Order(f1, f2, …, fn)

Where or_Order is descending if positiveness speed.

Obviously this is a high level optimization and not a compiler task. Is this an established study? How is this practical in cases other than sentiment analysis? Is the PS(F) even correctly representing what it should be representing?

Edit in response to ppolymorphe answer:

Quote:

Quote:
First of all, Boolean values are 0 for False and 1 for True. In computers, the usage to use 0 for False and non 0 for True.

There is nothing that holds usage of Boolean type for representing two values types. Even if it doesn't hold truth meaning. In my case It does. Here is a scenario:
After an event called "Innovation week", We wants to see if visitors were satisfied at many levels: "organization", "subjects", "proposed solutions"…

Let’s say that "organization", "subjects", "proposed solutions" are three topics that the most represent people's reaction (sentiment) on "Innovation week event". "Organization" and "subjects" are on the same level and one is sufficient to satisfy the other (logical OR), while both grouped are in the same level at "proposed solutions" (logical AND).

Possible modelization:

Python

Sent ("Innovation week") = (Sent("organization") || Sent("subjects")) && Sent("proposed solutions")

Quote:
Short-circuit evaluation is a straight forward subject for lazy evaluation on Boolean expressions,

Where Sent is polar sentiment function (Boolean)

Quote:
No, it is the efficient say to Boolean expression evaluation.

That's True. I didn't give attention to.

Quote:
your reordering must be faster, unfortunately, it is impossible.
Reordering at runtime imply evaluating all your slow functions before reordering, there is no gain there.

That's True.
But, I will be evaluating a sample (say 1/10 of the input big array), therfore I will accept error range. That's why

Quote:
Where and_Order is ascending on positiveness speed.

Where positiveness speed is a probability (based on sampling).

Thank you.

What I have tried:

This is an optimization I thought about, but I'm not sure about it's feasibility neither it's correctness

Posted 5-Sep-17 4:42am

Abderrahim Ben

Updated 5-Sep-17 21:02pm

v2

Add a Solution

Comments

Mehdi Gholam 5-Sep-17 12:06pm

Don't waste your time on optimizing the order, put the effort in optimizing your functions instead.

1 solution

Add a Solution

Add your solution here

Treat my content as plain text, not as HTML

Preview 0

…

Existing Members

Sign in to your account

...or Join us

Download, Vote, Comment, Publish.

Your Email
Password
Forgot your password?

Your Email
This email is in use. Do you need your password?
Optional Password

I have read and agree to the Terms of Service and Privacy Policy
Please subscribe me to the CodeProject newsletters

When answering a question please:

Read the question carefully.
Understand that English isn't everyone's first language so be lenient of bad spelling and grammar.
If a question is poorly phrased then either ask for clarification, ignore it, or edit the question and fix the problem. Insults are not welcome.
Don't tell someone to read the manual. Chances are they have and don't get it. Provide an answer or move on to the next question.

Let's work to help developers, not make them feel stupid.

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)

Patrice T · Answer 1 · 2017-09-05T09:14:00

Quote:
Let's say F is a Boolean function that scores sentiment and returns -1 or 1.

First of all, Boolean values are 0 for False and 1 for True. In computers, the usage to use 0 for False and non 0 for True.

Quote:
Short-circuit evaluation is a straight forward subject for lazy evaluation on Boolean expressions,

No, it is the efficient say to Boolean expression evaluation.

Quote:
Let Lazy be a function which takes a Boolean expression, and gives the best ordering of functions for run-time.

Evaluating a logical operation like and and or between 2 Boolean values takes 1 CPU clock cycle. To be of interest, your reordering must be faster, unfortunately, it is impossible.
Reordering at runtime imply evaluating all your slow functions before reordering, there is no gain there.
The only possible thing to do is ordering your Boolean expressions before hand in source code.

Lazy evaluation on slow boolean functions (optimization)

1 solution

Solution 1

Add your solution here

Preview 0

Existing Members

...or Join us