L19 Programming Problem Championship: Round 2 (Strings)

Post by **Solomon** » Tue May 09, 2017 3:31 am

peti29 wrote:Solomon: thank you for the detailed explanation! I almost get it now. Only one thing remains:
How is it guaranteed that we'll use a piece only once?
Say our test case is: '(', '(', ')', ')' then our positive memo table will be [0, 1, 2] and our negative memo table the same,
so when we compare memo table entries we will add 4 for i==2 which is correct, but then we'll add 2 for i==1 resulting in a sum of 6 which is incorrect.

peti29 · Post by **peti29** » Tue May 09, 2017 6:19 am

Solomon: Thank you! I think I understand now.

Bill Spight · Post by **Bill Spight** » Tue May 09, 2017 7:07 am

One virtue, which I did not mention, of reducing the maximum n in the Power Strings problem from the length of the string to a smaller n(max) in the manner used is that you only have to find the divisors of n(max) instead of the string length.

Let me illustrate with a string length of 60, which has several divisors. In the worst case we might have to try substring lengths of 1, 2, 3, 4, 5, 6, 10, 12, 15, 20, and 30. But suppose that we are able to reduce n(max) to 15. Then our possible n's are 15, 5, 3, and 1, which means that we might only have to try substring lengths of 4, 12, and 20, i.e., divisors of 60 which are multiples of 4. OC, a maximum n of 15 is a bit lucky. 12 is perhaps more likely. Then we might only have to try substring lengths of 5, 10, 15, 20, and 30.

bernds · Post by **bernds** » Tue May 09, 2017 9:31 am

Solomon wrote:
Indeed, but when I tried something far simpler like:
Code: Select all
while True:
    s = input()
    if s == '.':
        break
    print(len(s) // (s + s).find(s, 1))
It would pass the first 3 test cases, before hitting TLE on the fourth (I wrote the C++ equivalent, and found it would hit TLE even sooner, at the second test case).

Post by **Solomon** » Tue May 09, 2017 9:33 am

bernds wrote:
Solomon wrote:
Indeed, but when I tried something far simpler like:
Code: Select all
while True:
    s = input()
    if s == '.':
        break
    print(len(s) // (s + s).find(s, 1))
It would pass the first 3 test cases, before hitting TLE on the fourth (I wrote the C++ equivalent, and found it would hit TLE even sooner, at the second test case).
Well, you're not in control of whether the library uses a clever algorithm. I had a look at libstdc++ and it looks like it doesn't (there's a mention of KMP only for a parallel version string search, the default seems to be the naive loop).
I've built a KMP version with a slightly different approach. I think you can just compute the solution while building the kmp prefix table: whenever the length of the prefix is equal to the length of the original string, you've found S within S+S. The first time this occurs, you can stop searching. As an additional optimization you can avoid the string copy by just noticing when your index goes past the length and wrapping it. I implemented this without reference to your program, just looking at general KMP algorithm descriptions elsewhere, and I suspect that this version would be good for spot 2 on the performance leaderboard (assuming it really works). Would you object if I submitted it?
I suspect Bill's method could very slightly improve runtimes further in some cases, but the solution is already linear and that obviously won't change, so I imagine the improvement would be very limited.

Bill Spight · Post by **Bill Spight** » Tue May 09, 2017 11:03 am

Solomon wrote:
bernds wrote:
Solomon wrote:
Indeed, but when I tried something far simpler like:
Code: Select all
while True:
    s = input()
    if s == '.':
        break
    print(len(s) // (s + s).find(s, 1))
It would pass the first 3 test cases, before hitting TLE on the fourth (I wrote the C++ equivalent, and found it would hit TLE even sooner, at the second test case).
Well, you're not in control of whether the library uses a clever algorithm. I had a look at libstdc++ and it looks like it doesn't (there's a mention of KMP only for a parallel version string search, the default seems to be the naive loop).
I've built a KMP version with a slightly different approach. I think you can just compute the solution while building the kmp prefix table: whenever the length of the prefix is equal to the length of the original string, you've found S within S+S. The first time this occurs, you can stop searching. As an additional optimization you can avoid the string copy by just noticing when your index goes past the length and wrapping it. I implemented this without reference to your program, just looking at general KMP algorithm descriptions elsewhere, and I suspect that this version would be good for spot 2 on the performance leaderboard (assuming it really works). Would you object if I submitted it?
I suspect Bill's method could very slightly improve runtimes further in some cases, but the solution is already linear and that obviously won't change, so I imagine the improvement would be very limited.
Go for it! I'm curious .

2d or 3d edit:

In fact, I would be willing to submit a solution that calculates n just from the single character counts, with no string matching at all. Not even to check the result.

I may well do that. As Kirby says, not perfect, but quick.

bernds · Post by **bernds** » Tue May 09, 2017 1:06 pm

Bill Spight wrote:In fact, I would be willing to submit a solution that calculates n just from the single character counts, with no string matching at all. Not even to check the result. I may well do that. As Kirby says, not perfect, but quick.

Not going to work, since you can't distinguish abcdabcd from abcddcba.

My new attempt also got WA on one of the hidden testcases, so I'll have to do a bit of debugging.

Bill Spight · Post by **Bill Spight** » Tue May 09, 2017 1:12 pm

bernds wrote:
Bill Spight wrote:In fact, I would be willing to submit a solution that calculates n just from the single character counts, with no string matching at all. Not even to check the result. I may well do that. As Kirby says, not perfect, but quick.
Not going to work, since you can't distinguish abcdabcd from abcddcba.

My new attempt also got WA on one of the hidden testcases, so I'll have to do a bit of debugging.

Yeah, well, that depends upon the psychology of the testers.

But considering the expectation that you inform them if you yourself find a bug, then knowingly submitting a buggy program probably violates the spirit of the competition.

bernds · Post by **bernds** » Tue May 09, 2017 2:05 pm

Solomon wrote:
Go for it! I'm curious .

Life In 19x19

L19 Programming Problem Championship: Round 2 (Strings)

Re: L19 Programming Problem Championship: Round 2

Re: L19 Programming Problem Championship: Round 2

Re: L19 Programming Problem Championship: Round 2

Re: L19 Programming Problem Championship: Round 2

Re: L19 Programming Problem Championship: Round 2

Re: L19 Programming Problem Championship: Round 2

Re: L19 Programming Problem Championship: Round 2

Re: L19 Programming Problem Championship: Round 2

Re: L19 Programming Problem Championship: Round 2