ex_KMP--Theme Section

题目网址: http://acm.hust.edu.cn/vjudge/contest/view.action?cid=110060#problem/B

Description

It's time for music! A lot of popular musicians are invited to join us in the music festival. Each of them will play one of their representative songs. To make the programs more interesting and challenging, the hosts are going to add some constraints to the rhythm of the songs, i.e., each song is required to have a 'theme section'. The theme section shall be played at the beginning, the middle, and the end of each song. More specifically, given a theme section E, the song will be in the format of 'EAEBE', where section A and section B could have arbitrary number of notes. Note that there are 26 types of notes, denoted by lower case letters 'a' - 'z'. 

To get well prepared for the festival, the hosts want to know the maximum possible length of the theme section of each song. Can you help us? 
 

Input

The integer N in the first line denotes the total number of songs in the festival. Each of the following N lines consists of one string, indicating the notes of the i-th (1 <= i <= N) song. The length of the string will not exceed 10^6.
 

Output

There will be N lines in the output, where the i-th line denotes the maximum possible length of the theme section of the i-th song. 
 

Sample Input

5 xy abc aaa aaaaba aaxoaaaaa
 

Sample Output

0 0 1 1 2
 
题意: 输入一个字符串,求这个字符串中的一个子串,该子串为此字符串的前缀和后缀,并在字符串中间也有这个子串,但但这三个子串不能重叠,求是否存在这样的子串,输出子串长度的最大值,否则输出0;
 
思路: 使用扩展KMP算法,扩展KMP的next[i]表示从s[i]开始的与s前缀最大的匹配长度。在一个位置i中,如果要满足要求,子串的最大长度不能超过
min(i,  next[i], (len - i) / 2) ,所有的最大长度的最大值就是可能存在的最大长度。先找利用next[]找中间与前缀匹配的子串,然后判断后缀与子串匹配的最大长度,并要注意子串不能重叠。
 
代码:
#include <iostream>
#include <algorithm>
#include <cstring>
#include <cstdio>
using namespace std;
char s[1000005];
int nex[1000005];
int pre[1000005];

void ex_next(int length)
{
    ///nex[i]: 以第i位置开始的子串与T的前缀的最大长度;
    int i,j;
    nex[0]=length;
    for(i=0; i<length-1&&s[i]==s[i+1];i++);///前缀都是同一个字母的时候;
    nex[1]=i;
    int a=1;///a为使匹配到最远的地方时的起始匹配地点;
    for(int k=2;k<length;k++)
    {
        int p=a+nex[a]-1,L=nex[k-a];
        if( (k-1)+L>=p )
        {
            int j=(p-k+1)>0?(p-k+1):0;
            while(k+j<length&&s[k+j]==s[j]) j++;
            /// 枚举(p+1,length) 与(p-k+1,length) 区间比较;
            nex[k]=j,a=k;
        }
        else nex[k]=L;
    }
}

int main()
{
    int T,M,n;
    scanf("%d",&T);
    while(T--)
    {
        M=0;
        scanf("%s",s);
        int len=strlen(s);
        ex_next(len);
        for(int i=1;i<len;i++)
        {
            if(nex[i])
            {
                ///此时nex[i]为中间以s[i]开始的子串与前缀匹配的最大长度;
                ///接下来判断后缀与前缀及中间子串的最大匹配长度;
                n=min(min (i,nex[i]),(len-i)/2);
                for(int j=len-n;j<len;j++)
                  if(nex[j]==len-j)
                {
                    if(M<nex[j]) M=nex[j];
                    else         break;
                }
            }
        }
        printf("%d
",M);
    }
    return 0;
 }
 
原文地址:https://www.cnblogs.com/chen9510/p/5313034.html