poj 2945 Find the Clones

February 27, 2013
POJ Data Structure

Find the Clones Time Limit: 5000MS Memory Limit: 65536K Total Submissions: 6265 Accepted: 2328

Description Doubleville, a small town in Texas, was attacked by the aliens. They have abducted some of the residents and taken them to the a spaceship orbiting around earth. After some (quite unpleasant) human experiments, the aliens cloned the victims, and released multiple copies of them back in Doubleville. So now it might happen that there are 6 identical person named Hugh F. Bumblebee: the original person and its 5 copies. The Federal Bureau of Unauthorized Cloning (FBUC) charged you with the task of determining how many copies were made from each person. To help you in your task, FBUC have collected a DNA sample from each person. All copies of the same person have the same DNA sequence, and different people have different sequences (we know that there are no identical twins in the town, this is not an issue).

Input The input contains several blocks of test cases. Each case begins with a line containing two integers: the number 1 ≤ n ≤ 20000 people, and the length 1 ≤ m ≤ 20 of the DNA sequences. The next n lines contain the DNA sequences: each line contains a sequence of m characters, where each character is either A',C’, G' orT’.

The input is terminated by a block with n = m = 0 .

Output For each test case, you have to output n lines, each line containing a single integer. The first line contains the number of different people that were not copied. The second line contains the number of people that were copied only once (i.e., there are two identical copies for each such person.) The third line contains the number of people that are present in three identical copies, and so on: the i -th line contains the number of persons that are present in i identical copies. For example, if there are 11 samples, one of them is from John Smith, and all the others are from copies of Joe Foobar, then you have to print 1' in the first andthe tenth lines, and0’ in all the other lines.

Sample Input

9 6 AAAAAA ACACAC GTTTTG ACACAC GTTTTG ACACAC ACACAC TCCCCC TCCCCC 0 0

Sample Output

1 2 0 1 0 0 0 0 0

Hint Huge input file, ‘scanf’ recommended to avoid TLE.

用字典树做的,方法比较笨。

#include <iostream>
#include <cstdio>
#include <cstdlib>
#include <cstring>

using namespace std;
const int sonnum = 26, base = 'A';
char a[20000+10][25];
int b[20000+10];
struct Trie
{
  int num; bool terminal; int endsum; 
  Trie *son[sonnum];
};
Trie *NewTrie()
{
  Trie *temp = new Trie;
  temp->num = 1; temp->terminal = false; temp->endsum = 0;
  for (int i = 0; i < sonnum; ++i) temp->son[i] = NULL;
  return temp;
}
void Insert(Trie *pnt, char *s, int len)
{
  Trie *temp = pnt;
  for (int i = 0; i < len; ++i)
  {
    if (temp->son[s[i]-base] == NULL) temp->son[s[i]-base] = NewTrie();
    else temp->son[s[i]-base]->num++;
    temp = temp->son[s[i]-base];
  }
  temp->endsum++; temp->terminal = true;
}
void Find(Trie *pnt, char *s, int len)
{
  Trie *temp = pnt;
  for (int i = 0; i < len; ++i)
    temp = temp->son[s[i]-base];
  if (temp->endsum == 1) b[0]++;
  else 
  {
    if (temp->terminal == true)
    {
      b[temp->endsum-1]++;
      temp->terminal = false;
    }
  }
}
void Delete(Trie *pnt)
{
  if (pnt != NULL)
  {
    for (int i = 0; i < sonnum; ++i) if (pnt->son[i] != NULL) Delete(pnt->son[i]);
    delete pnt;
    pnt = NULL;
  }
}
int main(void)
{
  int n, m;
#ifndef ONLINE_JUDGE
  freopen("poj2956.in", "r", stdin);
#endif
  while (1)
  {
    Trie *pnt = NewTrie();
    memset(b, 0, sizeof(b));
    scanf("%d%d", &n, &m);
    {
      for (int i = 0; i < n; ++i) 
      {
        scanf("%s", a[i]);
        Insert(pnt, a[i], m);
      }
    }
    for (int i = 0; i < n; ++i) Find(pnt, a[i], m);
    for (int i = 0; i < n; ++i) printf("%d\n", b[i]);
    Delete(pnt);
    if (!n) break;
    }

  return 0;
}

没有优化……应该速度可以更快一点儿。

comments powered by Disqus