32
Ravello 19-21 settembre 2003 Covering Problems from a Formal Language Point of View M. Anselmo - M. Madonia 1 Covering problems from a formal language point of view Marcella ANSELMO Maria MADONIA

Ravello 19-21 settembre 2003Covering Problems from a Formal Language Point of View M. Anselmo - M. Madonia 1 Covering problems from a formal language point

Embed Size (px)

Citation preview

Page 1: Ravello 19-21 settembre 2003Covering Problems from a Formal Language Point of View M. Anselmo - M. Madonia 1 Covering problems from a formal language point

Ravello 19-21 settembre 2003

Covering Problems from a Formal Language Point of ViewM. Anselmo - M. Madonia

1

Covering problemsfrom

a formal language point of view

Marcella ANSELMOMaria MADONIA

Page 2: Ravello 19-21 settembre 2003Covering Problems from a Formal Language Point of View M. Anselmo - M. Madonia 1 Covering problems from a formal language point

Ravello 19-21 settembre 2003

Covering Problems from a Formal Language Point of ViewM. Anselmo - M. Madonia

2

Covering a word

Covering a word w with words in a set X

w

Covering = concatenations +overlaps

Example: X = ab+ba w = abababa

a b a b a b a

X X X X X

Page 3: Ravello 19-21 settembre 2003Covering Problems from a Formal Language Point of View M. Anselmo - M. Madonia 1 Covering problems from a formal language point

Ravello 19-21 settembre 2003

Covering Problems from a Formal Language Point of ViewM. Anselmo - M. Madonia

3

Why study covering ?

• Molecular biology: manipulating DNA molecules (e.g. fragment assembly)

• Data compression

• Computer-assisted music analysis

Page 4: Ravello 19-21 settembre 2003Covering Problems from a Formal Language Point of View M. Anselmo - M. Madonia 1 Covering problems from a formal language point

Ravello 19-21 settembre 2003

Covering Problems from a Formal Language Point of ViewM. Anselmo - M. Madonia

4

Literature

• Apostolico, Ehrenfeucht (1993)

• Brodal, Pedersen (2000)w is ‘quasiperiodic’

• Moore, Smyth (1995) x is a ‘cover’ of w

• Iliopulos, Moore, Park (1993) x ‘covers’ w

• Iliopulos, Smyth (1998) ‘set of k-covers’ of w

• Sim, Iliopulos, Park, Smyth (2001) p ‘approximated (complete references) period’ of w

All algorithmic problems!!!(given w find ‘optimal’ X)

Page 5: Ravello 19-21 settembre 2003Covering Problems from a Formal Language Point of View M. Anselmo - M. Madonia 1 Covering problems from a formal language point

Ravello 19-21 settembre 2003

Covering Problems from a Formal Language Point of ViewM. Anselmo - M. Madonia

5

Formal language point of view

alsoXcov = (X, A*), set of z-decompositions over (X, A*)

Here: Coverings not simple generalizations of z-decompositions!

If X A*,X cov = set of words ‘covered’ by words in X

Formal language point of view is needed!Madonia, Salemi, Sportelli (1999) [MSS99]:

Page 6: Ravello 19-21 settembre 2003Covering Problems from a Formal Language Point of View M. Anselmo - M. Madonia 1 Covering problems from a formal language point

Ravello 19-21 settembre 2003

Covering Problems from a Formal Language Point of ViewM. Anselmo - M. Madonia

6

Formal Definition

Def. A covering (over X) of w in A* is =(w1, …, wn) s.t.1. n is odd;

for any odd i, wi X for any even i, wi

2. red(w1… wn) = w3. for any i, red(w1…wi) is prefix of w

*A

.if...*

11 AAa...awaaw nn .| ,If * XxxXAX

) , :(Ex abababred(w) aba ba ba b ab w

aaaaA

A

s.t.copydisjoint,

alphabet,

red(w) = canonical representative of the class of w in the free group

,If*

AAw

Page 7: Ravello 19-21 settembre 2003Covering Problems from a Formal Language Point of View M. Anselmo - M. Madonia 1 Covering problems from a formal language point

Ravello 19-21 settembre 2003

Covering Problems from a Formal Language Point of ViewM. Anselmo - M. Madonia

7

=(ab, , ba, , ba, , ab) is a covering of w over X b a

a b a b a b :

Example: X = ab+ba w = ababab.

1. n is odd; for any odd i, wi X; for any even i, wi *

2. red(ab ba ba ab) = ababab3. for any i, red(w1…wi) is prefix of w

A

b a

Page 8: Ravello 19-21 settembre 2003Covering Problems from a Formal Language Point of View M. Anselmo - M. Madonia 1 Covering problems from a formal language point

Ravello 19-21 settembre 2003

Covering Problems from a Formal Language Point of ViewM. Anselmo - M. Madonia

8

Concatenation, zig-zag, covering

Xcovcov-submonoid

Xz-submonoid

X*submonoid

Covering

Zig-zag

Concatenation

cov-submonoid z-submonoid submonoid

Page 9: Ravello 19-21 settembre 2003Covering Problems from a Formal Language Point of View M. Anselmo - M. Madonia 1 Covering problems from a formal language point

Ravello 19-21 settembre 2003

Covering Problems from a Formal Language Point of ViewM. Anselmo - M. Madonia

9

Example: X= ab+ba, w=#ababaab$ L(S)

Splicing systems for Xcov

a b a b a $b a a b $

x = baa b a b a a b$

X, finiteS, splicing system s.t. L(S) = Xcov $

Start with: x $, xX or COV2(X) Rules: (, x, $), xX

(, x, x3$), x=x1x2, x2x3 X

COV2(X) = XxxxxAxxxxxx 3221321321 ,*,,,|

a b a b $a b a $

x = ab

Page 10: Ravello 19-21 settembre 2003Covering Problems from a Formal Language Point of View M. Anselmo - M. Madonia 1 Covering problems from a formal language point

Ravello 19-21 settembre 2003

Covering Problems from a Formal Language Point of ViewM. Anselmo - M. Madonia

10

Coding problems [MSS99]

How many coverings has a word?

Example: X=ab + ba, w = ababab X cov

• w has many different coverings over X :

4 =(ab, , ab, , ba, , ab) b a

5 =(ab, , ba, , ab, , ba, ,ab) b b a a

3 =(ab, , ba, , ab, , ab) b a

1 =(ab, , ab, , ab)

2 =(ab, , ba, , ba , , ba, , ab) b a

Page 11: Ravello 19-21 settembre 2003Covering Problems from a Formal Language Point of View M. Anselmo - M. Madonia 1 Covering problems from a formal language point

Ravello 19-21 settembre 2003

Covering Problems from a Formal Language Point of ViewM. Anselmo - M. Madonia

11

Covering codes [MSS 99]

Example: X = ab + ba is not a covering code (remember δ1, δ2)

Example: X = aabab + abb is a covering code

Example: X= ab+a + a is a covering code

X A* is a covering code if any word in A* has at most one minimal covering (over X).

Page 12: Ravello 19-21 settembre 2003Covering Problems from a Formal Language Point of View M. Anselmo - M. Madonia 1 Covering problems from a formal language point

Ravello 19-21 settembre 2003

Covering Problems from a Formal Language Point of ViewM. Anselmo - M. Madonia

12

Cov - freeness

Let M A*, cov-submonoid. cov-G(M) is the minimal X A* such that M= Xcov.

M is cov-free if cov-G(M) is a covering code.

Fact: M free M stable (well-known)

M z-free M z-stable (known)

We want ‘cov-stability’ = global notion equivalent to cov-freeness.

Question: M cov-free M ‘cov-stable’?

Page 13: Ravello 19-21 settembre 2003Covering Problems from a Formal Language Point of View M. Anselmo - M. Madonia 1 Covering problems from a formal language point

Ravello 19-21 settembre 2003

Covering Problems from a Formal Language Point of ViewM. Anselmo - M. Madonia

13

w, vw M , uv, u Z-p-s(uvw) implies v Z-p-s(uvw)

Toward a cov-stability definition (I)

cov-stable?

z-stable

stable u,w,uv,vw M implies w M

w, vw, uvx, uy M, for x <w and y <vw, implies vx M ?

Not always!

Example:

X = abcd+bcde+cdef+defg

Page 14: Ravello 19-21 settembre 2003Covering Problems from a Formal Language Point of View M. Anselmo - M. Madonia 1 Covering problems from a formal language point

Ravello 19-21 settembre 2003

Covering Problems from a Formal Language Point of ViewM. Anselmo - M. Madonia

14

Toward a cov-stability definition (II)

Main observation in the classical proof of (stable implies free):

• x minimal word with 2 different factorizations: the last step in a factorization from the last step in the other factorization

New situation with covering:

So we have to study the case v = .

Example: X = abc + bcd + cde

u w

Page 15: Ravello 19-21 settembre 2003Covering Problems from a Formal Language Point of View M. Anselmo - M. Madonia 1 Covering problems from a formal language point

Ravello 19-21 settembre 2003

Covering Problems from a Formal Language Point of ViewM. Anselmo - M. Madonia

15

Cov – stability

Def. M is cov-stable if w, vw, uvx, uy M, for x w and y vw

Remark: cov-stable implies stable

1. If v , then vz M, for some z w

Moreover vx M if y v

2. If v = , u and x y then t M,

for some t proper suffix of ux

Page 16: Ravello 19-21 settembre 2003Covering Problems from a Formal Language Point of View M. Anselmo - M. Madonia 1 Covering problems from a formal language point

Ravello 19-21 settembre 2003

Covering Problems from a Formal Language Point of ViewM. Anselmo - M. Madonia

16

Cov-stable iff cov-free

Proof: many cases and sub-cases (as in definition!)

Theorem: M covering submonoid. M is cov-stable M is cov-free

Page 17: Ravello 19-21 settembre 2003Covering Problems from a Formal Language Point of View M. Anselmo - M. Madonia 1 Covering problems from a formal language point

Ravello 19-21 settembre 2003

Covering Problems from a Formal Language Point of ViewM. Anselmo - M. Madonia

17

Some consequences

Fact 1: (cov-free cov-free) cov-free

Fact 5: cov –free z-free

free

Fact 2: cov-free implies free (not viceversa)

Fact 3: cov-free implies very pure (not viceversa)

Fact 4: M covering submonoid, X= cov-G(M). M cov-free implies X* free.

Remark: Covering not simple generalization of z-decomposition!

Page 18: Ravello 19-21 settembre 2003Covering Problems from a Formal Language Point of View M. Anselmo - M. Madonia 1 Covering problems from a formal language point

Ravello 19-21 settembre 2003

Covering Problems from a Formal Language Point of ViewM. Anselmo - M. Madonia

18

Cov - maximality and cov-completeness

Fact: X cov-complete X cov-maximal

Let X A*, covering code. X is cov-complete if Fact(Xcov).X is cov-maximal if X X1, covering code X=X1

Remark [MSS99]: X cov-complete X infinite (unless X=A)

Example: X=ab+a +a

Remark complete cov-complete (not viceversa) maximal cov-maximal (not viceversa)

Page 19: Ravello 19-21 settembre 2003Covering Problems from a Formal Language Point of View M. Anselmo - M. Madonia 1 Covering problems from a formal language point

Ravello 19-21 settembre 2003

Covering Problems from a Formal Language Point of ViewM. Anselmo - M. Madonia

19

B, 2FA recognizing Xcov

A, 1DFA recognizing X

Counting minimal coverings

X A*, regular language

covX : w number of minimal coverings of w

Remark: B counts all coverings of w Xcov

X1A

X

Page 20: Ravello 19-21 settembre 2003Covering Problems from a Formal Language Point of View M. Anselmo - M. Madonia 1 Covering problems from a formal language point

Ravello 19-21 settembre 2003

Covering Problems from a Formal Language Point of ViewM. Anselmo - M. Madonia

20

Remark on minimal coverings

Remark: In minimal coverings, no 2 steps to the left under the same occurrence of a letter

Crossing sequences in B for minimal coverings of w:

w

1 1 1 1 1

1

1 11

Page 21: Ravello 19-21 settembre 2003Covering Problems from a Formal Language Point of View M. Anselmo - M. Madonia 1 Covering problems from a formal language point

Ravello 19-21 settembre 2003

Covering Problems from a Formal Language Point of ViewM. Anselmo - M. Madonia

21

A 1NFA automaton for covX

CS3 = crossing sequences of length 3 and no twice state 1

(cs,a) =cs’ if cs matches cs’ on a

C = (CS3, (1), , (1) )

1

2

3

4a

a

b

bExample: X = ab + ba, A :

C :1

3

2 31

12

13

21

a

a

a

aa

b

bb

b

b

Page 22: Ravello 19-21 settembre 2003Covering Problems from a Formal Language Point of View M. Anselmo - M. Madonia 1 Covering problems from a formal language point

Ravello 19-21 settembre 2003

Covering Problems from a Formal Language Point of ViewM. Anselmo - M. Madonia

22

Some remarks

• Language recognized by C = X cov

• X regular implies X cov regular

• Behaviour of C is covX

• X regular implies covX rational

• X covering code iff C unambiguous (decidable) (different proof in [MSS99])

Page 23: Ravello 19-21 settembre 2003Covering Problems from a Formal Language Point of View M. Anselmo - M. Madonia 1 Covering problems from a formal language point

Ravello 19-21 settembre 2003

Covering Problems from a Formal Language Point of ViewM. Anselmo - M. Madonia

23

Conclusions and future works

• Formal language point of view is needed

• Covering not generalization of zig-zag (or z-decomposition): many new problems and results

covering codes: measurespecial cases: |X| =1, X Ak

suggestions …

•Further problems:

Page 24: Ravello 19-21 settembre 2003Covering Problems from a Formal Language Point of View M. Anselmo - M. Madonia 1 Covering problems from a formal language point

Ravello 19-21 settembre 2003

Covering Problems from a Formal Language Point of ViewM. Anselmo - M. Madonia

24

w

x xx x

w

x xx x

w

X XX X XX Ak

w is‘quasiperiodic’x is a ‘cover’ of w

‘set of k-covers’ of w

x ‘covers’ w

Page 25: Ravello 19-21 settembre 2003Covering Problems from a Formal Language Point of View M. Anselmo - M. Madonia 1 Covering problems from a formal language point

Ravello 19-21 settembre 2003

Covering Problems from a Formal Language Point of ViewM. Anselmo - M. Madonia

25

Example: X = ab+ba

a b a b a b

a b a b a b

Xcov = (ab + ba+ aba + bab)*

w = ababab Xcov

w = ababab (X, A*)

Page 26: Ravello 19-21 settembre 2003Covering Problems from a Formal Language Point of View M. Anselmo - M. Madonia 1 Covering problems from a formal language point

Ravello 19-21 settembre 2003

Covering Problems from a Formal Language Point of ViewM. Anselmo - M. Madonia

26

a b a b a b 1:

a b a b a b 2:

All the steps to the right are needed for covering w: δ1, δ2 are minimal coverings!

Page 27: Ravello 19-21 settembre 2003Covering Problems from a Formal Language Point of View M. Anselmo - M. Madonia 1 Covering problems from a formal language point

Ravello 19-21 settembre 2003

Covering Problems from a Formal Language Point of ViewM. Anselmo - M. Madonia

27

a b a b a b

a b a b a b4:

a b a b a b 5:

3:

All blue steps are useless for covering w :δ3, δ4, δ5 are not minimal.We count only minimal coverings.

Page 28: Ravello 19-21 settembre 2003Covering Problems from a Formal Language Point of View M. Anselmo - M. Madonia 1 Covering problems from a formal language point

Ravello 19-21 settembre 2003

Covering Problems from a Formal Language Point of ViewM. Anselmo - M. Madonia

28

Toward a cov-stability definition (I)

stable u,w,uv,vw M v M

u v w

u v w

z-stable w, vw M , uv, u Z-prefix-strict(uvw)

v Z -prefix-strict(uvw)

Page 29: Ravello 19-21 settembre 2003Covering Problems from a Formal Language Point of View M. Anselmo - M. Madonia 1 Covering problems from a formal language point

Ravello 19-21 settembre 2003

Covering Problems from a Formal Language Point of ViewM. Anselmo - M. Madonia

29

a b c d e f g

Set u=ab, v=c, w=defg, x=de, y=cd.

•Note vz=cdef M, z w .

Therefore w, vw, uvx, uy M

vx

Example: X= abcd+bcde+cdef+defg M=Xcov

but vx =cde M.

Page 30: Ravello 19-21 settembre 2003Covering Problems from a Formal Language Point of View M. Anselmo - M. Madonia 1 Covering problems from a formal language point

Ravello 19-21 settembre 2003

Covering Problems from a Formal Language Point of ViewM. Anselmo - M. Madonia

30

a b c d e

Set u=ab, v= , w=cde, x=cd, y=c.

Therefore w, vw, uvx, uy M

• Note bcd M, bcd proper suffix of ux.

Example: X = abc + bcd + cde M=Xcov

u x

w

but vz M for no z w.

Page 31: Ravello 19-21 settembre 2003Covering Problems from a Formal Language Point of View M. Anselmo - M. Madonia 1 Covering problems from a formal language point

Ravello 19-21 settembre 2003

Covering Problems from a Formal Language Point of ViewM. Anselmo - M. Madonia

31

Case 1.

u v w

yx

vz M

z w

u v w

xy

v

y v

v

y v vx M

Page 32: Ravello 19-21 settembre 2003Covering Problems from a Formal Language Point of View M. Anselmo - M. Madonia 1 Covering problems from a formal language point

Ravello 19-21 settembre 2003

Covering Problems from a Formal Language Point of ViewM. Anselmo - M. Madonia

32

Case 2.

v

x y

u

t M, t proper suffix of ux

u w

yx