; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G12095 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G12095
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionTy3/gypsy retrotransposon protein
Genome locationClcChr01:20483782..20487084
RNA-Seq ExpressionClc01G12095
SyntenyClc01G12095
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025132.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]9.8e-21852.68Show/hide
Query:  LVSTLHLPIVKTSNYGVILGSRSAVKGKGICNAVKLIFGE-----------------------LIGIRRSGCDFGYVVVAYFRRNGGEKNIVLKGDPSLT
        LV  L LP+ +T NYGVILGS +++KGKG+C+ V+L  G                        L  I  +  D+  +++ +F        + +KGDPSLT
Subjt:  LVSTLHLPIVKTSNYGVILGSRSAVKGKGICNAVKLIFGE-----------------------LIGIRRSGCDFGYVVVAYFRRNGGEKNIVLKGDPSLT

Query:  KSQVSLKSMIKSWAAWNQGFLIVCRATEGEMSLAKWYRIDEVYIRVES--IPAVIDKFKGVFDWLVTLPPRRSIEHHIHLKKGTDPVNVRRYRYAYHQKE
        K+Q+SLK MIKSW   +QG+L+ CR  E E    +   +D      E   + A+++KF+ VF+    LPP+R IEHHI+LK+G DPVNVR YRYA+ QKE
Subjt:  KSQVSLKSMIKSWAAWNQGFLIVCRATEGEMSLAKWYRIDEVYIRVES--IPAVIDKFKGVFDWLVTLPPRRSIEHHIHLKKGTDPVNVRRYRYAYHQKE

Query:  EMEKLVDEMLTLGIICPSISPYSSLVLLVKKKDGGWQFCVDYRALNNVTIPYKCPILVVEELFDELNGADLFSKIDLKSRYHQIRMYSGDIEKTVF----
        EME+LVDEML+ GII PS SPYSS VLLV+K+DG W+FCVDYRALNNVT+P K PI VVEELFDELNGAD+FSKIDLK+ YHQIRM+  DIEKT F    
Subjt:  EMEKLVDEMLTLGIICPSISPYSSLVLLVKKKDGGWQFCVDYRALNNVTIPYKCPILVVEELFDELNGADLFSKIDLKSRYHQIRMYSGDIEKTVF----

Query:  --------------------SLMNAIFKPFLRRFVLIFFDDILVYSKSMEDHVKHLESVFSMLRETELFANKNKCQFARERLEYLGNLIFGKGVEVDLEK
                            +LMN +FKPFLRRFVL+FFDDIL+YSK ME+H +HLE V  +LRE+EL+AN +KC FA+ R+ YLG+ I  KG+EVD EK
Subjt:  --------------------SLMNAIFKPFLRRFVLIFFDDILVYSKSMEDHVKHLESVFSMLRETELFANKNKCQFARERLEYLGNLIFGKGVEVDLEK

Query:  VGARLEWPIPSNIKEVRGFLGLTGYYRRFVQHYG----------RGGALEWTEDAQEAFERLKRAMMTLPVLAFPNFDQPFEVEIDASGYGIGAILSHHK
        + A  EWP P+N++E+RGFLGLTGYYRRFVQ+YG          + GA +WTE+ + AFE+LK+AMMTLPVLA P+F+ PFE+E DASG+G+GA+L   K
Subjt:  VGARLEWPIPSNIKEVRGFLGLTGYYRRFVQHYG----------RGGALEWTEDAQEAFERLKRAMMTLPVLAFPNFDQPFEVEIDASGYGIGAILSHHK

Query:  RPIA--------------------------------------FVVKTDPQALKFLLEQRVIQPQHQKWIAKLLGYDFQVVYRPGLENKAADVWSRIPPIV
        +P+A                                      F VKTD ++LKFLLEQRVIQPQ+Q+WIAKLLGY F+V+Y+PGLENKAAD  SRI P  
Subjt:  RPIA--------------------------------------FVVKTDPQALKFLLEQRVIQPQHQKWIAKLLGYDFQVVYRPGLENKAADVWSRIPPIV

Query:  HLAHLLAPTIVDVNVIKVEVEADPKLKEVIEKLKANVDSVLGFSWQQGMLKYKGRLVLSKSSMLLPT-----------GTFKILRTYKSLTRELYWEGMK
        HL  L AP ++DV VI+ EV  DP L+E++  ++     +  ++  QG+LK+KGRLVLSK+S L+PT           G    LRTYK +  ELYW+GMK
Subjt:  HLAHLLAPTIVDVNVIKVEVEADPKLKEVIEKLKANVDSVLGFSWQQGMLKYKGRLVLSKSSMLLPT-----------GTFKILRTYKSLTRELYWEGMK

Query:  GDVKRYVEECLVCQRNKSLALSPTGLLMPLEIPDVIWSDITMYFIEGLPKAGGHDVILVVVDRLT
         DV++Y EEC++CQ+NKS ALSP GLL+PLEIPD IWSDI+M FIEGLPK+ G +VILVVVDRL+
Subjt:  GDVKRYVEECLVCQRNKSLALSPTGLLMPLEIPDVIWSDITMYFIEGLPKAGGHDVILVVVDRLT

KAA0049776.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]5.7e-21852.81Show/hide
Query:  LVSTLHLPIVKTSNYGVILGSRSAVKGKGICNAVKLIFGE-----------------------LIGIRRSGCDFGYVVVAYFRRNGGEKNIVLKGDPSLT
        LV  L LP+ +T NYGVILGS +++KGKG+C+ V+L  G                        L  I  +  D+  +++ +F        + +KGDPSLT
Subjt:  LVSTLHLPIVKTSNYGVILGSRSAVKGKGICNAVKLIFGE-----------------------LIGIRRSGCDFGYVVVAYFRRNGGEKNIVLKGDPSLT

Query:  KSQVSLKSMIKSWAAWNQGFLIVCRATEGEMSLAKWYRIDEVYIRVES--IPAVIDKFKGVFDWLVTLPPRRSIEHHIHLKKGTDPVNVRRYRYAYHQKE
        K+Q+SLK MIKSW   +QG+L+ CR  E E    +   +D      E   + A+++KF+ VF+    LPP+R IEHHI+LK+G DPVNVR YRYA+ QKE
Subjt:  KSQVSLKSMIKSWAAWNQGFLIVCRATEGEMSLAKWYRIDEVYIRVES--IPAVIDKFKGVFDWLVTLPPRRSIEHHIHLKKGTDPVNVRRYRYAYHQKE

Query:  EMEKLVDEMLTLGIICPSISPYSSLVLLVKKKDGGWQFCVDYRALNNVTIPYKCPILVVEELFDELNGADLFSKIDLKSRYHQIRMYSGDIEKTVF----
        EME+LVDEML+ GII PS SPYSS VLLV+KKDG W+FCVDYRALNNVT+P K PI VVEELFDELNGAD+FSKIDLK+ YHQIRM+  DIEKT F    
Subjt:  EMEKLVDEMLTLGIICPSISPYSSLVLLVKKKDGGWQFCVDYRALNNVTIPYKCPILVVEELFDELNGADLFSKIDLKSRYHQIRMYSGDIEKTVF----

Query:  --------------------SLMNAIFKPFLRRFVLIFFDDILVYSKSMEDHVKHLESVFSMLRETELFANKNKCQFARERLEYLGNLIFGKGVEVDLEK
                            +LMN +FKPFLRRFVL+FFDDIL+YSK ME+H +HLE V  +LRE+EL+AN +KC FA+ R+ YLG+ I  KG+EVD EK
Subjt:  --------------------SLMNAIFKPFLRRFVLIFFDDILVYSKSMEDHVKHLESVFSMLRETELFANKNKCQFARERLEYLGNLIFGKGVEVDLEK

Query:  VGARLEWPIPSNIKEVRGFLGLTGYYRRFVQHYG----------RGGALEWTEDAQEAFERLKRAMMTLPVLAFPNFDQPFEVEIDASGYGIGAILSHHK
        + A  EWP P+N++E+RGFLGLTGYYRRFVQ+YG          + GA +WTE+ + AFE+LK+AMMTLPVLA P+F+ PFE+E DASG+G+GA+L   K
Subjt:  VGARLEWPIPSNIKEVRGFLGLTGYYRRFVQHYG----------RGGALEWTEDAQEAFERLKRAMMTLPVLAFPNFDQPFEVEIDASGYGIGAILSHHK

Query:  RPIA--------------------------------------FVVKTDPQALKFLLEQRVIQPQHQKWIAKLLGYDFQVVYRPGLENKAADVWSRIPPIV
        +P+A                                      F VKTD ++LKFLLEQRVIQPQ+Q+WIAKLLGY F+V+Y+PGLENKAAD  SRI P  
Subjt:  RPIA--------------------------------------FVVKTDPQALKFLLEQRVIQPQHQKWIAKLLGYDFQVVYRPGLENKAADVWSRIPPIV

Query:  HLAHLLAPTIVDVNVIKVEVEADPKLKEVIEKLKANVDSVLGFSWQQGMLKYKGRLVLSKSSMLLPT-----------GTFKILRTYKSLTRELYWEGMK
        HL  L AP ++DV VI+ EV  DP L+E++  ++     +  ++  QG+LK+KGRLVLSK+S L+PT           G    LRTYK +  ELYW+GMK
Subjt:  HLAHLLAPTIVDVNVIKVEVEADPKLKEVIEKLKANVDSVLGFSWQQGMLKYKGRLVLSKSSMLLPT-----------GTFKILRTYKSLTRELYWEGMK

Query:  GDVKRYVEECLVCQRNKSLALSPTGLLMPLEIPDVIWSDITMYFIEGLPKAGGHDVILVVVDRLT
         DV++Y EEC++CQ+NKS ALSP GLL+PLEIPD IWSDI+M FIEGLPK+ G +VILVVVDRL+
Subjt:  GDVKRYVEECLVCQRNKSLALSPTGLLMPLEIPDVIWSDITMYFIEGLPKAGGHDVILVVVDRLT

KAA0066118.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]4.4e-21852.95Show/hide
Query:  LVSTLHLPIVKTSNYGVILGSRSAVKGKGICNAVKLIFGE-----------------------LIGIRRSGCDFGYVVVAYFRRNGGEKNIVLKGDPSLT
        LV  L LP+ +T NYGVILGS +++KGKG+C+ V+L  G                        L  I  +  D+  +++ +F +      + +KGDPSLT
Subjt:  LVSTLHLPIVKTSNYGVILGSRSAVKGKGICNAVKLIFGE-----------------------LIGIRRSGCDFGYVVVAYFRRNGGEKNIVLKGDPSLT

Query:  KSQVSLKSMIKSWAAWNQGFLIVCRATEGEMSLAKWYRIDEVYIRVESIPAVIDKFKGVFDWLVTLPPRRSIEHHIHLKKGTDPVNVRRYRYAYHQKEEM
        K+Q+SLK MIK+W    QG+L+ CRA E E    +             + A+I+KF  VF+    LPP+R IEHHI+LK+G DPVNVR YRYA+ QKEEM
Subjt:  KSQVSLKSMIKSWAAWNQGFLIVCRATEGEMSLAKWYRIDEVYIRVESIPAVIDKFKGVFDWLVTLPPRRSIEHHIHLKKGTDPVNVRRYRYAYHQKEEM

Query:  EKLVDEMLTLGIICPSISPYSSLVLLVKKKDGGWQFCVDYRALNNVTIPYKCPILVVEELFDELNGADLFSKIDLKSRYHQIRMYSGDIEKTVF------
        E+LVDEMLT GII PS SPYSS VLLV+KKDG W+FCVDYRALNNVT+P K PI VVEELFDELNGAD+FSKIDLK+ YHQIRM+  DIEKT F      
Subjt:  EKLVDEMLTLGIICPSISPYSSLVLLVKKKDGGWQFCVDYRALNNVTIPYKCPILVVEELFDELNGADLFSKIDLKSRYHQIRMYSGDIEKTVF------

Query:  ------------------SLMNAIFKPFLRRFVLIFFDDILVYSKSMEDHVKHLESVFSMLRETELFANKNKCQFARERLEYLGNLIFGKGVEVDLEKVG
                          +LMN +FKPFLRRFVL+FFDDIL+YSK ME+H +HLE V  +LRE+EL+AN +KC FA+ R+ YLG+ I  KG+EVD EK+ 
Subjt:  ------------------SLMNAIFKPFLRRFVLIFFDDILVYSKSMEDHVKHLESVFSMLRETELFANKNKCQFARERLEYLGNLIFGKGVEVDLEKVG

Query:  ARLEWPIPSNIKEVRGFLGLTGYYRRFVQHYG----------RGGALEWTEDAQEAFERLKRAMMTLPVLAFPNFDQPFEVEIDASGYGIGAILSHHKRP
        A  EWP PSN++E+RGFLGLTGYYRRFVQ+YG          + GA +WTE+ + AFE+LK+AMMTLPVLA P+F+ PFE+E DASG+G+GA+L   KRP
Subjt:  ARLEWPIPSNIKEVRGFLGLTGYYRRFVQHYG----------RGGALEWTEDAQEAFERLKRAMMTLPVLAFPNFDQPFEVEIDASGYGIGAILSHHKRP

Query:  IA--------------------------------------FVVKTDPQALKFLLEQRVIQPQHQKWIAKLLGYDFQVVYRPGLENKAADVWSRIPPIVHL
        +A                                      F VKTD ++LKFLLEQRVIQPQ+Q+WIAKLLGY F+V+Y+PGL+NKAAD  SR+ P  HL
Subjt:  IA--------------------------------------FVVKTDPQALKFLLEQRVIQPQHQKWIAKLLGYDFQVVYRPGLENKAADVWSRIPPIVHL

Query:  AHLLAPTIVDVNVIKVEVEADPKLKEVIEKLKANVDSVLGFSWQQGMLKYKGRLVLSKSSMLLPT-----------GTFKILRTYKSLTRELYWEGMKGD
          L AP ++DV VI+ EV+ DP L+E+I  ++    ++  ++  QG+LK+KGRLVLSK+S L+PT           G    LRTYK +  ELYW+GMK D
Subjt:  AHLLAPTIVDVNVIKVEVEADPKLKEVIEKLKANVDSVLGFSWQQGMLKYKGRLVLSKSSMLLPT-----------GTFKILRTYKSLTRELYWEGMKGD

Query:  VKRYVEECLVCQRNKSLALSPTGLLMPLEIPDVIWSDITMYFIEGLPKAGGHDVILVVVDRLT
        V++Y +EC++CQ+NKS ALSP GLL+PLEIPD IWSDI+M FIEGLPK+ G +VILVVVDRL+
Subjt:  VKRYVEECLVCQRNKSLALSPTGLLMPLEIPDVIWSDITMYFIEGLPKAGGHDVILVVVDRLT

TYK15990.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]5.7e-21852.81Show/hide
Query:  LVSTLHLPIVKTSNYGVILGSRSAVKGKGICNAVKLIFGE-----------------------LIGIRRSGCDFGYVVVAYFRRNGGEKNIVLKGDPSLT
        LV  L LP+ +T NYGVILGS +++KGKG+C+ V+L  G                        L  I  +  D+  +++ +F        + +KGDPSLT
Subjt:  LVSTLHLPIVKTSNYGVILGSRSAVKGKGICNAVKLIFGE-----------------------LIGIRRSGCDFGYVVVAYFRRNGGEKNIVLKGDPSLT

Query:  KSQVSLKSMIKSWAAWNQGFLIVCRATEGEMSLAKWYRIDEVYIRVES--IPAVIDKFKGVFDWLVTLPPRRSIEHHIHLKKGTDPVNVRRYRYAYHQKE
        K+Q+SLK MIKSW   +QG+L+ CR  E E    +   +D      E   + A+++KF+ VF+    LPP+R IEHHI+LK+G DPVNVR YRYA+ QKE
Subjt:  KSQVSLKSMIKSWAAWNQGFLIVCRATEGEMSLAKWYRIDEVYIRVES--IPAVIDKFKGVFDWLVTLPPRRSIEHHIHLKKGTDPVNVRRYRYAYHQKE

Query:  EMEKLVDEMLTLGIICPSISPYSSLVLLVKKKDGGWQFCVDYRALNNVTIPYKCPILVVEELFDELNGADLFSKIDLKSRYHQIRMYSGDIEKTVF----
        EME+LVDEML+ GII PS SPYSS VLLV+KKDG W+FCVDYRALNNVT+P K PI VVEELFDELNGAD+FSKIDLK+ YHQIRM+  DIEKT F    
Subjt:  EMEKLVDEMLTLGIICPSISPYSSLVLLVKKKDGGWQFCVDYRALNNVTIPYKCPILVVEELFDELNGADLFSKIDLKSRYHQIRMYSGDIEKTVF----

Query:  --------------------SLMNAIFKPFLRRFVLIFFDDILVYSKSMEDHVKHLESVFSMLRETELFANKNKCQFARERLEYLGNLIFGKGVEVDLEK
                            +LMN +FKPFLRRFVL+FFDDIL+YSK ME+H +HLE V  +LRE+EL+AN +KC FA+ R+ YLG+ I  KG+EVD EK
Subjt:  --------------------SLMNAIFKPFLRRFVLIFFDDILVYSKSMEDHVKHLESVFSMLRETELFANKNKCQFARERLEYLGNLIFGKGVEVDLEK

Query:  VGARLEWPIPSNIKEVRGFLGLTGYYRRFVQHYG----------RGGALEWTEDAQEAFERLKRAMMTLPVLAFPNFDQPFEVEIDASGYGIGAILSHHK
        + A  EWP P+N++E+RGFLGLTGYYRRFVQ+YG          + GA +WTE+ + AFE+LK+AMMTLPVLA P+F+ PFE+E DASG+G+GA+L   K
Subjt:  VGARLEWPIPSNIKEVRGFLGLTGYYRRFVQHYG----------RGGALEWTEDAQEAFERLKRAMMTLPVLAFPNFDQPFEVEIDASGYGIGAILSHHK

Query:  RPIA--------------------------------------FVVKTDPQALKFLLEQRVIQPQHQKWIAKLLGYDFQVVYRPGLENKAADVWSRIPPIV
        +P+A                                      F VKTD ++LKFLLEQRVIQPQ+Q+WIAKLLGY F+V+Y+PGLENKAAD  SRI P  
Subjt:  RPIA--------------------------------------FVVKTDPQALKFLLEQRVIQPQHQKWIAKLLGYDFQVVYRPGLENKAADVWSRIPPIV

Query:  HLAHLLAPTIVDVNVIKVEVEADPKLKEVIEKLKANVDSVLGFSWQQGMLKYKGRLVLSKSSMLLPT-----------GTFKILRTYKSLTRELYWEGMK
        HL  L AP ++DV VI+ EV  DP L+E++  ++     +  ++  QG+LK+KGRLVLSK+S L+PT           G    LRTYK +  ELYW+GMK
Subjt:  HLAHLLAPTIVDVNVIKVEVEADPKLKEVIEKLKANVDSVLGFSWQQGMLKYKGRLVLSKSSMLLPT-----------GTFKILRTYKSLTRELYWEGMK

Query:  GDVKRYVEECLVCQRNKSLALSPTGLLMPLEIPDVIWSDITMYFIEGLPKAGGHDVILVVVDRLT
         DV++Y EEC++CQ+NKS ALSP GLL+PLEIPD IWSDI+M FIEGLPK+ G +VILVVVDRL+
Subjt:  GDVKRYVEECLVCQRNKSLALSPTGLLMPLEIPDVIWSDITMYFIEGLPKAGGHDVILVVVDRLT

TYK23090.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]5.7e-21852.81Show/hide
Query:  LVSTLHLPIVKTSNYGVILGSRSAVKGKGICNAVKLIFGE-----------------------LIGIRRSGCDFGYVVVAYFRRNGGEKNIVLKGDPSLT
        LV  L LP+ +T NYGVILGS +++KGKG+C+ V+L  G                        L  I  +  D+  +++ +F        + +KGDPSLT
Subjt:  LVSTLHLPIVKTSNYGVILGSRSAVKGKGICNAVKLIFGE-----------------------LIGIRRSGCDFGYVVVAYFRRNGGEKNIVLKGDPSLT

Query:  KSQVSLKSMIKSWAAWNQGFLIVCRATEGEMSLAKWYRIDEVYIRVES--IPAVIDKFKGVFDWLVTLPPRRSIEHHIHLKKGTDPVNVRRYRYAYHQKE
        K+Q+SLK MIKSW   +QG+L+ CR  E E    +   +D      E   + A+++KF+ VF+    LPP+R IEHHI+LK+G DPVNVR YRYA+ QKE
Subjt:  KSQVSLKSMIKSWAAWNQGFLIVCRATEGEMSLAKWYRIDEVYIRVES--IPAVIDKFKGVFDWLVTLPPRRSIEHHIHLKKGTDPVNVRRYRYAYHQKE

Query:  EMEKLVDEMLTLGIICPSISPYSSLVLLVKKKDGGWQFCVDYRALNNVTIPYKCPILVVEELFDELNGADLFSKIDLKSRYHQIRMYSGDIEKTVF----
        EME+LVDEML+ GII PS SPYSS VLLV+KKDG W+FCVDYRALNNVT+P K PI VVEELFDELNGAD+FSKIDLK+ YHQIRM+  DIEKT F    
Subjt:  EMEKLVDEMLTLGIICPSISPYSSLVLLVKKKDGGWQFCVDYRALNNVTIPYKCPILVVEELFDELNGADLFSKIDLKSRYHQIRMYSGDIEKTVF----

Query:  --------------------SLMNAIFKPFLRRFVLIFFDDILVYSKSMEDHVKHLESVFSMLRETELFANKNKCQFARERLEYLGNLIFGKGVEVDLEK
                            +LMN +FKPFLRRFVL+FFDDIL+YSK ME+H +HLE V  +LRE+EL+AN +KC FA+ R+ YLG+ I  KG+EVD EK
Subjt:  --------------------SLMNAIFKPFLRRFVLIFFDDILVYSKSMEDHVKHLESVFSMLRETELFANKNKCQFARERLEYLGNLIFGKGVEVDLEK

Query:  VGARLEWPIPSNIKEVRGFLGLTGYYRRFVQHYG----------RGGALEWTEDAQEAFERLKRAMMTLPVLAFPNFDQPFEVEIDASGYGIGAILSHHK
        + A  EWP P+N++E+RGFLGLTGYYRRFVQ+YG          + GA +WTE+ + AFE+LK+AMMTLPVLA P+F+ PFE+E DASG+G+GA+L   K
Subjt:  VGARLEWPIPSNIKEVRGFLGLTGYYRRFVQHYG----------RGGALEWTEDAQEAFERLKRAMMTLPVLAFPNFDQPFEVEIDASGYGIGAILSHHK

Query:  RPIA--------------------------------------FVVKTDPQALKFLLEQRVIQPQHQKWIAKLLGYDFQVVYRPGLENKAADVWSRIPPIV
        +P+A                                      F VKTD ++LKFLLEQRVIQPQ+Q+WIAKLLGY F+V+Y+PGLENKAAD  SRI P  
Subjt:  RPIA--------------------------------------FVVKTDPQALKFLLEQRVIQPQHQKWIAKLLGYDFQVVYRPGLENKAADVWSRIPPIV

Query:  HLAHLLAPTIVDVNVIKVEVEADPKLKEVIEKLKANVDSVLGFSWQQGMLKYKGRLVLSKSSMLLPT-----------GTFKILRTYKSLTRELYWEGMK
        HL  L AP ++DV VI+ EV  DP L+E++  ++     +  ++  QG+LK+KGRLVLSK+S L+PT           G    LRTYK +  ELYW+GMK
Subjt:  HLAHLLAPTIVDVNVIKVEVEADPKLKEVIEKLKANVDSVLGFSWQQGMLKYKGRLVLSKSSMLLPT-----------GTFKILRTYKSLTRELYWEGMK

Query:  GDVKRYVEECLVCQRNKSLALSPTGLLMPLEIPDVIWSDITMYFIEGLPKAGGHDVILVVVDRLT
         DV++Y EEC++CQ+NKS ALSP GLL+PLEIPD IWSDI+M FIEGLPK+ G +VILVVVDRL+
Subjt:  GDVKRYVEECLVCQRNKSLALSPTGLLMPLEIPDVIWSDITMYFIEGLPKAGGHDVILVVVDRLT

TrEMBL top hitse value%identityAlignment
A0A5A7SIV7 Ty3/gypsy retrotransposon protein4.7e-21852.68Show/hide
Query:  LVSTLHLPIVKTSNYGVILGSRSAVKGKGICNAVKLIFGE-----------------------LIGIRRSGCDFGYVVVAYFRRNGGEKNIVLKGDPSLT
        LV  L LP+ +T NYGVILGS +++KGKG+C+ V+L  G                        L  I  +  D+  +++ +F        + +KGDPSLT
Subjt:  LVSTLHLPIVKTSNYGVILGSRSAVKGKGICNAVKLIFGE-----------------------LIGIRRSGCDFGYVVVAYFRRNGGEKNIVLKGDPSLT

Query:  KSQVSLKSMIKSWAAWNQGFLIVCRATEGEMSLAKWYRIDEVYIRVES--IPAVIDKFKGVFDWLVTLPPRRSIEHHIHLKKGTDPVNVRRYRYAYHQKE
        K+Q+SLK MIKSW   +QG+L+ CR  E E    +   +D      E   + A+++KF+ VF+    LPP+R IEHHI+LK+G DPVNVR YRYA+ QKE
Subjt:  KSQVSLKSMIKSWAAWNQGFLIVCRATEGEMSLAKWYRIDEVYIRVES--IPAVIDKFKGVFDWLVTLPPRRSIEHHIHLKKGTDPVNVRRYRYAYHQKE

Query:  EMEKLVDEMLTLGIICPSISPYSSLVLLVKKKDGGWQFCVDYRALNNVTIPYKCPILVVEELFDELNGADLFSKIDLKSRYHQIRMYSGDIEKTVF----
        EME+LVDEML+ GII PS SPYSS VLLV+K+DG W+FCVDYRALNNVT+P K PI VVEELFDELNGAD+FSKIDLK+ YHQIRM+  DIEKT F    
Subjt:  EMEKLVDEMLTLGIICPSISPYSSLVLLVKKKDGGWQFCVDYRALNNVTIPYKCPILVVEELFDELNGADLFSKIDLKSRYHQIRMYSGDIEKTVF----

Query:  --------------------SLMNAIFKPFLRRFVLIFFDDILVYSKSMEDHVKHLESVFSMLRETELFANKNKCQFARERLEYLGNLIFGKGVEVDLEK
                            +LMN +FKPFLRRFVL+FFDDIL+YSK ME+H +HLE V  +LRE+EL+AN +KC FA+ R+ YLG+ I  KG+EVD EK
Subjt:  --------------------SLMNAIFKPFLRRFVLIFFDDILVYSKSMEDHVKHLESVFSMLRETELFANKNKCQFARERLEYLGNLIFGKGVEVDLEK

Query:  VGARLEWPIPSNIKEVRGFLGLTGYYRRFVQHYG----------RGGALEWTEDAQEAFERLKRAMMTLPVLAFPNFDQPFEVEIDASGYGIGAILSHHK
        + A  EWP P+N++E+RGFLGLTGYYRRFVQ+YG          + GA +WTE+ + AFE+LK+AMMTLPVLA P+F+ PFE+E DASG+G+GA+L   K
Subjt:  VGARLEWPIPSNIKEVRGFLGLTGYYRRFVQHYG----------RGGALEWTEDAQEAFERLKRAMMTLPVLAFPNFDQPFEVEIDASGYGIGAILSHHK

Query:  RPIA--------------------------------------FVVKTDPQALKFLLEQRVIQPQHQKWIAKLLGYDFQVVYRPGLENKAADVWSRIPPIV
        +P+A                                      F VKTD ++LKFLLEQRVIQPQ+Q+WIAKLLGY F+V+Y+PGLENKAAD  SRI P  
Subjt:  RPIA--------------------------------------FVVKTDPQALKFLLEQRVIQPQHQKWIAKLLGYDFQVVYRPGLENKAADVWSRIPPIV

Query:  HLAHLLAPTIVDVNVIKVEVEADPKLKEVIEKLKANVDSVLGFSWQQGMLKYKGRLVLSKSSMLLPT-----------GTFKILRTYKSLTRELYWEGMK
        HL  L AP ++DV VI+ EV  DP L+E++  ++     +  ++  QG+LK+KGRLVLSK+S L+PT           G    LRTYK +  ELYW+GMK
Subjt:  HLAHLLAPTIVDVNVIKVEVEADPKLKEVIEKLKANVDSVLGFSWQQGMLKYKGRLVLSKSSMLLPT-----------GTFKILRTYKSLTRELYWEGMK

Query:  GDVKRYVEECLVCQRNKSLALSPTGLLMPLEIPDVIWSDITMYFIEGLPKAGGHDVILVVVDRLT
         DV++Y EEC++CQ+NKS ALSP GLL+PLEIPD IWSDI+M FIEGLPK+ G +VILVVVDRL+
Subjt:  GDVKRYVEECLVCQRNKSLALSPTGLLMPLEIPDVIWSDITMYFIEGLPKAGGHDVILVVVDRLT

A0A5A7U6J3 Ty3/gypsy retrotransposon protein2.8e-21852.81Show/hide
Query:  LVSTLHLPIVKTSNYGVILGSRSAVKGKGICNAVKLIFGE-----------------------LIGIRRSGCDFGYVVVAYFRRNGGEKNIVLKGDPSLT
        LV  L LP+ +T NYGVILGS +++KGKG+C+ V+L  G                        L  I  +  D+  +++ +F        + +KGDPSLT
Subjt:  LVSTLHLPIVKTSNYGVILGSRSAVKGKGICNAVKLIFGE-----------------------LIGIRRSGCDFGYVVVAYFRRNGGEKNIVLKGDPSLT

Query:  KSQVSLKSMIKSWAAWNQGFLIVCRATEGEMSLAKWYRIDEVYIRVES--IPAVIDKFKGVFDWLVTLPPRRSIEHHIHLKKGTDPVNVRRYRYAYHQKE
        K+Q+SLK MIKSW   +QG+L+ CR  E E    +   +D      E   + A+++KF+ VF+    LPP+R IEHHI+LK+G DPVNVR YRYA+ QKE
Subjt:  KSQVSLKSMIKSWAAWNQGFLIVCRATEGEMSLAKWYRIDEVYIRVES--IPAVIDKFKGVFDWLVTLPPRRSIEHHIHLKKGTDPVNVRRYRYAYHQKE

Query:  EMEKLVDEMLTLGIICPSISPYSSLVLLVKKKDGGWQFCVDYRALNNVTIPYKCPILVVEELFDELNGADLFSKIDLKSRYHQIRMYSGDIEKTVF----
        EME+LVDEML+ GII PS SPYSS VLLV+KKDG W+FCVDYRALNNVT+P K PI VVEELFDELNGAD+FSKIDLK+ YHQIRM+  DIEKT F    
Subjt:  EMEKLVDEMLTLGIICPSISPYSSLVLLVKKKDGGWQFCVDYRALNNVTIPYKCPILVVEELFDELNGADLFSKIDLKSRYHQIRMYSGDIEKTVF----

Query:  --------------------SLMNAIFKPFLRRFVLIFFDDILVYSKSMEDHVKHLESVFSMLRETELFANKNKCQFARERLEYLGNLIFGKGVEVDLEK
                            +LMN +FKPFLRRFVL+FFDDIL+YSK ME+H +HLE V  +LRE+EL+AN +KC FA+ R+ YLG+ I  KG+EVD EK
Subjt:  --------------------SLMNAIFKPFLRRFVLIFFDDILVYSKSMEDHVKHLESVFSMLRETELFANKNKCQFARERLEYLGNLIFGKGVEVDLEK

Query:  VGARLEWPIPSNIKEVRGFLGLTGYYRRFVQHYG----------RGGALEWTEDAQEAFERLKRAMMTLPVLAFPNFDQPFEVEIDASGYGIGAILSHHK
        + A  EWP P+N++E+RGFLGLTGYYRRFVQ+YG          + GA +WTE+ + AFE+LK+AMMTLPVLA P+F+ PFE+E DASG+G+GA+L   K
Subjt:  VGARLEWPIPSNIKEVRGFLGLTGYYRRFVQHYG----------RGGALEWTEDAQEAFERLKRAMMTLPVLAFPNFDQPFEVEIDASGYGIGAILSHHK

Query:  RPIA--------------------------------------FVVKTDPQALKFLLEQRVIQPQHQKWIAKLLGYDFQVVYRPGLENKAADVWSRIPPIV
        +P+A                                      F VKTD ++LKFLLEQRVIQPQ+Q+WIAKLLGY F+V+Y+PGLENKAAD  SRI P  
Subjt:  RPIA--------------------------------------FVVKTDPQALKFLLEQRVIQPQHQKWIAKLLGYDFQVVYRPGLENKAADVWSRIPPIV

Query:  HLAHLLAPTIVDVNVIKVEVEADPKLKEVIEKLKANVDSVLGFSWQQGMLKYKGRLVLSKSSMLLPT-----------GTFKILRTYKSLTRELYWEGMK
        HL  L AP ++DV VI+ EV  DP L+E++  ++     +  ++  QG+LK+KGRLVLSK+S L+PT           G    LRTYK +  ELYW+GMK
Subjt:  HLAHLLAPTIVDVNVIKVEVEADPKLKEVIEKLKANVDSVLGFSWQQGMLKYKGRLVLSKSSMLLPT-----------GTFKILRTYKSLTRELYWEGMK

Query:  GDVKRYVEECLVCQRNKSLALSPTGLLMPLEIPDVIWSDITMYFIEGLPKAGGHDVILVVVDRLT
         DV++Y EEC++CQ+NKS ALSP GLL+PLEIPD IWSDI+M FIEGLPK+ G +VILVVVDRL+
Subjt:  GDVKRYVEECLVCQRNKSLALSPTGLLMPLEIPDVIWSDITMYFIEGLPKAGGHDVILVVVDRLT

A0A5A7VEI2 Ty3/gypsy retrotransposon protein2.1e-21852.95Show/hide
Query:  LVSTLHLPIVKTSNYGVILGSRSAVKGKGICNAVKLIFGE-----------------------LIGIRRSGCDFGYVVVAYFRRNGGEKNIVLKGDPSLT
        LV  L LP+ +T NYGVILGS +++KGKG+C+ V+L  G                        L  I  +  D+  +++ +F +      + +KGDPSLT
Subjt:  LVSTLHLPIVKTSNYGVILGSRSAVKGKGICNAVKLIFGE-----------------------LIGIRRSGCDFGYVVVAYFRRNGGEKNIVLKGDPSLT

Query:  KSQVSLKSMIKSWAAWNQGFLIVCRATEGEMSLAKWYRIDEVYIRVESIPAVIDKFKGVFDWLVTLPPRRSIEHHIHLKKGTDPVNVRRYRYAYHQKEEM
        K+Q+SLK MIK+W    QG+L+ CRA E E    +             + A+I+KF  VF+    LPP+R IEHHI+LK+G DPVNVR YRYA+ QKEEM
Subjt:  KSQVSLKSMIKSWAAWNQGFLIVCRATEGEMSLAKWYRIDEVYIRVESIPAVIDKFKGVFDWLVTLPPRRSIEHHIHLKKGTDPVNVRRYRYAYHQKEEM

Query:  EKLVDEMLTLGIICPSISPYSSLVLLVKKKDGGWQFCVDYRALNNVTIPYKCPILVVEELFDELNGADLFSKIDLKSRYHQIRMYSGDIEKTVF------
        E+LVDEMLT GII PS SPYSS VLLV+KKDG W+FCVDYRALNNVT+P K PI VVEELFDELNGAD+FSKIDLK+ YHQIRM+  DIEKT F      
Subjt:  EKLVDEMLTLGIICPSISPYSSLVLLVKKKDGGWQFCVDYRALNNVTIPYKCPILVVEELFDELNGADLFSKIDLKSRYHQIRMYSGDIEKTVF------

Query:  ------------------SLMNAIFKPFLRRFVLIFFDDILVYSKSMEDHVKHLESVFSMLRETELFANKNKCQFARERLEYLGNLIFGKGVEVDLEKVG
                          +LMN +FKPFLRRFVL+FFDDIL+YSK ME+H +HLE V  +LRE+EL+AN +KC FA+ R+ YLG+ I  KG+EVD EK+ 
Subjt:  ------------------SLMNAIFKPFLRRFVLIFFDDILVYSKSMEDHVKHLESVFSMLRETELFANKNKCQFARERLEYLGNLIFGKGVEVDLEKVG

Query:  ARLEWPIPSNIKEVRGFLGLTGYYRRFVQHYG----------RGGALEWTEDAQEAFERLKRAMMTLPVLAFPNFDQPFEVEIDASGYGIGAILSHHKRP
        A  EWP PSN++E+RGFLGLTGYYRRFVQ+YG          + GA +WTE+ + AFE+LK+AMMTLPVLA P+F+ PFE+E DASG+G+GA+L   KRP
Subjt:  ARLEWPIPSNIKEVRGFLGLTGYYRRFVQHYG----------RGGALEWTEDAQEAFERLKRAMMTLPVLAFPNFDQPFEVEIDASGYGIGAILSHHKRP

Query:  IA--------------------------------------FVVKTDPQALKFLLEQRVIQPQHQKWIAKLLGYDFQVVYRPGLENKAADVWSRIPPIVHL
        +A                                      F VKTD ++LKFLLEQRVIQPQ+Q+WIAKLLGY F+V+Y+PGL+NKAAD  SR+ P  HL
Subjt:  IA--------------------------------------FVVKTDPQALKFLLEQRVIQPQHQKWIAKLLGYDFQVVYRPGLENKAADVWSRIPPIVHL

Query:  AHLLAPTIVDVNVIKVEVEADPKLKEVIEKLKANVDSVLGFSWQQGMLKYKGRLVLSKSSMLLPT-----------GTFKILRTYKSLTRELYWEGMKGD
          L AP ++DV VI+ EV+ DP L+E+I  ++    ++  ++  QG+LK+KGRLVLSK+S L+PT           G    LRTYK +  ELYW+GMK D
Subjt:  AHLLAPTIVDVNVIKVEVEADPKLKEVIEKLKANVDSVLGFSWQQGMLKYKGRLVLSKSSMLLPT-----------GTFKILRTYKSLTRELYWEGMKGD

Query:  VKRYVEECLVCQRNKSLALSPTGLLMPLEIPDVIWSDITMYFIEGLPKAGGHDVILVVVDRLT
        V++Y +EC++CQ+NKS ALSP GLL+PLEIPD IWSDI+M FIEGLPK+ G +VILVVVDRL+
Subjt:  VKRYVEECLVCQRNKSLALSPTGLLMPLEIPDVIWSDITMYFIEGLPKAGGHDVILVVVDRLT

A0A5D3CXB1 Ty3/gypsy retrotransposon protein2.8e-21852.81Show/hide
Query:  LVSTLHLPIVKTSNYGVILGSRSAVKGKGICNAVKLIFGE-----------------------LIGIRRSGCDFGYVVVAYFRRNGGEKNIVLKGDPSLT
        LV  L LP+ +T NYGVILGS +++KGKG+C+ V+L  G                        L  I  +  D+  +++ +F        + +KGDPSLT
Subjt:  LVSTLHLPIVKTSNYGVILGSRSAVKGKGICNAVKLIFGE-----------------------LIGIRRSGCDFGYVVVAYFRRNGGEKNIVLKGDPSLT

Query:  KSQVSLKSMIKSWAAWNQGFLIVCRATEGEMSLAKWYRIDEVYIRVES--IPAVIDKFKGVFDWLVTLPPRRSIEHHIHLKKGTDPVNVRRYRYAYHQKE
        K+Q+SLK MIKSW   +QG+L+ CR  E E    +   +D      E   + A+++KF+ VF+    LPP+R IEHHI+LK+G DPVNVR YRYA+ QKE
Subjt:  KSQVSLKSMIKSWAAWNQGFLIVCRATEGEMSLAKWYRIDEVYIRVES--IPAVIDKFKGVFDWLVTLPPRRSIEHHIHLKKGTDPVNVRRYRYAYHQKE

Query:  EMEKLVDEMLTLGIICPSISPYSSLVLLVKKKDGGWQFCVDYRALNNVTIPYKCPILVVEELFDELNGADLFSKIDLKSRYHQIRMYSGDIEKTVF----
        EME+LVDEML+ GII PS SPYSS VLLV+KKDG W+FCVDYRALNNVT+P K PI VVEELFDELNGAD+FSKIDLK+ YHQIRM+  DIEKT F    
Subjt:  EMEKLVDEMLTLGIICPSISPYSSLVLLVKKKDGGWQFCVDYRALNNVTIPYKCPILVVEELFDELNGADLFSKIDLKSRYHQIRMYSGDIEKTVF----

Query:  --------------------SLMNAIFKPFLRRFVLIFFDDILVYSKSMEDHVKHLESVFSMLRETELFANKNKCQFARERLEYLGNLIFGKGVEVDLEK
                            +LMN +FKPFLRRFVL+FFDDIL+YSK ME+H +HLE V  +LRE+EL+AN +KC FA+ R+ YLG+ I  KG+EVD EK
Subjt:  --------------------SLMNAIFKPFLRRFVLIFFDDILVYSKSMEDHVKHLESVFSMLRETELFANKNKCQFARERLEYLGNLIFGKGVEVDLEK

Query:  VGARLEWPIPSNIKEVRGFLGLTGYYRRFVQHYG----------RGGALEWTEDAQEAFERLKRAMMTLPVLAFPNFDQPFEVEIDASGYGIGAILSHHK
        + A  EWP P+N++E+RGFLGLTGYYRRFVQ+YG          + GA +WTE+ + AFE+LK+AMMTLPVLA P+F+ PFE+E DASG+G+GA+L   K
Subjt:  VGARLEWPIPSNIKEVRGFLGLTGYYRRFVQHYG----------RGGALEWTEDAQEAFERLKRAMMTLPVLAFPNFDQPFEVEIDASGYGIGAILSHHK

Query:  RPIA--------------------------------------FVVKTDPQALKFLLEQRVIQPQHQKWIAKLLGYDFQVVYRPGLENKAADVWSRIPPIV
        +P+A                                      F VKTD ++LKFLLEQRVIQPQ+Q+WIAKLLGY F+V+Y+PGLENKAAD  SRI P  
Subjt:  RPIA--------------------------------------FVVKTDPQALKFLLEQRVIQPQHQKWIAKLLGYDFQVVYRPGLENKAADVWSRIPPIV

Query:  HLAHLLAPTIVDVNVIKVEVEADPKLKEVIEKLKANVDSVLGFSWQQGMLKYKGRLVLSKSSMLLPT-----------GTFKILRTYKSLTRELYWEGMK
        HL  L AP ++DV VI+ EV  DP L+E++  ++     +  ++  QG+LK+KGRLVLSK+S L+PT           G    LRTYK +  ELYW+GMK
Subjt:  HLAHLLAPTIVDVNVIKVEVEADPKLKEVIEKLKANVDSVLGFSWQQGMLKYKGRLVLSKSSMLLPT-----------GTFKILRTYKSLTRELYWEGMK

Query:  GDVKRYVEECLVCQRNKSLALSPTGLLMPLEIPDVIWSDITMYFIEGLPKAGGHDVILVVVDRLT
         DV++Y EEC++CQ+NKS ALSP GLL+PLEIPD IWSDI+M FIEGLPK+ G +VILVVVDRL+
Subjt:  GDVKRYVEECLVCQRNKSLALSPTGLLMPLEIPDVIWSDITMYFIEGLPKAGGHDVILVVVDRLT

A0A5D3DI73 Ty3/gypsy retrotransposon protein2.8e-21852.81Show/hide
Query:  LVSTLHLPIVKTSNYGVILGSRSAVKGKGICNAVKLIFGE-----------------------LIGIRRSGCDFGYVVVAYFRRNGGEKNIVLKGDPSLT
        LV  L LP+ +T NYGVILGS +++KGKG+C+ V+L  G                        L  I  +  D+  +++ +F        + +KGDPSLT
Subjt:  LVSTLHLPIVKTSNYGVILGSRSAVKGKGICNAVKLIFGE-----------------------LIGIRRSGCDFGYVVVAYFRRNGGEKNIVLKGDPSLT

Query:  KSQVSLKSMIKSWAAWNQGFLIVCRATEGEMSLAKWYRIDEVYIRVES--IPAVIDKFKGVFDWLVTLPPRRSIEHHIHLKKGTDPVNVRRYRYAYHQKE
        K+Q+SLK MIKSW   +QG+L+ CR  E E    +   +D      E   + A+++KF+ VF+    LPP+R IEHHI+LK+G DPVNVR YRYA+ QKE
Subjt:  KSQVSLKSMIKSWAAWNQGFLIVCRATEGEMSLAKWYRIDEVYIRVES--IPAVIDKFKGVFDWLVTLPPRRSIEHHIHLKKGTDPVNVRRYRYAYHQKE

Query:  EMEKLVDEMLTLGIICPSISPYSSLVLLVKKKDGGWQFCVDYRALNNVTIPYKCPILVVEELFDELNGADLFSKIDLKSRYHQIRMYSGDIEKTVF----
        EME+LVDEML+ GII PS SPYSS VLLV+KKDG W+FCVDYRALNNVT+P K PI VVEELFDELNGAD+FSKIDLK+ YHQIRM+  DIEKT F    
Subjt:  EMEKLVDEMLTLGIICPSISPYSSLVLLVKKKDGGWQFCVDYRALNNVTIPYKCPILVVEELFDELNGADLFSKIDLKSRYHQIRMYSGDIEKTVF----

Query:  --------------------SLMNAIFKPFLRRFVLIFFDDILVYSKSMEDHVKHLESVFSMLRETELFANKNKCQFARERLEYLGNLIFGKGVEVDLEK
                            +LMN +FKPFLRRFVL+FFDDIL+YSK ME+H +HLE V  +LRE+EL+AN +KC FA+ R+ YLG+ I  KG+EVD EK
Subjt:  --------------------SLMNAIFKPFLRRFVLIFFDDILVYSKSMEDHVKHLESVFSMLRETELFANKNKCQFARERLEYLGNLIFGKGVEVDLEK

Query:  VGARLEWPIPSNIKEVRGFLGLTGYYRRFVQHYG----------RGGALEWTEDAQEAFERLKRAMMTLPVLAFPNFDQPFEVEIDASGYGIGAILSHHK
        + A  EWP P+N++E+RGFLGLTGYYRRFVQ+YG          + GA +WTE+ + AFE+LK+AMMTLPVLA P+F+ PFE+E DASG+G+GA+L   K
Subjt:  VGARLEWPIPSNIKEVRGFLGLTGYYRRFVQHYG----------RGGALEWTEDAQEAFERLKRAMMTLPVLAFPNFDQPFEVEIDASGYGIGAILSHHK

Query:  RPIA--------------------------------------FVVKTDPQALKFLLEQRVIQPQHQKWIAKLLGYDFQVVYRPGLENKAADVWSRIPPIV
        +P+A                                      F VKTD ++LKFLLEQRVIQPQ+Q+WIAKLLGY F+V+Y+PGLENKAAD  SRI P  
Subjt:  RPIA--------------------------------------FVVKTDPQALKFLLEQRVIQPQHQKWIAKLLGYDFQVVYRPGLENKAADVWSRIPPIV

Query:  HLAHLLAPTIVDVNVIKVEVEADPKLKEVIEKLKANVDSVLGFSWQQGMLKYKGRLVLSKSSMLLPT-----------GTFKILRTYKSLTRELYWEGMK
        HL  L AP ++DV VI+ EV  DP L+E++  ++     +  ++  QG+LK+KGRLVLSK+S L+PT           G    LRTYK +  ELYW+GMK
Subjt:  HLAHLLAPTIVDVNVIKVEVEADPKLKEVIEKLKANVDSVLGFSWQQGMLKYKGRLVLSKSSMLLPT-----------GTFKILRTYKSLTRELYWEGMK

Query:  GDVKRYVEECLVCQRNKSLALSPTGLLMPLEIPDVIWSDITMYFIEGLPKAGGHDVILVVVDRLT
         DV++Y EEC++CQ+NKS ALSP GLL+PLEIPD IWSDI+M FIEGLPK+ G +VILVVVDRL+
Subjt:  GDVKRYVEECLVCQRNKSLALSPTGLLMPLEIPDVIWSDITMYFIEGLPKAGGHDVILVVVDRLT

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein5.8e-5625.46Show/hide
Query:  IRVESIPAVIDKFKGVFDWLVT--LP-PRRSIEHHIHLKKGTDPVNVRRYRYAYHQKEEMEKLVDEMLTLGIICPSISPYSSLVLLVKKKDGGWQFCVDY
        ++   +P +  +FK +     T  LP P + +E  + L +    + +R Y     + + M   +++ L  GII  S +  +  V+ V KK+G  +  VDY
Subjt:  IRVESIPAVIDKFKGVFDWLVT--LP-PRRSIEHHIHLKKGTDPVNVRRYRYAYHQKEEMEKLVDEMLTLGIICPSISPYSSLVLLVKKKDGGWQFCVDY

Query:  RALNNVTIPYKCPILVVEELFDELNGADLFSKIDLKSRYHQIRMYSGDIEKTVFSLMNAIFKPFLRRF------------------------VLIFFDDI
        + LN    P   P+ ++E+L  ++ G+ +F+K+DLKS YH IR+  GD  K  F     +F+  +  +                        V+ + DDI
Subjt:  RALNNVTIPYKCPILVVEELFDELNGADLFSKIDLKSRYHQIRMYSGDIEKTVFSLMNAIFKPFLRRF------------------------VLIFFDDI

Query:  LVYSKSMEDHVKHLESVFSMLRETELFANKNKCQFARERLEYLGNLIFGKGVEVDLEKVGARLEWPIPSNIKEVRGFLGLTGYYRRFVQHYG--------
        L++SKS  +HVKH++ V   L+   L  N+ KC+F + +++++G  I  KG     E +   L+W  P N KE+R FLG   Y R+F+            
Subjt:  LVYSKSMEDHVKHLESVFSMLRETELFANKNKCQFARERLEYLGNLIFGKGVEVDLEKVGARLEWPIPSNIKEVRGFLGLTGYYRRFVQHYG--------

Query:  ---RGGALEWTEDAQEAFERLKRAMMTLPVLAFPNFDQPFEVEIDASGYGIGAILSH---------------------------HKRPIAFV--------
           +    +WT    +A E +K+ +++ PVL   +F +   +E DAS   +GA+LS                             K  +A +        
Subjt:  ---RGGALEWTEDAQEAFERLKRAMMTLPVLAFPNFDQPFEVEIDASGYGIGAILSH---------------------------HKRPIAFV--------

Query:  -VKTDPQALKFLLEQRVI--------QPQHQ---KWIAKLLGYDFQVVYRPGLENKAADVWSRI----PPI--------VHLAHLLAPTIVDVNVIKVEV
         +++  +  K L + R +        +P+++   +W   L  ++F++ YRPG  N  AD  SRI     PI        ++  + ++ T    N +  E 
Subjt:  -VKTDPQALKFLLEQRVI--------QPQHQ---KWIAKLLGYDFQVVYRPGLENKAADVWSRI----PPI--------VHLAHLLAPTIVDVNVIKVEV

Query:  EADPKLKEVIEKLKANVDSVLGFSWQQGMLKYKGRLVLSKSSMLLPTG---TFKILRTYKS---------------LTRELYWEGMKGDVKRYVEECLVC
          D KL  ++      V        ++ +    G L+ SK  +LLP     T  I++ Y                 + R   W+G++  ++ YV+ C  C
Subjt:  EADPKLKEVIEKLKANVDSVLGFSWQQGMLKYKGRLVLSKSSMLLPTG---TFKILRTYKS---------------LTRELYWEGMKGDVKRYVEECLVC

Query:  QRNKSLALSPTGLLMPLEIPDVIWSDITMYFIEGLPKAGGHDVILVVVDRLT
        Q NKS    P G L P+   +  W  ++M FI  LP++ G++ + VVVDR +
Subjt:  QRNKSLALSPTGLLMPLEIPDVIWSDITMYFIEGLPKAGGHDVILVVVDRLT

P0CT35 Transposon Tf2-2 polyprotein5.8e-5625.46Show/hide
Query:  IRVESIPAVIDKFKGVFDWLVT--LP-PRRSIEHHIHLKKGTDPVNVRRYRYAYHQKEEMEKLVDEMLTLGIICPSISPYSSLVLLVKKKDGGWQFCVDY
        ++   +P +  +FK +     T  LP P + +E  + L +    + +R Y     + + M   +++ L  GII  S +  +  V+ V KK+G  +  VDY
Subjt:  IRVESIPAVIDKFKGVFDWLVT--LP-PRRSIEHHIHLKKGTDPVNVRRYRYAYHQKEEMEKLVDEMLTLGIICPSISPYSSLVLLVKKKDGGWQFCVDY

Query:  RALNNVTIPYKCPILVVEELFDELNGADLFSKIDLKSRYHQIRMYSGDIEKTVFSLMNAIFKPFLRRF------------------------VLIFFDDI
        + LN    P   P+ ++E+L  ++ G+ +F+K+DLKS YH IR+  GD  K  F     +F+  +  +                        V+ + DDI
Subjt:  RALNNVTIPYKCPILVVEELFDELNGADLFSKIDLKSRYHQIRMYSGDIEKTVFSLMNAIFKPFLRRF------------------------VLIFFDDI

Query:  LVYSKSMEDHVKHLESVFSMLRETELFANKNKCQFARERLEYLGNLIFGKGVEVDLEKVGARLEWPIPSNIKEVRGFLGLTGYYRRFVQHYG--------
        L++SKS  +HVKH++ V   L+   L  N+ KC+F + +++++G  I  KG     E +   L+W  P N KE+R FLG   Y R+F+            
Subjt:  LVYSKSMEDHVKHLESVFSMLRETELFANKNKCQFARERLEYLGNLIFGKGVEVDLEKVGARLEWPIPSNIKEVRGFLGLTGYYRRFVQHYG--------

Query:  ---RGGALEWTEDAQEAFERLKRAMMTLPVLAFPNFDQPFEVEIDASGYGIGAILSH---------------------------HKRPIAFV--------
           +    +WT    +A E +K+ +++ PVL   +F +   +E DAS   +GA+LS                             K  +A +        
Subjt:  ---RGGALEWTEDAQEAFERLKRAMMTLPVLAFPNFDQPFEVEIDASGYGIGAILSH---------------------------HKRPIAFV--------

Query:  -VKTDPQALKFLLEQRVI--------QPQHQ---KWIAKLLGYDFQVVYRPGLENKAADVWSRI----PPI--------VHLAHLLAPTIVDVNVIKVEV
         +++  +  K L + R +        +P+++   +W   L  ++F++ YRPG  N  AD  SRI     PI        ++  + ++ T    N +  E 
Subjt:  -VKTDPQALKFLLEQRVI--------QPQHQ---KWIAKLLGYDFQVVYRPGLENKAADVWSRI----PPI--------VHLAHLLAPTIVDVNVIKVEV

Query:  EADPKLKEVIEKLKANVDSVLGFSWQQGMLKYKGRLVLSKSSMLLPTG---TFKILRTYKS---------------LTRELYWEGMKGDVKRYVEECLVC
          D KL  ++      V        ++ +    G L+ SK  +LLP     T  I++ Y                 + R   W+G++  ++ YV+ C  C
Subjt:  EADPKLKEVIEKLKANVDSVLGFSWQQGMLKYKGRLVLSKSSMLLPTG---TFKILRTYKS---------------LTRELYWEGMKGDVKRYVEECLVC

Query:  QRNKSLALSPTGLLMPLEIPDVIWSDITMYFIEGLPKAGGHDVILVVVDRLT
        Q NKS    P G L P+   +  W  ++M FI  LP++ G++ + VVVDR +
Subjt:  QRNKSLALSPTGLLMPLEIPDVIWSDITMYFIEGLPKAGGHDVILVVVDRLT

P0CT36 Transposon Tf2-3 polyprotein5.8e-5625.46Show/hide
Query:  IRVESIPAVIDKFKGVFDWLVT--LP-PRRSIEHHIHLKKGTDPVNVRRYRYAYHQKEEMEKLVDEMLTLGIICPSISPYSSLVLLVKKKDGGWQFCVDY
        ++   +P +  +FK +     T  LP P + +E  + L +    + +R Y     + + M   +++ L  GII  S +  +  V+ V KK+G  +  VDY
Subjt:  IRVESIPAVIDKFKGVFDWLVT--LP-PRRSIEHHIHLKKGTDPVNVRRYRYAYHQKEEMEKLVDEMLTLGIICPSISPYSSLVLLVKKKDGGWQFCVDY

Query:  RALNNVTIPYKCPILVVEELFDELNGADLFSKIDLKSRYHQIRMYSGDIEKTVFSLMNAIFKPFLRRF------------------------VLIFFDDI
        + LN    P   P+ ++E+L  ++ G+ +F+K+DLKS YH IR+  GD  K  F     +F+  +  +                        V+ + DDI
Subjt:  RALNNVTIPYKCPILVVEELFDELNGADLFSKIDLKSRYHQIRMYSGDIEKTVFSLMNAIFKPFLRRF------------------------VLIFFDDI

Query:  LVYSKSMEDHVKHLESVFSMLRETELFANKNKCQFARERLEYLGNLIFGKGVEVDLEKVGARLEWPIPSNIKEVRGFLGLTGYYRRFVQHYG--------
        L++SKS  +HVKH++ V   L+   L  N+ KC+F + +++++G  I  KG     E +   L+W  P N KE+R FLG   Y R+F+            
Subjt:  LVYSKSMEDHVKHLESVFSMLRETELFANKNKCQFARERLEYLGNLIFGKGVEVDLEKVGARLEWPIPSNIKEVRGFLGLTGYYRRFVQHYG--------

Query:  ---RGGALEWTEDAQEAFERLKRAMMTLPVLAFPNFDQPFEVEIDASGYGIGAILSH---------------------------HKRPIAFV--------
           +    +WT    +A E +K+ +++ PVL   +F +   +E DAS   +GA+LS                             K  +A +        
Subjt:  ---RGGALEWTEDAQEAFERLKRAMMTLPVLAFPNFDQPFEVEIDASGYGIGAILSH---------------------------HKRPIAFV--------

Query:  -VKTDPQALKFLLEQRVI--------QPQHQ---KWIAKLLGYDFQVVYRPGLENKAADVWSRI----PPI--------VHLAHLLAPTIVDVNVIKVEV
         +++  +  K L + R +        +P+++   +W   L  ++F++ YRPG  N  AD  SRI     PI        ++  + ++ T    N +  E 
Subjt:  -VKTDPQALKFLLEQRVI--------QPQHQ---KWIAKLLGYDFQVVYRPGLENKAADVWSRI----PPI--------VHLAHLLAPTIVDVNVIKVEV

Query:  EADPKLKEVIEKLKANVDSVLGFSWQQGMLKYKGRLVLSKSSMLLPTG---TFKILRTYKS---------------LTRELYWEGMKGDVKRYVEECLVC
          D KL  ++      V        ++ +    G L+ SK  +LLP     T  I++ Y                 + R   W+G++  ++ YV+ C  C
Subjt:  EADPKLKEVIEKLKANVDSVLGFSWQQGMLKYKGRLVLSKSSMLLPTG---TFKILRTYKS---------------LTRELYWEGMKGDVKRYVEECLVC

Query:  QRNKSLALSPTGLLMPLEIPDVIWSDITMYFIEGLPKAGGHDVILVVVDRLT
        Q NKS    P G L P+   +  W  ++M FI  LP++ G++ + VVVDR +
Subjt:  QRNKSLALSPTGLLMPLEIPDVIWSDITMYFIEGLPKAGGHDVILVVVDRLT

P0CT37 Transposon Tf2-4 polyprotein5.8e-5625.46Show/hide
Query:  IRVESIPAVIDKFKGVFDWLVT--LP-PRRSIEHHIHLKKGTDPVNVRRYRYAYHQKEEMEKLVDEMLTLGIICPSISPYSSLVLLVKKKDGGWQFCVDY
        ++   +P +  +FK +     T  LP P + +E  + L +    + +R Y     + + M   +++ L  GII  S +  +  V+ V KK+G  +  VDY
Subjt:  IRVESIPAVIDKFKGVFDWLVT--LP-PRRSIEHHIHLKKGTDPVNVRRYRYAYHQKEEMEKLVDEMLTLGIICPSISPYSSLVLLVKKKDGGWQFCVDY

Query:  RALNNVTIPYKCPILVVEELFDELNGADLFSKIDLKSRYHQIRMYSGDIEKTVFSLMNAIFKPFLRRF------------------------VLIFFDDI
        + LN    P   P+ ++E+L  ++ G+ +F+K+DLKS YH IR+  GD  K  F     +F+  +  +                        V+ + DDI
Subjt:  RALNNVTIPYKCPILVVEELFDELNGADLFSKIDLKSRYHQIRMYSGDIEKTVFSLMNAIFKPFLRRF------------------------VLIFFDDI

Query:  LVYSKSMEDHVKHLESVFSMLRETELFANKNKCQFARERLEYLGNLIFGKGVEVDLEKVGARLEWPIPSNIKEVRGFLGLTGYYRRFVQHYG--------
        L++SKS  +HVKH++ V   L+   L  N+ KC+F + +++++G  I  KG     E +   L+W  P N KE+R FLG   Y R+F+            
Subjt:  LVYSKSMEDHVKHLESVFSMLRETELFANKNKCQFARERLEYLGNLIFGKGVEVDLEKVGARLEWPIPSNIKEVRGFLGLTGYYRRFVQHYG--------

Query:  ---RGGALEWTEDAQEAFERLKRAMMTLPVLAFPNFDQPFEVEIDASGYGIGAILSH---------------------------HKRPIAFV--------
           +    +WT    +A E +K+ +++ PVL   +F +   +E DAS   +GA+LS                             K  +A +        
Subjt:  ---RGGALEWTEDAQEAFERLKRAMMTLPVLAFPNFDQPFEVEIDASGYGIGAILSH---------------------------HKRPIAFV--------

Query:  -VKTDPQALKFLLEQRVI--------QPQHQ---KWIAKLLGYDFQVVYRPGLENKAADVWSRI----PPI--------VHLAHLLAPTIVDVNVIKVEV
         +++  +  K L + R +        +P+++   +W   L  ++F++ YRPG  N  AD  SRI     PI        ++  + ++ T    N +  E 
Subjt:  -VKTDPQALKFLLEQRVI--------QPQHQ---KWIAKLLGYDFQVVYRPGLENKAADVWSRI----PPI--------VHLAHLLAPTIVDVNVIKVEV

Query:  EADPKLKEVIEKLKANVDSVLGFSWQQGMLKYKGRLVLSKSSMLLPTG---TFKILRTYKS---------------LTRELYWEGMKGDVKRYVEECLVC
          D KL  ++      V        ++ +    G L+ SK  +LLP     T  I++ Y                 + R   W+G++  ++ YV+ C  C
Subjt:  EADPKLKEVIEKLKANVDSVLGFSWQQGMLKYKGRLVLSKSSMLLPTG---TFKILRTYKS---------------LTRELYWEGMKGDVKRYVEECLVC

Query:  QRNKSLALSPTGLLMPLEIPDVIWSDITMYFIEGLPKAGGHDVILVVVDRLT
        Q NKS    P G L P+   +  W  ++M FI  LP++ G++ + VVVDR +
Subjt:  QRNKSLALSPTGLLMPLEIPDVIWSDITMYFIEGLPKAGGHDVILVVVDRLT

P0CT41 Transposon Tf2-12 polyprotein5.8e-5625.46Show/hide
Query:  IRVESIPAVIDKFKGVFDWLVT--LP-PRRSIEHHIHLKKGTDPVNVRRYRYAYHQKEEMEKLVDEMLTLGIICPSISPYSSLVLLVKKKDGGWQFCVDY
        ++   +P +  +FK +     T  LP P + +E  + L +    + +R Y     + + M   +++ L  GII  S +  +  V+ V KK+G  +  VDY
Subjt:  IRVESIPAVIDKFKGVFDWLVT--LP-PRRSIEHHIHLKKGTDPVNVRRYRYAYHQKEEMEKLVDEMLTLGIICPSISPYSSLVLLVKKKDGGWQFCVDY

Query:  RALNNVTIPYKCPILVVEELFDELNGADLFSKIDLKSRYHQIRMYSGDIEKTVFSLMNAIFKPFLRRF------------------------VLIFFDDI
        + LN    P   P+ ++E+L  ++ G+ +F+K+DLKS YH IR+  GD  K  F     +F+  +  +                        V+ + DDI
Subjt:  RALNNVTIPYKCPILVVEELFDELNGADLFSKIDLKSRYHQIRMYSGDIEKTVFSLMNAIFKPFLRRF------------------------VLIFFDDI

Query:  LVYSKSMEDHVKHLESVFSMLRETELFANKNKCQFARERLEYLGNLIFGKGVEVDLEKVGARLEWPIPSNIKEVRGFLGLTGYYRRFVQHYG--------
        L++SKS  +HVKH++ V   L+   L  N+ KC+F + +++++G  I  KG     E +   L+W  P N KE+R FLG   Y R+F+            
Subjt:  LVYSKSMEDHVKHLESVFSMLRETELFANKNKCQFARERLEYLGNLIFGKGVEVDLEKVGARLEWPIPSNIKEVRGFLGLTGYYRRFVQHYG--------

Query:  ---RGGALEWTEDAQEAFERLKRAMMTLPVLAFPNFDQPFEVEIDASGYGIGAILSH---------------------------HKRPIAFV--------
           +    +WT    +A E +K+ +++ PVL   +F +   +E DAS   +GA+LS                             K  +A +        
Subjt:  ---RGGALEWTEDAQEAFERLKRAMMTLPVLAFPNFDQPFEVEIDASGYGIGAILSH---------------------------HKRPIAFV--------

Query:  -VKTDPQALKFLLEQRVI--------QPQHQ---KWIAKLLGYDFQVVYRPGLENKAADVWSRI----PPI--------VHLAHLLAPTIVDVNVIKVEV
         +++  +  K L + R +        +P+++   +W   L  ++F++ YRPG  N  AD  SRI     PI        ++  + ++ T    N +  E 
Subjt:  -VKTDPQALKFLLEQRVI--------QPQHQ---KWIAKLLGYDFQVVYRPGLENKAADVWSRI----PPI--------VHLAHLLAPTIVDVNVIKVEV

Query:  EADPKLKEVIEKLKANVDSVLGFSWQQGMLKYKGRLVLSKSSMLLPTG---TFKILRTYKS---------------LTRELYWEGMKGDVKRYVEECLVC
          D KL  ++      V        ++ +    G L+ SK  +LLP     T  I++ Y                 + R   W+G++  ++ YV+ C  C
Subjt:  EADPKLKEVIEKLKANVDSVLGFSWQQGMLKYKGRLVLSKSSMLLPTG---TFKILRTYKS---------------LTRELYWEGMKGDVKRYVEECLVC

Query:  QRNKSLALSPTGLLMPLEIPDVIWSDITMYFIEGLPKAGGHDVILVVVDRLT
        Q NKS    P G L P+   +  W  ++M FI  LP++ G++ + VVVDR +
Subjt:  QRNKSLALSPTGLLMPLEIPDVIWSDITMYFIEGLPKAGGHDVILVVVDRLT

Arabidopsis top hitse value%identityAlignment
ATMG00850.1 DNA/RNA polymerases superfamily protein2.2e-0558.97Show/hide
Query:  QKEEMEKLVDEMLTLGIICPSISPYSSLVLLVKKKDGGW
        ++  ++  + EML   II PSISPYSS VLLV+KKDGGW
Subjt:  QKEEMEKLVDEMLTLGIICPSISPYSSLVLLVKKKDGGW

ATMG00860.1 DNA/RNA polymerases superfamily protein3.2e-2542.96Show/hide
Query:  VKHLESVFSMLRETELFANKNKCQFARERLEYLG--NLIFGKGVEVDLEKVGARLEWPIPSNIKEVRGFLGLTGYYRRFVQHYG----------RGGALE
        + HL  V  +  + + +AN+ KC F + ++ YLG  ++I G+GV  D  K+ A + WP P N  E+RGFLGLTGYYRRFV++YG          +  +L+
Subjt:  VKHLESVFSMLRETELFANKNKCQFARERLEYLG--NLIFGKGVEVDLEKVGARLEWPIPSNIKEVRGFLGLTGYYRRFVQHYG----------RGGALE

Query:  WTEDAQEAFERLKRAMMTLPVLAFPNFDQPFEVEI
        WTE A  AF+ LK A+ TLPVLA P+   PF   +
Subjt:  WTEDAQEAFERLKRAMMTLPVLAFPNFDQPFEVEI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTAAGCGATTTGAGGAACAAAAGGATGCTATGGACCAAGAGGTTTCTAAAATTCTGGTGAAGATTCAACGGTTGCCGAAAATTGAAGAGACTTTGGCATCCTTGTC
GAAGAGTATCGAGCGATTGGAGGTACAAGCGGAGAAGCAATACAACGATTGCACGAGACCAACAAGACTCACAATAGCAATGAGACCAGTGAGACTGTCGATCAAGGCTG
ACCGGTACTTCAATATTCATAAGTTGTCGAAACTAGAGAAAATGATCGTGGTCATGATTAGTTTTAATGGTACAATGTTGGATTGGTATCTGGGATTTGATTGTTTGGTT
TCTACTTTGCATTTACCCATTGTGAAGACATCCAATTATGGAGTCATCTTAGGTTCTAGATCAGCAGTTAAAGGGAAAGGGATTTGTAATGCCGTTAAACTTATATTTGG
TGAGTTGATTGGAATTAGAAGAAGTGGATGTGATTTTGGGTATGTAGTGGTTGCATACTTTAGGCGTAACGGAGGAGAAAAGAATATTGTGTTGAAAGGTGATCCGAGCT
TGACAAAATCGCAAGTAAGTTTAAAGAGTATGATCAAGTCATGGGCAGCATGGAATCAAGGGTTCTTGATAGTATGTCGAGCTACGGAAGGGGAAATGTCATTGGCAAAA
TGGTATAGGATAGATGAAGTGTATATCAGGGTTGAATCAATACCAGCAGTCATTGACAAGTTTAAGGGTGTGTTTGATTGGTTAGTGACATTACCACCAAGGAGAAGCAT
TGAACATCATATTCATTTGAAGAAGGGGACTGATCCGGTCAATGTAAGACGTTATAGATATGCATATCACCAAAAAGAAGAAATGGAAAAATTGGTGGATGAAATGTTGA
CTTTAGGGATTATCTGCCCTAGCATCAGTCCATATTCAAGCCTTGTGTTGTTAGTTAAGAAGAAGGATGGGGGGTGGCAATTTTGTGTGGATTATAGGGCGTTGAACAAT
GTTACTATACCATACAAGTGTCCAATTCTAGTAGTTGAGGAGTTGTTTGATGAATTGAATGGAGCAGATCTCTTTTCAAAGATTGATCTGAAGTCAAGGTACCACCAGAT
ACGAATGTATTCGGGTGATATTGAGAAAACTGTATTTTCACTTATGAATGCCATTTTTAAACCTTTTTTGAGGAGGTTCGTATTGATTTTCTTTGATGATATTCTTGTAT
ATAGTAAATCTATGGAGGATCATGTGAAGCATCTTGAATCCGTATTTTCTATGCTTCGAGAGACTGAGTTGTTTGCTAATAAGAATAAGTGTCAATTTGCTAGAGAACGA
TTGGAATATTTGGGTAATTTAATTTTTGGAAAAGGTGTTGAGGTAGATCTAGAGAAGGTTGGAGCTAGGTTAGAGTGGCCTATTCCCTCTAATATTAAGGAGGTTCGTGG
ATTTTTGGGATTGACTGGTTATTATAGACGATTTGTTCAACATTATGGAAGAGGGGGTGCTCTTGAGTGGACTGAAGATGCACAGGAGGCTTTTGAACGTTTGAAACGAG
CTATGATGACTTTACCGGTGTTGGCCTTTCCTAATTTTGATCAACCTTTTGAAGTCGAAATTGATGCTTCTGGGTACGGTATTGGAGCTATTTTGTCTCACCATAAAAGG
CCAATCGCGTTTGTAGTCAAGACTGATCCGCAGGCTCTGAAGTTCTTGTTGGAACAACGAGTGATTCAACCTCAGCATCAAAAGTGGATTGCTAAATTGTTGGGGTATGA
TTTTCAGGTGGTTTATCGACCTGGGTTAGAGAACAAAGCAGCTGATGTGTGGTCTCGCATTCCTCCTATAGTGCATTTGGCTCATTTGTTGGCTCCTACTATCGTTGATG
TCAATGTTATTAAAGTGGAAGTTGAAGCGGATCCTAAGTTGAAGGAAGTTATTGAGAAGTTAAAAGCCAATGTTGATAGTGTTCTAGGGTTTTCGTGGCAGCAAGGTATG
TTGAAGTATAAAGGCCGACTAGTGTTGTCTAAATCTTCCATGTTACTTCCCACGGGAACATTCAAGATTTTACGTACTTACAAGAGTTTGACAAGAGAACTTTATTGGGA
AGGCATGAAAGGAGATGTGAAAAGATATGTGGAAGAATGTCTAGTTTGTCAACGGAACAAGTCATTGGCCTTGTCTCCTACAGGATTACTCATGCCCCTAGAGATTCCTG
ATGTTATTTGGAGTGATATTACCATGTATTTCATTGAAGGATTACCTAAGGCGGGGGGTCATGATGTCATATTGGTAGTGGTTGATCGATTGACAACCCCTACAGTTCAA
CGATCCAAATCTTCCCCCTACGACAACAATGACAACGCCCCGGGGAGAACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTAAGCGATTTGAGGAACAAAAGGATGCTATGGACCAAGAGGTTTCTAAAATTCTGGTGAAGATTCAACGGTTGCCGAAAATTGAAGAGACTTTGGCATCCTTGTC
GAAGAGTATCGAGCGATTGGAGGTACAAGCGGAGAAGCAATACAACGATTGCACGAGACCAACAAGACTCACAATAGCAATGAGACCAGTGAGACTGTCGATCAAGGCTG
ACCGGTACTTCAATATTCATAAGTTGTCGAAACTAGAGAAAATGATCGTGGTCATGATTAGTTTTAATGGTACAATGTTGGATTGGTATCTGGGATTTGATTGTTTGGTT
TCTACTTTGCATTTACCCATTGTGAAGACATCCAATTATGGAGTCATCTTAGGTTCTAGATCAGCAGTTAAAGGGAAAGGGATTTGTAATGCCGTTAAACTTATATTTGG
TGAGTTGATTGGAATTAGAAGAAGTGGATGTGATTTTGGGTATGTAGTGGTTGCATACTTTAGGCGTAACGGAGGAGAAAAGAATATTGTGTTGAAAGGTGATCCGAGCT
TGACAAAATCGCAAGTAAGTTTAAAGAGTATGATCAAGTCATGGGCAGCATGGAATCAAGGGTTCTTGATAGTATGTCGAGCTACGGAAGGGGAAATGTCATTGGCAAAA
TGGTATAGGATAGATGAAGTGTATATCAGGGTTGAATCAATACCAGCAGTCATTGACAAGTTTAAGGGTGTGTTTGATTGGTTAGTGACATTACCACCAAGGAGAAGCAT
TGAACATCATATTCATTTGAAGAAGGGGACTGATCCGGTCAATGTAAGACGTTATAGATATGCATATCACCAAAAAGAAGAAATGGAAAAATTGGTGGATGAAATGTTGA
CTTTAGGGATTATCTGCCCTAGCATCAGTCCATATTCAAGCCTTGTGTTGTTAGTTAAGAAGAAGGATGGGGGGTGGCAATTTTGTGTGGATTATAGGGCGTTGAACAAT
GTTACTATACCATACAAGTGTCCAATTCTAGTAGTTGAGGAGTTGTTTGATGAATTGAATGGAGCAGATCTCTTTTCAAAGATTGATCTGAAGTCAAGGTACCACCAGAT
ACGAATGTATTCGGGTGATATTGAGAAAACTGTATTTTCACTTATGAATGCCATTTTTAAACCTTTTTTGAGGAGGTTCGTATTGATTTTCTTTGATGATATTCTTGTAT
ATAGTAAATCTATGGAGGATCATGTGAAGCATCTTGAATCCGTATTTTCTATGCTTCGAGAGACTGAGTTGTTTGCTAATAAGAATAAGTGTCAATTTGCTAGAGAACGA
TTGGAATATTTGGGTAATTTAATTTTTGGAAAAGGTGTTGAGGTAGATCTAGAGAAGGTTGGAGCTAGGTTAGAGTGGCCTATTCCCTCTAATATTAAGGAGGTTCGTGG
ATTTTTGGGATTGACTGGTTATTATAGACGATTTGTTCAACATTATGGAAGAGGGGGTGCTCTTGAGTGGACTGAAGATGCACAGGAGGCTTTTGAACGTTTGAAACGAG
CTATGATGACTTTACCGGTGTTGGCCTTTCCTAATTTTGATCAACCTTTTGAAGTCGAAATTGATGCTTCTGGGTACGGTATTGGAGCTATTTTGTCTCACCATAAAAGG
CCAATCGCGTTTGTAGTCAAGACTGATCCGCAGGCTCTGAAGTTCTTGTTGGAACAACGAGTGATTCAACCTCAGCATCAAAAGTGGATTGCTAAATTGTTGGGGTATGA
TTTTCAGGTGGTTTATCGACCTGGGTTAGAGAACAAAGCAGCTGATGTGTGGTCTCGCATTCCTCCTATAGTGCATTTGGCTCATTTGTTGGCTCCTACTATCGTTGATG
TCAATGTTATTAAAGTGGAAGTTGAAGCGGATCCTAAGTTGAAGGAAGTTATTGAGAAGTTAAAAGCCAATGTTGATAGTGTTCTAGGGTTTTCGTGGCAGCAAGGTATG
TTGAAGTATAAAGGCCGACTAGTGTTGTCTAAATCTTCCATGTTACTTCCCACGGGAACATTCAAGATTTTACGTACTTACAAGAGTTTGACAAGAGAACTTTATTGGGA
AGGCATGAAAGGAGATGTGAAAAGATATGTGGAAGAATGTCTAGTTTGTCAACGGAACAAGTCATTGGCCTTGTCTCCTACAGGATTACTCATGCCCCTAGAGATTCCTG
ATGTTATTTGGAGTGATATTACCATGTATTTCATTGAAGGATTACCTAAGGCGGGGGGTCATGATGTCATATTGGTAGTGGTTGATCGATTGACAACCCCTACAGTTCAA
CGATCCAAATCTTCCCCCTACGACAACAATGACAACGCCCCGGGGAGAACTTAA
Protein sequenceShow/hide protein sequence
MAKRFEEQKDAMDQEVSKILVKIQRLPKIEETLASLSKSIERLEVQAEKQYNDCTRPTRLTIAMRPVRLSIKADRYFNIHKLSKLEKMIVVMISFNGTMLDWYLGFDCLV
STLHLPIVKTSNYGVILGSRSAVKGKGICNAVKLIFGELIGIRRSGCDFGYVVVAYFRRNGGEKNIVLKGDPSLTKSQVSLKSMIKSWAAWNQGFLIVCRATEGEMSLAK
WYRIDEVYIRVESIPAVIDKFKGVFDWLVTLPPRRSIEHHIHLKKGTDPVNVRRYRYAYHQKEEMEKLVDEMLTLGIICPSISPYSSLVLLVKKKDGGWQFCVDYRALNN
VTIPYKCPILVVEELFDELNGADLFSKIDLKSRYHQIRMYSGDIEKTVFSLMNAIFKPFLRRFVLIFFDDILVYSKSMEDHVKHLESVFSMLRETELFANKNKCQFARER
LEYLGNLIFGKGVEVDLEKVGARLEWPIPSNIKEVRGFLGLTGYYRRFVQHYGRGGALEWTEDAQEAFERLKRAMMTLPVLAFPNFDQPFEVEIDASGYGIGAILSHHKR
PIAFVVKTDPQALKFLLEQRVIQPQHQKWIAKLLGYDFQVVYRPGLENKAADVWSRIPPIVHLAHLLAPTIVDVNVIKVEVEADPKLKEVIEKLKANVDSVLGFSWQQGM
LKYKGRLVLSKSSMLLPTGTFKILRTYKSLTRELYWEGMKGDVKRYVEECLVCQRNKSLALSPTGLLMPLEIPDVIWSDITMYFIEGLPKAGGHDVILVVVDRLTTPTVQ
RSKSSPYDNNDNAPGRT