; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS018199 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS018199
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationscaffold56:470495..473724
RNA-Seq ExpressionMS018199
SyntenyMS018199
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039309.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.1e-14833.85Show/hide
Query:  GSYKSFSMSLLLDTITWLHGSFRTLIAIPRTNRFFQERRYSDYCIWIEKTSNRRACIAEIYRVDEKGHKCCIIVPEGYDKSGWKDFHNMITFN-DVK---
        G++K+FS+ +    + W+  + ++LI  P +NRFF E R  ++CIWI KT N + C AEI+RVD K  K CI+VPEG +KS W  F +MIT   +VK   
Subjt:  GSYKSFSMSLLLDTITWLHGSFRTLIAIPRTNRFFQERRYSDYCIWIEKTSNRRACIAEIYRVDEKGHKCCIIVPEGYDKSGWKDFHNMITFN-DVK---

Query:  ---------PKQIMQPP----KKPQSKAPSSSSVSDTSSRDSQSTLGNSHSQRSVINQGEEDTLISWENTVVLTCWCFHYDWTKIMEALQEINESVATFN
                 P+  + PP    K+  +KA S    S  SS  S S   +  SQ S  +  +    +  ENTVVL    FH DW KI++ L++  E   T+N
Subjt:  ---------PKQIMQPP----KKPQSKAPSSSSVSDTSSRDSQSTLGNSHSQRSVINQGEEDTLISWENTVVLTCWCFHYDWTKIMEALQEINESVATFN

Query:  PFQADKAILHVWDPEQARLMCMNKGWVTVGPFYVKFEHWVSEQHSTPKFIPSYGGWLRFRGIPLHLWNLKTSKHIGDACGGFELAGINTLRKIDIMEAAI
         F A+K ++H      A L+C NKGW TVG + V+FE W    H++PK IPSYGGW  FRGIPLHLWN+ T + IG ACGG       T    +++EA +
Subjt:  PFQADKAILHVWDPEQARLMCMNKGWVTVGPFYVKFEHWVSEQHSTPKFIPSYGGWLRFRGIPLHLWNLKTSKHIGDACGGFELAGINTLRKIDIMEAAI

Query:  KVKDNFCGFIPASIEIKDATGKIFIVNTVASAKKKWLIIRHPQIHGTFTREAAAQFDEFDPSSETYYFQGNSVIPPPI--------AAISNEMQISNTSV
        K++ N+ GF+PA ++I D  G  F+V  V  ++ KWL+ R+ ++HGTF R+AAA FD+F+P SE + F G   I P +         +IS E   +  SV
Subjt:  KVKDNFCGFIPASIEIKDATGKIFIVNTVASAKKKWLIIRHPQIHGTFTREAAAQFDEFDPSSETYYFQGNSVIPPPI--------AAISNEMQISNTSV

Query:  FISSSHIGNSSI---HPVQSEQTQIRTKGKEICKERKASHPNPILPKSK-KVIFPSQKNAPII-------------SILTQLLDHTKPPPPNSPLQYKPA
         I  +    S       V ++ +   T  K   K       +  L K K KV  PSQ  +  I             S  T   +    P  +SP + K  
Subjt:  FISSSHIGNSSI---HPVQSEQTQIRTKGKEICKERKASHPNPILPKSK-KVIFPSQKNAPII-------------SILTQLLDHTKPPPPNSPLQYKPA

Query:  ---LSLKKRSPQIKLLKPTNQ-KTTFITQNKPTVTKAHHPTPIQTTNFRPITIHQTVNHYLDFYLTVDLGPLSPISNTVVSSPNSYQ--HSPPNSNLNQK
            S+KK+S  I+     NQ K   ITQ            P+Q      +  H          LTVDLG L      V+    S++  HS  N+ +   
Subjt:  ---LSLKKRSPQIKLLKPTNQ-KTTFITQNKPTVTKAHHPTPIQTTNFRPITIHQTVNHYLDFYLTVDLGPLSPISNTVVSSPNSYQ--HSPPNSNLNQK

Query:  NHTNLI--------TTEEILDGSLLTLFNEA----------QEDKNPQNDTGKESFKKQLNEWLIENAFCAIKDTISTICPDIVILSKSKASCISSRFIK
         +T ++        T  E  + S    + +           ++ ++ + DT  E+FK QL  WL EN      DT S+                      
Subjt:  NHTNLI--------TTEEILDGSLLTLFNEA----------QEDKNPQNDTGKESFKKQLNEWLIENAFCAIKDTISTICPDIVILSKSKASCISSRFIK

Query:  SIWSSINIAWASLDASGSSGGIIILWDRLSIEAEEVICGTYTISVLFKLADNTHWWLTGVYGPVRRRKRKLLWQVLYALNGLCSPWWLLGGDFNIYRWSH
           +S N  ++ L    S+GGI+ILWD           G +++S  F   +N+ WWLTG+YGPV+RR+R  +W+ L+ L+ L S  W++GGD N+ R   
Subjt:  SIWSSINIAWASLDASGSSGGIIILWDRLSIEAEEVICGTYTISVLFKLADNTHWWLTGVYGPVRRRKRKLLWQVLYALNGLCSPWWLLGGDFNIYRWSH

Query:  ETSSANPHRAGMRHFNKFTYFVGLIDPSLENGPYTWSNLRHRLVLSRIDRFLFSQSWSEKPSFHNTRHLPRICSDHYPLLLESPTH--KWRPCPFRLENL
        E+++           N F     LIDP L N  YTWSNLR+    SR+DRFL++  W    + H TR LPR  SDH+PL+ E  T   +W P PFRL ++
Subjt:  ETSSANPHRAGMRHFNKFTYFVGLIDPSLENGPYTWSNLRHRLVLSRIDRFLFSQSWSEKPSFHNTRHLPRICSDHYPLLLESPTH--KWRPCPFRLENL

Query:  WIQDKNFQQLISTQWSSIPSHGYPGHAIIKKLYILAQNIKRWKKTR-PKFPYRKGTHHKRISLINQKEESNQIDEADCHLRTSLKNDLQSLALVETRLWS
         + D  F++ +   W     +G+PG   I++L  LA  IK W+K +       K    + +  I++ E    +   + + R +LK +L  L+L E++ W 
Subjt:  WIQDKNFQQLISTQWSSIPSHGYPGHAIIKKLYILAQNIKRWKKTR-PKFPYRKGTHHKRISLINQKEESNQIDEADCHLRTSLKNDLQSLALVETRLWS

Query:  QKYKKLWLLEGDENSVFFHKICTARRRRSFIL-------SQLGSDSSIEAEAIRHFKGIYHVNNDVGS-FITNLEWAPITLDQSNRLVQPFTD
        Q+ KKLWL EGDENS FFH+IC++R++R+ I        S   ++++I    + HF  IY  +      FI NLEW PI     + L  PF++
Subjt:  QKYKKLWLLEGDENSVFFHKICTARRRRSFIL-------SQLGSDSSIEAEAIRHFKGIYHVNNDVGS-FITNLEWAPITLDQSNRLVQPFTD

KAA0056838.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]8.1e-15635.14Show/hide
Query:  YKSFSMSLLLDTITWLHGSFRTLIAIPRTNRFFQERRYSDYCIWIEKTSN--RRACIAEIYRVDEKGHKCCIIVPEGYDKSGWKDFHNMITFNDVKP---
        +KSFSM +  DT+ W+   F+ L+    T  FF ERR  D C+W+ KT N  + +  AEI+R+D KG KC I+VPEG D  GWK F  +ITF    P   
Subjt:  YKSFSMSLLLDTITWLHGSFRTLIAIPRTNRFFQERRYSDYCIWIEKTSN--RRACIAEIYRVDEKGHKCCIIVPEGYDKSGWKDFHNMITFNDVKP---

Query:  --KQIMQPPKKPQSKAPSSSS----------VSDTSSRDS----QSTLGNSHSQRSVINQGEEDTLI--SWENTVVLTCWCFHYDWTKIMEALQEINESV
           +I + P    S + SS S          +SD+S  D+    ++T  +S S+RS     +  TL   S+E TV++T  CFH DW +IM +L++ +E  
Subjt:  --KQIMQPPKKPQSKAPSSSS----------VSDTSSRDS----QSTLGNSHSQRSVINQGEEDTLI--SWENTVVLTCWCFHYDWTKIMEALQEINESV

Query:  ATFNPFQADKAILHVWDPEQARLMCMNK---GWVTVGPFYVKFEHWVSEQHSTPKFIPSYGGWLRFRGIPLHLWNLKTSKHIGDACGGFELAGINTLRKI
         ++ PFQADKAIL + + + A+L+C NK   GW TVG + VKFE W S  HS    IPSYGGWLRFRGIPLHLWN  T +HIG ACGGF      T++  
Subjt:  ATFNPFQADKAILHVWDPEQARLMCMNK---GWVTVGPFYVKFEHWVSEQHSTPKFIPSYGGWLRFRGIPLHLWNLKTSKHIGDACGGFELAGINTLRKI

Query:  DIMEAAIKVKDNFCGFIPASIEIKDATGKIFIVNTVASAKKKWLIIRHPQIHGTFTREAAAQFDEFDPSSETYYFQGNSVIPPPIAAISNEMQISNTSVF
         +++A IKV+ N+ GF+PASI I D  G+ FIV TV  A+ +WL+ R+ ++HG+F  +AA +FD+ +  +ETY + G   IPP       +  I N+   
Subjt:  DIMEAAIKVKDNFCGFIPASIEIKDATGKIFIVNTVASAKKKWLIIRHPQIHGTFTREAAAQFDEFDPSSETYYFQGNSVIPPPIAAISNEMQISNTSVF

Query:  ISSSHIGNSSIHPVQSEQTQIRTKGKEICKERKASHPNPILPKSKKVIFPSQKNAPIISILTQLLDHTK----PPPPNSPLQYKPALSLKKRSPQIKLLK
          S H      +  +SE        +++   RK         K K ++  + +N    S  ++ + + K     P           ++ K +S +I  + 
Subjt:  ISSSHIGNSSIHPVQSEQTQIRTKGKEICKERKASHPNPILPKSKKVIFPSQKNAPIISILTQLLDHTK----PPPPNSPLQYKPALSLKKRSPQIKLLK

Query:  PTNQKTTFITQ-NKPTVTKAHHPTPIQTTNFRPITIHQTVNHYLDFYLTVDLGPLSPISNTVVSSPNSYQHSPPNSNLNQKNHTNLITTEEILDGSLLTL
           +K     Q  K  +T      P ++T    +++ +T        L+VD+GP+SP+ + + S  N   H     N NQ    N  +T+     +L   
Subjt:  PTNQKTTFITQ-NKPTVTKAHHPTPIQTTNFRPITIHQTVNHYLDFYLTVDLGPLSPISNTVVSSPNSYQHSPPNSNLNQKNHTNLITTEEILDGSLLTL

Query:  FNE-AQEDKNPQNDTGK---------------ESFKKQLNEWLIENAFCAIKDTISTICPDIVILSKSKASCISSRFIKSIWSSINIAWASLDASGSSGG
          E A ++K+    T +                +FK++L  WL EN              ++ +  K      SS     I S  N+  A     G  GG
Subjt:  FNE-AQEDKNPQNDTGK---------------ESFKKQLNEWLIENAFCAIKDTISTICPDIVILSKSKASCISSRFIKSIWSSINIAWASLDASGSSGG

Query:  IIILWDRLSIEAEEVICGTYTISVLFKLADNTHWWLTGVYGPVRRRKRKLLWQVLYALNGLCSPWWLLGGDFNIYRWSHETSSANPHRAGMRHFNKFTYF
        I++LWD    +  ++  G Y+IS L  L  N +WWLT VYGP +   R  LW  L  L  LC P WL+ GDFNI RW  ET++ +  +  M +FN F   
Subjt:  IIILWDRLSIEAEEVICGTYTISVLFKLADNTHWWLTGVYGPVRRRKRKLLWQVLYALNGLCSPWWLLGGDFNIYRWSHETSSANPHRAGMRHFNKFTYF

Query:  VGLIDPSLENGPYTWSNLRHRLVLSRIDRFLFSQSWSEKPSFHNTRHLPRICSDHYPLLLESPTHKWRPCPFRLENLWIQDKNFQQLISTQWSSIPSHGY
          LIDP L N  +TWSNLR     SR+DRFL S+ W      H +R L R  SDH+P+LLESP  KW PCPFRL N  ++DK+FQ+     W++    G+
Subjt:  VGLIDPSLENGPYTWSNLRHRLVLSRIDRFLFSQSWSEKPSFHNTRHLPRICSDHYPLLLESPTHKWRPCPFRLENLWIQDKNFQQLISTQWSSIPSHGY

Query:  PGHAIIKKLYILAQNIKRWKKTRPK-FPYRKGTHHKRISLINQKEESNQIDEADCHLRTSLKNDLQSLALVETRLWSQKYKKLWLLEGDENSVFFHKICT
        PG+A I+ L  L++ IK W+  +   +   K    K I +I++ E   ++       R SLK+DL S+   + ++W Q+ ++ W L GDEN+ +FH+ICT
Subjt:  PGHAIIKKLYILAQNIKRWKKTRPK-FPYRKGTHHKRISLINQKEESNQIDEADCHLRTSLKNDLQSLALVETRLWSQKYKKLWLLEGDENSVFFHKICT

Query:  ARRRRSFILS-------QLGSDSSIEAEAIRHFKGIYHVNNDVGSFITNL
          +R++ I S        L S   I    I HF+ IY   N     I NL
Subjt:  ARRRRSFILS-------QLGSDSSIEAEAIRHFKGIYHVNNDVGSFITNL

KAA0057507.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.1e-16034.84Show/hide
Query:  YKSFSMSLLLDTITWLHGSFRTLIAIPRTNRFFQERRYSDYCIWIEKTSN--RRACIAEIYRVDEKGHKCCIIVPEGYDKSGWKDFHNMITFNDVKP---
        +KSFSM +  DT+ W+   F+ L+    T  FF ERR  D C+W+ KT N  + +  AEI+R+D KG KC I+VPEG D  GWK F  +ITF    P   
Subjt:  YKSFSMSLLLDTITWLHGSFRTLIAIPRTNRFFQERRYSDYCIWIEKTSN--RRACIAEIYRVDEKGHKCCIIVPEGYDKSGWKDFHNMITFNDVKP---

Query:  --KQIMQPPKKPQSKAPSSSS----------VSDTSSRDS----QSTLGNSHSQRSVINQGEEDTLI--SWENTVVLTCWCFHYDWTKIMEALQEINESV
           +I + P    S + SS S          +SD+S  D+    ++T  +S S+RS     +  TL   S+E TV++T  CFH DW +IM +L++ +E  
Subjt:  --KQIMQPPKKPQSKAPSSSS----------VSDTSSRDS----QSTLGNSHSQRSVINQGEEDTLI--SWENTVVLTCWCFHYDWTKIMEALQEINESV

Query:  ATFNPFQADKAILHVWDPEQARLMCMNK---GWVTVGPFYVKFEHWVSEQHSTPKFIPSYGGWLRFRGIPLHLWNLKTSKHIGDACGGFELAGINTLRKI
         ++ PFQADKAIL + +P+ A+L+C NK   GW TVG + VKFE W S  HS    IPSYGGWLRFRGIPLHLWN  T +HIG ACGGF      T++  
Subjt:  ATFNPFQADKAILHVWDPEQARLMCMNK---GWVTVGPFYVKFEHWVSEQHSTPKFIPSYGGWLRFRGIPLHLWNLKTSKHIGDACGGFELAGINTLRKI

Query:  DIMEAAIKVKDNFCGFIPASIEIKDATGKIFIVNTVASAKKKWLIIRHPQIHGTFTREAAAQFDEFDPSSETYYFQGNSVIPPPIAAISNEMQISNTSVF
         +++A IKV+ N+ GF+PASI I D  G+ FIV TV  A+ +WL+ R+ ++HG+F  +AA +FD+ +  +ETY + G   IPP       +  I N+   
Subjt:  DIMEAAIKVKDNFCGFIPASIEIKDATGKIFIVNTVASAKKKWLIIRHPQIHGTFTREAAAQFDEFDPSSETYYFQGNSVIPPPIAAISNEMQISNTSVF

Query:  ISSSHIGNSSIHPVQSEQTQIRTKGKEICKERKASHPNPILPKSKKVIFPSQKNAPIISILTQLLDHTK----PPPPNSPLQYKPALSLKKRSPQIKLLK
          S H      +  +SE        +++   RK         K K ++  + ++    S  ++ + + K     P           ++ K +S +I  + 
Subjt:  ISSSHIGNSSIHPVQSEQTQIRTKGKEICKERKASHPNPILPKSKKVIFPSQKNAPIISILTQLLDHTK----PPPPNSPLQYKPALSLKKRSPQIKLLK

Query:  PTNQKTTFITQ-NKPTVTKAHHPTPIQTTNFRPITIHQTVNHYLDFYLTVDLGPLSPISNTVVSSPN----SYQHSPPNSNLNQKNHT---NLITTEEIL
           +K     Q  K  +T      P ++T    +++ +T        L+VD+GP+SP+ + + S  N    ++ +  P+ N    +     NL  + +  
Subjt:  PTNQKTTFITQ-NKPTVTKAHHPTPIQTTNFRPITIHQTVNHYLDFYLTVDLGPLSPISNTVVSSPN----SYQHSPPNSNLNQKNHT---NLITTEEIL

Query:  DGSLLTLFNEAQEDKNPQNDTGKE-----SFKKQLNEWLIENAFCAIKDTISTICPDIVILSKSKASCISSRFIKSIWSSINIAWASLDASGSSGGIIIL
             +      E  +    TG E     +FK++L  WL EN              ++ +  K      SS +   I S  N+  A     G  GGI++L
Subjt:  DGSLLTLFNEAQEDKNPQNDTGKE-----SFKKQLNEWLIENAFCAIKDTISTICPDIVILSKSKASCISSRFIKSIWSSINIAWASLDASGSSGGIIIL

Query:  WDRLSIEAEEVICGTYTISVLFKLADNTHWWLTGVYGPVRRRKRKLLWQVLYALNGLCSPWWLLGGDFNIYRWSHETSSANPHRAGMRHFNKFTYFVGLI
        WD  + +  ++  G Y+IS L  L  N +WWLT VYGP +   R  LW  L  L  LC P WL+ GDFNI RW  ET++ +  +  M +FN F     LI
Subjt:  WDRLSIEAEEVICGTYTISVLFKLADNTHWWLTGVYGPVRRRKRKLLWQVLYALNGLCSPWWLLGGDFNIYRWSHETSSANPHRAGMRHFNKFTYFVGLI

Query:  DPSLENGPYTWSNLRHRLVLSRIDRFLFSQSWSEKPSFHNTRHLPRICSDHYPLLLESPTHKWRPCPFRLENLWIQDKNFQQLISTQWSSIPSHGYPGHA
        DP   N  +TWSNLR     SR+DRFL S+ W      H +R L R  SDH+P+LLESP  KW PCPFRL N  ++DK FQ+     W+S    G+PG+A
Subjt:  DPSLENGPYTWSNLRHRLVLSRIDRFLFSQSWSEKPSFHNTRHLPRICSDHYPLLLESPTHKWRPCPFRLENLWIQDKNFQQLISTQWSSIPSHGYPGHA

Query:  IIKKLYILAQNIKRWKKTRPK-FPYRKGTHHKRISLINQKEESNQIDEADCHLRTSLKNDLQSLALVETRLWSQKYKKLWLLEGDENSVFFHKICTARRR
         I+ L  L++ IK W+  +   +   K    K I +I++ E   ++       R SLK+DL S+   + ++W Q+ ++ W L GDEN+ +FH+ICT  +R
Subjt:  IIKKLYILAQNIKRWKKTRPK-FPYRKGTHHKRISLINQKEESNQIDEADCHLRTSLKNDLQSLALVETRLWSQKYKKLWLLEGDENSVFFHKICTARRR

Query:  RSFILS-------QLGSDSSIEAEAIRHFKGIYHVNNDVGSFITNLEWAPITLDQSNRLVQPFTD
        ++ I S        L S   I    I HF+ IY   +     I NL W PI+    + L +PF +
Subjt:  RSFILS-------QLGSDSSIEAEAIRHFKGIYHVNNDVGSFITNLEWAPITLDQSNRLVQPFTD

TYJ99315.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]3.2e-16835.36Show/hide
Query:  GSYKSFSMSLLLDTITWLHGSFRTLIAIPRTNRFFQERRYSDYCIWIEKTSNRRACIAEIYRVDEKGHKCCIIVPEGYDKSGWKDFHNMIT---------
        G++K+FS+ +    + W+  + ++LIA P TNRFF E R S+  IWI KT N + C AEI+RVD+K  K CI+VPEG DKSGW  F +MIT         
Subjt:  GSYKSFSMSLLLDTITWLHGSFRTLIAIPRTNRFFQERRYSDYCIWIEKTSNRRACIAEIYRVDEKGHKCCIIVPEGYDKSGWKDFHNMIT---------

Query:  ----FNDVKPKQIMQPP----KKPQSKAPSSSSVSDTS-SRDSQSTLGNSHSQRSVINQGEEDTLISWENTVVLTCWCFHYDWTKIMEALQEINESVATF
                 P   + PP    K+  +KA +      TS S DS  +  +SHS  +         L+  ENTVV+    FH DW KI++ L++  E   T+
Subjt:  ----FNDVKPKQIMQPP----KKPQSKAPSSSSVSDTS-SRDSQSTLGNSHSQRSVINQGEEDTLISWENTVVLTCWCFHYDWTKIMEALQEINESVATF

Query:  NPFQADKAILHVWDPEQARLMCMNKGWVTVGPFYVKFEHWVSEQHSTPKFIPSYGGWLRFRGIPLHLWNLKTSKHIGDACGGFELAGINTLRKIDIMEAA
        N F A+KA++H      A L+C NKGW TVG + V+FE W    H+TPK IPSYGGW  FRGIPLHLWN+ T + IG AC G       T    +++EA 
Subjt:  NPFQADKAILHVWDPEQARLMCMNKGWVTVGPFYVKFEHWVSEQHSTPKFIPSYGGWLRFRGIPLHLWNLKTSKHIGDACGGFELAGINTLRKIDIMEAA

Query:  IKVKDNFCGFIPASIEIKDATGKIFIVNTVASAKKKWLIIRHPQIHGTFTREAAAQFDEFDPSSETYYFQGNSVIPPPIAAISNEMQISNT---------
        IKV+ N+ GF+PA++ I D  G  F V  V   + KWLI R+ ++HGTF R+AAA FD+F+P SE ++F+G+  I P   + S++ + S+T         
Subjt:  IKVKDNFCGFIPASIEIKDATGKIFIVNTVASAKKKWLIIRHPQIHGTFTREAAAQFDEFDPSSETYYFQGNSVIPPPIAAISNEMQISNT---------

Query:  -----------SVFISSSHIGNSSIHPVQSE----------QTQIRTKGKEICKERKASHPNPILPKSK-KVIFPSQKNAPIISILTQLLDHTKPPPPNS
                     F++   + +S++H   ++             +  KGK+    +   +    L KSK KV F S  N       T + +     P ++
Subjt:  -----------SVFISSSHIGNSSIHPVQSE----------QTQIRTKGKEICKERKASHPNPILPKSK-KVIFPSQKNAPIISILTQLLDHTKPPPPNS

Query:  PLQYKPALSLKKRSPQIKLLKPTNQKTTFITQNKPTVTKAHHPTPIQTTNFRPITIHQTVNHYLDFYLTVDLGPLS--------------------PISN
        P  + P+L+  ++  ++   +   +K++    N    +KA+    +  T    I  H          LTVDLG L                      I+N
Subjt:  PLQYKPALSLKKRSPQIKLLKPTNQKTTFITQNKPTVTKAHHPTPIQTTNFRPITIHQTVNHYLDFYLTVDLGPLS--------------------PISN

Query:  TVVSSPNSYQHSPPNSNLNQKNHTNLITTEEILDGSLLTLFNEAQEDKNPQNDTGKESFKKQLNEWLIENAFCAIKDTIS---TICPDIVILSKSKASCI
        T V         P N N N  +  N    + +        +   +E+K  + D   E+FKKQL  WL +N      DT S   T   ++++   +    I
Subjt:  TVVSSPNSYQHSPPNSNLNQKNHTNLITTEEILDGSLLTLFNEAQEDKNPQNDTGKESFKKQLNEWLIENAFCAIKDTIS---TICPDIVILSKSKASCI

Query:  SS-RFIKSIWSSINIAWASLDASGSSGGIIILWDRLSIEAEEVICGTYTISVLFKLADNTHWWLTGVYGPVRRRKRKLLWQVLYALNGLCSPWWLLGGDF
        ++ R IKS+W S +I W + +ASGSSGGI+ILWD  +        G +++S  F L +N+ WWLTG+YGPV+RR+R   W  L+ L  L S  W+LGGD 
Subjt:  SS-RFIKSIWSSINIAWASLDASGSSGGIIILWDRLSIEAEEVICGTYTISVLFKLADNTHWWLTGVYGPVRRRKRKLLWQVLYALNGLCSPWWLLGGDF

Query:  NIYRWSHETSSANPHRAGMRHFNKFTYFVGLIDPSLENGPYTWSNLRHRLVLSRIDRFLFSQSWSEKPSFHNTRHLPRICSDHYPLLLE--SPTHKWRPC
        N+ R   E++S        R  N F     LIDP L N  +TWSNLR+    SRIDRFL++ SW    S H TR LPR  SDH+PL+ E  +P   W P 
Subjt:  NIYRWSHETSSANPHRAGMRHFNKFTYFVGLIDPSLENGPYTWSNLRHRLVLSRIDRFLFSQSWSEKPSFHNTRHLPRICSDHYPLLLE--SPTHKWRPC

Query:  PFRLENLWIQDKNFQQLISTQWSSIPSHGYPGHAIIKKLYILAQNIKRWKKTR-PKFPYRKGTHHKRISLINQKEESNQIDEADCHLRTSLKNDLQSLAL
        PFRL ++ + D  F++ +   W +    GYPG + I++L  LA  IK W+K +     Y K    + +  I++KE    + + + + R +LK DL  L+L
Subjt:  PFRLENLWIQDKNFQQLISTQWSSIPSHGYPGHAIIKKLYILAQNIKRWKKTR-PKFPYRKGTHHKRISLINQKEESNQIDEADCHLRTSLKNDLQSLAL

Query:  VETRLWSQKYKKLWLLEGDENSVFFHKICTARRRRSFIL-------SQLGSDSSIEAEAIRHFKGIYHVNNDVGS-FITNLEWAPITLDQSNRLVQPFTD
         E++ W Q+ KKLWL EGDENS FFH+IC++R++RSFI        S   +++SI    I+ F  IY  +      FI NL+W PI   + + L  PF +
Subjt:  VETRLWSQKYKKLWLLEGDENSVFFHKICTARRRRSFIL-------SQLGSDSSIEAEAIRHFKGIYHVNNDVGS-FITNLEWAPITLDQSNRLVQPFTD

TYK08190.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.1e-16034.84Show/hide
Query:  YKSFSMSLLLDTITWLHGSFRTLIAIPRTNRFFQERRYSDYCIWIEKTSN--RRACIAEIYRVDEKGHKCCIIVPEGYDKSGWKDFHNMITFNDVKP---
        +KSFSM +  DT+ W+   F+ L+    T  FF ERR  D C+W+ KT N  + +  AEI+R+D KG KC I+VPEG D  GWK F  +ITF    P   
Subjt:  YKSFSMSLLLDTITWLHGSFRTLIAIPRTNRFFQERRYSDYCIWIEKTSN--RRACIAEIYRVDEKGHKCCIIVPEGYDKSGWKDFHNMITFNDVKP---

Query:  --KQIMQPPKKPQSKAPSSSS----------VSDTSSRDS----QSTLGNSHSQRSVINQGEEDTLI--SWENTVVLTCWCFHYDWTKIMEALQEINESV
           +I + P    S + SS S          +SD+S  D+    ++T  +S S+RS     +  TL   S+E TV++T  CFH DW +IM +L++ +E  
Subjt:  --KQIMQPPKKPQSKAPSSSS----------VSDTSSRDS----QSTLGNSHSQRSVINQGEEDTLI--SWENTVVLTCWCFHYDWTKIMEALQEINESV

Query:  ATFNPFQADKAILHVWDPEQARLMCMNK---GWVTVGPFYVKFEHWVSEQHSTPKFIPSYGGWLRFRGIPLHLWNLKTSKHIGDACGGFELAGINTLRKI
         ++ PFQADKAIL + +P+ A+L+C NK   GW TVG + VKFE W S  HS    IPSYGGWLRFRGIPLHLWN  T +HIG ACGGF      T++  
Subjt:  ATFNPFQADKAILHVWDPEQARLMCMNK---GWVTVGPFYVKFEHWVSEQHSTPKFIPSYGGWLRFRGIPLHLWNLKTSKHIGDACGGFELAGINTLRKI

Query:  DIMEAAIKVKDNFCGFIPASIEIKDATGKIFIVNTVASAKKKWLIIRHPQIHGTFTREAAAQFDEFDPSSETYYFQGNSVIPPPIAAISNEMQISNTSVF
         +++A IKV+ N+ GF+PASI I D  G+ FIV TV  A+ +WL+ R+ ++HG+F  +AA +FD+ +  +ETY + G   IPP       +  I N+   
Subjt:  DIMEAAIKVKDNFCGFIPASIEIKDATGKIFIVNTVASAKKKWLIIRHPQIHGTFTREAAAQFDEFDPSSETYYFQGNSVIPPPIAAISNEMQISNTSVF

Query:  ISSSHIGNSSIHPVQSEQTQIRTKGKEICKERKASHPNPILPKSKKVIFPSQKNAPIISILTQLLDHTK----PPPPNSPLQYKPALSLKKRSPQIKLLK
          S H      +  +SE        +++   RK         K K ++  + ++    S  ++ + + K     P           ++ K +S +I  + 
Subjt:  ISSSHIGNSSIHPVQSEQTQIRTKGKEICKERKASHPNPILPKSKKVIFPSQKNAPIISILTQLLDHTK----PPPPNSPLQYKPALSLKKRSPQIKLLK

Query:  PTNQKTTFITQ-NKPTVTKAHHPTPIQTTNFRPITIHQTVNHYLDFYLTVDLGPLSPISNTVVSSPN----SYQHSPPNSNLNQKNHT---NLITTEEIL
           +K     Q  K  +T      P ++T    +++ +T        L+VD+GP+SP+ + + S  N    ++ +  P+ N    +     NL  + +  
Subjt:  PTNQKTTFITQ-NKPTVTKAHHPTPIQTTNFRPITIHQTVNHYLDFYLTVDLGPLSPISNTVVSSPN----SYQHSPPNSNLNQKNHT---NLITTEEIL

Query:  DGSLLTLFNEAQEDKNPQNDTGKE-----SFKKQLNEWLIENAFCAIKDTISTICPDIVILSKSKASCISSRFIKSIWSSINIAWASLDASGSSGGIIIL
             +      E  +    TG E     +FK++L  WL EN              ++ +  K      SS +   I S  N+  A     G  GGI++L
Subjt:  DGSLLTLFNEAQEDKNPQNDTGKE-----SFKKQLNEWLIENAFCAIKDTISTICPDIVILSKSKASCISSRFIKSIWSSINIAWASLDASGSSGGIIIL

Query:  WDRLSIEAEEVICGTYTISVLFKLADNTHWWLTGVYGPVRRRKRKLLWQVLYALNGLCSPWWLLGGDFNIYRWSHETSSANPHRAGMRHFNKFTYFVGLI
        WD  + +  ++  G Y+IS L  L  N +WWLT VYGP +   R  LW  L  L  LC P WL+ GDFNI RW  ET++ +  +  M +FN F     LI
Subjt:  WDRLSIEAEEVICGTYTISVLFKLADNTHWWLTGVYGPVRRRKRKLLWQVLYALNGLCSPWWLLGGDFNIYRWSHETSSANPHRAGMRHFNKFTYFVGLI

Query:  DPSLENGPYTWSNLRHRLVLSRIDRFLFSQSWSEKPSFHNTRHLPRICSDHYPLLLESPTHKWRPCPFRLENLWIQDKNFQQLISTQWSSIPSHGYPGHA
        DP   N  +TWSNLR     SR+DRFL S+ W      H +R L R  SDH+P+LLESP  KW PCPFRL N  ++DK FQ+     W+S    G+PG+A
Subjt:  DPSLENGPYTWSNLRHRLVLSRIDRFLFSQSWSEKPSFHNTRHLPRICSDHYPLLLESPTHKWRPCPFRLENLWIQDKNFQQLISTQWSSIPSHGYPGHA

Query:  IIKKLYILAQNIKRWKKTRPK-FPYRKGTHHKRISLINQKEESNQIDEADCHLRTSLKNDLQSLALVETRLWSQKYKKLWLLEGDENSVFFHKICTARRR
         I+ L  L++ IK W+  +   +   K    K I +I++ E   ++       R SLK+DL S+   + ++W Q+ ++ W L GDEN+ +FH+ICT  +R
Subjt:  IIKKLYILAQNIKRWKKTRPK-FPYRKGTHHKRISLINQKEESNQIDEADCHLRTSLKNDLQSLALVETRLWSQKYKKLWLLEGDENSVFFHKICTARRR

Query:  RSFILS-------QLGSDSSIEAEAIRHFKGIYHVNNDVGSFITNLEWAPITLDQSNRLVQPFTD
        ++ I S        L S   I    I HF+ IY   +     I NL W PI+    + L +PF +
Subjt:  RSFILS-------QLGSDSSIEAEAIRHFKGIYHVNNDVGSFITNLEWAPITLDQSNRLVQPFTD

TrEMBL top hitse value%identityAlignment
A0A5A7TDG1 LINE-1 retrotransposable element ORF2 protein1.0e-14833.85Show/hide
Query:  GSYKSFSMSLLLDTITWLHGSFRTLIAIPRTNRFFQERRYSDYCIWIEKTSNRRACIAEIYRVDEKGHKCCIIVPEGYDKSGWKDFHNMITFN-DVK---
        G++K+FS+ +    + W+  + ++LI  P +NRFF E R  ++CIWI KT N + C AEI+RVD K  K CI+VPEG +KS W  F +MIT   +VK   
Subjt:  GSYKSFSMSLLLDTITWLHGSFRTLIAIPRTNRFFQERRYSDYCIWIEKTSNRRACIAEIYRVDEKGHKCCIIVPEGYDKSGWKDFHNMITFN-DVK---

Query:  ---------PKQIMQPP----KKPQSKAPSSSSVSDTSSRDSQSTLGNSHSQRSVINQGEEDTLISWENTVVLTCWCFHYDWTKIMEALQEINESVATFN
                 P+  + PP    K+  +KA S    S  SS  S S   +  SQ S  +  +    +  ENTVVL    FH DW KI++ L++  E   T+N
Subjt:  ---------PKQIMQPP----KKPQSKAPSSSSVSDTSSRDSQSTLGNSHSQRSVINQGEEDTLISWENTVVLTCWCFHYDWTKIMEALQEINESVATFN

Query:  PFQADKAILHVWDPEQARLMCMNKGWVTVGPFYVKFEHWVSEQHSTPKFIPSYGGWLRFRGIPLHLWNLKTSKHIGDACGGFELAGINTLRKIDIMEAAI
         F A+K ++H      A L+C NKGW TVG + V+FE W    H++PK IPSYGGW  FRGIPLHLWN+ T + IG ACGG       T    +++EA +
Subjt:  PFQADKAILHVWDPEQARLMCMNKGWVTVGPFYVKFEHWVSEQHSTPKFIPSYGGWLRFRGIPLHLWNLKTSKHIGDACGGFELAGINTLRKIDIMEAAI

Query:  KVKDNFCGFIPASIEIKDATGKIFIVNTVASAKKKWLIIRHPQIHGTFTREAAAQFDEFDPSSETYYFQGNSVIPPPI--------AAISNEMQISNTSV
        K++ N+ GF+PA ++I D  G  F+V  V  ++ KWL+ R+ ++HGTF R+AAA FD+F+P SE + F G   I P +         +IS E   +  SV
Subjt:  KVKDNFCGFIPASIEIKDATGKIFIVNTVASAKKKWLIIRHPQIHGTFTREAAAQFDEFDPSSETYYFQGNSVIPPPI--------AAISNEMQISNTSV

Query:  FISSSHIGNSSI---HPVQSEQTQIRTKGKEICKERKASHPNPILPKSK-KVIFPSQKNAPII-------------SILTQLLDHTKPPPPNSPLQYKPA
         I  +    S       V ++ +   T  K   K       +  L K K KV  PSQ  +  I             S  T   +    P  +SP + K  
Subjt:  FISSSHIGNSSI---HPVQSEQTQIRTKGKEICKERKASHPNPILPKSK-KVIFPSQKNAPII-------------SILTQLLDHTKPPPPNSPLQYKPA

Query:  ---LSLKKRSPQIKLLKPTNQ-KTTFITQNKPTVTKAHHPTPIQTTNFRPITIHQTVNHYLDFYLTVDLGPLSPISNTVVSSPNSYQ--HSPPNSNLNQK
            S+KK+S  I+     NQ K   ITQ            P+Q      +  H          LTVDLG L      V+    S++  HS  N+ +   
Subjt:  ---LSLKKRSPQIKLLKPTNQ-KTTFITQNKPTVTKAHHPTPIQTTNFRPITIHQTVNHYLDFYLTVDLGPLSPISNTVVSSPNSYQ--HSPPNSNLNQK

Query:  NHTNLI--------TTEEILDGSLLTLFNEA----------QEDKNPQNDTGKESFKKQLNEWLIENAFCAIKDTISTICPDIVILSKSKASCISSRFIK
         +T ++        T  E  + S    + +           ++ ++ + DT  E+FK QL  WL EN      DT S+                      
Subjt:  NHTNLI--------TTEEILDGSLLTLFNEA----------QEDKNPQNDTGKESFKKQLNEWLIENAFCAIKDTISTICPDIVILSKSKASCISSRFIK

Query:  SIWSSINIAWASLDASGSSGGIIILWDRLSIEAEEVICGTYTISVLFKLADNTHWWLTGVYGPVRRRKRKLLWQVLYALNGLCSPWWLLGGDFNIYRWSH
           +S N  ++ L    S+GGI+ILWD           G +++S  F   +N+ WWLTG+YGPV+RR+R  +W+ L+ L+ L S  W++GGD N+ R   
Subjt:  SIWSSINIAWASLDASGSSGGIIILWDRLSIEAEEVICGTYTISVLFKLADNTHWWLTGVYGPVRRRKRKLLWQVLYALNGLCSPWWLLGGDFNIYRWSH

Query:  ETSSANPHRAGMRHFNKFTYFVGLIDPSLENGPYTWSNLRHRLVLSRIDRFLFSQSWSEKPSFHNTRHLPRICSDHYPLLLESPTH--KWRPCPFRLENL
        E+++           N F     LIDP L N  YTWSNLR+    SR+DRFL++  W    + H TR LPR  SDH+PL+ E  T   +W P PFRL ++
Subjt:  ETSSANPHRAGMRHFNKFTYFVGLIDPSLENGPYTWSNLRHRLVLSRIDRFLFSQSWSEKPSFHNTRHLPRICSDHYPLLLESPTH--KWRPCPFRLENL

Query:  WIQDKNFQQLISTQWSSIPSHGYPGHAIIKKLYILAQNIKRWKKTR-PKFPYRKGTHHKRISLINQKEESNQIDEADCHLRTSLKNDLQSLALVETRLWS
         + D  F++ +   W     +G+PG   I++L  LA  IK W+K +       K    + +  I++ E    +   + + R +LK +L  L+L E++ W 
Subjt:  WIQDKNFQQLISTQWSSIPSHGYPGHAIIKKLYILAQNIKRWKKTR-PKFPYRKGTHHKRISLINQKEESNQIDEADCHLRTSLKNDLQSLALVETRLWS

Query:  QKYKKLWLLEGDENSVFFHKICTARRRRSFIL-------SQLGSDSSIEAEAIRHFKGIYHVNNDVGS-FITNLEWAPITLDQSNRLVQPFTD
        Q+ KKLWL EGDENS FFH+IC++R++R+ I        S   ++++I    + HF  IY  +      FI NLEW PI     + L  PF++
Subjt:  QKYKKLWLLEGDENSVFFHKICTARRRRSFIL-------SQLGSDSSIEAEAIRHFKGIYHVNNDVGS-FITNLEWAPITLDQSNRLVQPFTD

A0A5A7US62 LINE-1 retrotransposable element ORF2 protein5.3e-16134.84Show/hide
Query:  YKSFSMSLLLDTITWLHGSFRTLIAIPRTNRFFQERRYSDYCIWIEKTSN--RRACIAEIYRVDEKGHKCCIIVPEGYDKSGWKDFHNMITFNDVKP---
        +KSFSM +  DT+ W+   F+ L+    T  FF ERR  D C+W+ KT N  + +  AEI+R+D KG KC I+VPEG D  GWK F  +ITF    P   
Subjt:  YKSFSMSLLLDTITWLHGSFRTLIAIPRTNRFFQERRYSDYCIWIEKTSN--RRACIAEIYRVDEKGHKCCIIVPEGYDKSGWKDFHNMITFNDVKP---

Query:  --KQIMQPPKKPQSKAPSSSS----------VSDTSSRDS----QSTLGNSHSQRSVINQGEEDTLI--SWENTVVLTCWCFHYDWTKIMEALQEINESV
           +I + P    S + SS S          +SD+S  D+    ++T  +S S+RS     +  TL   S+E TV++T  CFH DW +IM +L++ +E  
Subjt:  --KQIMQPPKKPQSKAPSSSS----------VSDTSSRDS----QSTLGNSHSQRSVINQGEEDTLI--SWENTVVLTCWCFHYDWTKIMEALQEINESV

Query:  ATFNPFQADKAILHVWDPEQARLMCMNK---GWVTVGPFYVKFEHWVSEQHSTPKFIPSYGGWLRFRGIPLHLWNLKTSKHIGDACGGFELAGINTLRKI
         ++ PFQADKAIL + +P+ A+L+C NK   GW TVG + VKFE W S  HS    IPSYGGWLRFRGIPLHLWN  T +HIG ACGGF      T++  
Subjt:  ATFNPFQADKAILHVWDPEQARLMCMNK---GWVTVGPFYVKFEHWVSEQHSTPKFIPSYGGWLRFRGIPLHLWNLKTSKHIGDACGGFELAGINTLRKI

Query:  DIMEAAIKVKDNFCGFIPASIEIKDATGKIFIVNTVASAKKKWLIIRHPQIHGTFTREAAAQFDEFDPSSETYYFQGNSVIPPPIAAISNEMQISNTSVF
         +++A IKV+ N+ GF+PASI I D  G+ FIV TV  A+ +WL+ R+ ++HG+F  +AA +FD+ +  +ETY + G   IPP       +  I N+   
Subjt:  DIMEAAIKVKDNFCGFIPASIEIKDATGKIFIVNTVASAKKKWLIIRHPQIHGTFTREAAAQFDEFDPSSETYYFQGNSVIPPPIAAISNEMQISNTSVF

Query:  ISSSHIGNSSIHPVQSEQTQIRTKGKEICKERKASHPNPILPKSKKVIFPSQKNAPIISILTQLLDHTK----PPPPNSPLQYKPALSLKKRSPQIKLLK
          S H      +  +SE        +++   RK         K K ++  + ++    S  ++ + + K     P           ++ K +S +I  + 
Subjt:  ISSSHIGNSSIHPVQSEQTQIRTKGKEICKERKASHPNPILPKSKKVIFPSQKNAPIISILTQLLDHTK----PPPPNSPLQYKPALSLKKRSPQIKLLK

Query:  PTNQKTTFITQ-NKPTVTKAHHPTPIQTTNFRPITIHQTVNHYLDFYLTVDLGPLSPISNTVVSSPN----SYQHSPPNSNLNQKNHT---NLITTEEIL
           +K     Q  K  +T      P ++T    +++ +T        L+VD+GP+SP+ + + S  N    ++ +  P+ N    +     NL  + +  
Subjt:  PTNQKTTFITQ-NKPTVTKAHHPTPIQTTNFRPITIHQTVNHYLDFYLTVDLGPLSPISNTVVSSPN----SYQHSPPNSNLNQKNHT---NLITTEEIL

Query:  DGSLLTLFNEAQEDKNPQNDTGKE-----SFKKQLNEWLIENAFCAIKDTISTICPDIVILSKSKASCISSRFIKSIWSSINIAWASLDASGSSGGIIIL
             +      E  +    TG E     +FK++L  WL EN              ++ +  K      SS +   I S  N+  A     G  GGI++L
Subjt:  DGSLLTLFNEAQEDKNPQNDTGKE-----SFKKQLNEWLIENAFCAIKDTISTICPDIVILSKSKASCISSRFIKSIWSSINIAWASLDASGSSGGIIIL

Query:  WDRLSIEAEEVICGTYTISVLFKLADNTHWWLTGVYGPVRRRKRKLLWQVLYALNGLCSPWWLLGGDFNIYRWSHETSSANPHRAGMRHFNKFTYFVGLI
        WD  + +  ++  G Y+IS L  L  N +WWLT VYGP +   R  LW  L  L  LC P WL+ GDFNI RW  ET++ +  +  M +FN F     LI
Subjt:  WDRLSIEAEEVICGTYTISVLFKLADNTHWWLTGVYGPVRRRKRKLLWQVLYALNGLCSPWWLLGGDFNIYRWSHETSSANPHRAGMRHFNKFTYFVGLI

Query:  DPSLENGPYTWSNLRHRLVLSRIDRFLFSQSWSEKPSFHNTRHLPRICSDHYPLLLESPTHKWRPCPFRLENLWIQDKNFQQLISTQWSSIPSHGYPGHA
        DP   N  +TWSNLR     SR+DRFL S+ W      H +R L R  SDH+P+LLESP  KW PCPFRL N  ++DK FQ+     W+S    G+PG+A
Subjt:  DPSLENGPYTWSNLRHRLVLSRIDRFLFSQSWSEKPSFHNTRHLPRICSDHYPLLLESPTHKWRPCPFRLENLWIQDKNFQQLISTQWSSIPSHGYPGHA

Query:  IIKKLYILAQNIKRWKKTRPK-FPYRKGTHHKRISLINQKEESNQIDEADCHLRTSLKNDLQSLALVETRLWSQKYKKLWLLEGDENSVFFHKICTARRR
         I+ L  L++ IK W+  +   +   K    K I +I++ E   ++       R SLK+DL S+   + ++W Q+ ++ W L GDEN+ +FH+ICT  +R
Subjt:  IIKKLYILAQNIKRWKKTRPK-FPYRKGTHHKRISLINQKEESNQIDEADCHLRTSLKNDLQSLALVETRLWSQKYKKLWLLEGDENSVFFHKICTARRR

Query:  RSFILS-------QLGSDSSIEAEAIRHFKGIYHVNNDVGSFITNLEWAPITLDQSNRLVQPFTD
        ++ I S        L S   I    I HF+ IY   +     I NL W PI+    + L +PF +
Subjt:  RSFILS-------QLGSDSSIEAEAIRHFKGIYHVNNDVGSFITNLEWAPITLDQSNRLVQPFTD

A0A5D3BKT8 LINE-1 retrotransposable element ORF2 protein3.9e-15635.14Show/hide
Query:  YKSFSMSLLLDTITWLHGSFRTLIAIPRTNRFFQERRYSDYCIWIEKTSN--RRACIAEIYRVDEKGHKCCIIVPEGYDKSGWKDFHNMITFNDVKP---
        +KSFSM +  DT+ W+   F+ L+    T  FF ERR  D C+W+ KT N  + +  AEI+R+D KG KC I+VPEG D  GWK F  +ITF    P   
Subjt:  YKSFSMSLLLDTITWLHGSFRTLIAIPRTNRFFQERRYSDYCIWIEKTSN--RRACIAEIYRVDEKGHKCCIIVPEGYDKSGWKDFHNMITFNDVKP---

Query:  --KQIMQPPKKPQSKAPSSSS----------VSDTSSRDS----QSTLGNSHSQRSVINQGEEDTLI--SWENTVVLTCWCFHYDWTKIMEALQEINESV
           +I + P    S + SS S          +SD+S  D+    ++T  +S S+RS     +  TL   S+E TV++T  CFH DW +IM +L++ +E  
Subjt:  --KQIMQPPKKPQSKAPSSSS----------VSDTSSRDS----QSTLGNSHSQRSVINQGEEDTLI--SWENTVVLTCWCFHYDWTKIMEALQEINESV

Query:  ATFNPFQADKAILHVWDPEQARLMCMNK---GWVTVGPFYVKFEHWVSEQHSTPKFIPSYGGWLRFRGIPLHLWNLKTSKHIGDACGGFELAGINTLRKI
         ++ PFQADKAIL + + + A+L+C NK   GW TVG + VKFE W S  HS    IPSYGGWLRFRGIPLHLWN  T +HIG ACGGF      T++  
Subjt:  ATFNPFQADKAILHVWDPEQARLMCMNK---GWVTVGPFYVKFEHWVSEQHSTPKFIPSYGGWLRFRGIPLHLWNLKTSKHIGDACGGFELAGINTLRKI

Query:  DIMEAAIKVKDNFCGFIPASIEIKDATGKIFIVNTVASAKKKWLIIRHPQIHGTFTREAAAQFDEFDPSSETYYFQGNSVIPPPIAAISNEMQISNTSVF
         +++A IKV+ N+ GF+PASI I D  G+ FIV TV  A+ +WL+ R+ ++HG+F  +AA +FD+ +  +ETY + G   IPP       +  I N+   
Subjt:  DIMEAAIKVKDNFCGFIPASIEIKDATGKIFIVNTVASAKKKWLIIRHPQIHGTFTREAAAQFDEFDPSSETYYFQGNSVIPPPIAAISNEMQISNTSVF

Query:  ISSSHIGNSSIHPVQSEQTQIRTKGKEICKERKASHPNPILPKSKKVIFPSQKNAPIISILTQLLDHTK----PPPPNSPLQYKPALSLKKRSPQIKLLK
          S H      +  +SE        +++   RK         K K ++  + +N    S  ++ + + K     P           ++ K +S +I  + 
Subjt:  ISSSHIGNSSIHPVQSEQTQIRTKGKEICKERKASHPNPILPKSKKVIFPSQKNAPIISILTQLLDHTK----PPPPNSPLQYKPALSLKKRSPQIKLLK

Query:  PTNQKTTFITQ-NKPTVTKAHHPTPIQTTNFRPITIHQTVNHYLDFYLTVDLGPLSPISNTVVSSPNSYQHSPPNSNLNQKNHTNLITTEEILDGSLLTL
           +K     Q  K  +T      P ++T    +++ +T        L+VD+GP+SP+ + + S  N   H     N NQ    N  +T+     +L   
Subjt:  PTNQKTTFITQ-NKPTVTKAHHPTPIQTTNFRPITIHQTVNHYLDFYLTVDLGPLSPISNTVVSSPNSYQHSPPNSNLNQKNHTNLITTEEILDGSLLTL

Query:  FNE-AQEDKNPQNDTGK---------------ESFKKQLNEWLIENAFCAIKDTISTICPDIVILSKSKASCISSRFIKSIWSSINIAWASLDASGSSGG
          E A ++K+    T +                +FK++L  WL EN              ++ +  K      SS     I S  N+  A     G  GG
Subjt:  FNE-AQEDKNPQNDTGK---------------ESFKKQLNEWLIENAFCAIKDTISTICPDIVILSKSKASCISSRFIKSIWSSINIAWASLDASGSSGG

Query:  IIILWDRLSIEAEEVICGTYTISVLFKLADNTHWWLTGVYGPVRRRKRKLLWQVLYALNGLCSPWWLLGGDFNIYRWSHETSSANPHRAGMRHFNKFTYF
        I++LWD    +  ++  G Y+IS L  L  N +WWLT VYGP +   R  LW  L  L  LC P WL+ GDFNI RW  ET++ +  +  M +FN F   
Subjt:  IIILWDRLSIEAEEVICGTYTISVLFKLADNTHWWLTGVYGPVRRRKRKLLWQVLYALNGLCSPWWLLGGDFNIYRWSHETSSANPHRAGMRHFNKFTYF

Query:  VGLIDPSLENGPYTWSNLRHRLVLSRIDRFLFSQSWSEKPSFHNTRHLPRICSDHYPLLLESPTHKWRPCPFRLENLWIQDKNFQQLISTQWSSIPSHGY
          LIDP L N  +TWSNLR     SR+DRFL S+ W      H +R L R  SDH+P+LLESP  KW PCPFRL N  ++DK+FQ+     W++    G+
Subjt:  VGLIDPSLENGPYTWSNLRHRLVLSRIDRFLFSQSWSEKPSFHNTRHLPRICSDHYPLLLESPTHKWRPCPFRLENLWIQDKNFQQLISTQWSSIPSHGY

Query:  PGHAIIKKLYILAQNIKRWKKTRPK-FPYRKGTHHKRISLINQKEESNQIDEADCHLRTSLKNDLQSLALVETRLWSQKYKKLWLLEGDENSVFFHKICT
        PG+A I+ L  L++ IK W+  +   +   K    K I +I++ E   ++       R SLK+DL S+   + ++W Q+ ++ W L GDEN+ +FH+ICT
Subjt:  PGHAIIKKLYILAQNIKRWKKTRPK-FPYRKGTHHKRISLINQKEESNQIDEADCHLRTSLKNDLQSLALVETRLWSQKYKKLWLLEGDENSVFFHKICT

Query:  ARRRRSFILS-------QLGSDSSIEAEAIRHFKGIYHVNNDVGSFITNL
          +R++ I S        L S   I    I HF+ IY   N     I NL
Subjt:  ARRRRSFILS-------QLGSDSSIEAEAIRHFKGIYHVNNDVGSFITNL

A0A5D3BLV7 LINE-1 retrotransposable element ORF2 protein1.5e-16835.36Show/hide
Query:  GSYKSFSMSLLLDTITWLHGSFRTLIAIPRTNRFFQERRYSDYCIWIEKTSNRRACIAEIYRVDEKGHKCCIIVPEGYDKSGWKDFHNMIT---------
        G++K+FS+ +    + W+  + ++LIA P TNRFF E R S+  IWI KT N + C AEI+RVD+K  K CI+VPEG DKSGW  F +MIT         
Subjt:  GSYKSFSMSLLLDTITWLHGSFRTLIAIPRTNRFFQERRYSDYCIWIEKTSNRRACIAEIYRVDEKGHKCCIIVPEGYDKSGWKDFHNMIT---------

Query:  ----FNDVKPKQIMQPP----KKPQSKAPSSSSVSDTS-SRDSQSTLGNSHSQRSVINQGEEDTLISWENTVVLTCWCFHYDWTKIMEALQEINESVATF
                 P   + PP    K+  +KA +      TS S DS  +  +SHS  +         L+  ENTVV+    FH DW KI++ L++  E   T+
Subjt:  ----FNDVKPKQIMQPP----KKPQSKAPSSSSVSDTS-SRDSQSTLGNSHSQRSVINQGEEDTLISWENTVVLTCWCFHYDWTKIMEALQEINESVATF

Query:  NPFQADKAILHVWDPEQARLMCMNKGWVTVGPFYVKFEHWVSEQHSTPKFIPSYGGWLRFRGIPLHLWNLKTSKHIGDACGGFELAGINTLRKIDIMEAA
        N F A+KA++H      A L+C NKGW TVG + V+FE W    H+TPK IPSYGGW  FRGIPLHLWN+ T + IG AC G       T    +++EA 
Subjt:  NPFQADKAILHVWDPEQARLMCMNKGWVTVGPFYVKFEHWVSEQHSTPKFIPSYGGWLRFRGIPLHLWNLKTSKHIGDACGGFELAGINTLRKIDIMEAA

Query:  IKVKDNFCGFIPASIEIKDATGKIFIVNTVASAKKKWLIIRHPQIHGTFTREAAAQFDEFDPSSETYYFQGNSVIPPPIAAISNEMQISNT---------
        IKV+ N+ GF+PA++ I D  G  F V  V   + KWLI R+ ++HGTF R+AAA FD+F+P SE ++F+G+  I P   + S++ + S+T         
Subjt:  IKVKDNFCGFIPASIEIKDATGKIFIVNTVASAKKKWLIIRHPQIHGTFTREAAAQFDEFDPSSETYYFQGNSVIPPPIAAISNEMQISNT---------

Query:  -----------SVFISSSHIGNSSIHPVQSE----------QTQIRTKGKEICKERKASHPNPILPKSK-KVIFPSQKNAPIISILTQLLDHTKPPPPNS
                     F++   + +S++H   ++             +  KGK+    +   +    L KSK KV F S  N       T + +     P ++
Subjt:  -----------SVFISSSHIGNSSIHPVQSE----------QTQIRTKGKEICKERKASHPNPILPKSK-KVIFPSQKNAPIISILTQLLDHTKPPPPNS

Query:  PLQYKPALSLKKRSPQIKLLKPTNQKTTFITQNKPTVTKAHHPTPIQTTNFRPITIHQTVNHYLDFYLTVDLGPLS--------------------PISN
        P  + P+L+  ++  ++   +   +K++    N    +KA+    +  T    I  H          LTVDLG L                      I+N
Subjt:  PLQYKPALSLKKRSPQIKLLKPTNQKTTFITQNKPTVTKAHHPTPIQTTNFRPITIHQTVNHYLDFYLTVDLGPLS--------------------PISN

Query:  TVVSSPNSYQHSPPNSNLNQKNHTNLITTEEILDGSLLTLFNEAQEDKNPQNDTGKESFKKQLNEWLIENAFCAIKDTIS---TICPDIVILSKSKASCI
        T V         P N N N  +  N    + +        +   +E+K  + D   E+FKKQL  WL +N      DT S   T   ++++   +    I
Subjt:  TVVSSPNSYQHSPPNSNLNQKNHTNLITTEEILDGSLLTLFNEAQEDKNPQNDTGKESFKKQLNEWLIENAFCAIKDTIS---TICPDIVILSKSKASCI

Query:  SS-RFIKSIWSSINIAWASLDASGSSGGIIILWDRLSIEAEEVICGTYTISVLFKLADNTHWWLTGVYGPVRRRKRKLLWQVLYALNGLCSPWWLLGGDF
        ++ R IKS+W S +I W + +ASGSSGGI+ILWD  +        G +++S  F L +N+ WWLTG+YGPV+RR+R   W  L+ L  L S  W+LGGD 
Subjt:  SS-RFIKSIWSSINIAWASLDASGSSGGIIILWDRLSIEAEEVICGTYTISVLFKLADNTHWWLTGVYGPVRRRKRKLLWQVLYALNGLCSPWWLLGGDF

Query:  NIYRWSHETSSANPHRAGMRHFNKFTYFVGLIDPSLENGPYTWSNLRHRLVLSRIDRFLFSQSWSEKPSFHNTRHLPRICSDHYPLLLE--SPTHKWRPC
        N+ R   E++S        R  N F     LIDP L N  +TWSNLR+    SRIDRFL++ SW    S H TR LPR  SDH+PL+ E  +P   W P 
Subjt:  NIYRWSHETSSANPHRAGMRHFNKFTYFVGLIDPSLENGPYTWSNLRHRLVLSRIDRFLFSQSWSEKPSFHNTRHLPRICSDHYPLLLE--SPTHKWRPC

Query:  PFRLENLWIQDKNFQQLISTQWSSIPSHGYPGHAIIKKLYILAQNIKRWKKTR-PKFPYRKGTHHKRISLINQKEESNQIDEADCHLRTSLKNDLQSLAL
        PFRL ++ + D  F++ +   W +    GYPG + I++L  LA  IK W+K +     Y K    + +  I++KE    + + + + R +LK DL  L+L
Subjt:  PFRLENLWIQDKNFQQLISTQWSSIPSHGYPGHAIIKKLYILAQNIKRWKKTR-PKFPYRKGTHHKRISLINQKEESNQIDEADCHLRTSLKNDLQSLAL

Query:  VETRLWSQKYKKLWLLEGDENSVFFHKICTARRRRSFIL-------SQLGSDSSIEAEAIRHFKGIYHVNNDVGS-FITNLEWAPITLDQSNRLVQPFTD
         E++ W Q+ KKLWL EGDENS FFH+IC++R++RSFI        S   +++SI    I+ F  IY  +      FI NL+W PI   + + L  PF +
Subjt:  VETRLWSQKYKKLWLLEGDENSVFFHKICTARRRRSFIL-------SQLGSDSSIEAEAIRHFKGIYHVNNDVGS-FITNLEWAPITLDQSNRLVQPFTD

A0A5D3CA17 LINE-1 retrotransposable element ORF2 protein5.3e-16134.84Show/hide
Query:  YKSFSMSLLLDTITWLHGSFRTLIAIPRTNRFFQERRYSDYCIWIEKTSN--RRACIAEIYRVDEKGHKCCIIVPEGYDKSGWKDFHNMITFNDVKP---
        +KSFSM +  DT+ W+   F+ L+    T  FF ERR  D C+W+ KT N  + +  AEI+R+D KG KC I+VPEG D  GWK F  +ITF    P   
Subjt:  YKSFSMSLLLDTITWLHGSFRTLIAIPRTNRFFQERRYSDYCIWIEKTSN--RRACIAEIYRVDEKGHKCCIIVPEGYDKSGWKDFHNMITFNDVKP---

Query:  --KQIMQPPKKPQSKAPSSSS----------VSDTSSRDS----QSTLGNSHSQRSVINQGEEDTLI--SWENTVVLTCWCFHYDWTKIMEALQEINESV
           +I + P    S + SS S          +SD+S  D+    ++T  +S S+RS     +  TL   S+E TV++T  CFH DW +IM +L++ +E  
Subjt:  --KQIMQPPKKPQSKAPSSSS----------VSDTSSRDS----QSTLGNSHSQRSVINQGEEDTLI--SWENTVVLTCWCFHYDWTKIMEALQEINESV

Query:  ATFNPFQADKAILHVWDPEQARLMCMNK---GWVTVGPFYVKFEHWVSEQHSTPKFIPSYGGWLRFRGIPLHLWNLKTSKHIGDACGGFELAGINTLRKI
         ++ PFQADKAIL + +P+ A+L+C NK   GW TVG + VKFE W S  HS    IPSYGGWLRFRGIPLHLWN  T +HIG ACGGF      T++  
Subjt:  ATFNPFQADKAILHVWDPEQARLMCMNK---GWVTVGPFYVKFEHWVSEQHSTPKFIPSYGGWLRFRGIPLHLWNLKTSKHIGDACGGFELAGINTLRKI

Query:  DIMEAAIKVKDNFCGFIPASIEIKDATGKIFIVNTVASAKKKWLIIRHPQIHGTFTREAAAQFDEFDPSSETYYFQGNSVIPPPIAAISNEMQISNTSVF
         +++A IKV+ N+ GF+PASI I D  G+ FIV TV  A+ +WL+ R+ ++HG+F  +AA +FD+ +  +ETY + G   IPP       +  I N+   
Subjt:  DIMEAAIKVKDNFCGFIPASIEIKDATGKIFIVNTVASAKKKWLIIRHPQIHGTFTREAAAQFDEFDPSSETYYFQGNSVIPPPIAAISNEMQISNTSVF

Query:  ISSSHIGNSSIHPVQSEQTQIRTKGKEICKERKASHPNPILPKSKKVIFPSQKNAPIISILTQLLDHTK----PPPPNSPLQYKPALSLKKRSPQIKLLK
          S H      +  +SE        +++   RK         K K ++  + ++    S  ++ + + K     P           ++ K +S +I  + 
Subjt:  ISSSHIGNSSIHPVQSEQTQIRTKGKEICKERKASHPNPILPKSKKVIFPSQKNAPIISILTQLLDHTK----PPPPNSPLQYKPALSLKKRSPQIKLLK

Query:  PTNQKTTFITQ-NKPTVTKAHHPTPIQTTNFRPITIHQTVNHYLDFYLTVDLGPLSPISNTVVSSPN----SYQHSPPNSNLNQKNHT---NLITTEEIL
           +K     Q  K  +T      P ++T    +++ +T        L+VD+GP+SP+ + + S  N    ++ +  P+ N    +     NL  + +  
Subjt:  PTNQKTTFITQ-NKPTVTKAHHPTPIQTTNFRPITIHQTVNHYLDFYLTVDLGPLSPISNTVVSSPN----SYQHSPPNSNLNQKNHT---NLITTEEIL

Query:  DGSLLTLFNEAQEDKNPQNDTGKE-----SFKKQLNEWLIENAFCAIKDTISTICPDIVILSKSKASCISSRFIKSIWSSINIAWASLDASGSSGGIIIL
             +      E  +    TG E     +FK++L  WL EN              ++ +  K      SS +   I S  N+  A     G  GGI++L
Subjt:  DGSLLTLFNEAQEDKNPQNDTGKE-----SFKKQLNEWLIENAFCAIKDTISTICPDIVILSKSKASCISSRFIKSIWSSINIAWASLDASGSSGGIIIL

Query:  WDRLSIEAEEVICGTYTISVLFKLADNTHWWLTGVYGPVRRRKRKLLWQVLYALNGLCSPWWLLGGDFNIYRWSHETSSANPHRAGMRHFNKFTYFVGLI
        WD  + +  ++  G Y+IS L  L  N +WWLT VYGP +   R  LW  L  L  LC P WL+ GDFNI RW  ET++ +  +  M +FN F     LI
Subjt:  WDRLSIEAEEVICGTYTISVLFKLADNTHWWLTGVYGPVRRRKRKLLWQVLYALNGLCSPWWLLGGDFNIYRWSHETSSANPHRAGMRHFNKFTYFVGLI

Query:  DPSLENGPYTWSNLRHRLVLSRIDRFLFSQSWSEKPSFHNTRHLPRICSDHYPLLLESPTHKWRPCPFRLENLWIQDKNFQQLISTQWSSIPSHGYPGHA
        DP   N  +TWSNLR     SR+DRFL S+ W      H +R L R  SDH+P+LLESP  KW PCPFRL N  ++DK FQ+     W+S    G+PG+A
Subjt:  DPSLENGPYTWSNLRHRLVLSRIDRFLFSQSWSEKPSFHNTRHLPRICSDHYPLLLESPTHKWRPCPFRLENLWIQDKNFQQLISTQWSSIPSHGYPGHA

Query:  IIKKLYILAQNIKRWKKTRPK-FPYRKGTHHKRISLINQKEESNQIDEADCHLRTSLKNDLQSLALVETRLWSQKYKKLWLLEGDENSVFFHKICTARRR
         I+ L  L++ IK W+  +   +   K    K I +I++ E   ++       R SLK+DL S+   + ++W Q+ ++ W L GDEN+ +FH+ICT  +R
Subjt:  IIKKLYILAQNIKRWKKTRPK-FPYRKGTHHKRISLINQKEESNQIDEADCHLRTSLKNDLQSLALVETRLWSQKYKKLWLLEGDENSVFFHKICTARRR

Query:  RSFILS-------QLGSDSSIEAEAIRHFKGIYHVNNDVGSFITNLEWAPITLDQSNRLVQPFTD
        ++ I S        L S   I    I HF+ IY   +     I NL W PI+    + L +PF +
Subjt:  RSFILS-------QLGSDSSIEAEAIRHFKGIYHVNNDVGSFITNLEWAPITLDQSNRLVQPFTD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.9e-0623.83Show/hide
Query:  GMRHFNKFTYFVGLIDPSLENGPYTWSNLR-HRLVLSRIDRFLFSQSW-SEKPSFHNTRHLPRICSDHYP--LLLESPTHKWRPCPFRLENLWIQDKNFQ
        G+  F        L+D       YTWSN +    ++ ++DR + +  W S  PS      L  + SDH P  ++LE+   + + C FR  +       F 
Subjt:  GMRHFNKFTYFVGLIDPSLENGPYTWSNLR-HRLVLSRIDRFLFSQSW-SEKPSFHNTRHLPRICSDHYP--LLLESPTHKWRPCPFRLENLWIQDKNFQ

Query:  QLISTQW-SSIPSHGYPGHAIIKKLYILAQNIKRWKKTRPKFPYRKG---THHKRISLINQKEESNQIDEADCHLRTSLKNDLQSLALVETRLWS-----
          ++  W   IP        +   ++ L +++K  KK   K   R+G     HK       KE  + ++     L T+  + L  +  V  + W+     
Subjt:  QLISTQW-SSIPSHGYPGHAIIKKLYILAQNIKRWKKTRPKFPYRKG---THHKRISLINQKEESNQIDEADCHLRTSLKNDLQSLALVETRLWS-----

Query:  ------QKYKKLWLLEGDENSVFFHKICTARRRRSFILSQLGSDSSIEAEAIRHFK
              QK +  WL +GD N+ FFHK+  A + ++ ++  L  D  +  E +   K
Subjt:  ------QKYKKLWLLEGDENSVFFHKICTARRRRSFILSQLGSDSSIEAEAIRHFK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATCGGATCCTACAAATCTTTTTCCATGTCGCTGCTACTGGACACCATAACCTGGCTTCATGGATCTTTCCGAACACTCATAGCTATCCCGAGAACAAACCGCTTCTTCCA
AGAGAGGAGATACAGTGACTACTGTATATGGATTGAAAAAACGTCGAATCGAAGAGCTTGCATAGCAGAAATTTATCGAGTAGACGAAAAAGGCCACAAATGCTGTATTA
TAGTTCCAGAAGGTTACGACAAAAGTGGCTGGAAAGATTTTCATAATATGATTACTTTTAACGATGTGAAACCAAAACAGATCATGCAACCCCCCAAAAAGCCACAAAGC
AAGGCCCCATCAAGCTCCTCGGTGAGTGACACATCATCCCGCGACTCTCAATCCACCTTGGGTAACTCTCATTCTCAACGTTCTGTGATTAATCAAGGAGAAGAAGACAC
CCTCATATCATGGGAAAACACGGTAGTTCTCACCTGTTGGTGTTTTCACTATGACTGGACTAAAATAATGGAAGCTTTGCAGGAGATAAATGAATCTGTGGCAACTTTCA
ACCCATTCCAAGCTGACAAAGCTATACTCCATGTTTGGGACCCAGAACAAGCAAGACTAATGTGCATGAATAAGGGATGGGTAACAGTAGGTCCCTTCTATGTCAAATTC
GAACACTGGGTTTCAGAACAACACTCAACCCCTAAATTTATTCCAAGCTATGGTGGATGGTTGAGATTTAGAGGTATCCCCCTTCATTTATGGAACCTAAAAACTTCGAA
ACATATTGGGGATGCATGTGGGGGATTTGAGCTTGCCGGAATAAACACCCTGAGAAAAATAGATATCATGGAAGCTGCCATTAAAGTCAAAGATAATTTTTGTGGTTTCA
TCCCAGCCTCCATTGAAATCAAAGATGCCACCGGGAAAATTTTCATCGTGAATACTGTAGCTTCAGCGAAGAAGAAATGGCTAATTATCAGACACCCACAAATTCATGGA
ACTTTCACGAGAGAGGCAGCTGCTCAATTCGATGAATTCGATCCTTCGTCAGAAACTTACTATTTTCAAGGTAACTCTGTCATCCCTCCTCCCATAGCTGCTATATCAAA
TGAAATGCAGATCTCGAATACTTCCGTTTTTATTTCATCTTCTCACATTGGAAACAGCAGCATCCATCCAGTCCAATCTGAACAAACCCAAATAAGAACCAAAGGGAAGG
AAATCTGCAAAGAAAGGAAAGCGTCTCATCCGAACCCAATTTTACCTAAATCAAAGAAAGTCATTTTCCCATCCCAAAAAAATGCACCTATCATTTCAATCCTGACACAA
CTCCTCGACCATACAAAACCACCGCCCCCCAATAGTCCACTCCAATACAAGCCAGCCCTATCACTCAAAAAGAGGTCACCACAAATAAAGTTGTTGAAGCCCACAAACCA
AAAGACCACCTTCATCACCCAAAACAAGCCCACAGTTACCAAAGCCCACCATCCCACACCCATCCAAACCACAAACTTTAGACCCATAACCATACACCAAACAGTTAACC
ATTATCTAGATTTCTATCTCACGGTGGACCTTGGCCCATTATCGCCCATCTCTAATACCGTTGTGTCTAGCCCGAATTCATATCAACATAGCCCACCAAACTCAAACCTC
AACCAGAAAAACCATACAAACCTCATTACCACAGAAGAAATCTTGGACGGCAGCCTTCTAACATTATTTAATGAAGCACAGGAAGACAAAAATCCTCAAAATGACACAGG
GAAAGAGAGTTTCAAAAAACAGTTAAACGAATGGCTGATTGAAAATGCCTTCTGCGCCATAAAGGATACTATTTCTACTATCTGCCCCGACATTGTTATCCTATCGAAAT
CGAAAGCTTCCTGTATCTCTAGTCGCTTTATTAAATCTATATGGAGCTCCATAAACATAGCTTGGGCCTCTCTTGATGCATCGGGTTCATCTGGTGGCATTATAATCCTT
TGGGATAGACTCTCTATTGAGGCTGAGGAAGTCATTTGTGGCACATACACGATTTCTGTCCTTTTCAAACTTGCTGACAACACACATTGGTGGTTAACAGGGGTCTACGG
GCCTGTTCGACGAAGGAAACGAAAACTTTTATGGCAAGTGCTTTATGCTCTAAATGGGCTATGTAGTCCTTGGTGGTTATTAGGGGGCGATTTCAATATCTATAGATGGA
GCCATGAAACATCCTCGGCAAACCCCCATCGAGCTGGTATGAGACATTTTAACAAGTTTACTTATTTTGTAGGGCTTATCGATCCTAGTCTCGAAAATGGGCCTTATACA
TGGTCAAATCTCCGCCATAGACTAGTTCTCTCGCGCATTGACCGTTTTCTCTTCTCACAAAGCTGGTCTGAAAAGCCCTCCTTCCATAACACCAGACACCTCCCACGAAT
TTGTTCGGATCACTACCCTCTTCTCCTTGAATCCCCGACTCATAAATGGAGACCTTGCCCATTTCGATTAGAAAATCTATGGATTCAAGATAAAAATTTCCAGCAGCTCA
TATCTACTCAATGGAGTTCAATCCCCTCCCATGGATATCCTGGTCATGCCATTATTAAAAAGCTTTATATTTTGGCTCAGAACATAAAAAGGTGGAAAAAGACTCGGCCC
AAATTTCCCTATAGAAAAGGCACGCATCACAAAAGAATTTCTCTAATTAATCAAAAAGAGGAATCCAATCAAATTGATGAAGCTGACTGCCATCTACGAACATCCTTAAA
AAATGATTTGCAGTCGTTAGCTCTAGTCGAGACAAGACTATGGAGCCAAAAATACAAGAAGCTATGGTTACTCGAGGGGGACGAAAACTCAGTTTTTTTCCACAAAATTT
GCACTGCTCGAAGGAGGCGTAGTTTTATATTGAGTCAGCTTGGTTCTGACTCTTCTATTGAGGCTGAAGCAATTAGACACTTCAAAGGAATTTATCACGTGAATAATGAT
GTGGGTTCTTTTATCACAAATCTTGAATGGGCGCCTATAACTTTGGATCAATCTAACAGGCTAGTGCAACCATTCACAGAT
mRNA sequenceShow/hide mRNA sequence
ATCGGATCCTACAAATCTTTTTCCATGTCGCTGCTACTGGACACCATAACCTGGCTTCATGGATCTTTCCGAACACTCATAGCTATCCCGAGAACAAACCGCTTCTTCCA
AGAGAGGAGATACAGTGACTACTGTATATGGATTGAAAAAACGTCGAATCGAAGAGCTTGCATAGCAGAAATTTATCGAGTAGACGAAAAAGGCCACAAATGCTGTATTA
TAGTTCCAGAAGGTTACGACAAAAGTGGCTGGAAAGATTTTCATAATATGATTACTTTTAACGATGTGAAACCAAAACAGATCATGCAACCCCCCAAAAAGCCACAAAGC
AAGGCCCCATCAAGCTCCTCGGTGAGTGACACATCATCCCGCGACTCTCAATCCACCTTGGGTAACTCTCATTCTCAACGTTCTGTGATTAATCAAGGAGAAGAAGACAC
CCTCATATCATGGGAAAACACGGTAGTTCTCACCTGTTGGTGTTTTCACTATGACTGGACTAAAATAATGGAAGCTTTGCAGGAGATAAATGAATCTGTGGCAACTTTCA
ACCCATTCCAAGCTGACAAAGCTATACTCCATGTTTGGGACCCAGAACAAGCAAGACTAATGTGCATGAATAAGGGATGGGTAACAGTAGGTCCCTTCTATGTCAAATTC
GAACACTGGGTTTCAGAACAACACTCAACCCCTAAATTTATTCCAAGCTATGGTGGATGGTTGAGATTTAGAGGTATCCCCCTTCATTTATGGAACCTAAAAACTTCGAA
ACATATTGGGGATGCATGTGGGGGATTTGAGCTTGCCGGAATAAACACCCTGAGAAAAATAGATATCATGGAAGCTGCCATTAAAGTCAAAGATAATTTTTGTGGTTTCA
TCCCAGCCTCCATTGAAATCAAAGATGCCACCGGGAAAATTTTCATCGTGAATACTGTAGCTTCAGCGAAGAAGAAATGGCTAATTATCAGACACCCACAAATTCATGGA
ACTTTCACGAGAGAGGCAGCTGCTCAATTCGATGAATTCGATCCTTCGTCAGAAACTTACTATTTTCAAGGTAACTCTGTCATCCCTCCTCCCATAGCTGCTATATCAAA
TGAAATGCAGATCTCGAATACTTCCGTTTTTATTTCATCTTCTCACATTGGAAACAGCAGCATCCATCCAGTCCAATCTGAACAAACCCAAATAAGAACCAAAGGGAAGG
AAATCTGCAAAGAAAGGAAAGCGTCTCATCCGAACCCAATTTTACCTAAATCAAAGAAAGTCATTTTCCCATCCCAAAAAAATGCACCTATCATTTCAATCCTGACACAA
CTCCTCGACCATACAAAACCACCGCCCCCCAATAGTCCACTCCAATACAAGCCAGCCCTATCACTCAAAAAGAGGTCACCACAAATAAAGTTGTTGAAGCCCACAAACCA
AAAGACCACCTTCATCACCCAAAACAAGCCCACAGTTACCAAAGCCCACCATCCCACACCCATCCAAACCACAAACTTTAGACCCATAACCATACACCAAACAGTTAACC
ATTATCTAGATTTCTATCTCACGGTGGACCTTGGCCCATTATCGCCCATCTCTAATACCGTTGTGTCTAGCCCGAATTCATATCAACATAGCCCACCAAACTCAAACCTC
AACCAGAAAAACCATACAAACCTCATTACCACAGAAGAAATCTTGGACGGCAGCCTTCTAACATTATTTAATGAAGCACAGGAAGACAAAAATCCTCAAAATGACACAGG
GAAAGAGAGTTTCAAAAAACAGTTAAACGAATGGCTGATTGAAAATGCCTTCTGCGCCATAAAGGATACTATTTCTACTATCTGCCCCGACATTGTTATCCTATCGAAAT
CGAAAGCTTCCTGTATCTCTAGTCGCTTTATTAAATCTATATGGAGCTCCATAAACATAGCTTGGGCCTCTCTTGATGCATCGGGTTCATCTGGTGGCATTATAATCCTT
TGGGATAGACTCTCTATTGAGGCTGAGGAAGTCATTTGTGGCACATACACGATTTCTGTCCTTTTCAAACTTGCTGACAACACACATTGGTGGTTAACAGGGGTCTACGG
GCCTGTTCGACGAAGGAAACGAAAACTTTTATGGCAAGTGCTTTATGCTCTAAATGGGCTATGTAGTCCTTGGTGGTTATTAGGGGGCGATTTCAATATCTATAGATGGA
GCCATGAAACATCCTCGGCAAACCCCCATCGAGCTGGTATGAGACATTTTAACAAGTTTACTTATTTTGTAGGGCTTATCGATCCTAGTCTCGAAAATGGGCCTTATACA
TGGTCAAATCTCCGCCATAGACTAGTTCTCTCGCGCATTGACCGTTTTCTCTTCTCACAAAGCTGGTCTGAAAAGCCCTCCTTCCATAACACCAGACACCTCCCACGAAT
TTGTTCGGATCACTACCCTCTTCTCCTTGAATCCCCGACTCATAAATGGAGACCTTGCCCATTTCGATTAGAAAATCTATGGATTCAAGATAAAAATTTCCAGCAGCTCA
TATCTACTCAATGGAGTTCAATCCCCTCCCATGGATATCCTGGTCATGCCATTATTAAAAAGCTTTATATTTTGGCTCAGAACATAAAAAGGTGGAAAAAGACTCGGCCC
AAATTTCCCTATAGAAAAGGCACGCATCACAAAAGAATTTCTCTAATTAATCAAAAAGAGGAATCCAATCAAATTGATGAAGCTGACTGCCATCTACGAACATCCTTAAA
AAATGATTTGCAGTCGTTAGCTCTAGTCGAGACAAGACTATGGAGCCAAAAATACAAGAAGCTATGGTTACTCGAGGGGGACGAAAACTCAGTTTTTTTCCACAAAATTT
GCACTGCTCGAAGGAGGCGTAGTTTTATATTGAGTCAGCTTGGTTCTGACTCTTCTATTGAGGCTGAAGCAATTAGACACTTCAAAGGAATTTATCACGTGAATAATGAT
GTGGGTTCTTTTATCACAAATCTTGAATGGGCGCCTATAACTTTGGATCAATCTAACAGGCTAGTGCAACCATTCACAGAT
Protein sequenceShow/hide protein sequence
IGSYKSFSMSLLLDTITWLHGSFRTLIAIPRTNRFFQERRYSDYCIWIEKTSNRRACIAEIYRVDEKGHKCCIIVPEGYDKSGWKDFHNMITFNDVKPKQIMQPPKKPQS
KAPSSSSVSDTSSRDSQSTLGNSHSQRSVINQGEEDTLISWENTVVLTCWCFHYDWTKIMEALQEINESVATFNPFQADKAILHVWDPEQARLMCMNKGWVTVGPFYVKF
EHWVSEQHSTPKFIPSYGGWLRFRGIPLHLWNLKTSKHIGDACGGFELAGINTLRKIDIMEAAIKVKDNFCGFIPASIEIKDATGKIFIVNTVASAKKKWLIIRHPQIHG
TFTREAAAQFDEFDPSSETYYFQGNSVIPPPIAAISNEMQISNTSVFISSSHIGNSSIHPVQSEQTQIRTKGKEICKERKASHPNPILPKSKKVIFPSQKNAPIISILTQ
LLDHTKPPPPNSPLQYKPALSLKKRSPQIKLLKPTNQKTTFITQNKPTVTKAHHPTPIQTTNFRPITIHQTVNHYLDFYLTVDLGPLSPISNTVVSSPNSYQHSPPNSNL
NQKNHTNLITTEEILDGSLLTLFNEAQEDKNPQNDTGKESFKKQLNEWLIENAFCAIKDTISTICPDIVILSKSKASCISSRFIKSIWSSINIAWASLDASGSSGGIIIL
WDRLSIEAEEVICGTYTISVLFKLADNTHWWLTGVYGPVRRRKRKLLWQVLYALNGLCSPWWLLGGDFNIYRWSHETSSANPHRAGMRHFNKFTYFVGLIDPSLENGPYT
WSNLRHRLVLSRIDRFLFSQSWSEKPSFHNTRHLPRICSDHYPLLLESPTHKWRPCPFRLENLWIQDKNFQQLISTQWSSIPSHGYPGHAIIKKLYILAQNIKRWKKTRP
KFPYRKGTHHKRISLINQKEESNQIDEADCHLRTSLKNDLQSLALVETRLWSQKYKKLWLLEGDENSVFFHKICTARRRRSFILSQLGSDSSIEAEAIRHFKGIYHVNND
VGSFITNLEWAPITLDQSNRLVQPFTD