; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg029895 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg029895
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationscaffold6:8803085..8806183
RNA-Seq ExpressionSpg029895
SyntenySpg029895
Gene Ontology termsNA
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044449.1 hypothetical protein E6C27_scaffold46G001820 [Cucumis melo var. makuwa]1.1e-13536.32Show/hide
Query:  PESVEWLKVTFQSLLSNPRNNRFFQEKRYEDYCLWVDKTSNRRGNIAEVFRIDDKGRKCCVMVPEGYDRNGWNSFLSMFIFKEAKAKYVRPEAKNSRPYR
        P+S+EWLK+TF++LL+ PR  RFF EKRY D+CLWV    NRRG IAE++R+DD+GRKCC++VPEG D+ GW  F  M   K+   K   P    +R Y 
Subjt:  PESVEWLKVTFQSLLSNPRNNRFFQEKRYEDYCLWVDKTSNRRGNIAEVFRIDDKGRKCCVMVPEGYDRNGWNSFLSMFIFKEAKAKYVRPEAKNSRPYR

Query:  SQD------------STSSDNSRRSYVEVLSSKSHSNTETIPSVQEKDFNLSRSRTIAVRIEEDEFSWNNIVVLSRRCFHDDWVKIFNKLKELNDYAAS-
        +QD            ST S++ R++Y E +SS S S      S  E D + ++S +                 L RRCFHDDW KI ++L++  D   S 
Subjt:  SQD------------STSSDNSRRSYVEVLSSKSHSNTETIPSVQEKDFNLSRSRTIAVRIEEDEFSWNNIVVLSRRCFHDDWVKIFNKLKELNDYAAS-

Query:  --YNPFHANKALLFMRNEEQV----------------------------------------------MHV-------------GGFVEAAKHTWRKLDLI
          Y PFHA+KALLF++++E                                                +H+             GGF++AA  +  KL+L 
Subjt:  --YNPFHANKALLFMRNEEQV----------------------------------------------MHV-------------GGFVEAAKHTWRKLDLI

Query:  EATIKIKDNFTGFIPASVRIVDEKEITFIVHTVAPMAGKWLVCRRPKIHGSFTREAARDFDEYDVKSEAYTFRENSAC----TVDQSSASIIQQSRIQTK
        EA IK+K+N+TGF+PA ++I DE+   FI+ TV    GKWL  R P IHGSFT+ AA +F+E++  +E YTFR N A      + +SS    +Q     K
Subjt:  EATIKIKDNFTGFIPASVRIVDEKEITFIVHTVAPMAGKWLVCRRPKIHGSFTREAARDFDEYDVKSEAYTFRENSAC----TVDQSSASIIQQSRIQTK

Query:  -----------------------------PKDLFDSSDH-PPLQSKEKGKKIYTSDEENLI-----KKTVSFTSPKNQTVFYNPSYAPAKPL---TPLG-
                                       +     DH   +  K+KGK+I   +    I     K+ VSF+SPKN+T  ++   AP K L   +P+  
Subjt:  -----------------------------PKDLFDSSDH-PPLQSKEKGKKIYTSDEENLI-----KKTVSFTSPKNQTVFYNPSYAPAKPL---TPLG-

Query:  ---QNPTQRPNQQQKNTGKEVYQPKRYTGIRIQEPIPHMGQTQIFKEN-----EEFTLSVDLGPLSPISDTLISSPEKTPSPLEKRPSESPSDIMHDSYK
           +  ++RP Q  K   K+VY+ K  +  R         QT   K+       EF L VDLG +S +SDT  S PE +PS +    S + SDI+ DS  
Subjt:  ---QNPTQRPNQQQKNTGKEVYQPKRYTGIRIQEPIPHMGQTQIFKEN-----EEFTLSVDLGPLSPISDTLISSPEKTPSPLEKRPSESPSDIMHDSYK

Query:  FLIP-QDESEEEGDDNNEGEDAGNTINDPKQTGNASQEESFKRQLNDWLTENNFCLIPIKSIPNCNSRKSVEDDNIATSRRASGNNNFVILSETKLASIS
         ++    E  E+    N  E+                E SFKR+L DWL ENN  L                         A  N+ F  ++  ++ SI 
Subjt:  FLIP-QDESEEEGDDNNEGEDAGNTINDPKQTGNASQEESFKRQLNDWLTENNFCLIPIKSIPNCNSRKSVEDDNIATSRRASGNNNFVILSETKLASIS

Query:  KRLIKSMWSSIRINWMALESTGSSGGILNMWDDLNFSVSDFVKGQFSLSIKISLADGFSWWLSGIYGPASRKYRGLLWRELYDLYGLCNDCWLLGGDFNV
                           +   + G+ N+ +D+       + G FS+SI++   +G SWWLS IYGPA RK R L W EL +L  +C   W+LGGDFNV
Subjt:  KRLIKSMWSSIRINWMALESTGSSGGILNMWDDLNFSVSDFVKGQFSLSIKISLADGFSWWLSGIYGPASRKYRGLLWRELYDLYGLCNDCWLLGGDFNV

Query:  FRWSSETSSINPAKKSMRKFNNFISSTGLIDPPLLNGSYTRSNLRNSPIRSRLDRFLYSYNWALKFGDHTSKRLSRITSDHFPILLDSPSVSWGPCPFRF
         RW  ETS+ NPA  SM++FN FIS+  LIDPPL N  +T SNLR     SRLDRFL+S +W   F  HTSK L+R TSDHFPI+L+S S+SWGP PFRF
Subjt:  FRWSSETSSINPAKKSMRKFNNFISSTGLIDPPLLNGSYTRSNLRNSPIRSRLDRFLYSYNWALKFGDHTSKRLSRITSDHFPILLDSPSVSWGPCPFRF

Query:  DNYLLKNKSFLDKIEHWWISSSQDGHPGYSFIRHLRQLA
         N  LK+  +   IE WW ++SQ G+ GYSF+  L+QLA
Subjt:  DNYLLKNKSFLDKIEHWWISSSQDGHPGYSFIRHLRQLA

KAA0056838.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.3e-13932.65Show/hide
Query:  PESVEWLKVTFQSLLSNPRNNRFFQEKRYEDYCLWVDKTSNRRGN--IAEVFRIDDKGRKCCVMVPEGYDRNGWNSFLSMFIFK-EAKAKYVRPEAKNSR
        P+++ W++  F+ LL       FF E+R ED C+WV KT N+      AE+FRID+KGRKC ++VPEG D  GW  FL++  F+  A  K +R E +   
Subjt:  PESVEWLKVTFQSLLSNPRNNRFFQEKRYEDYCLWVDKTSNRRGN--IAEVFRIDDKGRKCCVMVPEGYDRNGWNSFLSMFIFK-EAKAKYVRPEAKNSR

Query:  PYRSQDSTS--SDNSRRSYVEVLSSKSHSNTETIPSVQEKDFNLSRSRTIAVR-IEEDEFSWNNIVVLSRRCFHDDWVKIFNKLKELNDYAASYNPFHAN
             DS S  SD+SR+SY +VLS  S  + +        D +  RS +I  +       S+   V+++RRCFHDDW +I   L++ ++ A SY PF A+
Subjt:  PYRSQDSTS--SDNSRRSYVEVLSSKSHSNTETIPSVQEKDFNLSRSRTIAVR-IEEDEFSWNNIVVLSRRCFHDDWVKIFNKLKELNDYAASYNPFHAN

Query:  KALLFMR---------------------------------------------------------NEEQVMHV----GGFVEAAKHTWRKLDLIEATIKIK
        KA+LF+                                                          N     H+    GGF++ AK T +   LI+A IK++
Subjt:  KALLFMR---------------------------------------------------------NEEQVMHV----GGFVEAAKHTWRKLDLIEATIKIK

Query:  DNFTGFIPASVRIVDEKEITFIVHTVAPMAGKWLVCRRPKIHGSFTREAARDFDEYDVKSEAYTFR--------------ENSACTVDQSSASIIQQSRI
         N+TGF+PAS+ I D +   FIV TV P   +WLV R  ++HGSF  +AA +FD+++  +E YT+               + S    D+ S S   Q++ 
Subjt:  DNFTGFIPASVRIVDEKEITFIVHTVAPMAGKWLVCRRPKIHGSFTREAARDFDEYDVKSEAYTFR--------------ENSACTVDQSSASIIQQSRI

Query:  QTKPKDLFDSSDHP-PLQSKEKGKKI----------YTSDEENLIKKTVSFTSP----KNQTVFYNPSYAPAKPLTPLGQNPTQRPNQQQKNTGKEVYQP
            +  +D  D     + KEKGK I          Y+   + +  + VSF SP     N +     +   +  ++ +     +R + +QK   K  Y+ 
Subjt:  QTKPKDLFDSSDHP-PLQSKEKGKKI----------YTSDEENLIKKTVSFTSP----KNQTVFYNPSYAPAKPLTPLGQNPTQRPNQQQKNTGKEVYQP

Query:  KRYTGIRIQEPIPHMGQTQIFKENEEFTLSVDLGPLSPISDTLISSPEKTPSPLEKRPSE--SPSDIMHDSYKFLIPQDESEEEGDDNNEGEDAGNTIND
        K+      ++   ++   +  + +++  LSVD+GP+SP+   + S        L  +  +  S S    ++    +   E  ++    +     GN+  D
Subjt:  KRYTGIRIQEPIPHMGQTQIFKENEEFTLSVDLGPLSPISDTLISSPEKTPSPLEKRPSE--SPSDIMHDSYKFLIPQDESEEEGDDNNEGEDAGNTIND

Query:  PKQTGNASQEESFKRQLNDWLTENNFCLIP--IKSIPNCNSRKSVEDDNIATSRRASGNNNFVILSETKLASISKRLIKSMWSSIRINWMALESTGSSGG
         K       + +FK +L  WL EN   L P     +P+ +S   +  D            N  I     L                         G  GG
Subjt:  PKQTGNASQEESFKRQLNDWLTENNFCLIP--IKSIPNCNSRKSVEDDNIATSRRASGNNNFVILSETKLASISKRLIKSMWSSIRINWMALESTGSSGG

Query:  ILNMWDDLNFSVSDFVKGQFSLSIKISLADGFSWWLSGIYGPASRKYRGLLWRELYDLYGLCNDCWLLGGDFNVFRWSSETSSINPAKKSMRKFNNFISS
        IL +WDD  F V+D   G +S+S+ I   +G +WWL+ +YGP     R  LW EL  L  LC   WL+ GDFN+ RW  ET++ +  K++M  FNNFIS 
Subjt:  ILNMWDDLNFSVSDFVKGQFSLSIKISLADGFSWWLSGIYGPASRKYRGLLWRELYDLYGLCNDCWLLGGDFNVFRWSSETSSINPAKKSMRKFNNFISS

Query:  TGLIDPPLLNGSYTRSNLRNSPIRSRLDRFLYSYNWALKFGDHTSKRLSRITSDHFPILLDSPSVSWGPCPFRFDNYLLKNKSFLDKIEHWWISSSQDGH
          LIDPPLLN ++T SNLR +P  SRLDRFL S  W   FG HTS+ L R  SDHFPILL+SP + WGPCPFR +N  L++K F     +WW +S Q G 
Subjt:  TGLIDPPLLNGSYTRSNLRNSPIRSRLDRFLYSYNWALKFGDHTSKRLSRITSDHFPILLDSPSVSWGPCPFRFDNYLLKNKSFLDKIEHWWISSSQDGH

Query:  PGYSFIRHLRQLAKSVKAWKKTNMENLKSRKKSIADEIDRIDSLEALGPIDDTLRQQRVALKTDILEVALQEARYWSQRCKKIWLSKGDENTSFFHKMCT
        PGY+FI+ L  L+K +K W+   +    + KK++  EID ID LE  G +  T  Q+R++LK+D+L +   +A+ W QR ++ W   GDEN S+FH++CT
Subjt:  PGYSFIRHLRQLAKSVKAWKKTNMENLKSRKKSIADEIDRIDSLEALGPIDDTLRQQRVALKTDILEVALQEARYWSQRCKKIWLSKGDENTSFFHKMCT

Query:  ARRRRNHITELMSKDGSSIASDD
          +R+N I  +    G+S+ S D
Subjt:  ARRRRNHITELMSKDGSSIASDD

KAA0057507.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.4e-13832.65Show/hide
Query:  PESVEWLKVTFQSLLSNPRNNRFFQEKRYEDYCLWVDKTSNRRGN--IAEVFRIDDKGRKCCVMVPEGYDRNGWNSFLSMFIFK-EAKAKYVRPEAKNSR
        P+++ W++  F+ LL       FF E+R ED C+WV KT N+      AE+FRID+KGRKC ++VPEG D  GW SFL++  F+  A  K +R E +   
Subjt:  PESVEWLKVTFQSLLSNPRNNRFFQEKRYEDYCLWVDKTSNRRGN--IAEVFRIDDKGRKCCVMVPEGYDRNGWNSFLSMFIFK-EAKAKYVRPEAKNSR

Query:  PYRSQDSTS--SDNSRRSYVEVLSSKSHSNTETIPSVQEKDFNLSRSRTIAVR-IEEDEFSWNNIVVLSRRCFHDDWVKIFNKLKELNDYAASYNPFHAN
             DS S  SD+SR+SY +VLS  S  + +        D +  RS +I  +       S+   V+++RRCFHDDW +I   L++ ++ A SY PF A+
Subjt:  PYRSQDSTS--SDNSRRSYVEVLSSKSHSNTETIPSVQEKDFNLSRSRTIAVR-IEEDEFSWNNIVVLSRRCFHDDWVKIFNKLKELNDYAASYNPFHAN

Query:  KALLFMR---------------------------------------------------------NEEQVMHV----GGFVEAAKHTWRKLDLIEATIKIK
        KA+LF+                                                          N     H+    GGF++ AK T +   LI+A IK++
Subjt:  KALLFMR---------------------------------------------------------NEEQVMHV----GGFVEAAKHTWRKLDLIEATIKIK

Query:  DNFTGFIPASVRIVDEKEITFIVHTVAPMAGKWLVCRRPKIHGSFTREAARDFDEYDVKSEAYTFR--------------ENSACTVDQSSASIIQQSRI
         N+ GF+PAS+ I D +   FIV TV P   +WLV R  ++HGSF  +AA +FD+++  +E YT+               + S    D+ S S   Q++ 
Subjt:  DNFTGFIPASVRIVDEKEITFIVHTVAPMAGKWLVCRRPKIHGSFTREAARDFDEYDVKSEAYTFR--------------ENSACTVDQSSASIIQQSRI

Query:  QTKPKDLFDSSDHP-PLQSKEKGKKI----------YTSDEENLIKKTVSFTSP----KNQTVFYNPSYAPAKPLTPLGQNPTQRPNQQQKNTGKEVYQP
            +  +D  D     + KEKGK I          Y+   + +  + VSF SP     N +     +   +  ++ +     +R + +QK   K  Y+ 
Subjt:  QTKPKDLFDSSDHP-PLQSKEKGKKI----------YTSDEENLIKKTVSFTSP----KNQTVFYNPSYAPAKPLTPLGQNPTQRPNQQQKNTGKEVYQP

Query:  KRYTGIRIQEPIPHMGQTQIFKENEEFTLSVDLGPLSPISDTLISSPEKTPSPLEKRPSE--SPSDIMHDSYKFLIPQDESEEEGDDNNEGEDAGNTIND
        K+      ++    + +T   + +++  LSVD+GP+SP+   + S           +  +  S S    ++    +   E  ++    +     GN+  D
Subjt:  KRYTGIRIQEPIPHMGQTQIFKENEEFTLSVDLGPLSPISDTLISSPEKTPSPLEKRPSE--SPSDIMHDSYKFLIPQDESEEEGDDNNEGEDAGNTIND

Query:  PKQTGNASQEESFKRQLNDWLTENNFCLIP--IKSIPNCNSRKSVEDDNIATSRRASGNNNFVILSETKLASISKRLIKSMWSSIRINWMALESTGSSGG
         K       + +FK +L  WL EN   L P     +P+ +    +  D            N  I     L                         G  GG
Subjt:  PKQTGNASQEESFKRQLNDWLTENNFCLIP--IKSIPNCNSRKSVEDDNIATSRRASGNNNFVILSETKLASISKRLIKSMWSSIRINWMALESTGSSGG

Query:  ILNMWDDLNFSVSDFVKGQFSLSIKISLADGFSWWLSGIYGPASRKYRGLLWRELYDLYGLCNDCWLLGGDFNVFRWSSETSSINPAKKSMRKFNNFISS
        IL +WDD NF V+D   G +S+S+ I   +G +WWL+ +YGP     R  LW EL  L  LC   WL+ GDFN+ RW  ET++ +  K++M  FNNFIS 
Subjt:  ILNMWDDLNFSVSDFVKGQFSLSIKISLADGFSWWLSGIYGPASRKYRGLLWRELYDLYGLCNDCWLLGGDFNVFRWSSETSSINPAKKSMRKFNNFISS

Query:  TGLIDPPLLNGSYTRSNLRNSPIRSRLDRFLYSYNWALKFGDHTSKRLSRITSDHFPILLDSPSVSWGPCPFRFDNYLLKNKSFLDKIEHWWISSSQDGH
          LIDPP LN ++T SNLR +P  SRLDRFL S  W   FG HTS+ L R  SDHFPILL+SP + WGPCPFR +N  L++K F     +WW SS Q G 
Subjt:  TGLIDPPLLNGSYTRSNLRNSPIRSRLDRFLYSYNWALKFGDHTSKRLSRITSDHFPILLDSPSVSWGPCPFRFDNYLLKNKSFLDKIEHWWISSSQDGH

Query:  PGYSFIRHLRQLAKSVKAWKKTNMENLKSRKKSIADEIDRIDSLEALGPIDDTLRQQRVALKTDILEVALQEARYWSQRCKKIWLSKGDENTSFFHKMCT
        PGY+FI+ L  L+K +K W+   +    + KK++  EID ID LE  G +  T  Q+R++LK+D+L +   +A+ W QR ++ W   GDEN S+FH++CT
Subjt:  PGYSFIRHLRQLAKSVKAWKKTNMENLKSRKKSIADEIDRIDSLEALGPIDDTLRQQRVALKTDILEVALQEARYWSQRCKKIWLSKGDENTSFFHKMCT

Query:  ARRRRNHITELMSKDGSSIASDD
          +R+N I  +    G+S+ S D
Subjt:  ARRRRNHITELMSKDGSSIASDD

TYJ99315.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]4.9e-13932.89Show/hide
Query:  PESVEWLKVTFQSLLSNPRNNRFFQEKRYEDYCLWVDKTSNRRGNIAEVFRIDDKGRKCCVMVPEGYDRNGWNSFLSMFIFK-EAKAKYVRPEAKNSRPY
        P  ++W++ T +SL++ P  NRFF E R  +  +W+ KT N +G  AE+FR+D K RK C++VPEG D++GW SFLSM   K E KAK  RP        
Subjt:  PESVEWLKVTFQSLLSNPRNNRFFQEKRYEDYCLWVDKTSNRRGNIAEVFRIDDKGRKCCVMVPEGYDRNGWNSFLSMFIFK-EAKAKYVRPEAKNSRPY

Query:  RSQDSTSSDNSRRSYVE-VLSSKSHSNTETIPSVQEKDFNLSRSRTIAVRIEEDEFSWNNIVVLSRRCFHDDWVKIFNKLKELNDYAASYNPFHANKAL-
          + S   D  +RSY + V   +  + +++  S    D + S S +       D     N VV+ RR FHDDW KI   L++  + + +YN FHA KAL 
Subjt:  RSQDSTSSDNSRRSYVE-VLSSKSHSNTETIPSVQEKDFNLSRSRTIAVRIEEDEFSWNNIVVLSRRCFHDDWVKIFNKLKELNDYAASYNPFHANKAL-

Query:  ------------------------------------------------------LFMRNEEQVMHVG----GFVEAAKHTWRKLDLIEATIKIKDNFTGF
                                                              L + N      +G    G ++ A+ T    +LIEA IK++ N++GF
Subjt:  ------------------------------------------------------LFMRNEEQVMHVG----GFVEAAKHTWRKLDLIEATIKIKDNFTGF

Query:  IPASVRIVDEKEITFIVHTVAPMAGKWLVCRRPKIHGSFTREAARDFDEYDVKSEAYTFRENSA---------------CTVDQSSA---SIIQQSRIQT
        +PA+VRI D +   F V  V    GKWL+ R  ++HG+F R+AA  FD+++ +SE + F  + A                T DQ SA    II+  R  T
Subjt:  IPASVRIVDEKEITFIVHTVAPMAGKWLVCRRPKIHGSFTREAARDFDEYDVKSEAYTFRENSA---------------CTVDQSSA---SIIQQSRIQT

Query:  KPKDL-----FDSSDHPPLQSK--------------EKGK-----KIYTSDEENL--IKKTVSFTSPKNQTVFYNPSYAPAKPLTPLGQNPTQRPNQQQK
         P  L      DS+ H                    +KGK     ++  +   NL   K+ VSF SP N+T  +NP  APA     L     ++   +++
Subjt:  KPKDL-----FDSSDHPPLQSK--------------EKGK-----KIYTSDEENL--IKKTVSFTSPKNQTVFYNPSYAPAKPLTPLGQNPTQRPNQQQK

Query:  NTGKE--VYQP----KRYTGIRIQEPIPHMGQTQIFKENEEFTLSVDLGPLSPISDTLISSPEKTPSPLEKRPSESPSDIMHDSYKFLIPQDESEEEGDD
        +  K+    QP     +  G+ I +PI  +   +     +  +L+VDLG L  +       P K+   LE   +   ++++  +   ++P+    +   +
Subjt:  NTGKE--VYQP----KRYTGIRIQEPIPHMGQTQIFKENEEFTLSVDLGPLSPISDTLISSPEKTPSPLEKRPSESPSDIMHDSYKFLIPQDESEEEGDD

Query:  NNEGEDAGNTINDPKQTGNAS-------------QEESFKRQLNDWLTENNFCLIPIKSIPNCNSRKSVEDDNIATSRRASGNNNFVILSETKLASISKR
         N    +      PK                     E+FK+QL  WL +N      +K   + +S  +    N+  ++  SG           L   +KR
Subjt:  NNEGEDAGNTINDPKQTGNAS-------------QEESFKRQLNDWLTENNFCLIPIKSIPNCNSRKSVEDDNIATSRRASGNNNFVILSETKLASISKR

Query:  LIKSMWSSIRINWMALESTGSSGGILNMWDDLNFSVSDFVKGQFSLSIKISLADGFSWWLSGIYGPASRKYRGLLWRELYDLYGLCNDCWLLGGDFNVFR
        +IKS+W S  INW+A  ++GSSGGIL +WD  N S+    +G FSLS    L +  SWWL+G+YGP  R+ R   W EL++L  L +  W+LGGD NV R
Subjt:  LIKSMWSSIRINWMALESTGSSGGILNMWDDLNFSVSDFVKGQFSLSIKISLADGFSWWLSGIYGPASRKYRGLLWRELYDLYGLCNDCWLLGGDFNVFR

Query:  WSSETSSINPAKKSMRKFNNFISSTGLIDPPLLNGSYTRSNLRNSPIRSRLDRFLYSYNWALKFGDHTSKRLSRITSDHFPILLD--SPSVSWGPCPFRF
           E++S+  +  + R  NNFIS+  LIDPPL N  +T SNLRN P  SR+DRFLY+ +W   F  HT++ L R TSDHFP++ +  +P +SWGP PFR 
Subjt:  WSSETSSINPAKKSMRKFNNFISSTGLIDPPLLNGSYTRSNLRNSPIRSRLDRFLYSYNWALKFGDHTSKRLSRITSDHFPILLD--SPSVSWGPCPFRF

Query:  DNYLLKNKSFLDKIEHWWISSSQDGHPGYSFIRHLRQLAKSVKAWKKTNMENLKSRKKSIADEIDRIDSLEALGPIDDTLRQQRVALKTDILEVALQEAR
        ++  L +  F   +  WW +S Q G+PG+SFI+ L+ LA  +K W+K  + +L   K++I  E+D ID  E   P+      +R+ALK D+ E++L+E++
Subjt:  DNYLLKNKSFLDKIEHWWISSSQDGHPGYSFIRHLRQLAKSVKAWKKTNMENLKSRKKSIADEIDRIDSLEALGPIDDTLRQQRVALKTDILEVALQEAR

Query:  YWSQRCKKIWLSKGDENTSFFHKMCTARRRRNHITELMSKDGS
        +W QR KK+WL +GDEN+SFFH++C++R++R+ I E+  ++GS
Subjt:  YWSQRCKKIWLSKGDENTSFFHKMCTARRRRNHITELMSKDGS

TYK08190.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.4e-13832.65Show/hide
Query:  PESVEWLKVTFQSLLSNPRNNRFFQEKRYEDYCLWVDKTSNRRGN--IAEVFRIDDKGRKCCVMVPEGYDRNGWNSFLSMFIFK-EAKAKYVRPEAKNSR
        P+++ W++  F+ LL       FF E+R ED C+WV KT N+      AE+FRID+KGRKC ++VPEG D  GW SFL++  F+  A  K +R E +   
Subjt:  PESVEWLKVTFQSLLSNPRNNRFFQEKRYEDYCLWVDKTSNRRGN--IAEVFRIDDKGRKCCVMVPEGYDRNGWNSFLSMFIFK-EAKAKYVRPEAKNSR

Query:  PYRSQDSTS--SDNSRRSYVEVLSSKSHSNTETIPSVQEKDFNLSRSRTIAVR-IEEDEFSWNNIVVLSRRCFHDDWVKIFNKLKELNDYAASYNPFHAN
             DS S  SD+SR+SY +VLS  S  + +        D +  RS +I  +       S+   V+++RRCFHDDW +I   L++ ++ A SY PF A+
Subjt:  PYRSQDSTS--SDNSRRSYVEVLSSKSHSNTETIPSVQEKDFNLSRSRTIAVR-IEEDEFSWNNIVVLSRRCFHDDWVKIFNKLKELNDYAASYNPFHAN

Query:  KALLFMR---------------------------------------------------------NEEQVMHV----GGFVEAAKHTWRKLDLIEATIKIK
        KA+LF+                                                          N     H+    GGF++ AK T +   LI+A IK++
Subjt:  KALLFMR---------------------------------------------------------NEEQVMHV----GGFVEAAKHTWRKLDLIEATIKIK

Query:  DNFTGFIPASVRIVDEKEITFIVHTVAPMAGKWLVCRRPKIHGSFTREAARDFDEYDVKSEAYTFR--------------ENSACTVDQSSASIIQQSRI
         N+ GF+PAS+ I D +   FIV TV P   +WLV R  ++HGSF  +AA +FD+++  +E YT+               + S    D+ S S   Q++ 
Subjt:  DNFTGFIPASVRIVDEKEITFIVHTVAPMAGKWLVCRRPKIHGSFTREAARDFDEYDVKSEAYTFR--------------ENSACTVDQSSASIIQQSRI

Query:  QTKPKDLFDSSDHP-PLQSKEKGKKI----------YTSDEENLIKKTVSFTSP----KNQTVFYNPSYAPAKPLTPLGQNPTQRPNQQQKNTGKEVYQP
            +  +D  D     + KEKGK I          Y+   + +  + VSF SP     N +     +   +  ++ +     +R + +QK   K  Y+ 
Subjt:  QTKPKDLFDSSDHP-PLQSKEKGKKI----------YTSDEENLIKKTVSFTSP----KNQTVFYNPSYAPAKPLTPLGQNPTQRPNQQQKNTGKEVYQP

Query:  KRYTGIRIQEPIPHMGQTQIFKENEEFTLSVDLGPLSPISDTLISSPEKTPSPLEKRPSE--SPSDIMHDSYKFLIPQDESEEEGDDNNEGEDAGNTIND
        K+      ++    + +T   + +++  LSVD+GP+SP+   + S           +  +  S S    ++    +   E  ++    +     GN+  D
Subjt:  KRYTGIRIQEPIPHMGQTQIFKENEEFTLSVDLGPLSPISDTLISSPEKTPSPLEKRPSE--SPSDIMHDSYKFLIPQDESEEEGDDNNEGEDAGNTIND

Query:  PKQTGNASQEESFKRQLNDWLTENNFCLIP--IKSIPNCNSRKSVEDDNIATSRRASGNNNFVILSETKLASISKRLIKSMWSSIRINWMALESTGSSGG
         K       + +FK +L  WL EN   L P     +P+ +    +  D            N  I     L                         G  GG
Subjt:  PKQTGNASQEESFKRQLNDWLTENNFCLIP--IKSIPNCNSRKSVEDDNIATSRRASGNNNFVILSETKLASISKRLIKSMWSSIRINWMALESTGSSGG

Query:  ILNMWDDLNFSVSDFVKGQFSLSIKISLADGFSWWLSGIYGPASRKYRGLLWRELYDLYGLCNDCWLLGGDFNVFRWSSETSSINPAKKSMRKFNNFISS
        IL +WDD NF V+D   G +S+S+ I   +G +WWL+ +YGP     R  LW EL  L  LC   WL+ GDFN+ RW  ET++ +  K++M  FNNFIS 
Subjt:  ILNMWDDLNFSVSDFVKGQFSLSIKISLADGFSWWLSGIYGPASRKYRGLLWRELYDLYGLCNDCWLLGGDFNVFRWSSETSSINPAKKSMRKFNNFISS

Query:  TGLIDPPLLNGSYTRSNLRNSPIRSRLDRFLYSYNWALKFGDHTSKRLSRITSDHFPILLDSPSVSWGPCPFRFDNYLLKNKSFLDKIEHWWISSSQDGH
          LIDPP LN ++T SNLR +P  SRLDRFL S  W   FG HTS+ L R  SDHFPILL+SP + WGPCPFR +N  L++K F     +WW SS Q G 
Subjt:  TGLIDPPLLNGSYTRSNLRNSPIRSRLDRFLYSYNWALKFGDHTSKRLSRITSDHFPILLDSPSVSWGPCPFRFDNYLLKNKSFLDKIEHWWISSSQDGH

Query:  PGYSFIRHLRQLAKSVKAWKKTNMENLKSRKKSIADEIDRIDSLEALGPIDDTLRQQRVALKTDILEVALQEARYWSQRCKKIWLSKGDENTSFFHKMCT
        PGY+FI+ L  L+K +K W+   +    + KK++  EID ID LE  G +  T  Q+R++LK+D+L +   +A+ W QR ++ W   GDEN S+FH++CT
Subjt:  PGYSFIRHLRQLAKSVKAWKKTNMENLKSRKKSIADEIDRIDSLEALGPIDDTLRQQRVALKTDILEVALQEARYWSQRCKKIWLSKGDENTSFFHKMCT

Query:  ARRRRNHITELMSKDGSSIASDD
          +R+N I  +    G+S+ S D
Subjt:  ARRRRNHITELMSKDGSSIASDD

TrEMBL top hitse value%identityAlignment
A0A5A7TTA1 DUF4283 domain-containing protein5.4e-13636.32Show/hide
Query:  PESVEWLKVTFQSLLSNPRNNRFFQEKRYEDYCLWVDKTSNRRGNIAEVFRIDDKGRKCCVMVPEGYDRNGWNSFLSMFIFKEAKAKYVRPEAKNSRPYR
        P+S+EWLK+TF++LL+ PR  RFF EKRY D+CLWV    NRRG IAE++R+DD+GRKCC++VPEG D+ GW  F  M   K+   K   P    +R Y 
Subjt:  PESVEWLKVTFQSLLSNPRNNRFFQEKRYEDYCLWVDKTSNRRGNIAEVFRIDDKGRKCCVMVPEGYDRNGWNSFLSMFIFKEAKAKYVRPEAKNSRPYR

Query:  SQD------------STSSDNSRRSYVEVLSSKSHSNTETIPSVQEKDFNLSRSRTIAVRIEEDEFSWNNIVVLSRRCFHDDWVKIFNKLKELNDYAAS-
        +QD            ST S++ R++Y E +SS S S      S  E D + ++S +                 L RRCFHDDW KI ++L++  D   S 
Subjt:  SQD------------STSSDNSRRSYVEVLSSKSHSNTETIPSVQEKDFNLSRSRTIAVRIEEDEFSWNNIVVLSRRCFHDDWVKIFNKLKELNDYAAS-

Query:  --YNPFHANKALLFMRNEEQV----------------------------------------------MHV-------------GGFVEAAKHTWRKLDLI
          Y PFHA+KALLF++++E                                                +H+             GGF++AA  +  KL+L 
Subjt:  --YNPFHANKALLFMRNEEQV----------------------------------------------MHV-------------GGFVEAAKHTWRKLDLI

Query:  EATIKIKDNFTGFIPASVRIVDEKEITFIVHTVAPMAGKWLVCRRPKIHGSFTREAARDFDEYDVKSEAYTFRENSAC----TVDQSSASIIQQSRIQTK
        EA IK+K+N+TGF+PA ++I DE+   FI+ TV    GKWL  R P IHGSFT+ AA +F+E++  +E YTFR N A      + +SS    +Q     K
Subjt:  EATIKIKDNFTGFIPASVRIVDEKEITFIVHTVAPMAGKWLVCRRPKIHGSFTREAARDFDEYDVKSEAYTFRENSAC----TVDQSSASIIQQSRIQTK

Query:  -----------------------------PKDLFDSSDH-PPLQSKEKGKKIYTSDEENLI-----KKTVSFTSPKNQTVFYNPSYAPAKPL---TPLG-
                                       +     DH   +  K+KGK+I   +    I     K+ VSF+SPKN+T  ++   AP K L   +P+  
Subjt:  -----------------------------PKDLFDSSDH-PPLQSKEKGKKIYTSDEENLI-----KKTVSFTSPKNQTVFYNPSYAPAKPL---TPLG-

Query:  ---QNPTQRPNQQQKNTGKEVYQPKRYTGIRIQEPIPHMGQTQIFKEN-----EEFTLSVDLGPLSPISDTLISSPEKTPSPLEKRPSESPSDIMHDSYK
           +  ++RP Q  K   K+VY+ K  +  R         QT   K+       EF L VDLG +S +SDT  S PE +PS +    S + SDI+ DS  
Subjt:  ---QNPTQRPNQQQKNTGKEVYQPKRYTGIRIQEPIPHMGQTQIFKEN-----EEFTLSVDLGPLSPISDTLISSPEKTPSPLEKRPSESPSDIMHDSYK

Query:  FLIP-QDESEEEGDDNNEGEDAGNTINDPKQTGNASQEESFKRQLNDWLTENNFCLIPIKSIPNCNSRKSVEDDNIATSRRASGNNNFVILSETKLASIS
         ++    E  E+    N  E+                E SFKR+L DWL ENN  L                         A  N+ F  ++  ++ SI 
Subjt:  FLIP-QDESEEEGDDNNEGEDAGNTINDPKQTGNASQEESFKRQLNDWLTENNFCLIPIKSIPNCNSRKSVEDDNIATSRRASGNNNFVILSETKLASIS

Query:  KRLIKSMWSSIRINWMALESTGSSGGILNMWDDLNFSVSDFVKGQFSLSIKISLADGFSWWLSGIYGPASRKYRGLLWRELYDLYGLCNDCWLLGGDFNV
                           +   + G+ N+ +D+       + G FS+SI++   +G SWWLS IYGPA RK R L W EL +L  +C   W+LGGDFNV
Subjt:  KRLIKSMWSSIRINWMALESTGSSGGILNMWDDLNFSVSDFVKGQFSLSIKISLADGFSWWLSGIYGPASRKYRGLLWRELYDLYGLCNDCWLLGGDFNV

Query:  FRWSSETSSINPAKKSMRKFNNFISSTGLIDPPLLNGSYTRSNLRNSPIRSRLDRFLYSYNWALKFGDHTSKRLSRITSDHFPILLDSPSVSWGPCPFRF
         RW  ETS+ NPA  SM++FN FIS+  LIDPPL N  +T SNLR     SRLDRFL+S +W   F  HTSK L+R TSDHFPI+L+S S+SWGP PFRF
Subjt:  FRWSSETSSINPAKKSMRKFNNFISSTGLIDPPLLNGSYTRSNLRNSPIRSRLDRFLYSYNWALKFGDHTSKRLSRITSDHFPILLDSPSVSWGPCPFRF

Query:  DNYLLKNKSFLDKIEHWWISSSQDGHPGYSFIRHLRQLA
         N  LK+  +   IE WW ++SQ G+ GYSF+  L+QLA
Subjt:  DNYLLKNKSFLDKIEHWWISSSQDGHPGYSFIRHLRQLA

A0A5A7US62 LINE-1 retrotransposable element ORF2 protein6.8e-13932.65Show/hide
Query:  PESVEWLKVTFQSLLSNPRNNRFFQEKRYEDYCLWVDKTSNRRGN--IAEVFRIDDKGRKCCVMVPEGYDRNGWNSFLSMFIFK-EAKAKYVRPEAKNSR
        P+++ W++  F+ LL       FF E+R ED C+WV KT N+      AE+FRID+KGRKC ++VPEG D  GW SFL++  F+  A  K +R E +   
Subjt:  PESVEWLKVTFQSLLSNPRNNRFFQEKRYEDYCLWVDKTSNRRGN--IAEVFRIDDKGRKCCVMVPEGYDRNGWNSFLSMFIFK-EAKAKYVRPEAKNSR

Query:  PYRSQDSTS--SDNSRRSYVEVLSSKSHSNTETIPSVQEKDFNLSRSRTIAVR-IEEDEFSWNNIVVLSRRCFHDDWVKIFNKLKELNDYAASYNPFHAN
             DS S  SD+SR+SY +VLS  S  + +        D +  RS +I  +       S+   V+++RRCFHDDW +I   L++ ++ A SY PF A+
Subjt:  PYRSQDSTS--SDNSRRSYVEVLSSKSHSNTETIPSVQEKDFNLSRSRTIAVR-IEEDEFSWNNIVVLSRRCFHDDWVKIFNKLKELNDYAASYNPFHAN

Query:  KALLFMR---------------------------------------------------------NEEQVMHV----GGFVEAAKHTWRKLDLIEATIKIK
        KA+LF+                                                          N     H+    GGF++ AK T +   LI+A IK++
Subjt:  KALLFMR---------------------------------------------------------NEEQVMHV----GGFVEAAKHTWRKLDLIEATIKIK

Query:  DNFTGFIPASVRIVDEKEITFIVHTVAPMAGKWLVCRRPKIHGSFTREAARDFDEYDVKSEAYTFR--------------ENSACTVDQSSASIIQQSRI
         N+ GF+PAS+ I D +   FIV TV P   +WLV R  ++HGSF  +AA +FD+++  +E YT+               + S    D+ S S   Q++ 
Subjt:  DNFTGFIPASVRIVDEKEITFIVHTVAPMAGKWLVCRRPKIHGSFTREAARDFDEYDVKSEAYTFR--------------ENSACTVDQSSASIIQQSRI

Query:  QTKPKDLFDSSDHP-PLQSKEKGKKI----------YTSDEENLIKKTVSFTSP----KNQTVFYNPSYAPAKPLTPLGQNPTQRPNQQQKNTGKEVYQP
            +  +D  D     + KEKGK I          Y+   + +  + VSF SP     N +     +   +  ++ +     +R + +QK   K  Y+ 
Subjt:  QTKPKDLFDSSDHP-PLQSKEKGKKI----------YTSDEENLIKKTVSFTSP----KNQTVFYNPSYAPAKPLTPLGQNPTQRPNQQQKNTGKEVYQP

Query:  KRYTGIRIQEPIPHMGQTQIFKENEEFTLSVDLGPLSPISDTLISSPEKTPSPLEKRPSE--SPSDIMHDSYKFLIPQDESEEEGDDNNEGEDAGNTIND
        K+      ++    + +T   + +++  LSVD+GP+SP+   + S           +  +  S S    ++    +   E  ++    +     GN+  D
Subjt:  KRYTGIRIQEPIPHMGQTQIFKENEEFTLSVDLGPLSPISDTLISSPEKTPSPLEKRPSE--SPSDIMHDSYKFLIPQDESEEEGDDNNEGEDAGNTIND

Query:  PKQTGNASQEESFKRQLNDWLTENNFCLIP--IKSIPNCNSRKSVEDDNIATSRRASGNNNFVILSETKLASISKRLIKSMWSSIRINWMALESTGSSGG
         K       + +FK +L  WL EN   L P     +P+ +    +  D            N  I     L                         G  GG
Subjt:  PKQTGNASQEESFKRQLNDWLTENNFCLIP--IKSIPNCNSRKSVEDDNIATSRRASGNNNFVILSETKLASISKRLIKSMWSSIRINWMALESTGSSGG

Query:  ILNMWDDLNFSVSDFVKGQFSLSIKISLADGFSWWLSGIYGPASRKYRGLLWRELYDLYGLCNDCWLLGGDFNVFRWSSETSSINPAKKSMRKFNNFISS
        IL +WDD NF V+D   G +S+S+ I   +G +WWL+ +YGP     R  LW EL  L  LC   WL+ GDFN+ RW  ET++ +  K++M  FNNFIS 
Subjt:  ILNMWDDLNFSVSDFVKGQFSLSIKISLADGFSWWLSGIYGPASRKYRGLLWRELYDLYGLCNDCWLLGGDFNVFRWSSETSSINPAKKSMRKFNNFISS

Query:  TGLIDPPLLNGSYTRSNLRNSPIRSRLDRFLYSYNWALKFGDHTSKRLSRITSDHFPILLDSPSVSWGPCPFRFDNYLLKNKSFLDKIEHWWISSSQDGH
          LIDPP LN ++T SNLR +P  SRLDRFL S  W   FG HTS+ L R  SDHFPILL+SP + WGPCPFR +N  L++K F     +WW SS Q G 
Subjt:  TGLIDPPLLNGSYTRSNLRNSPIRSRLDRFLYSYNWALKFGDHTSKRLSRITSDHFPILLDSPSVSWGPCPFRFDNYLLKNKSFLDKIEHWWISSSQDGH

Query:  PGYSFIRHLRQLAKSVKAWKKTNMENLKSRKKSIADEIDRIDSLEALGPIDDTLRQQRVALKTDILEVALQEARYWSQRCKKIWLSKGDENTSFFHKMCT
        PGY+FI+ L  L+K +K W+   +    + KK++  EID ID LE  G +  T  Q+R++LK+D+L +   +A+ W QR ++ W   GDEN S+FH++CT
Subjt:  PGYSFIRHLRQLAKSVKAWKKTNMENLKSRKKSIADEIDRIDSLEALGPIDDTLRQQRVALKTDILEVALQEARYWSQRCKKIWLSKGDENTSFFHKMCT

Query:  ARRRRNHITELMSKDGSSIASDD
          +R+N I  +    G+S+ S D
Subjt:  ARRRRNHITELMSKDGSSIASDD

A0A5D3BKT8 LINE-1 retrotransposable element ORF2 protein6.2e-14032.65Show/hide
Query:  PESVEWLKVTFQSLLSNPRNNRFFQEKRYEDYCLWVDKTSNRRGN--IAEVFRIDDKGRKCCVMVPEGYDRNGWNSFLSMFIFK-EAKAKYVRPEAKNSR
        P+++ W++  F+ LL       FF E+R ED C+WV KT N+      AE+FRID+KGRKC ++VPEG D  GW  FL++  F+  A  K +R E +   
Subjt:  PESVEWLKVTFQSLLSNPRNNRFFQEKRYEDYCLWVDKTSNRRGN--IAEVFRIDDKGRKCCVMVPEGYDRNGWNSFLSMFIFK-EAKAKYVRPEAKNSR

Query:  PYRSQDSTS--SDNSRRSYVEVLSSKSHSNTETIPSVQEKDFNLSRSRTIAVR-IEEDEFSWNNIVVLSRRCFHDDWVKIFNKLKELNDYAASYNPFHAN
             DS S  SD+SR+SY +VLS  S  + +        D +  RS +I  +       S+   V+++RRCFHDDW +I   L++ ++ A SY PF A+
Subjt:  PYRSQDSTS--SDNSRRSYVEVLSSKSHSNTETIPSVQEKDFNLSRSRTIAVR-IEEDEFSWNNIVVLSRRCFHDDWVKIFNKLKELNDYAASYNPFHAN

Query:  KALLFMR---------------------------------------------------------NEEQVMHV----GGFVEAAKHTWRKLDLIEATIKIK
        KA+LF+                                                          N     H+    GGF++ AK T +   LI+A IK++
Subjt:  KALLFMR---------------------------------------------------------NEEQVMHV----GGFVEAAKHTWRKLDLIEATIKIK

Query:  DNFTGFIPASVRIVDEKEITFIVHTVAPMAGKWLVCRRPKIHGSFTREAARDFDEYDVKSEAYTFR--------------ENSACTVDQSSASIIQQSRI
         N+TGF+PAS+ I D +   FIV TV P   +WLV R  ++HGSF  +AA +FD+++  +E YT+               + S    D+ S S   Q++ 
Subjt:  DNFTGFIPASVRIVDEKEITFIVHTVAPMAGKWLVCRRPKIHGSFTREAARDFDEYDVKSEAYTFR--------------ENSACTVDQSSASIIQQSRI

Query:  QTKPKDLFDSSDHP-PLQSKEKGKKI----------YTSDEENLIKKTVSFTSP----KNQTVFYNPSYAPAKPLTPLGQNPTQRPNQQQKNTGKEVYQP
            +  +D  D     + KEKGK I          Y+   + +  + VSF SP     N +     +   +  ++ +     +R + +QK   K  Y+ 
Subjt:  QTKPKDLFDSSDHP-PLQSKEKGKKI----------YTSDEENLIKKTVSFTSP----KNQTVFYNPSYAPAKPLTPLGQNPTQRPNQQQKNTGKEVYQP

Query:  KRYTGIRIQEPIPHMGQTQIFKENEEFTLSVDLGPLSPISDTLISSPEKTPSPLEKRPSE--SPSDIMHDSYKFLIPQDESEEEGDDNNEGEDAGNTIND
        K+      ++   ++   +  + +++  LSVD+GP+SP+   + S        L  +  +  S S    ++    +   E  ++    +     GN+  D
Subjt:  KRYTGIRIQEPIPHMGQTQIFKENEEFTLSVDLGPLSPISDTLISSPEKTPSPLEKRPSE--SPSDIMHDSYKFLIPQDESEEEGDDNNEGEDAGNTIND

Query:  PKQTGNASQEESFKRQLNDWLTENNFCLIP--IKSIPNCNSRKSVEDDNIATSRRASGNNNFVILSETKLASISKRLIKSMWSSIRINWMALESTGSSGG
         K       + +FK +L  WL EN   L P     +P+ +S   +  D            N  I     L                         G  GG
Subjt:  PKQTGNASQEESFKRQLNDWLTENNFCLIP--IKSIPNCNSRKSVEDDNIATSRRASGNNNFVILSETKLASISKRLIKSMWSSIRINWMALESTGSSGG

Query:  ILNMWDDLNFSVSDFVKGQFSLSIKISLADGFSWWLSGIYGPASRKYRGLLWRELYDLYGLCNDCWLLGGDFNVFRWSSETSSINPAKKSMRKFNNFISS
        IL +WDD  F V+D   G +S+S+ I   +G +WWL+ +YGP     R  LW EL  L  LC   WL+ GDFN+ RW  ET++ +  K++M  FNNFIS 
Subjt:  ILNMWDDLNFSVSDFVKGQFSLSIKISLADGFSWWLSGIYGPASRKYRGLLWRELYDLYGLCNDCWLLGGDFNVFRWSSETSSINPAKKSMRKFNNFISS

Query:  TGLIDPPLLNGSYTRSNLRNSPIRSRLDRFLYSYNWALKFGDHTSKRLSRITSDHFPILLDSPSVSWGPCPFRFDNYLLKNKSFLDKIEHWWISSSQDGH
          LIDPPLLN ++T SNLR +P  SRLDRFL S  W   FG HTS+ L R  SDHFPILL+SP + WGPCPFR +N  L++K F     +WW +S Q G 
Subjt:  TGLIDPPLLNGSYTRSNLRNSPIRSRLDRFLYSYNWALKFGDHTSKRLSRITSDHFPILLDSPSVSWGPCPFRFDNYLLKNKSFLDKIEHWWISSSQDGH

Query:  PGYSFIRHLRQLAKSVKAWKKTNMENLKSRKKSIADEIDRIDSLEALGPIDDTLRQQRVALKTDILEVALQEARYWSQRCKKIWLSKGDENTSFFHKMCT
        PGY+FI+ L  L+K +K W+   +    + KK++  EID ID LE  G +  T  Q+R++LK+D+L +   +A+ W QR ++ W   GDEN S+FH++CT
Subjt:  PGYSFIRHLRQLAKSVKAWKKTNMENLKSRKKSIADEIDRIDSLEALGPIDDTLRQQRVALKTDILEVALQEARYWSQRCKKIWLSKGDENTSFFHKMCT

Query:  ARRRRNHITELMSKDGSSIASDD
          +R+N I  +    G+S+ S D
Subjt:  ARRRRNHITELMSKDGSSIASDD

A0A5D3BLV7 LINE-1 retrotransposable element ORF2 protein2.4e-13932.89Show/hide
Query:  PESVEWLKVTFQSLLSNPRNNRFFQEKRYEDYCLWVDKTSNRRGNIAEVFRIDDKGRKCCVMVPEGYDRNGWNSFLSMFIFK-EAKAKYVRPEAKNSRPY
        P  ++W++ T +SL++ P  NRFF E R  +  +W+ KT N +G  AE+FR+D K RK C++VPEG D++GW SFLSM   K E KAK  RP        
Subjt:  PESVEWLKVTFQSLLSNPRNNRFFQEKRYEDYCLWVDKTSNRRGNIAEVFRIDDKGRKCCVMVPEGYDRNGWNSFLSMFIFK-EAKAKYVRPEAKNSRPY

Query:  RSQDSTSSDNSRRSYVE-VLSSKSHSNTETIPSVQEKDFNLSRSRTIAVRIEEDEFSWNNIVVLSRRCFHDDWVKIFNKLKELNDYAASYNPFHANKAL-
          + S   D  +RSY + V   +  + +++  S    D + S S +       D     N VV+ RR FHDDW KI   L++  + + +YN FHA KAL 
Subjt:  RSQDSTSSDNSRRSYVE-VLSSKSHSNTETIPSVQEKDFNLSRSRTIAVRIEEDEFSWNNIVVLSRRCFHDDWVKIFNKLKELNDYAASYNPFHANKAL-

Query:  ------------------------------------------------------LFMRNEEQVMHVG----GFVEAAKHTWRKLDLIEATIKIKDNFTGF
                                                              L + N      +G    G ++ A+ T    +LIEA IK++ N++GF
Subjt:  ------------------------------------------------------LFMRNEEQVMHVG----GFVEAAKHTWRKLDLIEATIKIKDNFTGF

Query:  IPASVRIVDEKEITFIVHTVAPMAGKWLVCRRPKIHGSFTREAARDFDEYDVKSEAYTFRENSA---------------CTVDQSSA---SIIQQSRIQT
        +PA+VRI D +   F V  V    GKWL+ R  ++HG+F R+AA  FD+++ +SE + F  + A                T DQ SA    II+  R  T
Subjt:  IPASVRIVDEKEITFIVHTVAPMAGKWLVCRRPKIHGSFTREAARDFDEYDVKSEAYTFRENSA---------------CTVDQSSA---SIIQQSRIQT

Query:  KPKDL-----FDSSDHPPLQSK--------------EKGK-----KIYTSDEENL--IKKTVSFTSPKNQTVFYNPSYAPAKPLTPLGQNPTQRPNQQQK
         P  L      DS+ H                    +KGK     ++  +   NL   K+ VSF SP N+T  +NP  APA     L     ++   +++
Subjt:  KPKDL-----FDSSDHPPLQSK--------------EKGK-----KIYTSDEENL--IKKTVSFTSPKNQTVFYNPSYAPAKPLTPLGQNPTQRPNQQQK

Query:  NTGKE--VYQP----KRYTGIRIQEPIPHMGQTQIFKENEEFTLSVDLGPLSPISDTLISSPEKTPSPLEKRPSESPSDIMHDSYKFLIPQDESEEEGDD
        +  K+    QP     +  G+ I +PI  +   +     +  +L+VDLG L  +       P K+   LE   +   ++++  +   ++P+    +   +
Subjt:  NTGKE--VYQP----KRYTGIRIQEPIPHMGQTQIFKENEEFTLSVDLGPLSPISDTLISSPEKTPSPLEKRPSESPSDIMHDSYKFLIPQDESEEEGDD

Query:  NNEGEDAGNTINDPKQTGNAS-------------QEESFKRQLNDWLTENNFCLIPIKSIPNCNSRKSVEDDNIATSRRASGNNNFVILSETKLASISKR
         N    +      PK                     E+FK+QL  WL +N      +K   + +S  +    N+  ++  SG           L   +KR
Subjt:  NNEGEDAGNTINDPKQTGNAS-------------QEESFKRQLNDWLTENNFCLIPIKSIPNCNSRKSVEDDNIATSRRASGNNNFVILSETKLASISKR

Query:  LIKSMWSSIRINWMALESTGSSGGILNMWDDLNFSVSDFVKGQFSLSIKISLADGFSWWLSGIYGPASRKYRGLLWRELYDLYGLCNDCWLLGGDFNVFR
        +IKS+W S  INW+A  ++GSSGGIL +WD  N S+    +G FSLS    L +  SWWL+G+YGP  R+ R   W EL++L  L +  W+LGGD NV R
Subjt:  LIKSMWSSIRINWMALESTGSSGGILNMWDDLNFSVSDFVKGQFSLSIKISLADGFSWWLSGIYGPASRKYRGLLWRELYDLYGLCNDCWLLGGDFNVFR

Query:  WSSETSSINPAKKSMRKFNNFISSTGLIDPPLLNGSYTRSNLRNSPIRSRLDRFLYSYNWALKFGDHTSKRLSRITSDHFPILLD--SPSVSWGPCPFRF
           E++S+  +  + R  NNFIS+  LIDPPL N  +T SNLRN P  SR+DRFLY+ +W   F  HT++ L R TSDHFP++ +  +P +SWGP PFR 
Subjt:  WSSETSSINPAKKSMRKFNNFISSTGLIDPPLLNGSYTRSNLRNSPIRSRLDRFLYSYNWALKFGDHTSKRLSRITSDHFPILLD--SPSVSWGPCPFRF

Query:  DNYLLKNKSFLDKIEHWWISSSQDGHPGYSFIRHLRQLAKSVKAWKKTNMENLKSRKKSIADEIDRIDSLEALGPIDDTLRQQRVALKTDILEVALQEAR
        ++  L +  F   +  WW +S Q G+PG+SFI+ L+ LA  +K W+K  + +L   K++I  E+D ID  E   P+      +R+ALK D+ E++L+E++
Subjt:  DNYLLKNKSFLDKIEHWWISSSQDGHPGYSFIRHLRQLAKSVKAWKKTNMENLKSRKKSIADEIDRIDSLEALGPIDDTLRQQRVALKTDILEVALQEAR

Query:  YWSQRCKKIWLSKGDENTSFFHKMCTARRRRNHITELMSKDGS
        +W QR KK+WL +GDEN+SFFH++C++R++R+ I E+  ++GS
Subjt:  YWSQRCKKIWLSKGDENTSFFHKMCTARRRRNHITELMSKDGS

A0A5D3CA17 LINE-1 retrotransposable element ORF2 protein6.8e-13932.65Show/hide
Query:  PESVEWLKVTFQSLLSNPRNNRFFQEKRYEDYCLWVDKTSNRRGN--IAEVFRIDDKGRKCCVMVPEGYDRNGWNSFLSMFIFK-EAKAKYVRPEAKNSR
        P+++ W++  F+ LL       FF E+R ED C+WV KT N+      AE+FRID+KGRKC ++VPEG D  GW SFL++  F+  A  K +R E +   
Subjt:  PESVEWLKVTFQSLLSNPRNNRFFQEKRYEDYCLWVDKTSNRRGN--IAEVFRIDDKGRKCCVMVPEGYDRNGWNSFLSMFIFK-EAKAKYVRPEAKNSR

Query:  PYRSQDSTS--SDNSRRSYVEVLSSKSHSNTETIPSVQEKDFNLSRSRTIAVR-IEEDEFSWNNIVVLSRRCFHDDWVKIFNKLKELNDYAASYNPFHAN
             DS S  SD+SR+SY +VLS  S  + +        D +  RS +I  +       S+   V+++RRCFHDDW +I   L++ ++ A SY PF A+
Subjt:  PYRSQDSTS--SDNSRRSYVEVLSSKSHSNTETIPSVQEKDFNLSRSRTIAVR-IEEDEFSWNNIVVLSRRCFHDDWVKIFNKLKELNDYAASYNPFHAN

Query:  KALLFMR---------------------------------------------------------NEEQVMHV----GGFVEAAKHTWRKLDLIEATIKIK
        KA+LF+                                                          N     H+    GGF++ AK T +   LI+A IK++
Subjt:  KALLFMR---------------------------------------------------------NEEQVMHV----GGFVEAAKHTWRKLDLIEATIKIK

Query:  DNFTGFIPASVRIVDEKEITFIVHTVAPMAGKWLVCRRPKIHGSFTREAARDFDEYDVKSEAYTFR--------------ENSACTVDQSSASIIQQSRI
         N+ GF+PAS+ I D +   FIV TV P   +WLV R  ++HGSF  +AA +FD+++  +E YT+               + S    D+ S S   Q++ 
Subjt:  DNFTGFIPASVRIVDEKEITFIVHTVAPMAGKWLVCRRPKIHGSFTREAARDFDEYDVKSEAYTFR--------------ENSACTVDQSSASIIQQSRI

Query:  QTKPKDLFDSSDHP-PLQSKEKGKKI----------YTSDEENLIKKTVSFTSP----KNQTVFYNPSYAPAKPLTPLGQNPTQRPNQQQKNTGKEVYQP
            +  +D  D     + KEKGK I          Y+   + +  + VSF SP     N +     +   +  ++ +     +R + +QK   K  Y+ 
Subjt:  QTKPKDLFDSSDHP-PLQSKEKGKKI----------YTSDEENLIKKTVSFTSP----KNQTVFYNPSYAPAKPLTPLGQNPTQRPNQQQKNTGKEVYQP

Query:  KRYTGIRIQEPIPHMGQTQIFKENEEFTLSVDLGPLSPISDTLISSPEKTPSPLEKRPSE--SPSDIMHDSYKFLIPQDESEEEGDDNNEGEDAGNTIND
        K+      ++    + +T   + +++  LSVD+GP+SP+   + S           +  +  S S    ++    +   E  ++    +     GN+  D
Subjt:  KRYTGIRIQEPIPHMGQTQIFKENEEFTLSVDLGPLSPISDTLISSPEKTPSPLEKRPSE--SPSDIMHDSYKFLIPQDESEEEGDDNNEGEDAGNTIND

Query:  PKQTGNASQEESFKRQLNDWLTENNFCLIP--IKSIPNCNSRKSVEDDNIATSRRASGNNNFVILSETKLASISKRLIKSMWSSIRINWMALESTGSSGG
         K       + +FK +L  WL EN   L P     +P+ +    +  D            N  I     L                         G  GG
Subjt:  PKQTGNASQEESFKRQLNDWLTENNFCLIP--IKSIPNCNSRKSVEDDNIATSRRASGNNNFVILSETKLASISKRLIKSMWSSIRINWMALESTGSSGG

Query:  ILNMWDDLNFSVSDFVKGQFSLSIKISLADGFSWWLSGIYGPASRKYRGLLWRELYDLYGLCNDCWLLGGDFNVFRWSSETSSINPAKKSMRKFNNFISS
        IL +WDD NF V+D   G +S+S+ I   +G +WWL+ +YGP     R  LW EL  L  LC   WL+ GDFN+ RW  ET++ +  K++M  FNNFIS 
Subjt:  ILNMWDDLNFSVSDFVKGQFSLSIKISLADGFSWWLSGIYGPASRKYRGLLWRELYDLYGLCNDCWLLGGDFNVFRWSSETSSINPAKKSMRKFNNFISS

Query:  TGLIDPPLLNGSYTRSNLRNSPIRSRLDRFLYSYNWALKFGDHTSKRLSRITSDHFPILLDSPSVSWGPCPFRFDNYLLKNKSFLDKIEHWWISSSQDGH
          LIDPP LN ++T SNLR +P  SRLDRFL S  W   FG HTS+ L R  SDHFPILL+SP + WGPCPFR +N  L++K F     +WW SS Q G 
Subjt:  TGLIDPPLLNGSYTRSNLRNSPIRSRLDRFLYSYNWALKFGDHTSKRLSRITSDHFPILLDSPSVSWGPCPFRFDNYLLKNKSFLDKIEHWWISSSQDGH

Query:  PGYSFIRHLRQLAKSVKAWKKTNMENLKSRKKSIADEIDRIDSLEALGPIDDTLRQQRVALKTDILEVALQEARYWSQRCKKIWLSKGDENTSFFHKMCT
        PGY+FI+ L  L+K +K W+   +    + KK++  EID ID LE  G +  T  Q+R++LK+D+L +   +A+ W QR ++ W   GDEN S+FH++CT
Subjt:  PGYSFIRHLRQLAKSVKAWKKTNMENLKSRKKSIADEIDRIDSLEALGPIDDTLRQQRVALKTDILEVALQEARYWSQRCKKIWLSKGDENTSFFHKMCT

Query:  ARRRRNHITELMSKDGSSIASDD
          +R+N I  +    G+S+ S D
Subjt:  ARRRRNHITELMSKDGSSIASDD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein5.0e-1728.24Show/hide
Query:  LLGGDFNVFRWSSETSSINPAKKSMR---KFNNFISSTGLIDPPLLNGSYTRSNLR-NSPIRSRLDRFLYSYNWALKFGDHTSKRLSRITSDHFP--ILL
        +L GDF+    +S+  S+      MR   +F N +  + L+D P     YT SN + ++PI  +LDR + + +W   F    +       SDH P  I+L
Subjt:  LLGGDFNVFRWSSETSSINPAKKSMR---KFNNFISSTGLIDPPLLNGSYTRSNLR-NSPIRSRLDRFLYSYNWALKFGDHTSKRLSRITSDHFP--ILL

Query:  DSPSVSWGPCPFRFDNYLLKNKSFLDKIEHWWISSSQDGHPGYSFIRHLRQLAKSVKAWKKTNMENLKSRKKSIADEIDRIDSLEALGPIDDTLRQQRVA
        ++       C FR+ ++L  + +FL  +   W      G   +S   HL+   K  K   +    N++ + K   D ++ I S     P D   R + VA
Subjt:  DSPSVSWGPCPFRFDNYLLKNKSFLDKIEHWWISSSQDGHPGYSFIRHLRQLAKSVKAWKKTNMENLKSRKKSIADEIDRIDSLEALGPIDDTLRQQRVA

Query:  LKTDILEVALQEARYWSQRCKKIWLSKGDENTSFFHKMCTARRRRNHITELMSKD
         K      A  E+ ++ Q+ +  WL  GD NT FFHK+  A + +N I  L   D
Subjt:  LKTDILEVALQEARYWSQRCKKIWLSKGDENTSFFHKMCTARRRRNHITELMSKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGCTGCTTACGGAAATCGGGCCTTATAAATCGTTCTCCATGTCCCTGCTGCCCCGAATCAGTTGAATGGCTAAAGGTAACTTTTCAATCCCTGCTTTCTAATCCGCG
AAACAATCGTTTTTTTCAGGAAAAGCGCTACGAAGATTATTGTCTTTGGGTAGACAAAACATCAAATAGGAGAGGAAATATAGCAGAGGTTTTTCGTATTGACGATAAAG
GAAGAAAATGTTGTGTTATGGTCCCTGAAGGCTATGACAGAAATGGGTGGAACTCTTTCCTATCTATGTTTATCTTCAAGGAAGCCAAGGCAAAATATGTCAGACCTGAG
GCGAAAAATTCCAGACCTTATAGAAGTCAAGATTCTACAAGTAGTGATAATTCCAGAAGGTCTTATGTTGAAGTTCTATCTTCAAAGAGCCATTCCAACACAGAAACTAT
TCCTTCTGTTCAAGAAAAAGACTTTAATCTCAGCAGAAGTAGAACTATAGCAGTAAGAATAGAGGAAGACGAATTTTCTTGGAATAATATTGTAGTCTTAAGCCGCCGTT
GTTTTCATGATGATTGGGTGAAAATTTTTAACAAATTGAAGGAGCTGAACGATTATGCGGCTTCATACAACCCATTTCATGCCAATAAAGCTTTACTTTTTATGAGGAAT
GAGGAACAAGTAATGCATGTGGGGGGTTTTGTGGAAGCGGCCAAACATACATGGCGGAAGCTTGATCTCATCGAAGCCACCATCAAAATTAAAGATAATTTCACTGGCTT
TATTCCCGCATCAGTCAGAATTGTGGATGAAAAGGAAATTACCTTTATAGTTCACACCGTAGCTCCGATGGCCGGAAAATGGCTAGTCTGCAGAAGACCTAAGATACATG
GTTCGTTCACGCGAGAAGCTGCCAGAGATTTTGACGAATATGACGTCAAATCAGAGGCATACACTTTCAGAGAAAATTCGGCGTGCACAGTGGATCAATCGAGTGCCTCG
ATAATACAACAATCAAGAATTCAAACAAAGCCCAAAGACCTTTTTGATTCTTCAGATCATCCTCCTCTGCAATCGAAAGAAAAGGGTAAAAAGATCTACACGTCTGACGA
GGAAAATTTGATAAAAAAGACAGTTTCCTTCACATCTCCCAAAAATCAGACTGTCTTTTACAACCCTTCTTATGCCCCAGCCAAACCCTTAACTCCATTGGGCCAAAATC
CTACACAAAGGCCCAATCAGCAGCAAAAGAATACAGGTAAGGAAGTATATCAGCCCAAGAGATACACGGGGATCAGAATCCAAGAACCAATACCACATATGGGGCAGACC
CAAATCTTTAAAGAAAATGAAGAATTTACTTTGAGTGTTGACTTGGGCCCATTGTCTCCAATTTCAGACACTTTGATATCTAGTCCAGAGAAAACTCCATCACCTCTAGA
GAAAAGGCCAAGCGAATCCCCATCAGATATAATGCACGACAGTTATAAGTTCCTGATTCCACAAGATGAGTCTGAGGAAGAGGGCGACGATAATAACGAGGGGGAGGATG
CTGGAAACACAATCAATGATCCAAAGCAAACAGGAAATGCATCCCAAGAAGAGTCTTTCAAAAGACAGTTAAACGACTGGTTGACGGAGAACAATTTTTGTCTTATTCCC
ATTAAATCGATCCCAAATTGTAATAGCAGAAAGTCTGTTGAAGACGATAATATTGCAACTAGCCGTAGGGCTTCGGGGAACAACAATTTCGTAATTCTCTCTGAAACTAA
ATTAGCTTCCATCAGCAAGCGTTTGATTAAATCTATGTGGAGCTCCATTAGAATCAATTGGATGGCCCTTGAATCTACAGGATCGTCTGGTGGTATCTTGAATATGTGGG
ATGATCTTAATTTCTCAGTTTCGGACTTCGTCAAAGGTCAATTCTCTCTTTCCATTAAAATTTCTCTTGCAGATGGATTCTCTTGGTGGTTATCTGGTATATACGGCCCG
GCGAGCAGGAAGTACAGAGGTCTTTTATGGAGGGAGCTCTATGATCTTTATGGGCTTTGCAATGATTGCTGGCTACTTGGGGGAGACTTCAACGTGTTTCGTTGGTCTTC
TGAAACCTCCTCTATCAATCCAGCTAAGAAGAGTATGCGGAAATTTAACAACTTCATTTCTTCCACTGGCTTGATAGATCCCCCCCTCTTGAATGGATCCTATACCAGGT
CGAATCTTCGGAATTCTCCCATAAGATCGCGCCTAGACAGATTTTTGTATTCTTATAATTGGGCCTTAAAGTTCGGTGATCATACATCTAAAAGACTCTCCAGGATAACG
TCTGATCATTTTCCAATCTTATTAGATTCTCCGTCTGTGAGTTGGGGCCCCTGCCCTTTCCGTTTCGATAATTACCTGCTGAAAAACAAAAGTTTTCTTGACAAGATTGA
GCATTGGTGGATCTCCTCATCTCAAGACGGCCATCCGGGTTACTCCTTTATTCGTCATTTGAGGCAATTAGCGAAGTCGGTTAAGGCTTGGAAGAAAACAAACATGGAAA
ATCTGAAATCCAGGAAGAAATCTATTGCCGATGAGATTGACAGAATCGATTCCTTGGAAGCTCTCGGCCCTATTGACGACACCCTCAGGCAACAGAGAGTTGCTCTTAAA
ACAGACATCTTAGAAGTTGCGTTGCAAGAGGCCCGATATTGGAGTCAGCGTTGCAAAAAAATCTGGCTTTCTAAGGGTGATGAAAATACGTCTTTCTTTCACAAAATGTG
CACAGCTCGTAGAAGAAGAAACCACATCACGGAACTTATGTCTAAAGATGGTTCTAGTATAGCTTCGGATGATTAG
mRNA sequenceShow/hide mRNA sequence
ATGTGCTGCTTACGGAAATCGGGCCTTATAAATCGTTCTCCATGTCCCTGCTGCCCCGAATCAGTTGAATGGCTAAAGGTAACTTTTCAATCCCTGCTTTCTAATCCGCG
AAACAATCGTTTTTTTCAGGAAAAGCGCTACGAAGATTATTGTCTTTGGGTAGACAAAACATCAAATAGGAGAGGAAATATAGCAGAGGTTTTTCGTATTGACGATAAAG
GAAGAAAATGTTGTGTTATGGTCCCTGAAGGCTATGACAGAAATGGGTGGAACTCTTTCCTATCTATGTTTATCTTCAAGGAAGCCAAGGCAAAATATGTCAGACCTGAG
GCGAAAAATTCCAGACCTTATAGAAGTCAAGATTCTACAAGTAGTGATAATTCCAGAAGGTCTTATGTTGAAGTTCTATCTTCAAAGAGCCATTCCAACACAGAAACTAT
TCCTTCTGTTCAAGAAAAAGACTTTAATCTCAGCAGAAGTAGAACTATAGCAGTAAGAATAGAGGAAGACGAATTTTCTTGGAATAATATTGTAGTCTTAAGCCGCCGTT
GTTTTCATGATGATTGGGTGAAAATTTTTAACAAATTGAAGGAGCTGAACGATTATGCGGCTTCATACAACCCATTTCATGCCAATAAAGCTTTACTTTTTATGAGGAAT
GAGGAACAAGTAATGCATGTGGGGGGTTTTGTGGAAGCGGCCAAACATACATGGCGGAAGCTTGATCTCATCGAAGCCACCATCAAAATTAAAGATAATTTCACTGGCTT
TATTCCCGCATCAGTCAGAATTGTGGATGAAAAGGAAATTACCTTTATAGTTCACACCGTAGCTCCGATGGCCGGAAAATGGCTAGTCTGCAGAAGACCTAAGATACATG
GTTCGTTCACGCGAGAAGCTGCCAGAGATTTTGACGAATATGACGTCAAATCAGAGGCATACACTTTCAGAGAAAATTCGGCGTGCACAGTGGATCAATCGAGTGCCTCG
ATAATACAACAATCAAGAATTCAAACAAAGCCCAAAGACCTTTTTGATTCTTCAGATCATCCTCCTCTGCAATCGAAAGAAAAGGGTAAAAAGATCTACACGTCTGACGA
GGAAAATTTGATAAAAAAGACAGTTTCCTTCACATCTCCCAAAAATCAGACTGTCTTTTACAACCCTTCTTATGCCCCAGCCAAACCCTTAACTCCATTGGGCCAAAATC
CTACACAAAGGCCCAATCAGCAGCAAAAGAATACAGGTAAGGAAGTATATCAGCCCAAGAGATACACGGGGATCAGAATCCAAGAACCAATACCACATATGGGGCAGACC
CAAATCTTTAAAGAAAATGAAGAATTTACTTTGAGTGTTGACTTGGGCCCATTGTCTCCAATTTCAGACACTTTGATATCTAGTCCAGAGAAAACTCCATCACCTCTAGA
GAAAAGGCCAAGCGAATCCCCATCAGATATAATGCACGACAGTTATAAGTTCCTGATTCCACAAGATGAGTCTGAGGAAGAGGGCGACGATAATAACGAGGGGGAGGATG
CTGGAAACACAATCAATGATCCAAAGCAAACAGGAAATGCATCCCAAGAAGAGTCTTTCAAAAGACAGTTAAACGACTGGTTGACGGAGAACAATTTTTGTCTTATTCCC
ATTAAATCGATCCCAAATTGTAATAGCAGAAAGTCTGTTGAAGACGATAATATTGCAACTAGCCGTAGGGCTTCGGGGAACAACAATTTCGTAATTCTCTCTGAAACTAA
ATTAGCTTCCATCAGCAAGCGTTTGATTAAATCTATGTGGAGCTCCATTAGAATCAATTGGATGGCCCTTGAATCTACAGGATCGTCTGGTGGTATCTTGAATATGTGGG
ATGATCTTAATTTCTCAGTTTCGGACTTCGTCAAAGGTCAATTCTCTCTTTCCATTAAAATTTCTCTTGCAGATGGATTCTCTTGGTGGTTATCTGGTATATACGGCCCG
GCGAGCAGGAAGTACAGAGGTCTTTTATGGAGGGAGCTCTATGATCTTTATGGGCTTTGCAATGATTGCTGGCTACTTGGGGGAGACTTCAACGTGTTTCGTTGGTCTTC
TGAAACCTCCTCTATCAATCCAGCTAAGAAGAGTATGCGGAAATTTAACAACTTCATTTCTTCCACTGGCTTGATAGATCCCCCCCTCTTGAATGGATCCTATACCAGGT
CGAATCTTCGGAATTCTCCCATAAGATCGCGCCTAGACAGATTTTTGTATTCTTATAATTGGGCCTTAAAGTTCGGTGATCATACATCTAAAAGACTCTCCAGGATAACG
TCTGATCATTTTCCAATCTTATTAGATTCTCCGTCTGTGAGTTGGGGCCCCTGCCCTTTCCGTTTCGATAATTACCTGCTGAAAAACAAAAGTTTTCTTGACAAGATTGA
GCATTGGTGGATCTCCTCATCTCAAGACGGCCATCCGGGTTACTCCTTTATTCGTCATTTGAGGCAATTAGCGAAGTCGGTTAAGGCTTGGAAGAAAACAAACATGGAAA
ATCTGAAATCCAGGAAGAAATCTATTGCCGATGAGATTGACAGAATCGATTCCTTGGAAGCTCTCGGCCCTATTGACGACACCCTCAGGCAACAGAGAGTTGCTCTTAAA
ACAGACATCTTAGAAGTTGCGTTGCAAGAGGCCCGATATTGGAGTCAGCGTTGCAAAAAAATCTGGCTTTCTAAGGGTGATGAAAATACGTCTTTCTTTCACAAAATGTG
CACAGCTCGTAGAAGAAGAAACCACATCACGGAACTTATGTCTAAAGATGGTTCTAGTATAGCTTCGGATGATTAG
Protein sequenceShow/hide protein sequence
MCCLRKSGLINRSPCPCCPESVEWLKVTFQSLLSNPRNNRFFQEKRYEDYCLWVDKTSNRRGNIAEVFRIDDKGRKCCVMVPEGYDRNGWNSFLSMFIFKEAKAKYVRPE
AKNSRPYRSQDSTSSDNSRRSYVEVLSSKSHSNTETIPSVQEKDFNLSRSRTIAVRIEEDEFSWNNIVVLSRRCFHDDWVKIFNKLKELNDYAASYNPFHANKALLFMRN
EEQVMHVGGFVEAAKHTWRKLDLIEATIKIKDNFTGFIPASVRIVDEKEITFIVHTVAPMAGKWLVCRRPKIHGSFTREAARDFDEYDVKSEAYTFRENSACTVDQSSAS
IIQQSRIQTKPKDLFDSSDHPPLQSKEKGKKIYTSDEENLIKKTVSFTSPKNQTVFYNPSYAPAKPLTPLGQNPTQRPNQQQKNTGKEVYQPKRYTGIRIQEPIPHMGQT
QIFKENEEFTLSVDLGPLSPISDTLISSPEKTPSPLEKRPSESPSDIMHDSYKFLIPQDESEEEGDDNNEGEDAGNTINDPKQTGNASQEESFKRQLNDWLTENNFCLIP
IKSIPNCNSRKSVEDDNIATSRRASGNNNFVILSETKLASISKRLIKSMWSSIRINWMALESTGSSGGILNMWDDLNFSVSDFVKGQFSLSIKISLADGFSWWLSGIYGP
ASRKYRGLLWRELYDLYGLCNDCWLLGGDFNVFRWSSETSSINPAKKSMRKFNNFISSTGLIDPPLLNGSYTRSNLRNSPIRSRLDRFLYSYNWALKFGDHTSKRLSRIT
SDHFPILLDSPSVSWGPCPFRFDNYLLKNKSFLDKIEHWWISSSQDGHPGYSFIRHLRQLAKSVKAWKKTNMENLKSRKKSIADEIDRIDSLEALGPIDDTLRQQRVALK
TDILEVALQEARYWSQRCKKIWLSKGDENTSFFHKMCTARRRRNHITELMSKDGSSIASDD