; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032657 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032657
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr11:35732445..35737008
RNA-Seq ExpressionLag0032657
SyntenyLag0032657
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG2711776.1 hypothetical protein I3760_04G092800 [Carya illinoinensis]1.9e-11231.25Show/hide
Query:  MNERLDRFVANEVFINLFPNASVLHLKWAQSDHRPIMLDGCYTAENVGRQRRARKFRFEEVWTQNEDCKRVVLEAVSRISNKNGLPSLKECLHQCSSRLQ
        + ERLDRF+AN  +  ++ N  V H   A SDH P+ LD   T   + R+R  R FRFE +W   ++C  ++  A  + +    L  +   + +C+  L 
Subjt:  MNERLDRFVANEVFINLFPNASVLHLKWAQSDHRPIMLDGCYTAENVGRQRRARKFRFEEVWTQNEDCKRVVLEAVSRISNKNGLPSLKECLHQCSSRLQ

Query:  KWGKGTSHSLRQNIMVHQRVLQELYSKPP-PWDFDEIKRLESQLDKALEDEEIYWKQRSRENWLHWG--------------------IEIRDDR------
        +W K +   +++N+   +R LQ L +        +E K+   ++ K LE +E+ WKQRSR  WL  G                    ++++D+       
Subjt:  KWGKGTSHSLRQNIMVHQRVLQELYSKPP-PWDFDEIKRLESQLDKALEDEEIYWKQRSRENWLHWG--------------------IEIRDDR------

Query:  QKMEQTFTSYFSNLFSSTKPQLESLDLALQDITTRVTQEMNAKFMAPFTKEEIVRAIKQMHPTKAPGPDGFPALFYQKFWDEVGDITISNCLDILNRSRS
         +M+   T YF NLF++   +++  D+ L  +  RVT EMN   + P+  EE+  A+KQMHP+KAPGPDG   LF+QK+W  +G+   +  L  LN    
Subjt:  QKMEQTFTSYFSNLFSSTKPQLESLDLALQDITTRVTQEMNAKFMAPFTKEEIVRAIKQMHPTKAPGPDGFPALFYQKFWDEVGDITISNCLDILNRSRS

Query:  VKDWNDTNIALIPKVI-------VNRMKWILQDVISENQSAFVPGRSIFDNVILGHECLHAIKARKKGRCGWLALKLDMSKAYDRVEWPFLERLLIELGF
         K  N T I LIPK +          +  IL DVIS++QSAFVPGR I DNV++ +E LH ++ ++KGR G+++LKLDMS AYDRVEW FLE+++  LGF
Subjt:  VKDWNDTNIALIPKVI-------VNRMKWILQDVISENQSAFVPGRSIFDNVILGHECLHAIKARKKGRCGWLALKLDMSKAYDRVEWPFLERLLIELGF

Query:  DARWVHLIMECVGTPNFSILLNGDP--------------------------------TRAMSKKNLTGFKPGKYCPAISHLFFADDSLL---------FK
        D + + L+M CV T +FS+L+NG+P                                 R + ++ + G +  +  P I+HL FADDS+           K
Subjt:  DARWVHLIMECVGTPNFSILLNGDP--------------------------------TRAMSKKNLTGFKPGKYCPAISHLFFADDSLL---------FK

Query:  LQLNMCGIYEIFYRCM-----------PIVDNLGR----------------YLGVPSAFSRKRKDDFREIKQRVWQTLQGWKGQFFSVGGKEILIKSIAQ
        +Q  +        +C+            + D+L R                YLG+P    R +K  F +IK+RVWQ LQ WKG   S GG+E+LIK++A 
Subjt:  LQLNMCGIYEIFYRCM-----------PIVDNLGR----------------YLGVPSAFSRKRKDDFREIKQRVWQTLQGWKGQFFSVGGKEILIKSIAQ

Query:  SIPSYSMSCFRLPKTLCDELHAMMARFWWGSTDSKGRSIGRNGPRCGGSFRIHRC-WH---LGSFRGDMRIRCPFCMRQLRPTVRCFGGVLFGREDVDVI
        SIP+Y+MSCF  PKTLC EL  MMA+FWWG    +               +IH C W    +  F+G M  R           +  F   L  ++   ++
Subjt:  SIPSYSMSCFRLPKTLCDELHAMMARFWWGSTDSKGRSIGRNGPRCGGSFRIHRC-WH---LGSFRGDMRIRCPFCMRQLRPTVRCFGGVLFGREDVDVI

Query:  EAIPISITNDEDKWIW------HYCSNGMYIVKSGYKLARRLLVDQQSPSTDDQRLWWMRLWKAKIPQKIKIF--IWKAYHGCLPTRSKQICDNMFPISI
                 +ED  ++      ++ S+ ++  K+G   A    V +      D      R W+    Q I+IF   W    G   +   ++ +N+   S+
Subjt:  EAIPISITNDEDKWIW------HYCSNGMYIVKSGYKLARRLLVDQQSPSTDDQRLWWMRLWKAKIPQKIKIF--IWKAYHGCLPTRSKQICDNMFPISI

Query:  TNDEDKWIWHYCSNGMAYGNVELTSLLFNKLELFNKALLEKQEDVDVIEAIPISITNDEDKWIWHYCSNGMYIVKSGYKLARR--LLVDQQSPNTDDQRL
         +    W W          NV +   LFN               +  I  + IS+ N ED W W +  NG + VKS Y+  ++   L+  QS +   + +
Subjt:  TNDEDKWIWHYCSNGMAYGNVELTSLLFNKLELFNKALLEKQEDVDVIEAIPISITNDEDKWIWHYCSNGMYIVKSGYKLARR--LLVDQQSPNTDDQRL

Query:  WWMRLWKAKIPQKIKIFIWKAYHGCLPT
        +W  LW  K+P+K+KIF W+A    LPT
Subjt:  WWMRLWKAKIPQKIKIFIWKAYHGCLPT

XP_017250619.1 PREDICTED: uncharacterized protein LOC108221234 [Daucus carota subsp. sativus]2.4e-11230.36Show/hide
Query:  MNERLDRFVANEVFINLFPNASVLHLKWAQSDHRPIMLDGCYTAENVGRQRRARKFRFEEVWTQNEDCKRVVLEAVSRISNKNGLPSLKECLHQCSSRLQ
        + ERLD  +ANE +++ +PN S+ HL +  SDHR + L           +R+  +FRFE +W +  +CK +V ++    +N+  L S    + QCS +L 
Subjt:  MNERLDRFVANEVFINLFPNASVLHLKWAQSDHRPIMLDGCYTAENVGRQRRARKFRFEEVWTQNEDCKRVVLEAVSRISNKNGLPSLKECLHQCSSRLQ

Query:  KWGKGTSHSLRQNIM-VHQRVLQELYSKPPPWDFDEIKRLESQLDKALEDEEIYWKQRSRENWLHWG---------------------------IEIRDD
         W +    SL + I   H ++ Q    +       E++  E +L++ L  EEI+WKQRSR  WL  G                            E   D
Subjt:  KWGKGTSHSLRQNIM-VHQRVLQELYSKPPPWDFDEIKRLESQLDKALEDEEIYWKQRSRENWLHWG---------------------------IEIRDD

Query:  RQKMEQTFTSYFSNLFSSTKPQLESLDLALQDITTRVTQEMNAKFMAPFTKEEIVRAIKQMHPTKAPGPDGFPALFYQKFWDEVGDITISNCLDILNRSR
          +       ++ NLF+++ P  + +D  L  +   VT +MN      FT  E+ RAI  M P K+PGPDG  A+F+Q+ W+ VG +     LD LN   
Subjt:  RQKMEQTFTSYFSNLFSSTKPQLESLDLALQDITTRVTQEMNAKFMAPFTKEEIVRAIKQMHPTKAPGPDGFPALFYQKFWDEVGDITISNCLDILNRSR

Query:  SVKDWNDTNIALIP-------------------------KVIVNRMKWILQDVISENQSAFVPGRSIFDNVILGHECLHAIKARKKGRCGWLALKLDMSK
         ++  N T + LIP                         KV++NR+K IL  +I+  QSAFVPGR I DN ++ +ECLH ++  + G+  ++A+KLDMSK
Subjt:  SVKDWNDTNIALIP-------------------------KVIVNRMKWILQDVISENQSAFVPGRSIFDNVILGHECLHAIKARKKGRCGWLALKLDMSK

Query:  AYDRVEWPFLERLLIELGFDARWVHLIMECVGTPNFSILLN---------------GDP-----------------TRAMSKKNLTGFKPGKYCPAISHL
        AYDRVEW F+E++L +LGF  +WV  IM+CV + N+S  +N               GDP                  +A ++ ++ G K  +  P+ISHL
Subjt:  AYDRVEWPFLERLLIELGFDARWVHLIMECVGTPNFSILLN---------------GDP-----------------TRAMSKKNLTGFKPGKYCPAISHL

Query:  FFADDSLLF-----KLQLNMCGIYEIFYRC-------------------------------MPIVDNLGRYLGVPSAFSRKRKDDFREIKQRVWQTLQGW
        FFADDSLLF     +   ++  I+ ++  C                               M   + +  YLG+P    + +K  FR IK++VW  L  W
Subjt:  FFADDSLLF-----KLQLNMCGIYEIFYRC-------------------------------MPIVDNLGRYLGVPSAFSRKRKDDFREIKQRVWQTLQGW

Query:  KGQFFSVGGKEILIKSIAQSIPSYSMSCFRLPKTLCDELHAMMARFWWGSTDSKGRSIGR-----NGPRCGG--SFR--IHRCWHLGSFRGDMRIRCPFC
        +   FS GGKEIL+K++ Q++P+Y MSCF++P+  C E+  ++AR+WWGS  SK +   R     + P+  G   FR  IH    L + +    +  P  
Subjt:  KGQFFSVGGKEILIKSIAQSIPSYSMSCFRLPKTLCDELHAMMARFWWGSTDSKGRSIGR-----NGPRCGG--SFR--IHRCWHLGSFRGDMRIRCPFC

Query:  MRQLRPTVRCFGGVLFGREDVDVIEAIPISITNDEDKWIWHYCSNGMYIVKSGYKLARRLLVDQQSPSTDDQRLWWMRLWKAKIPQKIKIFIWKAYHGCL
        +       + F    F    +D  E    S+T       W     G  ++  G  L RR+   Q + +  D        W A+ P               
Subjt:  MRQLRPTVRCFGGVLFGREDVDVIEAIPISITNDEDKWIWHYCSNGMYIVKSGYKLARRLLVDQQSPSTDDQRLWWMRLWKAKIPQKIKIFIWKAYHGCL

Query:  PTRSKQICDNMFPISITNDEDKWIWHYCSNGMAYGNVELTSLLFNKLELFNKALLEKQEDVDVIEAIPISITNDEDKWIWHYCSNGMYIVKSGYKLARRL
                 +  PI+  ++E+  +  Y + G            +N+ EL  +  L    D+ +I  IP+S  +  D W WHY S G Y VKSGYKL   L
Subjt:  PTRSKQICDNMFPISITNDEDKWIWHYCSNGMAYGNVELTSLLFNKLELFNKALLEKQEDVDVIEAIPISITNDEDKWIWHYCSNGMYIVKSGYKLARRL

Query:  LVDQQSPNTDDQRLWWMRLWKAKIPQKIKIFIWKAYHGCLPT
          D  S +      WW   W  KIP+KI IF W+ YH  LPT
Subjt:  LVDQQSPNTDDQRLWWMRLWKAKIPQKIKIFIWKAYHGCLPT

XP_030923330.1 uncharacterized protein LOC115950239 [Quercus lobata]2.4e-11531.4Show/hide
Query:  RLDRFVANEVFINLFPNASVLHLKWAQSDHRPIMLDGCYTAENVGRQRRARKFRFEEVWTQNEDCKRVVLEAVSR-ISNKNGLPSLKECLHQCSSRLQKW
        RLDR VAN+ + + F  + V+HL    SDH P++L     + +  RQ   R F+FEE W   ++C  V+ EA      N++GL +++E +  C   L  W
Subjt:  RLDRFVANEVFINLFPNASVLHLKWAQSDHRPIMLDGCYTAENVGRQRRARKFRFEEVWTQNEDCKRVVLEAVSR-ISNKNGLPSLKECLHQCSSRLQKW

Query:  GKGTSHSLRQNIMVHQRVLQEL-YSKPPPWDFDEIKRLESQLDKALEDEEIYWKQRSRENWLHWGIE--------------------IRDDR-------Q
        G   +      I   Q+ L  L  ++       E   L  ++D  L+ +EIYW QRSR NWL  G                      IR+ +       +
Subjt:  GKGTSHSLRQNIMVHQRVLQEL-YSKPPPWDFDEIKRLESQLDKALEDEEIYWKQRSRENWLHWGIE--------------------IRDDR-------Q

Query:  KMEQTFTSYFSNLFSSTKPQLESLDLALQDITTRVTQEMNAKFMAPFTKEEIVRAIKQMHPTKAPGPDGFPALFYQKFWDEVGDITISNCLDILNRSRSV
        ++ Q    YF NLF +     + ++  L  + T+VT++M       FT EE+  A+ QM PTKAPGPDG  ALFYQKFW  VGD  +S  LD LN    +
Subjt:  KMEQTFTSYFSNLFSSTKPQLESLDLALQDITTRVTQEMNAKFMAPFTKEEIVRAIKQMHPTKAPGPDGFPALFYQKFWDEVGDITISNCLDILNRSRSV

Query:  KDWNDTNIALIP-------------------------KVIVNRMKWILQDVISENQSAFVPGRSIFDNVILGHECLHAIKARKKGRCGWLALKLDMSKAY
         + N TNI LIP                         KV+ NR+K +L  +IS  QSAFVPGR I DNV++ +E LH + ARKKG+ G +ALKLD+SKAY
Subjt:  KDWNDTNIALIP-------------------------KVIVNRMKWILQDVISENQSAFVPGRSIFDNVILGHECLHAIKARKKGRCGWLALKLDMSKAY

Query:  DRVEWPFLERLLIELGFDARWVHLIMECVGTPNFSILLNGDP----------------------------TRAMSKKNLTGFKPG----KYCPAISHLFF
        DRVEW FL+ ++ ++GF A W+  +M CV TP+FSIL+NG P                            T  ++K  L G   G    +  P I++L F
Subjt:  DRVEWPFLERLLIELGFDARWVHLIMECVGTPNFSILLNGDP----------------------------TRAMSKKNLTGFKPG----KYCPAISHLFF

Query:  ADDSLLFKLQLNMCG-----IYEIFYRC-------------------------------MPIVDNLGRYLGVPSAFSRKRKDDFREIKQRVWQTLQGWKG
        ADDSLLF       G     I +I+ R                                +  VD   +YLG+P+   R + + F E+K RVW+ LQGWKG
Subjt:  ADDSLLFKLQLNMCG-----IYEIFYRC-------------------------------MPIVDNLGRYLGVPSAFSRKRKDDFREIKQRVWQTLQGWKG

Query:  QFFSVGGKEILIKSIAQSIPSYSMSCFRLPKTLCDELHAMMARFWWGSTDSKGRSIGRNGPRCGGSFRIH-RCWH---LGSFRGDMRIRCPFCMRQLRPT
           S  GKEILIK++AQ+IP+Y+MS F++P  LC EL A+ ARFWWG              + G   +IH + W         G M  R           
Subjt:  QFFSVGGKEILIKSIAQSIPSYSMSCFRLPKTLCDELHAMMARFWWGSTDSKGRSIGRNGPRCGGSFRIH-RCWH---LGSFRGDMRIRCPFCMRQLRPT

Query:  VRCFGGVLFGREDVDVIEAIPISITNDEDKWIWHYCSNGMYIVKSGYKLARRLLVDQQSPSTDDQRLWWMRLWKAKIPQKIKIFIWKAYHGCLPTRSKQI
        +R F   +  ++   +++             + + C    Y  +S +      L  ++SP+                      F+W++     P      
Subjt:  VRCFGGVLFGREDVDVIEAIPISITNDEDKWIWHYCSNGMYIVKSGYKLARRLLVDQQSPSTDDQRLWWMRLWKAKIPQKIKIFIWKAYHGCLPTRSKQI

Query:  CDNM-FPISITNDEDKWIWHYCSNGMAY-----GNVELTSLLFN-KLELFN----KALLEKQEDVDVIEAIPISITNDEDKWIWHYCSNGMYIVKSGYKL
        C  +    SI   +D+W+ ++ +N +       G+  L + L N +  ++N    +A+  + E  + I  IP+S  +  D   W Y   G++ VKS Y +
Subjt:  CDNM-FPISITNDEDKWIWHYCSNGMAY-----GNVELTSLLFN-KLELFN----KALLEKQEDVDVIEAIPISITNDEDKWIWHYCSNGMYIVKSGYKL

Query:  ARRLLVDQQSPNTD---DQRLWWMRLWKAKIPQKIKIFIWKAYHGCLPT
        ARR+L D     T      +  W  +WK ++P K+K+F W+A H  LPT
Subjt:  ARRLLVDQQSPNTD---DQRLWWMRLWKAKIPQKIKIFIWKAYHGCLPT

XP_042962672.1 uncharacterized protein LOC122296942 [Carya illinoinensis]1.1e-11231.13Show/hide
Query:  MNERLDRFVANEVFINLFPNASVLHLKWAQSDHRPIMLDGCYTAENVGRQRRARKFRFEEVWTQNEDCKRVVLEAVSRISNKNGLPSLKECLHQCSSRLQ
        + ERLDRF+AN ++ ++FPN  V H   A SDH P+ LD   T   + R+R  R FRFE +W    +C  ++     R      L  +   +  C++ L 
Subjt:  MNERLDRFVANEVFINLFPNASVLHLKWAQSDHRPIMLDGCYTAENVGRQRRARKFRFEEVWTQNEDCKRVVLEAVSRISNKNGLPSLKECLHQCSSRLQ

Query:  KWGKGTSHSLRQNIMVHQRVLQELYSKPPPWD-FDEIKRLESQLDKALEDEEIYWKQRSRENWLHWG--------------------IEIRDDR------
        +W K +   +++N+   +R LQ L          +E K+   ++ K LE +E+ WKQRSR  WL  G                    ++++D+       
Subjt:  KWGKGTSHSLRQNIMVHQRVLQELYSKPPPWD-FDEIKRLESQLDKALEDEEIYWKQRSRENWLHWG--------------------IEIRDDR------

Query:  QKMEQTFTSYFSNLFSSTKPQLESLDLALQDITTRVTQEMNAKFMAPFTKEEIVRAIKQMHPTKAPGPDGFPALFYQKFWDEVGDITISNCLDILNRSRS
         +M+   T YF  LF  T      ++  L  +  RVT EMN   + P+  EE+  A+KQMHP+KAPGPDG P LF+QK+W  +G+   +  L  LN    
Subjt:  QKMEQTFTSYFSNLFSSTKPQLESLDLALQDITTRVTQEMNAKFMAPFTKEEIVRAIKQMHPTKAPGPDGFPALFYQKFWDEVGDITISNCLDILNRSRS

Query:  VKDWNDTNIALIP-------------------------KVIVNRMKWILQDVISENQSAFVPGRSIFDNVILGHECLHAIKARKKGRCGWLALKLDMSKA
            N T I LIP                         KVI NR+K +L D+IS +QSAFVPGR I DNV++ +E LH ++ ++KGR G+++LKLDMSKA
Subjt:  VKDWNDTNIALIP-------------------------KVIVNRMKWILQDVISENQSAFVPGRSIFDNVILGHECLHAIKARKKGRCGWLALKLDMSKA

Query:  YDRVEWPFLERLLIELGFDARWVHLIMECVGTPNFSILLNGDP--------------------------------TRAMSKKNLTGFKPGKYCPAISHLF
        YDRV+W FLE+++  LGFD + + LIM+CV T +FS+L+NG P                                 R  S++ + G +  +  P I+HL 
Subjt:  YDRVEWPFLERLLIELGFDARWVHLIMECVGTPNFSILLNGDP--------------------------------TRAMSKKNLTGFKPGKYCPAISHLF

Query:  FADDSLLF---------KLQLNMCGIYEIFYRCM-----------PIVDNLGR----------------YLGVPSAFSRKRKDDFREIKQRVWQTLQGWK
        FADDS++F         K+Q  +        +C+            + D+L R                YLG P    R +K  F +IK+RVWQ LQ WK
Subjt:  FADDSLLF---------KLQLNMCGIYEIFYRCM-----------PIVDNLGR----------------YLGVPSAFSRKRKDDFREIKQRVWQTLQGWK

Query:  GQFFSVGGKEILIKSIAQSIPSYSMSCFRLPKTLCDELHAMMARFWWGSTDSKGRSIGRNGPRCGGSFRIHRC-WH---LGSFRGDMRIRCPFCMRQLRP
        G   S GG+E+LIK++A SIP+Y+MSCF  PKTLC EL  MMARFWWG    +               +IH C W    +  FRG M  R          
Subjt:  GQFFSVGGKEILIKSIAQSIPSYSMSCFRLPKTLCDELHAMMARFWWGSTDSKGRSIGRNGPRCGGSFRIHRC-WH---LGSFRGDMRIRCPFCMRQLRP

Query:  TVRCFGGVLFGREDVDVIEAIPISITNDEDKWIWHYCSNGMYIVKSGYKLARRLLVDQQSPSTDDQRLWWMRLWKAKIPQKIKIFIWKAYHGCLPTRSKQ
         +  F   L  ++   ++  +                 +  Y V          L + +  +          +WK          IW+A   CL    + 
Subjt:  TVRCFGGVLFGREDVDVIEAIPISITNDEDKWIWHYCSNGMYIVKSGYKLARRLLVDQQSPSTDDQRLWWMRLWKAKIPQKIKIFIWKAYHGCLPTRSKQ

Query:  ICDNMFPISITNDEDKWIWHYCSNGMAYGNVELTSLLFNKLELFNKALLEKQEDVDVIEAIPISITNDEDKWIWHYCSNGMYIVKSGYKLAR--RLLVDQ
           N   + I   +D W+             E  S+          AL+E  E++ V   I  S T+ ED   W +  NG + VKS Y+  +  + L + 
Subjt:  ICDNMFPISITNDEDKWIWHYCSNGMAYGNVELTSLLFNKLELFNKALLEKQEDVDVIEAIPISITNDEDKWIWHYCSNGMYIVKSGYKLAR--RLLVDQ

Query:  QSPNTDDQRLWWMRLWKAKIPQKIKIFIWKAYHGCLPT
        QS     + ++W  LW  K+P+K+K+F W+A    LPT
Subjt:  QSPNTDDQRLWWMRLWKAKIPQKIKIFIWKAYHGCLPT

XP_042974784.1 uncharacterized protein LOC122306423 [Carya illinoinensis]8.1e-11629.02Show/hide
Query:  MNERLDRFVANEVFINLFPNASVLHLKWAQSDHRPIMLDGCYTAENVGRQRRARKFRFEEVWTQNEDCKRVVLEAVSRISNKNGLPSLKECLHQCSSRLQ
        + ERLDRF+AN  +  ++ N  V H   A SDH P+ LD   T   + R+R  R FRFE +W   ++C  ++  A  + +    L  +   + +C+  L 
Subjt:  MNERLDRFVANEVFINLFPNASVLHLKWAQSDHRPIMLDGCYTAENVGRQRRARKFRFEEVWTQNEDCKRVVLEAVSRISNKNGLPSLKECLHQCSSRLQ

Query:  KWGKGTSHSLRQNIMVHQRVLQELYSKPP-PWDFDEIKRLESQLDKALEDEEIYWKQRSRENWLHWG--------------------IEIRDDR------
        +W K +   +++N+   +R LQ L +        +E K+   ++ K LE +E+ WKQRSR  WL  G                    ++++D+       
Subjt:  KWGKGTSHSLRQNIMVHQRVLQELYSKPP-PWDFDEIKRLESQLDKALEDEEIYWKQRSRENWLHWG--------------------IEIRDDR------

Query:  QKMEQTFTSYFSNLFSSTKPQLESLDLALQDITTRVTQEMNAKFMAPFTKEEIVRAIKQMHPTKAPGPDGFPALFYQKFWDEVGDITISNCLDILNRSRS
         +M+   T YF NLF  T      +   L  +  RVT EMN   + P+  EE+  A+KQMHP+KAPGP+G   LF+QK+W  +G+   +  L  LN    
Subjt:  QKMEQTFTSYFSNLFSSTKPQLESLDLALQDITTRVTQEMNAKFMAPFTKEEIVRAIKQMHPTKAPGPDGFPALFYQKFWDEVGDITISNCLDILNRSRS

Query:  VKDWNDTNIALIP-------------------------KVIVNRMKWILQDVISENQSAFVPGRSIFDNVILGHECLHAIKARKKGRCGWLALKLDMSKA
         K  N T I LIP                         KVI NR+K IL DVIS++QSAFVPGR I DNV++ +E LH ++ ++KGR G+++LKLDMSKA
Subjt:  VKDWNDTNIALIP-------------------------KVIVNRMKWILQDVISENQSAFVPGRSIFDNVILGHECLHAIKARKKGRCGWLALKLDMSKA

Query:  YDRVEWPFLERLLIELGFDARWVHLIMECVGTPNFSILLNGDP--------------------------------TRAMSKKNLTGFKPGKYCPAISHLF
        YDRVEW FLE+++  LGFD + + L+M CV T +FS+L+NG+P                                 R + ++ + G +  +  P I+HL 
Subjt:  YDRVEWPFLERLLIELGFDARWVHLIMECVGTPNFSILLNGDP--------------------------------TRAMSKKNLTGFKPGKYCPAISHLF

Query:  FADDSLLF---------KLQLNMCGIYEIFYRCM-----------PIVDNLGR----------------YLGVPSAFSRKRKDDFREIKQRVWQTLQGWK
        FADDS+ F         K+Q  +        +C+            + D+L R                YLG+P    R +K  F +IK+RVWQ LQ WK
Subjt:  FADDSLLF---------KLQLNMCGIYEIFYRCM-----------PIVDNLGR----------------YLGVPSAFSRKRKDDFREIKQRVWQTLQGWK

Query:  GQFFSVGGKEILIKSIAQSIPSYSMSCFRLPKTLCDELHAMMARFWWGSTDSKGRSIGRNGPRCGGSFRIHRC-WH---LGSFRGDMRIRCPFCMRQLRP
        G   S GG+E+LIK++A SIP+Y+MSCF  PKTLC EL  MMA+FWWG    +               +IH C W    +  F+G M  R          
Subjt:  GQFFSVGGKEILIKSIAQSIPSYSMSCFRLPKTLCDELHAMMARFWWGSTDSKGRSIGRNGPRCGGSFRIHRC-WH---LGSFRGDMRIRCPFCMRQLRP

Query:  TVRCFGGVLFGREDVDVIEAIPISITNDEDKWIW------HYCSNGMYIVKSGYKLARRLLVDQQSPSTDDQRLWWMRLWKAKIPQKIKIF--IWKAYHG
         +  F   L  ++   ++         +ED  ++      ++ S+ ++  K+G   A    V +      D      R W+    Q I+IF   W    G
Subjt:  TVRCFGGVLFGREDVDVIEAIPISITNDEDKWIW------HYCSNGMYIVKSGYKLARRLLVDQQSPSTDDQRLWWMRLWKAKIPQKIKIF--IWKAYHG

Query:  CLPTRSKQICDNMFPISITNDEDKWIWHYCSNGMAYGNVELTSLLFNKLELFNKALLEKQEDVDVIEAIPISITNDEDKWIWHYCSNGMYIVKSGYKLAR
           +   ++ +N+   S+ +    W W          NV +   LFN               +  I  + IS+ N ED   W +  NG + VKS Y+  +
Subjt:  CLPTRSKQICDNMFPISITNDEDKWIWHYCSNGMAYGNVELTSLLFNKLELFNKALLEKQEDVDVIEAIPISITNDEDKWIWHYCSNGMYIVKSGYKLAR

Query:  R--LLVDQQSPNTDDQRLWWMRLWKAKIPQKIKIFIWKAYHGCLPT-----------------------------RSKQICDNMFPRLDVLVPVGNNFVD
        +   L+  QS +   + ++W  LW  K+P+K+KIF W+A    LPT                             R+KQ+ +N+    +V V   NN   
Subjt:  R--LLVDQQSPNTDDQRLWWMRLWKAKIPQKIKIFIWKAYHGCLPT-----------------------------RSKQICDNMFPRLDVLVPVGNNFVD

Query:  RVICLATGQRVETSQSPAGDRPSFGN-------QSTMVSLFTDAAVRLSSKGAGMRAVVVDNIGNLVAAMECLEEASLSVLAAEIRAIIEGLRLLQRLEI
            LA  +  E         P            S  + L  D A       AG+  V+ ++ G ++ A   +E+   S    E  A++ GL+L  +  I
Subjt:  RVICLATGQRVETSQSPAGDRPSFGN-------QSTMVSLFTDAAVRLSSKGAGMRAVVVDNIGNLVAAMECLEEASLSVLAAEIRAIIEGLRLLQRLEI

Query:  THAMVHFNSSNAIKMISGDIPISSEVYHWVQHIRVIGTSFQELSFIYVSKL
           M+  +    +  ++G+    ++    +Q IR +  +FQE+  ++V+ L
Subjt:  THAMVHFNSSNAIKMISGDIPISSEVYHWVQHIRVIGTSFQELSFIYVSKL

TrEMBL top hitse value%identityAlignment
A0A2N9EWI8 Uncharacterized protein9.7e-11530.25Show/hide
Query:  MNERLDRFVANEVFINLFPNASVLHLKWAQSDHRPIMLDGCYTAENVGRQRRARKFRFEEVWTQNEDCKRVVLEAVSRISNKNGLPSLKECLHQCSSRLQ
        ++ERLDR VA E +++LFP + + H+ +A SDH  ++L+     +   R  R R+F FE  W + E C+  + +A     +   +  L + + QC   L 
Subjt:  MNERLDRFVANEVFINLFPNASVLHLKWAQSDHRPIMLDGCYTAENVGRQRRARKFRFEEVWTQNEDCKRVVLEAVSRISNKNGLPSLKECLHQCSSRLQ

Query:  KWGKGTSHSLRQNIMVHQRVLQELYSKPPPW-DFDEIKRLESQLDKALEDEEIYWKQRSRENWLHWG--------------------IEIRDDRQKMEQT
         W K     + + I+  ++ L E+Y       ++ E + L  +L   L+ EEIYW+QRSR  WL  G                    + IRD +   ++ 
Subjt:  KWGKGTSHSLRQNIMVHQRVLQELYSKPPPW-DFDEIKRLESQLDKALEDEEIYWKQRSRENWLHWG--------------------IEIRDDRQKMEQT

Query:  FTS-------YFSNLFSSTKPQLESLDLALQDITTRVTQEMNAKFMAPFTKEEIVRAIKQMHPTKAPGPDGFPALFYQKFWDEVGDITISNCLDILNRSR
         T        YF  ++++T P   ++D  ++++   V+ +MN + + PFT+EE+  A+ QM P+KAPGPDG  ALF+QKFW  VG    +  LD LN+  
Subjt:  FTS-------YFSNLFSSTKPQLESLDLALQDITTRVTQEMNAKFMAPFTKEEIVRAIKQMHPTKAPGPDGFPALFYQKFWDEVGDITISNCLDILNRSR

Query:  SVKDWNDTNIALIP-------------------------KVIVNRMKWILQDVISENQSAFVPGRSIFDNVILGHECLHAIKARKKGRCGWLALKLDMSK
         +K  N T+IALIP                         KV+VNRMK IL  V+S++QSAFVPGR I DN+++  E +H +K ++ G+   +A KLDMSK
Subjt:  SVKDWNDTNIALIP-------------------------KVIVNRMKWILQDVISENQSAFVPGRSIFDNVILGHECLHAIKARKKGRCGWLALKLDMSK

Query:  AYDRVEWPFLERLLIELGFDARWVHLIMECVGTPNFSILLNGDP--------------------------------TRAMSKKNLTGFKPGKYCPAISHL
        AY+RVEW +L++++++LGF  +WV LIMECV + ++SIL+NGDP                                 +A  ++ + G    +  P +SHL
Subjt:  AYDRVEWPFLERLLIELGFDARWVHLIMECVGTPNFSILLNGDP--------------------------------TRAMSKKNLTGFKPGKYCPAISHL

Query:  FFADDSLLF---------KLQLNMCGIYE-------------IFY--RCMPIV-------------DNLGRYLGVPSAFSRKRKDDFREIKQRVWQTLQG
        FFADDSL+F          LQ  +  +YE             +F+     P +                 +YLG+P    R +K  F EIK R+W+ LQG
Subjt:  FFADDSLLF---------KLQLNMCGIYE-------------IFY--RCMPIV-------------DNLGRYLGVPSAFSRKRKDDFREIKQRVWQTLQG

Query:  WKGQFFSVGGKEILIKSIAQSIPSYSMSCFRLPKTLCDELHAMMARFWWGSTDSKGRSIGRNGPRCGGSFRIHRCWHLGSFRGDMRIRCPFCMRQLRPTV
        WK +F S  GKEILIK++ Q+IP+Y+MSCF+LP  LCDE+  M  RFWWG                G   +IH  W         ++      R L    
Subjt:  WKGQFFSVGGKEILIKSIAQSIPSYSMSCFRLPKTLCDELHAMMARFWWGSTDSKGRSIGRNGPRCGGSFRIHRCWHLGSFRGDMRIRCPFCMRQLRPTV

Query:  RCFGGVLFGREDVDVIEAIPISITNDEDKWIWHYCSNGMYIVKSGYKLARRLLVDQQSPSTDDQRLWWMRLWKAKIPQKIKIFIWKAYHGCLPTRSKQIC
        +CF   L  R+   +++        + +  ++  C             A+++L D                W+    + I+  IWK      P+  + I 
Subjt:  RCFGGVLFGREDVDVIEAIPISITNDEDKWIWHYCSNGMYIVKSGYKLARRLLVDQQSPSTDDQRLWWMRLWKAKIPQKIKIFIWKAYHGCLPTRSKQIC

Query:  DNMFPISITNDEDKWIWHYCSNGMAYGNVELTSLLFNKLELFNKALLEK---QEDVDVIEAIPISITNDEDKWIWHYCSNGMYIVKSGYKLARRLLVDQQ
            P+    +                N  +TSL+      +N  LL +     D+++I  IP+S+    D+ +W   S G++ V+S Y +     V  Q
Subjt:  DNMFPISITNDEDKWIWHYCSNGMAYGNVELTSLLFNKLELFNKALLEK---QEDVDVIEAIPISITNDEDKWIWHYCSNGMYIVKSGYKLARRLLVDQQ

Query:  SPNTDDQRL--WWMRLWKAKIPQKIKIFIWKAYHGCLPTRSK
        + ++  + L  +W RLW  +   KIK+FIW+A    LPT++K
Subjt:  SPNTDDQRL--WWMRLWKAKIPQKIKIFIWKAYHGCLPTRSK

A0A803PV25 Uncharacterized protein2.9e-11931.24Show/hide
Query:  ERLDRFVANEVFINLFPNASVLHLKWAQSDHR------PIMLDGCYTAENVGRQRRARKFRFEEVWTQNEDCKRVVLEAVSRISNKNGLPSLKECLHQCS
        ERLDR +  E +++ F  A +  L W +SDHR      P+ LDG    +  G+ +R  +F FEE W Q E+C  +V    S  +    + S +  +++C 
Subjt:  ERLDRFVANEVFINLFPNASVLHLKWAQSDHR------PIMLDGCYTAENVGRQRRARKFRFEEVWTQNEDCKRVVLEAVSRISNKNGLPSLKECLHQCS

Query:  SRLQKWGKGTSHSLRQNIMVHQRVLQELYSKPPPWDFDEIKRLESQLDKALEDEEIYWKQRSRENWLHWG-----------------IEIR---------
          LQ W +     L   +   ++ L EL     P  ++ I+++E++L+  LE +E YW+QRSR  WL WG                  EI+         
Subjt:  SRLQKWGKGTSHSLRQNIMVHQRVLQELYSKPPPWDFDEIKRLESQLDKALEDEEIYWKQRSRENWLHWG-----------------IEIR---------

Query:  -DDRQKMEQTFTSYFSNLFSSTKPQLESLDLALQDITTRVTQEMNAKFMAPFTKEEIVRAIKQMHPTKAPGPDGFPALFYQKFWDEVGDITISNCLDILN
         DD+  + +    Y+  LF+S+      L+  L  +  +V+  MN   +A F +EE++RA+K+M+PTKAPG DG PALFYQKFW ++    ++  L++LN
Subjt:  -DDRQKMEQTFTSYFSNLFSSTKPQLESLDLALQDITTRVTQEMNAKFMAPFTKEEIVRAIKQMHPTKAPGPDGFPALFYQKFWDEVGDITISNCLDILN

Query:  RSRSVKDWNDTNIALIPKV-------------------------IVNRMKWILQDVISENQSAFVPGRSIFDNVILGHECLHAIKARKKGRCGWLALKLD
            ++  NDT +ALIPKV                         + NRM+  L  V+S++QSAF+ GR I DN I+G+E LH ++  +      +ALKLD
Subjt:  RSRSVKDWNDTNIALIPKV-------------------------IVNRMKWILQDVISENQSAFVPGRSIFDNVILGHECLHAIKARKKGRCGWLALKLD

Query:  MSKAYDRVEWPFLERLLIELGFDARWVHLIMECVGTPNFSILLNGD------PTR--------------------------AMSKKNLTGFKPGKYCPAI
        M+KAYDRVEW FLE ++++LG+   WV  IM C+ +  FS ++NG+      P R                          A  +  L G   G+    +
Subjt:  MSKAYDRVEWPFLERLLIELGFDARWVHLIMECVGTPNFSILLNGD------PTR--------------------------AMSKKNLTGFKPGKYCPAI

Query:  SHLFFADDSLLF----------------KLQL-----------NMC---------GIYEIFYRCMPIVDNLGRYLGVPSAFSRKRKDDFREIKQRVWQTL
        SHLFFADDSL+F                K  +            MC           +   +  + +VDN G+YLG+PS   R +K  F E   +VW  L
Subjt:  SHLFFADDSLLF----------------KLQL-----------NMC---------GIYEIFYRCMPIVDNLGRYLGVPSAFSRKRKDDFREIKQRVWQTL

Query:  QGWKGQFFSVGGKEILIKSIAQSIPSYSMSCFRLPKTLCDELHAMMARFWWGSTDS-------KGRSIGRNGPRCGGSFRIHRCWHLGSFRGDMR-----
        +GWKG FFS  GKE+LIK+I Q+IP+Y+MSCFRLPK   + +H+M ARFWWGS++        K   + ++  + G  FR      LG F   +      
Subjt:  QGWKGQFFSVGGKEILIKSIAQSIPSYSMSCFRLPKTLCDELHAMMARFWWGSTDS-------KGRSIGRNGPRCGGSFRIHRCWHLGSFRGDMR-----

Query:  --IRCP--FCMRQLRPTVRCFGGVLFGREDVDVIEAIPISITNDEDKWIWHYCSNGMYIVKSGYKLARRLLVDQQSPSTDDQRLWWMRLWKAKIPQKIKI
          IR P   C + L+ +            +V V+EA       +   ++W     G  I+++GY+   R+         DD   W  R      P   KI
Subjt:  --IRCP--FCMRQLRPTVRCFGGVLFGREDVDVIEAIPISITNDEDKWIWHYCSNGMYIVKSGYKLARRLLVDQQSPSTDDQRLWWMRLWKAKIPQKIKI

Query:  FIWKAYHGCLPTRSKQICDNMFPISITNDEDKWIWHYCSNGMAYGNVELTSLLFNKLELFNKALLEKQEDVDVIEAIPISITNDEDKWIWHYCSNGMYIV
        +               + DN+  I +     +W            + E    +FN              D ++I  +  S  + EDK +WHY  +G Y V
Subjt:  FIWKAYHGCLPTRSKQICDNMFPISITNDEDKWIWHYCSNGMAYGNVELTSLLFNKLELFNKALLEKQEDVDVIEAIPISITNDEDKWIWHYCSNGMYIV

Query:  KSGYKLARRLLVDQQSPNTDDQRLWWMRLWKAKIPQKIKIFIWKAYHGCLPTRS
        +SGY++A  L V     NT+    WW +LWK KIP K+K F+WK  H  +PT S
Subjt:  KSGYKLARRLLVDQQSPNTDDQRLWWMRLWKAKIPQKIKIFIWKAYHGCLPTRS

A0A803PYI0 Uncharacterized protein8.5e-11934.41Show/hide
Query:  ERLDRFVANEVFINLFPNASVLHLKWAQSDHRPIMLDGCYTAENVGRQRRARK--FRFEEVWTQNEDCKRVVLEAVSRISNKNGLPSLKECLHQCSSRLQ
        ERLDR + NE ++  F  A +  L W  SDH+P+++D     +  G  +  +K  F FEE W + + CK +V +              K    Q    L 
Subjt:  ERLDRFVANEVFINLFPNASVLHLKWAQSDHRPIMLDGCYTAENVGRQRRARK--FRFEEVWTQNEDCKRVVLEAVSRISNKNGLPSLKECLHQCSSRLQ

Query:  KWGKGTSHSLRQNIMVHQRVLQELYSKPPPWDFDEIKRLESQLDKALEDEEIYWKQRSRENWLHWG--------------------IEIRDD-------R
         W +     L Q I   +  L EL S   P  + E+K++E QL+  LE +E YW Q SR  WL WG                      + DD        
Subjt:  KWGKGTSHSLRQNIMVHQRVLQELYSKPPPWDFDEIKRLESQLDKALEDEEIYWKQRSRENWLHWG--------------------IEIRDD-------R

Query:  QKMEQTFTSYFSNLFSSTKPQLESLDLALQDITTRVTQEMNAKFMAPFTKEEIVRAIKQMHPTKAPGPDGFPALFYQKFWDEVGDITISNCLDILNRSRS
          +++   +YF N+F S+    +  +  +  I  +VT +MN   +  FT +EIV+A+K M+PTKAPG +G PALFYQKFW  +    I  CL +LN+  +
Subjt:  QKMEQTFTSYFSNLFSSTKPQLESLDLALQDITTRVTQEMNAKFMAPFTKEEIVRAIKQMHPTKAPGPDGFPALFYQKFWDEVGDITISNCLDILNRSRS

Query:  VKDWNDTNIALIPKV-------------------------IVNRMKWILQDVISENQSAFVPGRSIFDNVILGHECLHAIKARKKGRCGWLALKLDMSKA
        ++  N+T IALIPKV                         +V R+  ++  VIS+ QSAF+  R I DN I+G+E LH ++  +    G +ALKLDM+KA
Subjt:  VKDWNDTNIALIPKV-------------------------IVNRMKWILQDVISENQSAFVPGRSIFDNVILGHECLHAIKARKKGRCGWLALKLDMSKA

Query:  YDRVEWPFLERLLIELGFDARWVHLIMECVGTPNFSILLNGD------PTRAM--------------------------SKKNLTGFKPGKYCPAISHLF
        YDRVEW FL  ++  LGF   WV  IM CV + +FS L+NG+      P R +                          S   L G + G+   ++SHLF
Subjt:  YDRVEWPFLERLLIELGFDARWVHLIMECVGTPNFSILLNGD------PTRAM--------------------------SKKNLTGFKPGKYCPAISHLF

Query:  FADDSLLF-KLQLNMCGIY--------------------EIFYRC---------------MPIVDNLGRYLGVPSAFSRKRKDDFREIKQRVWQTLQGWK
        FADDSL+F    ++ C  +                    E  + C               +  VDN G+YLG+PS+  R +K+   EIK +VW  ++GWK
Subjt:  FADDSLLF-KLQLNMCGIY--------------------EIFYRC---------------MPIVDNLGRYLGVPSAFSRKRKDDFREIKQRVWQTLQGWK

Query:  GQFFSVGGKEILIKSIAQSIPSYSMSCFRLPKTLCDELHAMMARFWWGSTDS-------KGRSIGRNGPRCGGSFRIHRCWHLGSFRGDMR-------IR
           FSV GKE+LIKSI Q+IP+Y+M+CF+L K     LH M +RFWWGS+D        K R + R   + G  FR      LG F   +        IR
Subjt:  GQFFSVGGKEILIKSIAQSIPSYSMSCFRLPKTLCDELHAMMARFWWGSTDS-------KGRSIGRNGPRCGGSFRIHRCWHLGSFRGDMR-------IR

Query:  CP--FCMRQLRPTVRCFGGVLFGREDVDVIEAIPISITNDEDKWIWHYCSNGMYIVKSGYKLARRLLVDQQSPSTDDQRLWWMRLWKAKIPQKIKIFIWK
         P   C R L+ +     GVL      ++I AIP S  + EDK +WHY  NG Y VKS Y++A  L  +Q   +      WW +LW+ KIP K+K F+WK
Subjt:  CP--FCMRQLRPTVRCFGGVLFGREDVDVIEAIPISITNDEDKWIWHYCSNGMYIVKSGYKLARRLLVDQQSPSTDDQRLWWMRLWKAKIPQKIKIFIWK

Query:  AYHGCLPT
          H  LPT
Subjt:  AYHGCLPT

M5VU98 Reverse transcriptase domain-containing protein5.3e-12131.7Show/hide
Query:  RLDRFVANEVFINLFPNASVLHLKWAQSDHRPIMLDGCYTAENVGRQRRARKFRFEEVWTQNEDCKRVVLEAVSRISNKNGLPSLKECLHQCSSRLQKWG
        RLDR +A   + NLFP  SV HL  ++SDH PI++          ++ R R+F FE +WT + DC++ + +    + N + +  L + + Q +  LQ+W 
Subjt:  RLDRFVANEVFINLFPNASVLHLKWAQSDHRPIMLDGCYTAENVGRQRRARKFRFEEVWTQNEDCKRVVLEAVSRISNKNGLPSLKECLHQCSSRLQKWG

Query:  KGTSHSLRQNIMVHQRVLQELYSKPPPWDFDEIKR-LESQLDKALEDEEIYWKQRSRENWLHWGIE---------------------------IRDDRQK
        K T   +++   V +  L  L+  P     +E +R ++  LD+ L   E+YW QRSRENWL  G +                            R  RQ 
Subjt:  KGTSHSLRQNIMVHQRVLQELYSKPPPWDFDEIKR-LESQLDKALEDEEIYWKQRSRENWLHWGIE---------------------------IRDDRQK

Query:  MEQTFTSYFSNLFSSTKPQLESLDLALQDITTRVTQEMNAKFMAPFTKEEIVRAIKQMHPTKAPGPDGFPALFYQKFWDEVGDITISNCLDILNRSRSVK
        +      YF +LF S+   +  ++  L  +  +VT +M    +A F+ +EI  A+ QM P+KAPGPDG P LFYQK+W  VGD  ++     L  +  ++
Subjt:  MEQTFTSYFSNLFSSTKPQLESLDLALQDITTRVTQEMNAKFMAPFTKEEIVRAIKQMHPTKAPGPDGFPALFYQKFWDEVGDITISNCLDILNRSRSVK

Query:  DWNDTNIALIPKV-------------------------IVNRMKWILQDVISENQSAFVPGRSIFDNVILGHECLHAIKARKKGRCGWLALKLDMSKAYD
          N T + LIPKV                         + NRMK+++Q VISE+QSAFVPGR I DN I+  E  H +K R++GR G LALKLDMSKAYD
Subjt:  DWNDTNIALIPKV-------------------------IVNRMKWILQDVISENQSAFVPGRSIFDNVILGHECLHAIKARKKGRCGWLALKLDMSKAYD

Query:  RVEWPFLERLLIELGFDARWVHLIMECVGTPNFSILLNGDPTR--------------------------------AMSKKNLTGFKPGKYCPAISHLFFA
        RVEW FLE++++ +GF   WV ++M+CV T ++S L+NG+PTR                                A  +  L G    +  P +SHLFFA
Subjt:  RVEWPFLERLLIELGFDARWVHLIMECVGTPNFSILLNGDPTR--------------------------------AMSKKNLTGFKPGKYCPAISHLFFA

Query:  DDSLLF-KLQLNMCG----IYEIFYRC-------------------------------MPIVDNLGRYLGVPSAFSRKRKDDFREIKQRVWQTLQGWKGQ
        DDS +F K   N CG    I+E++                                  +P VD+   YLG+P    R +   FR +K+RVW+ LQGW+ Q
Subjt:  DDSLLF-KLQLNMCG----IYEIFYRC-------------------------------MPIVDNLGRYLGVPSAFSRKRKDDFREIKQRVWQTLQGWKGQ

Query:  FFSVGGKEILIKSIAQSIPSYSMSCFRLPKTLCDELHAMMARFWWGSTDSKGRSIGRNGPRCGGSFRIH-RCWH---LGSFRGDMRIRCPFCMRQLRPTV
          S+ GKE+L+K +AQSIP Y MSCF LP+ LC E+  MMARFWWG                G + +IH   W         G M  RC          +
Subjt:  FFSVGGKEILIKSIAQSIPSYSMSCFRLPKTLCDELHAMMARFWWGSTDSKGRSIGRNGPRCGGSFRIH-RCWH---LGSFRGDMRIRCPFCMRQLRPTV

Query:  RCFGGVLFGREDVDVIEAIPISITNDEDKWIWHYCSNGMYIVKSGYKLARRLLVDQQSPSTDDQRLWWMRLWKAKI---PQKIKIFIWKAYHGCLPTRSK
        + F   +  ++                          G  +V + + LA RLL  +  P T+         W+A +   P  +   IW A          
Subjt:  RCFGGVLFGREDVDVIEAIPISITNDEDKWIWHYCSNGMYIVKSGYKLARRLLVDQQSPSTDDQRLWWMRLWKAKI---PQKIKIFIWKAYHGCLPTRSK

Query:  QICDNMFPISITNDEDKWIWHYCS--------NGMAYGNVELTSLLFNK------LELFNKALLEKQEDVDVIEAIPISITNDEDKWIWHYCSNGMYIVK
        QI D     S+    DKW+    +        +GM   N +++ L+ N+      L+  N   L    DV  I  IP+SI    D+ +W+Y  +G++ VK
Subjt:  QICDNMFPISITNDEDKWIWHYCS--------NGMAYGNVELTSLLFNK------LELFNKALLEKQEDVDVIEAIPISITNDEDKWIWHYCSNGMYIVK

Query:  SGYKLARRLL---VDQQSPNTDDQRLWWMRLWKAKIPQKIKIFIWKAYHGCLPTRSKQI
        S Y++A R+     D+ S +  D  + W  +W A +P K+KIF W+  H  LPT++  I
Subjt:  SGYKLARRLL---VDQQSPNTDDQRLWWMRLWKAKIPQKIKIFIWKAYHGCLPTRSKQI

M5XHI9 Reverse transcriptase domain-containing protein1.6e-12232.26Show/hide
Query:  RLDRFVANEVFINLFPNASVLHLKWAQSDHRPIMLDGCYTAENVGRQRRARKFRFEEVWTQNEDCKRVVLEAVSRISNKNGLPSLKECLHQCSSRLQKWG
        RLDR +A   + NLFP  SV HL  ++SDH PI++          ++ R  +F FE +WT + DC++ + +    + + + +  L + + Q +  LQ+W 
Subjt:  RLDRFVANEVFINLFPNASVLHLKWAQSDHRPIMLDGCYTAENVGRQRRARKFRFEEVWTQNEDCKRVVLEAVSRISNKNGLPSLKECLHQCSSRLQKWG

Query:  KGTSHSLRQNIMVHQRVLQELYSKPPPWDFDEIKR-LESQLDKALEDEEIYWKQRSRENWLHWGIE---------------------------IRDDRQK
        K T   +++   V +  L  L+  P     +E +R ++  LD+ L   E+YW QRSRENWL  G +                            R  RQ 
Subjt:  KGTSHSLRQNIMVHQRVLQELYSKPPPWDFDEIKR-LESQLDKALEDEEIYWKQRSRENWLHWGIE---------------------------IRDDRQK

Query:  MEQTFTSYFSNLFSSTKPQLESLDLALQDITTRVTQEMNAKFMAPFTKEEIVRAIKQMHPTKAPGPDGFPALFYQKFWDEVGDITISNCLDILNRSRSVK
        +      YF +LF S+   +  ++  L  +  +VT +M    +A F+ +EI  A+ QM P+KAPGPDG P LFYQK+W  VGD  ++     L  +  ++
Subjt:  MEQTFTSYFSNLFSSTKPQLESLDLALQDITTRVTQEMNAKFMAPFTKEEIVRAIKQMHPTKAPGPDGFPALFYQKFWDEVGDITISNCLDILNRSRSVK

Query:  DWNDTNIALIPKV-------------------------IVNRMKWILQDVISENQSAFVPGRSIFDNVILGHECLHAIKARKKGRCGWLALKLDMSKAYD
          N T + LIPKV                         + NRMK+++Q VISE+QSAFVPGR I DN I+  E  H +K R++GR G LALKLDMSKAYD
Subjt:  DWNDTNIALIPKV-------------------------IVNRMKWILQDVISENQSAFVPGRSIFDNVILGHECLHAIKARKKGRCGWLALKLDMSKAYD

Query:  RVEWPFLERLLIELGFDARWVHLIMECVGTPNFSILLNGDPTR--------------------------------AMSKKNLTGFKPGKYCPAISHLFFA
        RVEW FLE++++ +GF   WV ++M+CV T ++S L+NG+PTR                                A  +  L G    +  P +SHLFFA
Subjt:  RVEWPFLERLLIELGFDARWVHLIMECVGTPNFSILLNGDPTR--------------------------------AMSKKNLTGFKPGKYCPAISHLFFA

Query:  DDSLLF-KLQLNMCGIYEIFYRC---------MPIVDNLGRYLGVPSAFSRKRKDDFREIKQRVWQTLQGWKGQFFSVGGKEILIKSIAQSIPSYSMSCF
        DDS +F K   N CG+  I             +P VD+   YLG+P    R +   FR +K+RVW+ LQGW+ Q  S+ GKE+L+K +AQSIP Y MSCF
Subjt:  DDSLLF-KLQLNMCGIYEIFYRC---------MPIVDNLGRYLGVPSAFSRKRKDDFREIKQRVWQTLQGWKGQFFSVGGKEILIKSIAQSIPSYSMSCF

Query:  RLPKTLCDELHAMMARFWWGSTDSKGRSIGRNGPRCGGSFRIH-RCWH---LGSFRGDMRIRCPFCMRQLRPTVRCFGGVLFGREDVDVIEAIPISITND
         LP+ LC E+  MMARFWWG                G + +IH   W         G M  RC          ++ F   +  ++               
Subjt:  RLPKTLCDELHAMMARFWWGSTDSKGRSIGRNGPRCGGSFRIH-RCWH---LGSFRGDMRIRCPFCMRQLRPTVRCFGGVLFGREDVDVIEAIPISITND

Query:  EDKWIWHYCSNGMYIVKSGYKLARRLLVDQQSPSTDDQRLWWMRLWKAKI---PQKIKIFIWKAYHGCLPTRSKQICDNMFPISITNDEDKWIWHYCS--
                   G  +V + + LA RLL  +  P T+         W+A +   P  +   IW A          QI D     S+    DKW+    +  
Subjt:  EDKWIWHYCSNGMYIVKSGYKLARRLLVDQQSPSTDDQRLWWMRLWKAKI---PQKIKIFIWKAYHGCLPTRSKQICDNMFPISITNDEDKWIWHYCS--

Query:  ------NGMAYGNVELTSLLFNK------LELFNKALLEKQEDVDVIEAIPISITNDEDKWIWHYCSNGMYIVKSGYKLARRLL---VDQQSPNTDDQRL
              +GM   N +++ L+ N+      L+  N   L    DV  I  IP+SI    D+ +W+Y  +G++ VKS Y++A R+     D+ S +  D  +
Subjt:  ------NGMAYGNVELTSLLFNK------LELFNKALLEKQEDVDVIEAIPISITNDEDKWIWHYCSNGMYIVKSGYKLARRLL---VDQQSPNTDDQRL

Query:  WWMRLWKAKIPQKIKIFIWKAYHGCLPTRSKQI
         W  +W A +P K+KIF W+  H  LPT++  I
Subjt:  WWMRLWKAKIPQKIKIFIWKAYHGCLPTRSKQI

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein7.1e-2223.7Show/hide
Query:  KRVVLEAVSRISNKNGLPSLKECLHQCSSRLQKWGKGTSHSLRQNIMVHQRVLQEL--------YSKPPPWDFDEIKRLESQLDKALEDEEIYWKQRSRE
        K + L A  R   ++ + +L   L +   + Q   K    S RQ I   +  L+E+         ++   W F+ I +++  L + ++ +    ++   +
Subjt:  KRVVLEAVSRISNKNGLPSLKECLHQCSSRLQKWGKGTSHSLRQNIMVHQRVLQEL--------YSKPPPWDFDEIKRLESQLDKALEDEEIYWKQRSRE

Query:  NWLHWGIEIRDDRQKMEQTFTSYFSNLFSSTKPQLESLDLALQDIT-TRVTQEMNAKFMAPFTKEEIVRAIKQMHPTKAPGPDGFPALFYQKFWDEVGDI
           +   +I  D  +++ T   Y+ +L+++    LE +D  L   T  R+ QE       P T  EIV  I  +   K+PGPDGF A FYQ++ +E+   
Subjt:  NWLHWGIEIRDDRQKMEQTFTSYFSNLFSSTKPQLESLDLALQDIT-TRVTQEMNAKFMAPFTKEEIVRAIKQMHPTKAPGPDGFPALFYQKFWDEVGDI

Query:  TISNCLDILNRSRSVKDWNDTNIALIP--------------------------KVIVNRMKWILQDVISENQSAFVPGRSIFDNVILGHECL-HAIKARK
         +     I         + + +I LIP                          K++ NR++  ++ +I  +Q  F+PG   + N+      + H  +A+ 
Subjt:  TISNCLDILNRSRSVKDWNDTNIALIP--------------------------KVIVNRMKWILQDVISENQSAFVPGRSIFDNVILGHECL-HAIKARK

Query:  KGRCGWLALKLDMSKAYDRVEWPFLERLLIELGFDARWVHLIMECVGTPNFSILLNGDPTRAMSKKNLTGFKPGKYCPAISHLF
        K     + + +D  KA+D+++ PF+ + L +LG D  ++ +I      P  +I+LNG    A   K  TG + G  CP    LF
Subjt:  KGRCGWLALKLDMSKAYDRVEWPFLERLLIELGFDARWVHLIMECVGTPNFSILLNGDPTRAMSKKNLTGFKPGKYCPAISHLF

P08548 LINE-1 reverse transcriptase homolog1.7e-2023.8Show/hide
Query:  NIMVHQRVLQELYSKPPPWDFDEIKRLESQLDKALEDEEIYWKQRSRENWLHWGIEIRDDRQKMEQTFTSYFSNLFSSTKPQLESLDLALQDI-TTRVTQ
        N + ++R++Q++ +K   W F++I +++  L      + +     S  N      EI  D  ++++    Y+  L+S     L+ +D  L+     R++Q
Subjt:  NIMVHQRVLQELYSKPPPWDFDEIKRLESQLDKALEDEEIYWKQRSRENWLHWGIEIRDDRQKMEQTFTSYFSNLFSSTKPQLESLDLALQDI-TTRVTQ

Query:  EMNAKFMAPFTKEEIVRAIKQMHPTKAPGPDGFPALFYQKFWDEVGDITISNCLDILNRSRSVKDWNDTNIALIP-------------------------
        +       P +  EI   I+ +   K+PGPDGF + FYQ F +E+  I ++   +I         + + NI LIP                         
Subjt:  EMNAKFMAPFTKEEIVRAIKQMHPTKAPGPDGFPALFYQKFWDEVGDITISNCLDILNRSRSVKDWNDTNIALIP-------------------------

Query:  -KVIVNRMKWILQDVISENQSAFVPGRSIFDNVILGHECL-HAIKARKKGRCGWLALKLDMSKAYDRVEWPFLERLLIELGFDARWVHLIMECVGTPNFS
         K++ NR++  ++ +I  +Q  F+PG   + N+      + H  K + K     + L +D  KA+D ++ PF+ R L ++G +  ++ LI      P  +
Subjt:  -KVIVNRMKWILQDVISENQSAFVPGRSIFDNVILGHECL-HAIKARKKGRCGWLALKLDMSKAYDRVEWPFLERLLIELGFDARWVHLIMECVGTPNFS

Query:  ILLNGDPTRAMSKKNLTGFKPGKYCPAISHLF
        I+LNG   ++   +  +G + G  CP    LF
Subjt:  ILLNGDPTRAMSKKNLTGFKPGKYCPAISHLF

P11369 LINE-1 retrotransposable element ORF2 protein2.8e-1823.1Show/hide
Query:  QRVLQELYSKPPPWDFDEIKRLE---SQLDKALEDEEIYWKQRSRENWLHWGIEIRDDRQKMEQTFTSYFSNLFSSTKPQLESLDLALQDI-TTRVTQEM
        +R +Q + ++   W F++I +++   ++L K   D+ +  K R+ +       +I  D ++++ T  S++  L+S+    L+ +D  L      ++ Q+ 
Subjt:  QRVLQELYSKPPPWDFDEIKRLE---SQLDKALEDEEIYWKQRSRENWLHWGIEIRDDRQKMEQTFTSYFSNLFSSTKPQLESLDLALQDI-TTRVTQEM

Query:  NAKFMAPFTKEEIVRAIKQMHPTKAPGPDGFPALFYQKFWDEVGDITISNCLDILNRSRSVKDWNDTNIALIP--------------------------K
             +P + +EI   I  +   K+PGPDGF A FYQ F +++  I       I         + +  I LIP                          K
Subjt:  NAKFMAPFTKEEIVRAIKQMHPTKAPGPDGFPALFYQKFWDEVGDITISNCLDILNRSRSVKDWNDTNIALIP--------------------------K

Query:  VIVNRMKWILQDVISENQSAFVPGRSIFDNVILGHECLHAIKARKKGRCGWLALKLDMSKAYDRVEWPFLERLLIELGFDARWVHLIMECVGTPNFSILL
        ++ NR++  ++ +I  +Q  F+PG   + N+      +H I   K      + + LD  KA+D+++ PF+ ++L   G    ++++I      P  +I +
Subjt:  VIVNRMKWILQDVISENQSAFVPGRSIFDNVILGHECLHAIKARKKGRCGWLALKLDMSKAYDRVEWPFLERLLIELGFDARWVHLIMECVGTPNFSILL

Query:  NGDPTRAMSKKNLTGFKPGKYCPAISHLF
        NG+   A+  K  +G + G  CP   +LF
Subjt:  NGDPTRAMSKKNLTGFKPGKYCPAISHLF

P14381 Transposon TX1 uncharacterized 149 kDa protein8.1e-1827.34Show/hide
Query:  DDRQKMEQTFTSYFSNLFSSTKPQLESLDLALQDITTRVTQEMNAKFMAPFTKEEIVRAIKQMHPTKAPGPDGFPALFYQKFWDEV--------------
        +D + +     S++ NLFS      ++ +  L D    V++    +   P T +E+ +A++ M   K+PG DG    F+Q FWD +              
Subjt:  DDRQKMEQTFTSYFSNLFSSTKPQLESLDLALQDITTRVTQEMNAKFMAPFTKEEIVRAIKQMHPTKAPGPDGFPALFYQKFWDEV--------------

Query:  GDITIS---NCLDILNRS---RSVKDWN-----DTNIALIPKVIVNRMKWILQDVISENQSAFVPGRSIFDNVILGHECLHAIKARKKGRCGWLALKLDM
        G++ +S     L +L +    R +K+W       T+  ++ K I  R+K +L +VI  +QS  VPGR+IFDNV L  + LH   AR+ G      L LD 
Subjt:  GDITIS---NCLDILNRS---RSVKDWN-----DTNIALIPKVIVNRMKWILQDVISENQSAFVPGRSIFDNVILGHECLHAIKARKKGRCGWLALKLDM

Query:  SKAYDRVEWPFLERLLIELGFDARWVHLIMECVGTPNFSILLNGDPTRAMSKKNLTGFKPGKYCPAISHLF-FADDSLLFKLQLNMCGI
         KA+DRV+  +L   L    F  ++V  +     +    + +N   T  ++     G + G  CP    L+  A +  L  L+  + G+
Subjt:  SKAYDRVEWPFLERLLIELGFDARWVHLIMECVGTPNFSILLNGDPTRAMSKKNLTGFKPGKYCPAISHLF-FADDSLLFKLQLNMCGI

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein3.3e-0629.31Show/hide
Query:  KMEQTFTSYFSNLFSSTKPQL--ESLDLALQDITTRVTQEMNAKFMAPFTKEEIVRAIKQMHPTKAPGPDGFPALFYQKFWDEVGDITISNCLDILNRSR
        ++++   +Y+++L  S    L  +S+         R    + ++  A  + +EI  A+  M   KAPGPD F A F+ + W  V D TI+   +      
Subjt:  KMEQTFTSYFSNLFSSTKPQL--ESLDLALQDITTRVTQEMNAKFMAPFTKEEIVRAIKQMHPTKAPGPDGFPALFYQKFWDEVGDITISNCLDILNRSR

Query:  SVKDWNDTNIALIPKV
         +K +N T I LIPKV
Subjt:  SVKDWNDTNIALIPKV

AT3G09510.1 Ribonuclease H-like superfamily protein6.9e-0428.83Show/hide
Query:  QEDVDVIEAIPISITNDEDKWIWHYCSNGMYIVKSGYKLARRLLVDQQSPNTDDQRL--------WWMRLWKAKIPQKIKIFIWKAYHGCLPTRSKQICD
        Q D   I  I ++ +   DK IW+Y + G Y V+SGY L          P+T+   +           R+W   I  K+K F+W+A    L T  +    
Subjt:  QEDVDVIEAIPISITNDEDKWIWHYCSNGMYIVKSGYKLARRLLVDQQSPNTDDQRL--------WWMRLWKAKIPQKIKIFIWKAYHGCLPTRSKQICD

Query:  NMFPRLDVLVP
         M  R+D   P
Subjt:  NMFPRLDVLVP

AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.0e-1543.37Show/hide
Query:  IVNRMKWILQDVISENQSAFVPGRSIFDNVILGHECLHAIKARKKGRCGWLALKLDMSKAYDRVEWPFLERLLIELGFDARWV
        +V R+K ++ ++I   Q++F+PGR   DN++   E +H+++ RKKG  GW+ LKLD+ KAYDR+ W +LE  LI  GF   W+
Subjt:  IVNRMKWILQDVISENQSAFVPGRSIFDNVILGHECLHAIKARKKGRCGWLALKLDMSKAYDRVEWPFLERLLIELGFDARWV

AT4G29090.1 Ribonuclease H-like superfamily protein2.9e-1020.37Show/hide
Query:  SIPSYSMSCFRLPKTLCDELHAMMARFWW-GSTDSKGR------SIGRNGPRCGGSFRIHRCWHLGSFRGDMRIRCPFCMRQLRPTVRCFGGVLFGREDV
        ++P+Y+M+CF LPKT+C ++ +++A FWW    ++KG        +       G  F+    ++L      M        R      + F    F + D 
Subjt:  SIPSYSMSCFRLPKTLCDELHAMMARFWW-GSTDSKGR------SIGRNGPRCGGSFRIHRCWHLGSFRGDMRIRCPFCMRQLRPTVRCFGGVLFGREDV

Query:  DVIEAIPISI-TNDEDKWIWHYCSNGMYIVKSGYKLARRLLVDQQSPSTDDQRLWWMRLWKAKIPQKIKIFIWKAYHGCLPTRSKQICDNMFPISITNDE
              P++        ++W        I++ G   AR ++         +  + W   W    P    + + +     +P +      ++  +S   DE
Subjt:  DVIEAIPISI-TNDEDKWIWHYCSNGMYIVKSGYKLARRLLVDQQSPSTDDQRLWWMRLWKAKIPQKIKIFIWKAYHGCLPTRSKQICDNMFPISITNDE

Query:  DKWIWHYCSNGMAYGNVELTSLLFNKLELFNKALLEKQEDVDVIEAIPISITNDEDKWIWHYCSNGMYIVKSGYKLARRLLVDQQSPNTDDQ---RLWWM
            W      M +  VE    L  +L    + +L                    D + W Y S+G Y VKSGY +  +++  + SP    +      + 
Subjt:  DKWIWHYCSNGMAYGNVELTSLLFNKLELFNKALLEKQEDVDVIEAIPISITNDEDKWIWHYCSNGMYIVKSGYKLARRLLVDQQSPNTDDQ---RLWWM

Query:  RLWKAKIPQKIKIFIWKAYHGCLP
        ++WK++   KI+ F+WK     LP
Subjt:  RLWKAKIPQKIKIFIWKAYHGCLP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACGAGAGACTAGACCGTTTCGTTGCGAATGAAGTGTTCATAAATCTTTTTCCGAATGCTTCTGTTTTACATCTGAAATGGGCACAATCCGATCATCGTCCAATCAT
GCTGGATGGGTGTTATACAGCTGAAAATGTAGGGAGGCAAAGGAGAGCGAGGAAATTTCGATTTGAGGAAGTTTGGACCCAAAATGAGGATTGCAAGCGTGTGGTGTTGG
AGGCAGTTAGCAGAATCAGCAACAAGAATGGTCTTCCATCATTGAAAGAGTGTCTACATCAATGTAGCAGCCGACTTCAAAAGTGGGGTAAAGGTACCTCTCACTCCCTT
AGGCAAAATATTATGGTTCATCAACGCGTGCTTCAGGAGTTATATAGCAAGCCTCCACCATGGGATTTTGATGAAATTAAGCGGTTGGAGTCTCAGCTTGATAAAGCTTT
GGAGGATGAAGAGATTTATTGGAAGCAGCGGTCACGAGAGAATTGGCTTCATTGGGGGATAGAAATACGCGATGATAGGCAAAAGATGGAGCAGACTTTCACTTCTTATT
TTTCTAATTTATTTTCCTCCACTAAACCTCAACTGGAAAGTTTAGATTTGGCTTTGCAGGACATCACGACAAGGGTAACACAGGAGATGAATGCCAAATTTATGGCACCT
TTCACAAAAGAGGAGATTGTGAGGGCTATTAAACAAATGCATCCCACCAAAGCACCGGGGCCAGATGGATTTCCTGCTCTCTTCTATCAGAAATTTTGGGATGAGGTTGG
TGATATTACCATTTCCAATTGCTTAGATATCCTGAATCGTAGTAGGTCGGTTAAGGATTGGAATGATACTAATATTGCTTTGATTCCAAAGGTCATAGTGAATCGTATGA
AGTGGATTTTGCAAGATGTTATTTCTGAGAATCAATCTGCGTTTGTTCCTGGGCGGTCAATTTTTGATAATGTGATTTTGGGACATGAATGCTTGCATGCAATCAAGGCT
AGGAAAAAGGGTCGTTGTGGCTGGTTAGCTTTGAAGTTAGATATGAGTAAGGCCTATGACCGAGTTGAATGGCCTTTTTTGGAGAGATTGTTAATAGAACTAGGGTTTGA
TGCTCGATGGGTTCACCTAATAATGGAATGCGTTGGTACTCCAAACTTTTCCATTTTGCTTAATGGTGATCCTACGAGAGCCATGTCGAAAAAGAATTTAACTGGGTTTA
AGCCGGGAAAGTACTGTCCTGCTATTTCTCACCTTTTCTTTGCAGATGACAGCCTTTTATTTAAGCTTCAATTGAACATGTGTGGAATTTACGAAATATTTTATCGCTGT
ATGCCTATTGTGGACAATCTAGGTAGATACTTGGGAGTGCCTTCGGCCTTTAGTAGGAAAAGGAAAGATGACTTTCGAGAGATTAAGCAGCGAGTTTGGCAAACTCTTCA
GGGTTGGAAGGGTCAATTCTTTTCAGTGGGTGGTAAAGAAATTCTAATTAAGAGTATTGCCCAATCTATTCCTTCGTATAGTATGAGTTGTTTCCGCCTCCCAAAAACGC
TATGTGATGAACTACATGCTATGATGGCTCGATTTTGGTGGGGATCGACGGATTCAAAAGGAAGATCCATTGGAAGAAATGGTCCCAGGTGTGGAGGATCTTTTCGAATC
CATCGTTGCTGGCATCTAGGGTCATTCAGGGGAGATATGCGAATCAGATGTCCCTTTTGCATGCGCCAATTAAGGCCAACTGTTCGGTGTTTTGGAGGAGTTTTGTTTGG
GCGCGAAGACGTGGATGTTATAGAGGCAATTCCAATTAGCATAACTAATGACGAAGATAAGTGGATCTGGCATTACTGTTCAAATGGTATGTACATTGTTAAGAGTGGAT
ATAAACTAGCAAGAAGGTTGTTAGTTGATCAACAGTCCCCTAGTACTGATGATCAAAGATTGTGGTGGATGAGGCTATGGAAAGCAAAAATTCCACAGAAAATCAAAATT
TTTATCTGGAAAGCTTATCACGGTTGTCTGCCTACTAGGTCCAAACAGATATGTGATAATATGTTTCCAATTAGCATAACTAATGACGAAGATAAGTGGATCTGGCATTA
CTGTTCAAATGGTATGGCCTATGGAAACGTGGAATTGACATCTCTCCTGTTCAACAAATTAGAGCTATTCAACAAAGCTCTCTTGGAAAAACAGGAAGACGTGGATGTTA
TAGAGGCAATTCCAATTAGCATAACTAATGACGAAGATAAGTGGATCTGGCATTACTGTTCAAATGGTATGTACATTGTTAAGAGTGGATATAAACTAGCAAGAAGGTTG
TTAGTTGATCAACAGTCCCCTAATACTGATGATCAAAGATTGTGGTGGATGAGGCTATGGAAAGCAAAAATTCCACAGAAAATCAAAATTTTTATCTGGAAAGCTTATCA
CGGTTGTCTGCCTACTAGGTCCAAACAGATATGTGATAATATGTTTCCACGTTTGGATGTCCTTGTTCCGGTTGGTAATAATTTTGTCGATCGTGTCATTTGCCTTGCTA
CAGGACAGCGGGTGGAGACAAGTCAATCACCAGCTGGAGATAGGCCGTCGTTCGGGAATCAGAGTACGATGGTTTCTTTATTTACAGATGCAGCGGTTCGGCTTTCTTCA
AAAGGCGCAGGTATGAGGGCTGTCGTTGTTGATAATATTGGGAACTTAGTAGCAGCAATGGAATGTTTGGAGGAAGCATCACTTTCGGTTCTGGCAGCAGAGATTAGGGC
AATAATAGAAGGATTGCGTTTGTTACAACGCTTGGAAATTACTCATGCTATGGTTCACTTCAATTCTTCCAATGCAATAAAGATGATTAGTGGAGATATTCCTATTAGTT
CCGAGGTTTATCATTGGGTCCAGCACATACGGGTAATTGGTACATCGTTTCAAGAGTTATCTTTTATTTATGTATCGAAACTCTATGTTGTGGATACGAAATGTTCCTCA
ACAAGTTGCTTCAATGAGTGGTTCTACTCATTGCACTTGTATTGCTCTTTCTTCTTTCAATTAATGAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAACGAGAGACTAGACCGTTTCGTTGCGAATGAAGTGTTCATAAATCTTTTTCCGAATGCTTCTGTTTTACATCTGAAATGGGCACAATCCGATCATCGTCCAATCAT
GCTGGATGGGTGTTATACAGCTGAAAATGTAGGGAGGCAAAGGAGAGCGAGGAAATTTCGATTTGAGGAAGTTTGGACCCAAAATGAGGATTGCAAGCGTGTGGTGTTGG
AGGCAGTTAGCAGAATCAGCAACAAGAATGGTCTTCCATCATTGAAAGAGTGTCTACATCAATGTAGCAGCCGACTTCAAAAGTGGGGTAAAGGTACCTCTCACTCCCTT
AGGCAAAATATTATGGTTCATCAACGCGTGCTTCAGGAGTTATATAGCAAGCCTCCACCATGGGATTTTGATGAAATTAAGCGGTTGGAGTCTCAGCTTGATAAAGCTTT
GGAGGATGAAGAGATTTATTGGAAGCAGCGGTCACGAGAGAATTGGCTTCATTGGGGGATAGAAATACGCGATGATAGGCAAAAGATGGAGCAGACTTTCACTTCTTATT
TTTCTAATTTATTTTCCTCCACTAAACCTCAACTGGAAAGTTTAGATTTGGCTTTGCAGGACATCACGACAAGGGTAACACAGGAGATGAATGCCAAATTTATGGCACCT
TTCACAAAAGAGGAGATTGTGAGGGCTATTAAACAAATGCATCCCACCAAAGCACCGGGGCCAGATGGATTTCCTGCTCTCTTCTATCAGAAATTTTGGGATGAGGTTGG
TGATATTACCATTTCCAATTGCTTAGATATCCTGAATCGTAGTAGGTCGGTTAAGGATTGGAATGATACTAATATTGCTTTGATTCCAAAGGTCATAGTGAATCGTATGA
AGTGGATTTTGCAAGATGTTATTTCTGAGAATCAATCTGCGTTTGTTCCTGGGCGGTCAATTTTTGATAATGTGATTTTGGGACATGAATGCTTGCATGCAATCAAGGCT
AGGAAAAAGGGTCGTTGTGGCTGGTTAGCTTTGAAGTTAGATATGAGTAAGGCCTATGACCGAGTTGAATGGCCTTTTTTGGAGAGATTGTTAATAGAACTAGGGTTTGA
TGCTCGATGGGTTCACCTAATAATGGAATGCGTTGGTACTCCAAACTTTTCCATTTTGCTTAATGGTGATCCTACGAGAGCCATGTCGAAAAAGAATTTAACTGGGTTTA
AGCCGGGAAAGTACTGTCCTGCTATTTCTCACCTTTTCTTTGCAGATGACAGCCTTTTATTTAAGCTTCAATTGAACATGTGTGGAATTTACGAAATATTTTATCGCTGT
ATGCCTATTGTGGACAATCTAGGTAGATACTTGGGAGTGCCTTCGGCCTTTAGTAGGAAAAGGAAAGATGACTTTCGAGAGATTAAGCAGCGAGTTTGGCAAACTCTTCA
GGGTTGGAAGGGTCAATTCTTTTCAGTGGGTGGTAAAGAAATTCTAATTAAGAGTATTGCCCAATCTATTCCTTCGTATAGTATGAGTTGTTTCCGCCTCCCAAAAACGC
TATGTGATGAACTACATGCTATGATGGCTCGATTTTGGTGGGGATCGACGGATTCAAAAGGAAGATCCATTGGAAGAAATGGTCCCAGGTGTGGAGGATCTTTTCGAATC
CATCGTTGCTGGCATCTAGGGTCATTCAGGGGAGATATGCGAATCAGATGTCCCTTTTGCATGCGCCAATTAAGGCCAACTGTTCGGTGTTTTGGAGGAGTTTTGTTTGG
GCGCGAAGACGTGGATGTTATAGAGGCAATTCCAATTAGCATAACTAATGACGAAGATAAGTGGATCTGGCATTACTGTTCAAATGGTATGTACATTGTTAAGAGTGGAT
ATAAACTAGCAAGAAGGTTGTTAGTTGATCAACAGTCCCCTAGTACTGATGATCAAAGATTGTGGTGGATGAGGCTATGGAAAGCAAAAATTCCACAGAAAATCAAAATT
TTTATCTGGAAAGCTTATCACGGTTGTCTGCCTACTAGGTCCAAACAGATATGTGATAATATGTTTCCAATTAGCATAACTAATGACGAAGATAAGTGGATCTGGCATTA
CTGTTCAAATGGTATGGCCTATGGAAACGTGGAATTGACATCTCTCCTGTTCAACAAATTAGAGCTATTCAACAAAGCTCTCTTGGAAAAACAGGAAGACGTGGATGTTA
TAGAGGCAATTCCAATTAGCATAACTAATGACGAAGATAAGTGGATCTGGCATTACTGTTCAAATGGTATGTACATTGTTAAGAGTGGATATAAACTAGCAAGAAGGTTG
TTAGTTGATCAACAGTCCCCTAATACTGATGATCAAAGATTGTGGTGGATGAGGCTATGGAAAGCAAAAATTCCACAGAAAATCAAAATTTTTATCTGGAAAGCTTATCA
CGGTTGTCTGCCTACTAGGTCCAAACAGATATGTGATAATATGTTTCCACGTTTGGATGTCCTTGTTCCGGTTGGTAATAATTTTGTCGATCGTGTCATTTGCCTTGCTA
CAGGACAGCGGGTGGAGACAAGTCAATCACCAGCTGGAGATAGGCCGTCGTTCGGGAATCAGAGTACGATGGTTTCTTTATTTACAGATGCAGCGGTTCGGCTTTCTTCA
AAAGGCGCAGGTATGAGGGCTGTCGTTGTTGATAATATTGGGAACTTAGTAGCAGCAATGGAATGTTTGGAGGAAGCATCACTTTCGGTTCTGGCAGCAGAGATTAGGGC
AATAATAGAAGGATTGCGTTTGTTACAACGCTTGGAAATTACTCATGCTATGGTTCACTTCAATTCTTCCAATGCAATAAAGATGATTAGTGGAGATATTCCTATTAGTT
CCGAGGTTTATCATTGGGTCCAGCACATACGGGTAATTGGTACATCGTTTCAAGAGTTATCTTTTATTTATGTATCGAAACTCTATGTTGTGGATACGAAATGTTCCTCA
ACAAGTTGCTTCAATGAGTGGTTCTACTCATTGCACTTGTATTGCTCTTTCTTCTTTCAATTAATGAAATGA
Protein sequenceShow/hide protein sequence
MNERLDRFVANEVFINLFPNASVLHLKWAQSDHRPIMLDGCYTAENVGRQRRARKFRFEEVWTQNEDCKRVVLEAVSRISNKNGLPSLKECLHQCSSRLQKWGKGTSHSL
RQNIMVHQRVLQELYSKPPPWDFDEIKRLESQLDKALEDEEIYWKQRSRENWLHWGIEIRDDRQKMEQTFTSYFSNLFSSTKPQLESLDLALQDITTRVTQEMNAKFMAP
FTKEEIVRAIKQMHPTKAPGPDGFPALFYQKFWDEVGDITISNCLDILNRSRSVKDWNDTNIALIPKVIVNRMKWILQDVISENQSAFVPGRSIFDNVILGHECLHAIKA
RKKGRCGWLALKLDMSKAYDRVEWPFLERLLIELGFDARWVHLIMECVGTPNFSILLNGDPTRAMSKKNLTGFKPGKYCPAISHLFFADDSLLFKLQLNMCGIYEIFYRC
MPIVDNLGRYLGVPSAFSRKRKDDFREIKQRVWQTLQGWKGQFFSVGGKEILIKSIAQSIPSYSMSCFRLPKTLCDELHAMMARFWWGSTDSKGRSIGRNGPRCGGSFRI
HRCWHLGSFRGDMRIRCPFCMRQLRPTVRCFGGVLFGREDVDVIEAIPISITNDEDKWIWHYCSNGMYIVKSGYKLARRLLVDQQSPSTDDQRLWWMRLWKAKIPQKIKI
FIWKAYHGCLPTRSKQICDNMFPISITNDEDKWIWHYCSNGMAYGNVELTSLLFNKLELFNKALLEKQEDVDVIEAIPISITNDEDKWIWHYCSNGMYIVKSGYKLARRL
LVDQQSPNTDDQRLWWMRLWKAKIPQKIKIFIWKAYHGCLPTRSKQICDNMFPRLDVLVPVGNNFVDRVICLATGQRVETSQSPAGDRPSFGNQSTMVSLFTDAAVRLSS
KGAGMRAVVVDNIGNLVAAMECLEEASLSVLAAEIRAIIEGLRLLQRLEITHAMVHFNSSNAIKMISGDIPISSEVYHWVQHIRVIGTSFQELSFIYVSKLYVVDTKCSS
TSCFNEWFYSLHLYCSFFFQLMK