; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0004183 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0004183
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr6:1672442..1674748
RNA-Seq ExpressionLag0004183
SyntenyLag0004183
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ONI01138.1 hypothetical protein PRUPE_6G123900 [Prunus persica]7.7e-15938.1Show/hide
Query:  MLRMGFASDWVDLIMRCVRSVSYSFNLNGERIGHVQPSRGLRQGDPLSSYLFLLCAQGLSSLLTEAERCKSISGFKISRGGPSLSHLFFADDSLLFFKVK
        ML++GF++ WV  +M C+ + ++S    G  +GH+ P RGLRQG PLS YLFL+C +G S LL  AER   + G +++RG PS++HL FADDS+LF K  
Subjt:  MLRMGFASDWVDLIMRCVRSVSYSFNLNGERIGHVQPSRGLRQGDPLSSYLFLLCAQGLSSLLTEAERCKSISGFKISRGGPSLSHLFFADDSLLFFKVK

Query:  GSEALAIRTVLEVYEKASGQAINFEKSMLSCSPNTAPQAFSTVRQILGIQSGSVHQIYLGLPTFMPRNKSGTLSFIKDRIWRCLQGWKEKLFSVGGKEIL
          + +A+ T+ + YE+ +GQ IN+ KS LS SPN     F  +  +L +     H+ YLGLPT   + +      +KD++W+ + GWKEKL S  GKEIL
Subjt:  GSEALAIRTVLEVYEKASGQAINFEKSMLSCSPNTAPQAFSTVRQILGIQSGSVHQIYLGLPTFMPRNKSGTLSFIKDRIWRCLQGWKEKLFSVGGKEIL

Query:  LKAVIQAVPCYSMNCFRLPKKLINSISQAMARFWWNEVEGRNQIHWVKWGEMCKPKCLGGLGFKNLEVFNQSLLAKQCWKILNEPESLLSRILKGRYFPN
        +KAV+QA+P YSM+CFR+PK L   ++  MARFWW + + +  IHWVKW  +CK K  GGLGF++LE FNQ+LLAKQCW+IL  PESL++RI + RY P+
Subjt:  LKAVIQAVPCYSMNCFRLPKKLINSISQAMARFWWNEVEGRNQIHWVKWGEMCKPKCLGGLGFKNLEVFNQSLLAKQCWKILNEPESLLSRILKGRYFPN

Query:  CDFLEANIGYRASFVWRSLMWGKDLLVKGLRWRIGDGRSVPVYASNWIPRDFFLSVQSAVSLPLDSRVSDLLLGSGGWDERKISHHFDFEEARRILSIPL
          FLEA +G   SF+WRSL WGK+LL KGLRWR+G G S+ VY   W+P      + S   LPL +RV DL   SG W+   +   F  +E   IL IPL
Subjt:  CDFLEANIGYRASFVWRSLMWGKDLLVKGLRWRIGDGRSVPVYASNWIPRDFFLSVQSAVSLPLDSRVSDLLLGSGGWDERKISHHFDFEEARRILSIPL

Query:  SPLGQVDRLIWHYEKEGKYTVKSGYRVGQNAILALRASSSSSAKI---GAWWKGFWTMQLPGKIKVFLWRLFLDRLPTLSNLNARGMDMLNLCSFCGRQG
        + L   D LIWHYE+ G Y+VKSGYR+    +   + S   SA++     +WK  W +++P KIK FLWR   D LP    L  R +    +C  C R+ 
Subjt:  SPLGQVDRLIWHYEKEGKYTVKSGYRVGQNAILALRASSSSSAKI---GAWWKGFWTMQLPGKIKVFLWRLFLDRLPTLSNLNARGMDMLNLCSFCGRQG

Query:  EDALHVFWFCKFARIQWDRSSFAHLTGLFTPTSTSILLQDLKDRLSWDEFSGLATFLWGLWNARNRLQLQG-----LEPIADIGPWAAQYVRIYKEAHCR
        E  LH  W C+ A+  W  S++ ++  ++   S   L   L+   S +E    A   WGLWN RN    +G     ++ ++ +   A ++       H  
Subjt:  EDALHVFWFCKFARIQWDRSSFAHLTGLFTPTSTSILLQDLKDRLSWDEFSGLATFLWGLWNARNRLQLQG-----LEPIADIGPWAAQYVRIYKEAHCR

Query:  YGVGAVERRVRTEVRWQAPPSGWYKLNVDAAFDQSSRSAGLGMIVRDSQGNVLLSASKFLPCVLSVDFAECLAAAEGFRLSLETGCCPLQLETDSWRVFH
        +G          +   QAP  GW       A        G+G++VR++ G  + +  + +         E +A  EG R +++ G     LE D+    +
Subjt:  YGVGAVERRVRTEVRWQAPPSGWYKLNVDAAFDQSSRSAGLGMIVRDSQGNVLLSASKFLPCVLSVDFAECLAAAEGFRLSLETGCCPLQLETDSWRVFH

Query:  LLQREIDDCSEVGILASDLRRQLSS-PGFLSLFSVREGNQAADLLAKLAIQNRWDQVWIEDFPLDLSNFLDVDVLRL
         +    +     G L  ++   L++    +  ++ R GN+ A  LA+ A        WIE+ P  L   L+ DVL L
Subjt:  LLQREIDDCSEVGILASDLRRQLSS-PGFLSLFSVREGNQAADLLAKLAIQNRWDQVWIEDFPLDLSNFLDVDVLRL

VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]1.2e-16438.69Show/hide
Query:  MLRMGFASDWVDLIMRCVRSVSYSFNLNGERIGHVQPSRGLRQGDPLSSYLFLLCAQGLSSLLTEAERCKSISGFKISRGGPSLSHLFFADDSLLFFKVK
        ML++GF++ WV  +M C+ + ++S    G  +GH+ P RGLRQG PLS YLFL+C +G S LL  AER   + G +++RGGPS++HL FADDS+LF K  
Subjt:  MLRMGFASDWVDLIMRCVRSVSYSFNLNGERIGHVQPSRGLRQGDPLSSYLFLLCAQGLSSLLTEAERCKSISGFKISRGGPSLSHLFFADDSLLFFKVK

Query:  GSEALAIRTVLEVYEKASGQAINFEKSMLSCSPNTAPQAFSTVRQILGIQSGSVHQIYLGLPTFMPRNKSGTLSFIKDRIWRCLQGWKEKLFSVGGKEIL
             A+ T+ + YE+ SGQ IN+ KS  S SPN     F  ++ +L +     H+ YLGLPT   + +      +KD++W+ + GWKEKL S  GKEIL
Subjt:  GSEALAIRTVLEVYEKASGQAINFEKSMLSCSPNTAPQAFSTVRQILGIQSGSVHQIYLGLPTFMPRNKSGTLSFIKDRIWRCLQGWKEKLFSVGGKEIL

Query:  LKAVIQAVPCYSMNCFRLPKKLINSISQAMARFWWNEVEGRNQIHWVKWGEMCKPKCLGGLGFKNLEVFNQSLLAKQCWKILNEPESLLSRILKGRYFPN
        +KAV+QA+P YSM+CFR+PK L   ++  MARFWW + + +  IHWVKW  +CK K  GGLGF++LE FNQ+LLAKQCW+IL  PESL++RI + RY P+
Subjt:  LKAVIQAVPCYSMNCFRLPKKLINSISQAMARFWWNEVEGRNQIHWVKWGEMCKPKCLGGLGFKNLEVFNQSLLAKQCWKILNEPESLLSRILKGRYFPN

Query:  CDFLEANIGYRASFVWRSLMWGKDLLVKGLRWRIGDGRSVPVYASNWIPRDFFLSVQSAVSLPLDSRVSDLLLGSGGWDERKISHHFDFEEARRILSIPL
          FLEA +G   SF+WRSL WGK+LL KGLRWR+G+G S+ VY   W+P   F  + S   LPL + V DL   SG W+   +   F  +E    L IPL
Subjt:  CDFLEANIGYRASFVWRSLMWGKDLLVKGLRWRIGDGRSVPVYASNWIPRDFFLSVQSAVSLPLDSRVSDLLLGSGGWDERKISHHFDFEEARRILSIPL

Query:  SPLGQVDRLIWHYEKEGKYTVKSGYRVGQNAILALRASSSSSAKI---GAWWKGFWTMQLPGKIKVFLWRLFLDRLPTLSNLNARGMDMLNLCSFCGRQG
        + L   D LIWHYE+ G Y+VKSGYR+    +   + S   S ++     +WK  W +++P KIK FLWR   D LP    L  R +    +C  C R+ 
Subjt:  SPLGQVDRLIWHYEKEGKYTVKSGYRVGQNAILALRASSSSSAKI---GAWWKGFWTMQLPGKIKVFLWRLFLDRLPTLSNLNARGMDMLNLCSFCGRQG

Query:  EDALHVFWFCKFARIQWDRSSFAHLTGLFTPTSTSILLQDLKDRLSWDEFSGLATFLWGLWNARNRLQLQGLEPIAD-----IGPWAAQYVRIYKEAHCR
        E  LH  W C+ A+  W  S++ ++   +   S   L   L+   S +E    A   WGLWN RN    +G    A      +   A ++      +H  
Subjt:  EDALHVFWFCKFARIQWDRSSFAHLTGLFTPTSTSILLQDLKDRLSWDEFSGLATFLWGLWNARNRLQLQGLEPIAD-----IGPWAAQYVRIYKEAHCR

Query:  YGVGAVERRVRTEVRWQAPPSGWYKLNVDAAFDQSSRSAGLGMIVRDSQGNVLLSASKFLPCVLSVDFAECLAAAEGFRLSLETGCCPLQLETDSWRVFH
        +G  +  +       W+ PP+G YK+NVD A        G+G++VR++ G  + +  + +         E +A  EG R +++ G     LE D+    +
Subjt:  YGVGAVERRVRTEVRWQAPPSGWYKLNVDAAFDQSSRSAGLGMIVRDSQGNVLLSASKFLPCVLSVDFAECLAAAEGFRLSLETGCCPLQLETDSWRVFH

Query:  LLQREIDDCSEV-GILASDLRRQLSS-PGFLSLFSVREGNQAADLLAKLAIQNRWDQVWIEDFPLDLSNFLDVDVLRL
         +    ++C+ + G+L  ++   L +    +  ++ R GN+ A  LA+ A        WIE+ P  L   L+ DVL L
Subjt:  LLQREIDDCSEV-GILASDLRRQLSS-PGFLSLFSVREGNQAADLLAKLAIQNRWDQVWIEDFPLDLSNFLDVDVLRL

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]4.7e-16444.85Show/hide
Query:  YEKASGQAINFEKSMLSCSPNTAPQAFSTVRQILGIQSGSVHQIYLGLPTFMPRNKSGTLSFIKDRIWRCLQGWKEKLFSVGGKEILLKAVIQAVPCYSM
        Y K +GQA       L   P    +  S ++ IL +        YLGLPTFMPRN+    ++IKDR+W+ LQGWK KLFS+GGKE+L+KAV QA+PCY+M
Subjt:  YEKASGQAINFEKSMLSCSPNTAPQAFSTVRQILGIQSGSVHQIYLGLPTFMPRNKSGTLSFIKDRIWRCLQGWKEKLFSVGGKEILLKAVIQAVPCYSM

Query:  NCFRLPKKLINSISQAMARFWWNEVEGRNQIHWVKWGEMCKPKCLGGLGFKNLEVFNQSLLAKQCWKILNEPESLLSRILKGRYFPNCDFLEANIGYRAS
        +CFRLPK+LI       ARFWW   +   +IHWV W  +  PKC GG+GF++LE+FN++LLAKQCW+ILN P S+LSR+LKGRYF +C F+EA I    S
Subjt:  NCFRLPKKLINSISQAMARFWWNEVEGRNQIHWVKWGEMCKPKCLGGLGFKNLEVFNQSLLAKQCWKILNEPESLLSRILKGRYFPNCDFLEANIGYRAS

Query:  FVWRSLMWGKDLLVKGLRWRIGDGRSVPVYASNWIPRDFFLSVQSAVSLPLDSRVSDLL-LGSGGWDERKISHHFDFEEARRILSIPLSPLGQVDRLIWH
        ++WRS++WG+DLL KGLRWRIG+G SV +Y  NW+P    L + S+  LPL SRVS L+    GGW    +   F  +EA+ ILSIP+    + DRLIW+
Subjt:  FVWRSLMWGKDLLVKGLRWRIGDGRSVPVYASNWIPRDFFLSVQSAVSLPLDSRVSDLL-LGSGGWDERKISHHFDFEEARRILSIPLSPLGQVDRLIWH

Query:  YEKEGKYTVKSGYRVG-QNAILALRASSSSSAKIGAWWKGFWTMQLPGKIKVFLWRLFLDRLPTLSNLNARGMDMLNLCSFCGRQGEDALHVFWFCKFAR
        YEK G Y+V+SGY+V   N       SSSSS ++  WW GFW M +P KIKVFLWRL LDRLPT  NL+ RG+++ N C FCGR GED++H+FW CKFA 
Subjt:  YEKEGKYTVKSGYRVG-QNAILALRASSSSSAKIGAWWKGFWTMQLPGKIKVFLWRLFLDRLPTLSNLNARGMDMLNLCSFCGRQGEDALHVFWFCKFAR

Query:  IQWDRSSFAHLTGLFTPTSTSILLQDLKDRLSWDEFSGLATFLWGLWNARN-RLQLQGLEPIADIG----PWAAQYVRIYKEAHCRYGVGAVERRVRTEV
          W  S F  L       S  ++L++  + LS  +F  L   +WGLWN RN R      + +  IG     WA +Y   ++EA      G V      E+
Subjt:  IQWDRSSFAHLTGLFTPTSTSILLQDLKDRLSWDEFSGLATFLWGLWNARN-RLQLQGLEPIADIG----PWAAQYVRIYKEAHCRYGVGAVERRVRTEV

Query:  RWQAPPSGWYKLNVDAAFDQSSRSAGLGMIVRDSQGNVLLSASKFLPCVLSVDFAECLAAAEGFRLSLETGCCPLQLETDSWRVFHLLQREIDDCSEVGI
         WQ P  G YK+N DA+F  S + AGLG+I+ + +G V+ +A+K+L  + SVD AE +AA EG +L+ E G              H    ++ +  E+ +
Subjt:  RWQAPPSGWYKLNVDAAFDQSSRSAGLGMIVRDSQGNVLLSASKFLPCVLSVDFAECLAAAEGFRLSLETGCCPLQLETDSWRVFHLLQREIDDCSEVGI

Query:  LASDLRRQLSSPGFLSLFSVREGNQAADLLAKLAIQNRWDQVWIEDFPLDLSNFLDVDVL
         A +   Q     F   F  REGN+AA +LA+ A+      +W+ED+PL+L + L+++ L
Subjt:  LASDLRRQLSSPGFLSLFSVREGNQAADLLAKLAIQNRWDQVWIEDFPLDLSNFLDVDVL

XP_030483669.1 uncharacterized protein LOC115700241 [Cannabis sativa]1.8e-15539.09Show/hide
Query:  MLRMGFASDWVDLIMRCVRSVSYSFNLNGERIGHVQPSRGLRQGDPLSSYLFLLCAQGLSSLLTEAERCKSISGFKISRGGPSLSHLFFADDSLLFFKVK
        ML+MGF S  +DLI+RC++SV+YSF LNG+ +G + P+RG+RQGDPLS YLFL+CA+GLS LL  +E+  S+ G K+SR  PS+SHLFFADDS+LF +  
Subjt:  MLRMGFASDWVDLIMRCVRSVSYSFNLNGERIGHVQPSRGLRQGDPLSSYLFLLCAQGLSSLLTEAERCKSISGFKISRGGPSLSHLFFADDSLLFFKVK

Query:  GSEALAIRTVLEVYEKASGQAINFEKSMLSCSPNTAPQAFSTVRQILGIQSGSVHQIYLGLPTFMPRNKSGTLSFIKDRIWRCLQGWKEKLFSVGGKEIL
           A +I+ +L++Y +ASGQ +N +K +LS SPNT  Q     + +L +     H+ YLGLP+F  R+K+   S I D+IW+ L  WKE+LFSVGGKE+L
Subjt:  GSEALAIRTVLEVYEKASGQAINFEKSMLSCSPNTAPQAFSTVRQILGIQSGSVHQIYLGLPTFMPRNKSGTLSFIKDRIWRCLQGWKEKLFSVGGKEIL

Query:  LKAVIQAVPCYSMNCFRLPKKLINSISQAMARFWWNEVEGRNQIHWVKWGEMCKPKCLGGLGFKNLEVFNQSLLAKQCWKILNEPESLLSRILKGRYFPN
        LKAV+QA+P Y+M+CFRLP  L N I   M+ FWW      N IHW  W  +CK K  GGLGF+N  +FNQ+LLAKQ W++L  P SLL R+L  RYF N
Subjt:  LKAVIQAVPCYSMNCFRLPKKLINSISQAMARFWWNEVEGRNQIHWVKWGEMCKPKCLGGLGFKNLEVFNQSLLAKQCWKILNEPESLLSRILKGRYFPN

Query:  CDFLEANIGYRASFVWRSLMWGKDLLVKGLRWRIGDGRSVPVYASNWIPRDFFLSVQSAVSLPLDSRVSDLLLGSGGWDERKISHHFDFEEARRILSIPL
         + L A +G   S  WRS++WGK+LLV+GL+WR+G G ++   + +W+P     +  S        +V+DL+     WD   IS +F   +  RIL+IPL
Subjt:  CDFLEANIGYRASFVWRSLMWGKDLLVKGLRWRIGDGRSVPVYASNWIPRDFFLSVQSAVSLPLDSRVSDLLLGSGGWDERKISHHFDFEEARRILSIPL

Query:  SPLGQVDRLIWHYEKEGKYTVKSGYRVGQNAILALRASSSSSAKIGAWWKGFWTMQLPGKIKVFLWRLFLDRLPTLSNLNARGMDMLNLCSFCGRQGEDA
        S     D LIW+    G YTVKSGY+   +  LA    +++S  + +WW  FW ++LP KI++F+W++F + LP  S L+ + +     C  C    E  
Subjt:  SPLGQVDRLIWHYEKEGKYTVKSGYRVGQNAILALRASSSSSAKIGAWWKGFWTMQLPGKIKVFLWRLFLDRLPTLSNLNARGMDMLNLCSFCGRQGEDA

Query:  LHVFWFCKFARIQWDRSSFAHLTGLFTPTSTSILLQDLKDRLSWDEFSGLATFLWGLWNARN-RLQLQGLEPIADIGPWAAQYVRIYK--EAHCRYGVG-
         H  +FC  A+  W +S       +   T+++  L  +    S  EF       W +W  RN     +  +  A +  +A  Y+  Y+  +A   +  G 
Subjt:  LHVFWFCKFARIQWDRSSFAHLTGLFTPTSTSILLQDLKDRLSWDEFSGLATFLWGLWNARN-RLQLQGLEPIADIGPWAAQYVRIYK--EAHCRYGVG-

Query:  AVERRVRTEVR-----------WQAPPSGWYKLNVDAAFDQSSRSAGLGMIVRDSQGNVLLSASKFLPCVLSVDFAECLAAAEGFRLSLETGCCPLQLET
        A     R  V            W APP G  KLN DAA +++    G+G +VRDS G ++ + SK +      +  E +A A   + +   G     +ET
Subjt:  AVERRVRTEVR-----------WQAPPSGWYKLNVDAAFDQSSRSAGLGMIVRDSQGNVLLSASKFLPCVLSVDFAECLAAAEGFRLSLETGCCPLQLET

Query:  DSWRVFHLLQREIDDCSEVGILASDLRRQLS-SPGFLSLFSVREGNQAADLLAKLAIQNRWDQVWIEDFP
        DS  V   L+      S   ++ +D+   +S  P        R  N  AD+LAK A+    D  W+E+FP
Subjt:  DSWRVFHLLQREIDDCSEVGILASDLRRQLS-SPGFLSLFSVREGNQAADLLAKLAIQNRWDQVWIEDFP

XP_030498122.1 uncharacterized protein LOC115713779 [Cannabis sativa]1.5e-15439.92Show/hide
Query:  MLRMGFASDWVDLIMRCVRSVSYSFNLNGERIGHVQPSRGLRQGDPLSSYLFLLCAQGLSSLLTEAERCKSISGFKISRGGPSLSHLFFADDSLLFFKVK
        ML+MGF +  V+LI+RC++SVSYSF LNG+ +G ++PSRG+RQGDPLS YLFL+CA+GLS LL   E   S+ G ++SR  PS+SHLFFADDS+LF +  
Subjt:  MLRMGFASDWVDLIMRCVRSVSYSFNLNGERIGHVQPSRGLRQGDPLSSYLFLLCAQGLSSLLTEAERCKSISGFKISRGGPSLSHLFFADDSLLFFKVK

Query:  GSEALAIRTVLEVYEKASGQAINFEKSMLSCSPNTAPQAFSTVRQILGIQSGSVHQIYLGLPTFMPRNKSGTLSFIKDRIWRCLQGWKEKLFSVGGKEIL
           A +I+ VL++Y +ASGQA+N +K +LS SPNT     +  +Q+L +     H+ YLGLP+F  R+K+   S I D+IW+ L  WKE+LFS GGKE+L
Subjt:  GSEALAIRTVLEVYEKASGQAINFEKSMLSCSPNTAPQAFSTVRQILGIQSGSVHQIYLGLPTFMPRNKSGTLSFIKDRIWRCLQGWKEKLFSVGGKEIL

Query:  LKAVIQAVPCYSMNCFRLPKKLINSISQAMARFWWNEVEGRNQIHWVKWGEMCKPKCLGGLGFKNLEVFNQSLLAKQCWKILNEPESLLSRILKGRYFPN
        LKAV+QA+P Y+M+CFRLP  L + I   MA FWW      N IHW  W  +CK K  GGLGF+N   FNQ+LLAKQ W++L  P SLLSRIL  RYF +
Subjt:  LKAVIQAVPCYSMNCFRLPKKLINSISQAMARFWWNEVEGRNQIHWVKWGEMCKPKCLGGLGFKNLEVFNQSLLAKQCWKILNEPESLLSRILKGRYFPN

Query:  CDFLEANIGYRASFVWRSLMWGKDLLVKGLRWRIGDGRSVPVYASNWIPRDFFLSVQSAVSLPLDSRVSDLLLGSGGWDERKISHHFDFEEARRILSIPL
           L A +G   S  WRS++WGK+LLVKGLRWR+G G  +      W+P     +  S +      +V+DL+     WD   ++ +F   +  RILSIPL
Subjt:  CDFLEANIGYRASFVWRSLMWGKDLLVKGLRWRIGDGRSVPVYASNWIPRDFFLSVQSAVSLPLDSRVSDLLLGSGGWDERKISHHFDFEEARRILSIPL

Query:  SPLGQVDRLIWHYEKEGKYTVKSGYRVGQNAILALRASSSSSAKIGAWWKGFWTMQLPGKIKVFLWRLFLDRLPTLSNLNARGMDMLNLCSFCGRQGEDA
        S   + D LIW+    G Y+VKSGY    +  LA +  + SS     WW  FW +QLP K+++F+W++F + LP  S LN R +     C  C  Q E  
Subjt:  SPLGQVDRLIWHYEKEGKYTVKSGYRVGQNAILALRASSSSSAKIGAWWKGFWTMQLPGKIKVFLWRLFLDRLPTLSNLNARGMDMLNLCSFCGRQGEDA

Query:  LHVFWFCKFARIQWDRSSFAHLTGLFTPTSTSILLQDLKDRLSWDEFSGLATFLWGLWNARN-RLQLQGLEPIADIGPWAAQYVRIYKEAHCRYGVGAVE
         H  + C  A+  W  S+      L   +S    L  +    S  EF    T  W +W  RN     +  +    I  +A QY+  Y+  H +  V   +
Subjt:  LHVFWFCKFARIQWDRSSFAHLTGLFTPTSTSILLQDLKDRLSWDEFSGLATFLWGLWNARN-RLQLQGLEPIADIGPWAAQYVRIYKEAHCRYGVGAVE

Query:  -----------RRVRTEVRWQAPPSGWYKLNVDAAFDQSSRSAGLGMIVRDSQGNVLLSASKFLPCVLSVDFAECLAAAEGFRLSLETGCCPLQLETDSW
                       +   W+APP G YKLN DAAFD++++  G+G ++RDS G +  + SK +      +  E  A     +     G     +ETDS 
Subjt:  -----------RRVRTEVRWQAPPSGWYKLNVDAAFDQSSRSAGLGMIVRDSQGNVLLSASKFLPCVLSVDFAECLAAAEGFRLSLETGCCPLQLETDSW

Query:  RVFHLLQREIDDCSEVGILASDLRRQLSSPGFLSLFSV-REGNQAADLLAKLAI
         V   L+      S    + +D+   +S      +  V R  N  A +LAK A+
Subjt:  RVFHLLQREIDDCSEVGILASDLRRQLSSPGFLSLFSV-REGNQAADLLAKLAI

TrEMBL top hitse value%identityAlignment
A0A251NPF0 Reverse transcriptase domain-containing protein3.7e-15938.1Show/hide
Query:  MLRMGFASDWVDLIMRCVRSVSYSFNLNGERIGHVQPSRGLRQGDPLSSYLFLLCAQGLSSLLTEAERCKSISGFKISRGGPSLSHLFFADDSLLFFKVK
        ML++GF++ WV  +M C+ + ++S    G  +GH+ P RGLRQG PLS YLFL+C +G S LL  AER   + G +++RG PS++HL FADDS+LF K  
Subjt:  MLRMGFASDWVDLIMRCVRSVSYSFNLNGERIGHVQPSRGLRQGDPLSSYLFLLCAQGLSSLLTEAERCKSISGFKISRGGPSLSHLFFADDSLLFFKVK

Query:  GSEALAIRTVLEVYEKASGQAINFEKSMLSCSPNTAPQAFSTVRQILGIQSGSVHQIYLGLPTFMPRNKSGTLSFIKDRIWRCLQGWKEKLFSVGGKEIL
          + +A+ T+ + YE+ +GQ IN+ KS LS SPN     F  +  +L +     H+ YLGLPT   + +      +KD++W+ + GWKEKL S  GKEIL
Subjt:  GSEALAIRTVLEVYEKASGQAINFEKSMLSCSPNTAPQAFSTVRQILGIQSGSVHQIYLGLPTFMPRNKSGTLSFIKDRIWRCLQGWKEKLFSVGGKEIL

Query:  LKAVIQAVPCYSMNCFRLPKKLINSISQAMARFWWNEVEGRNQIHWVKWGEMCKPKCLGGLGFKNLEVFNQSLLAKQCWKILNEPESLLSRILKGRYFPN
        +KAV+QA+P YSM+CFR+PK L   ++  MARFWW + + +  IHWVKW  +CK K  GGLGF++LE FNQ+LLAKQCW+IL  PESL++RI + RY P+
Subjt:  LKAVIQAVPCYSMNCFRLPKKLINSISQAMARFWWNEVEGRNQIHWVKWGEMCKPKCLGGLGFKNLEVFNQSLLAKQCWKILNEPESLLSRILKGRYFPN

Query:  CDFLEANIGYRASFVWRSLMWGKDLLVKGLRWRIGDGRSVPVYASNWIPRDFFLSVQSAVSLPLDSRVSDLLLGSGGWDERKISHHFDFEEARRILSIPL
          FLEA +G   SF+WRSL WGK+LL KGLRWR+G G S+ VY   W+P      + S   LPL +RV DL   SG W+   +   F  +E   IL IPL
Subjt:  CDFLEANIGYRASFVWRSLMWGKDLLVKGLRWRIGDGRSVPVYASNWIPRDFFLSVQSAVSLPLDSRVSDLLLGSGGWDERKISHHFDFEEARRILSIPL

Query:  SPLGQVDRLIWHYEKEGKYTVKSGYRVGQNAILALRASSSSSAKI---GAWWKGFWTMQLPGKIKVFLWRLFLDRLPTLSNLNARGMDMLNLCSFCGRQG
        + L   D LIWHYE+ G Y+VKSGYR+    +   + S   SA++     +WK  W +++P KIK FLWR   D LP    L  R +    +C  C R+ 
Subjt:  SPLGQVDRLIWHYEKEGKYTVKSGYRVGQNAILALRASSSSSAKI---GAWWKGFWTMQLPGKIKVFLWRLFLDRLPTLSNLNARGMDMLNLCSFCGRQG

Query:  EDALHVFWFCKFARIQWDRSSFAHLTGLFTPTSTSILLQDLKDRLSWDEFSGLATFLWGLWNARNRLQLQG-----LEPIADIGPWAAQYVRIYKEAHCR
        E  LH  W C+ A+  W  S++ ++  ++   S   L   L+   S +E    A   WGLWN RN    +G     ++ ++ +   A ++       H  
Subjt:  EDALHVFWFCKFARIQWDRSSFAHLTGLFTPTSTSILLQDLKDRLSWDEFSGLATFLWGLWNARNRLQLQG-----LEPIADIGPWAAQYVRIYKEAHCR

Query:  YGVGAVERRVRTEVRWQAPPSGWYKLNVDAAFDQSSRSAGLGMIVRDSQGNVLLSASKFLPCVLSVDFAECLAAAEGFRLSLETGCCPLQLETDSWRVFH
        +G          +   QAP  GW       A        G+G++VR++ G  + +  + +         E +A  EG R +++ G     LE D+    +
Subjt:  YGVGAVERRVRTEVRWQAPPSGWYKLNVDAAFDQSSRSAGLGMIVRDSQGNVLLSASKFLPCVLSVDFAECLAAAEGFRLSLETGCCPLQLETDSWRVFH

Query:  LLQREIDDCSEVGILASDLRRQLSS-PGFLSLFSVREGNQAADLLAKLAIQNRWDQVWIEDFPLDLSNFLDVDVLRL
         +    +     G L  ++   L++    +  ++ R GN+ A  LA+ A        WIE+ P  L   L+ DVL L
Subjt:  LLQREIDDCSEVGILASDLRRQLSS-PGFLSLFSVREGNQAADLLAKLAIQNRWDQVWIEDFPLDLSNFLDVDVLRL

A0A5E4FZN9 PREDICTED: retrotransposon5.9e-16538.69Show/hide
Query:  MLRMGFASDWVDLIMRCVRSVSYSFNLNGERIGHVQPSRGLRQGDPLSSYLFLLCAQGLSSLLTEAERCKSISGFKISRGGPSLSHLFFADDSLLFFKVK
        ML++GF++ WV  +M C+ + ++S    G  +GH+ P RGLRQG PLS YLFL+C +G S LL  AER   + G +++RGGPS++HL FADDS+LF K  
Subjt:  MLRMGFASDWVDLIMRCVRSVSYSFNLNGERIGHVQPSRGLRQGDPLSSYLFLLCAQGLSSLLTEAERCKSISGFKISRGGPSLSHLFFADDSLLFFKVK

Query:  GSEALAIRTVLEVYEKASGQAINFEKSMLSCSPNTAPQAFSTVRQILGIQSGSVHQIYLGLPTFMPRNKSGTLSFIKDRIWRCLQGWKEKLFSVGGKEIL
             A+ T+ + YE+ SGQ IN+ KS  S SPN     F  ++ +L +     H+ YLGLPT   + +      +KD++W+ + GWKEKL S  GKEIL
Subjt:  GSEALAIRTVLEVYEKASGQAINFEKSMLSCSPNTAPQAFSTVRQILGIQSGSVHQIYLGLPTFMPRNKSGTLSFIKDRIWRCLQGWKEKLFSVGGKEIL

Query:  LKAVIQAVPCYSMNCFRLPKKLINSISQAMARFWWNEVEGRNQIHWVKWGEMCKPKCLGGLGFKNLEVFNQSLLAKQCWKILNEPESLLSRILKGRYFPN
        +KAV+QA+P YSM+CFR+PK L   ++  MARFWW + + +  IHWVKW  +CK K  GGLGF++LE FNQ+LLAKQCW+IL  PESL++RI + RY P+
Subjt:  LKAVIQAVPCYSMNCFRLPKKLINSISQAMARFWWNEVEGRNQIHWVKWGEMCKPKCLGGLGFKNLEVFNQSLLAKQCWKILNEPESLLSRILKGRYFPN

Query:  CDFLEANIGYRASFVWRSLMWGKDLLVKGLRWRIGDGRSVPVYASNWIPRDFFLSVQSAVSLPLDSRVSDLLLGSGGWDERKISHHFDFEEARRILSIPL
          FLEA +G   SF+WRSL WGK+LL KGLRWR+G+G S+ VY   W+P   F  + S   LPL + V DL   SG W+   +   F  +E    L IPL
Subjt:  CDFLEANIGYRASFVWRSLMWGKDLLVKGLRWRIGDGRSVPVYASNWIPRDFFLSVQSAVSLPLDSRVSDLLLGSGGWDERKISHHFDFEEARRILSIPL

Query:  SPLGQVDRLIWHYEKEGKYTVKSGYRVGQNAILALRASSSSSAKI---GAWWKGFWTMQLPGKIKVFLWRLFLDRLPTLSNLNARGMDMLNLCSFCGRQG
        + L   D LIWHYE+ G Y+VKSGYR+    +   + S   S ++     +WK  W +++P KIK FLWR   D LP    L  R +    +C  C R+ 
Subjt:  SPLGQVDRLIWHYEKEGKYTVKSGYRVGQNAILALRASSSSSAKI---GAWWKGFWTMQLPGKIKVFLWRLFLDRLPTLSNLNARGMDMLNLCSFCGRQG

Query:  EDALHVFWFCKFARIQWDRSSFAHLTGLFTPTSTSILLQDLKDRLSWDEFSGLATFLWGLWNARNRLQLQGLEPIAD-----IGPWAAQYVRIYKEAHCR
        E  LH  W C+ A+  W  S++ ++   +   S   L   L+   S +E    A   WGLWN RN    +G    A      +   A ++      +H  
Subjt:  EDALHVFWFCKFARIQWDRSSFAHLTGLFTPTSTSILLQDLKDRLSWDEFSGLATFLWGLWNARNRLQLQGLEPIAD-----IGPWAAQYVRIYKEAHCR

Query:  YGVGAVERRVRTEVRWQAPPSGWYKLNVDAAFDQSSRSAGLGMIVRDSQGNVLLSASKFLPCVLSVDFAECLAAAEGFRLSLETGCCPLQLETDSWRVFH
        +G  +  +       W+ PP+G YK+NVD A        G+G++VR++ G  + +  + +         E +A  EG R +++ G     LE D+    +
Subjt:  YGVGAVERRVRTEVRWQAPPSGWYKLNVDAAFDQSSRSAGLGMIVRDSQGNVLLSASKFLPCVLSVDFAECLAAAEGFRLSLETGCCPLQLETDSWRVFH

Query:  LLQREIDDCSEV-GILASDLRRQLSS-PGFLSLFSVREGNQAADLLAKLAIQNRWDQVWIEDFPLDLSNFLDVDVLRL
         +    ++C+ + G+L  ++   L +    +  ++ R GN+ A  LA+ A        WIE+ P  L   L+ DVL L
Subjt:  LLQREIDDCSEV-GILASDLRRQLSS-PGFLSLFSVREGNQAADLLAKLAIQNRWDQVWIEDFPLDLSNFLDVDVLRL

A0A6J1DAR4 uncharacterized protein LOC1110189542.3e-16444.85Show/hide
Query:  YEKASGQAINFEKSMLSCSPNTAPQAFSTVRQILGIQSGSVHQIYLGLPTFMPRNKSGTLSFIKDRIWRCLQGWKEKLFSVGGKEILLKAVIQAVPCYSM
        Y K +GQA       L   P    +  S ++ IL +        YLGLPTFMPRN+    ++IKDR+W+ LQGWK KLFS+GGKE+L+KAV QA+PCY+M
Subjt:  YEKASGQAINFEKSMLSCSPNTAPQAFSTVRQILGIQSGSVHQIYLGLPTFMPRNKSGTLSFIKDRIWRCLQGWKEKLFSVGGKEILLKAVIQAVPCYSM

Query:  NCFRLPKKLINSISQAMARFWWNEVEGRNQIHWVKWGEMCKPKCLGGLGFKNLEVFNQSLLAKQCWKILNEPESLLSRILKGRYFPNCDFLEANIGYRAS
        +CFRLPK+LI       ARFWW   +   +IHWV W  +  PKC GG+GF++LE+FN++LLAKQCW+ILN P S+LSR+LKGRYF +C F+EA I    S
Subjt:  NCFRLPKKLINSISQAMARFWWNEVEGRNQIHWVKWGEMCKPKCLGGLGFKNLEVFNQSLLAKQCWKILNEPESLLSRILKGRYFPNCDFLEANIGYRAS

Query:  FVWRSLMWGKDLLVKGLRWRIGDGRSVPVYASNWIPRDFFLSVQSAVSLPLDSRVSDLL-LGSGGWDERKISHHFDFEEARRILSIPLSPLGQVDRLIWH
        ++WRS++WG+DLL KGLRWRIG+G SV +Y  NW+P    L + S+  LPL SRVS L+    GGW    +   F  +EA+ ILSIP+    + DRLIW+
Subjt:  FVWRSLMWGKDLLVKGLRWRIGDGRSVPVYASNWIPRDFFLSVQSAVSLPLDSRVSDLL-LGSGGWDERKISHHFDFEEARRILSIPLSPLGQVDRLIWH

Query:  YEKEGKYTVKSGYRVG-QNAILALRASSSSSAKIGAWWKGFWTMQLPGKIKVFLWRLFLDRLPTLSNLNARGMDMLNLCSFCGRQGEDALHVFWFCKFAR
        YEK G Y+V+SGY+V   N       SSSSS ++  WW GFW M +P KIKVFLWRL LDRLPT  NL+ RG+++ N C FCGR GED++H+FW CKFA 
Subjt:  YEKEGKYTVKSGYRVG-QNAILALRASSSSSAKIGAWWKGFWTMQLPGKIKVFLWRLFLDRLPTLSNLNARGMDMLNLCSFCGRQGEDALHVFWFCKFAR

Query:  IQWDRSSFAHLTGLFTPTSTSILLQDLKDRLSWDEFSGLATFLWGLWNARN-RLQLQGLEPIADIG----PWAAQYVRIYKEAHCRYGVGAVERRVRTEV
          W  S F  L       S  ++L++  + LS  +F  L   +WGLWN RN R      + +  IG     WA +Y   ++EA      G V      E+
Subjt:  IQWDRSSFAHLTGLFTPTSTSILLQDLKDRLSWDEFSGLATFLWGLWNARN-RLQLQGLEPIADIG----PWAAQYVRIYKEAHCRYGVGAVERRVRTEV

Query:  RWQAPPSGWYKLNVDAAFDQSSRSAGLGMIVRDSQGNVLLSASKFLPCVLSVDFAECLAAAEGFRLSLETGCCPLQLETDSWRVFHLLQREIDDCSEVGI
         WQ P  G YK+N DA+F  S + AGLG+I+ + +G V+ +A+K+L  + SVD AE +AA EG +L+ E G              H    ++ +  E+ +
Subjt:  RWQAPPSGWYKLNVDAAFDQSSRSAGLGMIVRDSQGNVLLSASKFLPCVLSVDFAECLAAAEGFRLSLETGCCPLQLETDSWRVFHLLQREIDDCSEVGI

Query:  LASDLRRQLSSPGFLSLFSVREGNQAADLLAKLAIQNRWDQVWIEDFPLDLSNFLDVDVL
         A +   Q     F   F  REGN+AA +LA+ A+      +W+ED+PL+L + L+++ L
Subjt:  LASDLRRQLSSPGFLSLFSVREGNQAADLLAKLAIQNRWDQVWIEDFPLDLSNFLDVDVL

A0A803QC75 Uncharacterized protein1.6e-15738.51Show/hide
Query:  MLRMGFASDWVDLIMRCVRSVSYSFNLNGERIGHVQPSRGLRQGDPLSSYLFLLCAQGLSSLLTEAERCKSISGFKISRGGPSLSHLFFADDSLLFFKVK
        M +MGFA  W+ LIM C+ + ++SF +NGE +G++ PSRGLRQG PLS YLFL+C++G S LL   +   ++ GFK++R  P ++HLFFADDSLLF +  
Subjt:  MLRMGFASDWVDLIMRCVRSVSYSFNLNGERIGHVQPSRGLRQGDPLSSYLFLLCAQGLSSLLTEAERCKSISGFKISRGGPSLSHLFFADDSLLFFKVK

Query:  GSEALAIRTVLEVYEKASGQAINFEKSMLSCSPNTAPQAFSTVRQILGIQSGSVHQIYLGLPTFMPRNKSGTLSFIKDRIWRCLQGWKEKLFSVGGKEIL
            LAI+ VL+ Y KASGQ +N +KS++S SPNT   A     Q L +     H+ YLGLP++  R+K    S IK+RIW+ +  W EK+FS GGKEIL
Subjt:  GSEALAIRTVLEVYEKASGQAINFEKSMLSCSPNTAPQAFSTVRQILGIQSGSVHQIYLGLPTFMPRNKSGTLSFIKDRIWRCLQGWKEKLFSVGGKEIL

Query:  LKAVIQAVPCYSMNCFRLPKKLINSISQAMARFWWNEVEGRNQIHWVKWGEMCKPKCLGGLGFKNLEVFNQSLLAKQCWKILNEPESLLSRILKGRYFPN
        LKAV+Q++P Y+M+CFRLP      +   MA FWW   E  ++IHW  W  +CK K  GG+GF++   FNQ+LLAKQ W+I  +P SLL RILK RYFPN
Subjt:  LKAVIQAVPCYSMNCFRLPKKLINSISQAMARFWWNEVEGRNQIHWVKWGEMCKPKCLGGLGFKNLEVFNQSLLAKQCWKILNEPESLLSRILKGRYFPN

Query:  CDFLEANIGYRASFVWRSLMWGKDLLVKGLRWRIGDGRSVPVYASNWIP-RDFFLSVQSAVSLPLDSRVSDLLLGSGGWDERKISHHFDFEEARRILSIP
         +FLEA++G+  S  W+ + W ++LL+KGLRW++GDGR +   +  WIP  + FL      S P +  VS+L+     WD   +   F   +  RILS+P
Subjt:  CDFLEANIGYRASFVWRSLMWGKDLLVKGLRWRIGDGRSVPVYASNWIP-RDFFLSVQSAVSLPLDSRVSDLLLGSGGWDERKISHHFDFEEARRILSIP

Query:  LSPLGQVDRLIWHYEKEGKYTVKSGYRVGQNAILALRASSSSSAKIGAWWKGFWTMQLPGKIKVFLWRLFLDRLPTLSNLNARGMDMLNLCSFCGRQGED
        LS     D LIWH+   G YTVKSGY +   A + +   SSSS +   WWK FW++QLP K+K+F WR   D LP  ++L  R +   + CS C +  E 
Subjt:  LSPLGQVDRLIWHYEKEGKYTVKSGYRVGQNAILALRASSSSSAKIGAWWKGFWTMQLPGKIKVFLWRLFLDRLPTLSNLNARGMDMLNLCSFCGRQGED

Query:  ALHVFWFCKFARIQWDRSSFAHLTGLFTPTSTSILLQDLKDRLSWDEFSGLATFLWGLWNARNRL-QLQGLEPIADIGPWAAQYVRIYKEAHCR------
          H  + CK+A+  W  S+      L         +  L    S  E   +   LW +W  RNR+       P  D+  +A  Y+  Y+ A  R      
Subjt:  ALHVFWFCKFARIQWDRSSFAHLTGLFTPTSTSILLQDLKDRLSWDEFSGLATFLWGLWNARNRL-QLQGLEPIADIGPWAAQYVRIYKEAHCR------

Query:  ----------------YGVGAVERR---VRTEVRWQAPPSGWYKLNVDAAFDQSSRSAGLGMIVRDSQGNVLLSASKFLPCVLSVDFAECLAAAEGFRLS
                        Y + +   +   V    +W+ P +  YKLNVDAA D   +  G+G I+RDS GNV+ + SK  P   S    E +A  +     
Subjt:  ----------------YGVGAVERR---VRTEVRWQAPPSGWYKLNVDAAFDQSSRSAGLGMIVRDSQGNVLLSASKFLPCVLSVDFAECLAAAEGFRLS

Query:  LETGCCPLQLETDSWRVFHLLQREIDDCSEVGILASDLRRQLSSPGFLSLFSV-REGNQAADLLAKLAIQNRWDQVWIEDFPLDLSNFLDVD
        ++       +E+D+  V + L+   +  S    L  D+   LS    +++  V RE N AA  LAK A+       W E+ P  + + + V+
Subjt:  LETGCCPLQLETDSWRVFHLLQREIDDCSEVGILASDLRRQLSSPGFLSLFSV-REGNQAADLLAKLAIQNRWDQVWIEDFPLDLSNFLDVD

M5W5F3 Reverse transcriptase domain-containing protein (Fragment)3.7e-15938.1Show/hide
Query:  MLRMGFASDWVDLIMRCVRSVSYSFNLNGERIGHVQPSRGLRQGDPLSSYLFLLCAQGLSSLLTEAERCKSISGFKISRGGPSLSHLFFADDSLLFFKVK
        ML++GF++ WV  +M C+ + ++S    G  +GH+ P RGLRQG PLS YLFL+C +G S LL  AER   + G +++RG PS++HL FADDS+LF K  
Subjt:  MLRMGFASDWVDLIMRCVRSVSYSFNLNGERIGHVQPSRGLRQGDPLSSYLFLLCAQGLSSLLTEAERCKSISGFKISRGGPSLSHLFFADDSLLFFKVK

Query:  GSEALAIRTVLEVYEKASGQAINFEKSMLSCSPNTAPQAFSTVRQILGIQSGSVHQIYLGLPTFMPRNKSGTLSFIKDRIWRCLQGWKEKLFSVGGKEIL
          + +A+ T+ + YE+ +GQ IN+ KS LS SPN     F  +  +L +     H+ YLGLPT   + +      +KD++W+ + GWKEKL S  GKEIL
Subjt:  GSEALAIRTVLEVYEKASGQAINFEKSMLSCSPNTAPQAFSTVRQILGIQSGSVHQIYLGLPTFMPRNKSGTLSFIKDRIWRCLQGWKEKLFSVGGKEIL

Query:  LKAVIQAVPCYSMNCFRLPKKLINSISQAMARFWWNEVEGRNQIHWVKWGEMCKPKCLGGLGFKNLEVFNQSLLAKQCWKILNEPESLLSRILKGRYFPN
        +KAV+QA+P YSM+CFR+PK L   ++  MARFWW + + +  IHWVKW  +CK K  GGLGF++LE FNQ+LLAKQCW+IL  PESL++RI + RY P+
Subjt:  LKAVIQAVPCYSMNCFRLPKKLINSISQAMARFWWNEVEGRNQIHWVKWGEMCKPKCLGGLGFKNLEVFNQSLLAKQCWKILNEPESLLSRILKGRYFPN

Query:  CDFLEANIGYRASFVWRSLMWGKDLLVKGLRWRIGDGRSVPVYASNWIPRDFFLSVQSAVSLPLDSRVSDLLLGSGGWDERKISHHFDFEEARRILSIPL
          FLEA +G   SF+WRSL WGK+LL KGLRWR+G G S+ VY   W+P      + S   LPL +RV DL   SG W+   +   F  +E   IL IPL
Subjt:  CDFLEANIGYRASFVWRSLMWGKDLLVKGLRWRIGDGRSVPVYASNWIPRDFFLSVQSAVSLPLDSRVSDLLLGSGGWDERKISHHFDFEEARRILSIPL

Query:  SPLGQVDRLIWHYEKEGKYTVKSGYRVGQNAILALRASSSSSAKI---GAWWKGFWTMQLPGKIKVFLWRLFLDRLPTLSNLNARGMDMLNLCSFCGRQG
        + L   D LIWHYE+ G Y+VKSGYR+    +   + S   SA++     +WK  W +++P KIK FLWR   D LP    L  R +    +C  C R+ 
Subjt:  SPLGQVDRLIWHYEKEGKYTVKSGYRVGQNAILALRASSSSSAKI---GAWWKGFWTMQLPGKIKVFLWRLFLDRLPTLSNLNARGMDMLNLCSFCGRQG

Query:  EDALHVFWFCKFARIQWDRSSFAHLTGLFTPTSTSILLQDLKDRLSWDEFSGLATFLWGLWNARNRLQLQG-----LEPIADIGPWAAQYVRIYKEAHCR
        E  LH  W C+ A+  W  S++ ++  ++   S   L   L+   S +E    A   WGLWN RN    +G     ++ ++ +   A ++       H  
Subjt:  EDALHVFWFCKFARIQWDRSSFAHLTGLFTPTSTSILLQDLKDRLSWDEFSGLATFLWGLWNARNRLQLQG-----LEPIADIGPWAAQYVRIYKEAHCR

Query:  YGVGAVERRVRTEVRWQAPPSGWYKLNVDAAFDQSSRSAGLGMIVRDSQGNVLLSASKFLPCVLSVDFAECLAAAEGFRLSLETGCCPLQLETDSWRVFH
        +G          +   QAP  GW       A        G+G++VR++ G  + +  + +         E +A  EG R +++ G     LE D+    +
Subjt:  YGVGAVERRVRTEVRWQAPPSGWYKLNVDAAFDQSSRSAGLGMIVRDSQGNVLLSASKFLPCVLSVDFAECLAAAEGFRLSLETGCCPLQLETDSWRVFH

Query:  LLQREIDDCSEVGILASDLRRQLSS-PGFLSLFSVREGNQAADLLAKLAIQNRWDQVWIEDFPLDLSNFLDVDVLRL
         +    +     G L  ++   L++    +  ++ R GN+ A  LA+ A        WIE+ P  L   L+ DVL L
Subjt:  LLQREIDDCSEVGILASDLRRQLSS-PGFLSLFSVREGNQAADLLAKLAIQNRWDQVWIEDFPLDLSNFLDVDVLRL

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657508.5e-5225.59Show/hide
Query:  LPTFMPRNKSGTLSFIKDRIWRCLQGWKEKLFSVGGKEILLKAVIQAVPCYSMNCFRLPKKLINSISQAMARFWWNEVEGRNQIHWVKWGEMCKPKCLGG
        +P    R    T   I +R+   + GW+EK  S  G+  L KAV+ ++P +SM+   LP+ ++N + Q    F W     + + H VKW ++C PK  GG
Subjt:  LPTFMPRNKSGTLSFIKDRIWRCLQGWKEKLFSVGGKEILLKAVIQAVPCYSMNCFRLPKKLINSISQAMARFWWNEVEGRNQIHWVKWGEMCKPKCLGG

Query:  LGFKNLEVFNQSLLAKQCWKILNEPESLLSRILKGRY----FPNCDFLEANIGYRASFVWRSLMWG-KDLLVKGLRWRIGDGRSVPVYASNWIPRDFFLS
        LG +  +  N++L++K  W++L E  SL + +L+ +Y      +  +L     +  S  WRS+  G +D++  G+ W  GDG+ +  +   W+     L 
Subjt:  LGFKNLEVFNQSLLAKQCWKILNEPESLLSRILKGRY----FPNCDFLEANIGYRASFVWRSLMWG-KDLLVKGLRWRIGDGRSVPVYASNWIPRDFFLS

Query:  VQSAVSLPLDSR---VSDLLLGSGGWDERKISHHFDFEEARRILSIPLSPL-GQVDRLIWHYEKEGKYTVKSGYRVGQNAILALRASSSSSAKIGAWWKG
        + +    P D       DL +   GWD  KI  +        + ++ L  + G  DRL W + ++G+++V+S Y +       L         + +++  
Subjt:  VQSAVSLPLDSR---VSDLLLGSGGWDERKISHHFDFEEARRILSIPLSPL-GQVDRLIWHYEKEGKYTVKSGYRVGQNAILALRASSSSSAKIGAWWKG

Query:  FWTMQLPGKIKVFLWRLFLDRLPTLSNLNARGMDMLNLCSFCGRQGEDALHVFWFCKFARIQWDR-SSFAHLTGLFTPTSTSILLQDLKDRLSWDEFSGL
         W +++P ++K FLW +    + T    + R +   N+C  C    E  LHV   C      W R        G F+ +    L  +L DR   ++    
Subjt:  FWTMQLPGKIKVFLWRLFLDRLPTLSNLNARGMDMLNLCSFCGRQGEDALHVFWFCKFARIQWDR-SSFAHLTGLFTPTSTSILLQDLKDRLSWDEFSGL

Query:  ATFL----WG-LWNARNRL--QLQGLEPIADIGPWAAQYVRIYKEAHCRYGVGAVERRVRTEVRWQAPPSGWYKLNVDAAFDQSSRSAGLGMIVRDSQGN
          F     WG  W   N      +  + +  +  WA   V +Y+       VG  + RV   + W +P  GW K+N D A   +   A  G ++RD  G 
Subjt:  ATFL----WG-LWNARNRL--QLQGLEPIADIGPWAAQYVRIYKEAHCRYGVGAVERRVRTEVRWQAPPSGWYKLNVDAAFDQSSRSAGLGMIVRDSQGN

Query:  VLLSASKFLPCVLSVDFAECLAAAEGFRLSLETGCCPLQLETDSWRVFHLLQREIDDCSEVGILASDLRRQLSSPGFLSLFSV-REGNQAADLLAKLA
             S  +    S   AE      G   + E     ++LE DS  +   L+  I D   +  L       L     + +  V RE N+ AD LA  A
Subjt:  VLLSASKFLPCVLSVDFAECLAAAEGFRLSLETGCCPLQLETDSWRVFHLLQREIDDCSEVGILASDLRRQLSSPGFLSLFSV-REGNQAADLLAKLA

P11369 LINE-1 retrotransposable element ORF2 protein2.3e-1222.42Show/hide
Query:  RMGFASDWVDLIMRCVRSVSYSFNLNGERIGHVQPSRGLRQGDPLSSYLFLLCAQGLSSLLTEAERCKSISGFKISRGGPSLSHLFFADDSLLFFKVKGS
        R G    ++++I         +  +NGE++  +    G RQG PLS YLF +  + L+  + +    K I G +I +    +S L  ADD +++     +
Subjt:  RMGFASDWVDLIMRCVRSVSYSFNLNGERIGHVQPSRGLRQGDPLSSYLFLLCAQGLSSLLTEAERCKSISGFKISRGGPSLSHLFFADDSLLFFKVKGS

Query:  EALAIRTVLEVYEKASGQAINFEKSMLSCSPNTAPQAFSTVRQILGIQSGSVHQIYLGLPTFMPRNKSGTLSF--IKDRIWRCLQGWKEKLFSVGGKEIL
            +  ++  + +  G  IN  KSM         QA   +R+       + +  YLG+            +F  +K  I   L+ WK+   S  G+  +
Subjt:  EALAIRTVLEVYEKASGQAINFEKSMLSCSPNTAPQAFSTVRQILGIQSGSVHQIYLGLPTFMPRNKSGTLSF--IKDRIWRCLQGWKEKLFSVGGKEIL

Query:  LKAVIQAVPCYSMNC--FRLPKKLINSISQAMARFWWNEVEGRNQIHWVKWGEMCKPKCLGGLGFKNLEVFNQSLLAKQCW
        +K  I     Y  N    ++P +  N +  A+ +F WN  + R     +K       +  GG+   +L+++ ++++ K  W
Subjt:  LKAVIQAVPCYSMNC--FRLPKKLINSISQAMARFWWNEVEGRNQIHWVKWGEMCKPKCLGGLGFKNLEVFNQSLLAKQCW

P92555 Uncharacterized mitochondrial protein AtMg012502.3e-1250.72Show/hide
Query:  FNLNGERIGHVQPSRGLRQGDPLSSYLFLLCAQGLSSLLTEAERCKSISGFKISRGGPSLSHLFFADDS
        F +NG   G V PSRGLRQGDPLS YLF+LC + LS L   A+    + G ++S   P ++HL FADD+
Subjt:  FNLNGERIGHVQPSRGLRQGDPLSSYLFLLCAQGLSSLLTEAERCKSISGFKISRGGPSLSHLFFADDS

P93295 Uncharacterized mitochondrial protein AtMg003102.3e-3646.85Show/hide
Query:  AVPCYSMNCFRLPKKLINSISQAMARFWWNEVEGRNQIHWVKWGEMCKPK-CLGGLGFKNLEVFNQSLLAKQCWKILNEPESLLSRILKGRYFPNCDFLE
        A+P Y+M+CFRL K L   ++ AM  FWW+  E + +I WV W ++CK K   GGLGF++L  FNQ+LLAKQ ++I+++P +LLSR+L+ RYFP+   +E
Subjt:  AVPCYSMNCFRLPKKLINSISQAMARFWWNEVEGRNQIHWVKWGEMCKPK-CLGGLGFKNLEVFNQSLLAKQCWKILNEPESLLSRILKGRYFPNCDFLE

Query:  ANIGYRASFVWRSLMWGKDLLVKGLRWRIGDGRSVPVYASNWI
         ++G R S+ WRS++ G++LL +GL   IGDG    V+   WI
Subjt:  ANIGYRASFVWRSLMWGKDLLVKGLRWRIGDGRSVPVYASNWI

Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein2.6e-3226.39Show/hide
Query:  LKGRYFPNCDFLEANIGYRASFVWRSLMWGKDLLVKGLRWRIGDGRSVPVYASNWI----PRDFFLSVQSAVSLPLDSRVSDLLLGSGG---WDERKISH
        +K RYF +   L+A +  + S+ W SL+ G  LL KG R  IGDG+++ +   N +    PR   L+ +       +  +++L    G    WD+ KIS 
Subjt:  LKGRYFPNCDFLEANIGYRASFVWRSLMWGKDLLVKGLRWRIGDGRSVPVYASNWI----PRDFFLSVQSAVSLPLDSRVSDLLLGSGG---WDERKISH

Query:  HFDFEEARRILSIPLSPLGQVDRLIWHYEKEGKYTVKSGY-RVGQNAILALRASSSSSAKIGAWWKGFWTMQLPGKIKVFLWRLFLDRLPTLSNLNARGM
          D  +   I  I L+   + D++IW+Y   G+YTV+SGY  +  +    + A +     I    +  W + +  K+K FLWR     L T   L  RGM
Subjt:  HFDFEEARRILSIPLSPLGQVDRLIWHYEKEGKYTVKSGY-RVGQNAILALRASSSSSAKIGAWWKGFWTMQLPGKIKVFLWRLFLDRLPTLSNLNARGM

Query:  DMLNLCSFCGRQGEDALHVFWFCKFARIQWDRSSFAHLTGLFTPT----STSILLQDLKDRLSWDEFSGLATFL-WGLWNARNRLQLQGL--EPIADIGP
         +   C  C R+ E   H  + C FA + W  S  + +           + S +L  ++D    D    L  +L W +W ARN +        P   +  
Subjt:  DMLNLCSFCGRQGEDALHVFWFCKFARIQWDRSSFAHLTGLFTPT----STSILLQDLKDRLSWDEFSGLATFL-WGLWNARNRLQLQGL--EPIADIGP

Query:  WAAQYVRIYKEAHCRYGVGAVERRV-RTEVRWQAPPSGWYKLNVDAAFDQSSRSAGLGMIVRDSQGNVLLSASKFLPCVLSVDFAECLAAAEGFRLSLET
          A+               +  R++   ++ W+ PP+ + K N DA FD     A  G I+R+  G  +   S  L    +   AE  A     + +   
Subjt:  WAAQYVRIYKEAHCRYGVGAVERRV-RTEVRWQAPPSGWYKLNVDAAFDQSSRSAGLGMIVRDSQGNVLLSASKFLPCVLSVDFAECLAAAEGFRLSLET

Query:  GCCPLQLETDSWRVFHLLQREIDDCSEVGILASDLRR-QLSSPGFLSL---FSVREGNQAADLLAK
        G   + +E D   + +L    I+  S    LA+ L      +  F S+   F  R+GN+ A +LAK
Subjt:  GCCPLQLETDSWRVFHLLQREIDDCSEVGILASDLRR-QLSSPGFLSL---FSVREGNQAADLLAK

AT3G25270.1 Ribonuclease H-like superfamily protein4.4e-1925.82Show/hide
Query:  WTMQLPGKIKVFLWRLFLDRLPTLSNLNARGMDMLNLCSFCGRQGEDALHVFWFCKFARIQWDRSSFAH----LTGLFTPTSTSILLQDLKDRLSWDEFS
        W ++   KIK FLW+L    L T  NL  R +     C  C ++ E + H+F+ C +A+  W  S   H     TG+   T   +LL           F+
Subjt:  WTMQLPGKIKVFLWRLFLDRLPTLSNLNARGMDMLNLCSFCGRQGEDALHVFWFCKFARIQWDRSSFAH----LTGLFTPTSTSILLQDLKDRLSWDEFS

Query:  GLATFLWGLWNARNRLQLQG---------LEPIADIGPW--AAQYVR-IYKEAHCRYGVGAVERRVRTEVRWQAPPSGWYKLNVDAAFDQSSRSAGLGMI
             LW LW +RN+L  Q               D+  W     YV+ + ++ H        ++      +WQ PPS W K N D AF+  +R+A  G +
Subjt:  GLATFLWGLWNARNRLQLQG---------LEPIADIGPW--AAQYVR-IYKEAHCRYGVGAVERRVRTEVRWQAPPSGWYKLNVDAAFDQSSRSAGLGMI

Query:  VRDSQGNVLLSASKFLPCVLSVDFAECLAAAEGFRLSLETGCCPLQLETDSWRVFHLLQREIDDCSEVGILASDLRRQLSSPGFLSLFSVREGNQAADLL
        +RD  G  + S             +E  A     + +   G   +  E DS +V  L+  E  +      +      Q      +  +  R  NQ AD+L
Subjt:  VRDSQGNVLLSASKFLPCVLSVDFAECLAAAEGFRLSLETGCCPLQLETDSWRVFHLLQREIDDCSEVGILASDLRRQLSSPGFLSLFSVREGNQAADLL

Query:  AKLAIQ
        AK  +Q
Subjt:  AKLAIQ

AT4G29090.1 Ribonuclease H-like superfamily protein7.6e-6431.55Show/hide
Query:  AVPCYSMNCFRLPKKLINSISQAMARFWWNEVEGRNQIHWVKWGEMCKPKCLGGLGFKNLEVFNQSLLAKQCWKILNEPESLLSRILKGRYFPNCDFLEA
        A+P Y+M CF LPK +   I   +A FWW   +    +HW  W  +   K  GG+GFK++E FN +LL KQ W++L+ PESL++++ K RYF   D L A
Subjt:  AVPCYSMNCFRLPKKLINSISQAMARFWWNEVEGRNQIHWVKWGEMCKPKCLGGLGFKNLEVFNQSLLAKQCWKILNEPESLLSRILKGRYFPNCDFLEA

Query:  NIGYRASFVWRSLMWGKDLLVKGLRWRIGDGRSVPVYASNWI---PRDFFLSVQSA-----VSLPLDSRVSDLLLGSGGWDERKISHHFDFEEARRILSI
         +G R SFVW+S+   +++L +G R  +G+G  + ++   W+   P    L +Q        S+    +VSD L+   G + RK      F E  R L  
Subjt:  NIGYRASFVWRSLMWGKDLLVKGLRWRIGDGRSVPVYASNWI---PRDFFLSVQSA-----VSLPLDSRVSDLLLGSGGWDERKISHHFDFEEARRILSI

Query:  PLSPLGQ--VDRLIWHYEKEGKYTVKSGYRVGQNAILALRASSS--SSAKIGAWWKGFWTMQLPGKIKVFLWRLFLDRLPTLSNLNARGMDMLNLCSFCG
         L P G+  +D   W Y   G YTVKSGY V    I+  R+S    S   +   ++  W  Q   KI+ FLW+   + LP    L  R +   + C  C 
Subjt:  PLSPLGQ--VDRLIWHYEKEGKYTVKSGYRVGQNAILALRASSS--SSAKIGAWWKGFWTMQLPGKIKVFLWRLFLDRLPTLSNLNARGMDMLNLCSFCG

Query:  RQGEDALHVFWFCKFARIQWDRSSFAHLTGLFTPTSTSILLQDL----KDRLSWDEFSGLATF-LWGLWNARNRLQLQGLEPIA---------DIGPWAA
           E   H+ + C FAR+ W  SS     G     S  + L  +         W++ S L  + LW LW  RN L  +G E  A         D+  W  
Subjt:  RQGEDALHVFWFCKFARIQWDRSSFAHLTGLFTPTSTSILLQDL----KDRLSWDEFSGLATF-LWGLWNARNRLQLQGLEPIA---------DIGPWAA

Query:  QYVRIYKEAHCRYGVGAVERRVRTEV-RWQAPPSGWYKLNVDAAFDQSSRSAGLGMIVRDSQGNVLLSASKFLPCVLSVDFAECLAAAEGFRLSLETGCC
           RI  EA      G   +  R+   RW+ PP  W K N DA +++ +   G+G ++R+ +G V    ++ LP + SV  AE L A     LSL     
Subjt:  QYVRIYKEAHCRYGVGAVERRVRTEV-RWQAPPSGWYKLNVDAAFDQSSRSAGLGMIVRDSQGNVLLSASKFLPCVLSVDFAECLAAAEGFRLSLETGCC

Query:  PLQL-ETDSWRVFHLLQREIDDCSEVGILASDLRRQLSS-PGFLSLFSVREGNQAADLLAK
           + E+DS  +  +L  + +    +     DL+R LS       +F  REGN  A+ +A+
Subjt:  PLQL-ETDSWRVFHLLQREIDDCSEVGILASDLRRQLSS-PGFLSLFSVREGNQAADLLAK

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.6e-3746.85Show/hide
Query:  AVPCYSMNCFRLPKKLINSISQAMARFWWNEVEGRNQIHWVKWGEMCKPK-CLGGLGFKNLEVFNQSLLAKQCWKILNEPESLLSRILKGRYFPNCDFLE
        A+P Y+M+CFRL K L   ++ AM  FWW+  E + +I WV W ++CK K   GGLGF++L  FNQ+LLAKQ ++I+++P +LLSR+L+ RYFP+   +E
Subjt:  AVPCYSMNCFRLPKKLINSISQAMARFWWNEVEGRNQIHWVKWGEMCKPK-CLGGLGFKNLEVFNQSLLAKQCWKILNEPESLLSRILKGRYFPNCDFLE

Query:  ANIGYRASFVWRSLMWGKDLLVKGLRWRIGDGRSVPVYASNWI
         ++G R S+ WRS++ G++LL +GL   IGDG    V+   WI
Subjt:  ANIGYRASFVWRSLMWGKDLLVKGLRWRIGDGRSVPVYASNWI

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.6e-1350.72Show/hide
Query:  FNLNGERIGHVQPSRGLRQGDPLSSYLFLLCAQGLSSLLTEAERCKSISGFKISRGGPSLSHLFFADDS
        F +NG   G V PSRGLRQGDPLS YLF+LC + LS L   A+    + G ++S   P ++HL FADD+
Subjt:  FNLNGERIGHVQPSRGLRQGDPLSSYLFLLCAQGLSSLLTEAERCKSISGFKISRGGPSLSHLFFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTACGTATGGGATTTGCTTCTGATTGGGTTGATTTAATTATGCGTTGTGTCAGATCAGTATCCTATTCTTTTAATCTGAATGGGGAGCGTATTGGTCATGTCCAGCC
CAGCCGAGGCCTAAGGCAGGGAGATCCATTGTCATCATATTTGTTTTTGCTTTGTGCCCAGGGATTGTCAAGTTTGTTAACAGAGGCAGAAAGATGTAAGAGCATTTCAG
GTTTCAAAATTTCGAGGGGAGGTCCTTCTCTCTCCCATTTGTTTTTTGCTGACGATAGTTTGTTATTTTTTAAAGTGAAGGGTTCTGAGGCCCTGGCCATCCGTACTGTT
CTTGAAGTGTATGAAAAGGCCTCTGGGCAAGCGATCAATTTTGAGAAATCGATGTTGTCTTGTAGCCCGAATACCGCACCCCAAGCTTTCAGTACTGTGCGACAGATTTT
GGGTATTCAGTCTGGGTCTGTTCATCAAATTTACCTTGGCCTCCCGACTTTCATGCCTAGAAACAAATCTGGCACGCTTTCTTTCATCAAAGATAGGATTTGGCGTTGTT
TGCAGGGTTGGAAAGAAAAGTTATTTTCAGTGGGGGGAAAGGAGATTTTGTTGAAAGCTGTAATTCAAGCGGTTCCATGTTATTCTATGAATTGTTTTCGGCTACCCAAG
AAGTTGATCAATTCTATTTCTCAAGCTATGGCCCGATTTTGGTGGAATGAGGTAGAGGGAAGGAATCAGATTCATTGGGTTAAATGGGGTGAAATGTGTAAACCAAAATG
TCTTGGGGGATTGGGTTTTAAGAACTTAGAGGTTTTTAACCAATCTCTCCTGGCTAAACAATGTTGGAAAATACTAAATGAGCCTGAATCTCTCCTCTCTCGGATTCTCA
AGGGGAGATATTTTCCAAATTGCGATTTTCTTGAGGCCAATATTGGTTACCGTGCTTCTTTTGTTTGGCGTAGTCTGATGTGGGGGAAAGATCTCCTCGTAAAGGGCCTT
CGATGGCGAATTGGAGACGGTCGCTCAGTTCCAGTTTACGCATCCAATTGGATTCCCCGAGATTTTTTTCTCTCTGTTCAATCAGCGGTGTCCCTCCCTTTGGATTCTCG
AGTCAGCGATTTGCTCTTGGGTTCTGGGGGCTGGGATGAGCGAAAGATTTCTCATCATTTTGATTTCGAGGAGGCGCGCCGAATTCTCTCGATTCCGTTATCTCCGTTGG
GGCAAGTTGATCGGCTGATTTGGCACTATGAAAAGGAGGGGAAATATACGGTAAAAAGTGGATATCGGGTGGGTCAGAATGCGATCCTGGCTTTGAGGGCTTCTTCATCG
AGTTCTGCGAAAATAGGGGCCTGGTGGAAGGGGTTTTGGACGATGCAACTTCCAGGAAAAATCAAGGTCTTTTTATGGCGGCTTTTCTTGGATCGACTCCCAACTTTGTC
GAATCTTAACGCTCGAGGGATGGACATGCTGAATCTGTGCAGTTTCTGTGGCCGTCAGGGGGAAGATGCACTCCATGTCTTTTGGTTTTGCAAATTCGCAAGAATTCAGT
GGGATCGCTCTTCTTTTGCTCATCTCACTGGCCTCTTCACTCCGACATCGACTTCGATCTTATTGCAAGATTTGAAGGATCGACTTTCATGGGATGAGTTCTCGGGTCTT
GCAACTTTTCTTTGGGGCCTCTGGAACGCCCGTAATCGACTCCAACTACAGGGGTTAGAGCCGATTGCAGATATAGGACCTTGGGCAGCTCAGTATGTTCGAATTTATAA
AGAGGCGCATTGCCGATATGGAGTTGGTGCGGTTGAGCGGCGAGTTCGAACAGAGGTGCGATGGCAGGCCCCACCGAGTGGTTGGTATAAATTGAATGTTGATGCAGCCT
TTGATCAGAGCTCTCGGTCCGCAGGTTTGGGTATGATTGTTCGAGATTCTCAAGGGAACGTTTTGCTATCAGCGTCCAAGTTTCTCCCATGTGTTTTGTCAGTGGATTTT
GCAGAATGCCTTGCCGCGGCTGAAGGATTTCGTTTGAGTTTAGAAACGGGGTGTTGCCCACTTCAATTGGAAACTGATTCTTGGCGAGTTTTTCATCTGTTGCAAAGGGA
AATTGATGACTGTTCTGAGGTTGGGATATTGGCTTCGGACTTAAGACGTCAACTATCCTCTCCTGGGTTTTTGTCTCTTTTTTCGGTGCGAGAAGGCAATCAAGCGGCAG
ACCTGTTGGCGAAGCTTGCAATTCAGAATCGCTGGGACCAAGTATGGATTGAAGATTTTCCTTTGGATCTTTCAAATTTTCTTGATGTTGATGTATTGCGTCTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTACGTATGGGATTTGCTTCTGATTGGGTTGATTTAATTATGCGTTGTGTCAGATCAGTATCCTATTCTTTTAATCTGAATGGGGAGCGTATTGGTCATGTCCAGCC
CAGCCGAGGCCTAAGGCAGGGAGATCCATTGTCATCATATTTGTTTTTGCTTTGTGCCCAGGGATTGTCAAGTTTGTTAACAGAGGCAGAAAGATGTAAGAGCATTTCAG
GTTTCAAAATTTCGAGGGGAGGTCCTTCTCTCTCCCATTTGTTTTTTGCTGACGATAGTTTGTTATTTTTTAAAGTGAAGGGTTCTGAGGCCCTGGCCATCCGTACTGTT
CTTGAAGTGTATGAAAAGGCCTCTGGGCAAGCGATCAATTTTGAGAAATCGATGTTGTCTTGTAGCCCGAATACCGCACCCCAAGCTTTCAGTACTGTGCGACAGATTTT
GGGTATTCAGTCTGGGTCTGTTCATCAAATTTACCTTGGCCTCCCGACTTTCATGCCTAGAAACAAATCTGGCACGCTTTCTTTCATCAAAGATAGGATTTGGCGTTGTT
TGCAGGGTTGGAAAGAAAAGTTATTTTCAGTGGGGGGAAAGGAGATTTTGTTGAAAGCTGTAATTCAAGCGGTTCCATGTTATTCTATGAATTGTTTTCGGCTACCCAAG
AAGTTGATCAATTCTATTTCTCAAGCTATGGCCCGATTTTGGTGGAATGAGGTAGAGGGAAGGAATCAGATTCATTGGGTTAAATGGGGTGAAATGTGTAAACCAAAATG
TCTTGGGGGATTGGGTTTTAAGAACTTAGAGGTTTTTAACCAATCTCTCCTGGCTAAACAATGTTGGAAAATACTAAATGAGCCTGAATCTCTCCTCTCTCGGATTCTCA
AGGGGAGATATTTTCCAAATTGCGATTTTCTTGAGGCCAATATTGGTTACCGTGCTTCTTTTGTTTGGCGTAGTCTGATGTGGGGGAAAGATCTCCTCGTAAAGGGCCTT
CGATGGCGAATTGGAGACGGTCGCTCAGTTCCAGTTTACGCATCCAATTGGATTCCCCGAGATTTTTTTCTCTCTGTTCAATCAGCGGTGTCCCTCCCTTTGGATTCTCG
AGTCAGCGATTTGCTCTTGGGTTCTGGGGGCTGGGATGAGCGAAAGATTTCTCATCATTTTGATTTCGAGGAGGCGCGCCGAATTCTCTCGATTCCGTTATCTCCGTTGG
GGCAAGTTGATCGGCTGATTTGGCACTATGAAAAGGAGGGGAAATATACGGTAAAAAGTGGATATCGGGTGGGTCAGAATGCGATCCTGGCTTTGAGGGCTTCTTCATCG
AGTTCTGCGAAAATAGGGGCCTGGTGGAAGGGGTTTTGGACGATGCAACTTCCAGGAAAAATCAAGGTCTTTTTATGGCGGCTTTTCTTGGATCGACTCCCAACTTTGTC
GAATCTTAACGCTCGAGGGATGGACATGCTGAATCTGTGCAGTTTCTGTGGCCGTCAGGGGGAAGATGCACTCCATGTCTTTTGGTTTTGCAAATTCGCAAGAATTCAGT
GGGATCGCTCTTCTTTTGCTCATCTCACTGGCCTCTTCACTCCGACATCGACTTCGATCTTATTGCAAGATTTGAAGGATCGACTTTCATGGGATGAGTTCTCGGGTCTT
GCAACTTTTCTTTGGGGCCTCTGGAACGCCCGTAATCGACTCCAACTACAGGGGTTAGAGCCGATTGCAGATATAGGACCTTGGGCAGCTCAGTATGTTCGAATTTATAA
AGAGGCGCATTGCCGATATGGAGTTGGTGCGGTTGAGCGGCGAGTTCGAACAGAGGTGCGATGGCAGGCCCCACCGAGTGGTTGGTATAAATTGAATGTTGATGCAGCCT
TTGATCAGAGCTCTCGGTCCGCAGGTTTGGGTATGATTGTTCGAGATTCTCAAGGGAACGTTTTGCTATCAGCGTCCAAGTTTCTCCCATGTGTTTTGTCAGTGGATTTT
GCAGAATGCCTTGCCGCGGCTGAAGGATTTCGTTTGAGTTTAGAAACGGGGTGTTGCCCACTTCAATTGGAAACTGATTCTTGGCGAGTTTTTCATCTGTTGCAAAGGGA
AATTGATGACTGTTCTGAGGTTGGGATATTGGCTTCGGACTTAAGACGTCAACTATCCTCTCCTGGGTTTTTGTCTCTTTTTTCGGTGCGAGAAGGCAATCAAGCGGCAG
ACCTGTTGGCGAAGCTTGCAATTCAGAATCGCTGGGACCAAGTATGGATTGAAGATTTTCCTTTGGATCTTTCAAATTTTCTTGATGTTGATGTATTGCGTCTTTAG
Protein sequenceShow/hide protein sequence
MLRMGFASDWVDLIMRCVRSVSYSFNLNGERIGHVQPSRGLRQGDPLSSYLFLLCAQGLSSLLTEAERCKSISGFKISRGGPSLSHLFFADDSLLFFKVKGSEALAIRTV
LEVYEKASGQAINFEKSMLSCSPNTAPQAFSTVRQILGIQSGSVHQIYLGLPTFMPRNKSGTLSFIKDRIWRCLQGWKEKLFSVGGKEILLKAVIQAVPCYSMNCFRLPK
KLINSISQAMARFWWNEVEGRNQIHWVKWGEMCKPKCLGGLGFKNLEVFNQSLLAKQCWKILNEPESLLSRILKGRYFPNCDFLEANIGYRASFVWRSLMWGKDLLVKGL
RWRIGDGRSVPVYASNWIPRDFFLSVQSAVSLPLDSRVSDLLLGSGGWDERKISHHFDFEEARRILSIPLSPLGQVDRLIWHYEKEGKYTVKSGYRVGQNAILALRASSS
SSAKIGAWWKGFWTMQLPGKIKVFLWRLFLDRLPTLSNLNARGMDMLNLCSFCGRQGEDALHVFWFCKFARIQWDRSSFAHLTGLFTPTSTSILLQDLKDRLSWDEFSGL
ATFLWGLWNARNRLQLQGLEPIADIGPWAAQYVRIYKEAHCRYGVGAVERRVRTEVRWQAPPSGWYKLNVDAAFDQSSRSAGLGMIVRDSQGNVLLSASKFLPCVLSVDF
AECLAAAEGFRLSLETGCCPLQLETDSWRVFHLLQREIDDCSEVGILASDLRRQLSSPGFLSLFSVREGNQAADLLAKLAIQNRWDQVWIEDFPLDLSNFLDVDVLRL