; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035065 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035065
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr3:14336647..14338966
RNA-Seq ExpressionLag0035065
SyntenyLag0035065
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ONI01138.1 hypothetical protein PRUPE_6G123900 [Prunus persica]8.6e-12834.99Show/hide
Query:  MLRLGFAQQWVDLILRCVSSVSFSFNLNGEKVGQVVPSRGLRQGDPLSHIYFCYVLNG--CLVCCVELSG---GFRLRDFGLDALAHRFRTFFLQMTASS
        ML+LGF+  WV  ++ C+S+ +FS    G  VG ++P RGLRQG PLS   F     G  CL+   E  G   G ++   G  ++ H        +   +
Subjt:  MLRLGFAQQWVDLILRCVSSVSFSFNLNGEKVGQVVPSRGLRQGDPLSHIYFCYVLNG--CLVCCVELSG---GFRLRDFGLDALAHRFRTFFLQMTASS

Query:  SSRPSGVKRWLFGSCWNDMRGRQGQTINYEKSVVAFSPNTGEEAQQYISQVLAVSRCPCHQQYLGLPSFMPRNRSGALKFVTDRIWRQIQGWKGKFFSVA
        +++       LF   + ++ G+Q   INY KS ++ SPN        I  VL V    CH+ YLGLP+   + R    + + D++W+ I GWK K  S A
Subjt:  SSRPSGVKRWLFGSCWNDMRGRQGQTINYEKSVVAFSPNTGEEAQQYISQVLAVSRCPCHQQYLGLPSFMPRNRSGALKFVTDRIWRQIQGWKGKFFSVA

Query:  GKEVLLKSIVQAIPCYTMNCFRLPRCLIKEIHRSMARFWWSGSEEERRIHWLSWDALCLPKCLGGLGFRNMELFNQALLAKQCWRVLQDPSSLLGCVLKG
        GKE+L+K+++QAIP Y+M+CFR+P+ L KE++  MARFWW+ ++++R IHW+ W+ LC  K  GGLGFR++E FNQALLAKQCWR+L+ P SL+  + + 
Subjt:  GKEVLLKSIVQAIPCYTMNCFRLPRCLIKEIHRSMARFWWSGSEEERRIHWLSWDALCLPKCLGGLGFRNMELFNQALLAKQCWRVLQDPSSLLGCVLKG

Query:  RYFPQSGFLEAV--------------------RGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSPTSTVSELFTASGGWDVALLRTIFNGADCEAI
        RY P   FLEA                     +G RWR+G+G +  +Y   WLP     +I S P L  ++ V +LFT+SG W+V LL+ IF   + +AI
Subjt:  RYFPQSGFLEAV--------------------RGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSPTSTVSELFTASGGWDVALLRTIFNGADCEAI

Query:  LRIPLQQGSGEDRLIWHFEKHGNFSVKSGYRLAHTLAIQDRPGSSNSERVRM---WWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLTVSPLCVL
        L+IPL   +G D LIWH+E++G +SVKSGYRLA     +D+     S RV +   +W  +W L +PNK +FFLWR   D LP    L  R +  +P+C  
Subjt:  LRIPLQQGSGEDRLIWHFEKHGNFSVKSGYRLAHTLAIQDRPGSSNSERVRM---WWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLTVSPLCVL

Query:  CDDDTEDCLHLFWTCPVVKSMWLGSKFALFHQSFSHFRFEEIIGAMRDKLTGPDFELVVIFWWSVWSLRNNLFWGGQSDG--------RDLWAYSSDYLH
        C    E  LH  W C   K +W  S +    + +    F E+  A++   +G +  L     W +W+ RN+  + G+S+           L    SD  +
Subjt:  CDDDTEDCLHLFWTCPVVKSMWLGSKFALFHQSFSHFRFEEIIGAMRDKLTGPDFELVVIFWWSVWSLRNNLFWGGQSDG--------RDLWAYSSDYLH

Query:  AFHVGGGRCGVRDSLWAQSGEQEERGVWRPPPN----------RVLKLNIDAS----------RCWSVDLAEGWAVYKGIQLARQLGFVDFVVETDSLR-
          H   GR         QS  Q     WRPPP            V+  N +              +     E  A  +G++ A  +GF D ++E D+   
Subjt:  AFHVGGGRCGVRDSLWAQSGEQEERGVWRPPPN----------RVLKLNIDAS----------RCWSVDLAEGWAVYKGIQLARQLGFVDFVVETDSLR-

Query:  LVKILNGELHD------VSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFSYVDRV-WLEEWPSEVSDVLRGDFASV
        L  I + E ++      + EV  L+++ R ++  W       TPR GNKVAH LA+ AF   + V W+EE PS +  VL  D  S+
Subjt:  LVKILNGELHD------VSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFSYVDRV-WLEEWPSEVSDVLRGDFASV

VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]6.6e-12833.92Show/hide
Query:  MLRLGFAQQWVDLILRCVSSVSFSFNLNGEKVGQVVPSRGLRQGDPLSHIYFCYVLNG--CLVCCVELSG---GFRLRDFGLDALAHRFRTFFLQMTASS
        ML+LGF+  WV  ++ C+S+ +FS    G  VG ++P RGLRQG PLS   F     G  CL+   E  G   G ++   G       F    +    ++
Subjt:  MLRLGFAQQWVDLILRCVSSVSFSFNLNGEKVGQVVPSRGLRQGDPLSHIYFCYVLNG--CLVCCVELSG---GFRLRDFGLDALAHRFRTFFLQMTASS

Query:  SSRPSGVKRWLFGSCWNDMRGRQGQTINYEKSVVAFSPNTGEEAQQYISQVLAVSRCPCHQQYLGLPSFMPRNRSGALKFVTDRIWRQIQGWKGKFFSVA
        +     ++     + +       GQ INY KS  + SPN        I  VL V    CH++YLGLP+   + R    + + D++W+ I GWK K  S A
Subjt:  SSRPSGVKRWLFGSCWNDMRGRQGQTINYEKSVVAFSPNTGEEAQQYISQVLAVSRCPCHQQYLGLPSFMPRNRSGALKFVTDRIWRQIQGWKGKFFSVA

Query:  GKEVLLKSIVQAIPCYTMNCFRLPRCLIKEIHRSMARFWWSGSEEERRIHWLSWDALCLPKCLGGLGFRNMELFNQALLAKQCWRVLQDPSSLLGCVLKG
        GKE+L+K+++QAIP Y+M+CFR+P+ L KE++  MARFWW+ ++++R IHW+ W+ LC  K  GGLGFR++E FNQALLAKQCWR+L+ P SL+  + + 
Subjt:  GKEVLLKSIVQAIPCYTMNCFRLPRCLIKEIHRSMARFWWSGSEEERRIHWLSWDALCLPKCLGGLGFRNMELFNQALLAKQCWRVLQDPSSLLGCVLKG

Query:  RYFPQSGFLEAV--------------------RGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSPTSTVSELFTASGGWDVALLRTIFNGADCEAI
        RY P   FLEA                     +G RWR+GNG +  +Y   WLP     +I S P L  ++ V +LFT+SG W+V LL+ IF   + +A 
Subjt:  RYFPQSGFLEAV--------------------RGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSPTSTVSELFTASGGWDVALLRTIFNGADCEAI

Query:  LRIPLQQGSGEDRLIWHFEKHGNFSVKSGYRLAHTLAIQDRPGSSNSERVRM---WWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLTVSPLCVL
        L+IPL   +G D LIWH+E++G +SVKSGYRLA     +D+     S RV +   +W  +W L +PNK +FFLWR   D LP    L  R +  +P+C  
Subjt:  LRIPLQQGSGEDRLIWHFEKHGNFSVKSGYRLAHTLAIQDRPGSSNSERVRM---WWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLTVSPLCVL

Query:  CDDDTEDCLHLFWTCPVVKSMWLGSKFALFHQSFSHFRFEEIIGAMRDKLTGPDFELVVIFWWSVWSLRNNLFWGGQSDG-----------RDLWAYSSD
        C    E  LH  W C   K +W  S +    + +    F E+  A++   +G +  L     W +W+ RN+  + G+S+               ++ +++
Subjt:  CDDDTEDCLHLFWTCPVVKSMWLGSKFALFHQSFSHFRFEEIIGAMRDKLTGPDFELVVIFWWSVWSLRNNLFWGGQSDG-----------RDLWAYSSD

Query:  YLHAFHVGGGRCGVRDSLWAQSGEQEERGVWRPPPNRVLKLNIDAS------------------------------RCWSVDLAEGWAVYKGIQLARQLG
          H  H   GR         QS  Q     WRPPP  + K+N+D +                                +     E  A  +G++ A  +G
Subjt:  YLHAFHVGGGRCGVRDSLWAQSGEQEERGVWRPPPNRVLKLNIDAS------------------------------RCWSVDLAEGWAVYKGIQLARQLG

Query:  FVDFVVETDSLRLV-KILNGELHD------VSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFSYVDRV-WLEEWPSEVSDVLRGDFASV
        F   V+E D+   +  IL+ E  +      + EV  L+ + R ++  W       TPR GNKVAH LA+ AF   + V W+EE P  +  VL  D  S+
Subjt:  FVDFVVETDSLRLV-KILNGELHD------VSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFSYVDRV-WLEEWPSEVSDVLRGDFASV

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]6.8e-13338.79Show/hide
Query:  PNTGEEAQQYISQVLAVSRCPCHQQYLGLPSFMPRNRSGALKFVTDRIWRQIQGWKGKFFSVAGKEVLLKSIVQAIPCYTMNCFRLPRCLIKEIHRSMAR
        P   +     I  +L+V+   C  QYLGLP+FMPRNR     ++ DR+W+ +QGWK K FS+ GKEVL+K++ QAIPCYTM+CFRLP+ LI+E H   AR
Subjt:  PNTGEEAQQYISQVLAVSRCPCHQQYLGLPSFMPRNRSGALKFVTDRIWRQIQGWKGKFFSVAGKEVLLKSIVQAIPCYTMNCFRLPRCLIKEIHRSMAR

Query:  FWWSGSEEERRIHWLSWDALCLPKCLGGLGFRNMELFNQALLAKQCWRVLQDPSSLLGCVLKGRYFPQSGFLEA--------------------VRGCRW
        FWW  S+E+++IHW++W++L LPKC GG+GFR++ELFN+ALLAKQCWR+L  P+S+L  VLKGRYF    F+EA                     +G RW
Subjt:  FWWSGSEEERRIHWLSWDALCLPKCLGGLGFRNMELFNQALLAKQCWRVLQDPSSLLGCVLKGRYFPQSGFLEA--------------------VRGCRW

Query:  RIGNGRATPIYGSNWLPNEFSLQIQSAPVLSPTSTVSELFT-ASGGWDVALLRTIFNGADCEAILRIPLQQGSGEDRLIWHFEKHGNFSVKSGYRLA--H
        RIGNG +  IYG NW+PN+ +L+I S+P L   S VS L     GGW   ++R  F   + + IL IP+ +G+ EDRLIW++EK G +SV+SGY++A  +
Subjt:  RIGNGRATPIYGSNWLPNEFSLQIQSAPVLSPTSTVSELFT-ASGGWDVALLRTIFNGADCEAILRIPLQQGSGEDRLIWHFEKHGNFSVKSGYRLA--H

Query:  TLAIQDRPGSSNSERVRMWWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLTVSPLCVLCDDDTEDCLHLFWTCPVVKSMWLGSKFALFHQSFSHF
           +Q  P SS+SE VR WW+G W++++PNK + FLWRLC DRLPT  NL KRG+ ++  C  C  + ED +HLFW C   +++W+ SKF       S F
Subjt:  TLAIQDRPGSSNSERVRMWWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLTVSPLCVLCDDDTEDCLHLFWTCPVVKSMWLGSKFALFHQSFSHF

Query:  RFEEIIGAMRDKLTGPDFELVVIFWWSVWSLRNNLFWGGQSD-----GRDLWAYSSDYLHAFHVGGGRCGVRDSLWAQSGEQEERGVWRPPPNRVLKLNI
            I+    + L+  DFE + +  W +W+ RN   +   +      G +L  +++ Y   F     R    + +  +     E  +W+PP   + K+N 
Subjt:  RFEEIIGAMRDKLTGPDFELVVIFWWSVWSLRNNLFWGGQSD-----GRDLWAYSSDYLHAFHVGGGRCGVRDSLWAQSGEQEERGVWRPPPNRVLKLNI

Query:  DAS------------------------------RCWSVDLAEGWAVYKGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVN
        DAS                                 SVD+AE  A  +G+QLA ++G                ++  L D+SE G ++   +   +  ++
Subjt:  DAS------------------------------RCWSVDLAEGWAVYKGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVN

Query:  GKVLFTPRQGNKVAHVLARLAFSYVD-RVWLEEWPSEVSDVL
            F  R+GNK AH+LAR A    +  +W+E+WP E+   L
Subjt:  GKVLFTPRQGNKVAHVLARLAFSYVD-RVWLEEWPSEVSDVL

XP_030508852.1 uncharacterized protein LOC115723496 [Cannabis sativa]2.3e-12034.15Show/hide
Query:  MLRLGFAQQWVDLILRCVSSVSFSFNLNGEKVGQVVPSRGLRQGDPLSHIYFCYVLNGC--LVCCVELSGGFRLRDFGLDALAHRFRTFFLQMTASSSSR
        M+++GF    V+LILRC+ SVS+SF LNG   GQV+PSRG+RQGDPLS   F     G   L+   EL+G   L    +   A      F    +    R
Subjt:  MLRLGFAQQWVDLILRCVSSVSFSFNLNGEKVGQVVPSRGLRQGDPLSHIYFCYVLNGC--LVCCVELSGGFRLRDFGLDALAHRFRTFFLQMTASSSSR

Query:  PSGVKRWLFGSCWNDMRGRQGQTINYEKSVVAFSPNTGEEAQQYISQVLAVSRCPCHQQYLGLPSFMPRNRSGALKFVTDRIWRQIQGWKGKFFSVAGKE
         +         C        GQ IN EK V++FS NT +  Q +   +L +   PCH+QYLGLPSF  +N+      +TD+IW+ +  WK   FS  GKE
Subjt:  PSGVKRWLFGSCWNDMRGRQGQTINYEKSVVAFSPNTGEEAQQYISQVLAVSRCPCHQQYLGLPSFMPRNRSGALKFVTDRIWRQIQGWKGKFFSVAGKE

Query:  VLLKSIVQAIPCYTMNCFRLPRCLIKEIHRSMARFWWSGSEEERRIHWLSWDALCLPKCLGGLGFRNMELFNQALLAKQCWRVLQDPSSLLGCVLKGRYF
        VLLK++VQAIP Y M+CFRLP  L  +I   MARFWW  +   + IHW +W+ LC  K  GGLGFRN   FNQALLAKQ WR+L+ P+SLL  +L+ RYF
Subjt:  VLLKSIVQAIPCYTMNCFRLPRCLIKEIHRSMARFWWSGSEEERRIHWLSWDALCLPKCLGGLGFRNMELFNQALLAKQCWRVLQDPSSLLGCVLKGRYF

Query:  PQSGFLEA--------------------VRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSPTSTVSELFTASGGWDVALLRTIFNGADCEAILRI
            +L A                    ++G RWR+G+G        +WLP   + +        P   V++L T    WD+  L T FN AD   +L I
Subjt:  PQSGFLEA--------------------VRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSPTSTVSELFTASGGWDVALLRTIFNGADCEAILRI

Query:  PLQQGSGEDRLIWHFEKHGNFSVKSGYRLAHTLAIQDRPGSSNSERVRMWWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLTVSPLCVLCDDDTE
        PL     +D LIW+    G ++VKSGY  A +LA QD    SNS  +  WWS  W+L +P K R F+W++ H  LP    L +R +  SP C +C+   E
Subjt:  PLQQGSGEDRLIWHFEKHGNFSVKSGYRLAHTLAIQDRPGSSNSERVRMWWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLTVSPLCVLCDDDTE

Query:  DCLHLFWTCPVVKSMWLGSKFALFHQSFSHFRFEEIIGAMRDKLTGPDFELVVIFWWSVWSLRNNLFWGGQ-SDGRDLWAYSSDYLHAFHVGGGR-----
           H  ++CP  K++W  S F++  Q+       + +  +   L+  + EL ++  WS+W  RN ++ G        + AY+  YL  F     +     
Subjt:  DCLHLFWTCPVVKSMWLGSKFALFHQSFSHFRFEEIIGAMRDKLTGPDFELVVIFWWSVWSLRNNLFWGGQ-SDGRDLWAYSSDYLHAFHVGGGR-----

Query:  -CGVRDSLWAQSGEQEERGVWRPPPNRVLKLNIDAS---------------RCWSVDLAEGWAVYKG---------IQLARQLGF-------VDFVVETD
              +    S E      W  PP   LKLN DA+                   + +A     ++G         + LA  L +       VDF +ETD
Subjt:  -CGVRDSLWAQSGEQEERGVWRPPPNRVLKLNIDAS---------------RCWSVDLAEGWAVYKG---------IQLARQLGF-------VDFVVETD

Query:  SLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFSY-VDRVWLEEWPSEVSDVL
        SL +V+ L      +S    L+++I  ++S +   ++    R  N  AH LA+ A +   D +WL  +PS +  ++
Subjt:  SLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFSY-VDRVWLEEWPSEVSDVL

XP_030939975.1 uncharacterized protein LOC115964883 [Quercus lobata]1.9e-11934.39Show/hide
Query:  MLRLGFAQQWVDLILRCVSSVSFSFNLNGEKVGQVVPSRGLRQGDPLSHIYFCYVLNG--CLVCCVELSG---GFRLRDFGLDALAHRFRTFFLQMTASS
        M +LGF  +W+ L+  C+ SVSFS  +NGE  G   P+RGLRQGDPLS   F     G   L+   E+SG   G  L   G       F    L    ++
Subjt:  MLRLGFAQQWVDLILRCVSSVSFSFNLNGEKVGQVVPSRGLRQGDPLSHIYFCYVLNG--CLVCCVELSG---GFRLRDFGLDALAHRFRTFFLQMTASS

Query:  SSRPSGVKRWLFGSCWNDMRGRQGQTINYEKSVVAFSPNTGEEAQQYISQVLAVSRCPCHQQYLGLPSFMPRNRSGALKFVTDRIWRQIQGWKGKFFSVA
        S   S +   L            GQ IN EK+ + FSPNT    Q+ I  +L V+    +++YLGLPSF+ R +  +  ++ +RIW ++QGWK +  S  
Subjt:  SSRPSGVKRWLFGSCWNDMRGRQGQTINYEKSVVAFSPNTGEEAQQYISQVLAVSRCPCHQQYLGLPSFMPRNRSGALKFVTDRIWRQIQGWKGKFFSVA

Query:  GKEVLLKSIVQAIPCYTMNCFRLPRCLIKEIHRSMARFWWSGSEEERRIHWLSWDALCLPKCLGGLGFRNMELFNQALLAKQCWRVLQDPSSLLGCVLKG
        G+EVL+K+++QA+P +TM CF++P+ L K+I   + +FWW    E R+IHW+ W  LC  K  GGLGF+++ELFN A+L KQ WR++ +  SL   V K 
Subjt:  GKEVLLKSIVQAIPCYTMNCFRLPRCLIKEIHRSMARFWWSGSEEERRIHWLSWDALCLPKCLGGLGFRNMELFNQALLAKQCWRVLQDPSSLLGCVLKG

Query:  RYFPQSGFLE-------------------AVR-GCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSPTST-VSELFTASGG-WDVALLRTIFNGADCE
        ++FP    L+                    VR G +WRIG+G +  I G  WLP+ FS ++ S     P +T V  L       W    +R  F   + E
Subjt:  RYFPQSGFLE-------------------AVR-GCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSPTST-VSELFTASGG-WDVALLRTIFNGADCE

Query:  AILRIPLQQGSGEDRLIWHFEKHGNFSVKSGYRLAHTLAIQDRPGSSNSERVRMWWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLTVSPLCVLC
        AIL +PL     EDRLIW    +G ++ KS YRL    A    PG+SN    + +W  LW LNVPNK R FLWR  +D LPTK NLLKR +     C  C
Subjt:  AILRIPLQQGSGEDRLIWHFEKHGNFSVKSGYRLAHTLAIQDRPGSSNSERVRMWWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLTVSPLCVLC

Query:  DDDTEDCLHLFWTCPVVKSMW---------LGSKFALFHQSFSHFRFEEIIGAMRDKLTGPDF-ELVVIFWWSVWSLRNNLFWGGQS-DGRDLWAYSSDY
          + ED +H  W C ++K +W         L  KFA FH        + + G +  K+  P+F EL     WS+W  RN    G  S     ++  + + 
Subjt:  DDDTEDCLHLFWTCPVVKSMW---------LGSKFALFHQSFSHFRFEEIIGAMRDKLTGPDF-ELVVIFWWSVWSLRNNLFWGGQS-DGRDLWAYSSDY

Query:  LHAFHVGGGRCGVRDSLWAQSGEQEERGVWRPPPNRVLKLNIDASRCWSVDLA------------------------------EGWAVYKGIQLARQLGF
        L  FH       V++    Q         W PP   V K+N D +    +  A                              E  A  + I  AR+LG 
Subjt:  LHAFHVGGGRCGVRDSLWAQSGEQEERGVWRPPPNRVLKLNIDASRCWSVDLA------------------------------EGWAVYKGIQLARQLGF

Query:  VDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFS-YVDRVWLEEWPSEVSDVLRGDFASV
         D V E DS  + K+L  E   ++  G ++D+ R + + + +     T RQGNKVA  LA+LA + Y  +VWLEE    V++++  D  S+
Subjt:  VDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFS-YVDRVWLEEWPSEVSDVLRGDFASV

TrEMBL top hitse value%identityAlignment
A0A251NPF0 Reverse transcriptase domain-containing protein4.2e-12834.99Show/hide
Query:  MLRLGFAQQWVDLILRCVSSVSFSFNLNGEKVGQVVPSRGLRQGDPLSHIYFCYVLNG--CLVCCVELSG---GFRLRDFGLDALAHRFRTFFLQMTASS
        ML+LGF+  WV  ++ C+S+ +FS    G  VG ++P RGLRQG PLS   F     G  CL+   E  G   G ++   G  ++ H        +   +
Subjt:  MLRLGFAQQWVDLILRCVSSVSFSFNLNGEKVGQVVPSRGLRQGDPLSHIYFCYVLNG--CLVCCVELSG---GFRLRDFGLDALAHRFRTFFLQMTASS

Query:  SSRPSGVKRWLFGSCWNDMRGRQGQTINYEKSVVAFSPNTGEEAQQYISQVLAVSRCPCHQQYLGLPSFMPRNRSGALKFVTDRIWRQIQGWKGKFFSVA
        +++       LF   + ++ G+Q   INY KS ++ SPN        I  VL V    CH+ YLGLP+   + R    + + D++W+ I GWK K  S A
Subjt:  SSRPSGVKRWLFGSCWNDMRGRQGQTINYEKSVVAFSPNTGEEAQQYISQVLAVSRCPCHQQYLGLPSFMPRNRSGALKFVTDRIWRQIQGWKGKFFSVA

Query:  GKEVLLKSIVQAIPCYTMNCFRLPRCLIKEIHRSMARFWWSGSEEERRIHWLSWDALCLPKCLGGLGFRNMELFNQALLAKQCWRVLQDPSSLLGCVLKG
        GKE+L+K+++QAIP Y+M+CFR+P+ L KE++  MARFWW+ ++++R IHW+ W+ LC  K  GGLGFR++E FNQALLAKQCWR+L+ P SL+  + + 
Subjt:  GKEVLLKSIVQAIPCYTMNCFRLPRCLIKEIHRSMARFWWSGSEEERRIHWLSWDALCLPKCLGGLGFRNMELFNQALLAKQCWRVLQDPSSLLGCVLKG

Query:  RYFPQSGFLEAV--------------------RGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSPTSTVSELFTASGGWDVALLRTIFNGADCEAI
        RY P   FLEA                     +G RWR+G+G +  +Y   WLP     +I S P L  ++ V +LFT+SG W+V LL+ IF   + +AI
Subjt:  RYFPQSGFLEAV--------------------RGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSPTSTVSELFTASGGWDVALLRTIFNGADCEAI

Query:  LRIPLQQGSGEDRLIWHFEKHGNFSVKSGYRLAHTLAIQDRPGSSNSERVRM---WWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLTVSPLCVL
        L+IPL   +G D LIWH+E++G +SVKSGYRLA     +D+     S RV +   +W  +W L +PNK +FFLWR   D LP    L  R +  +P+C  
Subjt:  LRIPLQQGSGEDRLIWHFEKHGNFSVKSGYRLAHTLAIQDRPGSSNSERVRM---WWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLTVSPLCVL

Query:  CDDDTEDCLHLFWTCPVVKSMWLGSKFALFHQSFSHFRFEEIIGAMRDKLTGPDFELVVIFWWSVWSLRNNLFWGGQSDG--------RDLWAYSSDYLH
        C    E  LH  W C   K +W  S +    + +    F E+  A++   +G +  L     W +W+ RN+  + G+S+           L    SD  +
Subjt:  CDDDTEDCLHLFWTCPVVKSMWLGSKFALFHQSFSHFRFEEIIGAMRDKLTGPDFELVVIFWWSVWSLRNNLFWGGQSDG--------RDLWAYSSDYLH

Query:  AFHVGGGRCGVRDSLWAQSGEQEERGVWRPPPN----------RVLKLNIDAS----------RCWSVDLAEGWAVYKGIQLARQLGFVDFVVETDSLR-
          H   GR         QS  Q     WRPPP            V+  N +              +     E  A  +G++ A  +GF D ++E D+   
Subjt:  AFHVGGGRCGVRDSLWAQSGEQEERGVWRPPPN----------RVLKLNIDAS----------RCWSVDLAEGWAVYKGIQLARQLGFVDFVVETDSLR-

Query:  LVKILNGELHD------VSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFSYVDRV-WLEEWPSEVSDVLRGDFASV
        L  I + E ++      + EV  L+++ R ++  W       TPR GNKVAH LA+ AF   + V W+EE PS +  VL  D  S+
Subjt:  LVKILNGELHD------VSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFSYVDRV-WLEEWPSEVSDVLRGDFASV

A0A5E4FZN9 PREDICTED: retrotransposon3.2e-12833.92Show/hide
Query:  MLRLGFAQQWVDLILRCVSSVSFSFNLNGEKVGQVVPSRGLRQGDPLSHIYFCYVLNG--CLVCCVELSG---GFRLRDFGLDALAHRFRTFFLQMTASS
        ML+LGF+  WV  ++ C+S+ +FS    G  VG ++P RGLRQG PLS   F     G  CL+   E  G   G ++   G       F    +    ++
Subjt:  MLRLGFAQQWVDLILRCVSSVSFSFNLNGEKVGQVVPSRGLRQGDPLSHIYFCYVLNG--CLVCCVELSG---GFRLRDFGLDALAHRFRTFFLQMTASS

Query:  SSRPSGVKRWLFGSCWNDMRGRQGQTINYEKSVVAFSPNTGEEAQQYISQVLAVSRCPCHQQYLGLPSFMPRNRSGALKFVTDRIWRQIQGWKGKFFSVA
        +     ++     + +       GQ INY KS  + SPN        I  VL V    CH++YLGLP+   + R    + + D++W+ I GWK K  S A
Subjt:  SSRPSGVKRWLFGSCWNDMRGRQGQTINYEKSVVAFSPNTGEEAQQYISQVLAVSRCPCHQQYLGLPSFMPRNRSGALKFVTDRIWRQIQGWKGKFFSVA

Query:  GKEVLLKSIVQAIPCYTMNCFRLPRCLIKEIHRSMARFWWSGSEEERRIHWLSWDALCLPKCLGGLGFRNMELFNQALLAKQCWRVLQDPSSLLGCVLKG
        GKE+L+K+++QAIP Y+M+CFR+P+ L KE++  MARFWW+ ++++R IHW+ W+ LC  K  GGLGFR++E FNQALLAKQCWR+L+ P SL+  + + 
Subjt:  GKEVLLKSIVQAIPCYTMNCFRLPRCLIKEIHRSMARFWWSGSEEERRIHWLSWDALCLPKCLGGLGFRNMELFNQALLAKQCWRVLQDPSSLLGCVLKG

Query:  RYFPQSGFLEAV--------------------RGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSPTSTVSELFTASGGWDVALLRTIFNGADCEAI
        RY P   FLEA                     +G RWR+GNG +  +Y   WLP     +I S P L  ++ V +LFT+SG W+V LL+ IF   + +A 
Subjt:  RYFPQSGFLEAV--------------------RGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSPTSTVSELFTASGGWDVALLRTIFNGADCEAI

Query:  LRIPLQQGSGEDRLIWHFEKHGNFSVKSGYRLAHTLAIQDRPGSSNSERVRM---WWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLTVSPLCVL
        L+IPL   +G D LIWH+E++G +SVKSGYRLA     +D+     S RV +   +W  +W L +PNK +FFLWR   D LP    L  R +  +P+C  
Subjt:  LRIPLQQGSGEDRLIWHFEKHGNFSVKSGYRLAHTLAIQDRPGSSNSERVRM---WWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLTVSPLCVL

Query:  CDDDTEDCLHLFWTCPVVKSMWLGSKFALFHQSFSHFRFEEIIGAMRDKLTGPDFELVVIFWWSVWSLRNNLFWGGQSDG-----------RDLWAYSSD
        C    E  LH  W C   K +W  S +    + +    F E+  A++   +G +  L     W +W+ RN+  + G+S+               ++ +++
Subjt:  CDDDTEDCLHLFWTCPVVKSMWLGSKFALFHQSFSHFRFEEIIGAMRDKLTGPDFELVVIFWWSVWSLRNNLFWGGQSDG-----------RDLWAYSSD

Query:  YLHAFHVGGGRCGVRDSLWAQSGEQEERGVWRPPPNRVLKLNIDAS------------------------------RCWSVDLAEGWAVYKGIQLARQLG
          H  H   GR         QS  Q     WRPPP  + K+N+D +                                +     E  A  +G++ A  +G
Subjt:  YLHAFHVGGGRCGVRDSLWAQSGEQEERGVWRPPPNRVLKLNIDAS------------------------------RCWSVDLAEGWAVYKGIQLARQLG

Query:  FVDFVVETDSLRLV-KILNGELHD------VSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFSYVDRV-WLEEWPSEVSDVLRGDFASV
        F   V+E D+   +  IL+ E  +      + EV  L+ + R ++  W       TPR GNKVAH LA+ AF   + V W+EE P  +  VL  D  S+
Subjt:  FVDFVVETDSLRLV-KILNGELHD------VSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFSYVDRV-WLEEWPSEVSDVLRGDFASV

A0A6J1DAR4 uncharacterized protein LOC1110189543.3e-13338.79Show/hide
Query:  PNTGEEAQQYISQVLAVSRCPCHQQYLGLPSFMPRNRSGALKFVTDRIWRQIQGWKGKFFSVAGKEVLLKSIVQAIPCYTMNCFRLPRCLIKEIHRSMAR
        P   +     I  +L+V+   C  QYLGLP+FMPRNR     ++ DR+W+ +QGWK K FS+ GKEVL+K++ QAIPCYTM+CFRLP+ LI+E H   AR
Subjt:  PNTGEEAQQYISQVLAVSRCPCHQQYLGLPSFMPRNRSGALKFVTDRIWRQIQGWKGKFFSVAGKEVLLKSIVQAIPCYTMNCFRLPRCLIKEIHRSMAR

Query:  FWWSGSEEERRIHWLSWDALCLPKCLGGLGFRNMELFNQALLAKQCWRVLQDPSSLLGCVLKGRYFPQSGFLEA--------------------VRGCRW
        FWW  S+E+++IHW++W++L LPKC GG+GFR++ELFN+ALLAKQCWR+L  P+S+L  VLKGRYF    F+EA                     +G RW
Subjt:  FWWSGSEEERRIHWLSWDALCLPKCLGGLGFRNMELFNQALLAKQCWRVLQDPSSLLGCVLKGRYFPQSGFLEA--------------------VRGCRW

Query:  RIGNGRATPIYGSNWLPNEFSLQIQSAPVLSPTSTVSELFT-ASGGWDVALLRTIFNGADCEAILRIPLQQGSGEDRLIWHFEKHGNFSVKSGYRLA--H
        RIGNG +  IYG NW+PN+ +L+I S+P L   S VS L     GGW   ++R  F   + + IL IP+ +G+ EDRLIW++EK G +SV+SGY++A  +
Subjt:  RIGNGRATPIYGSNWLPNEFSLQIQSAPVLSPTSTVSELFT-ASGGWDVALLRTIFNGADCEAILRIPLQQGSGEDRLIWHFEKHGNFSVKSGYRLA--H

Query:  TLAIQDRPGSSNSERVRMWWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLTVSPLCVLCDDDTEDCLHLFWTCPVVKSMWLGSKFALFHQSFSHF
           +Q  P SS+SE VR WW+G W++++PNK + FLWRLC DRLPT  NL KRG+ ++  C  C  + ED +HLFW C   +++W+ SKF       S F
Subjt:  TLAIQDRPGSSNSERVRMWWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLTVSPLCVLCDDDTEDCLHLFWTCPVVKSMWLGSKFALFHQSFSHF

Query:  RFEEIIGAMRDKLTGPDFELVVIFWWSVWSLRNNLFWGGQSD-----GRDLWAYSSDYLHAFHVGGGRCGVRDSLWAQSGEQEERGVWRPPPNRVLKLNI
            I+    + L+  DFE + +  W +W+ RN   +   +      G +L  +++ Y   F     R    + +  +     E  +W+PP   + K+N 
Subjt:  RFEEIIGAMRDKLTGPDFELVVIFWWSVWSLRNNLFWGGQSD-----GRDLWAYSSDYLHAFHVGGGRCGVRDSLWAQSGEQEERGVWRPPPNRVLKLNI

Query:  DAS------------------------------RCWSVDLAEGWAVYKGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVN
        DAS                                 SVD+AE  A  +G+QLA ++G                ++  L D+SE G ++   +   +  ++
Subjt:  DAS------------------------------RCWSVDLAEGWAVYKGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVN

Query:  GKVLFTPRQGNKVAHVLARLAFSYVD-RVWLEEWPSEVSDVL
            F  R+GNK AH+LAR A    +  +W+E+WP E+   L
Subjt:  GKVLFTPRQGNKVAHVLARLAFSYVD-RVWLEEWPSEVSDVL

A0A803Q1K6 Uncharacterized protein3.8e-12936.11Show/hide
Query:  LGFAQQWVDLILRCVSSVSFSFNLNGEKVGQVVPSRGLRQGDPLSH--IYFCYVLNGCLVCCVELSG---GFRLRDFGLDALAHRF----RTFFLQMTAS
        LGFAQ WVD I+RCVSS SFSF +NGE  G+++P RGLRQGDPLS     FC      L+   E +G   G R    G+ +++H F       F+     
Subjt:  LGFAQQWVDLILRCVSSVSFSFNLNGEKVGQVVPSRGLRQGDPLSH--IYFCYVLNGCLVCCVELSG---GFRLRDFGLDALAHRF----RTFFLQMTAS

Query:  SSSRPSGVKRWLFGSCWNDMRGRQGQTINYEKSVVAFSPNTGEEAQQYISQVLAVSRCPCHQQYLGLPSFMPRNRSGALKFVTDRIWRQIQGWKGKFFSV
        S  +        F    +      GQ +NY KS   F  N   E +  ++ +L V     H +YLGLPSF+ RN+   L  + +++W +++GWK   FSV
Subjt:  SSSRPSGVKRWLFGSCWNDMRGRQGQTINYEKSVVAFSPNTGEEAQQYISQVLAVSRCPCHQQYLGLPSFMPRNRSGALKFVTDRIWRQIQGWKGKFFSV

Query:  AGKEVLLKSIVQAIPCYTMNCFRLPRCLIKEIHRSMARFWWSGSEEERRIHWLSWDALCLPKCLGGLGFRNMELFNQALLAKQCWRVLQDPSSLLGCVLK
        AGKEVL+K+IVQAIP YTM+CF+LP+  I  +HR  +RFWW  S++E++IHW  W  LC PK  GGLGFR++ +FNQALLAKQ WR L+ P  L   VLK
Subjt:  AGKEVLLKSIVQAIPCYTMNCFRLPRCLIKEIHRSMARFWWSGSEEERRIHWLSWDALCLPKCLGGLGFRNMELFNQALLAKQCWRVLQDPSSLLGCVLK

Query:  GRYFPQSGFLEA--------------------VRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSPTSTVSELFTASGGWDVALLRTIFNGADCEA
          YFP+ G LEA                    ++G RWR+GNG +  +    WLP   + ++   P L     V++L  A G WD   +R+IFN  D + 
Subjt:  GRYFPQSGFLEA--------------------VRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSPTSTVSELFTASGGWDVALLRTIFNGADCEA

Query:  ILRIPLQQGSGEDRLIWHFEKHGNFSVKSGYRLAHTLAIQDRPGSSNSERVRMWWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLTVSPLCVLCD
        IL IP      ED+++WH+ K+G +SVKSGYR+A +L  +     SN   +  WW  LWRLN P K + F+W++ H+ LP  VNL KRG+  S +C  C 
Subjt:  ILRIPLQQGSGEDRLIWHFEKHGNFSVKSGYRLAHTLAIQDRPGSSNSERVRMWWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLTVSPLCVLCD

Query:  DDTEDCL-HLFWTCPVVKSMWLGS------KFALFHQSFSHFRFEEIIGAMRDKLTGPDFELVVIFWWSVWSLRNNLFWGG-QSDGRDLWAYSSDYLHAF
           E+ + H  W C   K  W  S      K  L   + S       I A  DK      E  ++  W++W++RN +  GG      ++  +  ++L  F
Subjt:  DDTEDCL-HLFWTCPVVKSMWLGS------KFALFHQSFSHFRFEEIIGAMRDKLTGPDFELVVIFWWSVWSLRNNLFWGG-QSDGRDLWAYSSDYLHAF

Query:  HVGGGRCGVRDSLWAQSGEQEERGVWRPPPN------RVLKLNIDASRCWSVDLAEGWAVYKGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGL
            GR         +S    E   W PP          L L++  S   +    E  A+ KGIQ+  Q     F V+TD L+ V ++  + +   ++  
Subjt:  HVGGGRCGVRDSLWAQSGEQEERGVWRPPPN------RVLKLNIDASRCWSVDLAEGWAVYKGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGL

Query:  LMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFSY-VDRVWL
        L++ IR ++S      + F  R+ N+VAH LA  A  +    +W+
Subjt:  LMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFSY-VDRVWL

M5W5F3 Reverse transcriptase domain-containing protein (Fragment)4.2e-12834.99Show/hide
Query:  MLRLGFAQQWVDLILRCVSSVSFSFNLNGEKVGQVVPSRGLRQGDPLSHIYFCYVLNG--CLVCCVELSG---GFRLRDFGLDALAHRFRTFFLQMTASS
        ML+LGF+  WV  ++ C+S+ +FS    G  VG ++P RGLRQG PLS   F     G  CL+   E  G   G ++   G  ++ H        +   +
Subjt:  MLRLGFAQQWVDLILRCVSSVSFSFNLNGEKVGQVVPSRGLRQGDPLSHIYFCYVLNG--CLVCCVELSG---GFRLRDFGLDALAHRFRTFFLQMTASS

Query:  SSRPSGVKRWLFGSCWNDMRGRQGQTINYEKSVVAFSPNTGEEAQQYISQVLAVSRCPCHQQYLGLPSFMPRNRSGALKFVTDRIWRQIQGWKGKFFSVA
        +++       LF   + ++ G+Q   INY KS ++ SPN        I  VL V    CH+ YLGLP+   + R    + + D++W+ I GWK K  S A
Subjt:  SSRPSGVKRWLFGSCWNDMRGRQGQTINYEKSVVAFSPNTGEEAQQYISQVLAVSRCPCHQQYLGLPSFMPRNRSGALKFVTDRIWRQIQGWKGKFFSVA

Query:  GKEVLLKSIVQAIPCYTMNCFRLPRCLIKEIHRSMARFWWSGSEEERRIHWLSWDALCLPKCLGGLGFRNMELFNQALLAKQCWRVLQDPSSLLGCVLKG
        GKE+L+K+++QAIP Y+M+CFR+P+ L KE++  MARFWW+ ++++R IHW+ W+ LC  K  GGLGFR++E FNQALLAKQCWR+L+ P SL+  + + 
Subjt:  GKEVLLKSIVQAIPCYTMNCFRLPRCLIKEIHRSMARFWWSGSEEERRIHWLSWDALCLPKCLGGLGFRNMELFNQALLAKQCWRVLQDPSSLLGCVLKG

Query:  RYFPQSGFLEAV--------------------RGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSPTSTVSELFTASGGWDVALLRTIFNGADCEAI
        RY P   FLEA                     +G RWR+G+G +  +Y   WLP     +I S P L  ++ V +LFT+SG W+V LL+ IF   + +AI
Subjt:  RYFPQSGFLEAV--------------------RGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSPTSTVSELFTASGGWDVALLRTIFNGADCEAI

Query:  LRIPLQQGSGEDRLIWHFEKHGNFSVKSGYRLAHTLAIQDRPGSSNSERVRM---WWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLTVSPLCVL
        L+IPL   +G D LIWH+E++G +SVKSGYRLA     +D+     S RV +   +W  +W L +PNK +FFLWR   D LP    L  R +  +P+C  
Subjt:  LRIPLQQGSGEDRLIWHFEKHGNFSVKSGYRLAHTLAIQDRPGSSNSERVRM---WWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLTVSPLCVL

Query:  CDDDTEDCLHLFWTCPVVKSMWLGSKFALFHQSFSHFRFEEIIGAMRDKLTGPDFELVVIFWWSVWSLRNNLFWGGQSDG--------RDLWAYSSDYLH
        C    E  LH  W C   K +W  S +    + +    F E+  A++   +G +  L     W +W+ RN+  + G+S+           L    SD  +
Subjt:  CDDDTEDCLHLFWTCPVVKSMWLGSKFALFHQSFSHFRFEEIIGAMRDKLTGPDFELVVIFWWSVWSLRNNLFWGGQSDG--------RDLWAYSSDYLH

Query:  AFHVGGGRCGVRDSLWAQSGEQEERGVWRPPPN----------RVLKLNIDAS----------RCWSVDLAEGWAVYKGIQLARQLGFVDFVVETDSLR-
          H   GR         QS  Q     WRPPP            V+  N +              +     E  A  +G++ A  +GF D ++E D+   
Subjt:  AFHVGGGRCGVRDSLWAQSGEQEERGVWRPPPN----------RVLKLNIDAS----------RCWSVDLAEGWAVYKGIQLARQLGFVDFVVETDSLR-

Query:  LVKILNGELHD------VSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFSYVDRV-WLEEWPSEVSDVLRGDFASV
        L  I + E ++      + EV  L+++ R ++  W       TPR GNKVAH LA+ AF   + V W+EE PS +  VL  D  S+
Subjt:  LVKILNGELHD------VSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFSYVDRV-WLEEWPSEVSDVLRGDFASV

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657504.3e-4523.59Show/hide
Query:  LPSFMPRNRSGALKFVTDRIWRQIQGWKGKFFSVAGKEVLLKSIVQAIPCYTMNCFRLPRCLIKEIHRSMARFWWSGSEEERRIHWLSWDALCLPKCLGG
        +P    R        + +R+  ++ GW+ K  S AG+  L K+++ ++P ++M+   LP+ ++  + +    F W  + E+++ H + W  +C PK  GG
Subjt:  LPSFMPRNRSGALKFVTDRIWRQIQGWKGKFFSVAGKEVLLKSIVQAIPCYTMNCFRLPRCLIKEIHRSMARFWWSGSEEERRIHWLSWDALCLPKCLGG

Query:  LGFRNMELFNQALLAKQCWRVLQDPSSLLGCVLKGRY-----------FPQSGFLEAVR------------GCRWRIGNGRATPIYGSNWLPNEFSLQIQ
        LG R  +  N+AL++K  WR+LQ+ +SL   VL+ +Y            P+  +    R            G  W  G+G+    +   W+  +  L++ 
Subjt:  LGFRNMELFNQALLAKQCWRVLQDPSSLLGCVLKGRY-----------FPQSGFLEAVR------------GCRWRIGNGRATPIYGSNWLPNEFSLQIQ

Query:  SA--PVLSPTSTVSELFTASGGWDVALLRTIFNGADCEAILRIPLQQGSG-EDRLIWHFEKHGNFSVKSGYRLAHTLAIQDRPGSSNSERVRMWWSGLWR
        +   P    T    +L+    GWD A +           +  + L   +G  DRL W F + G FSV+S Y +   L + + P       +  +++ LW+
Subjt:  SA--PVLSPTSTVSELFTASGGWDVALLRTIFNGADCEAILRIPLQQGSG-EDRLIWHFEKHGNFSVKSGYRLAHTLAIQDRPGSSNSERVRMWWSGLWR

Query:  LNVPNKHRFFLWRLCHDRLPTKVNLLKRGLTVSPLCVLCDDDTEDCLHLFWTCPVVKSMWLGSKFALFHQS-FSHFRFEEIIGAMRDKLTGPDFE----L
        + VP + + FLW + +  + T+    +R L+ S +C +C    E  LH+   CP    +W+        Q  FS   FE +   + D+    D       
Subjt:  LNVPNKHRFFLWRLCHDRLPTKVNLLKRGLTVSPLCVLCDDDTEDCLHLFWTCPVVKSMWLGSKFALFHQS-FSHFRFEEIIGAMRDKLTGPDFE----L

Query:  VVIFWWSVWSLRNNLFWGGQSDGRDLWAYSSDY---LHAFHVGGGRCGVRDSLWAQSGEQEERGV-WRPPPNRVLKLNIDAS------------------
         VI WW  W  R    +G  +  RD   +  ++   ++  H G    G+       +  + ER + W  P    +K+N D +                  
Subjt:  VVIFWWSVWSLRNNLFWGGQSDGRDLWAYSSDY---LHAFHVGGGRCGVRDSLWAQSGEQEERGV-WRPPPNRVLKLNIDAS------------------

Query:  ------------RCWSVDLAEGWAVYKGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLA
                    RC S   AE W VY G+  A +       +E DS  +V  L   + D   +  L+      L      +++   R+ N++A  LA  A
Subjt:  ------------RCWSVDLAEGWAVYKGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLA

Query:  FS
        FS
Subjt:  FS

P93295 Uncharacterized mitochondrial protein AtMg003102.2e-2540.27Show/hide
Query:  AIPCYTMNCFRLPRCLIKEIHRSMARFWWSGSEEERRIHWLSWDALCLPK-CLGGLGFRNMELFNQALLAKQCWRVLQDPSSLLGCVLKGRYFPQSGFLE
        A+P Y M+CFRL + L K++  +M  FWWS  E +R+I W++W  LC  K   GGLGFR++  FNQALLAKQ +R++  P +LL  +L+ RYFP S  +E
Subjt:  AIPCYTMNCFRLPRCLIKEIHRSMARFWWSGSEEERRIHWLSWDALCLPK-CLGGLGFRNMELFNQALLAKQCWRVLQDPSSLLGCVLKGRYFPQSGFLE

Query:  AVRGCR----WR----------------IGNGRATPIYGSNWLPNEFSL
           G R    WR                IG+G  T ++   W+ +E  L
Subjt:  AVRGCR----WR----------------IGNGRATPIYGSNWLPNEFSL

Arabidopsis top hitse value%identityAlignment
AT2G02650.1 Ribonuclease H-like superfamily protein8.6e-1724.43Show/hide
Query:  VKSGYRLA------HTLAIQDRPGSSNSERVRMWWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLTVSPLCVLCDDDTEDCLHLFWTCPVVKSMW
        ++SGY +A         AIQ  PGS+  ++       +W+L+V  K + FLWR     L T   L  R +   P+C  C  + E   H+ + CP  +S+W
Subjt:  VKSGYRLA------HTLAIQDRPGSSNSERVRMWWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLTVSPLCVLCDDDTEDCLHLFWTCPVVKSMW

Query:  LGSKFALFHQSFSHFRFEEIIG---AMRDKLTGPDFELVVIFW--WSVWSLRNNLFWGGQSDGRDLWAYSSDYLHAFHVGGGRCGVRDSL-WAQSGEQEE
          +   + +Q      FE+ +     +    T    +  + FW  W +W  RN   +  +    D  A              R G++D+  W  + E  E
Subjt:  LGSKFALFHQSFSHFRFEEIIG---AMRDKLTGPDFELVVIFW--WSVWSLRNNLFWGGQSDGRDLWAYSSDYLHAFHVGGGRCGVRDSL-WAQSGEQEE

Query:  R-----------------GVWRPPPNRVLKLNIDASRC---------WSVDLAEGWAVYKG---IQL------ARQLGFVDFV------------VETDS
                            W PPP   +K N D+            W++    G  V  G   +Q       A  LGF+  +             E+DS
Subjt:  R-----------------GVWRPPPNRVLKLNIDASRC---------WSVDLAEGWAVYKG---IQL------ARQLGFVDFV------------VETDS

Query:  LRLVKIL-NGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLA
          LV ++ NGE H  S +G L+ DIR  +       + F  R+ N  A  LA
Subjt:  LRLVKIL-NGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLA

AT3G09510.1 Ribonuclease H-like superfamily protein6.3e-2823.89Show/hide
Query:  RVLQDPSSLLGCVLKGRYFPQSGFLEAV----RGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSPTSTVSELFTASGG---WDVALLRTIFNGADC
        R  +D S L   V K + +  +  L+ +    +G R  IG+G+   I   N + +     + +        T++ LF   G    WD + +    + +D 
Subjt:  RVLQDPSSLLGCVLKGRYFPQSGFLEAV----RGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSPTSTVSELFTASGG---WDVALLRTIFNGADC

Query:  EAILRIPLQQGSGEDRLIWHFEKHGNFSVKSGYRL------AHTLAIQDRPGSSNSERVRMWWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLTV
          I RI L +    D++IW++   G ++V+SGY L       +  AI    GS + +      + +W L +  K + FLWR     L T   L  RG+ +
Subjt:  EAILRIPLQQGSGEDRLIWHFEKHGNFSVKSGYRL------AHTLAIQDRPGSSNSERVRMWWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLTV

Query:  SPLCVLCDDDTEDCLHLFWTCPVVKSMWLGSKFALFHQSFSHFRFEE----IIGAMRDKLTGPDFELVVIFW--WSVWSLRNNLFWG--GQSDGRDLW--
         P C  C  + E   H  +TCP     W  S  +L         FEE    I+  ++D  T  DF  ++  W  W +W  RNN+ +    +S  + +   
Subjt:  SPLCVLCDDDTEDCLHLFWTCPVVKSMWLGSKFALFHQSFSHFRFEE----IIGAMRDKLTGPDFELVVIFW--WSVWSLRNNLFWG--GQSDGRDLW--

Query:  -AYSSDYLHAFHVGGGRCGVRDSLWAQSGEQEERGVWRPPPNRVLKLNIDAS-RCWSVDLAEGW-----------------------------AVYKGIQ
         A + D+L+A          + +        E +  WR PP   +K N DA      ++   GW                             A+   +Q
Subjt:  -AYSSDYLHAFHVGGGRCGVRDSLWAQSGEQEERGVWRPPPNRVLKLNIDAS-RCWSVDLAEGW-----------------------------AVYKGIQ

Query:  LARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFSY
             G+    +E D   L+ ++NG +   S +   ++DI    + + + +  F  R+GNK+AHVLA+   +Y
Subjt:  LARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFSY

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein8.9e-2226.59Show/hide
Query:  QYLGLPSFMPRNRSGALKFVTDRIWRQIQGWKGKFFSVAGKEVLLKSIVQAIPCYTMNCFRLPRCLIKEIHRSMARFWWSGSEEERRIHWLSWDALCLPK
        +YLGLP    +  +     + ++I  +I  W  +  S AG+  L+ S++ ++  + M+ FRLP   IKEI    + F WSG E   +   ++W  +C PK
Subjt:  QYLGLPSFMPRNRSGALKFVTDRIWRQIQGWKGKFFSVAGKEVLLKSIVQAIPCYTMNCFRLPRCLIKEIHRSMARFWWSGSEEERRIHWLSWDALCLPK

Query:  CLGGLGFRNMELFNQALLAKQCWRVLQDPSSLLGC-----VLKGRYFPQSGFLEAVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSPTSTVSEL
          GGLG R+++  N+       W +    ++ LG      +LK R    SGF+      +  I NG  T  +  NW     S   +   V      +   
Subjt:  CLGGLGFRNMELFNQALLAKQCWRVLQDPSSLLGC-----VLKGRYFPQSGFLEAVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSPTSTVSEL

Query:  FTASGGWDVALLRTIFNGADCEAILRIP------LQQG--SGEDRLIWHFEKHGNFSV-KSGYRLAHTLAIQDRPGSSNSERVRMWWSGLWRLNVPNKHR
         T       A++         + +LRI         QG  SGED + W     GN  + K  +    T A    P    +     W+ G+W  +   K+ 
Subjt:  FTASGGWDVALLRTIFNGADCEAILRIP------LQQG--SGEDRLIWHFEKHGNFSV-KSGYRLAHTLAIQDRPGSSNSERVRMWWSGLWRLNVPNKHR

Query:  FFLWRLCHDRLPTKVNLLKRGLTVSPLCVLCDDDTEDCLHLFWTCP
           W    +RL T   +L         CVLC    E   HLF+TCP
Subjt:  FFLWRLCHDRLPTKVNLLKRGLTVSPLCVLCDDDTEDCLHLFWTCP

AT4G29090.1 Ribonuclease H-like superfamily protein1.5e-5327.48Show/hide
Query:  AIPCYTMNCFRLPRCLIKEIHRSMARFWWSGSEEERRIHWLSWDALCLPKCLGGLGFRNMELFNQALLAKQCWRVLQDPSSLLGCVLKGRYFPQSGFLEA
        A+P YTM CF LP+ + K+I   +A FWW   +E + +HW +WD L   K  GG+GF+++E FN ALL KQ WR+L  P SL+  V K RYF +S  L A
Subjt:  AIPCYTMNCFRLPRCLIKEIHRSMARFWWSGSEEERRIHWLSWDALCLPKCLGGLGFRNMELFNQALLAKQCWRVLQDPSSLLGCVLKGRYFPQSGFLEA

Query:  VRGCR----WR----------------IGNGRATPIYGSNWL---PNEFSLQIQSAP-----VLSPTSTVSELFTASG-GWDVALLRTIFNGADCEAILR
          G R    W+                +GNG    I+   WL   P   +L++Q  P      +S    VS+L   SG  W   ++  +F   + E  L 
Subjt:  VRGCR----WR----------------IGNGRATPIYGSNWL---PNEFSLQIQSAP-----VLSPTSTVSELFTASG-GWDVALLRTIFNGADCEAILR

Query:  IPLQQGSGE--DRLIWHFEKHGNFSVKSGY-RLAHTLAIQDRPGSSNSERVRMWWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLTVSPLCVLCD
          L+ G     D   W +   G+++VKSGY  L   +  +  P   +   +   +  +W+     K + FLW+   + LP    L  R L+    C+ C 
Subjt:  IPLQQGSGE--DRLIWHFEKHGNFSVKSGY-RLAHTLAIQDRPGSSNSERVRMWWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLTVSPLCVLCD

Query:  DDTEDCLHLFWTCPVVKSMWLGSKFAL-FHQSFSHFRFEEIIGAMRDKLTGPDFE----LVVIFWWSVWSLRNNL-FWGGQSDGRDLWAYSSDYLHAFHV
           E   HL + C   +  W  S   +     ++   +  +          P +E    LV    W +W  RN L F G + + +++   + D L  + +
Subjt:  DDTEDCLHLFWTCPVVKSMWLGSKFAL-FHQSFSHFRFEEIIGAMRDKLTGPDFE----LVVIFWWSVWSLRNNL-FWGGQSDGRDLWAYSSDYLHAFHV

Query:  --GGGRCGVRDSLWAQSGEQEERGVWRPPPNRVLKLNIDAS------RC---WSVDLAEGWAVYKGIQLARQLGFV--------------------DFVV
              CG +  +      +   G WRPPP++ +K N DA+      RC   W +   +G   + G +   +L  V                    ++V+
Subjt:  --GGGRCGVRDSLWAQSGEQEERGVWRPPPNRVLKLNIDAS------RC---WSVDLAEGWAVYKGIQLARQLGFV--------------------DFVV

Query:  -ETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFSYVD
         E+DS  L++ILN +      +   + D++R+LS +   K +F PR+GN +A  +AR + S+++
Subjt:  -ETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFSYVD

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.6e-2640.27Show/hide
Query:  AIPCYTMNCFRLPRCLIKEIHRSMARFWWSGSEEERRIHWLSWDALCLPK-CLGGLGFRNMELFNQALLAKQCWRVLQDPSSLLGCVLKGRYFPQSGFLE
        A+P Y M+CFRL + L K++  +M  FWWS  E +R+I W++W  LC  K   GGLGFR++  FNQALLAKQ +R++  P +LL  +L+ RYFP S  +E
Subjt:  AIPCYTMNCFRLPRCLIKEIHRSMARFWWSGSEEERRIHWLSWDALCLPK-CLGGLGFRNMELFNQALLAKQCWRVLQDPSSLLGCVLKGRYFPQSGFLE

Query:  AVRGCR----WR----------------IGNGRATPIYGSNWLPNEFSL
           G R    WR                IG+G  T ++   W+ +E  L
Subjt:  AVRGCR----WR----------------IGNGRATPIYGSNWLPNEFSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTACGACTGGGTTTTGCGCAGCAGTGGGTTGATCTGATCCTCCGGTGTGTCAGCTCGGTTTCATTTTCCTTCAATCTGAATGGGGAAAAGGTGGGGCAGGTGGTACC
GTCTAGGGGTCTCCGGCAGGGGGATCCCCTCTCCCATATCTATTTCTGCTATGTGCTGAATGGTTGTCTAGTCTGCTGCGTGGAGCTGAGCGGAGGTTTCAGATTACGGG
ATTTCGGGTTGGACGCTCTAGCCCATCGATTTCGCACCTTTTTTTTGCAGATGACAGCCTCCTCTTCTTCAAGGCCATCGGGAGTGAAGCGCTGGTTATTCGGGAGCTGC
TGGAACGATATGAGAGGGCGTCAGGGGCAGACTATCAATTATGAGAAGTCTGTTGTTGCTTTTAGCCCAAATACGGGGGAAGAGGCTCAGCAGTATATCAGTCAAGTTCT
TGCCGTGTCCCGCTGCCCTTGTCATCAGCAGTATCTAGGCCTGCCCTCGTTCATGCCACGGAACCGGTCGGGGGCGTTGAAGTTTGTGACGGATCGGATTTGGCGCCAAA
TTCAGGGATGGAAGGGCAAGTTCTTCTCAGTGGCGGGGAAGGAGGTTCTCCTCAAGTCCATAGTTCAGGCGATTCCTTGCTACACGATGAATTGTTTTCGGCTGCCCAGG
TGTTTGATCAAGGAGATTCACAGGTCTATGGCTAGATTTTGGTGGAGTGGATCTGAAGAAGAGAGACGAATACATTGGCTGAGTTGGGATGCTCTATGTCTCCCAAAGTG
CTTGGGTGGGTTGGGGTTCCGTAATATGGAGCTTTTTAACCAAGCCCTGCTGGCTAAACAGTGTTGGCGTGTTCTCCAGGATCCTTCCTCCCTTCTAGGCTGTGTGCTCA
AGGGCCGCTATTTTCCCCAGTCGGGTTTCTTGGAGGCAGTTCGTGGATGTCGTTGGAGGATTGGTAATGGGCGTGCTACGCCCATATATGGCTCAAACTGGCTGCCGAAT
GAGTTCTCGCTTCAAATACAGTCGGCTCCAGTACTTTCTCCTACTAGTACGGTGAGTGAGTTGTTCACTGCGTCTGGTGGATGGGATGTGGCTCTACTCAGGACGATTTT
CAATGGGGCTGATTGTGAGGCTATTTTGAGAATTCCTCTACAACAGGGCTCGGGGGAGGACCGCTTAATCTGGCACTTTGAGAAGCATGGGAATTTTTCGGTGAAGAGTG
GGTATCGGCTTGCTCATACATTGGCTATTCAGGACCGACCTGGTTCCTCGAATTCCGAGAGAGTGCGCATGTGGTGGTCCGGCCTCTGGAGGTTGAATGTGCCCAATAAG
CATAGGTTCTTCCTCTGGCGTCTGTGCCACGACCGCTTGCCAACTAAGGTAAACCTTCTCAAACGTGGACTCACTGTATCCCCTTTGTGTGTTTTGTGCGATGACGATAC
AGAAGATTGCCTCCATCTGTTCTGGACCTGCCCTGTGGTTAAGAGTATGTGGTTGGGTTCCAAATTTGCCCTCTTCCACCAATCCTTTTCCCATTTCAGGTTCGAGGAAA
TCATTGGGGCGATGAGGGACAAACTGACAGGGCCTGATTTTGAGCTTGTGGTGATTTTTTGGTGGTCTGTGTGGAGCCTACGAAATAACCTGTTTTGGGGTGGGCAGTCA
GACGGTCGGGATCTCTGGGCATATTCGAGTGATTACCTCCATGCCTTCCATGTCGGTGGGGGACGTTGCGGGGTAAGGGACTCCTTATGGGCTCAATCGGGGGAGCAAGA
AGAGCGCGGTGTATGGAGACCGCCCCCTAATAGGGTGCTGAAACTTAATATTGATGCTTCACGGTGTTGGAGCGTGGATTTGGCTGAGGGTTGGGCTGTGTATAAAGGGA
TCCAACTTGCTCGACAGTTGGGGTTTGTGGATTTTGTGGTGGAGACTGACTCTCTAAGACTGGTCAAAATTTTGAATGGGGAGCTGCATGATGTGTCGGAAGTGGGGCTG
CTGATGGATGACATTCGACGGATCCTCAGTCCTTGGGTCAACGGTAAGGTGTTGTTTACTCCACGTCAGGGGAATAAGGTGGCGCATGTTCTGGCCCGCCTGGCCTTTTC
ATATGTTGATCGTGTATGGCTTGAGGAGTGGCCTAGCGAGGTCTCGGATGTCCTGAGGGGTGATTTTGCTTCAGTTGCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTACGACTGGGTTTTGCGCAGCAGTGGGTTGATCTGATCCTCCGGTGTGTCAGCTCGGTTTCATTTTCCTTCAATCTGAATGGGGAAAAGGTGGGGCAGGTGGTACC
GTCTAGGGGTCTCCGGCAGGGGGATCCCCTCTCCCATATCTATTTCTGCTATGTGCTGAATGGTTGTCTAGTCTGCTGCGTGGAGCTGAGCGGAGGTTTCAGATTACGGG
ATTTCGGGTTGGACGCTCTAGCCCATCGATTTCGCACCTTTTTTTTGCAGATGACAGCCTCCTCTTCTTCAAGGCCATCGGGAGTGAAGCGCTGGTTATTCGGGAGCTGC
TGGAACGATATGAGAGGGCGTCAGGGGCAGACTATCAATTATGAGAAGTCTGTTGTTGCTTTTAGCCCAAATACGGGGGAAGAGGCTCAGCAGTATATCAGTCAAGTTCT
TGCCGTGTCCCGCTGCCCTTGTCATCAGCAGTATCTAGGCCTGCCCTCGTTCATGCCACGGAACCGGTCGGGGGCGTTGAAGTTTGTGACGGATCGGATTTGGCGCCAAA
TTCAGGGATGGAAGGGCAAGTTCTTCTCAGTGGCGGGGAAGGAGGTTCTCCTCAAGTCCATAGTTCAGGCGATTCCTTGCTACACGATGAATTGTTTTCGGCTGCCCAGG
TGTTTGATCAAGGAGATTCACAGGTCTATGGCTAGATTTTGGTGGAGTGGATCTGAAGAAGAGAGACGAATACATTGGCTGAGTTGGGATGCTCTATGTCTCCCAAAGTG
CTTGGGTGGGTTGGGGTTCCGTAATATGGAGCTTTTTAACCAAGCCCTGCTGGCTAAACAGTGTTGGCGTGTTCTCCAGGATCCTTCCTCCCTTCTAGGCTGTGTGCTCA
AGGGCCGCTATTTTCCCCAGTCGGGTTTCTTGGAGGCAGTTCGTGGATGTCGTTGGAGGATTGGTAATGGGCGTGCTACGCCCATATATGGCTCAAACTGGCTGCCGAAT
GAGTTCTCGCTTCAAATACAGTCGGCTCCAGTACTTTCTCCTACTAGTACGGTGAGTGAGTTGTTCACTGCGTCTGGTGGATGGGATGTGGCTCTACTCAGGACGATTTT
CAATGGGGCTGATTGTGAGGCTATTTTGAGAATTCCTCTACAACAGGGCTCGGGGGAGGACCGCTTAATCTGGCACTTTGAGAAGCATGGGAATTTTTCGGTGAAGAGTG
GGTATCGGCTTGCTCATACATTGGCTATTCAGGACCGACCTGGTTCCTCGAATTCCGAGAGAGTGCGCATGTGGTGGTCCGGCCTCTGGAGGTTGAATGTGCCCAATAAG
CATAGGTTCTTCCTCTGGCGTCTGTGCCACGACCGCTTGCCAACTAAGGTAAACCTTCTCAAACGTGGACTCACTGTATCCCCTTTGTGTGTTTTGTGCGATGACGATAC
AGAAGATTGCCTCCATCTGTTCTGGACCTGCCCTGTGGTTAAGAGTATGTGGTTGGGTTCCAAATTTGCCCTCTTCCACCAATCCTTTTCCCATTTCAGGTTCGAGGAAA
TCATTGGGGCGATGAGGGACAAACTGACAGGGCCTGATTTTGAGCTTGTGGTGATTTTTTGGTGGTCTGTGTGGAGCCTACGAAATAACCTGTTTTGGGGTGGGCAGTCA
GACGGTCGGGATCTCTGGGCATATTCGAGTGATTACCTCCATGCCTTCCATGTCGGTGGGGGACGTTGCGGGGTAAGGGACTCCTTATGGGCTCAATCGGGGGAGCAAGA
AGAGCGCGGTGTATGGAGACCGCCCCCTAATAGGGTGCTGAAACTTAATATTGATGCTTCACGGTGTTGGAGCGTGGATTTGGCTGAGGGTTGGGCTGTGTATAAAGGGA
TCCAACTTGCTCGACAGTTGGGGTTTGTGGATTTTGTGGTGGAGACTGACTCTCTAAGACTGGTCAAAATTTTGAATGGGGAGCTGCATGATGTGTCGGAAGTGGGGCTG
CTGATGGATGACATTCGACGGATCCTCAGTCCTTGGGTCAACGGTAAGGTGTTGTTTACTCCACGTCAGGGGAATAAGGTGGCGCATGTTCTGGCCCGCCTGGCCTTTTC
ATATGTTGATCGTGTATGGCTTGAGGAGTGGCCTAGCGAGGTCTCGGATGTCCTGAGGGGTGATTTTGCTTCAGTTGCGTGA
Protein sequenceShow/hide protein sequence
MLRLGFAQQWVDLILRCVSSVSFSFNLNGEKVGQVVPSRGLRQGDPLSHIYFCYVLNGCLVCCVELSGGFRLRDFGLDALAHRFRTFFLQMTASSSSRPSGVKRWLFGSC
WNDMRGRQGQTINYEKSVVAFSPNTGEEAQQYISQVLAVSRCPCHQQYLGLPSFMPRNRSGALKFVTDRIWRQIQGWKGKFFSVAGKEVLLKSIVQAIPCYTMNCFRLPR
CLIKEIHRSMARFWWSGSEEERRIHWLSWDALCLPKCLGGLGFRNMELFNQALLAKQCWRVLQDPSSLLGCVLKGRYFPQSGFLEAVRGCRWRIGNGRATPIYGSNWLPN
EFSLQIQSAPVLSPTSTVSELFTASGGWDVALLRTIFNGADCEAILRIPLQQGSGEDRLIWHFEKHGNFSVKSGYRLAHTLAIQDRPGSSNSERVRMWWSGLWRLNVPNK
HRFFLWRLCHDRLPTKVNLLKRGLTVSPLCVLCDDDTEDCLHLFWTCPVVKSMWLGSKFALFHQSFSHFRFEEIIGAMRDKLTGPDFELVVIFWWSVWSLRNNLFWGGQS
DGRDLWAYSSDYLHAFHVGGGRCGVRDSLWAQSGEQEERGVWRPPPNRVLKLNIDASRCWSVDLAEGWAVYKGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGL
LMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFSYVDRVWLEEWPSEVSDVLRGDFASVA