; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0010379 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0010379
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr9:46803696..46805936
RNA-Seq ExpressionLag0010379
SyntenyLag0010379
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_017250619.1 PREDICTED: uncharacterized protein LOC108221234 [Daucus carota subsp. sativus]6.1e-14536.81Show/hide
Query:  MNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSAMLVSARNRS-MAGISVARDCPKISHLFFADDSLVFLKAAAEEFGFLKAILRDYEKASGQCVNFNK
        +NG+ +G + PSRG+RQGDPLSPYLFLIC EG SA+L  A NRS + G+ +AR+ P ISHLFFADDSL+FLKA+      ++ I   Y + SGQ +NFNK
Subjt:  MNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSAMLVSARNRS-MAGISVARDCPKISHLFFADDSLVFLKAAAEEFGFLKAILRDYEKASGQCVNFNK

Query:  STVCYSKNVIGESKRYLTNIMAMKESDSVGTYLGLPSTFHHGKSLDFHFLLDKVWAILQGWKQKLFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILSKI
        S + +S N   + +      + M+ ++++ TYLGLP      K + F  + +KVWA L  W+  +FSQGGKE+L+K++IQA+PTY M CF+IP+G   +I
Subjt:  STVCYSKNVIGESKRYLTNIMAMKESDSVGTYLGLPSTFHHGKSLDFHFLLDKVWAILQGWKQKLFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILSKI

Query:  SSLCAKFWWGSNEKKRKMHWKRWDELCNPKELGGLNFRDLVNFNQALLAKQVWRVLTNPNLTIFRLLRGKYFPNSTVLDAVDQYNSSFFWKVFLWGMELL
          L A++WWGS   KRK+HW+ W ++  PK  GGL FR  +++NQALLAKQ WR+L  P   + ++L+ KYF +S+ LD+ +    S  W+  +WG  LL
Subjt:  SSLCAKFWWGSNEKKRKMHWKRWDELCNPKELGGLNFRDLVNFNQALLAKQVWRVLTNPNLTIFRLLRGKYFPNSTVLDAVDQYNSSFFWKVFLWGMELL

Query:  KNGLRKNVGSDHSIEMFRDPWLPRLSTFKVFSHPCHEIENAVVAEFLTPSFQWDMEKLNQHLMKEDVDMIKCLPISPL-APDRWIWHYDSRGEYTVKSGY
          GLR+ +G+  S   F+DPWL R  +F   +      E   V E++T    W+ E + Q  +  D+ +I  +P+S     D W WHY+S+G YTVKSGY
Subjt:  KNGLRKNVGSDHSIEMFRDPWLPRLSTFKVFSHPCHEIENAVVAEFLTPSFQWDMEKLNQHLMKEDVDMIKCLPISPL-APDRWIWHYDSRGEYTVKSGY

Query:  KLSMVNRQEDSLSDRGRESRWWKKLWKLCVPNKIKVFVWKSFHNSIPTKVNLWNHHVLVDGYCHVCKEEIETTDHALFQCNRAKAVWGIISPPSDNVSYY
        KL     ++ S S      +WWK  W   +P KI +F W+ +H  +PT   L    + +   C +C    ++  HA+F C  A+ VW +++ P       
Subjt:  KLSMVNRQEDSLSDRGRESRWWKKLWKLCVPNKIKVFVWKSFHNSIPTKVNLWNHHVLVDGYCHVCKEEIETTDHALFQCNRAKAVWGIISPPSDNVSYY

Query:  QMNVKDRWLGLAECKDR-DLECICVGAWAIWNDRNSLLY--NRPVPDVVRRYEWIAKYLNDYRN--VNLKGAVLYQSKEEVSCIISESEGIILHTDASFI
        +++ KD  L + E  ++ D++ + +  W IW +RN L++   R  P  ++   W++ Y  + +N  V+   A+          +   S G  L  DA+  
Subjt:  QMNVKDRWLGLAECKDR-DLECICVGAWAIWNDRNSLLY--NRPVPDVVRRYEWIAKYLNDYRN--VNLKGAVLYQSKEEVSCIISESEGIILHTDASFI

Query:  GEGEHCGIGLVIRDRNGNLRATQSVGAVVCTSPLEAEAIAVLNGLRMAWDLNVPRLTILSDSLNVIRSINEELQCQSSIATIIWDIKEASEYFESVRFKY
           E  G+G  I   N   +AT S       S L AEA+A++ GL+ A         +L+DS ++++++N E +  + +  +I D +E  +YF  V+  +
Subjt:  GEGEHCGIGLVIRDRNGNLRATQSVGAVVCTSPLEAEAIAVLNGLRMAWDLNVPRLTILSDSLNVIRSINEELQCQSSIATIIWDIKEASEYFESVRFKY

Query:  ISRRFNCFAHSLARVGMSDTTLVLWLQN
        +SR  N  AH+LAR  +     V WL++
Subjt:  ISRRFNCFAHSLARVGMSDTTLVLWLQN

XP_023880912.1 uncharacterized protein LOC111993298 [Quercus suber]2.1e-13736.66Show/hide
Query:  MNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSAML-VSARNRSMAGISVARDCPKISHLFFADDSLVFLKAAAEEFGFLKAILRDYEKASGQCVNFNK
        +NGE  G I+PSRG+RQGDPLSPYLFL+C+EGL+ ML  +A N  + G S+ +  PKISHLFFADDSL+F +A+  +   ++ IL  YE+ASGQ +N  K
Subjt:  MNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSAML-VSARNRSMAGISVARDCPKISHLFFADDSLVFLKAAAEEFGFLKAILRDYEKASGQCVNFNK

Query:  STVCYSKNVIGESKRYLTNIMAMKESDSVGTYLGLPSTFHHGKSLDFHFLLDKVWAILQGWKQKLFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILSKI
        +T+ +SK V  E+K  L + + + E      YLGLP+     +    +F+ ++VW+ LQGWK+KL SQ G+EVL+K+++QAIPT+AMGCF++P G+  +I
Subjt:  STVCYSKNVIGESKRYLTNIMAMKESDSVGTYLGLPSTFHHGKSLDFHFLLDKVWAILQGWKQKLFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILSKI

Query:  SSLCAKFWWGSNEKKRKMHWKRWDELCNPKELGGLNFRDLVNFNQALLAKQVWRVLTNPNLTIFRLLRGKYFPNSTVLDAVDQYNSSFFWKVFLWGMELL
         +L  KF+WG   ++RK+HWK+W+ LC PK  GGL F+DL  FN A+LAKQVWR+L + +   +R+ + KYFP  ++ +A      S+ W+  +   +++
Subjt:  SSLCAKFWWGSNEKKRKMHWKRWDELCNPKELGGLNFRDLVNFNQALLAKQVWRVLTNPNLTIFRLLRGKYFPNSTVLDAVDQYNSSFFWKVFLWGMELL

Query:  KNGLRKNVGSDHSIEMFRDPWLPRLSTFKVFSHPCHEIENAVVAEFLTPSFQ-WDMEKLNQHLMKEDVDMIKCLPIS-PLAPDRWIWHYDSRGEYTVKSG
          G+R  +G   SI ++ + WLP   + ++ S     ++ A VA  + P+ + W+++ LNQH +  +V  I  +P+S     DR IW     GEY VKSG
Subjt:  KNGLRKNVGSDHSIEMFRDPWLPRLSTFKVFSHPCHEIENAVVAEFLTPSFQ-WDMEKLNQHLMKEDVDMIKCLPIS-PLAPDRWIWHYDSRGEYTVKSG

Query:  YKL--SMVNRQEDSLSDRGRESRWWKKLWKLCVPNKIKVFVWKSFHNSIPTKVNLWNHHVLVDGYCHVCKEEIETTDHALFQCNRAKAVWG-IISPPSDN
        Y++     N    S SD    S +WK+LWKL VPNKIK F+W+   N +PTK NL    +L D  C  C ++ ET  HAL+ C   K +W    S     
Subjt:  YKL--SMVNRQEDSLSDRGRESRWWKKLWKLCVPNKIKVFVWKSFHNSIPTKVNLWNHHVLVDGYCHVCKEEIETTDHALFQCNRAKAVWG-IISPPSDN

Query:  VSYYQMNVKDRWLGLAECKDRDLECICVGAWAIWNDRNSLLYNRPVPDVVRRYEWIAKYLNDYRNVNLKGAVLY-QSKEEVSCIISESEGII-LHTDASF
        V Y Q  ++   + L   +   LE   V AW IW  RN L  N       + +E    YL D+++   K  V   Q ++E +  +   +G+   + D + 
Subjt:  VSYYQMNVKDRWLGLAECKDRDLECICVGAWAIWNDRNSLLYNRPVPDVVRRYEWIAKYLNDYRNVNLKGAVLY-QSKEEVSCIISESEGII-LHTDASF

Query:  IGEGEHCGIGLVIRDRNGNLRATQSVGAVVCTSPLEAEAIAVLNGLRMAWDLNVPRLTILSDSLNVIRSINEELQCQSSIATIIWDIKEASEYFESVRFK
          E E  GIG+++RD  G++ A  +       S    EA+A     +   +L +       DS  V  ++    +  S++  II D         +  F 
Subjt:  IGEGEHCGIGLVIRDRNGNLRATQSVGAVVCTSPLEAEAIAVLNGLRMAWDLNVPRLTILSDSLNVIRSINEELQCQSSIATIIWDIKEASEYFESVRFK

Query:  YISRRFNCFAHSLARVGMSDTTLVLWLQNYP
        +  R+ NC AH+LA+  +    L++W+++ P
Subjt:  YISRRFNCFAHSLARVGMSDTTLVLWLQNYP

XP_030479133.1 uncharacterized protein LOC115696372 [Cannabis sativa]2.1e-13735.27Show/hide
Query:  MNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSAML-VSARNRSMAGISVARDCPKISHLFFADDSLVFLKAAAEEFGFLKAILRDYEKASGQCVNFNK
        +NGE  G + P RG+RQGDPLSPYLFLIC+EGLS +L    +   + G++V+R  P ISHLFFADDSL+F +A     G +K  L  Y +ASGQ +N +K
Subjt:  MNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSAML-VSARNRSMAGISVARDCPKISHLFFADDSLVFLKAAAEEFGFLKAILRDYEKASGQCVNFNK

Query:  STVCYSKNVIGESKRYLTNIMAMKESDSVGTYLGLPSTFHHGKSLDFHFLLDKVWAILQGWKQKLFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILSKI
        S + +S N     +     I+ M   +    YLGLP+     KS  F  + +K+W ++  W  K+FS GGKEVL+K+++Q+IPTYAM CFR+P  + ++I
Subjt:  STVCYSKNVIGESKRYLTNIMAMKESDSVGTYLGLPSTFHHGKSLDFHFLLDKVWAILQGWKQKLFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILSKI

Query:  SSLCAKFWWGSNEKKRKMHWKRWDELCNPKELGGLNFRDLVNFNQALLAKQVWRVLTNPNLTIFRLLRGKYFPNSTVLDAVDQYNSSFFWKVFLWGMELL
         ++ AKFWWGS+   +K+HWK+W  LC  K  GG+ FR  V+FNQALLAKQ WR+  +P   + R+L+G YF  +  + A     SS  W+  +WG ELL
Subjt:  SSLCAKFWWGSNEKKRKMHWKRWDELCNPKELGGLNFRDLVNFNQALLAKQVWRVLTNPNLTIFRLLRGKYFPNSTVLDAVDQYNSSFFWKVFLWGMELL

Query:  KNGLRKNVGSDHSIEMFRDPWLPRLSTFKVFSHPCHEIENAVVAEFLTPSFQWDMEKLNQHLMKEDVDMIKCLPISPL-APDRWIWHYDSRGEYTVKSGY
          GLR  VG+ ++I    D W+P    FK + +      +  VA+++T + +W++E L       DVD I  +P+S L   DRWIWHY+  G+Y+V SGY
Subjt:  KNGLRKNVGSDHSIEMFRDPWLPRLSTFKVFSHPCHEIENAVVAEFLTPSFQWDMEKLNQHLMKEDVDMIKCLPISPL-APDRWIWHYDSRGEYTVKSGY

Query:  KLSMVNRQEDSLSDRGRESRWWKKLWKLCVPNKIKVFVWKSFHNSIPTKVNLWNHHVLVDGYCHVCKEEIETTDHALFQCNRAKAVWGIISPPSDNVSYY
         L+    +ED  S    +  WWK  WKL +P+K+K+F WK   +SIP   +L++  +L    C +C+   E+  HALF C  AK VW       D  +  
Subjt:  KLSMVNRQEDSLSDRGRESRWWKKLWKLCVPNKIKVFVWKSFHNSIPTKVNLWNHHVLVDGYCHVCKEEIETTDHALFQCNRAKAVWGIISPPSDNVSYY

Query:  QMNVKDRWLGLAECKDRD-LECICVGAWAIWNDRNSLLYNRPVPDVVRRYEWIAKYLNDYRNVNLKGAVLYQSKEEVSCIIS----ESEGIILHTDASFI
        ++   D  + L+   ++   E I    W IW+DRN+ ++ + V   ++ +     Y++ YR++         ++   + + S          L+ DA+  
Subjt:  QMNVKDRWLGLAECKDRD-LECICVGAWAIWNDRNSLLYNRPVPDVVRRYEWIAKYLNDYRNVNLKGAVLYQSKEEVSCIIS----ESEGIILHTDASFI

Query:  GEGEHCGIGLVIRDRNGNLRATQSVGAVVCTSPLEAEAIAVLNGLRMAWDLNVPRLTILSDSLNVIRSINEELQCQSSIATIIWDIKEASEYFESVRFKY
              GIG+++R+  G ++A  S  A+      E EA A+  GL  A    +P   + +D L ++ ++N ++   S    ++ D+K     F +    +
Subjt:  GEGEHCGIGLVIRDRNGNLRATQSVGAVVCTSPLEAEAIAVLNGLRMAWDLNVPRLTILSDSLNVIRSINEELQCQSSIATIIWDIKEASEYFESVRFKY

Query:  ISRRFNCFAHSLARVGMSDTTLVLWLQNYPQWMVSLALQE
        I R  N  AH LAR  +      +WL++ P  + S+ + +
Subjt:  ISRRFNCFAHSLARVGMSDTTLVLWLQNYPQWMVSLALQE

XP_030483669.1 uncharacterized protein LOC115700241 [Cannabis sativa]6.8e-13635.38Show/hide
Query:  MNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSAML-VSARNRSMAGISVARDCPKISHLFFADDSLVFLKAAAEEFGFLKAILRDYEKASGQCVNFNK
        +NG+  G + P+RGIRQGDPLSPYLFLIC EGLS +L  S +N S+ G+ V+R  P +SHLFFADDS++F++A  +    +K IL  Y +ASGQ VN +K
Subjt:  MNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSAML-VSARNRSMAGISVARDCPKISHLFFADDSLVFLKAAAEEFGFLKAILRDYEKASGQCVNFNK

Query:  STVCYSKNVIGESKRYLTNIMAMKESDSVGTYLGLPSTFHHGKSLDFHFLLDKVWAILQGWKQKLFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILSKI
          + +S N   +++ +  +++ M        YLGLPS     K+  F  + DK+W +L  WK++LFS GGKEVL+K+++QAIPTYAM CFR+P  + ++I
Subjt:  STVCYSKNVIGESKRYLTNIMAMKESDSVGTYLGLPSTFHHGKSLDFHFLLDKVWAILQGWKQKLFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILSKI

Query:  SSLCAKFWWGSNEKKRKMHWKRWDELCNPKELGGLNFRDLVNFNQALLAKQVWRVLTNPNLTIFRLLRGKYFPNSTVLDAVDQYNSSFFWKVFLWGMELL
         S+ + FWWGS      +HWK W+ LC  K  GGL FR+ + FNQALLAKQ WR+L +PN  + R+L  +YF N  +L A      S  W+  +WG ELL
Subjt:  SSLCAKFWWGSNEKKRKMHWKRWDELCNPKELGGLNFRDLVNFNQALLAKQVWRVLTNPNLTIFRLLRGKYFPNSTVLDAVDQYNSSFFWKVFLWGMELL

Query:  KNGLRKNVGSDHSIEMFRDPWLPRLSTFKVFSHPCHEIENAVVAEFLTPSFQWDMEKLNQHLMKEDVDMIKCLPISPL-APDRWIWHYDSRGEYTVKSGY
          GL+  VG+  +I    D WLP  +TF  FS    +  +  VA+ +    QWD+  ++ +    D D I  +P+S   A D  IW+  + G YTVKSGY
Subjt:  KNGLRKNVGSDHSIEMFRDPWLPRLSTFKVFSHPCHEIENAVVAEFLTPSFQWDMEKLNQHLMKEDVDMIKCLPISPL-APDRWIWHYDSRGEYTVKSGY

Query:  KLSMVNRQEDSLSDRGRESRWWKKLWKLCVPNKIKVFVWKSFHNSIPTKVNLWNHHVLVDGYCHVCKEEIETTDHALFQCNRAKAVW-----------GI
        + ++        +       WW K WKL +P+KI++FVWK FHN++P    L   H+    +C +CK   ET +HALF C RAK VW             
Subjt:  KLSMVNRQEDSLSDRGRESRWWKKLWKLCVPNKIKVFVWKSFHNSIPTKVNLWNHHVLVDGYCHVCKEEIETTDHALFQCNRAKAVW-----------GI

Query:  ISPPSDNVSYYQMNVKDRWLGLAECKDRDLECICVGAWAIWNDRNSLLYNRPVPDVVRRYEWIAKYLNDYRNVNLKGAVLYQSKEEVSCIISESE-----
         +  +D + Y   N              + E   V  W+IW +RN+  +N+P        ++   YL  Y+N     +    S    +   S ++     
Subjt:  ISPPSDNVSYYQMNVKDRWLGLAECKDRDLECICVGAWAIWNDRNSLLYNRPVPDVVRRYEWIAKYLNDYRNVNLKGAVLYQSKEEVSCIISESE-----

Query:  ------------GIILHTDASFIGEGEHCGIGLVIRDRNGNLRATQSVGAVVCTSPLEAEAIAVLNGLRMAWDLNVPRLTILSDSLNVIRSINEELQCQS
                     + L++DA+        GIG V+RD  G + A  S     C  P E EA+A+ + L+ A +L +    I +DSL V++ +     C S
Subjt:  ------------GIILHTDASFIGEGEHCGIGLVIRDRNGNLRATQSVGAVVCTSPLEAEAIAVLNGLRMAWDLNVPRLTILSDSLNVIRSINEELQCQS

Query:  SIATIIWDIKEASEYFESVRFKYISRRFNCFAHSLARVGMSDTTLVLWLQNYPQWMVSLALQERDT
        +   I+ D+     +F   +  ++ R  N +A  LA+  ++  T V WL+ +P  ++++ +  R+T
Subjt:  SIATIIWDIKEASEYFESVRFKYISRRFNCFAHSLARVGMSDTTLVLWLQNYPQWMVSLALQERDT

XP_030505432.1 uncharacterized protein LOC115720422 [Cannabis sativa]5.8e-13536.15Show/hide
Query:  MNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSAML-VSARNRSMAGISVARDCPKISHLFFADDSLVFLKAAAEEFGFLKAILRDYEKASGQCVNFNK
        +NG     I PSRG+RQGDPLSPYLFL+C+EGL+A L +  +  +  G+S+AR  P +SHL FADD+L+F KA       L+A L  Y +A+GQ VNF K
Subjt:  MNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSAML-VSARNRSMAGISVARDCPKISHLFFADDSLVFLKAAAEEFGFLKAILRDYEKASGQCVNFNK

Query:  STVCYSKNVIGESKRYLTNIMAMKESDSVGTYLGLPSTFHHGKSLDFHFLLDKVWAILQGWKQKLFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILSKI
        S++ +S N   +   +      +     +  YLG+P  F   K   F+F+LD+V + L+ W +K FS+ GKEVL+K++IQAIP+YAM C+++P  I  KI
Subjt:  STVCYSKNVIGESKRYLTNIMAMKESDSVGTYLGLPSTFHHGKSLDFHFLLDKVWAILQGWKQKLFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILSKI

Query:  SSLCAKFWWGSNEKKRKMHWKRWDELCNPKELGGLNFRDLVNFNQALLAKQVWRVLTNPNLTIFRLLRGKYFPNSTVLDAVDQYNSSFFWKVFLWGMELL
         SL A+FWWGS   + K HWK W  LC  K  GG+ FR L++ NQALLAKQ WRVLT P+    ++L+ +YF +S+ LDA   ++ S+ W   LWG +LL
Subjt:  SSLCAKFWWGSNEKKRKMHWKRWDELCNPKELGGLNFRDLVNFNQALLAKQVWRVLTNPNLTIFRLLRGKYFPNSTVLDAVDQYNSSFFWKVFLWGMELL

Query:  KNGLRKNVGSDHSIEMFRDPWLPRLSTFKVFSHPCHEIENAVVAEFLTPSFQWDMEKLNQHLMKEDVDMIKCLPISPL-APDRWIWHYDSRGEYTVKSGY
        KNGL   VG+  +I   +D W+P L   +        + +  V+ F+  +  WD+ +L+ +   + V+ I  +PI      D  IW  DS G  TVKS Y
Subjt:  KNGLRKNVGSDHSIEMFRDPWLPRLSTFKVFSHPCHEIENAVVAEFLTPSFQWDMEKLNQHLMKEDVDMIKCLPISPL-APDRWIWHYDSRGEYTVKSGY

Query:  KLSMVNRQEDSLSDRGRESRWWKKLWKLCVPNKIKVFVWKSFHNSIPTKVNLWNHHVLVDGYCHVCKEEIETTDHALFQCNRAKAVWGIISPPSDNVSYY
         L+  +    S S+    +RWWK  W   +P KIK F W++FH+ +PT  NL+   V+    C  C   +ET  HAL  C+R + VW +       +S+ 
Subjt:  KLSMVNRQEDSLSDRGRESRWWKKLWKLCVPNKIKVFVWKSFHNSIPTKVNLWNHHVLVDGYCHVCKEEIETTDHALFQCNRAKAVWGIISPPSDNVSYY

Query:  QMNVKDRWL-GLAECKDRDLECICVGAWAIWNDRNSLLYNRPVPDVVRRYEWIAKYLNDYRNVNL---KGAVLYQSKEEVSCIISESEGIILHTDASFIG
          ++KD  L    +    D   +    W+IW  RN  L+    PD     +WIA YL+ YR   +   + A   Q+  + S          L TDA+   
Subjt:  QMNVKDRWL-GLAECKDRDLECICVGAWAIWNDRNSLLYNRPVPDVVRRYEWIAKYLNDYRNVNL---KGAVLYQSKEEVSCIISESEGIILHTDASFIG

Query:  EGEHCGIGLVIRDRNGNLRATQSVGAVVCTSPLEAEAIAVLNGLRMAWDLNVPRLTILSDSLNVIRSINEELQCQSSIATIIWDIKEASEYFESVRFKYI
             G+G VI+D NG + A  S+       PL AEA+A+  GL    ++ +P  +I +D   ++  I    + +S++A +I DIK +  +       +I
Subjt:  EGEHCGIGLVIRDRNGNLRATQSVGAVVCTSPLEAEAIAVLNGLRMAWDLNVPRLTILSDSLNVIRSINEELQCQSSIATIIWDIKEASEYFESVRFKYI

Query:  SRRFNCFAHSLARVGMSDTTLVLWLQNYPQWMV
         R  N  AH +A+  +     ++W    P  +V
Subjt:  SRRFNCFAHSLARVGMSDTTLVLWLQNYPQWMV

TrEMBL top hitse value%identityAlignment
A0A2N9I509 Uncharacterized protein7.8e-13837.11Show/hide
Query:  MNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSAMLVSA-RNRSMAGISVARDCPKISHLFFADDSLVFLKAAAEEFGFLKAILRDYEKASGQCVNFNK
        +NGE  G++ PSRG+RQGDPLSPYLFLIC EGLSA++  A R+  + GIS+ R  P+ISHLFFADDS++F +A   +   ++ IL  YEKASGQ VN +K
Subjt:  MNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSAMLVSA-RNRSMAGISVARDCPKISHLFFADDSLVFLKAAAEEFGFLKAILRDYEKASGQCVNFNK

Query:  STVCYSKNVIGESKRYLTNIMAMKESDSVGTYLGLPSTFHHGKSLDFHFLLDKVWAILQGWKQKLFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILSKI
        + + +S N     +  + N+           YLGLP      K   FH + D++W  LQGWK+KL SQ GKEVLIK++IQA+PTYAM CF+ P G+ ++I
Subjt:  STVCYSKNVIGESKRYLTNIMAMKESDSVGTYLGLPSTFHHGKSLDFHFLLDKVWAILQGWKQKLFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILSKI

Query:  SSLCAKFWWGSNEKKRKMHWKRWDELCNPKELGGLNFRDLVNFNQALLAKQVWRVLTNPNLTIFRLLRGKYFPNSTVLDAVDQYNSSFFWKVFLWGMELL
        SS+   FWWG  +  RK+HW    +L  PK  GG+ FRDL  FN+ALLA+Q WR+L +P   + R L+ KYFPNS+ L+A    N+SF W+      ++L
Subjt:  SSLCAKFWWGSNEKKRKMHWKRWDELCNPKELGGLNFRDLVNFNQALLAKQVWRVLTNPNLTIFRLLRGKYFPNSTVLDAVDQYNSSFFWKVFLWGMELL

Query:  KNGLRKNVGSDHSIEMFRDPWLPRLSTFKVFSHPCHEIENAVVAEFL-TPSFQWDMEKLNQHLMKEDVDMIKCLPISPLAP-DRWIWHYDSRGEYTVKSG
          GLR  VG+   I+++ D WLP  STFKV S P    + A V + +   + +W ++ L++     D ++I+ +P+S   P D  IW     G ++V+S 
Subjt:  KNGLRKNVGSDHSIEMFRDPWLPRLSTFKVFSHPCHEIENAVVAEFL-TPSFQWDMEKLNQHLMKEDVDMIKCLPISPLAP-DRWIWHYDSRGEYTVKSG

Query:  YK--LSMVNRQEDSL-SDRGRESRWWKKLWKLCVPNKIKVFVWKSFHNSIPTKVNLWNHHVLVDGYCHVCKEEIETTDHALFQCNRAKAVWGIISPPSDN
        Y   LS  NR E S+ S R  ES++W  LW + VP K+K+F+WK+  N +PT+  L++  +     CH C EE ET  H L+ C  A+ VW   S P   
Subjt:  YK--LSMVNRQEDSL-SDRGRESRWWKKLWKLCVPNKIKVFVWKSFHNSIPTKVNLWNHHVLVDGYCHVCKEEIETTDHALFQCNRAKAVWGIISPPSDN

Query:  VSYYQMNVKDRWLG-LAECKDRDLECICVGAWAIWNDRNSLLYNRPVPDVVRRYEWIAKYLNDYRNVNLKG--AVLYQSKEEVSCIISESEGIILHTDAS
            +MN ++     L+  +   LE     AWA+WN RN   ++  VP+V       A    D+    LKG   +   S  +      +     L+    
Subjt:  VSYYQMNVKDRWLG-LAECKDRDLECICVGAWAIWNDRNSLLYNRPVPDVVRRYEWIAKYLNDYRNVNLKG--AVLYQSKEEVSCIISESEGIILHTDAS

Query:  FIGEGEHCGIGLVIRDRNGNLRATQSVGAVVCTSPLEAEAIAVLNGLRMAWDLNVPRLTILSDSLNVIRSINEELQCQSSIATIIWDIKEASEYFESVRF
         I      G+G++IRD  G++ A+     + C   L+  A   L  ++ A+D+ + +L +      +++ I +   C + I  II D+   +  F+ + F
Subjt:  FIGEGEHCGIGLVIRDRNGNLRATQSVGAVVCTSPLEAEAIAVLNGLRMAWDLNVPRLTILSDSLNVIRSINEELQCQSSIATIIWDIKEASEYFESVRF

Query:  KYISRRFNCFAHSLARVGMSDTTLVLWLQNYPQWMVSLALQ
         +I +  N  A  LA   +S + L +WL ++P   +SL +Q
Subjt:  KYISRRFNCFAHSLARVGMSDTTLVLWLQNYPQWMVSLALQ

A0A803NM27 Uncharacterized protein2.8e-14336.61Show/hide
Query:  MNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSAMLVSARN-RSMAGISVARDCPKISHLFFADDSLVFLKAAAEEFGFLKAILRDYEKASGQCVNFNK
        +NG   G +KP RG+RQGDPLSPYLFLIC+EGLS +L    +   + G++V+R  P +SHL FADDSL+F +A     G +K +L  Y KASGQ +N +K
Subjt:  MNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSAMLVSARN-RSMAGISVARDCPKISHLFFADDSLVFLKAAAEEFGFLKAILRDYEKASGQCVNFNK

Query:  STVCYSKNVIGESKRYLTNIMAMKESDSVGTYLGLPSTFHHGKSLDFHFLLDKVWAILQGWKQKLFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILSKI
        S + +S N    SK+   NI+ M   +   +YLGLP+     K   F+ + +++W +L  W  K+FS GGKEVL+K++IQ+IPTYAM CF++P     +I
Subjt:  STVCYSKNVIGESKRYLTNIMAMKESDSVGTYLGLPSTFHHGKSLDFHFLLDKVWAILQGWKQKLFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILSKI

Query:  SSLCAKFWWGSNEKKRKMHWKRWDELCNPKELGGLNFRDLVNFNQALLAKQVWRVLTNPNLTIFRLLRGKYFPNSTVLDAVDQYNSSFFWKVFLWGMELL
         SL + FWWGS   K+K+HWK+W  LC  K  GGL FR+ ++FNQALLAKQ WR+  NP   +FR+L+G+YF  S  L A     SS  W+ F WG ELL
Subjt:  SSLCAKFWWGSNEKKRKMHWKRWDELCNPKELGGLNFRDLVNFNQALLAKQVWRVLTNPNLTIFRLLRGKYFPNSTVLDAVDQYNSSFFWKVFLWGMELL

Query:  KNGLRKNVGSDHSIEMFRDPWLPRLSTFKVFSHPCHEIENAVVAEFLTPSFQWDMEKLNQHLMKEDVDMIKCLPISPLA-PDRWIWHYDSRGEYTVKSGY
        K GLR  VG+   I    DPW+P  S F              VA+++TP  +W++ KLN      DV+ I  LP+S  A  D W+WH  + G+Y VKSGY
Subjt:  KNGLRKNVGSDHSIEMFRDPWLPRLSTFKVFSHPCHEIENAVVAEFLTPSFQWDMEKLNQHLMKEDVDMIKCLPISPLA-PDRWIWHYDSRGEYTVKSGY

Query:  KLSMVNRQEDSLSDRGRESRWWKKLWKLCVPNKIKVFVWKSFHNSIPTKVNLWNHHVLVDGYCHVCKEEIETTDHALFQCNRAKAVWGIISPPSDNVSYY
         ++ +   E+ +S       WWK  W+L +P K+K+F WK+ HN++P    L+    L    C +C    E+  HA+F C  A+ VW I     +N +  
Subjt:  KLSMVNRQEDSLSDRGRESRWWKKLWKLCVPNKIKVFVWKSFHNSIPTKVNLWNHHVLVDGYCHVCKEEIETTDHALFQCNRAKAVWGIISPPSDNVSYY

Query:  QMNVKDRWLGLAECKDR-DLECICVGAWAIWNDRNSLLYNR--PVPDVVRRYEWIAKYLNDYRNVNL----KGAVLYQSKEEVSCIISESEGII-LHTDA
         M ++D    ++EC  + +LE I    W+IW+DRN++L+ +    P V+      A +L+ +++        G     +             ++ L+ DA
Subjt:  QMNVKDRWLGLAECKDR-DLECICVGAWAIWNDRNSLLYNR--PVPDVVRRYEWIAKYLNDYRNVNL----KGAVLYQSKEEVSCIISESEGII-LHTDA

Query:  SFIGEGEHCGIGLVIRDRNGNLRATQSVGAVVCTSPLEAEAIAVLNGLRMAWDLNVPRLTILSDSLNVIRSINEELQCQSSIATIIWDIKEASEYFESVR
        +F    +  G G +IRD  GN++A  S     C  P + EA  +   L+ A  LN     + +DSL ++ ++ +     SS   +I+D++    Y  +V 
Subjt:  SFIGEGEHCGIGLVIRDRNGNLRATQSVGAVVCTSPLEAEAIAVLNGLRMAWDLNVPRLTILSDSLNVIRSINEELQCQSSIATIIWDIKEASEYFESVR

Query:  FKYISRRFNCFAHSLARVGMSDTTLVLWLQNYPQWMVSLALQE
          ++ R  N  AH LA+  +    +  WL+++P  ++S+ +++
Subjt:  FKYISRRFNCFAHSLARVGMSDTTLVLWLQNYPQWMVSLALQE

A0A803Q8J4 Uncharacterized protein6.0e-14636.84Show/hide
Query:  MNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSAMLVSARN-RSMAGISVARDCPKISHLFFADDSLVFLKAAAEEFGFLKAILRDYEKASGQCVNFNK
        +NG   G +KP RG+RQGDPLSPYLFLIC+EGLS +L    +   + G++++R  P ISHL FADDSL+F +A     G +K +L  Y KASGQ +N +K
Subjt:  MNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSAMLVSARN-RSMAGISVARDCPKISHLFFADDSLVFLKAAAEEFGFLKAILRDYEKASGQCVNFNK

Query:  STVCYSKNVIGESKRYLTNIMAMKESDSVGTYLGLPSTFHHGKSLDFHFLLDKVWAILQGWKQKLFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILSKI
        S + +S N    +K    NI+ M   D   +YLGLP+     K   F+ + +++W +L  W  K+FS GGKEVL+K++IQ+IPTYAM CF++P     +I
Subjt:  STVCYSKNVIGESKRYLTNIMAMKESDSVGTYLGLPSTFHHGKSLDFHFLLDKVWAILQGWKQKLFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILSKI

Query:  SSLCAKFWWGSNEKKRKMHWKRWDELCNPKELGGLNFRDLVNFNQALLAKQVWRVLTNPNLTIFRLLRGKYFPNSTVLDAVDQYNSSFFWKVFLWGMELL
         S+ + +WWG+   K+K+HWK+W  LC+ K  GGL FR+ ++FNQALLAKQ WR+  N N  +FR+L+G+YFP +  L A     SS  W+   WG ELL
Subjt:  SSLCAKFWWGSNEKKRKMHWKRWDELCNPKELGGLNFRDLVNFNQALLAKQVWRVLTNPNLTIFRLLRGKYFPNSTVLDAVDQYNSSFFWKVFLWGMELL

Query:  KNGLRKNVGSDHSIEMFRDPWLPRLSTFKVFSHPCHEIENAVVAEFLTPSFQWDMEKLNQHLMKEDVDMIKCLPISPLA-PDRWIWHYDSRGEYTVKSGY
        K G+RK VG+  SI    DPW+P    F    +  +   N+VVA+++TP  +W+  KL+      DV  I  LP+S  A PD WIWH  + GEY VKSGY
Subjt:  KNGLRKNVGSDHSIEMFRDPWLPRLSTFKVFSHPCHEIENAVVAEFLTPSFQWDMEKLNQHLMKEDVDMIKCLPISPLA-PDRWIWHYDSRGEYTVKSGY

Query:  KLSMVNRQEDSLSDRGRESRWWKKLWKLCVPNKIKVFVWKSFHNSIPTKVNLWNHHVLVDGYCHVCKEEIETTDHALFQCNRAKAVWGIISPPSDNVSYY
          +  +  + + S     + WWK  W+L +P K+K+F WK+ HN++P    L+    L    C +C    E+  HALF C  A+ VW +     +N +  
Subjt:  KLSMVNRQEDSLSDRGRESRWWKKLWKLCVPNKIKVFVWKSFHNSIPTKVNLWNHHVLVDGYCHVCKEEIETTDHALFQCNRAKAVWGIISPPSDNVSYY

Query:  QMNVKDRWLGLAECKDR-DLECICVGAWAIWNDRNSLLYNRPVPDVVRRYEWIAKYLNDYRN---VNLKGAVLYQSKEEVSCIISESEG--IILHTDASF
         MN++D    ++E   + +LE I    W+IW+DRN++++ +              +LN+Y++   ++L   +   +   +S   S      + L+ DA+F
Subjt:  QMNVKDRWLGLAECKDR-DLECICVGAWAIWNDRNSLLYNRPVPDVVRRYEWIAKYLNDYRN---VNLKGAVLYQSKEEVSCIISESEG--IILHTDASF

Query:  IGEGEHCGIGLVIRDRNGNLRATQSVGAVVCTSPLEAEAIAVLNGLRMAWDLNVPRLTILSDSLNVIRSINEELQCQSSIATIIWDIKEASEYFESVRFK
               G G +IRD NGN++A  S     C  P E EA  +   L+ A  LN     + +DSL +  ++ +    +SS   +I+D++    Y  +V   
Subjt:  IGEGEHCGIGLVIRDRNGNLRATQSVGAVVCTSPLEAEAIAVLNGLRMAWDLNVPRLTILSDSLNVIRSINEELQCQSSIATIIWDIKEASEYFESVRFK

Query:  YISRRFNCFAHSLARVGMSDTTLVLWLQNYPQWMVSLALQE
        ++ R  N  AH LA+  +    +  WL+++P  ++S+ +++
Subjt:  YISRRFNCFAHSLARVGMSDTTLVLWLQNYPQWMVSLALQE

A0A803QGT2 Uncharacterized protein1.0e-13735.27Show/hide
Query:  MNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSAML-VSARNRSMAGISVARDCPKISHLFFADDSLVFLKAAAEEFGFLKAILRDYEKASGQCVNFNK
        +NGE  G + P RG+RQGDPLSPYLFLIC+EGLS +L    +   + G++V+R  P ISHLFFADDSL+F +A     G +K  L  Y +ASGQ +N +K
Subjt:  MNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSAML-VSARNRSMAGISVARDCPKISHLFFADDSLVFLKAAAEEFGFLKAILRDYEKASGQCVNFNK

Query:  STVCYSKNVIGESKRYLTNIMAMKESDSVGTYLGLPSTFHHGKSLDFHFLLDKVWAILQGWKQKLFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILSKI
        S + +S N     +     I+ M   +    YLGLP+     KS  F  + +K+W ++  W  K+FS GGKEVL+K+++Q+IPTYAM CFR+P  + ++I
Subjt:  STVCYSKNVIGESKRYLTNIMAMKESDSVGTYLGLPSTFHHGKSLDFHFLLDKVWAILQGWKQKLFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILSKI

Query:  SSLCAKFWWGSNEKKRKMHWKRWDELCNPKELGGLNFRDLVNFNQALLAKQVWRVLTNPNLTIFRLLRGKYFPNSTVLDAVDQYNSSFFWKVFLWGMELL
         ++ AKFWWGS+   +K+HWK+W  LC  K  GG+ FR  V+FNQALLAKQ WR+  +P   + R+L+G YF  +  + A     SS  W+  +WG ELL
Subjt:  SSLCAKFWWGSNEKKRKMHWKRWDELCNPKELGGLNFRDLVNFNQALLAKQVWRVLTNPNLTIFRLLRGKYFPNSTVLDAVDQYNSSFFWKVFLWGMELL

Query:  KNGLRKNVGSDHSIEMFRDPWLPRLSTFKVFSHPCHEIENAVVAEFLTPSFQWDMEKLNQHLMKEDVDMIKCLPISPL-APDRWIWHYDSRGEYTVKSGY
          GLR  VG+ ++I    D W+P    FK + +      +  VA+++T + +W++E L       DVD I  +P+S L   DRWIWHY+  G+Y+V SGY
Subjt:  KNGLRKNVGSDHSIEMFRDPWLPRLSTFKVFSHPCHEIENAVVAEFLTPSFQWDMEKLNQHLMKEDVDMIKCLPISPL-APDRWIWHYDSRGEYTVKSGY

Query:  KLSMVNRQEDSLSDRGRESRWWKKLWKLCVPNKIKVFVWKSFHNSIPTKVNLWNHHVLVDGYCHVCKEEIETTDHALFQCNRAKAVWGIISPPSDNVSYY
         L+    +ED  S    +  WWK  WKL +P+K+K+F WK   +SIP   +L++  +L    C +C+   E+  HALF C  AK VW       D  +  
Subjt:  KLSMVNRQEDSLSDRGRESRWWKKLWKLCVPNKIKVFVWKSFHNSIPTKVNLWNHHVLVDGYCHVCKEEIETTDHALFQCNRAKAVWGIISPPSDNVSYY

Query:  QMNVKDRWLGLAECKDRD-LECICVGAWAIWNDRNSLLYNRPVPDVVRRYEWIAKYLNDYRNVNLKGAVLYQSKEEVSCIIS----ESEGIILHTDASFI
        ++   D  + L+   ++   E I    W IW+DRN+ ++ + V   ++ +     Y++ YR++         ++   + + S          L+ DA+  
Subjt:  QMNVKDRWLGLAECKDRD-LECICVGAWAIWNDRNSLLYNRPVPDVVRRYEWIAKYLNDYRNVNLKGAVLYQSKEEVSCIIS----ESEGIILHTDASFI

Query:  GEGEHCGIGLVIRDRNGNLRATQSVGAVVCTSPLEAEAIAVLNGLRMAWDLNVPRLTILSDSLNVIRSINEELQCQSSIATIIWDIKEASEYFESVRFKY
              GIG+++R+  G ++A  S  A+      E EA A+  GL  A    +P   + +D L ++ ++N ++   S    ++ D+K     F +    +
Subjt:  GEGEHCGIGLVIRDRNGNLRATQSVGAVVCTSPLEAEAIAVLNGLRMAWDLNVPRLTILSDSLNVIRSINEELQCQSSIATIIWDIKEASEYFESVRFKY

Query:  ISRRFNCFAHSLARVGMSDTTLVLWLQNYPQWMVSLALQE
        I R  N  AH LAR  +      +WL++ P  + S+ + +
Subjt:  ISRRFNCFAHSLARVGMSDTTLVLWLQNYPQWMVSLALQE

A0A803QJN9 Uncharacterized protein8.1e-13535.44Show/hide
Query:  MNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSAML-VSARNRSMAGISVARDCPKISHLFFADDSLVFLKAAAEEFGFLKAILRDYEKASGQCVNFNK
        +NG+  G + P+RGIRQGDPLSPYLFLIC EGLS +L  S +N S+ G+ V+R  P +SHLFFADDS++F++A  +    +K IL  Y +ASGQ VN +K
Subjt:  MNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSAML-VSARNRSMAGISVARDCPKISHLFFADDSLVFLKAAAEEFGFLKAILRDYEKASGQCVNFNK

Query:  STVCYSKNVIGESKRYLTNIMAMKESDSVGTYLGLPSTFHHGKSLDFHFLLDKVWAILQGWKQKLFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILSKI
          + +S N   +++ +  +++ M        YLGLPS     K+  F  + DK+W +L  WK++LFS GGKEVL+K+++QAIPTYAM CFR+P  + ++I
Subjt:  STVCYSKNVIGESKRYLTNIMAMKESDSVGTYLGLPSTFHHGKSLDFHFLLDKVWAILQGWKQKLFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILSKI

Query:  SSLCAKFWWGSNEKKRKMHWKRWDELCNPKELGGLNFRDLVNFNQALLAKQVWRVLTNPNLTIFRLLRGKYFPNSTVLDAVDQYNSSFFWKVFLWGMELL
         S+ + FWWGS      +HWK W+ LC  K  GGL FR+ + FNQALLAKQ WR+L +PN  + R+L  +YF N  +L A      S  W+  +WG ELL
Subjt:  SSLCAKFWWGSNEKKRKMHWKRWDELCNPKELGGLNFRDLVNFNQALLAKQVWRVLTNPNLTIFRLLRGKYFPNSTVLDAVDQYNSSFFWKVFLWGMELL

Query:  KNGLRKNVGSDHSIEMFRDPWLPRLSTFKVFSHPCHEIENAVVAEFLTPSFQWDMEKLNQHLMKEDVDMIKCLPISPL-APDRWIWHYDSRGEYTVKSGY
          GL+  VG+  +I    D WLP  +TF  FS    +  +  VA+ +    QWD+  ++ +    D D I  +P+S   A D  IW+  + G YTVKSGY
Subjt:  KNGLRKNVGSDHSIEMFRDPWLPRLSTFKVFSHPCHEIENAVVAEFLTPSFQWDMEKLNQHLMKEDVDMIKCLPISPL-APDRWIWHYDSRGEYTVKSGY

Query:  KLSMVNRQEDSLSDRGRESRWWKKLWKLCVPNKIKVFVWKSFHNSIPTKVNLWNHHVLVDGYCHVCKEEIETTDHALFQCNRAKAVW-----------GI
        + ++        +       WW K WKL +P+KI++FVWK FHN++P    L   H+    +C +CK   ET +HALF C RAK VW             
Subjt:  KLSMVNRQEDSLSDRGRESRWWKKLWKLCVPNKIKVFVWKSFHNSIPTKVNLWNHHVLVDGYCHVCKEEIETTDHALFQCNRAKAVW-----------GI

Query:  ISPPSDNVSYYQMNVKDRWLGLAECKDRDLECICVGAWAIWNDRNSLLYNRPVPDVVRRYEWIAKYLNDYRNVNLKGAVLYQSKEEVSCIISESE-----
         +  +D + Y   N              + E   V  W+IW +RN+  +N+P        ++   YL  Y+N     +    S    +   S ++     
Subjt:  ISPPSDNVSYYQMNVKDRWLGLAECKDRDLECICVGAWAIWNDRNSLLYNRPVPDVVRRYEWIAKYLNDYRNVNLKGAVLYQSKEEVSCIISESE-----

Query:  ------------GIILHTDASFIGEGEHCGIGLVIRDRNGNLRATQSVGAVVCTSPLEAEAIAVLNGLRMAWDLNVPRLTILSDSLNVIRSINEELQCQS
                     + L++DA+        GIG V+RD  G + A  S     C  P E EA+A+ + L+ A +L +    I +DSL V++ +     C S
Subjt:  ------------GIILHTDASFIGEGEHCGIGLVIRDRNGNLRATQSVGAVVCTSPLEAEAIAVLNGLRMAWDLNVPRLTILSDSLNVIRSINEELQCQS

Query:  SIATIIWDIKEASEYFESVRFKYISRRFNCFAHSLARVGMSDTTLVLWLQNYPQWMVSL
        +   I+ D+     +F   +  ++ R  N +A  LA+  ++  T V WL+ +P  ++++
Subjt:  SIATIIWDIKEASEYFESVRFKYISRRFNCFAHSLARVGMSDTTLVLWLQNYPQWMVSL

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657506.8e-4623.05Show/hide
Query:  FHFLLDKVWAILQGWKQKLFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILSKISSLCAKFWWGSNEKKRKMHWKRWDELCNPKELGGLNFRDLVNFNQA
        F  +L++V + + GW++K  S  G+  L K+++ ++P ++M    +P+ IL+++  L   F WGS  +K+K H  +W ++C+PK+ GGL  R   + N+A
Subjt:  FHFLLDKVWAILQGWKQKLFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILSKISSLCAKFWWGSNEKKRKMHWKRWDELCNPKELGGLNFRDLVNFNQA

Query:  LLAKQVWRVLTNPNLTIFRLLRGKYFPNSTVLDA---VDQYNSSFFWKVFLWGM-ELLKNGLRKNVGSDHSIEMFRDPWLPRLSTFKVFSHPCHEIEN--
        L++K  WR+L   N +++ L+  K +    + D+   + + + S  W+    G+ +++ +G+    G    I  + D W        V   P  E++N  
Subjt:  LLAKQVWRVLTNPNLTIFRLLRGKYFPNSTVLDA---VDQYNSSFFWKVFLWGM-ELLKNGLRKNVGSDHSIEMFRDPWLPRLSTFKVFSHPCHEIEN--

Query:  -------AVVAEFLTPSFQWDMEKLNQHLMKEDVDMIKCLPISPL--APDRWIWHYDSRGEYTVKSGYKLSMVNRQEDSLSDRGRESRWWKKLWKLCVPN
                V  +   P   WD  K++ +        ++ + +  +  A DR  W +   G+++V+S Y++  V+        R   + ++  LWK+ VP 
Subjt:  -------AVVAEFLTPSFQWDMEKLNQHLMKEDVDMIKCLPISPL--APDRWIWHYDSRGEYTVKSGYKLSMVNRQEDSLSDRGRESRWWKKLWKLCVPN

Query:  KIKVFVWKSFHNSIPTKVNLWNHHVLVDGYCHVCKEEIETTDHALFQCNRAKAVWGIISPPSDNVSYYQMNVKDRWL-----GLAECKDRDLECI-CVGA
        ++K F+W   + ++ T+      H+     C VCK  +E+  H L  C     +W  + P      ++  ++ + WL       + C+D     I  V  
Subjt:  KIKVFVWKSFHNSIPTKVNLWNHHVLVDGYCHVCKEEIETTDHALFQCNRAKAVWGIISPPSDNVSYYQMNVKDRWL-----GLAECKDRDLECI-CVGA

Query:  WAIWNDRNSLLY--NRPVPDVVRRY-EWIAKYLNDYRNVNLKGAVLYQSKEEVSCIISESEGIILHTDASFIGEGEHCGIGLVIRDRNGNLRATQSVGAV
        W  W  R   ++  N    D V+   EW  +    +    L G    + +  +  +      + ++TD +  G       G V+RD  G      S+   
Subjt:  WAIWNDRNSLLY--NRPVPDVVRRY-EWIAKYLNDYRNVNLKGAVLYQSKEEVSCIISESEGIILHTDASFIGEGEHCGIGLVIRDRNGNLRATQSVGAV

Query:  VCTSPLEAEAIAVLNGLRMAWDLNVPRLTILSDSLNVIRSINEELQCQSSIATIIWDIKEASEYFESVRFKYISRRFNCFAHSLARVGMS
         C++P +AE   V  GL  AW+  VPR+ +  DS  ++  +   +     ++ ++       +    VR  ++ R  N  A  LA    S
Subjt:  VCTSPLEAEAIAVLNGLRMAWDLNVPRLTILSDSLNVIRSINEELQCQSSIATIIWDIKEASEYFESVRFKYISRRFNCFAHSLARVGMS

P11369 LINE-1 retrotransposable element ORF2 protein4.9e-1227.63Show/hide
Query:  MNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSAMLVSARNRSMAGISVARDCPKISHLFFADDSLVFLKAAAEEFGFLKAILRDYEKASGQCVNFNKS
        +NGE    I    G RQG PLSPYLF I  E L+  +   + + + GI + ++  KIS L  ADD +V++         L  ++  + +  G  +N NKS
Subjt:  MNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSAMLVSARNRSMAGISVARDCPKISHLFFADDSLVFLKAAAEEFGFLKAILRDYEKASGQCVNFNKS

Query:  -TVCYSKNVIGESKRYLTNIMAMKESDSVGTYLGLPSTFHHGKSLDFHF--LLDKVWAILQGWKQKLFSQGGKEVLIKSII--QAIPTYAMGCFRIPKGI
            Y+KN   E +   T   ++  ++    YLG+  T       D +F  L  ++   L+ WK    S  G+  ++K  I  +AI  +     +IP   
Subjt:  -TVCYSKNVIGESKRYLTNIMAMKESDSVGTYLGLPSTFHHGKSLDFHF--LLDKVWAILQGWKQKLFSQGGKEVLIKSII--QAIPTYAMGCFRIPKGI

Query:  LSKISSLCAKFWWGSNEKKRKMHWKRWDELCNPKELGGLNFRDLVNFNQALLAKQVW
         +++     KF W  N KK ++       L + +  GG+   DL  + +A++ K  W
Subjt:  LSKISSLCAKFWWGSNEKKRKMHWKRWDELCNPKELGGLNFRDLVNFNQALLAKQVW

P92555 Uncharacterized mitochondrial protein AtMg012509.9e-1352.24Show/hide
Query:  MNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSAMLVSARNRS-MAGISVARDCPKISHLFFADDS
        +NG   G + PSRG+RQGDPLSPYLF++CTE LS +   A+ +  + GI V+ + P+I+HL FADD+
Subjt:  MNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSAMLVSARNRS-MAGISVARDCPKISHLFFADDS

P93295 Uncharacterized mitochondrial protein AtMg003101.4e-3043.36Show/hide
Query:  AIPTYAMGCFRIPKGILSKISSLCAKFWWGSNEKKRKMHWKRWDELCNPKE-LGGLNFRDLVNFNQALLAKQVWRVLTNPNLTIFRLLRGKYFPNSTVLD
        A+P YAM CFR+ K +  K++S   +FWW S E KRK+ W  W +LC  KE  GGL FRDL  FNQALLAKQ +R++  P+  + RLLR +YFP+S++++
Subjt:  AIPTYAMGCFRIPKGILSKISSLCAKFWWGSNEKKRKMHWKRWDELCNPKE-LGGLNFRDLVNFNQALLAKQVWRVLTNPNLTIFRLLRGKYFPNSTVLD

Query:  AVDQYNSSFFWKVFLWGMELLKNGLRKNVGSDHSIEMFRDPWL
               S+ W+  + G ELL  GL + +G     +++ D W+
Subjt:  AVDQYNSSFFWKVFLWGMELLKNGLRKNVGSDHSIEMFRDPWL

Arabidopsis top hitse value%identityAlignment
AT2G02650.1 Ribonuclease H-like superfamily protein8.0e-1824.69Show/hide
Query:  LWKLCVPNKIKVFVWKSFHNSIPTKVNLWNHHVLVDGYCHVCKEEIETTDHALFQCNRAKAVW---GII-----SPPSDNVSYYQMNVKDRWLGLAECKD
        +WKL V  KIK F+W+    ++ T   L + ++  D  C  C  E ET  H +F C   ++VW    II      PPS         ++          D
Subjt:  LWKLCVPNKIKVFVWKSFHNSIPTKVNLWNHHVLVDGYCHVCKEEIETTDHALFQCNRAKAVW---GII-----SPPSDNVSYYQMNVKDRWLGLAECKD

Query:  RDLECICVGAWAIWNDRNSLLYNRPV--PDVVRR------YEWI-AKYLNDYRNVNLKGAVLYQSKEEVSCIISESEG-IILHTDASFIGEGEHCGIGLV
        R L    +  W +W  RN  L+ +    PD   R       EW+ A    +  NV++    +  S+ + S      EG +  + D+ +     +   G  
Subjt:  RDLECICVGAWAIWNDRNSLLYNRPV--PDVVRR------YEWI-AKYLNDYRNVNLKGAVLYQSKEEVSCIISESEG-IILHTDASFIGEGEHCGIGLV

Query:  IRDRNGNLRATQSVGAVVCTSPLEAEAIAVLNGLRMAWDLNVPRLTILSDSLNVIRSINEELQCQSSIATIIWDIKEASEYFESVRFKYISRRFNCFAHS
        IR+ NG++    +      T  L AEA+  L+ L++ W   +  +   SDS +++  IN   +  S + T+I+DI+           ++++R  N  A +
Subjt:  IRDRNGNLRATQSVGAVVCTSPLEAEAIAVLNGLRMAWDLNVPRLTILSDSLNVIRSINEELQCQSSIATIIWDIKEASEYFESVRFKYISRRFNCFAHS

Query:  LARVGMSDTTLVLWLQNYPQWMVS
        LA    +   L       P W+V+
Subjt:  LARVGMSDTTLVLWLQNYPQWMVS

AT3G09510.1 Ribonuclease H-like superfamily protein3.6e-3424.19Show/hide
Query:  LRGKYFPNSTVLDAVDQYNSSFFWKVFLWGMELLKNGLRKNVGSDHSIEMFRDPWLPRLSTFKVFSHPCHEIENAVVAEFLTPS---------FQWDMEK
        ++ +YF + ++LDA  +   S+ W   L G+ LLK G R  +G   +I +  D          V SHP   +      + +T +         + WD  K
Subjt:  LRGKYFPNSTVLDAVDQYNSSFFWKVFLWGMELLKNGLRKNVGSDHSIEMFRDPWLPRLSTFKVFSHPCHEIENAVVAEFLTPS---------FQWDMEK

Query:  LNQHLMKEDVDMIKCLPIS-PLAPDRWIWHYDSRGEYTVKSGY---------KLSMVNRQEDSLSDRGRESRWWKKLWKLCVPNKIKVFVWKSFHNSIPT
        ++Q + + D   I  + ++    PD+ IW+Y++ GEYTV+SGY          +  +N    S+  + R       +W L +  K+K F+W++   ++ T
Subjt:  LNQHLMKEDVDMIKCLPIS-PLAPDRWIWHYDSRGEYTVKSGY---------KLSMVNRQEDSLSDRGRESRWWKKLWKLCVPNKIKVFVWKSFHNSIPT

Query:  KVNLWNHHVLVDGYCHVCKEEIETTDHALFQCNRAKAVWGIISPP-----------SDNVSYYQMNVKDRWLGLAECKDRDLECICVGAWAIWNDRNSLL
           L    + +D  C  C  E E+ +HALF C  A   W +                +N+S     V+D  +      D          W IW  RN+++
Subjt:  KVNLWNHHVLVDGYCHVCKEEIETTDHALFQCNRAKAVWGIISPP-----------SDNVSYYQMNVKDRWLGLAECKDRDLECICVGAWAIWNDRNSLL

Query:  YNR----PVPDVVRRYEWIAKYLNDYRNVNLKGAVLYQ-SKEEVSCIISESEGIILHTDASFIGEGEHCGIGLVIRDRNGNLRATQSVGAVVCTSPLEAE
        +N+    P   V+        +LN  ++     +   Q ++ ++      +  +  + DA F  +      G +IR+  G   +  S+     ++PLEAE
Subjt:  YNR----PVPDVVRRYEWIAKYLNDYRNVNLKGAVLYQ-SKEEVSCIISESEGIILHTDASFIGEGEHCGIGLVIRDRNGNLRATQSVGAVVCTSPLEAE

Query:  AIAVLNGLRMAWDLNVPRLTILSDSLNVIRSINEELQCQSSIATIIWDIKEASEYFESVRFKYISRRFNCFAHSLARVGMSDTTLVLWLQNYPQWM
          A+L  L+  W     ++ +  D   +I  IN  +   SS+A  + DI   +  F S++F +I R+ N  AH LA+ G + +T      + P W+
Subjt:  AIAVLNGLRMAWDLNVPRLTILSDSLNVIRSINEELQCQSSIATIIWDIKEASEYFESVRFKYISRRFNCFAHSLARVGMSDTTLVLWLQNYPQWM

AT3G25270.1 Ribonuclease H-like superfamily protein9.8e-1624.59Show/hide
Query:  KLWKLCVPNKIKVFVWKSFHNSIPTKVNLWNHHVLVDGYCHVCKEEIETTDHALFQCNRAKAVWGIISPPSDNVSYYQMNVKDRW-LGLAEC---KDRDL
        K+WKL    KIK F+WK    ++ T  NL   H+     CH C +E ET+ H  F C  A+ VW     P   +    + ++ +  L L+ C   +   L
Subjt:  KLWKLCVPNKIKVFVWKSFHNSIPTKVNLWNHHVLVDGYCHVCKEEIETTDHALFQCNRAKAVWGIISPPSDNVSYYQMNVKDRW-LGLAEC---KDRDL

Query:  ECICVG-AWAIWNDRNSLLYNRPVPDVVRRYEWIAKYLNDYRNVNLKGAVLYQ----SKEEVSCII------SESEGIILHTDASFIGEGEHCGIGLVIR
          + +   W +W  RN L++ +         +     + ++ + N     L Q    S+ +   +         S  I  + D +F  +  +   G ++R
Subjt:  ECICVG-AWAIWNDRNSLLYNRPVPDVVRRYEWIAKYLNDYRNVNLKGAVLYQ----SKEEVSCII------SESEGIILHTDASFIGEGEHCGIGLVIR

Query:  DRNG-NLRATQSVGAVVCTSPLEAEAIAVLNGLRMAWDLNVPRLTILSDSLNVIRSINEELQCQSSIATIIWDIKEA---SEYFESVRFKYISRRFNCFA
        D NG  + + Q++G+    S LE+E  A++  ++ AW     ++    DS  V   +N E   + +     W I+E     + FE   FK++ R  N  A
Subjt:  DRNG-NLRATQSVGAVVCTSPLEAEAIAVLNGLRMAWDLNVPRLTILSDSLNVIRSINEELQCQSSIATIIWDIKEA---SEYFESVRFKYISRRFNCFA

Query:  HSLAR
          LA+
Subjt:  HSLAR

AT4G29090.1 Ribonuclease H-like superfamily protein1.8e-5728.7Show/hide
Query:  AIPTYAMGCFRIPKGILSKISSLCAKFWWGSNEKKRKMHWKRWDELCNPKELGGLNFRDLVNFNQALLAKQVWRVLTNPNLTIFRLLRGKYFPNSTVLDA
        A+PTY M CF +PK +  +I S+ A FWW + ++ + MHWK WD L   K  GG+ F+D+  FN ALL KQ+WR+L+ P   + ++ + +YF  S  L+A
Subjt:  AIPTYAMGCFRIPKGILSKISSLCAKFWWGSNEKKRKMHWKRWDELCNPKELGGLNFRDLVNFNQALLAKQVWRVLTNPNLTIFRLLRGKYFPNSTVLDA

Query:  VDQYNSSFFWKVFLWGMELLKNGLRKNVGSDHSIEMFRDPWL---PRLSTFKVFSHPCHEIENAVVAEFLTPSFQWDMEKLNQHLMKEDVDM----IKCL
              SF WK      E+L+ G R  VG+   I ++R  WL   P  +  ++   P  E   A V+  L  S    +++  +   K+ ++M    ++  
Subjt:  VDQYNSSFFWKVFLWGMELLKNGLRKNVGSDHSIEMFRDPWL---PRLSTFKVFSHPCHEIENAVVAEFLTPSFQWDMEKLNQHLMKEDVDM----IKCL

Query:  PISPLAP------DRWIWHYDSRGEYTVKSGY-KLSMVNRQEDSLSDRGRES--RWWKKLWKLCVPNKIKVFVWKSFHNSIPTKVNLWNHHVLVDGYCHV
         I  L P      D + W Y S G+YTVKSGY  L+ +  +  S  +    S    ++K+WK     KI+ F+WK   NS+P    L   H+  +  C  
Subjt:  PISPLAP------DRWIWHYDSRGEYTVKSGY-KLSMVNRQEDSLSDRGRES--RWWKKLWKLCVPNKIKVFVWKSFHNSIPTKVNLWNHHVLVDGYCHV

Query:  CKEEIETTDHALFQCNRAKAVWGIISPP---------SDNVSYYQM----NVKDRWLGLAECKDRDLECICVGAWAIWNDRNSLLYNR---PVPDVVRRY
        C    ET +H LF+C  A+  W I S P         S  V+ Y +    N   +W       ++  + +    W +W +RN L++        +V+RR 
Subjt:  CKEEIETTDHALFQCNRAKAVWGIISPP---------SDNVSYYQM----NVKDRWLGLAECKDRDLECICVGAWAIWNDRNSLLYNR---PVPDVVRRY

Query:  EWIAKYLNDYR-NVNLKGAVLYQSKEEVSC---IISESEGIILHTDASFIGEGEHCGIGLVIRDRNGNLRATQSVGAVVCTSPLEAEAIAVLNGLRMAWD
        E     L ++R     +           SC        + +  +TDA++  + E CGIG V+R+  G ++   +       S LEAE  A+   +     
Subjt:  EWIAKYLNDYR-NVNLKGAVLYQSKEEVSC---IISESEGIILHTDASFIGEGEHCGIGLVIRDRNGNLRATQSVGAVVCTSPLEAEAIAVLNGLRMAWD

Query:  LNVPRLTILSDSLNVIRSINEELQCQSSIATIIWDIKEASEYFESVRFKYISRRFNCFAHSLARVGMS
             +   SDS  +I  +N + +   S+   I D++     F  V+F +I R  N  A  +AR  +S
Subjt:  LNVPRLTILSDSLNVIRSINEELQCQSSIATIIWDIKEASEYFESVRFKYISRRFNCFAHSLARVGMS

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein9.8e-3243.36Show/hide
Query:  AIPTYAMGCFRIPKGILSKISSLCAKFWWGSNEKKRKMHWKRWDELCNPKE-LGGLNFRDLVNFNQALLAKQVWRVLTNPNLTIFRLLRGKYFPNSTVLD
        A+P YAM CFR+ K +  K++S   +FWW S E KRK+ W  W +LC  KE  GGL FRDL  FNQALLAKQ +R++  P+  + RLLR +YFP+S++++
Subjt:  AIPTYAMGCFRIPKGILSKISSLCAKFWWGSNEKKRKMHWKRWDELCNPKE-LGGLNFRDLVNFNQALLAKQVWRVLTNPNLTIFRLLRGKYFPNSTVLD

Query:  AVDQYNSSFFWKVFLWGMELLKNGLRKNVGSDHSIEMFRDPWL
               S+ W+  + G ELL  GL + +G     +++ D W+
Subjt:  AVDQYNSSFFWKVFLWGMELLKNGLRKNVGSDHSIEMFRDPWL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGGTGAATCCTTTGGTTTTATTAAACCATCTCGTGGGATCCGACAAGGTGATCCTCTATCTCCATATTTGTTCTTAATTTGTACGGAAGGTCTTTCAGCAATGTT
AGTGTCTGCCAGGAATAGATCTATGGCAGGTATATCAGTGGCTCGAGATTGTCCAAAAATATCCCACCTCTTTTTCGCTGATGACAGTCTGGTGTTTCTAAAAGCAGCGG
CAGAAGAATTTGGCTTTTTAAAAGCCATATTGAGGGATTACGAAAAGGCGTCTGGTCAGTGTGTTAATTTTAACAAATCAACTGTGTGCTATTCGAAGAATGTCATTGGT
GAATCTAAGAGATATCTGACCAATATTATGGCAATGAAAGAGTCTGATTCAGTGGGAACCTATCTTGGGCTGCCATCGACCTTTCATCATGGTAAGTCTCTTGATTTCCA
CTTTTTACTTGATAAAGTTTGGGCTATTTTGCAAGGATGGAAACAAAAGTTATTTTCTCAAGGGGGTAAAGAGGTGTTAATTAAGAGCATTATTCAAGCTATCCCTACAT
ATGCGATGGGTTGTTTCAGAATTCCTAAAGGAATTCTCTCAAAGATTTCGTCTTTATGTGCAAAATTTTGGTGGGGTTCAAATGAGAAGAAGCGGAAGATGCATTGGAAG
AGGTGGGACGAGCTTTGCAATCCTAAAGAGTTAGGGGGATTAAATTTTAGGGATTTGGTCAATTTCAATCAGGCTTTGTTGGCAAAGCAGGTGTGGAGGGTTTTGACCAA
TCCAAATTTAACGATCTTCAGGCTTCTACGTGGGAAGTATTTCCCGAATTCAACAGTTTTGGATGCGGTGGATCAGTATAATTCATCTTTCTTTTGGAAGGTTTTTTTAT
GGGGTATGGAATTGCTGAAGAATGGGTTAAGGAAGAATGTGGGCAGTGATCATTCAATTGAAATGTTTAGGGACCCGTGGCTTCCAAGGCTTAGTACATTCAAAGTGTTC
TCCCATCCGTGTCATGAAATTGAGAACGCGGTTGTGGCGGAATTCCTTACTCCGTCGTTTCAATGGGATATGGAAAAACTAAACCAACATTTGATGAAAGAAGATGTTGA
CATGATTAAATGTCTCCCGATTAGCCCTTTGGCACCGGATCGGTGGATTTGGCACTATGATAGTCGAGGAGAATATACCGTAAAGAGTGGGTATAAGCTCAGCATGGTAA
ATAGGCAGGAGGATTCGTTGTCTGATCGAGGTCGAGAATCAAGGTGGTGGAAGAAATTGTGGAAGCTTTGTGTGCCAAATAAAATAAAAGTTTTTGTTTGGAAATCCTTC
CATAATTCTATTCCTACTAAGGTTAATTTATGGAACCATCATGTTCTGGTTGATGGGTATTGTCATGTGTGTAAAGAGGAGATTGAGACGACTGATCATGCTCTGTTTCA
ATGCAACAGGGCAAAGGCAGTGTGGGGAATCATTAGTCCGCCTTCGGATAATGTTTCCTATTATCAAATGAACGTAAAAGATAGATGGTTGGGCCTAGCTGAGTGTAAGG
ACCGTGATCTAGAGTGCATTTGTGTTGGGGCATGGGCTATATGGAATGATCGAAATAGTTTGCTTTATAATCGACCTGTCCCGGATGTGGTAAGGCGGTATGAATGGATA
GCTAAGTATTTGAACGATTACAGGAATGTGAATCTGAAAGGAGCAGTGCTATATCAATCCAAGGAAGAAGTTTCTTGTATTATTTCAGAGAGTGAAGGTATAATTTTACA
TACGGATGCATCTTTCATTGGAGAAGGAGAGCATTGTGGGATTGGTCTTGTTATACGTGACAGGAATGGGAATCTAAGGGCCACTCAGTCAGTAGGAGCGGTGGTGTGTA
CTTCTCCGTTGGAAGCAGAAGCGATTGCAGTGCTGAATGGTCTTCGTATGGCTTGGGATTTGAATGTGCCTAGATTGACGATCTTATCCGATTCGCTAAACGTTATCAGA
TCTATTAATGAAGAACTCCAATGCCAATCTAGTATTGCAACGATCATCTGGGATATTAAAGAAGCAAGCGAGTATTTTGAGTCAGTTCGGTTCAAATATATTAGTCGTAG
GTTTAATTGTTTTGCCCATAGTTTGGCCCGTGTTGGTATGTCGGACACGACACTAGTTTTGTGGTTACAAAACTATCCTCAATGGATGGTTAGTTTAGCGCTCCAGGAGC
GAGATACTTTTGTATCCCATCGGGATTTGTTTGTTTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATGGTGAATCCTTTGGTTTTATTAAACCATCTCGTGGGATCCGACAAGGTGATCCTCTATCTCCATATTTGTTCTTAATTTGTACGGAAGGTCTTTCAGCAATGTT
AGTGTCTGCCAGGAATAGATCTATGGCAGGTATATCAGTGGCTCGAGATTGTCCAAAAATATCCCACCTCTTTTTCGCTGATGACAGTCTGGTGTTTCTAAAAGCAGCGG
CAGAAGAATTTGGCTTTTTAAAAGCCATATTGAGGGATTACGAAAAGGCGTCTGGTCAGTGTGTTAATTTTAACAAATCAACTGTGTGCTATTCGAAGAATGTCATTGGT
GAATCTAAGAGATATCTGACCAATATTATGGCAATGAAAGAGTCTGATTCAGTGGGAACCTATCTTGGGCTGCCATCGACCTTTCATCATGGTAAGTCTCTTGATTTCCA
CTTTTTACTTGATAAAGTTTGGGCTATTTTGCAAGGATGGAAACAAAAGTTATTTTCTCAAGGGGGTAAAGAGGTGTTAATTAAGAGCATTATTCAAGCTATCCCTACAT
ATGCGATGGGTTGTTTCAGAATTCCTAAAGGAATTCTCTCAAAGATTTCGTCTTTATGTGCAAAATTTTGGTGGGGTTCAAATGAGAAGAAGCGGAAGATGCATTGGAAG
AGGTGGGACGAGCTTTGCAATCCTAAAGAGTTAGGGGGATTAAATTTTAGGGATTTGGTCAATTTCAATCAGGCTTTGTTGGCAAAGCAGGTGTGGAGGGTTTTGACCAA
TCCAAATTTAACGATCTTCAGGCTTCTACGTGGGAAGTATTTCCCGAATTCAACAGTTTTGGATGCGGTGGATCAGTATAATTCATCTTTCTTTTGGAAGGTTTTTTTAT
GGGGTATGGAATTGCTGAAGAATGGGTTAAGGAAGAATGTGGGCAGTGATCATTCAATTGAAATGTTTAGGGACCCGTGGCTTCCAAGGCTTAGTACATTCAAAGTGTTC
TCCCATCCGTGTCATGAAATTGAGAACGCGGTTGTGGCGGAATTCCTTACTCCGTCGTTTCAATGGGATATGGAAAAACTAAACCAACATTTGATGAAAGAAGATGTTGA
CATGATTAAATGTCTCCCGATTAGCCCTTTGGCACCGGATCGGTGGATTTGGCACTATGATAGTCGAGGAGAATATACCGTAAAGAGTGGGTATAAGCTCAGCATGGTAA
ATAGGCAGGAGGATTCGTTGTCTGATCGAGGTCGAGAATCAAGGTGGTGGAAGAAATTGTGGAAGCTTTGTGTGCCAAATAAAATAAAAGTTTTTGTTTGGAAATCCTTC
CATAATTCTATTCCTACTAAGGTTAATTTATGGAACCATCATGTTCTGGTTGATGGGTATTGTCATGTGTGTAAAGAGGAGATTGAGACGACTGATCATGCTCTGTTTCA
ATGCAACAGGGCAAAGGCAGTGTGGGGAATCATTAGTCCGCCTTCGGATAATGTTTCCTATTATCAAATGAACGTAAAAGATAGATGGTTGGGCCTAGCTGAGTGTAAGG
ACCGTGATCTAGAGTGCATTTGTGTTGGGGCATGGGCTATATGGAATGATCGAAATAGTTTGCTTTATAATCGACCTGTCCCGGATGTGGTAAGGCGGTATGAATGGATA
GCTAAGTATTTGAACGATTACAGGAATGTGAATCTGAAAGGAGCAGTGCTATATCAATCCAAGGAAGAAGTTTCTTGTATTATTTCAGAGAGTGAAGGTATAATTTTACA
TACGGATGCATCTTTCATTGGAGAAGGAGAGCATTGTGGGATTGGTCTTGTTATACGTGACAGGAATGGGAATCTAAGGGCCACTCAGTCAGTAGGAGCGGTGGTGTGTA
CTTCTCCGTTGGAAGCAGAAGCGATTGCAGTGCTGAATGGTCTTCGTATGGCTTGGGATTTGAATGTGCCTAGATTGACGATCTTATCCGATTCGCTAAACGTTATCAGA
TCTATTAATGAAGAACTCCAATGCCAATCTAGTATTGCAACGATCATCTGGGATATTAAAGAAGCAAGCGAGTATTTTGAGTCAGTTCGGTTCAAATATATTAGTCGTAG
GTTTAATTGTTTTGCCCATAGTTTGGCCCGTGTTGGTATGTCGGACACGACACTAGTTTTGTGGTTACAAAACTATCCTCAATGGATGGTTAGTTTAGCGCTCCAGGAGC
GAGATACTTTTGTATCCCATCGGGATTTGTTTGTTTCTTAA
Protein sequenceShow/hide protein sequence
MNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSAMLVSARNRSMAGISVARDCPKISHLFFADDSLVFLKAAAEEFGFLKAILRDYEKASGQCVNFNKSTVCYSKNVIG
ESKRYLTNIMAMKESDSVGTYLGLPSTFHHGKSLDFHFLLDKVWAILQGWKQKLFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILSKISSLCAKFWWGSNEKKRKMHWK
RWDELCNPKELGGLNFRDLVNFNQALLAKQVWRVLTNPNLTIFRLLRGKYFPNSTVLDAVDQYNSSFFWKVFLWGMELLKNGLRKNVGSDHSIEMFRDPWLPRLSTFKVF
SHPCHEIENAVVAEFLTPSFQWDMEKLNQHLMKEDVDMIKCLPISPLAPDRWIWHYDSRGEYTVKSGYKLSMVNRQEDSLSDRGRESRWWKKLWKLCVPNKIKVFVWKSF
HNSIPTKVNLWNHHVLVDGYCHVCKEEIETTDHALFQCNRAKAVWGIISPPSDNVSYYQMNVKDRWLGLAECKDRDLECICVGAWAIWNDRNSLLYNRPVPDVVRRYEWI
AKYLNDYRNVNLKGAVLYQSKEEVSCIISESEGIILHTDASFIGEGEHCGIGLVIRDRNGNLRATQSVGAVVCTSPLEAEAIAVLNGLRMAWDLNVPRLTILSDSLNVIR
SINEELQCQSSIATIIWDIKEASEYFESVRFKYISRRFNCFAHSLARVGMSDTTLVLWLQNYPQWMVSLALQERDTFVSHRDLFVS