; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0027780 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0027780
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRNase H domain-containing protein
Genome locationchr8:4930325..4945360
RNA-Seq ExpressionLag0027780
SyntenyLag0027780
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150130.1 uncharacterized protein LOC111018384 [Momordica charantia]3.3e-13061.96Show/hide
Query:  MEEDKDTYYVVQKGDVFGFYRSWKEFEAQAGCFVILWRFFSSVVGVAPDHGQSLSICCIFDPNATIYKGCHLSKEAEQYLASRGLQSATYSISAANVTKD
        ME DKD YYVVQKG V GFY+S KE+EAQ G                         C IFDPNATIYKG HLSKEAEQYLAS GLQSATYSISAANVT+D
Subjt:  MEEDKDTYYVVQKGDVFGFYRSWKEFEAQAGCFVILWRFFSSVVGVAPDHGQSLSICCIFDPNATIYKGCHLSKEAEQYLASRGLQSATYSISAANVTKD

Query:  LFGKLVACPHEQPSATRGKMAEENPRVSRQQVLENTESGFVGANWVSTDSPKEEIIWDHGFEAVPASSSCGTASSLVNTVFTLMSGCREGLRVCLGTCGQ
        LFGKL+AC +EQPS+ RG+  EE       QV    E GFVGANWVSTDSPK+EIIWDHGFEAVPASSSC  ++S    +    SG +   +     C  
Subjt:  LFGKLVACPHEQPSATRGKMAEENPRVSRQQVLENTESGFVGANWVSTDSPKEEIIWDHGFEAVPASSSCGTASSLVNTVFTLMSGCREGLRVCLGTCGQ

Query:  QPSVSRGKRAEEYSEAKRPQVHETIESGYIGGCDKTSTPLPIVASRHLDTGSKTIAPSSTCGDNRSVDPSPFAGMPQLTLGVGMCVSGLLILRMGFLVVL
        + S SRGK AE YS AKRP  HET+E  +IGG   TSTPLP       D G    APSS C +++                                   
Subjt:  QPSVSRGKRAEEYSEAKRPQVHETIESGYIGGCDKTSTPLPIVASRHLDTGSKTIAPSSTCGDNRSVDPSPFAGMPQLTLGVGMCVSGLLILRMGFLVVL

Query:  FSITYFLEFDGASKGNPGLAGAGAVLRAIDGSTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNQNMANLCKVA
           TYFLEFDGASKGNPGLAGAGAVLRA DGSTVCRLQEGVGIATNNVAEYRAVILGLKHALK+GFKHI V+GDSKLVCMQVQGLWK+KN NM  LCKVA
Subjt:  FSITYFLEFDGASKGNPGLAGAGAVLRAIDGSTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNQNMANLCKVA

Query:  KELKDKFVSFEINHIPREQNSDADALANRAIHLRDPINV
        KELKDKF SFEI+HIPREQNSDADALANRAIHLRD + V
Subjt:  KELKDKFVSFEINHIPREQNSDADALANRAIHLRDPINV

XP_022989146.1 uncharacterized protein LOC111486303 [Cucurbita maxima]5.0e-11055.35Show/hide
Query:  MEEDKDTYYVVQKGDVFGFYRSWKEFEAQAGCFVILWRFFSSVVGVAPDHGQSLSICCIFDPNATIYKGCHLSKEAEQYLASRGLQSATYSISAANVTKD
        ME DKD YYVV+KGDVFGFYRS+KE EAQAG                           IFDPNATIYKG HLSKE+EQYLAS GL+SATYSISAANVT D
Subjt:  MEEDKDTYYVVQKGDVFGFYRSWKEFEAQAGCFVILWRFFSSVVGVAPDHGQSLSICCIFDPNATIYKGCHLSKEAEQYLASRGLQSATYSISAANVTKD

Query:  LFGKLVACPHEQPSATRGKMAEENPRVSRQQVLENTESGFVGANWVSTDSPKEEIIWDHGFEAVPASSSCGTASSLVNTVFTLMSGCREGLRVCLGTCGQ
        LFGKL+ACP+EQPS+TRGKMAEE P+  RQ+ +ENTESGFV A+ VSTDSPK+EI  DHG EAVPASSS                      RV       
Subjt:  LFGKLVACPHEQPSATRGKMAEENPRVSRQQVLENTESGFVGANWVSTDSPKEEIIWDHGFEAVPASSSCGTASSLVNTVFTLMSGCREGLRVCLGTCGQ

Query:  QPSVSRGKRAEEYSEAKRPQVHETIESGYIGGCDKTSTPLPIVASRHLDTGSKTIAPSSTCGDNRSVDPSPFAGMPQLTLGVGMCVSGLLILRMGFLVVL
                                                                                                            
Subjt:  QPSVSRGKRAEEYSEAKRPQVHETIESGYIGGCDKTSTPLPIVASRHLDTGSKTIAPSSTCGDNRSVDPSPFAGMPQLTLGVGMCVSGLLILRMGFLVVL

Query:  FSITYFLEFDGASKGNPGLAGAGAVLRAIDGSTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNQNMANLCKVA
            YFLEFDGASKGNPGLAGAGAVLRA DG+T+CRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHI VRGDSKLVCMQVQGLWKLKNQNMA LCKVA
Subjt:  FSITYFLEFDGASKGNPGLAGAGAVLRAIDGSTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNQNMANLCKVA

Query:  KELKDKFVSFEINHIPREQNSDADALANRAIHLRDPINV
        K+LKDKFVSFEINHIPREQNSDADALANRAI+LRD I V
Subjt:  KELKDKFVSFEINHIPREQNSDADALANRAIHLRDPINV

XP_023529762.1 uncharacterized protein LOC111792488 [Cucurbita pepo subsp. pepo]6.5e-11055.35Show/hide
Query:  MEEDKDTYYVVQKGDVFGFYRSWKEFEAQAGCFVILWRFFSSVVGVAPDHGQSLSICCIFDPNATIYKGCHLSKEAEQYLASRGLQSATYSISAANVTKD
        ME DKD YYVV+KGDVFGFYRS+KE EAQAG                           IFDPNATIYKG HLSKE+EQYLAS GLQSATYSISAANVT D
Subjt:  MEEDKDTYYVVQKGDVFGFYRSWKEFEAQAGCFVILWRFFSSVVGVAPDHGQSLSICCIFDPNATIYKGCHLSKEAEQYLASRGLQSATYSISAANVTKD

Query:  LFGKLVACPHEQPSATRGKMAEENPRVSRQQVLENTESGFVGANWVSTDSPKEEIIWDHGFEAVPASSSCGTASSLVNTVFTLMSGCREGLRVCLGTCGQ
        LFGKL+ACP+EQPS+T GKMAEE P+  RQ+ +ENTESGFVG + VSTDSPK+EI  DHG EAVPASSS                      RV       
Subjt:  LFGKLVACPHEQPSATRGKMAEENPRVSRQQVLENTESGFVGANWVSTDSPKEEIIWDHGFEAVPASSSCGTASSLVNTVFTLMSGCREGLRVCLGTCGQ

Query:  QPSVSRGKRAEEYSEAKRPQVHETIESGYIGGCDKTSTPLPIVASRHLDTGSKTIAPSSTCGDNRSVDPSPFAGMPQLTLGVGMCVSGLLILRMGFLVVL
                                                                                                            
Subjt:  QPSVSRGKRAEEYSEAKRPQVHETIESGYIGGCDKTSTPLPIVASRHLDTGSKTIAPSSTCGDNRSVDPSPFAGMPQLTLGVGMCVSGLLILRMGFLVVL

Query:  FSITYFLEFDGASKGNPGLAGAGAVLRAIDGSTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNQNMANLCKVA
            YFLEFDGASKGNPGLAGAGAVLRA DG+T+CRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHI VRGDSKLVCMQVQGLWKLKNQNMA LCKVA
Subjt:  FSITYFLEFDGASKGNPGLAGAGAVLRAIDGSTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNQNMANLCKVA

Query:  KELKDKFVSFEINHIPREQNSDADALANRAIHLRDPINV
        K+LKDKFVSFEINHIPREQNSDADALANRAI+LRD I V
Subjt:  KELKDKFVSFEINHIPREQNSDADALANRAIHLRDPINV

XP_038906127.1 uncharacterized protein LOC120092011 isoform X1 [Benincasa hispida]1.3e-12159.5Show/hide
Query:  MEEDKDTYYVVQKGDVFGFYRSWKEFEAQAGCFVILWRFFSSVVGVAPDHGQSLSICCIFDPNATIYKGCHLSKEAEQYLASRGLQSATYSISAANVTKD
        ME DKD YYVVQKGDVFGFYRS KE +AQAG                           IFDPNATIYKG HLSKEAE YLAS GLQSATYSISAANVTKD
Subjt:  MEEDKDTYYVVQKGDVFGFYRSWKEFEAQAGCFVILWRFFSSVVGVAPDHGQSLSICCIFDPNATIYKGCHLSKEAEQYLASRGLQSATYSISAANVTKD

Query:  LFGKLVACPHEQPSATRGKMAEENPRVSRQ-QVLENTESGFVGANWVSTDSPKEEIIWDHGFEAVPASSSCGTASSLVNTVFTLMSGCRE-GLRVCLGTC
        LFGKL+ACP+EQP A RGKMAEE P+  RQ QV EN E  +VGA  VS +SPK+EIIWDHGFEAVPASS     ++ V+          + G      + 
Subjt:  LFGKLVACPHEQPSATRGKMAEENPRVSRQ-QVLENTESGFVGANWVSTDSPKEEIIWDHGFEAVPASSSCGTASSLVNTVFTLMSGCRE-GLRVCLGTC

Query:  GQQPSVSRGKRAEEYSEAKR-PQVHETIESGYIGGCDKTSTPLPIVASRHLDTGSKTIAPSSTCGDNRSVDPSPFAGMPQLTLGVGMCVSGLLILRMGFL
          QPSVSRGK AE+YSEA R  Q HET++                                                                       
Subjt:  GQQPSVSRGKRAEEYSEAKR-PQVHETIESGYIGGCDKTSTPLPIVASRHLDTGSKTIAPSSTCGDNRSVDPSPFAGMPQLTLGVGMCVSGLLILRMGFL

Query:  VVLFSITYFLEFDGASKGNPGLAGAGAVLRAIDGSTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNQNMANLC
             ITYFLEFDGASKGNPGLAGAGAVLRA DGSTVCRLQEGVG+ATNNVAEYRAVILGLKHALK G KHICV+GDSKLVCMQVQGLWKLKNQNMANLC
Subjt:  VVLFSITYFLEFDGASKGNPGLAGAGAVLRAIDGSTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNQNMANLC

Query:  KVAKELKDKFVSFEINHIPREQNSDADALANRAIHLRDPINV
        KVAKELKDKFVSFEI+HIPREQNSDAD LANRAIHLRD + V
Subjt:  KVAKELKDKFVSFEINHIPREQNSDADALANRAIHLRDPINV

XP_038906129.1 uncharacterized protein LOC120092011 isoform X3 [Benincasa hispida]5.9e-11155.28Show/hide
Query:  MEEDKDTYYVVQKGDVFGFYRSWKEFEAQAGCFVILWRFFSSVVGVAPDHGQSLSICCIFDPNATIYKGCHLSKEAEQYLASRGLQSATYSISAANVTKD
        ME DKD YYVVQKGDVFGFYRS KE +AQAG                           IFDPNATIYKG HLSKEAE YLAS GLQSATYSISAANVTKD
Subjt:  MEEDKDTYYVVQKGDVFGFYRSWKEFEAQAGCFVILWRFFSSVVGVAPDHGQSLSICCIFDPNATIYKGCHLSKEAEQYLASRGLQSATYSISAANVTKD

Query:  LFGKLVACPHEQPSATRGKMAEENPRVS-----RQQVLENTESGFVGANWVSTDSPKEEIIWDHGFEAVPASSSCGTASSLVNTVFTLMSGCREGLRVCL
        LFGKL+ACP+E  +    +++ E+P+         + +  +  GFVGANWVS DSPK+E I DHGF A+PAS S                          
Subjt:  LFGKLVACPHEQPSATRGKMAEENPRVS-----RQQVLENTESGFVGANWVSTDSPKEEIIWDHGFEAVPASSSCGTASSLVNTVFTLMSGCREGLRVCL

Query:  GTCGQQPSVSRGKRAEEYSEAKR-PQVHETIESGYIGGCDKTSTPLPIVASRHLDTGSKTIAPSSTCGDNRSVDPSPFAGMPQLTLGVGMCVSGLLILRM
             QPSVSRGK AE+YSEA R  Q HET++                                                                    
Subjt:  GTCGQQPSVSRGKRAEEYSEAKR-PQVHETIESGYIGGCDKTSTPLPIVASRHLDTGSKTIAPSSTCGDNRSVDPSPFAGMPQLTLGVGMCVSGLLILRM

Query:  GFLVVLFSITYFLEFDGASKGNPGLAGAGAVLRAIDGSTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNQNMA
                ITYFLEFDGASKGNPGLAGAGAVLRA DGSTVCRLQEGVG+ATNNVAEYRAVILGLKHALK G KHICV+GDSKLVCMQVQGLWKLKNQNMA
Subjt:  GFLVVLFSITYFLEFDGASKGNPGLAGAGAVLRAIDGSTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNQNMA

Query:  NLCKVAKELKDKFVSFEINHIPREQNSDADALANRAIHLRDPINV
        NLCKVAKELKDKFVSFEI+HIPREQNSDAD LANRAIHLRD + V
Subjt:  NLCKVAKELKDKFVSFEINHIPREQNSDADALANRAIHLRDPINV

TrEMBL top hitse value%identityAlignment
A0A0A0LHV8 RNase H domain-containing protein4.9e-7945.35Show/hide
Query:  MEEDKDTYYVVQKGDVFGFYRSWKEFEAQAGCFVILWRFFSSVVGVAPDHGQSLSICCIFDPNATIYKGCHLSKEAEQYLASRGLQSATYSISAANVTKD
        ME DKD YYVV KGDVFGFYR+ KE     G                            FDP+ATIYKG HLSKEAE+YL + GLQSATYSISAANVTKD
Subjt:  MEEDKDTYYVVQKGDVFGFYRSWKEFEAQAGCFVILWRFFSSVVGVAPDHGQSLSICCIFDPNATIYKGCHLSKEAEQYLASRGLQSATYSISAANVTKD

Query:  LFGKLVAC-PHEQPSATRGKMAEENPRVSRQQ-VLENTESGFVGANWVSTDSPKEEIIWDHGFEAVPASSSCGTASSLVNTVFTLMSGCREGLRVCLGTC
        LFGK++ C P+EQPSATRGKMAEE  +  RQ+ VLENTE                                                             
Subjt:  LFGKLVAC-PHEQPSATRGKMAEENPRVSRQQ-VLENTESGFVGANWVSTDSPKEEIIWDHGFEAVPASSSCGTASSLVNTVFTLMSGCREGLRVCLGTC

Query:  GQQPSVSRGKRAEEYSEAKRPQVHETIESGYIGGCDKTSTPLPIVASRHLDTGSKTIAPSSTCGDNRSVDPSPFAGMPQLTLGVGMCVSGLLILRMGFLV
                                                                                                            
Subjt:  GQQPSVSRGKRAEEYSEAKRPQVHETIESGYIGGCDKTSTPLPIVASRHLDTGSKTIAPSSTCGDNRSVDPSPFAGMPQLTLGVGMCVSGLLILRMGFLV

Query:  VLFSITYFLEFDGASKGNPGLAGAGAVLRAIDGSTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNQNMANLCK
             TYFLEFDGASKGNPGLAGAGAVLRA DGSTVC+LQEGVGIAT NVAEYRAVILGLKHALK+G KHI V+GDSKLVCMQVQGLWKLKN NMA  CK
Subjt:  VLFSITYFLEFDGASKGNPGLAGAGAVLRAIDGSTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNQNMANLCK

Query:  VAKELKDKFVSFEINHIPREQNSDADALANRAIHLRDPINV
        VAKELKDKFVSFEI+H PR+QNSDADALAN AI L+D + V
Subjt:  VAKELKDKFVSFEINHIPREQNSDADALANRAIHLRDPINV

A0A1S3B7N6 uncharacterized protein LOC103487060 isoform X16.0e-8546.94Show/hide
Query:  MEEDKDTYYVVQKGDVFGFYRSWKEFEAQAGCFVILWRFFSSVVGVAPDHGQSLSICCIFDPNATIYKGCHLSKEAEQYLASRGLQSATYSISAANVTKD
        ME DKD YYVVQKGDVFGFYRS KE   Q                    HG+       FDPNATIYKG HLSKEAE+YL S GLQSATYSISAANVTKD
Subjt:  MEEDKDTYYVVQKGDVFGFYRSWKEFEAQAGCFVILWRFFSSVVGVAPDHGQSLSICCIFDPNATIYKGCHLSKEAEQYLASRGLQSATYSISAANVTKD

Query:  LFGKLVAC-PHEQPSATRGKMAEENPRVSRQQ-VLENTESGFVGANWVSTDSPKEEIIWDHGFEAVPASSSCGTASSLVNTVFTLMSGCREGLRVCLGTC
        LFGK++ C P+EQPS TRGKMAEE P+  RQ+ VL+NTE                                                             
Subjt:  LFGKLVAC-PHEQPSATRGKMAEENPRVSRQQ-VLENTESGFVGANWVSTDSPKEEIIWDHGFEAVPASSSCGTASSLVNTVFTLMSGCREGLRVCLGTC

Query:  GQQPSVSRGKRAEEYSEAKRPQVHETIESGYIGGCDKTSTPLPIVASRHLDTGSKTIAPSSTCGDNRSVDPSPFAGMPQLTLGVGMCVSGLLILRMGFLV
                                                                                                            
Subjt:  GQQPSVSRGKRAEEYSEAKRPQVHETIESGYIGGCDKTSTPLPIVASRHLDTGSKTIAPSSTCGDNRSVDPSPFAGMPQLTLGVGMCVSGLLILRMGFLV

Query:  VLFSITYFLEFDGASKGNPGLAGAGAVLRAIDGSTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNQNMANLCK
             TYFLEFDGASKGNPGLAGAGA+LRA DGSTVCRLQEGVGIAT NVAEYRA+ILGLKHALK+G KHI V+GDSKLVCMQVQGLWKLKNQNMA LCK
Subjt:  VLFSITYFLEFDGASKGNPGLAGAGAVLRAIDGSTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNQNMANLCK

Query:  VAKELKDKFVSFEINHIPREQNSDADALANRAIHLRDPINV
        VAKELKDKFVSFEI+H+PR +NSDADALANRAIHL+D + V
Subjt:  VAKELKDKFVSFEINHIPREQNSDADALANRAIHLRDPINV

A0A6J1D7M4 uncharacterized protein LOC1110183841.6e-13061.96Show/hide
Query:  MEEDKDTYYVVQKGDVFGFYRSWKEFEAQAGCFVILWRFFSSVVGVAPDHGQSLSICCIFDPNATIYKGCHLSKEAEQYLASRGLQSATYSISAANVTKD
        ME DKD YYVVQKG V GFY+S KE+EAQ G                         C IFDPNATIYKG HLSKEAEQYLAS GLQSATYSISAANVT+D
Subjt:  MEEDKDTYYVVQKGDVFGFYRSWKEFEAQAGCFVILWRFFSSVVGVAPDHGQSLSICCIFDPNATIYKGCHLSKEAEQYLASRGLQSATYSISAANVTKD

Query:  LFGKLVACPHEQPSATRGKMAEENPRVSRQQVLENTESGFVGANWVSTDSPKEEIIWDHGFEAVPASSSCGTASSLVNTVFTLMSGCREGLRVCLGTCGQ
        LFGKL+AC +EQPS+ RG+  EE       QV    E GFVGANWVSTDSPK+EIIWDHGFEAVPASSSC  ++S    +    SG +   +     C  
Subjt:  LFGKLVACPHEQPSATRGKMAEENPRVSRQQVLENTESGFVGANWVSTDSPKEEIIWDHGFEAVPASSSCGTASSLVNTVFTLMSGCREGLRVCLGTCGQ

Query:  QPSVSRGKRAEEYSEAKRPQVHETIESGYIGGCDKTSTPLPIVASRHLDTGSKTIAPSSTCGDNRSVDPSPFAGMPQLTLGVGMCVSGLLILRMGFLVVL
        + S SRGK AE YS AKRP  HET+E  +IGG   TSTPLP       D G    APSS C +++                                   
Subjt:  QPSVSRGKRAEEYSEAKRPQVHETIESGYIGGCDKTSTPLPIVASRHLDTGSKTIAPSSTCGDNRSVDPSPFAGMPQLTLGVGMCVSGLLILRMGFLVVL

Query:  FSITYFLEFDGASKGNPGLAGAGAVLRAIDGSTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNQNMANLCKVA
           TYFLEFDGASKGNPGLAGAGAVLRA DGSTVCRLQEGVGIATNNVAEYRAVILGLKHALK+GFKHI V+GDSKLVCMQVQGLWK+KN NM  LCKVA
Subjt:  FSITYFLEFDGASKGNPGLAGAGAVLRAIDGSTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNQNMANLCKVA

Query:  KELKDKFVSFEINHIPREQNSDADALANRAIHLRDPINV
        KELKDKF SFEI+HIPREQNSDADALANRAIHLRD + V
Subjt:  KELKDKFVSFEINHIPREQNSDADALANRAIHLRDPINV

A0A6J1EJI9 uncharacterized protein LOC1114350642.7e-10955.13Show/hide
Query:  MEEDKDTYYVVQKGDVFGFYRSWKEFEAQAGCFVILWRFFSSVVGVAPDHGQSLSICCIFDPNATIYKGCHLSKEAEQYLASRGLQSATYSISAANVTKD
        ME DKD YYVV+KGDVFGFYRS+KE EAQAG                           IFDPNATIYKG HLSKE+EQYLAS GL+SATYSISAANVT D
Subjt:  MEEDKDTYYVVQKGDVFGFYRSWKEFEAQAGCFVILWRFFSSVVGVAPDHGQSLSICCIFDPNATIYKGCHLSKEAEQYLASRGLQSATYSISAANVTKD

Query:  LFGKLVACPHEQPSATRGKMAEENPRVSRQQVLENTESGFVGANWVSTDSPKEEIIWDHGFEAVPASSSCGTASSLVNTVFTLMSGCREGLRVCLGTCGQ
        LFGKL+ACP+EQPS+TRG+MAEE P+  RQ+ +ENTESGFVGA+ VSTDS K+EI  DH  EAVPASSS                      RV       
Subjt:  LFGKLVACPHEQPSATRGKMAEENPRVSRQQVLENTESGFVGANWVSTDSPKEEIIWDHGFEAVPASSSCGTASSLVNTVFTLMSGCREGLRVCLGTCGQ

Query:  QPSVSRGKRAEEYSEAKRPQVHETIESGYIGGCDKTSTPLPIVASRHLDTGSKTIAPSSTCGDNRSVDPSPFAGMPQLTLGVGMCVSGLLILRMGFLVVL
                                                                                                            
Subjt:  QPSVSRGKRAEEYSEAKRPQVHETIESGYIGGCDKTSTPLPIVASRHLDTGSKTIAPSSTCGDNRSVDPSPFAGMPQLTLGVGMCVSGLLILRMGFLVVL

Query:  FSITYFLEFDGASKGNPGLAGAGAVLRAIDGSTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNQNMANLCKVA
            YFLEFDGASKGNPGLAGAGAVLRA DG+T+CRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHI VRGDSKLVCMQVQGLWKLKNQNMA LCKVA
Subjt:  FSITYFLEFDGASKGNPGLAGAGAVLRAIDGSTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNQNMANLCKVA

Query:  KELKDKFVSFEINHIPREQNSDADALANRAIHLRDPINV
        K+LKDKFVSFEINHIPREQNSDADALANRAIHLRD I V
Subjt:  KELKDKFVSFEINHIPREQNSDADALANRAIHLRDPINV

A0A6J1JJ82 uncharacterized protein LOC1114863032.4e-11055.35Show/hide
Query:  MEEDKDTYYVVQKGDVFGFYRSWKEFEAQAGCFVILWRFFSSVVGVAPDHGQSLSICCIFDPNATIYKGCHLSKEAEQYLASRGLQSATYSISAANVTKD
        ME DKD YYVV+KGDVFGFYRS+KE EAQAG                           IFDPNATIYKG HLSKE+EQYLAS GL+SATYSISAANVT D
Subjt:  MEEDKDTYYVVQKGDVFGFYRSWKEFEAQAGCFVILWRFFSSVVGVAPDHGQSLSICCIFDPNATIYKGCHLSKEAEQYLASRGLQSATYSISAANVTKD

Query:  LFGKLVACPHEQPSATRGKMAEENPRVSRQQVLENTESGFVGANWVSTDSPKEEIIWDHGFEAVPASSSCGTASSLVNTVFTLMSGCREGLRVCLGTCGQ
        LFGKL+ACP+EQPS+TRGKMAEE P+  RQ+ +ENTESGFV A+ VSTDSPK+EI  DHG EAVPASSS                      RV       
Subjt:  LFGKLVACPHEQPSATRGKMAEENPRVSRQQVLENTESGFVGANWVSTDSPKEEIIWDHGFEAVPASSSCGTASSLVNTVFTLMSGCREGLRVCLGTCGQ

Query:  QPSVSRGKRAEEYSEAKRPQVHETIESGYIGGCDKTSTPLPIVASRHLDTGSKTIAPSSTCGDNRSVDPSPFAGMPQLTLGVGMCVSGLLILRMGFLVVL
                                                                                                            
Subjt:  QPSVSRGKRAEEYSEAKRPQVHETIESGYIGGCDKTSTPLPIVASRHLDTGSKTIAPSSTCGDNRSVDPSPFAGMPQLTLGVGMCVSGLLILRMGFLVVL

Query:  FSITYFLEFDGASKGNPGLAGAGAVLRAIDGSTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNQNMANLCKVA
            YFLEFDGASKGNPGLAGAGAVLRA DG+T+CRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHI VRGDSKLVCMQVQGLWKLKNQNMA LCKVA
Subjt:  FSITYFLEFDGASKGNPGLAGAGAVLRAIDGSTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNQNMANLCKVA

Query:  KELKDKFVSFEINHIPREQNSDADALANRAIHLRDPINV
        K+LKDKFVSFEINHIPREQNSDADALANRAI+LRD I V
Subjt:  KELKDKFVSFEINHIPREQNSDADALANRAIHLRDPINV

SwissProt top hitse value%identityAlignment
P54162 14.7 kDa ribonuclease H-like protein3.3e-0834.13Show/hide
Query:  DGASKGNPGLAGAGAVLRAIDGSTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNQNMANLCKVAKELKDKFVS
        DGAS GNPG +G G  ++    +    +   +G+ TN  AE+ A+I G+K     G++ +  R DS +V  +   L  +KN       +    LK  F  
Subjt:  DGASKGNPGLAGAGAVLRAIDGSTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNQNMANLCKVAKELKDKFVS

Query:  FEINHIPREQNSDADALANRAIHLRD
        F I  IP +QN  AD LA  AI L +
Subjt:  FEINHIPREQNSDADALANRAIHLRD

P64956 Uncharacterized protein Mb2253c1.1e-1941.56Show/hide
Query:  LEFDGASKGNPGLAGAGAVLRAIDGSTV-CRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNQNMANLCKVAKELKD
        +E DG S+GNPG AG GAV+   D STV    ++ +G ATNNVAEYR +I GL  A+K G     V  DSKLV  Q+ G WK+K+ ++  L   A+ L  
Subjt:  LEFDGASKGNPGLAGAGAVLRAIDGSTV-CRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNQNMANLCKVAKELKD

Query:  KFVSFEINHIPREQNSDADALANRAIHLRDPINVMAPKKAPAKVTTSNDSYTGP
        +F       +PR +N+ AD LAN A+         A    PAK+  + +S T P
Subjt:  KFVSFEINHIPREQNSDADALANRAIHLRDPINVMAPKKAPAKVTTSNDSYTGP

P9WLH4 Uncharacterized protein MT22871.1e-1941.56Show/hide
Query:  LEFDGASKGNPGLAGAGAVLRAIDGSTV-CRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNQNMANLCKVAKELKD
        +E DG S+GNPG AG GAV+   D STV    ++ +G ATNNVAEYR +I GL  A+K G     V  DSKLV  Q+ G WK+K+ ++  L   A+ L  
Subjt:  LEFDGASKGNPGLAGAGAVLRAIDGSTV-CRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNQNMANLCKVAKELKD

Query:  KFVSFEINHIPREQNSDADALANRAIHLRDPINVMAPKKAPAKVTTSNDSYTGP
        +F       +PR +N+ AD LAN A+         A    PAK+  + +S T P
Subjt:  KFVSFEINHIPREQNSDADALANRAIHLRDPINVMAPKKAPAKVTTSNDSYTGP

P9WLH5 Bifunctional protein Rv2228c1.1e-1941.56Show/hide
Query:  LEFDGASKGNPGLAGAGAVLRAIDGSTV-CRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNQNMANLCKVAKELKD
        +E DG S+GNPG AG GAV+   D STV    ++ +G ATNNVAEYR +I GL  A+K G     V  DSKLV  Q+ G WK+K+ ++  L   A+ L  
Subjt:  LEFDGASKGNPGLAGAGAVLRAIDGSTV-CRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNQNMANLCKVAKELKD

Query:  KFVSFEINHIPREQNSDADALANRAIHLRDPINVMAPKKAPAKVTTSNDSYTGP
        +F       +PR +N+ AD LAN A+         A    PAK+  + +S T P
Subjt:  KFVSFEINHIPREQNSDADALANRAIHLRDPINVMAPKKAPAKVTTSNDSYTGP

Q9HSF6 Ribonuclease HI4.2e-1945.53Show/hide
Query:  FDGASKGNPGLAGAGAVLRAIDGSTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNQNMANLCKVAKELKDKFV
        FDGAS+GNPG A  G VL + DG  V    + +G ATNN AEY A+I  L+ A   GF  I +RGDS+LV  Q+ G W   + ++      A+EL   F 
Subjt:  FDGASKGNPGLAGAGAVLRAIDGSTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNQNMANLCKVAKELKDKFV

Query:  SFEINHIPREQNSDADALANRAI
         + I H+PR  N  ADALAN A+
Subjt:  SFEINHIPREQNSDADALANRAI

Arabidopsis top hitse value%identityAlignment
AT1G24090.1 RNase H family protein1.7e-4430.02Show/hide
Query:  MEEDKDTYYVVQKGDVFGFYRSWKEFEAQAGCFVILWRFFSSVVGVAPDHGQSLSICCIFDPNATIYKGCHLSKEAEQYLASRGLQSATYSISAANVTKD
        ++++KD ++VV+KGDV G Y+   + +AQ G                           +FD   ++YKG  L K+ E+YL+S GL+   YS+ A+++  D
Subjt:  MEEDKDTYYVVQKGDVFGFYRSWKEFEAQAGCFVILWRFFSSVVGVAPDHGQSLSICCIFDPNATIYKGCHLSKEAEQYLASRGLQSATYSISAANVTKD

Query:  LFGKLVACPHEQPSATRGKMAEENPRVSRQQVLENTESGFVGANWVSTDSPKEEIIWDHGFEAVPASSSCGTASSLVNTVFTLMSGCREGLRVCLGTCGQ
        +FG L  C  ++P+    K++E+          E T      +   S D  K+++                                             
Subjt:  LFGKLVACPHEQPSATRGKMAEENPRVSRQQVLENTESGFVGANWVSTDSPKEEIIWDHGFEAVPASSSCGTASSLVNTVFTLMSGCREGLRVCLGTCGQ

Query:  QPSVSRGKRAEEYSEAKRPQVHETIESGYIGGCDKTSTPLPIVASRHLDTGSKTIAPSSTCGDNRSVDPSPFAGMPQLTLGVGMCVSGLLILRMGFLVVL
         PS S                                     ++   L+  SK + PS+   D                                     
Subjt:  QPSVSRGKRAEEYSEAKRPQVHETIESGYIGGCDKTSTPLPIVASRHLDTGSKTIAPSSTCGDNRSVDPSPFAGMPQLTLGVGMCVSGLLILRMGFLVVL

Query:  FSITYFLEFDGASKGNPGLAGAGAVLRAIDGSTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNQNMANLCKVA
           T F+EFDGASKGNPGL+GA AVL+  DGS +CR+++G+GIATNN AEY A+ILGLK+A++ G+K+I V+GDSKLVCMQ++G WK+ ++ +A L K A
Subjt:  FSITYFLEFDGASKGNPGLAGAGAVLRAIDGSTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNQNMANLCKVA

Query:  KELKDKFVSFEINHIPREQNSDADALANRAIHL
        K L +K VSFEI+H+ R  N+DAD  AN A+ L
Subjt:  KELKDKFVSFEINHIPREQNSDADALANRAIHL

AT3G01410.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.0e-4933.03Show/hide
Query:  MEEDKDTYYVVQKGDVFGFYRSWKEFEAQAGCFVILWRFFSSVVGVAPDHGQSLSICCIFDPNATIYKGCHLSKEAEQYLASRGLQSATYSISAANVTKD
        ME++KD +Y+V+KGD+ G YRS  E + QAG                           +  P  ++YKG    K AE  L+S G+++A +S++A++V  D
Subjt:  MEEDKDTYYVVQKGDVFGFYRSWKEFEAQAGCFVILWRFFSSVVGVAPDHGQSLSICCIFDPNATIYKGCHLSKEAEQYLASRGLQSATYSISAANVTKD

Query:  LFGKLVACPHEQPSATRGKMAEENPRVSRQQVLENTESGFVGANWVSTDSPKEEI-IWDHGFEAVPASSSCGTASSLVNTVFTLMSGCREGLRVCLGTCG
         FGKL+ CP +QPS+++G+   ++    R Q + + ESG       S   P++++ I +     +P        SSL+                      
Subjt:  LFGKLVACPHEQPSATRGKMAEENPRVSRQQVLENTESGFVGANWVSTDSPKEEI-IWDHGFEAVPASSSCGTASSLVNTVFTLMSGCREGLRVCLGTCG

Query:  QQPSVSRGKRAEEYSEAKRPQVHETIESGYIGGCDKTSTPLPIVASRHLDTGSKTIAPSSTCGDNRSVDPSPFAGMPQLTLGVGMCVSGLLILRMGFLVV
                                            T TP               I  + +C                                      
Subjt:  QQPSVSRGKRAEEYSEAKRPQVHETIESGYIGGCDKTSTPLPIVASRHLDTGSKTIAPSSTCGDNRSVDPSPFAGMPQLTLGVGMCVSGLLILRMGFLVV

Query:  LFSITYFLEFDGASKGNPGLAGAGAVLRAIDGSTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNQNMANLCKV
               +EFDGASKGNPG AGAGAVLRA D S +  L+EGVG ATNNVAEYRA++LGL+ AL  GFK++ V GDS LVCMQVQG WK  +  MA LCK 
Subjt:  LFSITYFLEFDGASKGNPGLAGAGAVLRAIDGSTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNQNMANLCKV

Query:  AKELKDKFVSFEINHIPREQNSDADALANRAIHLRD
        AKEL + F +F+I HI RE+NS+AD  AN AI L D
Subjt:  AKELKDKFVSFEINHIPREQNSDADALANRAIHLRD

AT3G01410.2 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.0e-4933.03Show/hide
Query:  MEEDKDTYYVVQKGDVFGFYRSWKEFEAQAGCFVILWRFFSSVVGVAPDHGQSLSICCIFDPNATIYKGCHLSKEAEQYLASRGLQSATYSISAANVTKD
        ME++KD +Y+V+KGD+ G YRS  E + QAG                           +  P  ++YKG    K AE  L+S G+++A +S++A++V  D
Subjt:  MEEDKDTYYVVQKGDVFGFYRSWKEFEAQAGCFVILWRFFSSVVGVAPDHGQSLSICCIFDPNATIYKGCHLSKEAEQYLASRGLQSATYSISAANVTKD

Query:  LFGKLVACPHEQPSATRGKMAEENPRVSRQQVLENTESGFVGANWVSTDSPKEEI-IWDHGFEAVPASSSCGTASSLVNTVFTLMSGCREGLRVCLGTCG
         FGKL+ CP +QPS+++G+   ++    R Q + + ESG       S   P++++ I +     +P        SSL+                      
Subjt:  LFGKLVACPHEQPSATRGKMAEENPRVSRQQVLENTESGFVGANWVSTDSPKEEI-IWDHGFEAVPASSSCGTASSLVNTVFTLMSGCREGLRVCLGTCG

Query:  QQPSVSRGKRAEEYSEAKRPQVHETIESGYIGGCDKTSTPLPIVASRHLDTGSKTIAPSSTCGDNRSVDPSPFAGMPQLTLGVGMCVSGLLILRMGFLVV
                                            T TP               I  + +C                                      
Subjt:  QQPSVSRGKRAEEYSEAKRPQVHETIESGYIGGCDKTSTPLPIVASRHLDTGSKTIAPSSTCGDNRSVDPSPFAGMPQLTLGVGMCVSGLLILRMGFLVV

Query:  LFSITYFLEFDGASKGNPGLAGAGAVLRAIDGSTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNQNMANLCKV
               +EFDGASKGNPG AGAGAVLRA D S +  L+EGVG ATNNVAEYRA++LGL+ AL  GFK++ V GDS LVCMQVQG WK  +  MA LCK 
Subjt:  LFSITYFLEFDGASKGNPGLAGAGAVLRAIDGSTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNQNMANLCKV

Query:  AKELKDKFVSFEINHIPREQNSDADALANRAIHLRD
        AKEL + F +F+I HI RE+NS+AD  AN AI L D
Subjt:  AKELKDKFVSFEINHIPREQNSDADALANRAIHLRD

AT5G51080.1 RNase H family protein3.5e-3756.06Show/hide
Query:  TYFLEFDGASKGNPGLAGAGAVLRAIDGSTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNQNMANLCKVAKEL
        T  +EFDGASKGNPGL+GA AVL+  DGS + ++++G+GIATNN AEY  +ILGLKHA++ G+  I V+ DSKLVCMQ++G WK+ ++ ++ L K AK+L
Subjt:  TYFLEFDGASKGNPGLAGAGAVLRAIDGSTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNQNMANLCKVAKEL

Query:  KDKFVSFEINHIPREQNSDADALANRAIHLRD
         DK +SFEI+H+ R  NSDAD  AN A  L +
Subjt:  KDKFVSFEINHIPREQNSDADALANRAIHLRD

AT5G51080.1 RNase H family protein3.1e-0928.91Show/hide
Query:  EEDKDTYYVVQKGDVFGFYRSWKEFEAQAGCFVILWRFFSSVVGVAPDHGQSLSICCIFDPNATIYKGCHLSKEAEQYLASRGLQSATYSISAANVTKDL
        +++KD ++VV+KGD+ G Y+   + +AQ G                           ++DP  ++YKG  L K+ E+ L++ GL+   Y   A ++ +D+
Subjt:  EEDKDTYYVVQKGDVFGFYRSWKEFEAQAGCFVILWRFFSSVVGVAPDHGQSLSICCIFDPNATIYKGCHLSKEAEQYLASRGLQSATYSISAANVTKDL

Query:  FGKLVAC--PHEQPSATRG--KMAEENP
        FG L  C    + PSA+    K+AE  P
Subjt:  FGKLVAC--PHEQPSATRG--KMAEENP

AT5G51080.2 RNase H family protein3.5e-3756.06Show/hide
Query:  TYFLEFDGASKGNPGLAGAGAVLRAIDGSTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNQNMANLCKVAKEL
        T  +EFDGASKGNPGL+GA AVL+  DGS + ++++G+GIATNN AEY  +ILGLKHA++ G+  I V+ DSKLVCMQ++G WK+ ++ ++ L K AK+L
Subjt:  TYFLEFDGASKGNPGLAGAGAVLRAIDGSTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNQNMANLCKVAKEL

Query:  KDKFVSFEINHIPREQNSDADALANRAIHLRD
         DK +SFEI+H+ R  NSDAD  AN A  L +
Subjt:  KDKFVSFEINHIPREQNSDADALANRAIHLRD

AT5G51080.2 RNase H family protein3.1e-0928.91Show/hide
Query:  EEDKDTYYVVQKGDVFGFYRSWKEFEAQAGCFVILWRFFSSVVGVAPDHGQSLSICCIFDPNATIYKGCHLSKEAEQYLASRGLQSATYSISAANVTKDL
        +++KD ++VV+KGD+ G Y+   + +AQ G                           ++DP  ++YKG  L K+ E+ L++ GL+   Y   A ++ +D+
Subjt:  EEDKDTYYVVQKGDVFGFYRSWKEFEAQAGCFVILWRFFSSVVGVAPDHGQSLSICCIFDPNATIYKGCHLSKEAEQYLASRGLQSATYSISAANVTKDL

Query:  FGKLVAC--PHEQPSATRG--KMAEENP
        FG L  C    + PSA+    K+AE  P
Subjt:  FGKLVAC--PHEQPSATRG--KMAEENP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGAAGATAAAGATACCTATTACGTTGTACAGAAAGGGGATGTTTTTGGATTTTACAGGAGTTGGAAGGAGTTCGAGGCTCAAGCTGGATGTTTTGTGATATTGTG
GCGATTTTTCTCATCTGTCGTTGGAGTTGCTCCTGACCATGGACAGTCACTGTCTATATGCTGTATATTTGATCCTAATGCAACGATCTACAAAGGGTGTCACTTATCTA
AAGAAGCAGAGCAATACCTTGCATCACGTGGACTTCAGAGTGCAACTTACTCTATAAGTGCTGCCAATGTGACAAAGGATCTATTCGGAAAACTAGTTGCTTGCCCTCAT
GAGCAACCATCTGCCACCAGAGGAAAAATGGCCGAAGAGAACCCCAGAGTTAGTAGACAACAAGTCCTTGAGAATACTGAATCTGGTTTTGTAGGTGCCAACTGGGTCTC
AACAGATTCTCCGAAGGAAGAAATTATCTGGGATCACGGCTTTGAAGCTGTACCTGCTTCTTCGAGTTGTGGCACCGCATCATCGCTAGTAAATACGGTTTTCACCCTCA
TGAGTGGGTGTCGGGAGGGGTTAAGGGTATGTCTAGGAACCTGTGGACAGCAACCATCTGTTTCTAGAGGAAAAAGGGCTGAAGAGTACTCTGAAGCTAAGAGACCACAA
GTCCATGAGACTATTGAAAGTGGTTATATAGGTGGCTGTGATAAGACCTCAACACCTTTACCAATAGTTGCTTCAAGGCATTTGGATACTGGTTCTAAAACCATAGCTCC
TTCATCTACTTGCGGAGACAACAGAAGTGTTGACCCTTCTCCTTTTGCTGGGATGCCTCAGTTGACCCTAGGGGTAGGGATGTGTGTTTCTGGTCTTTTAATCCTTCGAA
TGGGTTTTCTTGTAGTTCTTTTCTCCATCACCTATTTTCTTGAGTTTGATGGTGCCTCAAAGGGAAATCCTGGGCTAGCAGGTGCTGGAGCTGTTTTGCGTGCTATCGAT
GGAAGTACGGTCTGTAGGTTGCAAGAAGGGGTTGGGATTGCCACAAATAACGTCGCTGAATATCGTGCTGTTATTTTAGGACTGAAACATGCTTTAAAGAGTGGCTTTAA
ACACATTTGTGTGCGAGGAGACTCCAAGCTTGTTTGTATGCAGGTTCAGGGTCTATGGAAGCTCAAAAACCAAAATATGGCTAATTTGTGTAAAGTGGCAAAGGAGCTCA
AGGATAAGTTTGTGTCATTTGAGATCAACCATATCCCTAGGGAACAAAATTCTGATGCCGATGCCCTAGCGAACCGTGCCATACATCTTCGAGATCCAATCAATGTGATG
GCACCAAAGAAAGCTCCCGCGAAGGTTACAACATCAAACGACTCTTATACAGGTCCTGTCACTCGTAGTTGTTCTCAAGGAATTGAGATCAAGGAAGACCACACTCCTCT
CGCTGTTGCAAGTAGGATCTCAAAATTGATTGAAGAATCCTCTAAGGACAAGGTTGCAGTCAAAGACAACCCGTTGTTCGAATCTGTCACTCCAACATCTGAGCAGTCAA
AGGATACACTAAATCCTCATGTGATGTTCGTCATGATGGCTGATGTATTCCAAGATGAAAGAATGACAGAGATAGAAAGAAAACTCAATCGCCTAATGAAGACAGTTGAT
GAAAGAGATCATGAGATTGCCTATTTAAAGAACCAGCTGCAAAATCGAGAGACTGCTGAATCTAACCAGACCCTTGCTGCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGAAGATAAAGATACCTATTACGTTGTACAGAAAGGGGATGTTTTTGGATTTTACAGGAGTTGGAAGGAGTTCGAGGCTCAAGCTGGATGTTTTGTGATATTGTG
GCGATTTTTCTCATCTGTCGTTGGAGTTGCTCCTGACCATGGACAGTCACTGTCTATATGCTGTATATTTGATCCTAATGCAACGATCTACAAAGGGTGTCACTTATCTA
AAGAAGCAGAGCAATACCTTGCATCACGTGGACTTCAGAGTGCAACTTACTCTATAAGTGCTGCCAATGTGACAAAGGATCTATTCGGAAAACTAGTTGCTTGCCCTCAT
GAGCAACCATCTGCCACCAGAGGAAAAATGGCCGAAGAGAACCCCAGAGTTAGTAGACAACAAGTCCTTGAGAATACTGAATCTGGTTTTGTAGGTGCCAACTGGGTCTC
AACAGATTCTCCGAAGGAAGAAATTATCTGGGATCACGGCTTTGAAGCTGTACCTGCTTCTTCGAGTTGTGGCACCGCATCATCGCTAGTAAATACGGTTTTCACCCTCA
TGAGTGGGTGTCGGGAGGGGTTAAGGGTATGTCTAGGAACCTGTGGACAGCAACCATCTGTTTCTAGAGGAAAAAGGGCTGAAGAGTACTCTGAAGCTAAGAGACCACAA
GTCCATGAGACTATTGAAAGTGGTTATATAGGTGGCTGTGATAAGACCTCAACACCTTTACCAATAGTTGCTTCAAGGCATTTGGATACTGGTTCTAAAACCATAGCTCC
TTCATCTACTTGCGGAGACAACAGAAGTGTTGACCCTTCTCCTTTTGCTGGGATGCCTCAGTTGACCCTAGGGGTAGGGATGTGTGTTTCTGGTCTTTTAATCCTTCGAA
TGGGTTTTCTTGTAGTTCTTTTCTCCATCACCTATTTTCTTGAGTTTGATGGTGCCTCAAAGGGAAATCCTGGGCTAGCAGGTGCTGGAGCTGTTTTGCGTGCTATCGAT
GGAAGTACGGTCTGTAGGTTGCAAGAAGGGGTTGGGATTGCCACAAATAACGTCGCTGAATATCGTGCTGTTATTTTAGGACTGAAACATGCTTTAAAGAGTGGCTTTAA
ACACATTTGTGTGCGAGGAGACTCCAAGCTTGTTTGTATGCAGGTTCAGGGTCTATGGAAGCTCAAAAACCAAAATATGGCTAATTTGTGTAAAGTGGCAAAGGAGCTCA
AGGATAAGTTTGTGTCATTTGAGATCAACCATATCCCTAGGGAACAAAATTCTGATGCCGATGCCCTAGCGAACCGTGCCATACATCTTCGAGATCCAATCAATGTGATG
GCACCAAAGAAAGCTCCCGCGAAGGTTACAACATCAAACGACTCTTATACAGGTCCTGTCACTCGTAGTTGTTCTCAAGGAATTGAGATCAAGGAAGACCACACTCCTCT
CGCTGTTGCAAGTAGGATCTCAAAATTGATTGAAGAATCCTCTAAGGACAAGGTTGCAGTCAAAGACAACCCGTTGTTCGAATCTGTCACTCCAACATCTGAGCAGTCAA
AGGATACACTAAATCCTCATGTGATGTTCGTCATGATGGCTGATGTATTCCAAGATGAAAGAATGACAGAGATAGAAAGAAAACTCAATCGCCTAATGAAGACAGTTGAT
GAAAGAGATCATGAGATTGCCTATTTAAAGAACCAGCTGCAAAATCGAGAGACTGCTGAATCTAACCAGACCCTTGCTGCATGA
Protein sequenceShow/hide protein sequence
MEEDKDTYYVVQKGDVFGFYRSWKEFEAQAGCFVILWRFFSSVVGVAPDHGQSLSICCIFDPNATIYKGCHLSKEAEQYLASRGLQSATYSISAANVTKDLFGKLVACPH
EQPSATRGKMAEENPRVSRQQVLENTESGFVGANWVSTDSPKEEIIWDHGFEAVPASSSCGTASSLVNTVFTLMSGCREGLRVCLGTCGQQPSVSRGKRAEEYSEAKRPQ
VHETIESGYIGGCDKTSTPLPIVASRHLDTGSKTIAPSSTCGDNRSVDPSPFAGMPQLTLGVGMCVSGLLILRMGFLVVLFSITYFLEFDGASKGNPGLAGAGAVLRAID
GSTVCRLQEGVGIATNNVAEYRAVILGLKHALKSGFKHICVRGDSKLVCMQVQGLWKLKNQNMANLCKVAKELKDKFVSFEINHIPREQNSDADALANRAIHLRDPINVM
APKKAPAKVTTSNDSYTGPVTRSCSQGIEIKEDHTPLAVASRISKLIEESSKDKVAVKDNPLFESVTPTSEQSKDTLNPHVMFVMMADVFQDERMTEIERKLNRLMKTVD
ERDHEIAYLKNQLQNRETAESNQTLAA