; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10018041 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10018041
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein TAR1
Genome locationChr03:29813655..29819174
RNA-Seq ExpressionHG10018041
SyntenyHG10018041
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6385696.1 hypothetical protein SASPL_154568 [Salvia splendens]8.6e-1934.2Show/hide
Query:  SELTVRRPGEAPKERSQSVPRPARATRSRPGAARAVHRQPTGRDW---TPVPSPQANP-SRSYGS-FSDSLAYIVPSTKAVT-LENLIVMSTTVREWHRP
        SELTVR P +AP E +   P P R   +R     +    P    +   TPVPSPQ+   SRSYGS    SLAYIVPST+  +      VMSTT R  H  
Subjt:  SELTVRRPGEAPKERSQSVPRPARATRSRPGAARAVHRQPTGRDW---TPVPSPQANP-SRSYGS-FSDSLAYIVPSTKAVT-LENLIVMSTTVREWHRP

Query:  PDFQGPPGPPTPRDVRCSSSHWTLPPAEPFPDSCPCSRRVEWGAHRPMPGARNAKPARRRALPATIGRRRLHRR-NKAWLGRRLNPHRPAPS-RSATGSS
        P                  S     P   F D    S     G HR       A+   RRAL ATI  R  H+  N    GRR NP    P     TGS 
Subjt:  PDFQGPPGPPTPRDVRCSSSHWTLPPAEPFPDSCPCSRRVEWGAHRPMPGARNAKPARRRALPATIGRRRLHRR-NKAWLGRRLNPHRPAPS-RSATGSS

Query:  PFHIRLGTSPAPIRFPPDNFKHYLTLFSNPF-IFPR---------------------------------------GPSATGSTLSGAP----------SR
               TSPAPIRFPPDNFKH LTLFS  F  FPR                                       GP  TG + S AP           R
Subjt:  PFHIRLGTSPAPIRFPPDNFKHYLTLFSNPF-IFPR---------------------------------------GPSATGSTLSGAP----------SR

Query:  TCARSAAEDASPDYNSDARTPDSQAGLFPVRS-----VLGNPLSRRR--------------AEDSNLSHPRTVRTGGQRVPPPAQRAHMGVGAGSDADAP
        T  ++    A P        PDSQ GLFPVRS      L +  +RRR              AE S+L  P      G+ +  P   A   V   +    P
Subjt:  TCARSAAEDASPDYNSDARTPDSQAGLFPVRS-----VLGNPLSRRR--------------AEDSNLSHPRTVRTGGQRVPPPAQRAHMGVGAGSDADAP

Query:  ADVLGQK--LGRNLRSKTRWF
             +    GRNLRSKTRWF
Subjt:  ADVLGQK--LGRNLRSKTRWF

KAG9444720.1 hypothetical protein H6P81_016060 [Aristolochia fimbriata]3.0e-1634.12Show/hide
Query:  PCPYQSELTVRRPGEAPKERSQSVPRPARATRSRPGAARAVHRQPTGRD-WTPVPSPQANPSRSYGSFSDSLAYIVPSTKAVTLENLIVMSTTVREWHRP
        PCP     +  R  +APK RS+SVPRPAR      GAARAVHRQPTG     P P+ +ANP      F ++ A +                        P
Subjt:  PCPYQSELTVRRPGEAPKERSQSVPRPARATRSRPGAARAVHRQPTGRD-WTPVPSPQANPSRSYGSFSDSLAYIVPSTKAVTLENLIVMSTTVREWHRP

Query:  PDFQGPPGP-PTPRDVRCSSSHWTLPPAEPFP------DSCP----------------------------------------------------------
        PDFQGPPG   TPRDVRCSSS WTLPPAEPFP       S P                                                          
Subjt:  PDFQGPPGP-PTPRDVRCSSSHWTLPPAEPFP------DSCP----------------------------------------------------------

Query:  ---------------------------CSRRVEW---GAHRPMP-GARNAKPARRRALPATIGRR------RLHRRNKAWLG---RRLNPHRPAPSRSAT
                                    SRRVEW   G  +  P GA  A+P  R A    +G R       + RR  AW+    RR  P   A  R A 
Subjt:  ---------------------------CSRRVEW---GAHRPMP-GARNAKPARRRALPATIGRR------RLHRRNKAWLG---RRLNPHRPAPSRSAT

Query:  GSSPFHIRLGTSPAPIRFPPDNFKHYLTLFSNPF-IFPRG
          S      G SP PIRFPPDNFK  LTLFS  F  FPRG
Subjt:  GSSPFHIRLGTSPAPIRFPPDNFKHYLTLFSNPF-IFPRG

PHT25065.1 hypothetical protein CQW23_35288 [Capsicum baccatum]1.1e-1835.37Show/hide
Query:  SELTVRRPGEAPKERSQSVPRPAR--ATRSRPGAARAVHRQPTGRDW---TPVPSPQANP-SRSYGS-FSDSLAYIVPSTKAVT-LENLIVMSTTVREWH
        SEL VRR G+AP E +   P P R  ATRSR G++ +    PT   +   TPVPSPQ+   SRSYGS    SLAYIVPST+  +      VMSTT R  H
Subjt:  SELTVRRPGEAPKERSQSVPRPAR--ATRSRPGAARAVHRQPTGRDW---TPVPSPQANP-SRSYGS-FSDSLAYIVPSTKAVT-LENLIVMSTTVREWH

Query:  RP-PDFQG--------------PPGPPTPRDVR-------CSSSH--------WTLPPAEPFPDSCPCSRRV-----------EW-----GAHRPMPGAR
             F+G              P   P  R  R       CS S+        W   P     +  PC+ RV            W     G++    G R
Subjt:  RP-PDFQG--------------PPGPPTPRDVR-------CSSSH--------WTLPPAEPFPDSCPCSRRV-----------EW-----GAHRPMPGAR

Query:  ----NAKPARRRALPATIGRRRLHRRNK-AWLGRRLNPHRPAPSRSATGSSPFHIR----LGTSPAPIRFPPDNFKHYLTLFSNPF-IFPRGPSATGS-T
            +A+  RRRALPATI     H R K    GR   P   A  R  +   P   R     GTS APIRFPPDNFKH LTLF+  F  FPRG    G+ T
Subjt:  ----NAKPARRRALPATIGRRRLHRRNK-AWLGRRLNPHRPAPSRSATGSSPFHIR----LGTSPAPIRFPPDNFKHYLTLFSNPF-IFPRGPSATGS-T

Query:  LSGAPSRTCARSAAEDASPDYNSDARTPDSQAGLFPVRS---------------------------VLGNPLSRRRAE-DSNLSHPRTVRTGGQRVPPPA
        LSG P RT  ++              +PDS+AGLFP+RS                            L  PLS    + DS+L  PR VR   +    PA
Subjt:  LSGAPSRTCARSAAEDASPDYNSDARTPDSQAGLFPVRS---------------------------VLGNPLSRRRAE-DSNLSHPRTVRTGGQRVPPPA

Query:  QRAHMGVGAGSDADAPADVLGQK------LGRNLRSKTRWF
                A  +      +  Q        GRNL SKTRWF
Subjt:  QRAHMGVGAGSDADAPADVLGQK------LGRNLRSKTRWF

PHT26754.1 Protein TAR1 [Capsicum baccatum]5.0e-1937.2Show/hide
Query:  SELTVRRPGEAPKERSQSVPRPAR--ATRSRPGAARAVHRQPTGRDW---TPVPSPQANP-SRSYGS-FSDSLAYIVPSTKAVTLENLIVMSTTVREW--
        SEL VRR G+AP E +   P P R  ATRSR G++ +    PT   +   TPVPSPQ+   S++Y S    SLA+IVPST+  +      + +T + W  
Subjt:  SELTVRRPGEAPKERSQSVPRPAR--ATRSRPGAARAVHRQPTGRDW---TPVPSPQANP-SRSYGS-FSDSLAYIVPSTKAVTLENLIVMSTTVREW--

Query:  HRPPDFQGPPGP-PTPRDVRCSSSHWTLPPAEPFP-------------------------------DSCPCSRRV-----------EWGAHR--------
          PPDFQGP G   TPRDVRCSSS WTLPPAEP P                               +  PC+ RV            W   +        
Subjt:  HRPPDFQGPPGP-PTPRDVRCSSSHWTLPPAEPFP-------------------------------DSCPCSRRV-----------EWGAHR--------

Query:  -PMPGARNAKPARRRALPATIGRRRLHRRNKAWLGRRLNPHRPAPSRSATGSSPFHIRLGTSPAPIRFPPDNFKHYLTLFSNPF-IFPRGPSA
           P   +A+  RRRALPATI                       P+R  + S       G SPAPIRFP  NFK  LTLFS  F  FPR  S+
Subjt:  -PMPGARNAKPARRRALPATIGRRRLHRRNKAWLGRRLNPHRPAPSRSATGSSPFHIRLGTSPAPIRFPPDNFKHYLTLFSNPF-IFPRGPSA

TKS17466.1 hypothetical protein D5086_0000012900 [Populus alba]1.2e-2036.96Show/hide
Query:  QGPPGPPTPRDVRCSS---SHWTLPPAEPFPDSCPCSRRVEWGAHRPMPGAR----NAKPARRRALPATIGRRRLHRRNKAWLGRRLNPHRPAP-SRSAT
        +G P    P  +R  S   SH    P   F      SRR EWGAHRPMPGAR    +A  AR        G      + +A    R +PHR  P +   T
Subjt:  QGPPGPPTPRDVRCSS---SHWTLPPAEPFPDSCPCSRRVEWGAHRPMPGAR----NAKPARRRALPATIGRRRLHRRNKAWLGRRLNPHRPAP-SRSAT

Query:  GSSPFHIRLGTSPAPIRFPPDNFKHYLTLFSNPF-IFPRG----------PSATGSTLSGAPSR-TCARSAAEDASPDYNSDARTPDSQAGLFP------
        G +      G SPAPI FPPDNFKH LTLFS  F  FPRG              GS LSGAP + T A SAAEDASPDYNSDA      +  FP      
Subjt:  GSSPFHIRLGTSPAPIRFPPDNFKHYLTLFSNPF-IFPRG----------PSATGSTLSGAPSR-TCARSAAEDASPDYNSDARTPDSQAGLFP------

Query:  -----------VRSVLGNPLS-------RRRAEDSNLSHPRTVRTGGQRVPPPAQRAHMGVGAGSDADAPADVLGQKLGRNLRSKTRWFADPAIH-----
                    R+ +   LS       RRR  D   +   + R    R  P   R        + AD P+        R  R++   F D  +H     
Subjt:  -----------VRSVLGNPLS-------RRRAEDSNLSHPRTVRTGGQRVPPPAQRAHMGVGAGSDADAPADVLGQKLGRNLRSKTRWFADPAIH-----

Query:  -TKYRIRYVLHRCESRDIRCRD
              RYVLHRCESRDIRCR+
Subjt:  -TKYRIRYVLHRCESRDIRCRD

TrEMBL top hitse value%identityAlignment
A0A2G2UWC5 Uncharacterized protein5.4e-1935.37Show/hide
Query:  SELTVRRPGEAPKERSQSVPRPAR--ATRSRPGAARAVHRQPTGRDW---TPVPSPQANP-SRSYGS-FSDSLAYIVPSTKAVT-LENLIVMSTTVREWH
        SEL VRR G+AP E +   P P R  ATRSR G++ +    PT   +   TPVPSPQ+   SRSYGS    SLAYIVPST+  +      VMSTT R  H
Subjt:  SELTVRRPGEAPKERSQSVPRPAR--ATRSRPGAARAVHRQPTGRDW---TPVPSPQANP-SRSYGS-FSDSLAYIVPSTKAVT-LENLIVMSTTVREWH

Query:  RP-PDFQG--------------PPGPPTPRDVR-------CSSSH--------WTLPPAEPFPDSCPCSRRV-----------EW-----GAHRPMPGAR
             F+G              P   P  R  R       CS S+        W   P     +  PC+ RV            W     G++    G R
Subjt:  RP-PDFQG--------------PPGPPTPRDVR-------CSSSH--------WTLPPAEPFPDSCPCSRRV-----------EW-----GAHRPMPGAR

Query:  ----NAKPARRRALPATIGRRRLHRRNK-AWLGRRLNPHRPAPSRSATGSSPFHIR----LGTSPAPIRFPPDNFKHYLTLFSNPF-IFPRGPSATGS-T
            +A+  RRRALPATI     H R K    GR   P   A  R  +   P   R     GTS APIRFPPDNFKH LTLF+  F  FPRG    G+ T
Subjt:  ----NAKPARRRALPATIGRRRLHRRNK-AWLGRRLNPHRPAPSRSATGSSPFHIR----LGTSPAPIRFPPDNFKHYLTLFSNPF-IFPRGPSATGS-T

Query:  LSGAPSRTCARSAAEDASPDYNSDARTPDSQAGLFPVRS---------------------------VLGNPLSRRRAE-DSNLSHPRTVRTGGQRVPPPA
        LSG P RT  ++              +PDS+AGLFP+RS                            L  PLS    + DS+L  PR VR   +    PA
Subjt:  LSGAPSRTCARSAAEDASPDYNSDARTPDSQAGLFPVRS---------------------------VLGNPLSRRRAE-DSNLSHPRTVRTGGQRVPPPA

Query:  QRAHMGVGAGSDADAPADVLGQK------LGRNLRSKTRWF
                A  +      +  Q        GRNL SKTRWF
Subjt:  QRAHMGVGAGSDADAPADVLGQK------LGRNLRSKTRWF

A0A2G2V192 Protein TAR12.4e-1937.2Show/hide
Query:  SELTVRRPGEAPKERSQSVPRPAR--ATRSRPGAARAVHRQPTGRDW---TPVPSPQANP-SRSYGS-FSDSLAYIVPSTKAVTLENLIVMSTTVREW--
        SEL VRR G+AP E +   P P R  ATRSR G++ +    PT   +   TPVPSPQ+   S++Y S    SLA+IVPST+  +      + +T + W  
Subjt:  SELTVRRPGEAPKERSQSVPRPAR--ATRSRPGAARAVHRQPTGRDW---TPVPSPQANP-SRSYGS-FSDSLAYIVPSTKAVTLENLIVMSTTVREW--

Query:  HRPPDFQGPPGP-PTPRDVRCSSSHWTLPPAEPFP-------------------------------DSCPCSRRV-----------EWGAHR--------
          PPDFQGP G   TPRDVRCSSS WTLPPAEP P                               +  PC+ RV            W   +        
Subjt:  HRPPDFQGPPGP-PTPRDVRCSSSHWTLPPAEPFP-------------------------------DSCPCSRRV-----------EWGAHR--------

Query:  -PMPGARNAKPARRRALPATIGRRRLHRRNKAWLGRRLNPHRPAPSRSATGSSPFHIRLGTSPAPIRFPPDNFKHYLTLFSNPF-IFPRGPSA
           P   +A+  RRRALPATI                       P+R  + S       G SPAPIRFP  NFK  LTLFS  F  FPR  S+
Subjt:  -PMPGARNAKPARRRALPATIGRRRLHRRNKAWLGRRLNPHRPAPSRSATGSSPFHIRLGTSPAPIRFPPDNFKHYLTLFSNPF-IFPRGPSA

A0A2N9IKI0 Uncharacterized protein2.6e-2135.16Show/hide
Query:  SGRRVPAGTTGTAPSGPSRRRTVNSELEGLDFGFAVGFPCPYQSELTVRRPGEAPKERSQSV-PRPARATRSRPGAARAVHRQPTGRDW---TPVPSPQA
        +G   PA +    P       +   + E L F  +V    P  S  +VRRPG+  +    ++ P    ATRSR G++ +    PT   +   TPVPSPQ+
Subjt:  SGRRVPAGTTGTAPSGPSRRRTVNSELEGLDFGFAVGFPCPYQSELTVRRPGEAPKERSQSV-PRPARATRSRPGAARAVHRQPTGRDW---TPVPSPQA

Query:  NP-SRSYGS-FSDSLAYIVPSTKAVT-LENLIVMSTTVREWHRP-PDFQGPPGP-PTPRDVRCSSSHWTLPPAEPFPDSCPC------------------
           SR YGS    SLAYIVPST+  +      VMSTT R WH   PDFQGPPG   TPRDVRCSSS WTLPPAEPFP S P                   
Subjt:  NP-SRSYGS-FSDSLAYIVPSTKAVT-LENLIVMSTTVREWHRP-PDFQGPPGP-PTPRDVRCSSSHWTLPPAEPFPDSCPC------------------

Query:  -------------------SRRVEWGAH-RPMPGARNAKPARRRA----------LPATI----------GRRRLHRRNKAWLGRRLNPH----------
                           S++++ G    P     N  P   R            P ++          GRR+ H   +A    R   H          
Subjt:  -------------------SRRVEWGAH-RPMPGARNAKPARRRA----------LPATI----------GRRRLHRRNKAWLGRRLNPH----------

Query:  -----------RPAPSRSATGSSPFHIR----LGTSPAPIRFPPDNFKHYLTLFSNPF-IFPRG
                   R  P   + G  P H R     G SPAPIRFPPDNFKH LTLFS  F  FPRG
Subjt:  -----------RPAPSRSATGSSPFHIR----LGTSPAPIRFPPDNFKHYLTLFSNPF-IFPRG

A0A4V6XX95 Uncharacterized protein5.8e-2136.96Show/hide
Query:  QGPPGPPTPRDVRCSS---SHWTLPPAEPFPDSCPCSRRVEWGAHRPMPGAR----NAKPARRRALPATIGRRRLHRRNKAWLGRRLNPHRPAP-SRSAT
        +G P    P  +R  S   SH    P   F      SRR EWGAHRPMPGAR    +A  AR        G      + +A    R +PHR  P +   T
Subjt:  QGPPGPPTPRDVRCSS---SHWTLPPAEPFPDSCPCSRRVEWGAHRPMPGAR----NAKPARRRALPATIGRRRLHRRNKAWLGRRLNPHRPAP-SRSAT

Query:  GSSPFHIRLGTSPAPIRFPPDNFKHYLTLFSNPF-IFPRG----------PSATGSTLSGAPSR-TCARSAAEDASPDYNSDARTPDSQAGLFP------
        G +      G SPAPI FPPDNFKH LTLFS  F  FPRG              GS LSGAP + T A SAAEDASPDYNSDA      +  FP      
Subjt:  GSSPFHIRLGTSPAPIRFPPDNFKHYLTLFSNPF-IFPRG----------PSATGSTLSGAPSR-TCARSAAEDASPDYNSDARTPDSQAGLFP------

Query:  -----------VRSVLGNPLS-------RRRAEDSNLSHPRTVRTGGQRVPPPAQRAHMGVGAGSDADAPADVLGQKLGRNLRSKTRWFADPAIH-----
                    R+ +   LS       RRR  D   +   + R    R  P   R        + AD P+        R  R++   F D  +H     
Subjt:  -----------VRSVLGNPLS-------RRRAEDSNLSHPRTVRTGGQRVPPPAQRAHMGVGAGSDADAPADVLGQKLGRNLRSKTRWFADPAIH-----

Query:  -TKYRIRYVLHRCESRDIRCRD
              RYVLHRCESRDIRCR+
Subjt:  -TKYRIRYVLHRCESRDIRCRD

A0A6N2N9P9 Uncharacterized protein1.9e-2439.87Show/hide
Query:  PCSRRVEWGAHRPMPGARNAKPARR---RALPATIGRRRLHRRNKA-WLGRRLNPHRPAP-SRSATGSSPFHIRLGTSPAPIRFPPDNFKHYLTLFSNPF
        PC +    GA    P  R    ARR   R  P +  RR LHRR K   LGR  +PHR  P +   TG        G SPAPI FPPDNFKH LTLFS  F
Subjt:  PCSRRVEWGAHRPMPGARNAKPARR---RALPATIGRRRLHRRNKA-WLGRRLNPHRPAP-SRSATGSSPFHIRLGTSPAPIRFPPDNFKHYLTLFSNPF

Query:  -IFPRGPSATGS-TLSGAPSR-TCARSAAEDASPDYNSDARTPDSQAGLFPVRSVLG-----------------NPLSRRRAEDSNLS------------
          FPRG S  G+ TLSGAP + T A SAAEDASPDYNS+A  PDSQAG FP R   G                    S R      LS            
Subjt:  -IFPRGPSATGS-TLSGAPSR-TCARSAAEDASPDYNSDARTPDSQAGLFPVRSVLG-----------------NPLSRRRAEDSNLS------------

Query:  --HPRTVRTGGQRVPPPAQRAHMGVGAGSDADAPADVLGQKLGRNLRSKTRWFADPAIH------TKYRIRYVLHRCESRDIRCRDLPTIAMIYPHHDEI
           P T  +  +    P  R   GVGA +  D  ADV   +  R   +    F D  +H           RYVLHRCESRDIRC             D  
Subjt:  --HPRTVRTGGQRVPPPAQRAHMGVGAGSDADAPADVLGQKLGRNLRSKTRWFADPAIH------TKYRIRYVLHRCESRDIRCRDLPTIAMIYPHHDEI

Query:  SKITRPVGQGY
         K  RP  +GY
Subjt:  SKITRPVGQGY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCTGAGCGCGGCACGGTCCCAGTCCCGAACCCGTCGGCTGTCGGTGGACTGCTCGAGCTGCTCCCGCGGCGAGAGCGGTCGCCGCGTGCCGGCCGGGACGACTGG
AACGGCTCCTTCGGGGCCTTCCCGGCGTCGAACAGTCAACTCAGAACTAGAAGGGTTGGATTTTGGTTTTGCGGTCGGATTCCCTTGTCCGTACCAGTCTGAGTTGACTG
TTCGACGCCCGGGTGAAGCCCCGAAGGAGCGTTCCCAGTCCGTCCCCCGGCCGGCACGCGCGACCCGCTCTCGCCCGGGAGCAGCTCGAGCAGTCCACCGACAGCCGACG
GGTCGGGACTGGACCCCCGTGCCCAGCCCTCAAGCCAATCCTTCCCGAAGTTACGGATCATTTTCCGACTCCCTTGCCTACATTGTTCCATCGACCAAGGCTGTCACCTT
GGAGAACCTGATCGTTATGAGTACGACCGTGCGTGAGTGGCACCGTCCTCCGGATTTTCAAGGGCCGCCGGGGCCACCGACACCACGCGACGTGCGGTGCTCTTCCAGCC
ACTGGACCCTACCTCCGGCTGAGCCGTTTCCAGACTCCTGTCCGTGTTCAAGACGGGTCGAATGGGGAGCCCACAGGCCGATGCCAGGAGCGCGCAATGCCAAGCCCGCC
CGAAGGCGCGCGCTGCCAGCCACGATCGGACGACGACGTCTCCACAGGCGTAACAAAGCCTGGTTAGGCCGCCGTCTCAATCCGCATCGTCCAGCCCCAAGTCGATCGGC
GACCGGCTCATCACCGTTCCACATCCGACTGGGCACATCGCCGGCCCCCATCCGCTTCCCTCCCGACAATTTCAAGCACTATTTGACTCTCTTTTCAAATCCTTTCATCT
TTCCTCGGGGTCCGAGCGCAACGGGCTCCACCCTCTCTGGCGCCCCCTCCAGGACTTGTGCCCGGTCCGCCGCTGAGGACGCTTCTCCAGACTACAATTCGGACGCGCGG
ACGCCCGATTCTCAAGCTGGGCTCTTCCCGGTTCGCTCCGTACTAGGGAATCCTTTGTCGCGACGACGCGCCGAGGACTCGAATTTAAGCCATCCGCGCACGGTGCGCAC
GGGAGGCCAGCGTGTGCCCCCGCCCGCGCAACGAGCCCACATGGGGGTTGGTGCGGGCAGCGATGCTGACGCCCCAGCAGACGTGCTCGGCCAGAAGCTCGGGCGCAACT
TGCGTTCAAAGACTCGGTGGTTCGCGGATCCTGCAATTCACACCAAGTATCGCATTCGCTACGTTCTTCATCGATGCGAGAGCCGAGATATCCGTTGCCGAGATCTACCA
ACAATTGCAATGATCTATCCCCATCACGATGAAATTTCAAAGATTACCCGGCCTGTCGGCCAAGGCTATAGACTCGTTGAATACATCAGGCACGTCAATGAGGAGGAGCT
GACGCCGACAGTTCGATGCCCGAGCACCGAGCCTACCGACCCAAACTACAGAATCACCACTCACGCGCCGTACGCATTCGAGCCCGGGCAACGCTCGACTATCAGCACCG
AGCCTACAGAAAAGCAAGGGTGCAGAGTCGTCGGGCGAGCCGAGCACGCAACGCCGTGCGCGGTCTTTCCTTCCCTTTCCATTCTCGATCACTTTAGTTTTGTTTCCAAT
GGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTCTGAGCGCGGCACGGTCCCAGTCCCGAACCCGTCGGCTGTCGGTGGACTGCTCGAGCTGCTCCCGCGGCGAGAGCGGTCGCCGCGTGCCGGCCGGGACGACTGG
AACGGCTCCTTCGGGGCCTTCCCGGCGTCGAACAGTCAACTCAGAACTAGAAGGGTTGGATTTTGGTTTTGCGGTCGGATTCCCTTGTCCGTACCAGTCTGAGTTGACTG
TTCGACGCCCGGGTGAAGCCCCGAAGGAGCGTTCCCAGTCCGTCCCCCGGCCGGCACGCGCGACCCGCTCTCGCCCGGGAGCAGCTCGAGCAGTCCACCGACAGCCGACG
GGTCGGGACTGGACCCCCGTGCCCAGCCCTCAAGCCAATCCTTCCCGAAGTTACGGATCATTTTCCGACTCCCTTGCCTACATTGTTCCATCGACCAAGGCTGTCACCTT
GGAGAACCTGATCGTTATGAGTACGACCGTGCGTGAGTGGCACCGTCCTCCGGATTTTCAAGGGCCGCCGGGGCCACCGACACCACGCGACGTGCGGTGCTCTTCCAGCC
ACTGGACCCTACCTCCGGCTGAGCCGTTTCCAGACTCCTGTCCGTGTTCAAGACGGGTCGAATGGGGAGCCCACAGGCCGATGCCAGGAGCGCGCAATGCCAAGCCCGCC
CGAAGGCGCGCGCTGCCAGCCACGATCGGACGACGACGTCTCCACAGGCGTAACAAAGCCTGGTTAGGCCGCCGTCTCAATCCGCATCGTCCAGCCCCAAGTCGATCGGC
GACCGGCTCATCACCGTTCCACATCCGACTGGGCACATCGCCGGCCCCCATCCGCTTCCCTCCCGACAATTTCAAGCACTATTTGACTCTCTTTTCAAATCCTTTCATCT
TTCCTCGGGGTCCGAGCGCAACGGGCTCCACCCTCTCTGGCGCCCCCTCCAGGACTTGTGCCCGGTCCGCCGCTGAGGACGCTTCTCCAGACTACAATTCGGACGCGCGG
ACGCCCGATTCTCAAGCTGGGCTCTTCCCGGTTCGCTCCGTACTAGGGAATCCTTTGTCGCGACGACGCGCCGAGGACTCGAATTTAAGCCATCCGCGCACGGTGCGCAC
GGGAGGCCAGCGTGTGCCCCCGCCCGCGCAACGAGCCCACATGGGGGTTGGTGCGGGCAGCGATGCTGACGCCCCAGCAGACGTGCTCGGCCAGAAGCTCGGGCGCAACT
TGCGTTCAAAGACTCGGTGGTTCGCGGATCCTGCAATTCACACCAAGTATCGCATTCGCTACGTTCTTCATCGATGCGAGAGCCGAGATATCCGTTGCCGAGATCTACCA
ACAATTGCAATGATCTATCCCCATCACGATGAAATTTCAAAGATTACCCGGCCTGTCGGCCAAGGCTATAGACTCGTTGAATACATCAGGCACGTCAATGAGGAGGAGCT
GACGCCGACAGTTCGATGCCCGAGCACCGAGCCTACCGACCCAAACTACAGAATCACCACTCACGCGCCGTACGCATTCGAGCCCGGGCAACGCTCGACTATCAGCACCG
AGCCTACAGAAAAGCAAGGGTGCAGAGTCGTCGGGCGAGCCGAGCACGCAACGCCGTGCGCGGTCTTTCCTTCCCTTTCCATTCTCGATCACTTTAGTTTTGTTTCCAAT
GGTTGA
Protein sequenceShow/hide protein sequence
MALSAARSQSRTRRLSVDCSSCSRGESGRRVPAGTTGTAPSGPSRRRTVNSELEGLDFGFAVGFPCPYQSELTVRRPGEAPKERSQSVPRPARATRSRPGAARAVHRQPT
GRDWTPVPSPQANPSRSYGSFSDSLAYIVPSTKAVTLENLIVMSTTVREWHRPPDFQGPPGPPTPRDVRCSSSHWTLPPAEPFPDSCPCSRRVEWGAHRPMPGARNAKPA
RRRALPATIGRRRLHRRNKAWLGRRLNPHRPAPSRSATGSSPFHIRLGTSPAPIRFPPDNFKHYLTLFSNPFIFPRGPSATGSTLSGAPSRTCARSAAEDASPDYNSDAR
TPDSQAGLFPVRSVLGNPLSRRRAEDSNLSHPRTVRTGGQRVPPPAQRAHMGVGAGSDADAPADVLGQKLGRNLRSKTRWFADPAIHTKYRIRYVLHRCESRDIRCRDLP
TIAMIYPHHDEISKITRPVGQGYRLVEYIRHVNEEELTPTVRCPSTEPTDPNYRITTHAPYAFEPGQRSTISTEPTEKQGCRVVGRAEHATPCAVFPSLSILDHFSFVSN
G