; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10005155 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10005155
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein of unknown function (DUF1685)
Genome locationChr07:92744..93436
RNA-Seq ExpressionHG10005155
SyntenyHG10005155
Gene Ontology termsNA
InterPro domainsIPR012881 - Protein of unknown function DUF1685


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572385.1 hypothetical protein SDJN03_29113, partial [Cucurbita argyrosperma subsp. sororia]2.3e-9079.57Show/hide
Query:  MAAEEILPLFDLFWFQRAILAGEPLLQTCSSAPENRFQSPVMQVTKGRSQSEYLLSSKNFPPPETAVYSTGSIISTNQKLETILSGKVTEFSGNGEGKPA
        MAAEEILPLFDLFWFQ A+  G+PLL T   +PENR QSPVMQV K RSQSEY LSS NF PPETA Y      S NQKL+ ILSGKVTEFSG GEGKPA
Subjt:  MAAEEILPLFDLFWFQRAILAGEPLLQTCSSAPENRFQSPVMQVTKGRSQSEYLLSSKNFPPPETAVYSTGSIISTNQKLETILSGKVTEFSGNGEGKPA

Query:  KKKLEGNENKRRRKRGKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQRLGQKTGEKEEETGIENGISRPYLSEAWEAVEEENEKRILM
        KKK EG+E +RRRKRG+GLSKSLSDLEFEELKGFMDLGFVFSEEDK +S+LASIIPGLQRLG+KT   EEE GIE+G+SRPYLSEAW+AVEEE EKR LM
Subjt:  KKKLEGNENKRRRKRGKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQRLGQKTGEKEEETGIENGISRPYLSEAWEAVEEENEKRILM

Query:  KWRVPGLGATKMDMKDHLKFWAHTVASTVR
        KWRVP LGAT+MDMKDHLKFWAHTVASTVR
Subjt:  KWRVPGLGATKMDMKDHLKFWAHTVASTVR

XP_022952129.1 uncharacterized protein LOC111454895 [Cucurbita moschata]1.7e-9079.57Show/hide
Query:  MAAEEILPLFDLFWFQRAILAGEPLLQTCSSAPENRFQSPVMQVTKGRSQSEYLLSSKNFPPPETAVYSTGSIISTNQKLETILSGKVTEFSGNGEGKPA
        MAAEEILPLFDLFWFQ A+  G+PLL T   +PENR QSPVMQV K RSQSEY LSS NF PPETA Y      S NQKL+ ILSGKVTEFSG GEGKPA
Subjt:  MAAEEILPLFDLFWFQRAILAGEPLLQTCSSAPENRFQSPVMQVTKGRSQSEYLLSSKNFPPPETAVYSTGSIISTNQKLETILSGKVTEFSGNGEGKPA

Query:  KKKLEGNENKRRRKRGKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQRLGQKTGEKEEETGIENGISRPYLSEAWEAVEEENEKRILM
        KKK EG+E +RRRKRG+GLSKSLSDLEFEELKGFMDLGFVFSEEDK +S+LASIIPGLQRLG+KT   EEE GIE+G+SRPYLSEAW+AVEEE EKR LM
Subjt:  KKKLEGNENKRRRKRGKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQRLGQKTGEKEEETGIENGISRPYLSEAWEAVEEENEKRILM

Query:  KWRVPGLGATKMDMKDHLKFWAHTVASTVR
        KWRVP LGAT+MDMKDHLKFWAHTVASTVR
Subjt:  KWRVPGLGATKMDMKDHLKFWAHTVASTVR

XP_022969253.1 uncharacterized protein LOC111468311 [Cucurbita maxima]3.0e-9079.13Show/hide
Query:  MAAEEILPLFDLFWFQRAILAGEPLLQTCSSAPENRFQSPVMQVTKGRSQSEYLLSSKNFPPPETAVYSTGSIISTNQKLETILSGKVTEFSGNGEGKPA
        MAAEEIL LFDLFWFQ A+  G PLL T   +PENR QSPVMQV K RSQSEY LSS NF PPETA Y      STNQKL+ ILSGKVTEFSG G GKPA
Subjt:  MAAEEILPLFDLFWFQRAILAGEPLLQTCSSAPENRFQSPVMQVTKGRSQSEYLLSSKNFPPPETAVYSTGSIISTNQKLETILSGKVTEFSGNGEGKPA

Query:  KKKLEGNENKRRRKRGKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQRLGQKTGEKEEETGIENGISRPYLSEAWEAVEEENEKRILM
        KKK EG+E +RRRKRG+GLSKSLSDLEFEELKGFMDLGFVFSEEDK +S+LASIIPGLQRLG++T E EEE GIE+G+SRPYLSEAW+AVEEE EKR LM
Subjt:  KKKLEGNENKRRRKRGKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQRLGQKTGEKEEETGIENGISRPYLSEAWEAVEEENEKRILM

Query:  KWRVPGLGATKMDMKDHLKFWAHTVASTVR
        KWRVP LGAT+MDMKDHLKFWAHTVASTVR
Subjt:  KWRVPGLGATKMDMKDHLKFWAHTVASTVR

XP_023554329.1 uncharacterized protein LOC111811624 [Cucurbita pepo subsp. pepo]1.2e-8878.26Show/hide
Query:  MAAEEILPLFDLFWFQRAILAGEPLLQTCSSAPENRFQSPVMQVTKGRSQSEYLLSSKNFPPPETAVYSTGSIISTNQKLETILSGKVTEFSGNGEGKPA
        MAAEEILPLFDLFWFQ A+  G+PLL T   +PENR QSPVMQV K RSQSEY LSS NF PPET  Y      S NQKL+ ILSGKVTEF+G  EGKPA
Subjt:  MAAEEILPLFDLFWFQRAILAGEPLLQTCSSAPENRFQSPVMQVTKGRSQSEYLLSSKNFPPPETAVYSTGSIISTNQKLETILSGKVTEFSGNGEGKPA

Query:  KKKLEGNENKRRRKRGKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQRLGQKTGEKEEETGIENGISRPYLSEAWEAVEEENEKRILM
        KKK EG+E +RRRKRG+GLSKSLSDLEFEELKGFMDLGFVFSEEDK +S+LASIIPGLQRLG+KT   EEE GIE+G+SRPYLSEAW+AVEEE EKR LM
Subjt:  KKKLEGNENKRRRKRGKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQRLGQKTGEKEEETGIENGISRPYLSEAWEAVEEENEKRILM

Query:  KWRVPGLGATKMDMKDHLKFWAHTVASTVR
        KWRVP LGAT+MDMKDHLKFWAHTVASTVR
Subjt:  KWRVPGLGATKMDMKDHLKFWAHTVASTVR

XP_038887878.1 uncharacterized protein LOC120077867 [Benincasa hispida]1.1e-10889.32Show/hide
Query:  MAAEEILPLFDLFWFQRAILAGEPLLQTCSSAPENRFQSPVMQVTKGRSQSEYLLSSKNFPPPETAVYSTGSIISTNQKLETILSGKVTEFSGNGEGK--
        MAAEEILPLFDLFWFQRAI  G+PLLQT SSAPE RFQSPV QV K RSQSEYLLSSK+FPPPETAVYSTGS+IST+QKL+TILSGKV EF+GNGEGK  
Subjt:  MAAEEILPLFDLFWFQRAILAGEPLLQTCSSAPENRFQSPVMQVTKGRSQSEYLLSSKNFPPPETAVYSTGSIISTNQKLETILSGKVTEFSGNGEGK--

Query:  --PAKKKLEGNENKRRRKRGKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQRLGQKTGEKEEETGIENGISRPYLSEAWEAVEEENEK
          PAKKKLEGNENKRR+KRGKGLSKSLSDLEFEELKGFMDLGFVF EEDKNDSNLASIIPGLQRLGQKTGE EEE  IENG+SRPYLSEAWEAVEEENEK
Subjt:  --PAKKKLEGNENKRRRKRGKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQRLGQKTGEKEEETGIENGISRPYLSEAWEAVEEENEK

Query:  RILMKWRVPGLGATKMDMKDHLKFWAHTVASTVR
        RILMKWRVPGLGAT+MDMKDHLKFWAHTVASTVR
Subjt:  RILMKWRVPGLGATKMDMKDHLKFWAHTVASTVR

TrEMBL top hitse value%identityAlignment
A0A0A0K3T7 Uncharacterized protein9.3e-8274.04Show/hide
Query:  MAAEEILPLFDLFWFQRAILAGEPLLQTCSSAPENRFQSPVMQVTKGRSQSEYLLSSKNFPPPETAVYSTGSIISTNQKLETILSGKVTEFSGNGEG---
        MAAEEILPLFDLFWFQRAI + +  L+TC       FQSPV QV K RSQSEYLL+SK+FPPPETA       +++NQKLETILSGKVTEF GN EG   
Subjt:  MAAEEILPLFDLFWFQRAILAGEPLLQTCSSAPENRFQSPVMQVTKGRSQSEYLLSSKNFPPPETAVYSTGSIISTNQKLETILSGKVTEFSGNGEG---

Query:  KPAKKKLEGNENK-RRRKRGKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQRLG-QKTGEKEEETGIENGISRPYLSEAWEAVEEENE
        K  KKKLEGNE+K RR+K+GKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNL SIIPGL RLG +KT EK  E G+   + RPYLSEAW+A+EEENE
Subjt:  KPAKKKLEGNENK-RRRKRGKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQRLG-QKTGEKEEETGIENGISRPYLSEAWEAVEEENE

Query:  KRILMKWRVPGLGATKMDMKDHLKFWAHTVASTVR
        K ILMKWRVP LGAT+MD+K HLKFWAHTVASTVR
Subjt:  KRILMKWRVPGLGATKMDMKDHLKFWAHTVASTVR

A0A1S3C1U3 uncharacterized protein LOC1034954951.1e-7772.03Show/hide
Query:  MAAEEILPLFDLFWFQRAILAGEPLLQTCSSAPENRFQ-SPVMQVTKGRSQSEYLLSSKNFPPPETAVYSTGSIISTNQKLETILSGKVTEFSGNGEGKP
        MAAEEILPLFDLFWFQ+AI   +PLL+TC       FQ SPVM   K RSQSEYLL+SK+FPPP T        +++NQKLET+LSG+VTEF G+GEGK 
Subjt:  MAAEEILPLFDLFWFQRAILAGEPLLQTCSSAPENRFQ-SPVMQVTKGRSQSEYLLSSKNFPPPETAVYSTGSIISTNQKLETILSGKVTEFSGNGEGKP

Query:  AK---KKLEGNENK-RRRKRGKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQRLG-QKTGEKEEETGIENGISRPYLSEAWEAVEEEN
         K   KKLEGNENK RR+K+ KGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNL SIIPGL RLG Q T EK  E G+   + RPYLSEAWEA+EEEN
Subjt:  AK---KKLEGNENK-RRRKRGKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQRLG-QKTGEKEEETGIENGISRPYLSEAWEAVEEEN

Query:  EKRILMKWRVPGLGATKMDMKDHLKFWAHTVASTVR
        EK +LMKWRVP LGAT+MD+K HLKFWAHTVASTVR
Subjt:  EKRILMKWRVPGLGATKMDMKDHLKFWAHTVASTVR

A0A6J1D2N3 uncharacterized protein LOC1110167788.2e-8674.9Show/hide
Query:  MAAEEILPLFDLFWFQRAILAGEPLLQT-----CSSAPENRFQSPVMQVTKGRSQSEYLLSSKNFPPPETAVYSTGSIISTNQKLETILSGKVTEFSGNG
        MA+EEIL LFD FWFQ  + AG+PLL+T      S+APEN  +SP+MQV +GRSQSEYLL S +FP PETA YSTGS I TN+KL+TILSG+VTEFSG  
Subjt:  MAAEEILPLFDLFWFQRAILAGEPLLQT-----CSSAPENRFQSPVMQVTKGRSQSEYLLSSKNFPPPETAVYSTGSIISTNQKLETILSGKVTEFSGNG

Query:  EGKPAKKKLEGNENKRRRKRGKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQRLGQKTG-------EKEEETGIENGISRPYLSEAWE
         GKPAKKKL GNE K R++RG+GLSKSLSDLEFEELKGFMDLGFVFSEEDKN S+LASIIPGLQRLG+KTG       EK EE G E G+SRPYLSEAWE
Subjt:  EGKPAKKKLEGNENKRRRKRGKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQRLGQKTG-------EKEEETGIENGISRPYLSEAWE

Query:  AVEEENEKRILMKWRVPGLG-ATKMDMKDHLKFWAHTVASTVR
        A +EENEKRILMKWRVP LG AT+MDMKDHLKFWAHTVASTVR
Subjt:  AVEEENEKRILMKWRVPGLG-ATKMDMKDHLKFWAHTVASTVR

A0A6J1GKQ8 uncharacterized protein LOC1114548958.4e-9179.57Show/hide
Query:  MAAEEILPLFDLFWFQRAILAGEPLLQTCSSAPENRFQSPVMQVTKGRSQSEYLLSSKNFPPPETAVYSTGSIISTNQKLETILSGKVTEFSGNGEGKPA
        MAAEEILPLFDLFWFQ A+  G+PLL T   +PENR QSPVMQV K RSQSEY LSS NF PPETA Y      S NQKL+ ILSGKVTEFSG GEGKPA
Subjt:  MAAEEILPLFDLFWFQRAILAGEPLLQTCSSAPENRFQSPVMQVTKGRSQSEYLLSSKNFPPPETAVYSTGSIISTNQKLETILSGKVTEFSGNGEGKPA

Query:  KKKLEGNENKRRRKRGKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQRLGQKTGEKEEETGIENGISRPYLSEAWEAVEEENEKRILM
        KKK EG+E +RRRKRG+GLSKSLSDLEFEELKGFMDLGFVFSEEDK +S+LASIIPGLQRLG+KT   EEE GIE+G+SRPYLSEAW+AVEEE EKR LM
Subjt:  KKKLEGNENKRRRKRGKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQRLGQKTGEKEEETGIENGISRPYLSEAWEAVEEENEKRILM

Query:  KWRVPGLGATKMDMKDHLKFWAHTVASTVR
        KWRVP LGAT+MDMKDHLKFWAHTVASTVR
Subjt:  KWRVPGLGATKMDMKDHLKFWAHTVASTVR

A0A6J1I0F9 uncharacterized protein LOC1114683111.4e-9079.13Show/hide
Query:  MAAEEILPLFDLFWFQRAILAGEPLLQTCSSAPENRFQSPVMQVTKGRSQSEYLLSSKNFPPPETAVYSTGSIISTNQKLETILSGKVTEFSGNGEGKPA
        MAAEEIL LFDLFWFQ A+  G PLL T   +PENR QSPVMQV K RSQSEY LSS NF PPETA Y      STNQKL+ ILSGKVTEFSG G GKPA
Subjt:  MAAEEILPLFDLFWFQRAILAGEPLLQTCSSAPENRFQSPVMQVTKGRSQSEYLLSSKNFPPPETAVYSTGSIISTNQKLETILSGKVTEFSGNGEGKPA

Query:  KKKLEGNENKRRRKRGKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQRLGQKTGEKEEETGIENGISRPYLSEAWEAVEEENEKRILM
        KKK EG+E +RRRKRG+GLSKSLSDLEFEELKGFMDLGFVFSEEDK +S+LASIIPGLQRLG++T E EEE GIE+G+SRPYLSEAW+AVEEE EKR LM
Subjt:  KKKLEGNENKRRRKRGKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQRLGQKTGEKEEETGIENGISRPYLSEAWEAVEEENEKRILM

Query:  KWRVPGLGATKMDMKDHLKFWAHTVASTVR
        KWRVP LGAT+MDMKDHLKFWAHTVASTVR
Subjt:  KWRVPGLGATKMDMKDHLKFWAHTVASTVR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05870.1 Protein of unknown function (DUF1685)1.7e-0629.41Show/hide
Query:  SKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQR--------LGQKTGEKEEETGIENGISRPYLSEAWEAVEEENEKRILMKWRVPGLGATK
        SKSL+D + E+L+G +DLGF FS ++  +  L + +P L+         L  K  +  E + +E+  S P ++              +  W++   G   
Subjt:  SKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQR--------LGQKTGEKEEETGIENGISRPYLSEAWEAVEEENEKRILMKWRVPGLGATK

Query:  MDMKDHLKFWAHTVASTVR
         D+K  LK+WA  VA TV+
Subjt:  MDMKDHLKFWAHTVASTVR

AT1G05870.2 Protein of unknown function (DUF1685)1.7e-0629.41Show/hide
Query:  SKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQR--------LGQKTGEKEEETGIENGISRPYLSEAWEAVEEENEKRILMKWRVPGLGATK
        SKSL+D + E+L+G +DLGF FS ++  +  L + +P L+         L  K  +  E + +E+  S P ++              +  W++   G   
Subjt:  SKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQR--------LGQKTGEKEEETGIENGISRPYLSEAWEAVEEENEKRILMKWRVPGLGATK

Query:  MDMKDHLKFWAHTVASTVR
         D+K  LK+WA  VA TV+
Subjt:  MDMKDHLKFWAHTVASTVR

AT1G05870.3 Protein of unknown function (DUF1685)1.7e-0629.41Show/hide
Query:  SKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQR--------LGQKTGEKEEETGIENGISRPYLSEAWEAVEEENEKRILMKWRVPGLGATK
        SKSL+D + E+L+G +DLGF FS ++  +  L + +P L+         L  K  +  E + +E+  S P ++              +  W++   G   
Subjt:  SKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQR--------LGQKTGEKEEETGIENGISRPYLSEAWEAVEEENEKRILMKWRVPGLGATK

Query:  MDMKDHLKFWAHTVASTVR
         D+K  LK+WA  VA TV+
Subjt:  MDMKDHLKFWAHTVASTVR

AT1G05870.4 Protein of unknown function (DUF1685)1.7e-0629.41Show/hide
Query:  SKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQR--------LGQKTGEKEEETGIENGISRPYLSEAWEAVEEENEKRILMKWRVPGLGATK
        SKSL+D + E+L+G +DLGF FS ++  +  L + +P L+         L  K  +  E + +E+  S P ++              +  W++   G   
Subjt:  SKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQR--------LGQKTGEKEEETGIENGISRPYLSEAWEAVEEENEKRILMKWRVPGLGATK

Query:  MDMKDHLKFWAHTVASTVR
         D+K  LK+WA  VA TV+
Subjt:  MDMKDHLKFWAHTVASTVR

AT2G42760.1 unknown protein7.4e-3137.32Show/hide
Query:  MAAEEILPLFDLFWFQRAILAGEPLLQTCSSAPENRFQSPVMQVTKGRSQSEYLLSSKNFP-----------------PPETAVYSTGS-----------
        MA EE+L LF+  W +R I   +      +   ++R +    ++ + R + E L   KNFP                   +T+++S+ S           
Subjt:  MAAEEILPLFDLFWFQRAILAGEPLLQTCSSAPENRFQSPVMQVTKGRSQSEYLLSSKNFP-----------------PPETAVYSTGS-----------

Query:  -IISTNQKLETILSGKVTEFSGNGEGKPAKKKLEGNENKRRRKRGKG-----LSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQRLGQK--
         +  T  KL+TILSGK        E    ++ L   E +R++K+ K        KS+SDLE+EELKGFMDLGFVFSE+D  DS+L SI+PGLQRL +K  
Subjt:  -IISTNQKLETILSGKVTEFSGNGEGKPAKKKLEGNENKRRRKRGKG-----LSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQRLGQK--

Query:  --TGEKEEETGIE----NGISRPYLSEAWEAVEEENEKRIL---MKWRVPG-LGATKMDMKDHLKFWAHTVASTVR
          T E+EEE   +    N  +RPYLSEAW+       K+ +   +KWRVP    A+++D+KD+L+ WAH VAST+R
Subjt:  --TGEKEEETGIE----NGISRPYLSEAWEAVEEENEKRIL---MKWRVPG-LGATKMDMKDHLKFWAHTVASTVR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGCCGAAGAAATCCTCCCTCTTTTTGATCTCTTCTGGTTTCAACGGGCAATTTTAGCCGGAGAACCGCTTTTGCAGACATGCTCCTCGGCGCCGGAAAACCGCTT
TCAGAGTCCTGTAATGCAAGTGACGAAGGGAAGATCTCAAAGCGAGTATCTTCTGAGCTCGAAGAATTTCCCACCTCCCGAAACCGCTGTTTACTCCACCGGCTCGATTA
TTTCCACCAATCAAAAGCTTGAAACCATTCTTTCCGGTAAGGTAACGGAATTTTCCGGCAACGGAGAGGGGAAACCGGCGAAGAAGAAATTGGAAGGAAATGAAAATAAA
AGAAGAAGGAAAAGGGGGAAAGGGTTGAGTAAGAGCTTATCAGACCTTGAATTTGAAGAGTTGAAAGGTTTTATGGATTTGGGATTTGTGTTCAGTGAAGAAGATAAGAA
TGATTCAAATTTGGCTTCAATAATTCCAGGGTTACAGAGATTAGGTCAAAAAACAGGGGAAAAAGAAGAGGAAACAGGGATTGAAAATGGGATTTCAAGGCCATATTTGT
CTGAAGCTTGGGAAGCTGTTGAAGAAGAAAATGAGAAAAGGATTTTGATGAAATGGAGAGTTCCAGGTTTGGGAGCAACTAAAATGGATATGAAAGATCATCTCAAGTTC
TGGGCTCATACAGTGGCTTCAACTGTGAGATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGCCGAAGAAATCCTCCCTCTTTTTGATCTCTTCTGGTTTCAACGGGCAATTTTAGCCGGAGAACCGCTTTTGCAGACATGCTCCTCGGCGCCGGAAAACCGCTT
TCAGAGTCCTGTAATGCAAGTGACGAAGGGAAGATCTCAAAGCGAGTATCTTCTGAGCTCGAAGAATTTCCCACCTCCCGAAACCGCTGTTTACTCCACCGGCTCGATTA
TTTCCACCAATCAAAAGCTTGAAACCATTCTTTCCGGTAAGGTAACGGAATTTTCCGGCAACGGAGAGGGGAAACCGGCGAAGAAGAAATTGGAAGGAAATGAAAATAAA
AGAAGAAGGAAAAGGGGGAAAGGGTTGAGTAAGAGCTTATCAGACCTTGAATTTGAAGAGTTGAAAGGTTTTATGGATTTGGGATTTGTGTTCAGTGAAGAAGATAAGAA
TGATTCAAATTTGGCTTCAATAATTCCAGGGTTACAGAGATTAGGTCAAAAAACAGGGGAAAAAGAAGAGGAAACAGGGATTGAAAATGGGATTTCAAGGCCATATTTGT
CTGAAGCTTGGGAAGCTGTTGAAGAAGAAAATGAGAAAAGGATTTTGATGAAATGGAGAGTTCCAGGTTTGGGAGCAACTAAAATGGATATGAAAGATCATCTCAAGTTC
TGGGCTCATACAGTGGCTTCAACTGTGAGATAA
Protein sequenceShow/hide protein sequence
MAAEEILPLFDLFWFQRAILAGEPLLQTCSSAPENRFQSPVMQVTKGRSQSEYLLSSKNFPPPETAVYSTGSIISTNQKLETILSGKVTEFSGNGEGKPAKKKLEGNENK
RRRKRGKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQRLGQKTGEKEEETGIENGISRPYLSEAWEAVEEENEKRILMKWRVPGLGATKMDMKDHLKF
WAHTVASTVR