; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g20730 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g20730
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionWAT1-related protein
Genome locationchr4:15048580..15069571
RNA-Seq ExpressionMoc04g20730
SyntenyMoc04g20730
Gene Ontology termsGO:0055085 - transmembrane transport (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0022857 - transmembrane transporter activity (molecular function)
InterPro domainsIPR000620 - EamA domain
IPR030184 - WAT1-related protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8651785.1 hypothetical protein Csa_006410 [Cucumis sativus]1.9e-6680.5Show/hide
Query:  LLQTKMTEIYPCHYSSTALMCVMGAIQGVAISMCVERDWKQWKLGWNIRLLTVAYAGIVASGAVVVIMAWCVRARGPLYVSVFSPLMLLLVAIAGSLCLD
        +LQ KM +IYPC YSSTALMCVMGAIQGVAIS+CVERDWKQWKLGWNIRL+TV +AGIV SGA+V +MAWCVR RGPLYVSVFSPLMLL+VAIAGSL LD
Subjt:  LLQTKMTEIYPCHYSSTALMCVMGAIQGVAISMCVERDWKQWKLGWNIRLLTVAYAGIVASGAVVVIMAWCVRARGPLYVSVFSPLMLLLVAIAGSLCLD

Query:  EKLHLGSVIGAVLIVCGLYMVSWGKSKEMNSSLQLATAESVGELELKDVVVTTPKPQND
        EKLHLGSV+GA+LIVCGLYMV WGKSKEMN  LQL  +ES+G+LELKD+ VTTP P N+
Subjt:  EKLHLGSVIGAVLIVCGLYMVSWGKSKEMNSSLQLATAESVGELELKDVVVTTPKPQND

XP_008457201.1 PREDICTED: WAT1-related protein At1g25270-like [Cucumis melo]3.1e-6479.25Show/hide
Query:  LLQTKMTEIYPCHYSSTALMCVMGAIQGVAISMCVERDWKQWKLGWNIRLLTVAYAGIVASGAVVVIMAWCVRARGPLYVSVFSPLMLLLVAIAGSLCLD
        +LQ KM +IYPC YSSTALMCVMGAIQGVAIS+C ERDWKQWKLGWNIRLLTV +AGIV +GA V I AWCVR +GPLYVSVFSPLMLL+VAIAGSL LD
Subjt:  LLQTKMTEIYPCHYSSTALMCVMGAIQGVAISMCVERDWKQWKLGWNIRLLTVAYAGIVASGAVVVIMAWCVRARGPLYVSVFSPLMLLLVAIAGSLCLD

Query:  EKLHLGSVIGAVLIVCGLYMVSWGKSKEMNSSLQLATAESVGELELKDVVVTTPKPQND
        EKLHLGSV+GA+LIVCGLYMV WGKSKEMN  LQL  +ES+G+L LKDV VTTP P N+
Subjt:  EKLHLGSVIGAVLIVCGLYMVSWGKSKEMNSSLQLATAESVGELELKDVVVTTPKPQND

XP_022155995.1 WAT1-related protein At1g25270-like [Momordica charantia]1.7e-8699.41Show/hide
Query:  LLQTKMTEIYPCHYSSTALMCVMGAIQGVAISMCVERDWKQWKLGWNIRLLTVAYAGIVASGAVVVIMAWCVRARGPLYVSVFSPLMLLLVAIAGSLCLD
        +LQTKMTEIYPCHYSSTALMCVMGAIQGVAISMCVERDWKQWKLGWNIRLLTVAYAGIVASGAVVVIMAWCVRARGPLYVSVFSPLMLLLVAIAGSLCLD
Subjt:  LLQTKMTEIYPCHYSSTALMCVMGAIQGVAISMCVERDWKQWKLGWNIRLLTVAYAGIVASGAVVVIMAWCVRARGPLYVSVFSPLMLLLVAIAGSLCLD

Query:  EKLHLGSVIGAVLIVCGLYMVSWGKSKEMNSSLQLATAESVGELELKDVVVTTPKPQNDCLKNNSTTST
        EKLHLGSVIGAVLIVCGLYMVSWGKSKEMNSSLQLATAESVGELELKDVVVTTPKPQNDCLKNNSTTST
Subjt:  EKLHLGSVIGAVLIVCGLYMVSWGKSKEMNSSLQLATAESVGELELKDVVVTTPKPQNDCLKNNSTTST

XP_031736703.1 WAT1-related protein At1g68170 [Cucumis sativus]1.9e-6680.5Show/hide
Query:  LLQTKMTEIYPCHYSSTALMCVMGAIQGVAISMCVERDWKQWKLGWNIRLLTVAYAGIVASGAVVVIMAWCVRARGPLYVSVFSPLMLLLVAIAGSLCLD
        +LQ KM +IYPC YSSTALMCVMGAIQGVAIS+CVERDWKQWKLGWNIRL+TV +AGIV SGA+V +MAWCVR RGPLYVSVFSPLMLL+VAIAGSL LD
Subjt:  LLQTKMTEIYPCHYSSTALMCVMGAIQGVAISMCVERDWKQWKLGWNIRLLTVAYAGIVASGAVVVIMAWCVRARGPLYVSVFSPLMLLLVAIAGSLCLD

Query:  EKLHLGSVIGAVLIVCGLYMVSWGKSKEMNSSLQLATAESVGELELKDVVVTTPKPQND
        EKLHLGSV+GA+LIVCGLYMV WGKSKEMN  LQL  +ES+G+LELKD+ VTTP P N+
Subjt:  EKLHLGSVIGAVLIVCGLYMVSWGKSKEMNSSLQLATAESVGELELKDVVVTTPKPQND

XP_038875637.1 WAT1-related protein At1g25270-like [Benincasa hispida]1.2e-6375.86Show/hide
Query:  LLQTKMTEIYPCHYSSTALMCVMGAIQGVAISMCVERDWKQWKLGWNIRLLTVAYAGIVASGAVVVIMAWCVRARGPLYVSVFSPLMLLLVAIAGSLCLD
        +LQTKMT+IYPC YSSTA+MCVMGAIQG+ IS+CVERD KQWKLGWNIRLLTVA+AGIV +GAVV +MAWCVR RGPLYVS+FSPLMLLLVAIAGSLCLD
Subjt:  LLQTKMTEIYPCHYSSTALMCVMGAIQGVAISMCVERDWKQWKLGWNIRLLTVAYAGIVASGAVVVIMAWCVRARGPLYVSVFSPLMLLLVAIAGSLCLD

Query:  EKLHLGSVIGAVLIVCGLYMVSWGKSKEMNSSLQLATAESVGEL-ELKDV-VVTTPKPQNDC----LKNNSTTS
        EKLHLGSV+GAVLIVCGLYMV WGKSKEMN  LQL  ++S+ +L +LKD+ VVTTPK QN+     + NN+  S
Subjt:  EKLHLGSVIGAVLIVCGLYMVSWGKSKEMNSSLQLATAESVGEL-ELKDV-VVTTPKPQNDC----LKNNSTTS

TrEMBL top hitse value%identityAlignment
A0A1S3C690 WAT1-related protein1.5e-6479.25Show/hide
Query:  LLQTKMTEIYPCHYSSTALMCVMGAIQGVAISMCVERDWKQWKLGWNIRLLTVAYAGIVASGAVVVIMAWCVRARGPLYVSVFSPLMLLLVAIAGSLCLD
        +LQ KM +IYPC YSSTALMCVMGAIQGVAIS+C ERDWKQWKLGWNIRLLTV +AGIV +GA V I AWCVR +GPLYVSVFSPLMLL+VAIAGSL LD
Subjt:  LLQTKMTEIYPCHYSSTALMCVMGAIQGVAISMCVERDWKQWKLGWNIRLLTVAYAGIVASGAVVVIMAWCVRARGPLYVSVFSPLMLLLVAIAGSLCLD

Query:  EKLHLGSVIGAVLIVCGLYMVSWGKSKEMNSSLQLATAESVGELELKDVVVTTPKPQND
        EKLHLGSV+GA+LIVCGLYMV WGKSKEMN  LQL  +ES+G+L LKDV VTTP P N+
Subjt:  EKLHLGSVIGAVLIVCGLYMVSWGKSKEMNSSLQLATAESVGELELKDVVVTTPKPQND

A0A6A1V714 WAT1-related protein4.1e-5463.91Show/hide
Query:  LLQTKMTEIYPCHYSSTALMCVMGAIQGVAISMCVERDWKQWKLGWNIRLLTVAYAGIVASGAVVVIMAWCVRARGPLYVSVFSPLMLLLVAIAGSLCLD
        ++Q KM+E YPCHYSSTALMC+MGAIQ    ++C+ERDW QWKLGWNIRLL V+Y GIVASG +V ++AWCV  RGPL+VS+FSPLML+ VAI GSL LD
Subjt:  LLQTKMTEIYPCHYSSTALMCVMGAIQGVAISMCVERDWKQWKLGWNIRLLTVAYAGIVASGAVVVIMAWCVRARGPLYVSVFSPLMLLLVAIAGSLCLD

Query:  EKLHLGSVIGAVLIVCGLYMVSWGKSKEMNSSLQLATAESVGELELKDVVVTTPKPQNDCLKNNSTTST
        EKLHLGS++GAVLIVCGLY+V WGK KEM    QLA ++S  E EL D+V+T P    +    + T ST
Subjt:  EKLHLGSVIGAVLIVCGLYMVSWGKSKEMNSSLQLATAESVGELELKDVVVTTPKPQNDCLKNNSTTST

A0A6J1DP24 WAT1-related protein8.1e-8799.41Show/hide
Query:  LLQTKMTEIYPCHYSSTALMCVMGAIQGVAISMCVERDWKQWKLGWNIRLLTVAYAGIVASGAVVVIMAWCVRARGPLYVSVFSPLMLLLVAIAGSLCLD
        +LQTKMTEIYPCHYSSTALMCVMGAIQGVAISMCVERDWKQWKLGWNIRLLTVAYAGIVASGAVVVIMAWCVRARGPLYVSVFSPLMLLLVAIAGSLCLD
Subjt:  LLQTKMTEIYPCHYSSTALMCVMGAIQGVAISMCVERDWKQWKLGWNIRLLTVAYAGIVASGAVVVIMAWCVRARGPLYVSVFSPLMLLLVAIAGSLCLD

Query:  EKLHLGSVIGAVLIVCGLYMVSWGKSKEMNSSLQLATAESVGELELKDVVVTTPKPQNDCLKNNSTTST
        EKLHLGSVIGAVLIVCGLYMVSWGKSKEMNSSLQLATAESVGELELKDVVVTTPKPQNDCLKNNSTTST
Subjt:  EKLHLGSVIGAVLIVCGLYMVSWGKSKEMNSSLQLATAESVGELELKDVVVTTPKPQNDCLKNNSTTST

A0A6J1EL42 WAT1-related protein5.7e-6476.3Show/hide
Query:  LLQTKMTEIYPCHYSSTALMCVMGAIQGVAISMCVERDWKQWKLGWNIRLLTVAYAGIVASGAVVVIMAWCVRARGPLYVSVFSPLMLLLVAIAGSLCLD
        +LQTKMT+IYPC YSSTALMCVMGAIQG+AIS+CVERDWKQWKLGWNIRLLTVA+AGIVASGA+V +MAWCVR RGPLYVS+FSPLMLLLVAIAGSLCL 
Subjt:  LLQTKMTEIYPCHYSSTALMCVMGAIQGVAISMCVERDWKQWKLGWNIRLLTVAYAGIVASGAVVVIMAWCVRARGPLYVSVFSPLMLLLVAIAGSLCLD

Query:  EKLHLGSVIGAVLIVCGLYMVSWGKSKEMNSSLQLATAESVGELELKDVVVTTPKPQNDC----LKNNSTTST
        E LHLGSVIGAVLIVCGLYMV WGKS+EMN+            L LKDV VTTPKPQN+     +KNN++ +T
Subjt:  EKLHLGSVIGAVLIVCGLYMVSWGKSKEMNSSLQLATAESVGELELKDVVVTTPKPQNDC----LKNNSTTST

A0A6J1I222 WAT1-related protein1.1e-6275.72Show/hide
Query:  LLQTKMTEIYPCHYSSTALMCVMGAIQGVAISMCVERDWKQWKLGWNIRLLTVAYAGIVASGAVVVIMAWCVRARGPLYVSVFSPLMLLLVAIAGSLCLD
        +LQTKMT+IYPC YSSTALMCVMGAIQG+AIS+CVERDWKQWKLGWNIRLLTVA+AGIVASGA+V +MAWCVR RGPLYVS FSPLMLLLVAIAGSLCL 
Subjt:  LLQTKMTEIYPCHYSSTALMCVMGAIQGVAISMCVERDWKQWKLGWNIRLLTVAYAGIVASGAVVVIMAWCVRARGPLYVSVFSPLMLLLVAIAGSLCLD

Query:  EKLHLGSVIGAVLIVCGLYMVSWGKSKEMNSSLQLATAESVGELELKDVVVTTPKPQND----CLKNNSTTST
        E LHLGSVIGAVLIVCGLYMV WGK +EMN+            L LKDV VTTPKPQN+     +KNN++ +T
Subjt:  EKLHLGSVIGAVLIVCGLYMVSWGKSKEMNSSLQLATAESVGELELKDVVVTTPKPQND----CLKNNSTTST

SwissProt top hitse value%identityAlignment
F4HVM3 WAT1-related protein At1g681702.6e-3449.01Show/hide
Query:  LLQTKMTEIYPCHYSSTALMCVMGAIQGVAISMCVERDWKQWKLGWNIRLLTVAYAGIVASGAVVVIMAWCVRARGPLYVSVFSPLMLLLVAIAGSLCLD
        LLQ K+++ +   Y +  LM +MG +  + +++C E D  +W+LGWNIRLLT+AYA I+ SG VV + AWC+ +RGPL+VSVFSP+ L++VA+ GS  LD
Subjt:  LLQTKMTEIYPCHYSSTALMCVMGAIQGVAISMCVERDWKQWKLGWNIRLLTVAYAGIVASGAVVVIMAWCVRARGPLYVSVFSPLMLLLVAIAGSLCLD

Query:  EKLHLGSVIGAVLIVCGLYMVSWGKSKEMNSSLQLATAESVGELELKDVVV
        E LHLGS+IG V+IV  LY+V W K+KEM S L  +      +   KD+ V
Subjt:  EKLHLGSVIGAVLIVCGLYMVSWGKSKEMNSSLQLATAESVGELELKDVVV

Q4PT23 WAT1-related protein At1g252701.0e-3049.24Show/hide
Query:  LLQTKMTEIYPCHYSSTALMCVMGAIQGVAISMCVERDWKQWKLGWNIRLLTVAYAGIVASGAVVVIMAWCVRARGPLYVSVFSPLMLLLVAIAGSLCLD
        LLQ K+ +     Y +T+LM  +G++  V I++C + DW+QW+LGW+I LL   Y+GIV SG VV ++AWC+  +GPL+V+VFSP+ L++VA+ GS  L+
Subjt:  LLQTKMTEIYPCHYSSTALMCVMGAIQGVAISMCVERDWKQWKLGWNIRLLTVAYAGIVASGAVVVIMAWCVRARGPLYVSVFSPLMLLLVAIAGSLCLD

Query:  EKLHLGSVIGAVLIVCGLYMVSWGKSKEMNSS
        E LHLGS+IGA+++V G+Y+V W K KE  S+
Subjt:  EKLHLGSVIGAVLIVCGLYMVSWGKSKEMNSS

Q8GXB4 WAT1-related protein At1g093802.7e-2645.38Show/hide
Query:  LLQTKMTEIYPCHYSSTALMCVMGAIQGVAISMCVERDWKQWKLGWNIRLLTVAYAGIVASGAVVVIMAWCVRARGPLYVSVFSPLMLLLVAIAGSLCLD
        ++QTKM+E +   Y+ST LMC+MG+IQ  AI++  +     W L   +R ++  YAG+VAS     +M+W ++ +GPLYVSVFSPL+L++VAI     L+
Subjt:  LLQTKMTEIYPCHYSSTALMCVMGAIQGVAISMCVERDWKQWKLGWNIRLLTVAYAGIVASGAVVVIMAWCVRARGPLYVSVFSPLMLLLVAIAGSLCLD

Query:  EKLHLGSVIGAVLIVCGLYMVSWGKSKEMN
        EKL+ G+ +G+ L+V GLY V WGK +E++
Subjt:  EKLHLGSVIGAVLIVCGLYMVSWGKSKEMN

Q9FGG3 WAT1-related protein At5g647001.0e-2539.39Show/hide
Query:  LLQTKMTEIYPCHYSSTALMCVMGAIQGVAISMCVERDWKQWKLGWNIRLLTVAYAGIVASGAVVVIMAWCVRARGPLYVSVFSPLMLLLVAIAGSLCLD
        +LQ ++ ++YP     T L C++ +IQ   I++ +ERD   WKLGWN+RL+ V Y G + +G    + +W +  RGP+++S+F+PL LL   ++ ++ L 
Subjt:  LLQTKMTEIYPCHYSSTALMCVMGAIQGVAISMCVERDWKQWKLGWNIRLLTVAYAGIVASGAVVVIMAWCVRARGPLYVSVFSPLMLLLVAIAGSLCLD

Query:  EKLHLGSVIGAVLIVCGLYMVSWGKSKEMNSS
        E + LGS++G +L++ GLY V WGKS+E  +S
Subjt:  EKLHLGSVIGAVLIVCGLYMVSWGKSKEMNSS

Q9FL41 WAT1-related protein At5g070501.3e-2544.37Show/hide
Query:  LLQTKMTEIYPCH-YSSTALMCVMGAIQGVAISMCVERDWKQWKLGWNIRLLTVAYAGIVASGAVVVIMAWCVRARGPLYVSVFSPLMLLLVAIAGSLCL
        +LQ K+ + Y  H  S T L+C +G +Q VA++  +E +   W++GW++ LL  AY+GIVAS     +    ++ RGP++ + FSPLM+++VA+ GS  L
Subjt:  LLQTKMTEIYPCH-YSSTALMCVMGAIQGVAISMCVERDWKQWKLGWNIRLLTVAYAGIVASGAVVVIMAWCVRARGPLYVSVFSPLMLLLVAIAGSLCL

Query:  DEKLHLGSVIGAVLIVCGLYMVSWGKSKEMNSSL-QLATAES
         EK+ LG VIGAVLIV GLY V WGK KE   ++ +LA  +S
Subjt:  DEKLHLGSVIGAVLIVCGLYMVSWGKSKEMNSSL-QLATAES

Arabidopsis top hitse value%identityAlignment
AT1G09380.1 nodulin MtN21 /EamA-like transporter family protein1.9e-2745.38Show/hide
Query:  LLQTKMTEIYPCHYSSTALMCVMGAIQGVAISMCVERDWKQWKLGWNIRLLTVAYAGIVASGAVVVIMAWCVRARGPLYVSVFSPLMLLLVAIAGSLCLD
        ++QTKM+E +   Y+ST LMC+MG+IQ  AI++  +     W L   +R ++  YAG+VAS     +M+W ++ +GPLYVSVFSPL+L++VAI     L+
Subjt:  LLQTKMTEIYPCHYSSTALMCVMGAIQGVAISMCVERDWKQWKLGWNIRLLTVAYAGIVASGAVVVIMAWCVRARGPLYVSVFSPLMLLLVAIAGSLCLD

Query:  EKLHLGSVIGAVLIVCGLYMVSWGKSKEMN
        EKL+ G+ +G+ L+V GLY V WGK +E++
Subjt:  EKLHLGSVIGAVLIVCGLYMVSWGKSKEMN

AT1G25270.1 nodulin MtN21 /EamA-like transporter family protein7.4e-3249.24Show/hide
Query:  LLQTKMTEIYPCHYSSTALMCVMGAIQGVAISMCVERDWKQWKLGWNIRLLTVAYAGIVASGAVVVIMAWCVRARGPLYVSVFSPLMLLLVAIAGSLCLD
        LLQ K+ +     Y +T+LM  +G++  V I++C + DW+QW+LGW+I LL   Y+GIV SG VV ++AWC+  +GPL+V+VFSP+ L++VA+ GS  L+
Subjt:  LLQTKMTEIYPCHYSSTALMCVMGAIQGVAISMCVERDWKQWKLGWNIRLLTVAYAGIVASGAVVVIMAWCVRARGPLYVSVFSPLMLLLVAIAGSLCLD

Query:  EKLHLGSVIGAVLIVCGLYMVSWGKSKEMNSS
        E LHLGS+IGA+++V G+Y+V W K KE  S+
Subjt:  EKLHLGSVIGAVLIVCGLYMVSWGKSKEMNSS

AT1G68170.1 nodulin MtN21 /EamA-like transporter family protein1.9e-3549.01Show/hide
Query:  LLQTKMTEIYPCHYSSTALMCVMGAIQGVAISMCVERDWKQWKLGWNIRLLTVAYAGIVASGAVVVIMAWCVRARGPLYVSVFSPLMLLLVAIAGSLCLD
        LLQ K+++ +   Y +  LM +MG +  + +++C E D  +W+LGWNIRLLT+AYA I+ SG VV + AWC+ +RGPL+VSVFSP+ L++VA+ GS  LD
Subjt:  LLQTKMTEIYPCHYSSTALMCVMGAIQGVAISMCVERDWKQWKLGWNIRLLTVAYAGIVASGAVVVIMAWCVRARGPLYVSVFSPLMLLLVAIAGSLCLD

Query:  EKLHLGSVIGAVLIVCGLYMVSWGKSKEMNSSLQLATAESVGELELKDVVV
        E LHLGS+IG V+IV  LY+V W K+KEM S L  +      +   KD+ V
Subjt:  EKLHLGSVIGAVLIVCGLYMVSWGKSKEMNSSLQLATAESVGELELKDVVV

AT5G07050.1 nodulin MtN21 /EamA-like transporter family protein9.4e-2744.37Show/hide
Query:  LLQTKMTEIYPCH-YSSTALMCVMGAIQGVAISMCVERDWKQWKLGWNIRLLTVAYAGIVASGAVVVIMAWCVRARGPLYVSVFSPLMLLLVAIAGSLCL
        +LQ K+ + Y  H  S T L+C +G +Q VA++  +E +   W++GW++ LL  AY+GIVAS     +    ++ RGP++ + FSPLM+++VA+ GS  L
Subjt:  LLQTKMTEIYPCH-YSSTALMCVMGAIQGVAISMCVERDWKQWKLGWNIRLLTVAYAGIVASGAVVVIMAWCVRARGPLYVSVFSPLMLLLVAIAGSLCL

Query:  DEKLHLGSVIGAVLIVCGLYMVSWGKSKEMNSSL-QLATAES
         EK+ LG VIGAVLIV GLY V WGK KE   ++ +LA  +S
Subjt:  DEKLHLGSVIGAVLIVCGLYMVSWGKSKEMNSSL-QLATAES

AT5G64700.1 nodulin MtN21 /EamA-like transporter family protein7.2e-2739.39Show/hide
Query:  LLQTKMTEIYPCHYSSTALMCVMGAIQGVAISMCVERDWKQWKLGWNIRLLTVAYAGIVASGAVVVIMAWCVRARGPLYVSVFSPLMLLLVAIAGSLCLD
        +LQ ++ ++YP     T L C++ +IQ   I++ +ERD   WKLGWN+RL+ V Y G + +G    + +W +  RGP+++S+F+PL LL   ++ ++ L 
Subjt:  LLQTKMTEIYPCHYSSTALMCVMGAIQGVAISMCVERDWKQWKLGWNIRLLTVAYAGIVASGAVVVIMAWCVRARGPLYVSVFSPLMLLLVAIAGSLCLD

Query:  EKLHLGSVIGAVLIVCGLYMVSWGKSKEMNSS
        E + LGS++G +L++ GLY V WGKS+E  +S
Subjt:  EKLHLGSVIGAVLIVCGLYMVSWGKSKEMNSS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGCAGTTCACGCGGACTGGGTGCCGGGGTCTGTGGTCGAGGCTTGCTGCGCTGCTGCCGGAGGTCATTAAGGCCGCCCGGTTCGTCGGACCTGTTACGCCGGGATGG
CCACCAAATTATGCGGGTCTCTGGATCTAGTGTTGCTGTACTGTCGAAGTTTGCCGCTCGAGTAGTATCGCCGGAAGCCGCGAGAGGTTGTCGGGGTCATGGTCCTGCCT
CAAAACTTATTGATTTGCTTTTCCCTCGGCTACCTCAGGTAGGGGCAGCCCCTGAGACATATGACATTCGTCATCAAAGAGTTGTGCATGAGAGGAAAGTTAGAGGGGAA
GTGAACAGAGGTGAGGGCGGCAGTGATGAGGGAGGGACGGAAGCATCGTTGGCCTGCTCATGCTGGCCAATTTATGCAAATGATGAAGAGAATGAGGGAGAAGATGGAGA
AGAGAGAAACTTACCAGTCGAAGAGGTCGAGGAAGGTGATGGACTAGGCAGAAGAAAGATAAGAGTGGTGGAGAGAGTGTTGGTTGACTTTTTACTGCAGAAACATGTGT
CGGTTACCCTAGCCTCTGCAGCAATCAAAAGCGGTAGTGAAAGAGTAGAACTCAAATTCCAAGAAAAGTCAGGAATTGCGCCTGGTTCATTTTCCCAACATAGTGTGTTT
TCCATGTTTTGCATCAAAACTAAGGGGAACGTTGAGTCAAAATTTATACGTTCAGAGTCTGGCGATGACGTCAGCAACATTCGTGTCAGCCATGCAGAATCTGTGTCCAG
CCATTACCTTCCTTCTCGCCCTCTCCTTCAGACAAAAATGACGGAAATATACCCGTGCCACTACTCGAGCACGGCGCTGATGTGCGTAATGGGAGCAATTCAGGGAGTGG
CAATATCAATGTGCGTGGAGAGGGACTGGAAGCAATGGAAACTTGGCTGGAATATCAGGCTCCTCACAGTGGCATACGCCGGAATCGTGGCTTCAGGAGCCGTAGTAGTG
ATAATGGCGTGGTGCGTACGCGCGAGAGGCCCGTTGTATGTTTCGGTTTTCAGCCCTCTCATGCTCCTGCTCGTAGCTATTGCAGGATCTCTCTGTCTCGACGAAAAATT
ACACCTTGGCAGTGTAATTGGAGCAGTGTTGATTGTATGTGGTTTATACATGGTGTCGTGGGGCAAAAGCAAAGAGATGAACAGCAGTCTTCAATTAGCAACAGCTGAAA
GCGTTGGGGAACTCGAATTAAAAGACGTTGTGGTTACGACTCCAAAGCCTCAAAATGACTGCCTCAAGAATAACAGCACCACTAGTACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTGCAGTTCACGCGGACTGGGTGCCGGGGTCTGTGGTCGAGGCTTGCTGCGCTGCTGCCGGAGGTCATTAAGGCCGCCCGGTTCGTCGGACCTGTTACGCCGGGATGG
CCACCAAATTATGCGGGTCTCTGGATCTAGTGTTGCTGTACTGTCGAAGTTTGCCGCTCGAGTAGTATCGCCGGAAGCCGCGAGAGGTTGTCGGGGTCATGGTCCTGCCT
CAAAACTTATTGATTTGCTTTTCCCTCGGCTACCTCAGGTAGGGGCAGCCCCTGAGACATATGACATTCGTCATCAAAGAGTTGTGCATGAGAGGAAAGTTAGAGGGGAA
GTGAACAGAGGTGAGGGCGGCAGTGATGAGGGAGGGACGGAAGCATCGTTGGCCTGCTCATGCTGGCCAATTTATGCAAATGATGAAGAGAATGAGGGAGAAGATGGAGA
AGAGAGAAACTTACCAGTCGAAGAGGTCGAGGAAGGTGATGGACTAGGCAGAAGAAAGATAAGAGTGGTGGAGAGAGTGTTGGTTGACTTTTTACTGCAGAAACATGTGT
CGGTTACCCTAGCCTCTGCAGCAATCAAAAGCGGTAGTGAAAGAGTAGAACTCAAATTCCAAGAAAAGTCAGGAATTGCGCCTGGTTCATTTTCCCAACATAGTGTGTTT
TCCATGTTTTGCATCAAAACTAAGGGGAACGTTGAGTCAAAATTTATACGTTCAGAGTCTGGCGATGACGTCAGCAACATTCGTGTCAGCCATGCAGAATCTGTGTCCAG
CCATTACCTTCCTTCTCGCCCTCTCCTTCAGACAAAAATGACGGAAATATACCCGTGCCACTACTCGAGCACGGCGCTGATGTGCGTAATGGGAGCAATTCAGGGAGTGG
CAATATCAATGTGCGTGGAGAGGGACTGGAAGCAATGGAAACTTGGCTGGAATATCAGGCTCCTCACAGTGGCATACGCCGGAATCGTGGCTTCAGGAGCCGTAGTAGTG
ATAATGGCGTGGTGCGTACGCGCGAGAGGCCCGTTGTATGTTTCGGTTTTCAGCCCTCTCATGCTCCTGCTCGTAGCTATTGCAGGATCTCTCTGTCTCGACGAAAAATT
ACACCTTGGCAGTGTAATTGGAGCAGTGTTGATTGTATGTGGTTTATACATGGTGTCGTGGGGCAAAAGCAAAGAGATGAACAGCAGTCTTCAATTAGCAACAGCTGAAA
GCGTTGGGGAACTCGAATTAAAAGACGTTGTGGTTACGACTCCAAAGCCTCAAAATGACTGCCTCAAGAATAACAGCACCACTAGTACTTGA
Protein sequenceShow/hide protein sequence
MCSSRGLGAGVCGRGLLRCCRRSLRPPGSSDLLRRDGHQIMRVSGSSVAVLSKFAARVVSPEAARGCRGHGPASKLIDLLFPRLPQVGAAPETYDIRHQRVVHERKVRGE
VNRGEGGSDEGGTEASLACSCWPIYANDEENEGEDGEERNLPVEEVEEGDGLGRRKIRVVERVLVDFLLQKHVSVTLASAAIKSGSERVELKFQEKSGIAPGSFSQHSVF
SMFCIKTKGNVESKFIRSESGDDVSNIRVSHAESVSSHYLPSRPLLQTKMTEIYPCHYSSTALMCVMGAIQGVAISMCVERDWKQWKLGWNIRLLTVAYAGIVASGAVVV
IMAWCVRARGPLYVSVFSPLMLLLVAIAGSLCLDEKLHLGSVIGAVLIVCGLYMVSWGKSKEMNSSLQLATAESVGELELKDVVVTTPKPQNDCLKNNSTTST