; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS022071 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS022071
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionOrgan-specific protein S2
Genome locationscaffold47:485335..502530
RNA-Seq ExpressionMS022071
SyntenyMS022071
Gene Ontology termsNA
InterPro domainsIPR024489 - Organ specific protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022135723.1 uncharacterized protein LOC111007615 [Momordica charantia]2.4e-13393.88Show/hide
Query:  IESRREPGDRKWRNVIKDESILPQANNYDDSKPHLESGEPLFKDVKLQSNFWNTRESGEPLFKDVKSQPNFWYTRESREPLFKDVKPQPNFWNTRESGES
        IESRREPGDRKWRNVIKDESILPQANNYDDSKPHLESGEPLFKDVKLQSNFWNTRESGEPLFKDVKSQPNFWYTRESREPLFKDVKPQPNFWNTRESGES
Subjt:  IESRREPGDRKWRNVIKDESILPQANNYDDSKPHLESGEPLFKDVKLQSNFWNTRESGEPLFKDVKSQPNFWYTRESREPLFKDVKPQPNFWNTRESGES

Query:  LFKDVKPQPNFWYTRALREPLFKDVKSQPNFWNIRVSGEPLFKDVKPQPNFWNTHESIEPLFKDVKPQPNFWNTRESVESRFKDVKPQPNFWYTRESGEP
        LFKDVK QPNFWYTRALREPLFKDVKSQPNFWNIRVSGEPLFK VKPQPN WNTHE+ E  FKDVKPQPNFWNT ES+E  FKDVKPQPNFWYTRESGEP
Subjt:  LFKDVKPQPNFWYTRALREPLFKDVKSQPNFWNIRVSGEPLFKDVKPQPNFWNTHESIEPLFKDVKPQPNFWNTRESVESRFKDVKPQPNFWYTRESGEP

Query:  LFKDVKPQPNFWFTHDVKPEESSSSDQQKPLFKLDIKPQPNLSKR
        LF+DVKPQPNFWFTHDVKPEESSS DQQKPLFKLDIKPQPNL K+
Subjt:  LFKDVKPQPNFWFTHDVKPEESSSSDQQKPLFKLDIKPQPNLSKR

XP_022952150.1 uncharacterized protein LOC111454914 isoform X1 [Cucurbita moschata]1.4e-2932.64Show/hide
Query:  IESRREPGDRKWRNVIKDESILPQANNYDDSKPHLESGEPLFKDVKLQSNFWNTRESGEPLFKDVKSQPNFWYTRE-SREPLFK-DVKPQP--NFWNTRE
        IESR EPGD  WRN I D+S+  +   +  S P+    E   +D           E G+   K+++ +P   +  +  +  LF  D+KP+P  +F+    
Subjt:  IESRREPGDRKWRNVIKDESILPQANNYDDSKPHLESGEPLFKDVKLQSNFWNTRESGEPLFKDVKSQPNFWYTRE-SREPLFK-DVKPQP--NFWNTRE

Query:  SGESLFKDVKPQPNF-WYTRALREPLFKDVKSQPNFWNIRVSGEPLFKDVKPQP--NFWNTHESIEPLFKDVKPQPNFWNTRESVESRF--KDVKPQPNF
          + + +D++P+PN  +Y   ++  LF                    KD++P+P  +F+      + + +D++P+PN     + V+++   KD++P+P+ 
Subjt:  SGESLFKDVKPQPNF-WYTRALREPLFKDVKSQPNFWNIRVSGEPLFKDVKPQP--NFWNTHESIEPLFKDVKPQPNFWNTRESVESRF--KDVKPQPNF

Query:  WYTRESGEPLF--KDVKPQPNFWFTHDVKPEESSSSDQQKPLFKLDIKPQPNLSKRPEDHEKLF-AKDIEPRPQISFYPNDIKTKESGNEKPFVKDIEPR
         +  +  +  F  +D++P+PN  F  DV          +  LF  DI+P+P+ S  P+D +K F A+DIEPRP +SFYP+ +KT      K F KDIEPR
Subjt:  WYTRESGEPLF--KDVKPQPNFWFTHDVKPEESSSSDQQKPLFKLDIKPQPNLSKRPEDHEKLF-AKDIEPRPQISFYPNDIKTKESGNEKPFVKDIEPR

Query:  PQASFYPNDIKTKESGSGTNEKPFAEDFEPRPNLSIY
        P ASFYP+D         T +K  AED EPRPNLS Y
Subjt:  PQASFYPNDIKTKESGSGTNEKPFAEDFEPRPNLSIY

XP_023554803.1 protein PELPK1-like isoform X2 [Cucurbita pepo subsp. pepo]1.9e-2632.06Show/hide
Query:  IESRREPGDRKWRNVIKDESILPQANNYDDSKPHLESGEPLFKDVKLQSNFWNTRESGEPLFKDVKSQPNFWYTRE-SREPLFK-DVKPQP--NFWNTRE
        ++SR EPGD  WRN I ++ +  +   +  S P+    E   +D           E G+   K+++ +P   +  +  +  LF  D++P+P  +F+    
Subjt:  IESRREPGDRKWRNVIKDESILPQANNYDDSKPHLESGEPLFKDVKLQSNFWNTRESGEPLFKDVKSQPNFWYTRE-SREPLFK-DVKPQP--NFWNTRE

Query:  SGESLFKDVKPQPN-FWYTRALREPLFKDVKS-QPN--FWNIRVSGEPLFKDVKPQPNFWNTHESIEPLF--KDVKPQPN--FWNTRESVESRFKDVKPQ
          +    D++P+P+  +Y    ++    +VK  +PN  F+   V  +   KD++P+P+     ++ + +F  +D++P PN  F+     VE   KD++P+
Subjt:  SGESLFKDVKPQPN-FWYTRALREPLFKDVKS-QPN--FWNIRVSGEPLFKDVKPQPNFWNTHESIEPLF--KDVKPQPN--FWNTRESVESRFKDVKPQ

Query:  P--NFWYTRESGEPLFKDVKPQPNFWFTHDVKPEESSSSDQQKPLFKLDIKPQPNLSKRPEDHEKLF-AKDIEPRPQISFYPNDIKTKESGNEKPFVKDI
        P  +F+      E + +D++P+PN  F  DV          +  LF  DI+P+P+ S  P+D EK F  +DIEPRP +SFYP+ +KT      K F KDI
Subjt:  P--NFWYTRESGEPLFKDVKPQPNFWFTHDVKPEESSSSDQQKPLFKLDIKPQPNLSKRPEDHEKLF-AKDIEPRPQISFYPNDIKTKESGNEKPFVKDI

Query:  EPRPQASFYPNDIKTKESGSGTNEKPFAEDFEPRPNLSIY
        EPRP ASFYP++         T +   AED EPRPNLS Y
Subjt:  EPRPQASFYPNDIKTKESGSGTNEKPFAEDFEPRPNLSIY

XP_031745285.1 proteoglycan 4 isoform X2 [Cucumis sativus]1.3e-3034Show/hide
Query:  IESRREPGDRKWRNVIKDESILPQANNYDDSKPHLESGEPLFKDVKLQSNFWNTRESGEPLFKDVKSQPN--FWYTRESREPLF-KDVKPQPN--FWNTR
        IESR EPG  +W+NVI+D+S+   +   +D           +K +K ++ F+N          D+K +P+  F+    S++ LF KD++P+P+  F+   
Subjt:  IESRREPGDRKWRNVIKDESILPQANNYDDSKPHLESGEPLFKDVKLQSNFWNTRESGEPLFKDVKSQPN--FWYTRESREPLF-KDVKPQPN--FWNTR

Query:  ESGESLF-KDVKPQPN--FWYTRALREPLF-KDVKSQPN--FWNIRVSGEPLF-KDVKPQPN--FWNTHESIEPLF-KDVKPQPN--FWNTRESVESRF-
        ES +  F KD++P+P+  F+     +  LF KD++ +P+  F+    S +  F KD++P+P+  F+   ++   LF KD++P+P+  F+   ES +  F 
Subjt:  ESGESLF-KDVKPQPN--FWYTRALREPLF-KDVKSQPN--FWNIRVSGEPLF-KDVKPQPN--FWNTHESIEPLF-KDVKPQPN--FWNTRESVESRF-

Query:  KDVKPQPN--FWYTRESGEPLF-KDVKPQPNFWFTHDVKPEESSSSDQQKPLFKLDIKPQPNLSKRPED--HEKLFAKDIEPRPQISFYPNDIKTKESGN
        KD++P+P+  F+   ++   LF KD++P+P+  F          + + +   F  DI+P+P+ +  P D    KLF KDIEPRP  +FYPND        
Subjt:  KDVKPQPN--FWYTRESGEPLF-KDVKPQPNFWFTHDVKPEESSSSDQQKPLFKLDIKPQPNLSKRPED--HEKLFAKDIEPRPQISFYPNDIKTKESGN

Query:  EKPFVKDIEPRPQASFYPNDIKTKESGSGTNEKPFAEDFEPRPNLSIYNN
        ++ F KDIEPRP A+FYPND         T  K F +D EPRP+ + Y N
Subjt:  EKPFVKDIEPRPQASFYPNDIKTKESGSGTNEKPFAEDFEPRPNLSIYNN

XP_038888700.1 uncharacterized protein LOC120078501 [Benincasa hispida]3.5e-2833.43Show/hide
Query:  IESRREPGDRKWRNVIKDESI----------LPQANNYDDSKPHLESGE-PLF-KDVKLQSNFWNTR-ESGEPLF---KDVKSQPNFWYTRESREPLFKD
        IESR EPGD  WRN+IKDE+I          L   N  DD    L+ G+  LF ++++ Q +    R ++   LF    + +S   F       +P+ K+
Subjt:  IESRREPGDRKWRNVIKDESI----------LPQANNYDDSKPHLESGE-PLF-KDVKLQSNFWNTR-ESGEPLF---KDVKSQPNFWYTRESREPLFKD

Query:  VKPQPN-FWNTRESGESLFKDVKPQPN-FWYTRALREPLF-KDVKSQPN--FWNIRVSGEPLFKDVKPQPN--FWNTHESIEPLFKDVKPQPNFWNTRES
        ++P+P+  ++  +      KD++ QP+  +Y   ++  LF KD++ +PN  F+   V  +   KD++P+PN  F+      +   KD++P+PN     + 
Subjt:  VKPQPN-FWNTRESGESLFKDVKPQPN-FWYTRALREPLF-KDVKSQPN--FWNIRVSGEPLFKDVKPQPN--FWNTHESIEPLFKDVKPQPNFWNTRES

Query:  VESRF--KDVKPQPNFWYTRESGEPLF--KDVKPQPNFWFTHDVKPEESSSSDQQKPLFKLDIKPQPNLS-KRPEDHEKLFAKDIEPRPQISFYPNDIKT
        V+++   KD++P+PN  +  +  +  F  KD++P+PN  F  D         D +  LF  D++PQP LS  R +   KLF KD+EPRP +SFYP+D+KT
Subjt:  VESRF--KDVKPQPNFWYTRESGEPLF--KDVKPQPNFWFTHDVKPEESSSSDQQKPLFKLDIKPQPNLS-KRPEDHEKLFAKDIEPRPQISFYPNDIKT

Query:  KESGNEKPFVKDIEPRPQASFYPNDIKTKESGSGTNEKPFAEDFEPRPNLSIY
              K F KD+E RP  S +P+ +KTK          F +D E +PNLS Y
Subjt:  KESGNEKPFVKDIEPRPQASFYPNDIKTKESGSGTNEKPFAEDFEPRPNLSIY

TrEMBL top hitse value%identityAlignment
A0A0A0K3R4 Uncharacterized protein1.2e-3434.32Show/hide
Query:  IESRREPGDRKWRNVIKDESILPQANNYDDSKPH--LESGEPLFKDVKLQSN--FWNTRESGEPLF-KDVKSQPN--FWYTRESREPLF-KDVKPQPN--
        IESR EPG  +W+NVI+D+S+   +   +D   +  L++    F D K + +  F+   ES +  F KD++ +P+  F+   ES++  F KD++P+P+  
Subjt:  IESRREPGDRKWRNVIKDESILPQANNYDDSKPH--LESGEPLFKDVKLQSN--FWNTRESGEPLF-KDVKSQPN--FWYTRESREPLF-KDVKPQPN--

Query:  FWNTRESGESLF-KDVKPQPN--FWYTRALREPLF-KDVKSQPN---FWNIRVSGEPLFKDVKPQPN--FWNTHESIEPLF-KDVKPQPN--FWNTRESV
        F+   ++   LF KD++P+P+  F+     +  LF KD++ +P+   + N     +   KD++P+P+  F+   ++   LF KD++P+P+  F+   ++ 
Subjt:  FWNTRESGESLF-KDVKPQPN--FWYTRALREPLF-KDVKSQPN---FWNIRVSGEPLFKDVKPQPN--FWNTHESIEPLF-KDVKPQPN--FWNTRESV

Query:  ESRF-KDVKPQPN--FWYTRESGEPLF-KDVKPQP-----------NFWFTHDVKPEESS----SSDQQKPLFKLDIKPQPNLSKRPED--HEKLFAKDI
           F KD++P+P+  F+   ++   LF KD++P+P           N  FT D++P  S+    + D +  LF  DI+P+P+ +  P D    KLF KDI
Subjt:  ESRF-KDVKPQPN--FWYTRESGEPLF-KDVKPQP-----------NFWFTHDVKPEESS----SSDQQKPLFKLDIKPQPNLSKRPED--HEKLFAKDI

Query:  EPRPQISFYPNDIKTKESGNEKPFVKDIEPRPQASFYPNDIKTKESGSGTNEKPFAEDFEPRPNLSIYNN
        EPRP  +FYPND         K F KDIEPRP A+FYPND         TN+K F +D EPRP+++ Y N
Subjt:  EPRPQISFYPNDIKTKESGNEKPFVKDIEPRPQASFYPNDIKTKESGSGTNEKPFAEDFEPRPNLSIYNN

A0A0A0K5Y0 Uncharacterized protein2.7e-2635.36Show/hide
Query:  IESRREPGDRKWRNVIKDESILPQANNYDDSKPHL--ESGEPLFKDVKLQSNFWNTRESGEPLFKDVKSQPNFWY-TRESREPLFK---DVKPQPNFWNT
        IESR EPGD  WRN++KD+         DD    L  E G+ LF + + Q+ F    ++ + L KD++ +P+  +   ++R  LF    ++ P   F+  
Subjt:  IESRREPGDRKWRNVIKDESILPQANNYDDSKPHL--ESGEPLFKDVKLQSNFWNTRESGEPLFKDVKSQPNFWY-TRESREPLFK---DVKPQPNFWNT

Query:  RESGESLFKDVK-PQPNFWYTRALREPLF-KDVKSQPNFWNIRVSGE-PLFKDVKPQPN--FWNTHESIEPLFKDVKPQPN--FWNTRESVESRF-KDVK
         E    L KD   P     Y   ++   F KD++ Q      R   +  L KD++P+PN  F+      +   +D++P+PN  F+   E+    F +DV+
Subjt:  RESGESLFKDVK-PQPNFWYTRALREPLF-KDVKSQPNFWNIRVSGE-PLFKDVKPQPN--FWNTHESIEPLFKDVKPQPN--FWNTRESVESRF-KDVK

Query:  PQPN--FWYTRESGEPLF-KDVKPQPNFWFTHDVKPEESSSSDQQKPLFKLDIKPQPNLSKRPEDH--EKLFAKDIEPRPQISFYP-NDIKTKESGNEKP
        P+PN  F+   E+   LF +DV+P+PN  F  D         D +  LF  D++P+PN+S  P+D    KLFA+D+EPRP   FYP +DIKT      K 
Subjt:  PQPN--FWYTRESGEPLF-KDVKPQPNFWFTHDVKPEESSSSDQQKPLFKLDIKPQPNLSKRPEDH--EKLFAKDIEPRPQISFYP-NDIKTKESGNEKP

Query:  FVKDIEPRPQASFYPNDIKTKESGSGTNEKPFAEDFEPRPNLSIY
         V++IEPRP  SFYP+D         T  K  AED EPRPN+S Y
Subjt:  FVKDIEPRPQASFYPNDIKTKESGSGTNEKPFAEDFEPRPNLSIY

A0A6J1C5P1 uncharacterized protein LOC1110076151.1e-13393.88Show/hide
Query:  IESRREPGDRKWRNVIKDESILPQANNYDDSKPHLESGEPLFKDVKLQSNFWNTRESGEPLFKDVKSQPNFWYTRESREPLFKDVKPQPNFWNTRESGES
        IESRREPGDRKWRNVIKDESILPQANNYDDSKPHLESGEPLFKDVKLQSNFWNTRESGEPLFKDVKSQPNFWYTRESREPLFKDVKPQPNFWNTRESGES
Subjt:  IESRREPGDRKWRNVIKDESILPQANNYDDSKPHLESGEPLFKDVKLQSNFWNTRESGEPLFKDVKSQPNFWYTRESREPLFKDVKPQPNFWNTRESGES

Query:  LFKDVKPQPNFWYTRALREPLFKDVKSQPNFWNIRVSGEPLFKDVKPQPNFWNTHESIEPLFKDVKPQPNFWNTRESVESRFKDVKPQPNFWYTRESGEP
        LFKDVK QPNFWYTRALREPLFKDVKSQPNFWNIRVSGEPLFK VKPQPN WNTHE+ E  FKDVKPQPNFWNT ES+E  FKDVKPQPNFWYTRESGEP
Subjt:  LFKDVKPQPNFWYTRALREPLFKDVKSQPNFWNIRVSGEPLFKDVKPQPNFWNTHESIEPLFKDVKPQPNFWNTRESVESRFKDVKPQPNFWYTRESGEP

Query:  LFKDVKPQPNFWFTHDVKPEESSSSDQQKPLFKLDIKPQPNLSKR
        LF+DVKPQPNFWFTHDVKPEESSS DQQKPLFKLDIKPQPNL K+
Subjt:  LFKDVKPQPNFWFTHDVKPEESSSSDQQKPLFKLDIKPQPNLSKR

A0A6J1GKY7 uncharacterized protein LOC111454914 isoform X16.9e-3032.64Show/hide
Query:  IESRREPGDRKWRNVIKDESILPQANNYDDSKPHLESGEPLFKDVKLQSNFWNTRESGEPLFKDVKSQPNFWYTRE-SREPLFK-DVKPQP--NFWNTRE
        IESR EPGD  WRN I D+S+  +   +  S P+    E   +D           E G+   K+++ +P   +  +  +  LF  D+KP+P  +F+    
Subjt:  IESRREPGDRKWRNVIKDESILPQANNYDDSKPHLESGEPLFKDVKLQSNFWNTRESGEPLFKDVKSQPNFWYTRE-SREPLFK-DVKPQP--NFWNTRE

Query:  SGESLFKDVKPQPNF-WYTRALREPLFKDVKSQPNFWNIRVSGEPLFKDVKPQP--NFWNTHESIEPLFKDVKPQPNFWNTRESVESRF--KDVKPQPNF
          + + +D++P+PN  +Y   ++  LF                    KD++P+P  +F+      + + +D++P+PN     + V+++   KD++P+P+ 
Subjt:  SGESLFKDVKPQPNF-WYTRALREPLFKDVKSQPNFWNIRVSGEPLFKDVKPQP--NFWNTHESIEPLFKDVKPQPNFWNTRESVESRF--KDVKPQPNF

Query:  WYTRESGEPLF--KDVKPQPNFWFTHDVKPEESSSSDQQKPLFKLDIKPQPNLSKRPEDHEKLF-AKDIEPRPQISFYPNDIKTKESGNEKPFVKDIEPR
         +  +  +  F  +D++P+PN  F  DV          +  LF  DI+P+P+ S  P+D +K F A+DIEPRP +SFYP+ +KT      K F KDIEPR
Subjt:  WYTRESGEPLF--KDVKPQPNFWFTHDVKPEESSSSDQQKPLFKLDIKPQPNLSKRPEDHEKLF-AKDIEPRPQISFYPNDIKTKESGNEKPFVKDIEPR

Query:  PQASFYPNDIKTKESGSGTNEKPFAEDFEPRPNLSIY
        P ASFYP+D         T +K  AED EPRPNLS Y
Subjt:  PQASFYPNDIKTKESGSGTNEKPFAEDFEPRPNLSIY

A0A6L5B7E4 Uncharacterized protein1.6e-2631.7Show/hide
Query:  ESRREPGDRKWRNVIKDESILPQANNYDDSKPHLESGEPLFKDVKLQSNFW-----NTRESGEPLFKDVKSQPNF--WYTRE---SREPLFKDVKPQPNF
        + +RE G+    + + D    P  ++Y D +  L   E   KD++ + N         RE GE    D++ +PN   ++  E     E   KD++P+PN 
Subjt:  ESRREPGDRKWRNVIKDESILPQANNYDDSKPHLESGEPLFKDVKLQSNFW-----NTRESGEPLFKDVKSQPNF--WYTRE---SREPLFKDVKPQPNF

Query:  W-----NTRESGESLFKDVKPQPN---FWYTRALR--EPLFKDVKSQPNFW-----NIRVSGEPLFKDVKPQPNFWNTHE-----SIEPLFKDVKPQPNF
                RE GES   D++P+PN   +     LR  E   KD++ +PN         R  GE    D++P+PN  + H+       E   KD++P+PN 
Subjt:  W-----NTRESGESLFKDVKPQPN---FWYTRALR--EPLFKDVKSQPNFW-----NIRVSGEPLFKDVKPQPNFWNTHE-----SIEPLFKDVKPQPNF

Query:  W-----NTRESVESRFKDVKPQPNF--WYTRE---SGEPLFKDVKPQPNF-------------WFTHDVKPEESSSS--DQQK----PLFKLDIKPQPNL
                 +  ES   D++P+PN   ++  E     E   KD++P+PN               F  D++P  + S+  D +K      F  DI+P+PN+
Subjt:  W-----NTRESVESRFKDVKPQPNF--WYTRE---SGEPLFKDVKPQPNF-------------WFTHDVKPEESSSS--DQQK----PLFKLDIKPQPNL

Query:  SKRPEDHEKL-----FAKDIEPRPQISFYPNDIKTKESGNEKPFVKDIEPRPQASFYPNDIKTKESGSGTNEKPFAEDFEPRPNLSIY
        S    D +KL     F++DIEPRP IS Y +D K  ++ +   FVKDIEPRP  S Y +D K KE      EK  A+D +PRPN+SIY
Subjt:  SKRPEDHEKL-----FAKDIEPRPQISFYPNDIKTKESGNEKPFVKDIEPRPQASFYPNDIKTKESGSGTNEKPFAEDFEPRPNLSIY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATAGAGTCAAGACGTGAGCCAGGAGATCGTAAGTGGAGAAATGTGATAAAAGATGAGTCGATATTGCCTCAAGCAAATAACTATGACGATTCAAAGCCTCATCTGGAAAG
TGGAGAGCCTCTTTTTAAAGATGTAAAACTTCAATCAAACTTCTGGAACACTCGTGAAAGTGGGGAGCCTCTCTTTAAGGATGTAAAGTCTCAGCCAAATTTCTGGTATA
CTCGTGAAAGTAGAGAGCCACTCTTTAAAGATGTAAAGCCTCAACCAAACTTCTGGAACACTCGTGAAAGTGGAGAGTCTCTCTTTAAAGATGTAAAGCCTCAACCAAAC
TTCTGGTACACTCGTGCCCTTAGAGAGCCTCTCTTTAAAGATGTAAAGTCTCAACCAAACTTTTGGAACATTCGTGTTAGTGGAGAGCCTCTCTTTAAAGATGTAAAGCC
TCAACCAAACTTCTGGAACACTCATGAAAGTATAGAGCCTCTCTTTAAAGATGTAAAGCCTCAACCAAACTTTTGGAATACTCGTGAAAGTGTAGAGTCTCGCTTTAAAG
ATGTAAAGCCTCAACCAAACTTCTGGTACACTCGTGAAAGTGGAGAGCCTCTCTTTAAAGATGTAAAACCTCAACCAAACTTCTGGTTCACACATGATGTTAAACCAGAA
GAGTCGTCCTCCAGTGATCAACAAAAACCTCTCTTTAAGCTGGATATAAAGCCTCAACCAAATTTATCAAAACGACCTGAAGATCATGAAAAGCTTTTCGCTAAGGATAT
CGAGCCTCGACCACAAATCTCATTTTATCCAAACGACATCAAAACGAAAGAGTCTGGCAATGAAAAGCCTTTCGTTAAGGATATCGAGCCTCGACCGCAAGCCTCATTTT
ATCCAAACGACATCAAAACGAAAGAGTCTGGCAGTGGCACCAATGAGAAGCCATTTGCTGAAGATTTTGAGCCAAGACCTAATCTGTCTATCTACAACAAC
mRNA sequenceShow/hide mRNA sequence
ATAGAGTCAAGACGTGAGCCAGGAGATCGTAAGTGGAGAAATGTGATAAAAGATGAGTCGATATTGCCTCAAGCAAATAACTATGACGATTCAAAGCCTCATCTGGAAAG
TGGAGAGCCTCTTTTTAAAGATGTAAAACTTCAATCAAACTTCTGGAACACTCGTGAAAGTGGGGAGCCTCTCTTTAAGGATGTAAAGTCTCAGCCAAATTTCTGGTATA
CTCGTGAAAGTAGAGAGCCACTCTTTAAAGATGTAAAGCCTCAACCAAACTTCTGGAACACTCGTGAAAGTGGAGAGTCTCTCTTTAAAGATGTAAAGCCTCAACCAAAC
TTCTGGTACACTCGTGCCCTTAGAGAGCCTCTCTTTAAAGATGTAAAGTCTCAACCAAACTTTTGGAACATTCGTGTTAGTGGAGAGCCTCTCTTTAAAGATGTAAAGCC
TCAACCAAACTTCTGGAACACTCATGAAAGTATAGAGCCTCTCTTTAAAGATGTAAAGCCTCAACCAAACTTTTGGAATACTCGTGAAAGTGTAGAGTCTCGCTTTAAAG
ATGTAAAGCCTCAACCAAACTTCTGGTACACTCGTGAAAGTGGAGAGCCTCTCTTTAAAGATGTAAAACCTCAACCAAACTTCTGGTTCACACATGATGTTAAACCAGAA
GAGTCGTCCTCCAGTGATCAACAAAAACCTCTCTTTAAGCTGGATATAAAGCCTCAACCAAATTTATCAAAACGACCTGAAGATCATGAAAAGCTTTTCGCTAAGGATAT
CGAGCCTCGACCACAAATCTCATTTTATCCAAACGACATCAAAACGAAAGAGTCTGGCAATGAAAAGCCTTTCGTTAAGGATATCGAGCCTCGACCGCAAGCCTCATTTT
ATCCAAACGACATCAAAACGAAAGAGTCTGGCAGTGGCACCAATGAGAAGCCATTTGCTGAAGATTTTGAGCCAAGACCTAATCTGTCTATCTACAACAAC
Protein sequenceShow/hide protein sequence
IESRREPGDRKWRNVIKDESILPQANNYDDSKPHLESGEPLFKDVKLQSNFWNTRESGEPLFKDVKSQPNFWYTRESREPLFKDVKPQPNFWNTRESGESLFKDVKPQPN
FWYTRALREPLFKDVKSQPNFWNIRVSGEPLFKDVKPQPNFWNTHESIEPLFKDVKPQPNFWNTRESVESRFKDVKPQPNFWYTRESGEPLFKDVKPQPNFWFTHDVKPE
ESSSSDQQKPLFKLDIKPQPNLSKRPEDHEKLFAKDIEPRPQISFYPNDIKTKESGNEKPFVKDIEPRPQASFYPNDIKTKESGSGTNEKPFAEDFEPRPNLSIYNN