; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS008595 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS008595
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionHydroxyproline-rich glycoprotein family protein
Genome locationscaffold4:2466810..2467631
RNA-Seq ExpressionMS008595
SyntenyMS008595
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6608324.1 hypothetical protein SDJN03_01666, partial [Cucurbita argyrosperma subsp. sororia]5.9e-6152.32Show/hide
Query:  MNSRDQI--------------PHGQQKKQVRRRRQSRRLYKERPLNMAEARREIATALKLHRASTRE-RQQNQKQH---------------PYFEPEGRM
        MNS DQ+              PHG++KKQVRRRRQSRRLYK+ PLNMAEARREI TALKLHRAST+E ++Q QKQ                P FEPE RM
Subjt:  MNSRDQI--------------PHGQQKKQVRRRRQSRRLYKERPLNMAEARREIATALKLHRASTRE-RQQNQKQH---------------PYFEPEGRM

Query:  KSRRNPRIYPGCSLYLDNRSDFSHVSSPSPKAISLPSCPVPPTPISQNLNIEFPIQTLGLNLNFLDSNTVDTTSVVSNNSHSFCSLSFLPPSSYLCPPVS
        KSRRNPRIYP CS Y +N S F                 + P P++Q+L+++ PIQTLGLN NF D+++V    V +NN+HSF SLSFLPPSSY+CP   
Subjt:  KSRRNPRIYPGCSLYLDNRSDFSHVSSPSPKAISLPSCPVPPTPISQNLNIEFPIQTLGLNLNFLDSNTVDTTSVVSNNSHSFCSLSFLPPSSYLCPPVS

Query:  CAA-TRQEVVESVTLS-EGGKLIASSDVDCSNFPSG---KDTQGAVEEKEAVAEKAM-EIHVM--KALEIDG--HHSF--------DKAVEFPDWLSIND
         AA T QEV +S++LS E G+L+AS     +NFP+G   K+  GAVEE+E   E  + EI  M  K LEIDG  H +F        ++A+EFPDWLSIND
Subjt:  CAA-TRQEVVESVTLS-EGGKLIASSDVDCSNFPSG---KDTQGAVEEKEAVAEKAM-EIHVM--KALEIDG--HHSF--------DKAVEFPDWLSIND

Query:  DFLQQHSNHHCA-EDYVQDPDLS
        DFLQ  SN+  + EDY+QDPDLS
Subjt:  DFLQQHSNHHCA-EDYVQDPDLS

KAG7037674.1 hypothetical protein SDJN02_01304, partial [Cucurbita argyrosperma subsp. argyrosperma]2.2e-6053.97Show/hide
Query:  PHGQQKKQVRRRRQSRRLYKERPLNMAEARREIATALKLHRASTRE-RQQNQKQH---------------PYFEPEGRMKSRRNPRIYPGCSLYLDNRSD
        PHG++KKQVRRRRQSRRLYK+ PLNMAEARREI TALKLHRAST+E ++Q QKQ                P FEPE RMKSRRNPRIYP CS Y +N S 
Subjt:  PHGQQKKQVRRRRQSRRLYKERPLNMAEARREIATALKLHRASTRE-RQQNQKQH---------------PYFEPEGRMKSRRNPRIYPGCSLYLDNRSD

Query:  FSHVSSPSPKAISLPSCPVPPTPISQNLNIEFPIQTLGLNLNFLDSNTVDTTSVVSNNSHSFCSLSFLPPSSYLCPPVSCAA-TRQEVVESVTLS-EGGK
        F                 + P P++Q+L+++ PIQTLGLN NF D+++V   +  +NN+HSF SLSFLPPSSY+CP    AA T QEV +S++LS E G+
Subjt:  FSHVSSPSPKAISLPSCPVPPTPISQNLNIEFPIQTLGLNLNFLDSNTVDTTSVVSNNSHSFCSLSFLPPSSYLCPPVSCAA-TRQEVVESVTLS-EGGK

Query:  LIASSDVDCSNFPSG---KDTQGAVEEKEAVAEKAM-EIHVM--KALEIDG--HHSF--------DKAVEFPDWLSINDDFLQQHSNHHCA-EDYVQDPD
        L+AS     +NFP+G   K+  GAVEE+E   E  + EI  M  K LEIDG  H +F        ++A+EFPDWLSINDDFLQ  SN+  + EDY+QDPD
Subjt:  LIASSDVDCSNFPSG---KDTQGAVEEKEAVAEKAM-EIHVM--KALEIDG--HHSF--------DKAVEFPDWLSINDDFLQQHSNHHCA-EDYVQDPD

Query:  LS
        LS
Subjt:  LS

XP_022132656.1 uncharacterized protein LOC111005459 [Momordica charantia]8.3e-14899.64Show/hide
Query:  MNSRDQIPHGQQKKQVRRRRQSRRLYKERPLNMAEARREIATALKLHRASTRERQQNQKQHPYFEPEGRMKSRRNPRIYPGCSLYLDNRSDFSHVSSPSP
        MNSRDQIPHGQQKKQVRRRRQSRRLYKERPLNMAEARREIATALKLHRASTRERQQNQKQHPYFEPEGRMKSRRNPRIYPGCSLYLDNRSDFSHVSSPSP
Subjt:  MNSRDQIPHGQQKKQVRRRRQSRRLYKERPLNMAEARREIATALKLHRASTRERQQNQKQHPYFEPEGRMKSRRNPRIYPGCSLYLDNRSDFSHVSSPSP

Query:  KAISLPSCPVPPTPISQNLNIEFPIQTLGLNLNFLDSNTVDTTSVVSNNSHSFCSLSFLPPSSYLCPPVSCAATRQEVVESVTLSEGGKLIASSDVDCSN
        KAISLPSCPVPPTPISQNLNIEFPIQTLGLNLNFLDSNTVDTTSVVSNNSHSFCSLSFL PSSYLCPPVSCAATRQEVVESVTLSEGGKLIASSDVDCSN
Subjt:  KAISLPSCPVPPTPISQNLNIEFPIQTLGLNLNFLDSNTVDTTSVVSNNSHSFCSLSFLPPSSYLCPPVSCAATRQEVVESVTLSEGGKLIASSDVDCSN

Query:  FPSGKDTQGAVEEKEAVAEKAMEIHVMKALEIDGHHSFDKAVEFPDWLSINDDFLQQHSNHHCAEDYVQDPDLS
        FPSGKDTQGAVEEKEAVAEKAMEIHVMKALEIDGHHSFDKAVEFPDWLSINDDFLQQHSNHHCAEDYVQDPDLS
Subjt:  FPSGKDTQGAVEEKEAVAEKAMEIHVMKALEIDGHHSFDKAVEFPDWLSINDDFLQQHSNHHCAEDYVQDPDLS

XP_022940715.1 uncharacterized protein LOC111446225 [Cucurbita moschata]3.4e-6151.85Show/hide
Query:  MNSRDQI--------------PHGQQKKQVRRRRQSRRLYKERPLNMAEARREIATALKLHRASTRE-RQQNQKQH---------------PYFEPEGRM
        MNS DQ+              PHG++KKQVRRRRQSRRLYK+ PLNMAEARREI TALKLHRAST+E ++Q QKQ                P FEPE RM
Subjt:  MNSRDQI--------------PHGQQKKQVRRRRQSRRLYKERPLNMAEARREIATALKLHRASTRE-RQQNQKQH---------------PYFEPEGRM

Query:  KSRRNPRIYPGCSLYLDNRSDFSHVSSPSPKAISLPSCPVPPTPISQNLNIEFPIQTLGLNLNFLDSNTVDTTSVVSNNSHSFCSLSFLPPSSYLCPPVS
        KSRRNPRIYP CS Y +N SDF                 + P P++Q+L+++ PIQTLGLN NF D+++V  ++  +NN+HSF SLSFLPPSSY+CP   
Subjt:  KSRRNPRIYPGCSLYLDNRSDFSHVSSPSPKAISLPSCPVPPTPISQNLNIEFPIQTLGLNLNFLDSNTVDTTSVVSNNSHSFCSLSFLPPSSYLCPPVS

Query:  CAA-TRQEVVESVTLS-EGGKLIASSDVDCSNFPSG---KDTQGAVEEKEAVAEKAMEIHVM----KALEIDG--HHSF--------DKAVEFPDWLSIN
         AA T QEV +S++LS E G+L+AS     +NFP+G   K+  GAVEE+E   E+AM   +     K LEIDG  H +F        ++A+EFPDWLSIN
Subjt:  CAA-TRQEVVESVTLS-EGGKLIASSDVDCSNFPSG---KDTQGAVEEKEAVAEKAMEIHVM----KALEIDG--HHSF--------DKAVEFPDWLSIN

Query:  DDFLQQHSNHHCA-EDYVQDPDLS
        DDFLQ  SN+  + EDY+QDPDLS
Subjt:  DDFLQQHSNHHCA-EDYVQDPDLS

XP_022981721.1 uncharacterized protein LOC111480786 [Cucurbita maxima]4.2e-5953.62Show/hide
Query:  PHGQQKKQVRRRRQSRRLYKERPLNMAEARREIATALKLHRASTRE-RQQNQKQH---------------PYFEPEGRMKSRRNPRIYPGCSLYLDNRSD
        PHG++KKQVRRRR++RRLYK+ PLNMAEARREI TALKLHRAST+E ++Q QKQ                P FEPE RMKSRRNPRIYP CS Y  N SD
Subjt:  PHGQQKKQVRRRRQSRRLYKERPLNMAEARREIATALKLHRASTRE-RQQNQKQH---------------PYFEPEGRMKSRRNPRIYPGCSLYLDNRSD

Query:  FSHVSSPSPKAISLPSCPVPPTPISQNLNIEFPIQTLGLNLNFLDSNTVDTTSVV--SNNSHSFCSLSFLPPSSYLCPPVSCAA-TRQEVVESVTLS-EG
        F                 + P P++Q+L+++ PIQTLGLN NF      DT+SVV  +NN+HSF SLSFL PSSY+CP    AA T +EV +S++LS E 
Subjt:  FSHVSSPSPKAISLPSCPVPPTPISQNLNIEFPIQTLGLNLNFLDSNTVDTTSVV--SNNSHSFCSLSFLPPSSYLCPPVSCAA-TRQEVVESVTLS-EG

Query:  GKLIASSDVDCSNFPSG---KDTQGAVEEKEAVAEKAM-EIHVM--KALEIDG--HHSF--------DKAVEFPDWLSINDDFLQQHSNHHCA-EDYVQD
        G+L+AS     +NFP+G   K+  GAVEE+E   E  + EI  M  K LEIDG  H +F        ++A+EFPDWLSINDDFLQ  SN+  + EDY+QD
Subjt:  GKLIASSDVDCSNFPSG---KDTQGAVEEKEAVAEKAM-EIHVM--KALEIDG--HHSF--------DKAVEFPDWLSINDDFLQQHSNHHCA-EDYVQD

Query:  PDLS
        PDLS
Subjt:  PDLS

TrEMBL top hitse value%identityAlignment
A0A1S4DZY0 uncharacterized protein LOC1034937177.5e-4650.16Show/hide
Query:  MNSRDQI----------PHGQQKKQVRRRRQS-RRLYKERPLNMAEARREIATALKLHRAST-----RERQQNQKQH------------PYFEPEGRMKS
        MNS DQ+          P  + KKQVRRRR S RRLYKE PL+MAEARREI TALKLHRAS+     RE+QQ Q Q               FE EGR KS
Subjt:  MNSRDQI----------PHGQQKKQVRRRRQS-RRLYKERPLNMAEARREIATALKLHRAST-----RERQQNQKQH------------PYFEPEGRMKS

Query:  RRNPRIYPG----CSLYLDNRSDFSHVSSPSPKAISLPSCPVPPTPISQNLNIEFPIQTLGLNLNFLDSNTVDTTSVVSNNSHSFCSLSFL-PPSSYLCP
        +RNPRIYP     CS YL+N S F                 V P P  +NLN E PIQT        D  T+DT S       SFCSLSF  PPSSY+CP
Subjt:  RRNPRIYPG----CSLYLDNRSDFSHVSSPSPKAISLPSCPVPPTPISQNLNIEFPIQTLGLNLNFLDSNTVDTTSVVSNNSHSFCSLSFL-PPSSYLCP

Query:  PVSCAAT-RQEVVESVTL-SEGGKLIASSDVDCSNFPSG---KD-TQGAVEEKEAVAE-----KAMEIHVMKALEIDGHHSFDKAVEFPDWLSINDDFLQ
         VSC  T  QE  +SV+L  E G L+AS     +N P+G   KD  Q AV E+EA+A      K+M + V KALEID HHS D A+ FPDW+SINDD LQ
Subjt:  PVSCAAT-RQEVVESVTL-SEGGKLIASSDVDCSNFPSG---KD-TQGAVEEKEAVAE-----KAMEIHVMKALEIDGHHSFDKAVEFPDWLSINDDFLQ

Query:  QHSNHHCA-EDYVQDPDLS
        Q+SN+HC  ED +Q+PDLS
Subjt:  QHSNHHCA-EDYVQDPDLS

A0A5A7V8V7 Putative WRKY transcription factor protein 1 isoform X27.5e-4650.16Show/hide
Query:  MNSRDQI----------PHGQQKKQVRRRRQS-RRLYKERPLNMAEARREIATALKLHRAST-----RERQQNQKQH------------PYFEPEGRMKS
        MNS DQ+          P  + KKQVRRRR S RRLYKE PL+MAEARREI TALKLHRAS+     RE+QQ Q Q               FE EGR KS
Subjt:  MNSRDQI----------PHGQQKKQVRRRRQS-RRLYKERPLNMAEARREIATALKLHRAST-----RERQQNQKQH------------PYFEPEGRMKS

Query:  RRNPRIYPG----CSLYLDNRSDFSHVSSPSPKAISLPSCPVPPTPISQNLNIEFPIQTLGLNLNFLDSNTVDTTSVVSNNSHSFCSLSFL-PPSSYLCP
        +RNPRIYP     CS YL+N S F                 V P P  +NLN E PIQT        D  T+DT S       SFCSLSF  PPSSY+CP
Subjt:  RRNPRIYPG----CSLYLDNRSDFSHVSSPSPKAISLPSCPVPPTPISQNLNIEFPIQTLGLNLNFLDSNTVDTTSVVSNNSHSFCSLSFL-PPSSYLCP

Query:  PVSCAAT-RQEVVESVTL-SEGGKLIASSDVDCSNFPSG---KD-TQGAVEEKEAVAE-----KAMEIHVMKALEIDGHHSFDKAVEFPDWLSINDDFLQ
         VSC  T  QE  +SV+L  E G L+AS     +N P+G   KD  Q AV E+EA+A      K+M + V KALEID HHS D A+ FPDW+SINDD LQ
Subjt:  PVSCAAT-RQEVVESVTL-SEGGKLIASSDVDCSNFPSG---KD-TQGAVEEKEAVAE-----KAMEIHVMKALEIDGHHSFDKAVEFPDWLSINDDFLQ

Query:  QHSNHHCA-EDYVQDPDLS
        Q+SN+HC  ED +Q+PDLS
Subjt:  QHSNHHCA-EDYVQDPDLS

A0A6J1BUG5 uncharacterized protein LOC1110054594.0e-14899.64Show/hide
Query:  MNSRDQIPHGQQKKQVRRRRQSRRLYKERPLNMAEARREIATALKLHRASTRERQQNQKQHPYFEPEGRMKSRRNPRIYPGCSLYLDNRSDFSHVSSPSP
        MNSRDQIPHGQQKKQVRRRRQSRRLYKERPLNMAEARREIATALKLHRASTRERQQNQKQHPYFEPEGRMKSRRNPRIYPGCSLYLDNRSDFSHVSSPSP
Subjt:  MNSRDQIPHGQQKKQVRRRRQSRRLYKERPLNMAEARREIATALKLHRASTRERQQNQKQHPYFEPEGRMKSRRNPRIYPGCSLYLDNRSDFSHVSSPSP

Query:  KAISLPSCPVPPTPISQNLNIEFPIQTLGLNLNFLDSNTVDTTSVVSNNSHSFCSLSFLPPSSYLCPPVSCAATRQEVVESVTLSEGGKLIASSDVDCSN
        KAISLPSCPVPPTPISQNLNIEFPIQTLGLNLNFLDSNTVDTTSVVSNNSHSFCSLSFL PSSYLCPPVSCAATRQEVVESVTLSEGGKLIASSDVDCSN
Subjt:  KAISLPSCPVPPTPISQNLNIEFPIQTLGLNLNFLDSNTVDTTSVVSNNSHSFCSLSFLPPSSYLCPPVSCAATRQEVVESVTLSEGGKLIASSDVDCSN

Query:  FPSGKDTQGAVEEKEAVAEKAMEIHVMKALEIDGHHSFDKAVEFPDWLSINDDFLQQHSNHHCAEDYVQDPDLS
        FPSGKDTQGAVEEKEAVAEKAMEIHVMKALEIDGHHSFDKAVEFPDWLSINDDFLQQHSNHHCAEDYVQDPDLS
Subjt:  FPSGKDTQGAVEEKEAVAEKAMEIHVMKALEIDGHHSFDKAVEFPDWLSINDDFLQQHSNHHCAEDYVQDPDLS

A0A6J1FRD8 uncharacterized protein LOC1114462251.7e-6151.85Show/hide
Query:  MNSRDQI--------------PHGQQKKQVRRRRQSRRLYKERPLNMAEARREIATALKLHRASTRE-RQQNQKQH---------------PYFEPEGRM
        MNS DQ+              PHG++KKQVRRRRQSRRLYK+ PLNMAEARREI TALKLHRAST+E ++Q QKQ                P FEPE RM
Subjt:  MNSRDQI--------------PHGQQKKQVRRRRQSRRLYKERPLNMAEARREIATALKLHRASTRE-RQQNQKQH---------------PYFEPEGRM

Query:  KSRRNPRIYPGCSLYLDNRSDFSHVSSPSPKAISLPSCPVPPTPISQNLNIEFPIQTLGLNLNFLDSNTVDTTSVVSNNSHSFCSLSFLPPSSYLCPPVS
        KSRRNPRIYP CS Y +N SDF                 + P P++Q+L+++ PIQTLGLN NF D+++V  ++  +NN+HSF SLSFLPPSSY+CP   
Subjt:  KSRRNPRIYPGCSLYLDNRSDFSHVSSPSPKAISLPSCPVPPTPISQNLNIEFPIQTLGLNLNFLDSNTVDTTSVVSNNSHSFCSLSFLPPSSYLCPPVS

Query:  CAA-TRQEVVESVTLS-EGGKLIASSDVDCSNFPSG---KDTQGAVEEKEAVAEKAMEIHVM----KALEIDG--HHSF--------DKAVEFPDWLSIN
         AA T QEV +S++LS E G+L+AS     +NFP+G   K+  GAVEE+E   E+AM   +     K LEIDG  H +F        ++A+EFPDWLSIN
Subjt:  CAA-TRQEVVESVTLS-EGGKLIASSDVDCSNFPSG---KDTQGAVEEKEAVAEKAMEIHVM----KALEIDG--HHSF--------DKAVEFPDWLSIN

Query:  DDFLQQHSNHHCA-EDYVQDPDLS
        DDFLQ  SN+  + EDY+QDPDLS
Subjt:  DDFLQQHSNHHCA-EDYVQDPDLS

A0A6J1IXC1 uncharacterized protein LOC1114807862.0e-5953.62Show/hide
Query:  PHGQQKKQVRRRRQSRRLYKERPLNMAEARREIATALKLHRASTRE-RQQNQKQH---------------PYFEPEGRMKSRRNPRIYPGCSLYLDNRSD
        PHG++KKQVRRRR++RRLYK+ PLNMAEARREI TALKLHRAST+E ++Q QKQ                P FEPE RMKSRRNPRIYP CS Y  N SD
Subjt:  PHGQQKKQVRRRRQSRRLYKERPLNMAEARREIATALKLHRASTRE-RQQNQKQH---------------PYFEPEGRMKSRRNPRIYPGCSLYLDNRSD

Query:  FSHVSSPSPKAISLPSCPVPPTPISQNLNIEFPIQTLGLNLNFLDSNTVDTTSVV--SNNSHSFCSLSFLPPSSYLCPPVSCAA-TRQEVVESVTLS-EG
        F                 + P P++Q+L+++ PIQTLGLN NF      DT+SVV  +NN+HSF SLSFL PSSY+CP    AA T +EV +S++LS E 
Subjt:  FSHVSSPSPKAISLPSCPVPPTPISQNLNIEFPIQTLGLNLNFLDSNTVDTTSVV--SNNSHSFCSLSFLPPSSYLCPPVSCAA-TRQEVVESVTLS-EG

Query:  GKLIASSDVDCSNFPSG---KDTQGAVEEKEAVAEKAM-EIHVM--KALEIDG--HHSF--------DKAVEFPDWLSINDDFLQQHSNHHCA-EDYVQD
        G+L+AS     +NFP+G   K+  GAVEE+E   E  + EI  M  K LEIDG  H +F        ++A+EFPDWLSINDDFLQ  SN+  + EDY+QD
Subjt:  GKLIASSDVDCSNFPSG---KDTQGAVEEKEAVAEKAM-EIHVM--KALEIDG--HHSF--------DKAVEFPDWLSINDDFLQQHSNHHCA-EDYVQD

Query:  PDLS
        PDLS
Subjt:  PDLS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G21280.1 hydroxyproline-rich glycoprotein family protein4.1e-1231.4Show/hide
Query:  KKQVRRRRQSRRLYKERPLNMAEARREIATALKLHRASTRERQQNQKQHPYFEPEGRMKSRRNPRIYPGCSLYLDNRSDFSHVSSPSPKAISLPSCPVPP
        KKQVRRR  + R Y+ER LNMAEARREI TALK HRAS R+  +     P                                   P P+ ++L S P PP
Subjt:  KKQVRRRRQSRRLYKERPLNMAEARREIATALKLHRASTRERQQNQKQHPYFEPEGRMKSRRNPRIYPGCSLYLDNRSDFSHVSSPSPKAISLPSCPVPP

Query:  TP------ISQNLNIEFPIQTLGLNLNFLDSN----TVDTTSVVSNNSHSFCSLSFLPPSSYLC-----PPVSCAATRQEVVESVTLSEGGKLIASSDVD
         P       + +LN   P Q LGLNLNF D N    T  TTS  S++S S  S S  P + ++      PP    AT     +  + S G   + +S   
Subjt:  TP------ISQNLNIEFPIQTLGLNLNFLDSN----TVDTTSVVSNNSHSFCSLSFLPPSSYLC-----PPVSCAATRQEVVESVTLSEGGKLIASSDVD

Query:  CSNFPSGKDTQGAVEEKEAVAEKAMEIHVMKALEIDGHHSFDKAVEFPDWLSINDDFL
                     +  K    E   E   +  +E D    F   +EFP WL+  ++ L
Subjt:  CSNFPSGKDTQGAVEEKEAVAEKAMEIHVMKALEIDGHHSFDKAVEFPDWLSINDDFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACTCTAGAGACCAAATCCCTCATGGACAACAGAAAAAACAGGTCAGAAGGAGACGCCAAAGCCGGCGGCTTTACAAGGAAAGGCCTCTCAATATGGCCGAGGCTAG
GAGAGAGATTGCAACTGCACTCAAGCTCCACAGAGCATCAACCAGAGAACGCCAACAGAATCAGAAACAACACCCATATTTTGAACCTGAAGGAAGAATGAAATCCAGGA
GAAACCCCAGGATATACCCAGGTTGCTCACTTTATTTGGATAACCGATCTGATTTTTCTCATGTATCTTCTCCTTCTCCCAAGGCTATCTCCTTGCCTTCCTGCCCAGTT
CCTCCTACACCTATTTCTCAGAATCTCAATATAGAATTCCCTATACAAACCTTAGGACTCAATCTCAATTTTCTTGATTCTAACACTGTGGATACTACTTCTGTTGTCTC
CAACAACAGCCATTCATTTTGTTCACTTTCCTTCTTACCCCCATCTTCATATCTTTGCCCCCCTGTTTCTTGTGCTGCTACTCGTCAGGAAGTTGTGGAATCAGTTACGT
TATCTGAGGGAGGGAAGTTAATAGCTTCTTCTGATGTGGATTGCTCCAATTTCCCAAGTGGAAAAGACACGCAGGGGGCGGTGGAAGAGAAAGAGGCTGTGGCTGAGAAG
GCCATGGAAATACACGTTATGAAGGCTTTGGAGATTGATGGTCACCATAGTTTTGATAAAGCCGTGGAATTTCCAGATTGGTTGAGTATCAATGACGACTTTCTGCAGCA
GCATTCCAATCATCACTGCGCAGAGGATTACGTCCAAGATCCTGACCTTTCT
mRNA sequenceShow/hide mRNA sequence
ATGAACTCTAGAGACCAAATCCCTCATGGACAACAGAAAAAACAGGTCAGAAGGAGACGCCAAAGCCGGCGGCTTTACAAGGAAAGGCCTCTCAATATGGCCGAGGCTAG
GAGAGAGATTGCAACTGCACTCAAGCTCCACAGAGCATCAACCAGAGAACGCCAACAGAATCAGAAACAACACCCATATTTTGAACCTGAAGGAAGAATGAAATCCAGGA
GAAACCCCAGGATATACCCAGGTTGCTCACTTTATTTGGATAACCGATCTGATTTTTCTCATGTATCTTCTCCTTCTCCCAAGGCTATCTCCTTGCCTTCCTGCCCAGTT
CCTCCTACACCTATTTCTCAGAATCTCAATATAGAATTCCCTATACAAACCTTAGGACTCAATCTCAATTTTCTTGATTCTAACACTGTGGATACTACTTCTGTTGTCTC
CAACAACAGCCATTCATTTTGTTCACTTTCCTTCTTACCCCCATCTTCATATCTTTGCCCCCCTGTTTCTTGTGCTGCTACTCGTCAGGAAGTTGTGGAATCAGTTACGT
TATCTGAGGGAGGGAAGTTAATAGCTTCTTCTGATGTGGATTGCTCCAATTTCCCAAGTGGAAAAGACACGCAGGGGGCGGTGGAAGAGAAAGAGGCTGTGGCTGAGAAG
GCCATGGAAATACACGTTATGAAGGCTTTGGAGATTGATGGTCACCATAGTTTTGATAAAGCCGTGGAATTTCCAGATTGGTTGAGTATCAATGACGACTTTCTGCAGCA
GCATTCCAATCATCACTGCGCAGAGGATTACGTCCAAGATCCTGACCTTTCT
Protein sequenceShow/hide protein sequence
MNSRDQIPHGQQKKQVRRRRQSRRLYKERPLNMAEARREIATALKLHRASTRERQQNQKQHPYFEPEGRMKSRRNPRIYPGCSLYLDNRSDFSHVSSPSPKAISLPSCPV
PPTPISQNLNIEFPIQTLGLNLNFLDSNTVDTTSVVSNNSHSFCSLSFLPPSSYLCPPVSCAATRQEVVESVTLSEGGKLIASSDVDCSNFPSGKDTQGAVEEKEAVAEK
AMEIHVMKALEIDGHHSFDKAVEFPDWLSINDDFLQQHSNHHCAEDYVQDPDLS