; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC03g1341 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC03g1341
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionDUF4408 domain protein
Genome locationMC03:19034965..19035834
RNA-Seq ExpressionMC03g1341
SyntenyMC03g1341
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR008480 - Protein of unknown function DUF761, plant
IPR025520 - Domain of unknown function DUF4408


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6584433.1 hypothetical protein SDJN03_20365, partial [Cucurbita argyrosperma subsp. sororia]4.06e-11867.85Show/hide
Query:  MADLSPKPVFNGLTLETAISVTKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRRQKDPPDGNYTP
        MAD SPKPV   +TLETAISVTKSIL+ VG++ST+ILFKVA+IPK+ +L+ITT PRLWVSF  WLSPPY+F VFNFII AVAASS FRRQKD  + NY  
Subjt:  MADLSPKPVFNGLTLETAISVTKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRRQKDPPDGNYTP

Query:  ISHP------STPYKSMHEDQNFSITARSREIWNNGILHEDDQNERGKE--EEGKPVKWDFST---GKVDDPLAEKLCAYSGEKVEEEED------DGES
        ISH       STPYK  HED+NFSIT  S EIW+  I  ED+Q +  +E  EE KPVKWDF T    K  DP + K CAY GEKVEEEED      D +S
Subjt:  ISHP------STPYKSMHEDQNFSITARSREIWNNGILHEDDQNERGKE--EEGKPVKWDFST---GKVDDPLAEKLCAYSGEKVEEEED------DGES

Query:  LDATWKAIMERQGKQTAQLKKSQTWDSPP--RLIRAVADGG--EEDAAAWARNEVKKSEMFKQTLSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQES
        +DATWKAIMER+GKQT QLKKSQTWDSPP  RLIRA  +    EED  AWARNEV+KSE F QTLSFRR++S++SEELKSRAEAFIEMVN++IRLQRQES
Subjt:  LDATWKAIMERQGKQTAQLKKSQTWDSPP--RLIRAVADGG--EEDAAAWARNEVKKSEMFKQTLSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQES

Query:  EQRFLQALKRS
        EQRFLQA++RS
Subjt:  EQRFLQALKRS

XP_016898842.1 PREDICTED: uncharacterized protein LOC103503737 [Cucumis melo]4.34e-11967.97Show/hide
Query:  MADLSPKPVFNGLTLETAISVTKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRRQKDPPDGNYTP
        M D S KPVF+GLTLETAIS+TKS+LL +G +STVILFKVAIIPK  SL I T PRL+VSF  WLSPPYVF VFNF++VA  ASS FRRQKD  +  +TP
Subjt:  MADLSPKPVFNGLTLETAISVTKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRRQKDPPDGNYTP

Query:  ISH------PSTPYKSMHEDQNFSITARSREIWNNGILHEDDQNERGKEEEGKPVKWDFST---GKVDDPLAEKLCAYSGEKVEEEEDDGE-----SLDA
        ISH      PS PYK  HED++FSIT RS EIWN GI  ED+Q++  +EEE K +KWDF T    K  DP +EK CAYS EKVEEE+D G+     S++A
Subjt:  ISH------PSTPYKSMHEDQNFSITARSREIWNNGILHEDDQNERGKEEEGKPVKWDFST---GKVDDPLAEKLCAYSGEKVEEEEDDGE-----SLDA

Query:  TWKAIMERQGKQTAQLKKSQTWDSP--PRLIRAVADGGEEDAAAWARNEVKKSEMFKQTLSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQESEQRFL
        TWKAIMERQ KQT QLKKSQTWDSP   RLIRA     EE+  AW RNEV+K E F+QTLSFRR+ISM+SEELKSRAEAFIEMVNR+IRLQRQESEQRFL
Subjt:  TWKAIMERQGKQTAQLKKSQTWDSP--PRLIRAVADGGEEDAAAWARNEVKKSEMFKQTLSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQESEQRFL

Query:  QALKRS
        QA+KRS
Subjt:  QALKRS

XP_022137281.1 uncharacterized protein LOC111008778 [Momordica charantia]2.47e-199100Show/hide
Query:  MADLSPKPVFNGLTLETAISVTKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRRQKDPPDGNYTP
        MADLSPKPVFNGLTLETAISVTKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRRQKDPPDGNYTP
Subjt:  MADLSPKPVFNGLTLETAISVTKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRRQKDPPDGNYTP

Query:  ISHPSTPYKSMHEDQNFSITARSREIWNNGILHEDDQNERGKEEEGKPVKWDFSTGKVDDPLAEKLCAYSGEKVEEEEDDGESLDATWKAIMERQGKQTA
        ISHPSTPYKSMHEDQNFSITARSREIWNNGILHEDDQNERGKEEEGKPVKWDFSTGKVDDPLAEKLCAYSGEKVEEEEDDGESLDATWKAIMERQGKQTA
Subjt:  ISHPSTPYKSMHEDQNFSITARSREIWNNGILHEDDQNERGKEEEGKPVKWDFSTGKVDDPLAEKLCAYSGEKVEEEEDDGESLDATWKAIMERQGKQTA

Query:  QLKKSQTWDSPPRLIRAVADGGEEDAAAWARNEVKKSEMFKQTLSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQESEQRFLQALKRS
        QLKKSQTWDSPPRLIRAVADGGEEDAAAWARNEVKKSEMFKQTLSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQESEQRFLQALKRS
Subjt:  QLKKSQTWDSPPRLIRAVADGGEEDAAAWARNEVKKSEMFKQTLSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQESEQRFLQALKRS

XP_022923785.1 uncharacterized protein LOC111431395 [Cucurbita moschata]1.23e-11968.49Show/hide
Query:  MADLSPKPVFNGLTLETAISVTKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRRQKDPPDGNYTP
        MAD SPKPV   +TLETAISVTKSIL+ VG++ST+ILFKVA+IPK+ +L+ITT PRLWVSF  WLSPPY+F VFNFII AVAASS FRRQKD  + NY  
Subjt:  MADLSPKPVFNGLTLETAISVTKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRRQKDPPDGNYTP

Query:  ISHP------STPYKSMHEDQNFSITARSREIWNNGILHEDDQNERGKE--EEGKPVKWDFST---GKVDDPLAEKLCAYSGEKVEEEED------DGES
        ISH       STPYK  HED+NFSIT  S EIW+  I  ED+Q +  +E  EE KPVKWDF T    K  DP + KLCAY GEKVEEEED      D +S
Subjt:  ISHP------STPYKSMHEDQNFSITARSREIWNNGILHEDDQNERGKE--EEGKPVKWDFST---GKVDDPLAEKLCAYSGEKVEEEED------DGES

Query:  LDATWKAIMERQGKQTAQLKKSQTWDSPP--RLIRAVADGG--EEDAAAWARNEVKKSEMFKQTLSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQES
        +DATWKAIMER+GKQT QLKKSQTWDSPP  RLIRA  D    EED  AWARNEV+KSE F QTLSFRR++S++SEELKSRAEAFIEMVN++IRLQRQES
Subjt:  LDATWKAIMERQGKQTAQLKKSQTWDSPP--RLIRAVADGG--EEDAAAWARNEVKKSEMFKQTLSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQES

Query:  EQRFLQALKRS
        EQRFLQA++RS
Subjt:  EQRFLQALKRS

XP_023520184.1 uncharacterized protein LOC111783484 [Cucurbita pepo subsp. pepo]5.19e-11868.81Show/hide
Query:  MADLSPKPVFNGLTLETAISVTKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRRQKDPPDGNYTP
        MAD SPKPV   +TLETAISVTKSIL+ VG++ST+ILFKVA+IPK+ SL+ITT P LWVSF  WLSPPY+F VFNFII AVAASS FRRQKD  + NY  
Subjt:  MADLSPKPVFNGLTLETAISVTKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRRQKDPPDGNYTP

Query:  ISHP------STPYKSMHEDQNFSITARSREIWNNGILHEDDQNERGKE-EEGKPVKWDFST---GKVDDPLAEKLCAYSGEKVEEEED-------DGES
        ISH       STPYK  HED+NFSIT  S EIW+  I  ED+Q   GKE EE KPVKWDF T    K  DP + KLCAY GEKVEEEED       D +S
Subjt:  ISHP------STPYKSMHEDQNFSITARSREIWNNGILHEDDQNERGKE-EEGKPVKWDFST---GKVDDPLAEKLCAYSGEKVEEEED-------DGES

Query:  LDATWKAIMERQGKQTAQLKKSQTWDSPP--RLIRAVADGG--EEDAAAWARNEVKKSEMFKQTLSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQES
        +DATWKAIMER+GKQT QLKKSQTWDSPP  RLIRA  D    EED  AWAR+EV+KSE F QTLSFRR++S++SEELKSRAEAFIEMVN++IRLQRQES
Subjt:  LDATWKAIMERQGKQTAQLKKSQTWDSPP--RLIRAVADGG--EEDAAAWARNEVKKSEMFKQTLSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQES

Query:  EQRFLQALKRS
        EQRFLQA++RS
Subjt:  EQRFLQALKRS

TrEMBL top hitse value%identityAlignment
A0A1S4DS64 uncharacterized protein LOC1035037372.10e-11967.97Show/hide
Query:  MADLSPKPVFNGLTLETAISVTKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRRQKDPPDGNYTP
        M D S KPVF+GLTLETAIS+TKS+LL +G +STVILFKVAIIPK  SL I T PRL+VSF  WLSPPYVF VFNF++VA  ASS FRRQKD  +  +TP
Subjt:  MADLSPKPVFNGLTLETAISVTKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRRQKDPPDGNYTP

Query:  ISH------PSTPYKSMHEDQNFSITARSREIWNNGILHEDDQNERGKEEEGKPVKWDFST---GKVDDPLAEKLCAYSGEKVEEEEDDGE-----SLDA
        ISH      PS PYK  HED++FSIT RS EIWN GI  ED+Q++  +EEE K +KWDF T    K  DP +EK CAYS EKVEEE+D G+     S++A
Subjt:  ISH------PSTPYKSMHEDQNFSITARSREIWNNGILHEDDQNERGKEEEGKPVKWDFST---GKVDDPLAEKLCAYSGEKVEEEEDDGE-----SLDA

Query:  TWKAIMERQGKQTAQLKKSQTWDSP--PRLIRAVADGGEEDAAAWARNEVKKSEMFKQTLSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQESEQRFL
        TWKAIMERQ KQT QLKKSQTWDSP   RLIRA     EE+  AW RNEV+K E F+QTLSFRR+ISM+SEELKSRAEAFIEMVNR+IRLQRQESEQRFL
Subjt:  TWKAIMERQGKQTAQLKKSQTWDSP--PRLIRAVADGGEEDAAAWARNEVKKSEMFKQTLSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQESEQRFL

Query:  QALKRS
        QA+KRS
Subjt:  QALKRS

A0A5A7T393 DUF4408 domain protein2.10e-11967.97Show/hide
Query:  MADLSPKPVFNGLTLETAISVTKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRRQKDPPDGNYTP
        M D S KPVF+GLTLETAIS+TKS+LL +G +STVILFKVAIIPK  SL I T PRL+VSF  WLSPPYVF VFNF++VA  ASS FRRQKD  +  +TP
Subjt:  MADLSPKPVFNGLTLETAISVTKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRRQKDPPDGNYTP

Query:  ISH------PSTPYKSMHEDQNFSITARSREIWNNGILHEDDQNERGKEEEGKPVKWDFST---GKVDDPLAEKLCAYSGEKVEEEEDDGE-----SLDA
        ISH      PS PYK  HED++FSIT RS EIWN GI  ED+Q++  +EEE K +KWDF T    K  DP +EK CAYS EKVEEE+D G+     S++A
Subjt:  ISH------PSTPYKSMHEDQNFSITARSREIWNNGILHEDDQNERGKEEEGKPVKWDFST---GKVDDPLAEKLCAYSGEKVEEEEDDGE-----SLDA

Query:  TWKAIMERQGKQTAQLKKSQTWDSP--PRLIRAVADGGEEDAAAWARNEVKKSEMFKQTLSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQESEQRFL
        TWKAIMERQ KQT QLKKSQTWDSP   RLIRA     EE+  AW RNEV+K E F+QTLSFRR+ISM+SEELKSRAEAFIEMVNR+IRLQRQESEQRFL
Subjt:  TWKAIMERQGKQTAQLKKSQTWDSP--PRLIRAVADGGEEDAAAWARNEVKKSEMFKQTLSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQESEQRFL

Query:  QALKRS
        QA+KRS
Subjt:  QALKRS

A0A6J1C651 uncharacterized protein LOC1110087781.20e-199100Show/hide
Query:  MADLSPKPVFNGLTLETAISVTKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRRQKDPPDGNYTP
        MADLSPKPVFNGLTLETAISVTKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRRQKDPPDGNYTP
Subjt:  MADLSPKPVFNGLTLETAISVTKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRRQKDPPDGNYTP

Query:  ISHPSTPYKSMHEDQNFSITARSREIWNNGILHEDDQNERGKEEEGKPVKWDFSTGKVDDPLAEKLCAYSGEKVEEEEDDGESLDATWKAIMERQGKQTA
        ISHPSTPYKSMHEDQNFSITARSREIWNNGILHEDDQNERGKEEEGKPVKWDFSTGKVDDPLAEKLCAYSGEKVEEEEDDGESLDATWKAIMERQGKQTA
Subjt:  ISHPSTPYKSMHEDQNFSITARSREIWNNGILHEDDQNERGKEEEGKPVKWDFSTGKVDDPLAEKLCAYSGEKVEEEEDDGESLDATWKAIMERQGKQTA

Query:  QLKKSQTWDSPPRLIRAVADGGEEDAAAWARNEVKKSEMFKQTLSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQESEQRFLQALKRS
        QLKKSQTWDSPPRLIRAVADGGEEDAAAWARNEVKKSEMFKQTLSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQESEQRFLQALKRS
Subjt:  QLKKSQTWDSPPRLIRAVADGGEEDAAAWARNEVKKSEMFKQTLSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQESEQRFLQALKRS

A0A6J1ECW6 uncharacterized protein LOC1114313955.95e-12068.49Show/hide
Query:  MADLSPKPVFNGLTLETAISVTKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRRQKDPPDGNYTP
        MAD SPKPV   +TLETAISVTKSIL+ VG++ST+ILFKVA+IPK+ +L+ITT PRLWVSF  WLSPPY+F VFNFII AVAASS FRRQKD  + NY  
Subjt:  MADLSPKPVFNGLTLETAISVTKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRRQKDPPDGNYTP

Query:  ISHP------STPYKSMHEDQNFSITARSREIWNNGILHEDDQNERGKE--EEGKPVKWDFST---GKVDDPLAEKLCAYSGEKVEEEED------DGES
        ISH       STPYK  HED+NFSIT  S EIW+  I  ED+Q +  +E  EE KPVKWDF T    K  DP + KLCAY GEKVEEEED      D +S
Subjt:  ISHP------STPYKSMHEDQNFSITARSREIWNNGILHEDDQNERGKE--EEGKPVKWDFST---GKVDDPLAEKLCAYSGEKVEEEED------DGES

Query:  LDATWKAIMERQGKQTAQLKKSQTWDSPP--RLIRAVADGG--EEDAAAWARNEVKKSEMFKQTLSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQES
        +DATWKAIMER+GKQT QLKKSQTWDSPP  RLIRA  D    EED  AWARNEV+KSE F QTLSFRR++S++SEELKSRAEAFIEMVN++IRLQRQES
Subjt:  LDATWKAIMERQGKQTAQLKKSQTWDSPP--RLIRAVADGG--EEDAAAWARNEVKKSEMFKQTLSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQES

Query:  EQRFLQALKRS
        EQRFLQA++RS
Subjt:  EQRFLQALKRS

A0A6J1KH91 uncharacterized protein LOC1114957363.01e-11667.31Show/hide
Query:  MADLSPKPVFNGLTLETAISVTKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRRQKDPPDGNYTP
        MA  SPKPV   +TLETAISVTKSIL+ VG++ST+ILFKVA+IPK+ +L+I+T PRLWVSF  WLSPPY+F VFNFII AVAASS FRRQKD  +  Y  
Subjt:  MADLSPKPVFNGLTLETAISVTKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRRQKDPPDGNYTP

Query:  ISHP------STPYKSMHEDQNFSITARSREIWNNGILHEDDQNERGKE--EEGKPVKWDFST---GKVDDPLAEKLCAYSGEKVEEEEDDG----ESLD
        ISH       STPYK  H+D+NFSIT  S EIW+  I  ED+Q ++ +E  EE KPVKWDF T    K  DP + KLCAY GEKVEEEED G    +S+D
Subjt:  ISHP------STPYKSMHEDQNFSITARSREIWNNGILHEDDQNERGKE--EEGKPVKWDFST---GKVDDPLAEKLCAYSGEKVEEEEDDG----ESLD

Query:  ATWKAIMERQGKQTAQLKKSQTWDSPP--RLIRAVADGG--EEDAAAWARNEVKKSEMFKQTLSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQESEQ
        ATW AIMER+GKQT QLKKSQTWDSPP  RLIRA  D    EED  AWARNEV+KSE F QTLSFRR++S++SEELKSRAEAFIEMVN++IRLQRQESEQ
Subjt:  ATWKAIMERQGKQTAQLKKSQTWDSPP--RLIRAVADGG--EEDAAAWARNEVKKSEMFKQTLSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQESEQ

Query:  RFLQALKRS
        RFLQA++RS
Subjt:  RFLQALKRS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G61260.1 Protein of unknown function (DUF761)6.1e-0624.86Show/hide
Query:  TKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRR---QKDPPD-------GNY-----TPI--SHP
        TK++L+  G+ +  +L K++ +P  V   ++  P LW S   WL PPY++ V N II+ + ASS + R    +D  D       G Y      PI   H 
Subjt:  TKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRR---QKDPPD-------GNY-----TPI--SHP

Query:  STP----YKSMHEDQNFSITARSREIWN------NGILHEDDQNER-----------GKEEEGKPVKWDFSTGKVDD---------------PLAEKLCA
        ++P     K +    +F     + E           ++ +D++ E+             EEE K V    ++  V+                P+ + L  
Subjt:  STP----YKSMHEDQNFSITARSREIWN------NGILHEDDQNER-----------GKEEEGKPVKWDFSTGKVDD---------------PLAEKLCA

Query:  --YSGEKVEEEEDDG------------ESLDATWKAIME-RQGKQTAQL-KKSQTWDSPPRLIRAVADGGEEDAAAWARNEVKKSEMFK---------QT
          +   K+ +   +G            E+L+ TWK I E +    T QL ++S T+           D G  D     +   KKS+ F+         +T
Subjt:  --YSGEKVEEEEDDG------------ESLDATWKAIME-RQGKQTAQL-KKSQTWDSPPRLIRAVADGGEEDAAAWARNEVKKSEMFK---------QT

Query:  LSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQESEQRFLQALKR
           R+  S+S EEL  R EAFI+  N  ++LQR ES +++ +   R
Subjt:  LSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQESEQRFLQALKR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGATCTGTCGCCAAAACCGGTGTTTAATGGTTTGACGTTAGAGACGGCAATCTCAGTTACAAAATCGATACTGCTGTTTGTTGGGCTTGTATCCACTGTAATTTT
GTTCAAAGTGGCTATAATTCCAAAATTAGTGAGTCTTCTCATCACGACTCCTCCTCGCTTATGGGTCTCCTTCGGATTCTGGCTCTCTCCACCCTACGTCTTCTTCGTCT
TCAATTTCATCATCGTCGCCGTCGCTGCCTCCTCCGCCTTCCGCCGCCAAAAAGACCCACCCGACGGTAATTACACCCCCATTTCTCATCCATCCACTCCCTATAAATCG
ATGCACGAGGACCAAAATTTTTCGATCACTGCTCGCTCCAGAGAAATCTGGAATAATGGGATCTTGCACGAGGACGATCAGAACGAACGTGGGAAAGAAGAAGAAGGGAA
GCCGGTGAAATGGGATTTTTCGACCGGAAAAGTCGATGACCCGTTGGCGGAGAAGTTGTGCGCATATTCCGGCGAGAAGGTGGAGGAGGAGGAGGATGACGGCGAGTCGC
TAGACGCGACGTGGAAGGCCATAATGGAAAGGCAAGGGAAGCAGACGGCGCAGCTGAAGAAGAGCCAGACATGGGACTCCCCACCACGTTTAATTAGAGCCGTGGCGGAC
GGCGGCGAGGAGGACGCCGCGGCGTGGGCCCGGAATGAGGTGAAAAAGTCGGAGATGTTCAAGCAAACACTGTCGTTCCGGAGAAGGATATCGATGAGTTCGGAGGAATT
GAAGAGCCGGGCAGAGGCGTTCATAGAGATGGTAAACAGGAACATTCGCTTACAGAGGCAAGAATCAGAGCAACGCTTCCTGCAGGCACTTAAACGCAGC
mRNA sequenceShow/hide mRNA sequence
ATGGCCGATCTGTCGCCAAAACCGGTGTTTAATGGTTTGACGTTAGAGACGGCAATCTCAGTTACAAAATCGATACTGCTGTTTGTTGGGCTTGTATCCACTGTAATTTT
GTTCAAAGTGGCTATAATTCCAAAATTAGTGAGTCTTCTCATCACGACTCCTCCTCGCTTATGGGTCTCCTTCGGATTCTGGCTCTCTCCACCCTACGTCTTCTTCGTCT
TCAATTTCATCATCGTCGCCGTCGCTGCCTCCTCCGCCTTCCGCCGCCAAAAAGACCCACCCGACGGTAATTACACCCCCATTTCTCATCCATCCACTCCCTATAAATCG
ATGCACGAGGACCAAAATTTTTCGATCACTGCTCGCTCCAGAGAAATCTGGAATAATGGGATCTTGCACGAGGACGATCAGAACGAACGTGGGAAAGAAGAAGAAGGGAA
GCCGGTGAAATGGGATTTTTCGACCGGAAAAGTCGATGACCCGTTGGCGGAGAAGTTGTGCGCATATTCCGGCGAGAAGGTGGAGGAGGAGGAGGATGACGGCGAGTCGC
TAGACGCGACGTGGAAGGCCATAATGGAAAGGCAAGGGAAGCAGACGGCGCAGCTGAAGAAGAGCCAGACATGGGACTCCCCACCACGTTTAATTAGAGCCGTGGCGGAC
GGCGGCGAGGAGGACGCCGCGGCGTGGGCCCGGAATGAGGTGAAAAAGTCGGAGATGTTCAAGCAAACACTGTCGTTCCGGAGAAGGATATCGATGAGTTCGGAGGAATT
GAAGAGCCGGGCAGAGGCGTTCATAGAGATGGTAAACAGGAACATTCGCTTACAGAGGCAAGAATCAGAGCAACGCTTCCTGCAGGCACTTAAACGCAGC
Protein sequenceShow/hide protein sequence
MADLSPKPVFNGLTLETAISVTKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRRQKDPPDGNYTPISHPSTPYKS
MHEDQNFSITARSREIWNNGILHEDDQNERGKEEEGKPVKWDFSTGKVDDPLAEKLCAYSGEKVEEEEDDGESLDATWKAIMERQGKQTAQLKKSQTWDSPPRLIRAVAD
GGEEDAAAWARNEVKKSEMFKQTLSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQESEQRFLQALKRS