; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g33560 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g33560
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionDUF4408 domain protein
Genome locationchr3:23789986..23790861
RNA-Seq ExpressionMoc03g33560
SyntenyMoc03g33560
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR008480 - Protein of unknown function DUF761, plant
IPR025520 - Domain of unknown function DUF4408


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6584433.1 hypothetical protein SDJN03_20365, partial [Cucurbita argyrosperma subsp. sororia]3.6e-9367.85Show/hide
Query:  MADLSPKPVFNGLTLETAISVTKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRRQKDPPDGNYTP
        MAD SPKPV   +TLETAISVTKSIL+ VG++ST+ILFKVA+IPK+ +L+ITT PRLWVSF  WLSPPY+F VFNFII AVAASS FRRQKD  + NY  
Subjt:  MADLSPKPVFNGLTLETAISVTKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRRQKDPPDGNYTP

Query:  ISH------PSTPYKSMHEDQNFSITARSREIWNNGILHEDDQNERGKE--EEGKPVKWDFS---TGKVDDPLAEKLCAYSGEKVEEEED------DGES
        ISH       STPYK  HED+NFSIT  S EIW+  I  ED+Q +  +E  EE KPVKWDF    T K  DP + K CAY GEKVEEEED      D +S
Subjt:  ISH------PSTPYKSMHEDQNFSITARSREIWNNGILHEDDQNERGKE--EEGKPVKWDFS---TGKVDDPLAEKLCAYSGEKVEEEED------DGES

Query:  LDATWKAIMERQGKQTAQLKKSQTWDSPP--RLIRAVADGG--EEDAAAWARNEVKKSEMFKQTLSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQES
        +DATWKAIMER+GKQT QLKKSQTWDSPP  RLIRA  +    EED  AWARNEV+KSE F QTLSFRR++S++SEELKSRAEAFIEMVN++IRLQRQES
Subjt:  LDATWKAIMERQGKQTAQLKKSQTWDSPP--RLIRAVADGG--EEDAAAWARNEVKKSEMFKQTLSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQES

Query:  EQRFLQALKRS
        EQRFLQA++RS
Subjt:  EQRFLQALKRS

XP_016898842.1 PREDICTED: uncharacterized protein LOC103503737 [Cucumis melo]7.3e-9467.97Show/hide
Query:  MADLSPKPVFNGLTLETAISVTKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRRQKDPPDGNYTP
        M D S KPVF+GLTLETAIS+TKS+LL +G +STVILFKVAIIPK  SL I T PRL+VSF  WLSPPYVF VFNF++VA  ASS FRRQKD  +  +TP
Subjt:  MADLSPKPVFNGLTLETAISVTKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRRQKDPPDGNYTP

Query:  ISH------PSTPYKSMHEDQNFSITARSREIWNNGILHEDDQNERGKEEEGKPVKWDFS---TGKVDDPLAEKLCAYSGEKVEEEED-----DGESLDA
        ISH      PS PYK  HED++FSIT RS EIW NGI  ED+Q++  +EEE K +KWDF    T K  DP +EK CAYS EKVEEE+D     D +S++A
Subjt:  ISH------PSTPYKSMHEDQNFSITARSREIWNNGILHEDDQNERGKEEEGKPVKWDFS---TGKVDDPLAEKLCAYSGEKVEEEED-----DGESLDA

Query:  TWKAIMERQGKQTAQLKKSQTWDS--PPRLIRAVADGGEEDAAAWARNEVKKSEMFKQTLSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQESEQRFL
        TWKAIMERQ KQT QLKKSQTWDS  P RLIRA     EE+  AW RNEV+K E F+QTLSFRR+ISM+SEELKSRAEAFIEMVNR+IRLQRQESEQRFL
Subjt:  TWKAIMERQGKQTAQLKKSQTWDS--PPRLIRAVADGGEEDAAAWARNEVKKSEMFKQTLSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQESEQRFL

Query:  QALKRS
        QA+KRS
Subjt:  QALKRS

XP_022137281.1 uncharacterized protein LOC111008778 [Momordica charantia]2.6e-155100Show/hide
Query:  MADLSPKPVFNGLTLETAISVTKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRRQKDPPDGNYTP
        MADLSPKPVFNGLTLETAISVTKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRRQKDPPDGNYTP
Subjt:  MADLSPKPVFNGLTLETAISVTKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRRQKDPPDGNYTP

Query:  ISHPSTPYKSMHEDQNFSITARSREIWNNGILHEDDQNERGKEEEGKPVKWDFSTGKVDDPLAEKLCAYSGEKVEEEEDDGESLDATWKAIMERQGKQTA
        ISHPSTPYKSMHEDQNFSITARSREIWNNGILHEDDQNERGKEEEGKPVKWDFSTGKVDDPLAEKLCAYSGEKVEEEEDDGESLDATWKAIMERQGKQTA
Subjt:  ISHPSTPYKSMHEDQNFSITARSREIWNNGILHEDDQNERGKEEEGKPVKWDFSTGKVDDPLAEKLCAYSGEKVEEEEDDGESLDATWKAIMERQGKQTA

Query:  QLKKSQTWDSPPRLIRAVADGGEEDAAAWARNEVKKSEMFKQTLSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQESEQRFLQALKRSV
        QLKKSQTWDSPPRLIRAVADGGEEDAAAWARNEVKKSEMFKQTLSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQESEQRFLQALKRSV
Subjt:  QLKKSQTWDSPPRLIRAVADGGEEDAAAWARNEVKKSEMFKQTLSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQESEQRFLQALKRSV

XP_022923785.1 uncharacterized protein LOC111431395 [Cucurbita moschata]2.5e-9468.49Show/hide
Query:  MADLSPKPVFNGLTLETAISVTKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRRQKDPPDGNYTP
        MAD SPKPV   +TLETAISVTKSIL+ VG++ST+ILFKVA+IPK+ +L+ITT PRLWVSF  WLSPPY+F VFNFII AVAASS FRRQKD  + NY  
Subjt:  MADLSPKPVFNGLTLETAISVTKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRRQKDPPDGNYTP

Query:  ISH------PSTPYKSMHEDQNFSITARSREIWNNGILHEDDQNERGKE--EEGKPVKWDFS---TGKVDDPLAEKLCAYSGEKVEEEED------DGES
        ISH       STPYK  HED+NFSIT  S EIW+  I  ED+Q +  +E  EE KPVKWDF    T K  DP + KLCAY GEKVEEEED      D +S
Subjt:  ISH------PSTPYKSMHEDQNFSITARSREIWNNGILHEDDQNERGKE--EEGKPVKWDFS---TGKVDDPLAEKLCAYSGEKVEEEED------DGES

Query:  LDATWKAIMERQGKQTAQLKKSQTWDSPP--RLIRAVADGG--EEDAAAWARNEVKKSEMFKQTLSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQES
        +DATWKAIMER+GKQT QLKKSQTWDSPP  RLIRA  D    EED  AWARNEV+KSE F QTLSFRR++S++SEELKSRAEAFIEMVN++IRLQRQES
Subjt:  LDATWKAIMERQGKQTAQLKKSQTWDSPP--RLIRAVADGG--EEDAAAWARNEVKKSEMFKQTLSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQES

Query:  EQRFLQALKRS
        EQRFLQA++RS
Subjt:  EQRFLQALKRS

XP_023520184.1 uncharacterized protein LOC111783484 [Cucurbita pepo subsp. pepo]3.6e-9368.81Show/hide
Query:  MADLSPKPVFNGLTLETAISVTKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRRQKDPPDGNYTP
        MAD SPKPV   +TLETAISVTKSIL+ VG++ST+ILFKVA+IPK+ SL+ITT P LWVSF  WLSPPY+F VFNFII AVAASS FRRQKD  + NY  
Subjt:  MADLSPKPVFNGLTLETAISVTKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRRQKDPPDGNYTP

Query:  ISH------PSTPYKSMHEDQNFSITARSREIWNNGILHEDDQNERGKE-EEGKPVKWDFS---TGKVDDPLAEKLCAYSGEKVEEEED-------DGES
        ISH       STPYK  HED+NFSIT  S EIW+  I  ED+Q   GKE EE KPVKWDF    T K  DP + KLCAY GEKVEEEED       D +S
Subjt:  ISH------PSTPYKSMHEDQNFSITARSREIWNNGILHEDDQNERGKE-EEGKPVKWDFS---TGKVDDPLAEKLCAYSGEKVEEEED-------DGES

Query:  LDATWKAIMERQGKQTAQLKKSQTWDSPP--RLIRAVADGG--EEDAAAWARNEVKKSEMFKQTLSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQES
        +DATWKAIMER+GKQT QLKKSQTWDSPP  RLIRA  D    EED  AWAR+EV+KSE F QTLSFRR++S++SEELKSRAEAFIEMVN++IRLQRQES
Subjt:  LDATWKAIMERQGKQTAQLKKSQTWDSPP--RLIRAVADGG--EEDAAAWARNEVKKSEMFKQTLSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQES

Query:  EQRFLQALKRS
        EQRFLQA++RS
Subjt:  EQRFLQALKRS

TrEMBL top hitse value%identityAlignment
A0A1S4DS64 uncharacterized protein LOC1035037373.5e-9467.97Show/hide
Query:  MADLSPKPVFNGLTLETAISVTKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRRQKDPPDGNYTP
        M D S KPVF+GLTLETAIS+TKS+LL +G +STVILFKVAIIPK  SL I T PRL+VSF  WLSPPYVF VFNF++VA  ASS FRRQKD  +  +TP
Subjt:  MADLSPKPVFNGLTLETAISVTKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRRQKDPPDGNYTP

Query:  ISH------PSTPYKSMHEDQNFSITARSREIWNNGILHEDDQNERGKEEEGKPVKWDFS---TGKVDDPLAEKLCAYSGEKVEEEED-----DGESLDA
        ISH      PS PYK  HED++FSIT RS EIW NGI  ED+Q++  +EEE K +KWDF    T K  DP +EK CAYS EKVEEE+D     D +S++A
Subjt:  ISH------PSTPYKSMHEDQNFSITARSREIWNNGILHEDDQNERGKEEEGKPVKWDFS---TGKVDDPLAEKLCAYSGEKVEEEED-----DGESLDA

Query:  TWKAIMERQGKQTAQLKKSQTWDS--PPRLIRAVADGGEEDAAAWARNEVKKSEMFKQTLSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQESEQRFL
        TWKAIMERQ KQT QLKKSQTWDS  P RLIRA     EE+  AW RNEV+K E F+QTLSFRR+ISM+SEELKSRAEAFIEMVNR+IRLQRQESEQRFL
Subjt:  TWKAIMERQGKQTAQLKKSQTWDS--PPRLIRAVADGGEEDAAAWARNEVKKSEMFKQTLSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQESEQRFL

Query:  QALKRS
        QA+KRS
Subjt:  QALKRS

A0A5A7T393 DUF4408 domain protein3.5e-9467.97Show/hide
Query:  MADLSPKPVFNGLTLETAISVTKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRRQKDPPDGNYTP
        M D S KPVF+GLTLETAIS+TKS+LL +G +STVILFKVAIIPK  SL I T PRL+VSF  WLSPPYVF VFNF++VA  ASS FRRQKD  +  +TP
Subjt:  MADLSPKPVFNGLTLETAISVTKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRRQKDPPDGNYTP

Query:  ISH------PSTPYKSMHEDQNFSITARSREIWNNGILHEDDQNERGKEEEGKPVKWDFS---TGKVDDPLAEKLCAYSGEKVEEEED-----DGESLDA
        ISH      PS PYK  HED++FSIT RS EIW NGI  ED+Q++  +EEE K +KWDF    T K  DP +EK CAYS EKVEEE+D     D +S++A
Subjt:  ISH------PSTPYKSMHEDQNFSITARSREIWNNGILHEDDQNERGKEEEGKPVKWDFS---TGKVDDPLAEKLCAYSGEKVEEEED-----DGESLDA

Query:  TWKAIMERQGKQTAQLKKSQTWDS--PPRLIRAVADGGEEDAAAWARNEVKKSEMFKQTLSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQESEQRFL
        TWKAIMERQ KQT QLKKSQTWDS  P RLIRA     EE+  AW RNEV+K E F+QTLSFRR+ISM+SEELKSRAEAFIEMVNR+IRLQRQESEQRFL
Subjt:  TWKAIMERQGKQTAQLKKSQTWDS--PPRLIRAVADGGEEDAAAWARNEVKKSEMFKQTLSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQESEQRFL

Query:  QALKRS
        QA+KRS
Subjt:  QALKRS

A0A6J1C651 uncharacterized protein LOC1110087781.2e-155100Show/hide
Query:  MADLSPKPVFNGLTLETAISVTKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRRQKDPPDGNYTP
        MADLSPKPVFNGLTLETAISVTKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRRQKDPPDGNYTP
Subjt:  MADLSPKPVFNGLTLETAISVTKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRRQKDPPDGNYTP

Query:  ISHPSTPYKSMHEDQNFSITARSREIWNNGILHEDDQNERGKEEEGKPVKWDFSTGKVDDPLAEKLCAYSGEKVEEEEDDGESLDATWKAIMERQGKQTA
        ISHPSTPYKSMHEDQNFSITARSREIWNNGILHEDDQNERGKEEEGKPVKWDFSTGKVDDPLAEKLCAYSGEKVEEEEDDGESLDATWKAIMERQGKQTA
Subjt:  ISHPSTPYKSMHEDQNFSITARSREIWNNGILHEDDQNERGKEEEGKPVKWDFSTGKVDDPLAEKLCAYSGEKVEEEEDDGESLDATWKAIMERQGKQTA

Query:  QLKKSQTWDSPPRLIRAVADGGEEDAAAWARNEVKKSEMFKQTLSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQESEQRFLQALKRSV
        QLKKSQTWDSPPRLIRAVADGGEEDAAAWARNEVKKSEMFKQTLSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQESEQRFLQALKRSV
Subjt:  QLKKSQTWDSPPRLIRAVADGGEEDAAAWARNEVKKSEMFKQTLSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQESEQRFLQALKRSV

A0A6J1ECW6 uncharacterized protein LOC1114313951.2e-9468.49Show/hide
Query:  MADLSPKPVFNGLTLETAISVTKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRRQKDPPDGNYTP
        MAD SPKPV   +TLETAISVTKSIL+ VG++ST+ILFKVA+IPK+ +L+ITT PRLWVSF  WLSPPY+F VFNFII AVAASS FRRQKD  + NY  
Subjt:  MADLSPKPVFNGLTLETAISVTKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRRQKDPPDGNYTP

Query:  ISH------PSTPYKSMHEDQNFSITARSREIWNNGILHEDDQNERGKE--EEGKPVKWDFS---TGKVDDPLAEKLCAYSGEKVEEEED------DGES
        ISH       STPYK  HED+NFSIT  S EIW+  I  ED+Q +  +E  EE KPVKWDF    T K  DP + KLCAY GEKVEEEED      D +S
Subjt:  ISH------PSTPYKSMHEDQNFSITARSREIWNNGILHEDDQNERGKE--EEGKPVKWDFS---TGKVDDPLAEKLCAYSGEKVEEEED------DGES

Query:  LDATWKAIMERQGKQTAQLKKSQTWDSPP--RLIRAVADGG--EEDAAAWARNEVKKSEMFKQTLSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQES
        +DATWKAIMER+GKQT QLKKSQTWDSPP  RLIRA  D    EED  AWARNEV+KSE F QTLSFRR++S++SEELKSRAEAFIEMVN++IRLQRQES
Subjt:  LDATWKAIMERQGKQTAQLKKSQTWDSPP--RLIRAVADGG--EEDAAAWARNEVKKSEMFKQTLSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQES

Query:  EQRFLQALKRS
        EQRFLQA++RS
Subjt:  EQRFLQALKRS

A0A6J1KH91 uncharacterized protein LOC1114957367.4e-9267.31Show/hide
Query:  MADLSPKPVFNGLTLETAISVTKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRRQKDPPDGNYTP
        MA  SPKPV   +TLETAISVTKSIL+ VG++ST+ILFKVA+IPK+ +L+I+T PRLWVSF  WLSPPY+F VFNFII AVAASS FRRQKD  +  Y  
Subjt:  MADLSPKPVFNGLTLETAISVTKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRRQKDPPDGNYTP

Query:  ISH------PSTPYKSMHEDQNFSITARSREIWNNGILHEDDQNERGKE--EEGKPVKWDFS---TGKVDDPLAEKLCAYSGEKVEEEEDDG----ESLD
        ISH       STPYK  H+D+NFSIT  S EIW+  I  ED+Q ++ +E  EE KPVKWDF    T K  DP + KLCAY GEKVEEEED G    +S+D
Subjt:  ISH------PSTPYKSMHEDQNFSITARSREIWNNGILHEDDQNERGKE--EEGKPVKWDFS---TGKVDDPLAEKLCAYSGEKVEEEEDDG----ESLD

Query:  ATWKAIMERQGKQTAQLKKSQTWDSPP--RLIRAVADGG--EEDAAAWARNEVKKSEMFKQTLSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQESEQ
        ATW AIMER+GKQT QLKKSQTWDSPP  RLIRA  D    EED  AWARNEV+KSE F QTLSFRR++S++SEELKSRAEAFIEMVN++IRLQRQESEQ
Subjt:  ATWKAIMERQGKQTAQLKKSQTWDSPP--RLIRAVADGG--EEDAAAWARNEVKKSEMFKQTLSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQESEQ

Query:  RFLQALKRS
        RFLQA++RS
Subjt:  RFLQALKRS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G61260.1 Protein of unknown function (DUF761)1.6e-0625Show/hide
Query:  TKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRR---QKDPPD-------GNY-----TPI--SHP
        TK++L+  G+ +  +L K++ +P  V   ++  P LW S   WL PPY++ V N II+ + ASS + R    +D  D       G Y      PI   H 
Subjt:  TKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRR---QKDPPD-------GNY-----TPI--SHP

Query:  STP----YKSMHEDQNFSITARSREIWN------NGILHEDDQNER-----------GKEEEGKPVKWDFSTGKVDD---------------PLAEKLCA
        ++P     K +    +F     + E           ++ +D++ E+             EEE K V    ++  V+                P+ + L  
Subjt:  STP----YKSMHEDQNFSITARSREIWN------NGILHEDDQNER-----------GKEEEGKPVKWDFSTGKVDD---------------PLAEKLCA

Query:  --YSGEKVEEEEDDG------------ESLDATWKAIME-RQGKQTAQL-KKSQTWDSPPRLIRAVADGGEEDAAAWARNEVKKSEMFK---------QT
          +   K+ +   +G            E+L+ TWK I E +    T QL ++S T+           D G  D     +   KKS+ F+         +T
Subjt:  --YSGEKVEEEEDDG------------ESLDATWKAIME-RQGKQTAQL-KKSQTWDSPPRLIRAVADGGEEDAAAWARNEVKKSEMFK---------QT

Query:  LSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQESEQRFLQALKRSV
           R+  S+S EEL  R EAFI+  N  ++LQR ES +++ +   R V
Subjt:  LSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQESEQRFLQALKRSV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGATCTGTCGCCAAAACCGGTGTTTAATGGTTTGACGTTAGAGACGGCAATCTCAGTTACAAAATCGATACTGCTGTTTGTTGGGCTTGTATCCACTGTAATTTT
GTTCAAAGTGGCTATAATTCCAAAATTAGTGAGTCTTCTCATCACGACTCCTCCTCGCTTATGGGTCTCCTTCGGATTCTGGCTCTCTCCACCCTACGTCTTCTTCGTCT
TCAATTTCATCATCGTCGCCGTCGCTGCCTCCTCCGCCTTCCGCCGCCAAAAAGACCCACCCGACGGTAATTACACCCCCATTTCTCATCCATCCACTCCCTATAAATCG
ATGCACGAGGACCAAAATTTTTCGATCACTGCTCGCTCCAGAGAAATCTGGAATAATGGGATCTTGCACGAGGACGATCAGAACGAACGTGGGAAAGAAGAAGAAGGGAA
GCCGGTGAAATGGGATTTTTCGACCGGAAAAGTCGATGACCCGTTGGCGGAGAAGTTGTGCGCATATTCCGGCGAGAAGGTGGAGGAGGAGGAGGATGACGGCGAGTCGC
TAGACGCGACGTGGAAGGCCATAATGGAAAGGCAAGGGAAGCAGACGGCGCAGCTGAAGAAGAGCCAGACATGGGACTCCCCACCACGTTTAATTAGAGCCGTGGCGGAC
GGCGGCGAGGAGGACGCCGCGGCGTGGGCCCGGAATGAGGTGAAAAAGTCGGAGATGTTCAAGCAAACACTGTCGTTCCGGAGAAGGATATCGATGAGTTCGGAGGAATT
GAAGAGCCGGGCAGAGGCGTTCATAGAGATGGTAAACAGGAACATTCGCTTACAGAGGCAAGAATCAGAGCAACGCTTCCTGCAGGCACTTAAACGCAGCGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCGATCTGTCGCCAAAACCGGTGTTTAATGGTTTGACGTTAGAGACGGCAATCTCAGTTACAAAATCGATACTGCTGTTTGTTGGGCTTGTATCCACTGTAATTTT
GTTCAAAGTGGCTATAATTCCAAAATTAGTGAGTCTTCTCATCACGACTCCTCCTCGCTTATGGGTCTCCTTCGGATTCTGGCTCTCTCCACCCTACGTCTTCTTCGTCT
TCAATTTCATCATCGTCGCCGTCGCTGCCTCCTCCGCCTTCCGCCGCCAAAAAGACCCACCCGACGGTAATTACACCCCCATTTCTCATCCATCCACTCCCTATAAATCG
ATGCACGAGGACCAAAATTTTTCGATCACTGCTCGCTCCAGAGAAATCTGGAATAATGGGATCTTGCACGAGGACGATCAGAACGAACGTGGGAAAGAAGAAGAAGGGAA
GCCGGTGAAATGGGATTTTTCGACCGGAAAAGTCGATGACCCGTTGGCGGAGAAGTTGTGCGCATATTCCGGCGAGAAGGTGGAGGAGGAGGAGGATGACGGCGAGTCGC
TAGACGCGACGTGGAAGGCCATAATGGAAAGGCAAGGGAAGCAGACGGCGCAGCTGAAGAAGAGCCAGACATGGGACTCCCCACCACGTTTAATTAGAGCCGTGGCGGAC
GGCGGCGAGGAGGACGCCGCGGCGTGGGCCCGGAATGAGGTGAAAAAGTCGGAGATGTTCAAGCAAACACTGTCGTTCCGGAGAAGGATATCGATGAGTTCGGAGGAATT
GAAGAGCCGGGCAGAGGCGTTCATAGAGATGGTAAACAGGAACATTCGCTTACAGAGGCAAGAATCAGAGCAACGCTTCCTGCAGGCACTTAAACGCAGCGTTTAG
Protein sequenceShow/hide protein sequence
MADLSPKPVFNGLTLETAISVTKSILLFVGLVSTVILFKVAIIPKLVSLLITTPPRLWVSFGFWLSPPYVFFVFNFIIVAVAASSAFRRQKDPPDGNYTPISHPSTPYKS
MHEDQNFSITARSREIWNNGILHEDDQNERGKEEEGKPVKWDFSTGKVDDPLAEKLCAYSGEKVEEEEDDGESLDATWKAIMERQGKQTAQLKKSQTWDSPPRLIRAVAD
GGEEDAAAWARNEVKKSEMFKQTLSFRRRISMSSEELKSRAEAFIEMVNRNIRLQRQESEQRFLQALKRSV