; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g16430 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g16430
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionBAHD acyltransferase At3g29680-like
Genome locationchr8:12647048..12648687
RNA-Seq ExpressionMoc08g16430
SyntenyMoc08g16430
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]1.8e-10650.28Show/hide
Query:  SPLKRRKKKKKVTSSSEVRPRGPLPSSHADLVDDPEARMGGTSDVKMWFRMEPSSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHAVEAFT
        +P KRRKKKK + SSSEV     LP+  AD VDDP ARMGGTSDV   FR+EPSSSGV+DQVSRISAA LDRCLRRASKFVS PGSVL R ID+A EAF 
Subjt:  SPLKRRKKKKKVTSSSEVRPRGPLPSSHADLVDDPEARMGGTSDVKMWFRMEPSSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHAVEAFT

Query:  ASIHSAIMIKAELHGKEALAAKERENSSAALEAA-TTLKGELLKARSEVDILRAEV--------------------------------------------
        ASI SA+ +KAEL G+E LAA+E+E  SAALEAA +T+K ELLKA SEV+ L+AEV                                            
Subjt:  ASIHSAIMIKAELHGKEALAAKERENSSAALEAA-TTLKGELLKARSEVDILRAEV--------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------EAKGELLKREYERHKAHLRAAHAITKGLEKEKFQLLKEKDDMIQALEGKDAAIGRLTAEM
                                                EAK ELLKRE ERHKAHLRAAHAITKGLEKEKFQLLKEKDDM+QALE KDAAIGRL AE+
Subjt:  ----------------------------------------EAKGELLKREYERHKAHLRAAHAITKGLEKEKFQLLKEKDDMIQALEGKDAAIGRLTAEM

Query:  KAEKERLTNGALLEAAFRQHPNFDGFAKDFSDAGFKFLMKGIAADMP------------------------PGPASLVDKYVRDLDSDYSDLDEDEAPSQ
        KAEKERLTNGALLEAAFRQHP+FDGFAKDFSDAGFKFLMKGIAAD+P                         GPASLVDKYVRDLDSDYSDLDEDE PSQ
Subjt:  KAEKERLTNGALLEAAFRQHPNFDGFAKDFSDAGFKFLMKGIAADMP------------------------PGPASLVDKYVRDLDSDYSDLDEDEAPSQ

Query:  EPTEVGTTQEGAPSQQNGSQEVNLLSSQGELSSHLGS
        EPTEVGTTQEG PSQQ+GSQEVNLL SQGELSSHLGS
Subjt:  EPTEVGTTQEGAPSQQNGSQEVNLLSSQGELSSHLGS

XP_022147182.1 uncharacterized protein LOC111016193 [Momordica charantia]2.1e-10775.96Show/hide
Query:  MWFRMEPSSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHAVEAFTASIHSAIMIKAELHGKEALAAKERENSSAALEAATTLKGELLKARS
        M FRME SSSGVKDQVSRISA CLDRCLRRAS+FVSDPGSVLQRTID+A EAF ASIHSA+M+KAEL G+EAL AKEREN S  LEAATTLKGELLKA+ 
Subjt:  MWFRMEPSSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHAVEAFTASIHSAIMIKAELHGKEALAAKERENSSAALEAATTLKGELLKARS

Query:  EVDILRAEVEAKGELLKREYERHKAHLRAAHAITKGLEKEKFQLLKEKDDMIQALEGKDAAIGRLTAEMKAEKERLTNGALLEAAFRQHPNFDGFAKDFS
        EVDILRAEV+AK +LLK+E E+HKAHLRAAHAITKGLEKEKFQLLKEKDD+ Q LE KDA+IGRLT E+K  KERLT+GALLE +FRQHPNFDGFAKDFS
Subjt:  EVDILRAEVEAKGELLKREYERHKAHLRAAHAITKGLEKEKFQLLKEKDDMIQALEGKDAAIGRLTAEMKAEKERLTNGALLEAAFRQHPNFDGFAKDFS

Query:  DAGFKFLMKGIAADMP------------------------PGPASLVDKYVRDLDSDYSDLDEDEAPSQEPTEVGTTQEGAPSQQNG
        DAGFKFLMKGIAADMP                        PGP SLVDKYVR+LDSDYSD++E++APSQEPT+VGTTQE APSQ  G
Subjt:  DAGFKFLMKGIAADMP------------------------PGPASLVDKYVRDLDSDYSDLDEDEAPSQEPTEVGTTQEGAPSQQNG

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]7.9e-8666.31Show/hide
Query:  RMEPSSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHAVEAFTASIHSAIMIKAELHGKEALAAKERENSSAALE-AATTLKGELLKARSEV
        R+EPSSSGV+DQVSRISAA LDRCLRRASKFVS PGSVLQRTID+A EAF ASI SA+ +KAEL G+E LAA+E+E  SAALE A++T+K ELLKA SEV
Subjt:  RMEPSSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHAVEAFTASIHSAIMIKAELHGKEALAAKERENSSAALE-AATTLKGELLKARSEV

Query:  DILRAEVEAKGELLKREYERHKAHLRAAHAITKGLEKEKFQLLKEKDDMIQALEGKDAAIGRLTAEMKAEKERLTNGALLEAAFRQHPNFDGFAKDFSDA
        + L+AEVE++ ELLK+E +R +A LRAAHAIT+GLE+EKFQLLKEKDDM+QALE KD  +   TAE++  KERL+NG LLE AFRQHP+FDGFAKDFSDA
Subjt:  DILRAEVEAKGELLKREYERHKAHLRAAHAITKGLEKEKFQLLKEKDDMIQALEGKDAAIGRLTAEMKAEKERLTNGALLEAAFRQHPNFDGFAKDFSDA

Query:  GFKFLMKGIAADMP------------------------PGPASLVDKYVRDLDSDYSDLDEDEAPSQEPTEVGTTQEGA
        GFKFLMKGIA+DMP                        PGP +LVD+YVRDLDSDYSD +ED        +VG+TQEGA
Subjt:  GFKFLMKGIAADMP------------------------PGPASLVDKYVRDLDSDYSDLDEDEAPSQEPTEVGTTQEGA

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]9.8e-12177.39Show/hide
Query:  MGGTSDVKMWFRMEPSSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHAVEAFTASIHSAIMIKAELHGKEALAAKERENSSAALEAATTLK
        MGGT DV+  FRMEPSSSGVKDQVSRISA CLDRCL+RASKFVSDPGSVLQRTID+A EAF ASIHSAIM+KAEL G+EALAAKERENSSAALEAATTLK
Subjt:  MGGTSDVKMWFRMEPSSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHAVEAFTASIHSAIMIKAELHGKEALAAKERENSSAALEAATTLK

Query:  GELLKARSEVDILRAEVEAKGELLKREYERHKAHLRAAHAITKGLEKEKFQLLKEKDDMIQALEGKDAAIGRLTAEMKAEKERLTNGALLEAAFRQHPNF
        GELLKA+ EV ILRAEV+AK ELLK+E E+HKAHLRAAHAITKGLEKEKFQLLKEKDD+ Q LEGKD +IGRLTAE+K  KERLTNG+LLE +FRQH +F
Subjt:  GELLKARSEVDILRAEVEAKGELLKREYERHKAHLRAAHAITKGLEKEKFQLLKEKDDMIQALEGKDAAIGRLTAEMKAEKERLTNGALLEAAFRQHPNF

Query:  DGFAKDFSDAGFKFLMKGIAADMP------------------------PGPASLVDKYVRDLDSDYSDLDEDEAPSQEPTEVGTTQEGAPSQQNGSQEVN
        DGFAKDFSDAGFKFLMKGIAADMP                        PGP SLV KYVR+LDSDYSD++E++APSQEP E+GTTQE  PSQQ+GSQEVN
Subjt:  DGFAKDFSDAGFKFLMKGIAADMP------------------------PGPASLVDKYVRDLDSDYSDLDEDEAPSQEPTEVGTTQEGAPSQQNGSQEVN

Query:  LLSSQGELSSHLGS
        LL S+GELSSHLGS
Subjt:  LLSSQGELSSHLGS

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.4e-14374.87Show/hide
Query:  PRVLAQDQAGPSSEVPTPVIELDSTGERSREKRSRSESKALDVSPLCEVREGSPLKRRKKKKKVTSSSEVRPRGPLPSSHADLVDDPEARMGGTSDVKMW
        PR  AQ  +GPSS VPTPVIELD +G RS EKRSR ES+ALDVSPL EVR  SPL+RR+KKKK +SSSE   RG LP+SHADLVDDPEARM GTS+V+M 
Subjt:  PRVLAQDQAGPSSEVPTPVIELDSTGERSREKRSRSESKALDVSPLCEVREGSPLKRRKKKKKVTSSSEVRPRGPLPSSHADLVDDPEARMGGTSDVKMW

Query:  FRMEPSSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHAVEAFTASIHSAIMIKAELHGKEALAAKERENSSAALEAATTLKGELLKARSEV
        F MEPSSSGVKDQVSRISA CLDR LRRASKFVSDPGSVLQRTID+  EAF ASIH A+M+KAEL G+EALAAKERENS AALEAATTLKGELLKA+ EV
Subjt:  FRMEPSSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHAVEAFTASIHSAIMIKAELHGKEALAAKERENSSAALEAATTLKGELLKARSEV

Query:  DILRAEVEAKGELLKREYERHKAHLRAAHAITKGLEKEKFQLLKEKDDMIQALEGKDAAIGRLTAEMKAEKERLTNGALLEAAFRQHPNFDGFAKDFSDA
        DILRAEV+AK +LLK+E E+HKAHLRAAHAITKGLEKEKFQLLKEKDD+ Q LE KDA+IGRLT E+K  KERLTNG LLE +FRQHP+FDGFAKDFSDA
Subjt:  DILRAEVEAKGELLKREYERHKAHLRAAHAITKGLEKEKFQLLKEKDDMIQALEGKDAAIGRLTAEMKAEKERLTNGALLEAAFRQHPNFDGFAKDFSDA

Query:  GFKFLMKGIAADMP------------------------PGPASLVDKYVRDLDSDYSDLDEDEAPSQEPTEVGTTQEGAPSQQNGS
        GFKFLMKGIAADMP                        P P SLVDKYVR+LDSDYSD++E++APSQEP EVGTTQE  PSQQ GS
Subjt:  GFKFLMKGIAADMP------------------------PGPASLVDKYVRDLDSDYSDLDEDEAPSQEPTEVGTTQEGAPSQQNGS

TrEMBL top hitse value%identityAlignment
A0A6J1CLV1 uncharacterized protein LOC1110124678.7e-10750.28Show/hide
Query:  SPLKRRKKKKKVTSSSEVRPRGPLPSSHADLVDDPEARMGGTSDVKMWFRMEPSSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHAVEAFT
        +P KRRKKKK + SSSEV     LP+  AD VDDP ARMGGTSDV   FR+EPSSSGV+DQVSRISAA LDRCLRRASKFVS PGSVL R ID+A EAF 
Subjt:  SPLKRRKKKKKVTSSSEVRPRGPLPSSHADLVDDPEARMGGTSDVKMWFRMEPSSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHAVEAFT

Query:  ASIHSAIMIKAELHGKEALAAKERENSSAALEAA-TTLKGELLKARSEVDILRAEV--------------------------------------------
        ASI SA+ +KAEL G+E LAA+E+E  SAALEAA +T+K ELLKA SEV+ L+AEV                                            
Subjt:  ASIHSAIMIKAELHGKEALAAKERENSSAALEAA-TTLKGELLKARSEVDILRAEV--------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------EAKGELLKREYERHKAHLRAAHAITKGLEKEKFQLLKEKDDMIQALEGKDAAIGRLTAEM
                                                EAK ELLKRE ERHKAHLRAAHAITKGLEKEKFQLLKEKDDM+QALE KDAAIGRL AE+
Subjt:  ----------------------------------------EAKGELLKREYERHKAHLRAAHAITKGLEKEKFQLLKEKDDMIQALEGKDAAIGRLTAEM

Query:  KAEKERLTNGALLEAAFRQHPNFDGFAKDFSDAGFKFLMKGIAADMP------------------------PGPASLVDKYVRDLDSDYSDLDEDEAPSQ
        KAEKERLTNGALLEAAFRQHP+FDGFAKDFSDAGFKFLMKGIAAD+P                         GPASLVDKYVRDLDSDYSDLDEDE PSQ
Subjt:  KAEKERLTNGALLEAAFRQHPNFDGFAKDFSDAGFKFLMKGIAADMP------------------------PGPASLVDKYVRDLDSDYSDLDEDEAPSQ

Query:  EPTEVGTTQEGAPSQQNGSQEVNLLSSQGELSSHLGS
        EPTEVGTTQEG PSQQ+GSQEVNLL SQGELSSHLGS
Subjt:  EPTEVGTTQEGAPSQQNGSQEVNLLSSQGELSSHLGS

A0A6J1D1N9 uncharacterized protein LOC1110161931.0e-10775.96Show/hide
Query:  MWFRMEPSSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHAVEAFTASIHSAIMIKAELHGKEALAAKERENSSAALEAATTLKGELLKARS
        M FRME SSSGVKDQVSRISA CLDRCLRRAS+FVSDPGSVLQRTID+A EAF ASIHSA+M+KAEL G+EAL AKEREN S  LEAATTLKGELLKA+ 
Subjt:  MWFRMEPSSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHAVEAFTASIHSAIMIKAELHGKEALAAKERENSSAALEAATTLKGELLKARS

Query:  EVDILRAEVEAKGELLKREYERHKAHLRAAHAITKGLEKEKFQLLKEKDDMIQALEGKDAAIGRLTAEMKAEKERLTNGALLEAAFRQHPNFDGFAKDFS
        EVDILRAEV+AK +LLK+E E+HKAHLRAAHAITKGLEKEKFQLLKEKDD+ Q LE KDA+IGRLT E+K  KERLT+GALLE +FRQHPNFDGFAKDFS
Subjt:  EVDILRAEVEAKGELLKREYERHKAHLRAAHAITKGLEKEKFQLLKEKDDMIQALEGKDAAIGRLTAEMKAEKERLTNGALLEAAFRQHPNFDGFAKDFS

Query:  DAGFKFLMKGIAADMP------------------------PGPASLVDKYVRDLDSDYSDLDEDEAPSQEPTEVGTTQEGAPSQQNG
        DAGFKFLMKGIAADMP                        PGP SLVDKYVR+LDSDYSD++E++APSQEPT+VGTTQE APSQ  G
Subjt:  DAGFKFLMKGIAADMP------------------------PGPASLVDKYVRDLDSDYSDLDEDEAPSQEPTEVGTTQEGAPSQQNG

A0A6J1D971 uncharacterized protein LOC1110185383.8e-8666.31Show/hide
Query:  RMEPSSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHAVEAFTASIHSAIMIKAELHGKEALAAKERENSSAALE-AATTLKGELLKARSEV
        R+EPSSSGV+DQVSRISAA LDRCLRRASKFVS PGSVLQRTID+A EAF ASI SA+ +KAEL G+E LAA+E+E  SAALE A++T+K ELLKA SEV
Subjt:  RMEPSSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHAVEAFTASIHSAIMIKAELHGKEALAAKERENSSAALE-AATTLKGELLKARSEV

Query:  DILRAEVEAKGELLKREYERHKAHLRAAHAITKGLEKEKFQLLKEKDDMIQALEGKDAAIGRLTAEMKAEKERLTNGALLEAAFRQHPNFDGFAKDFSDA
        + L+AEVE++ ELLK+E +R +A LRAAHAIT+GLE+EKFQLLKEKDDM+QALE KD  +   TAE++  KERL+NG LLE AFRQHP+FDGFAKDFSDA
Subjt:  DILRAEVEAKGELLKREYERHKAHLRAAHAITKGLEKEKFQLLKEKDDMIQALEGKDAAIGRLTAEMKAEKERLTNGALLEAAFRQHPNFDGFAKDFSDA

Query:  GFKFLMKGIAADMP------------------------PGPASLVDKYVRDLDSDYSDLDEDEAPSQEPTEVGTTQEGA
        GFKFLMKGIA+DMP                        PGP +LVD+YVRDLDSDYSD +ED        +VG+TQEGA
Subjt:  GFKFLMKGIAADMP------------------------PGPASLVDKYVRDLDSDYSDLDEDEAPSQEPTEVGTTQEGA

A0A6J1DF31 uncharacterized protein LOC1110199094.8e-12177.39Show/hide
Query:  MGGTSDVKMWFRMEPSSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHAVEAFTASIHSAIMIKAELHGKEALAAKERENSSAALEAATTLK
        MGGT DV+  FRMEPSSSGVKDQVSRISA CLDRCL+RASKFVSDPGSVLQRTID+A EAF ASIHSAIM+KAEL G+EALAAKERENSSAALEAATTLK
Subjt:  MGGTSDVKMWFRMEPSSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHAVEAFTASIHSAIMIKAELHGKEALAAKERENSSAALEAATTLK

Query:  GELLKARSEVDILRAEVEAKGELLKREYERHKAHLRAAHAITKGLEKEKFQLLKEKDDMIQALEGKDAAIGRLTAEMKAEKERLTNGALLEAAFRQHPNF
        GELLKA+ EV ILRAEV+AK ELLK+E E+HKAHLRAAHAITKGLEKEKFQLLKEKDD+ Q LEGKD +IGRLTAE+K  KERLTNG+LLE +FRQH +F
Subjt:  GELLKARSEVDILRAEVEAKGELLKREYERHKAHLRAAHAITKGLEKEKFQLLKEKDDMIQALEGKDAAIGRLTAEMKAEKERLTNGALLEAAFRQHPNF

Query:  DGFAKDFSDAGFKFLMKGIAADMP------------------------PGPASLVDKYVRDLDSDYSDLDEDEAPSQEPTEVGTTQEGAPSQQNGSQEVN
        DGFAKDFSDAGFKFLMKGIAADMP                        PGP SLV KYVR+LDSDYSD++E++APSQEP E+GTTQE  PSQQ+GSQEVN
Subjt:  DGFAKDFSDAGFKFLMKGIAADMP------------------------PGPASLVDKYVRDLDSDYSDLDEDEAPSQEPTEVGTTQEGAPSQQNGSQEVN

Query:  LLSSQGELSSHLGS
        LL S+GELSSHLGS
Subjt:  LLSSQGELSSHLGS

A0A6J1DZB3 uncharacterized protein LOC1110256656.8e-14474.87Show/hide
Query:  PRVLAQDQAGPSSEVPTPVIELDSTGERSREKRSRSESKALDVSPLCEVREGSPLKRRKKKKKVTSSSEVRPRGPLPSSHADLVDDPEARMGGTSDVKMW
        PR  AQ  +GPSS VPTPVIELD +G RS EKRSR ES+ALDVSPL EVR  SPL+RR+KKKK +SSSE   RG LP+SHADLVDDPEARM GTS+V+M 
Subjt:  PRVLAQDQAGPSSEVPTPVIELDSTGERSREKRSRSESKALDVSPLCEVREGSPLKRRKKKKKVTSSSEVRPRGPLPSSHADLVDDPEARMGGTSDVKMW

Query:  FRMEPSSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHAVEAFTASIHSAIMIKAELHGKEALAAKERENSSAALEAATTLKGELLKARSEV
        F MEPSSSGVKDQVSRISA CLDR LRRASKFVSDPGSVLQRTID+  EAF ASIH A+M+KAEL G+EALAAKERENS AALEAATTLKGELLKA+ EV
Subjt:  FRMEPSSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHAVEAFTASIHSAIMIKAELHGKEALAAKERENSSAALEAATTLKGELLKARSEV

Query:  DILRAEVEAKGELLKREYERHKAHLRAAHAITKGLEKEKFQLLKEKDDMIQALEGKDAAIGRLTAEMKAEKERLTNGALLEAAFRQHPNFDGFAKDFSDA
        DILRAEV+AK +LLK+E E+HKAHLRAAHAITKGLEKEKFQLLKEKDD+ Q LE KDA+IGRLT E+K  KERLTNG LLE +FRQHP+FDGFAKDFSDA
Subjt:  DILRAEVEAKGELLKREYERHKAHLRAAHAITKGLEKEKFQLLKEKDDMIQALEGKDAAIGRLTAEMKAEKERLTNGALLEAAFRQHPNFDGFAKDFSDA

Query:  GFKFLMKGIAADMP------------------------PGPASLVDKYVRDLDSDYSDLDEDEAPSQEPTEVGTTQEGAPSQQNGS
        GFKFLMKGIAADMP                        P P SLVDKYVR+LDSDYSD++E++APSQEP EVGTTQE  PSQQ GS
Subjt:  GFKFLMKGIAADMP------------------------PGPASLVDKYVRDLDSDYSDLDEDEAPSQEPTEVGTTQEGAPSQQNGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGTCCTCTATTTATAGGGCCGAGGTGGACCTAGGCAATCCCTCTTTTCCAGTTTCTTCAAACACCAATAAGGGTCCTCCACGTGTCCTGGCTCAGGACCAGGCTGG
GCCATCTTCTGAAGTTCCAACTCCCGTGATCGAGTTGGATTCTACTGGGGAACGGTCCAGGGAGAAGCGCTCGAGGAGCGAGTCCAAGGCGTTGGACGTGTCACCTCTTT
GTGAGGTAAGAGAGGGCTCTCCTCTGAAGAGAAGGAAGAAAAAGAAGAAAGTCACCTCCTCCTCGGAGGTTCGACCTCGCGGCCCCCTACCCTCAAGCCACGCCGATTTG
GTGGATGACCCCGAAGCGCGGATGGGAGGGACATCCGACGTGAAGATGTGGTTCAGAATGGAACCGTCAAGCTCCGGGGTGAAGGACCAGGTGTCACGCATCTCGGCTGC
CTGCTTGGATCGCTGTCTCAGAAGAGCGTCCAAGTTTGTGAGTGATCCAGGGTCCGTGCTGCAACGGACCATCGACCACGCTGTCGAGGCGTTCACTGCCTCCATCCACT
CAGCAATCATGATCAAGGCCGAGCTGCATGGAAAGGAGGCCCTGGCAGCGAAAGAGAGGGAGAACTCTTCTGCTGCGCTGGAGGCTGCCACTACACTCAAGGGCGAGCTG
CTGAAGGCTCGGAGCGAGGTTGACATTCTGAGGGCCGAGGTAGAAGCCAAGGGCGAGCTGCTGAAGAGGGAATATGAGAGGCATAAGGCCCACCTCCGAGCCGCCCACGC
CATCACTAAAGGGCTGGAGAAGGAAAAGTTCCAACTCCTTAAGGAGAAGGACGACATGATCCAGGCCCTTGAAGGGAAGGACGCTGCAATTGGGCGTCTCACTGCTGAGA
TGAAGGCGGAAAAGGAGCGCCTTACCAACGGAGCTCTTCTTGAAGCAGCCTTCAGGCAACACCCAAATTTTGATGGGTTTGCCAAGGACTTCAGCGATGCGGGCTTCAAA
TTTCTGATGAAGGGCATTGCTGCTGATATGCCTCCAGGTCCTGCATCCCTGGTGGACAAGTACGTCAGAGATCTGGACTCTGACTACTCCGACCTGGACGAAGACGAAGC
TCCTAGTCAGGAACCTACTGAGGTCGGCACTACCCAGGAGGGAGCTCCTTCTCAGCAGAATGGATCTCAGGAGGTCAACCTTCTGAGTTCTCAAGGCGAGCTATCTTCTC
ACCTCGGGAGCGGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGTCCTCTATTTATAGGGCCGAGGTGGACCTAGGCAATCCCTCTTTTCCAGTTTCTTCAAACACCAATAAGGGTCCTCCACGTGTCCTGGCTCAGGACCAGGCTGG
GCCATCTTCTGAAGTTCCAACTCCCGTGATCGAGTTGGATTCTACTGGGGAACGGTCCAGGGAGAAGCGCTCGAGGAGCGAGTCCAAGGCGTTGGACGTGTCACCTCTTT
GTGAGGTAAGAGAGGGCTCTCCTCTGAAGAGAAGGAAGAAAAAGAAGAAAGTCACCTCCTCCTCGGAGGTTCGACCTCGCGGCCCCCTACCCTCAAGCCACGCCGATTTG
GTGGATGACCCCGAAGCGCGGATGGGAGGGACATCCGACGTGAAGATGTGGTTCAGAATGGAACCGTCAAGCTCCGGGGTGAAGGACCAGGTGTCACGCATCTCGGCTGC
CTGCTTGGATCGCTGTCTCAGAAGAGCGTCCAAGTTTGTGAGTGATCCAGGGTCCGTGCTGCAACGGACCATCGACCACGCTGTCGAGGCGTTCACTGCCTCCATCCACT
CAGCAATCATGATCAAGGCCGAGCTGCATGGAAAGGAGGCCCTGGCAGCGAAAGAGAGGGAGAACTCTTCTGCTGCGCTGGAGGCTGCCACTACACTCAAGGGCGAGCTG
CTGAAGGCTCGGAGCGAGGTTGACATTCTGAGGGCCGAGGTAGAAGCCAAGGGCGAGCTGCTGAAGAGGGAATATGAGAGGCATAAGGCCCACCTCCGAGCCGCCCACGC
CATCACTAAAGGGCTGGAGAAGGAAAAGTTCCAACTCCTTAAGGAGAAGGACGACATGATCCAGGCCCTTGAAGGGAAGGACGCTGCAATTGGGCGTCTCACTGCTGAGA
TGAAGGCGGAAAAGGAGCGCCTTACCAACGGAGCTCTTCTTGAAGCAGCCTTCAGGCAACACCCAAATTTTGATGGGTTTGCCAAGGACTTCAGCGATGCGGGCTTCAAA
TTTCTGATGAAGGGCATTGCTGCTGATATGCCTCCAGGTCCTGCATCCCTGGTGGACAAGTACGTCAGAGATCTGGACTCTGACTACTCCGACCTGGACGAAGACGAAGC
TCCTAGTCAGGAACCTACTGAGGTCGGCACTACCCAGGAGGGAGCTCCTTCTCAGCAGAATGGATCTCAGGAGGTCAACCTTCTGAGTTCTCAAGGCGAGCTATCTTCTC
ACCTCGGGAGCGGCTGA
Protein sequenceShow/hide protein sequence
MKSSIYRAEVDLGNPSFPVSSNTNKGPPRVLAQDQAGPSSEVPTPVIELDSTGERSREKRSRSESKALDVSPLCEVREGSPLKRRKKKKKVTSSSEVRPRGPLPSSHADL
VDDPEARMGGTSDVKMWFRMEPSSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHAVEAFTASIHSAIMIKAELHGKEALAAKERENSSAALEAATTLKGEL
LKARSEVDILRAEVEAKGELLKREYERHKAHLRAAHAITKGLEKEKFQLLKEKDDMIQALEGKDAAIGRLTAEMKAEKERLTNGALLEAAFRQHPNFDGFAKDFSDAGFK
FLMKGIAADMPPGPASLVDKYVRDLDSDYSDLDEDEAPSQEPTEVGTTQEGAPSQQNGSQEVNLLSSQGELSSHLGSG