; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g03330 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g03330
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionBAHD acyltransferase At3g29680-like
Genome locationchr9:2588604..2591994
RNA-Seq ExpressionMoc09g03330
SyntenyMoc09g03330
Gene Ontology termsGO:0043167 - ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]3.5e-10750.09Show/hide
Query:  TEAAYVSSLGEE-------VREEAPLKRRRKKKKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRA
        +E+A+++ L E+       + E  P  +RRKKKK  S  EVGA   LPA FADRVDDP ARMGGTSDVTARFR+EPSSSGVRDQVSRISAASLDRCLRRA
Subjt:  TEAAYVSSLGEE-------VREEAPLKRRRKKKKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRA

Query:  SKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEV---------------------
        SKFVS PGSVL R IDYAAEAFVASIQSALAVKAELDGRE LAAREKEEFSAALEAASSTMKDELLKAHSEVE LKAEV                     
Subjt:  SKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEV---------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------EAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLK
                                                                       EAKAELLK+E++R KA LRAAHAITKGLEKEKFQLLK
Subjt:  ---------------------------------------------------------------EAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLK

Query:  EKDDMLQALEAKEEELKHATAELEMVKERLSNGALLEESFRQHLDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQAL
        EKDDMLQALE K+  +    AEL+  KERL+NGALLE +FRQH DFDGFAKDFSDAGFKFLMKGIA+D+P L++DLG LKKRYAE+WASGP+GT GP +L
Subjt:  EKDDMLQALEAKEEELKHATAELEMVKERLSNGALLEESFRQHLDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQAL

Query:  VDKYVRDLDSDYSDLEEHQVQRLDHSELFTS
        VDKYVRDLDSDYSDL+E +V   + +E+ T+
Subjt:  VDKYVRDLDSDYSDLEEHQVQRLDHSELFTS

XP_022147182.1 uncharacterized protein LOC111016193 [Momordica charantia]3.9e-9872.56Show/hide
Query:  RFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHS
        RFR+E SSSGV+DQVSRISA  LDRCLRRAS+FVSDPGSVLQRTID AAEAF+ASI SA+ VKAELDGREAL A+E+E  S  LEAA +T+K ELLKA  
Subjt:  RFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHS

Query:  EVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELEMVKERLSNGALLEESFRQHLDFDGFAKDFS
        EV+IL+AEV+AK +LLKKE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+ Q LE K+  +   T EL+ +KERL++GALLEESFRQH +FDGFAKDFS
Subjt:  EVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELEMVKERLSNGALLEESFRQHLDFDGFAKDFS

Query:  DAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEEHQVQRLDHSELFTS
        DAGFKFLMKGIA+DMP LQIDL  LKKRY+E WASGP+GTPGPQ+LVDKYVR+LDSDYSD+EE      + +++ T+
Subjt:  DAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEEHQVQRLDHSELFTS

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]4.2e-12188.28Show/hide
Query:  GTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKD
        G   + A+ R+EPSSSGVRDQVSRISAASLDRCLRRASKFVS PGSVLQRTIDYAAEAFVASIQSALAVKAELDGRE LAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELEMVKERLSNGALLEESFRQHLDFD
        ELLKAHSEVE LKAEVE++AELLKKEEDRR+AQLRAAHAIT+GLE+EKFQLLKEKDDMLQALEAK++EL+HATAELE  KERLSNG LLEE+FRQH DFD
Subjt:  ELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELEMVKERLSNGALLEESFRQHLDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEEHQV
        GFAKDFSDAGFKFLMKGIASDMPDLQIDL GLK+RYAE+WASGP GTPGPQALVD+YVRDLDSDYSD EE QV
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEEHQV

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]3.9e-10675.52Show/hide
Query:  MGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTM
        MGGT DV  RFR+EPSSSGV+DQVSRISA  LDRCL+RASKFVSDPGSVLQRTID AAEAFVASI SA+ VKAELDGREALAA+E+E  SAALEAA +T+
Subjt:  MGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTM

Query:  KDELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELEMVKERLSNGALLEESFRQHLD
        K ELLKA  EV IL+AEV+AKAELLKKE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+ Q LE K+  +   TAEL+ +KERL+NG+LLEESFRQHLD
Subjt:  KDELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELEMVKERLSNGALLEESFRQHLD

Query:  FDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEEHQVQRLDHSELFTS
        FDGFAKDFSDAGFKFLMKGIA+DMP LQIDL  LKK+Y+E+WASGP+GTPGPQ+LV KYVR+LDSDYSD+EE      + +E+ T+
Subjt:  FDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEEHQVQRLDHSELFTS

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]2.0e-13468.46Show/hide
Query:  IYGFASNVKRKSKGRAHALETVQSSEPATPAV--------AGPASEDPAPVIELESSEGPSREKCPRSQTEAAYVSSLGEEVREEAPLKRRRKKKKTTSP
        + GF  +VKRKSKGRAHAL+TV  +EP TP V        +GP+S  P PVIEL+ S G S EK  R ++EA  VS L  EVR E+PL+RRRKKKKT+S 
Subjt:  IYGFASNVKRKSKGRAHALETVQSSEPATPAV--------AGPASEDPAPVIELESSEGPSREKCPRSQTEAAYVSSLGEEVREEAPLKRRRKKKKTTSP

Query:  LEVGARGALPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDG
         E GARG LP S AD VDDPEARM GTS+V  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AEAF+ASI  A+ VKAELDG
Subjt:  LEVGARGALPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDG

Query:  REALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAE
        REALAA+E+E   AALEAA +T+K ELLKA  EV+IL+AEV+AK +LLKKE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+ Q LE K+  +   T E
Subjt:  REALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAE

Query:  LEMVKERLSNGALLEESFRQHLDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEEHQVQR
        L+ +KERL+NG LLEESFRQH DFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL GLKK+Y+E+WASGP+GTP PQ+LVDKYVR+LDSDYSD+EE     
Subjt:  LEMVKERLSNGALLEESFRQHLDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEEHQVQR

Query:  LDHSELFTS
         +  E+ T+
Subjt:  LDHSELFTS

TrEMBL top hitse value%identityAlignment
A0A6J1CLV1 uncharacterized protein LOC1110124671.7e-10750.09Show/hide
Query:  TEAAYVSSLGEE-------VREEAPLKRRRKKKKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRA
        +E+A+++ L E+       + E  P  +RRKKKK  S  EVGA   LPA FADRVDDP ARMGGTSDVTARFR+EPSSSGVRDQVSRISAASLDRCLRRA
Subjt:  TEAAYVSSLGEE-------VREEAPLKRRRKKKKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRA

Query:  SKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEV---------------------
        SKFVS PGSVL R IDYAAEAFVASIQSALAVKAELDGRE LAAREKEEFSAALEAASSTMKDELLKAHSEVE LKAEV                     
Subjt:  SKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEV---------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------EAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLK
                                                                       EAKAELLK+E++R KA LRAAHAITKGLEKEKFQLLK
Subjt:  ---------------------------------------------------------------EAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLK

Query:  EKDDMLQALEAKEEELKHATAELEMVKERLSNGALLEESFRQHLDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQAL
        EKDDMLQALE K+  +    AEL+  KERL+NGALLE +FRQH DFDGFAKDFSDAGFKFLMKGIA+D+P L++DLG LKKRYAE+WASGP+GT GP +L
Subjt:  EKDDMLQALEAKEEELKHATAELEMVKERLSNGALLEESFRQHLDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQAL

Query:  VDKYVRDLDSDYSDLEEHQVQRLDHSELFTS
        VDKYVRDLDSDYSDL+E +V   + +E+ T+
Subjt:  VDKYVRDLDSDYSDLEEHQVQRLDHSELFTS

A0A6J1D1N9 uncharacterized protein LOC1110161931.9e-9872.56Show/hide
Query:  RFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHS
        RFR+E SSSGV+DQVSRISA  LDRCLRRAS+FVSDPGSVLQRTID AAEAF+ASI SA+ VKAELDGREAL A+E+E  S  LEAA +T+K ELLKA  
Subjt:  RFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHS

Query:  EVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELEMVKERLSNGALLEESFRQHLDFDGFAKDFS
        EV+IL+AEV+AK +LLKKE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+ Q LE K+  +   T EL+ +KERL++GALLEESFRQH +FDGFAKDFS
Subjt:  EVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELEMVKERLSNGALLEESFRQHLDFDGFAKDFS

Query:  DAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEEHQVQRLDHSELFTS
        DAGFKFLMKGIA+DMP LQIDL  LKKRY+E WASGP+GTPGPQ+LVDKYVR+LDSDYSD+EE      + +++ T+
Subjt:  DAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEEHQVQRLDHSELFTS

A0A6J1D971 uncharacterized protein LOC1110185382.1e-12188.28Show/hide
Query:  GTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKD
        G   + A+ R+EPSSSGVRDQVSRISAASLDRCLRRASKFVS PGSVLQRTIDYAAEAFVASIQSALAVKAELDGRE LAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELEMVKERLSNGALLEESFRQHLDFD
        ELLKAHSEVE LKAEVE++AELLKKEEDRR+AQLRAAHAIT+GLE+EKFQLLKEKDDMLQALEAK++EL+HATAELE  KERLSNG LLEE+FRQH DFD
Subjt:  ELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELEMVKERLSNGALLEESFRQHLDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEEHQV
        GFAKDFSDAGFKFLMKGIASDMPDLQIDL GLK+RYAE+WASGP GTPGPQALVD+YVRDLDSDYSD EE QV
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEEHQV

A0A6J1DF31 uncharacterized protein LOC1110199091.9e-10675.52Show/hide
Query:  MGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTM
        MGGT DV  RFR+EPSSSGV+DQVSRISA  LDRCL+RASKFVSDPGSVLQRTID AAEAFVASI SA+ VKAELDGREALAA+E+E  SAALEAA +T+
Subjt:  MGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTM

Query:  KDELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELEMVKERLSNGALLEESFRQHLD
        K ELLKA  EV IL+AEV+AKAELLKKE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+ Q LE K+  +   TAEL+ +KERL+NG+LLEESFRQHLD
Subjt:  KDELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELEMVKERLSNGALLEESFRQHLD

Query:  FDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEEHQVQRLDHSELFTS
        FDGFAKDFSDAGFKFLMKGIA+DMP LQIDL  LKK+Y+E+WASGP+GTPGPQ+LV KYVR+LDSDYSD+EE      + +E+ T+
Subjt:  FDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEEHQVQRLDHSELFTS

A0A6J1DZB3 uncharacterized protein LOC1110256659.5e-13568.46Show/hide
Query:  IYGFASNVKRKSKGRAHALETVQSSEPATPAV--------AGPASEDPAPVIELESSEGPSREKCPRSQTEAAYVSSLGEEVREEAPLKRRRKKKKTTSP
        + GF  +VKRKSKGRAHAL+TV  +EP TP V        +GP+S  P PVIEL+ S G S EK  R ++EA  VS L  EVR E+PL+RRRKKKKT+S 
Subjt:  IYGFASNVKRKSKGRAHALETVQSSEPATPAV--------AGPASEDPAPVIELESSEGPSREKCPRSQTEAAYVSSLGEEVREEAPLKRRRKKKKTTSP

Query:  LEVGARGALPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDG
         E GARG LP S AD VDDPEARM GTS+V  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AEAF+ASI  A+ VKAELDG
Subjt:  LEVGARGALPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDG

Query:  REALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAE
        REALAA+E+E   AALEAA +T+K ELLKA  EV+IL+AEV+AK +LLKKE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+ Q LE K+  +   T E
Subjt:  REALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAE

Query:  LEMVKERLSNGALLEESFRQHLDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEEHQVQR
        L+ +KERL+NG LLEESFRQH DFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL GLKK+Y+E+WASGP+GTP PQ+LVDKYVR+LDSDYSD+EE     
Subjt:  LEMVKERLSNGALLEESFRQHLDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEEHQVQR

Query:  LDHSELFTS
         +  E+ T+
Subjt:  LDHSELFTS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTATTCGGGGAACATATTTCGAGTTCCGATTCGGGTCGGACTGCTTCGACCTCGGAGGAGAATTTGTGTGTTGCAATTTACGGATTTGCAAGCAACGTGAAACGCAA
GTCCAAGGGCCGAGCCCATGCTCTTGAGACCGTCCAGAGTTCGGAACCTGCCACTCCTGCCGTGGCAGGGCCAGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGT
CTTCTGAGGGTCCTTCGAGGGAGAAGTGCCCTAGGAGTCAGACCGAGGCGGCGTACGTCTCGTCCTTGGGCGAGGAGGTGAGGGAGGAGGCCCCTCTCAAGCGAAGGAGG
AAGAAGAAGAAGACCACCTCCCCCTTGGAGGTCGGAGCTCGTGGGGCCCTGCCTGCGAGCTTCGCAGATCGGGTGGACGATCCTGAGGCCAGGATGGGCGGGACGTCCGA
CGTGACAGCACGGTTCAGAGTCGAGCCGTCAAGTTCTGGGGTGAGGGACCAGGTGTCCCGCATCTCGGCTGCAAGTTTGGACCGCTGCCTCAGAAGAGCGTCCAAATTTG
TAAGTGACCCGGGGTCCGTCCTGCAGAGGACCATCGACTACGCCGCTGAGGCGTTTGTTGCTTCCATTCAATCGGCTCTGGCCGTGAAGGCCGAGCTGGATGGGAGGGAA
GCTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCCTCTTCCACCATGAAGGATGAGCTGCTGAAAGCTCACTCTGAGGTGGAAATTTTGAAGGC
TGAGGTGGAGGCCAAGGCCGAGCTGCTGAAGAAAGAAGAGGACAGACGCAAGGCCCAGCTCCGAGCTGCCCATGCTATCACCAAGGGCTTGGAGAAGGAGAAGTTCCAAC
TCCTCAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCAAAGGAGGAGGAGCTGAAGCACGCGACTGCTGAGCTGGAGATGGTGAAGGAGCGTCTCAGCAATGGAGCC
CTATTGGAGGAATCGTTCAGGCAACATCTTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTCCGACATGCCTGA
CCTTCAGATCGATCTCGGTGGTCTGAAGAAGAGGTATGCTGAGCAGTGGGCGTCTGGGCCTAGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATAAGTACGTCAGAGATC
TGGACTCTGACTACTCCGACCTCGAAGAGCATCAGGTTCAAAGGCTTGATCATTCTGAACTTTTCACATCGCCCCCTTGCCTTGAAGGTTTGAATTTTAAGTTCACCAAC
GGTTTTGGCATCGCACCTCGTACCCTTAGATCCATTGAAAACCATTTTGACATTTCAAGGATAATAACGCTTCAGGTGCTCCGCGTTCCACGGGTGCGCGAGGACGTCTC
CTTTCAGATCGGCCAATATGTACGTCCCAGGTCGGACTATTCCCTTGACCTCAAACGGCCCCTCCCAGGTCAGGTCAAGGGCACCCACATGGGTTTGGACCTTCCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTATTCGGGGAACATATTTCGAGTTCCGATTCGGGTCGGACTGCTTCGACCTCGGAGGAGAATTTGTGTGTTGCAATTTACGGATTTGCAAGCAACGTGAAACGCAA
GTCCAAGGGCCGAGCCCATGCTCTTGAGACCGTCCAGAGTTCGGAACCTGCCACTCCTGCCGTGGCAGGGCCAGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGT
CTTCTGAGGGTCCTTCGAGGGAGAAGTGCCCTAGGAGTCAGACCGAGGCGGCGTACGTCTCGTCCTTGGGCGAGGAGGTGAGGGAGGAGGCCCCTCTCAAGCGAAGGAGG
AAGAAGAAGAAGACCACCTCCCCCTTGGAGGTCGGAGCTCGTGGGGCCCTGCCTGCGAGCTTCGCAGATCGGGTGGACGATCCTGAGGCCAGGATGGGCGGGACGTCCGA
CGTGACAGCACGGTTCAGAGTCGAGCCGTCAAGTTCTGGGGTGAGGGACCAGGTGTCCCGCATCTCGGCTGCAAGTTTGGACCGCTGCCTCAGAAGAGCGTCCAAATTTG
TAAGTGACCCGGGGTCCGTCCTGCAGAGGACCATCGACTACGCCGCTGAGGCGTTTGTTGCTTCCATTCAATCGGCTCTGGCCGTGAAGGCCGAGCTGGATGGGAGGGAA
GCTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCCTCTTCCACCATGAAGGATGAGCTGCTGAAAGCTCACTCTGAGGTGGAAATTTTGAAGGC
TGAGGTGGAGGCCAAGGCCGAGCTGCTGAAGAAAGAAGAGGACAGACGCAAGGCCCAGCTCCGAGCTGCCCATGCTATCACCAAGGGCTTGGAGAAGGAGAAGTTCCAAC
TCCTCAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCAAAGGAGGAGGAGCTGAAGCACGCGACTGCTGAGCTGGAGATGGTGAAGGAGCGTCTCAGCAATGGAGCC
CTATTGGAGGAATCGTTCAGGCAACATCTTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTCCGACATGCCTGA
CCTTCAGATCGATCTCGGTGGTCTGAAGAAGAGGTATGCTGAGCAGTGGGCGTCTGGGCCTAGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATAAGTACGTCAGAGATC
TGGACTCTGACTACTCCGACCTCGAAGAGCATCAGGTTCAAAGGCTTGATCATTCTGAACTTTTCACATCGCCCCCTTGCCTTGAAGGTTTGAATTTTAAGTTCACCAAC
GGTTTTGGCATCGCACCTCGTACCCTTAGATCCATTGAAAACCATTTTGACATTTCAAGGATAATAACGCTTCAGGTGCTCCGCGTTCCACGGGTGCGCGAGGACGTCTC
CTTTCAGATCGGCCAATATGTACGTCCCAGGTCGGACTATTCCCTTGACCTCAAACGGCCCCTCCCAGGTCAGGTCAAGGGCACCCACATGGGTTTGGACCTTCCTTAA
Protein sequenceShow/hide protein sequence
MVFGEHISSSDSGRTASTSEENLCVAIYGFASNVKRKSKGRAHALETVQSSEPATPAVAGPASEDPAPVIELESSEGPSREKCPRSQTEAAYVSSLGEEVREEAPLKRRR
KKKKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGRE
ALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELEMVKERLSNGA
LLEESFRQHLDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEEHQVQRLDHSELFTSPPCLEGLNFKFTN
GFGIAPRTLRSIENHFDISRIITLQVLRVPRVREDVSFQIGQYVRPRSDYSLDLKRPLPGQVKGTHMGLDLP