; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g28670 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g28670
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionBAHD acyltransferase At3g29680-like
Genome locationchr3:20545332..20548389
RNA-Seq ExpressionMoc03g28670
SyntenyMoc03g28670
Gene Ontology termsGO:0043167 - ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]1.6e-11051.93Show/hide
Query:  RRRKKKKAISPSEVGACRVLPAGWADRVDDPAARMGGTSDVTARFRIEPSSLGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQ
        +RRKKKKAIS SEVGACRVLPAG+ADRVDDPAARMGGTSDVTARFRIEPSS GVR+QV+RISAASLDRCLRRASKFVS PGSVL R IDYAAEAFVASIQ
Subjt:  RRRKKKKAISPSEVGACRVLPAGWADRVDDPAARMGGTSDVTARFRIEPSSLGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQ

Query:  SALVIKAELDGREALASREKEEFSAALEAASSTMKDELLKAHSEVETLKGEVESQ---------------------------------------------
        SAL +KAELDGRE LA+REKEEFSAALEAASSTMKDELLKAHSEVETLK EVESQ                                             
Subjt:  SALVIKAELDGREALASREKEEFSAALEAASSTMKDELLKAHSEVETLKGEVESQ---------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------AELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAK
                                               AELLK+E++R KA LRAAHAIT+GLE+EKFQLLKEKDDMLQALE KD  +    AEL+  K
Subjt:  ---------------------------------------AELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAK

Query:  ERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDFSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYSNLEEDQV
        ERL+NG LLE +FRQHPDFDGFAKDFSDAGFKFLMKGIA+D+P L++D   LK+RYAEKWASGP+GT GP +LVD+YVRDLDSDYS+L+ED+V
Subjt:  ERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDFSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYSNLEEDQV

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]5.0e-12891.94Show/hide
Query:  GTSDVTARFRIEPSSLGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALVIKAELDGREALASREKEEFSAALEAASSTMKD
        G   + A+ RIEPSS GVR+QV+RISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSAL +KAELDGRE LA+REKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRIEPSSLGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALVIKAELDGREALASREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVETLKGEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD
        ELLKAHSEVETLK EVESQAELLKKEEDRR+AQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEE+FRQHPDFD
Subjt:  ELLKAHSEVETLKGEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDFSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYSNLEEDQV
        GFAKDFSDAGFKFLMKGIASDMPDLQID SGLKRRYAEKWASGP GTPGPQALVDQYVRDLDSDYS+ EEDQV
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDFSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYSNLEEDQV

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]5.8e-10073.26Show/hide
Query:  MGGTSDVTARFRIEPSSLGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALVIKAELDGREALASREKEEFSAALEAASSTM
        MGGT DV  RFR+EPSS GV++QV+RISA  LDRCL+RASKFVS PGSVLQRTID AAEAFVASI SA+++KAELDGREALA++E+E  SAALEAA +T+
Subjt:  MGGTSDVTARFRIEPSSLGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALVIKAELDGREALASREKEEFSAALEAASSTM

Query:  KDELLKAHSEVETLKGEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPD
        K ELLKA  EV  L+ EV+++AELLKKE ++ KA LRAAHAIT+GLE+EKFQLLKEKDD+ Q LE KD  +   TAEL+  KERL+NG LLEESFRQH D
Subjt:  KDELLKAHSEVETLKGEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPD

Query:  FDGFAKDFSDAGFKFLMKGIASDMPDLQIDFSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYSNLEED
        FDGFAKDFSDAGFKFLMKGIA+DMP LQID S LK++Y+EKWASGP+GTPGPQ+LV +YVR+LDSDYS++EE+
Subjt:  FDGFAKDFSDAGFKFLMKGIASDMPDLQIDFSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYSNLEED

XP_022158203.1 uncharacterized protein LOC111024740 [Momordica charantia]6.2e-9475.93Show/hide
Query:  EPSSLGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALVIKAELDGREALASREKEEFSAALEAASSTMKDELLKAHSEVET
        EPSS GVR+QV+RISAASLDRCLRRASKFVS PGSVLQRTIDYAAEAFVASIQSAL +KAELDGRE LA+REKEEFSAALEAA  TMKDELLKAHSEVET
Subjt:  EPSSLGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALVIKAELDGREALASREKEEFSAALEAASSTMKDELLKAHSEVET

Query:  LKGEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGF
        LK EVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALE KDKELEHATAELETAKERLSN                          
Subjt:  LKGEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGF

Query:  KFLMKGIASDMPDLQIDFSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYSNLEEDQVTPLRRALP
                      +ID SGLKRRYAEKWASGP GTPGPQALVDQYVRDLDSDYS+ +EDQV   +   P
Subjt:  KFLMKGIASDMPDLQIDFSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYSNLEEDQVTPLRRALP

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.6e-12664.69Show/hide
Query:  AMVCGFASGVKRKSKSRAHALEAAQSSKPPTPAV--------VGPASEDPVPVIELESSGGPSREKRPRGQTEAVDAQTEAADAPPLGEEAREEAPLKRR
        AMVCGF   VKRKSK RAHAL+    ++P TP V         GP+S  P PVIEL+ SGG S EKR R ++EA+D         PL  E R E+PL+RR
Subjt:  AMVCGFASGVKRKSKSRAHALEAAQSSKPPTPAV--------VGPASEDPVPVIELESSGGPSREKRPRGQTEAVDAQTEAADAPPLGEEAREEAPLKRR

Query:  RKKKKAISPSEVGACRVLPAGWADRVDDPAARMGGTSDVTARFRIEPSSLGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSA
        RKKKK  S SE GA   LP   AD VDDP ARM GTS+V  RF +EPSS GV++QV+RISA  LDR LRRASKFVS PGSVLQRTID  AEAF+ASI  A
Subjt:  RKKKKAISPSEVGACRVLPAGWADRVDDPAARMGGTSDVTARFRIEPSSLGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSA

Query:  LVIKAELDGREALASREKEEFSAALEAASSTMKDELLKAHSEVETLKGEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKD
        +++KAELDGREALA++E+E   AALEAA +T+K ELLKA  EV+ L+ EV+++ +LLKKE ++ KA LRAAHAIT+GLE+EKFQLLKEKDD+ Q LE KD
Subjt:  LVIKAELDGREALASREKEEFSAALEAASSTMKDELLKAHSEVETLKGEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKD

Query:  KELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDFSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYS
          +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQID +GLK++Y+EKWASGP+GTP PQ+LVD+YVR+LDSDYS
Subjt:  KELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDFSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYS

Query:  NLEED
        ++EE+
Subjt:  NLEED

TrEMBL top hitse value%identityAlignment
A0A6J1CLV1 uncharacterized protein LOC1110124677.8e-11151.93Show/hide
Query:  RRRKKKKAISPSEVGACRVLPAGWADRVDDPAARMGGTSDVTARFRIEPSSLGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQ
        +RRKKKKAIS SEVGACRVLPAG+ADRVDDPAARMGGTSDVTARFRIEPSS GVR+QV+RISAASLDRCLRRASKFVS PGSVL R IDYAAEAFVASIQ
Subjt:  RRRKKKKAISPSEVGACRVLPAGWADRVDDPAARMGGTSDVTARFRIEPSSLGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQ

Query:  SALVIKAELDGREALASREKEEFSAALEAASSTMKDELLKAHSEVETLKGEVESQ---------------------------------------------
        SAL +KAELDGRE LA+REKEEFSAALEAASSTMKDELLKAHSEVETLK EVESQ                                             
Subjt:  SALVIKAELDGREALASREKEEFSAALEAASSTMKDELLKAHSEVETLKGEVESQ---------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------AELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAK
                                               AELLK+E++R KA LRAAHAIT+GLE+EKFQLLKEKDDMLQALE KD  +    AEL+  K
Subjt:  ---------------------------------------AELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAK

Query:  ERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDFSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYSNLEEDQV
        ERL+NG LLE +FRQHPDFDGFAKDFSDAGFKFLMKGIA+D+P L++D   LK+RYAEKWASGP+GT GP +LVD+YVRDLDSDYS+L+ED+V
Subjt:  ERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDFSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYSNLEEDQV

A0A6J1D971 uncharacterized protein LOC1110185382.4e-12891.94Show/hide
Query:  GTSDVTARFRIEPSSLGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALVIKAELDGREALASREKEEFSAALEAASSTMKD
        G   + A+ RIEPSS GVR+QV+RISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSAL +KAELDGRE LA+REKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRIEPSSLGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALVIKAELDGREALASREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVETLKGEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD
        ELLKAHSEVETLK EVESQAELLKKEEDRR+AQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEE+FRQHPDFD
Subjt:  ELLKAHSEVETLKGEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDFSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYSNLEEDQV
        GFAKDFSDAGFKFLMKGIASDMPDLQID SGLKRRYAEKWASGP GTPGPQALVDQYVRDLDSDYS+ EEDQV
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDFSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYSNLEEDQV

A0A6J1DF31 uncharacterized protein LOC1110199092.8e-10073.26Show/hide
Query:  MGGTSDVTARFRIEPSSLGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALVIKAELDGREALASREKEEFSAALEAASSTM
        MGGT DV  RFR+EPSS GV++QV+RISA  LDRCL+RASKFVS PGSVLQRTID AAEAFVASI SA+++KAELDGREALA++E+E  SAALEAA +T+
Subjt:  MGGTSDVTARFRIEPSSLGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALVIKAELDGREALASREKEEFSAALEAASSTM

Query:  KDELLKAHSEVETLKGEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPD
        K ELLKA  EV  L+ EV+++AELLKKE ++ KA LRAAHAIT+GLE+EKFQLLKEKDD+ Q LE KD  +   TAEL+  KERL+NG LLEESFRQH D
Subjt:  KDELLKAHSEVETLKGEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPD

Query:  FDGFAKDFSDAGFKFLMKGIASDMPDLQIDFSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYSNLEED
        FDGFAKDFSDAGFKFLMKGIA+DMP LQID S LK++Y+EKWASGP+GTPGPQ+LV +YVR+LDSDYS++EE+
Subjt:  FDGFAKDFSDAGFKFLMKGIASDMPDLQIDFSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYSNLEED

A0A6J1DVF6 uncharacterized protein LOC1110247403.0e-9475.93Show/hide
Query:  EPSSLGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALVIKAELDGREALASREKEEFSAALEAASSTMKDELLKAHSEVET
        EPSS GVR+QV+RISAASLDRCLRRASKFVS PGSVLQRTIDYAAEAFVASIQSAL +KAELDGRE LA+REKEEFSAALEAA  TMKDELLKAHSEVET
Subjt:  EPSSLGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALVIKAELDGREALASREKEEFSAALEAASSTMKDELLKAHSEVET

Query:  LKGEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGF
        LK EVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALE KDKELEHATAELETAKERLSN                          
Subjt:  LKGEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGF

Query:  KFLMKGIASDMPDLQIDFSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYSNLEEDQVTPLRRALP
                      +ID SGLKRRYAEKWASGP GTPGPQALVDQYVRDLDSDYS+ +EDQV   +   P
Subjt:  KFLMKGIASDMPDLQIDFSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYSNLEEDQVTPLRRALP

A0A6J1DZB3 uncharacterized protein LOC1110256657.8e-12764.69Show/hide
Query:  AMVCGFASGVKRKSKSRAHALEAAQSSKPPTPAV--------VGPASEDPVPVIELESSGGPSREKRPRGQTEAVDAQTEAADAPPLGEEAREEAPLKRR
        AMVCGF   VKRKSK RAHAL+    ++P TP V         GP+S  P PVIEL+ SGG S EKR R ++EA+D         PL  E R E+PL+RR
Subjt:  AMVCGFASGVKRKSKSRAHALEAAQSSKPPTPAV--------VGPASEDPVPVIELESSGGPSREKRPRGQTEAVDAQTEAADAPPLGEEAREEAPLKRR

Query:  RKKKKAISPSEVGACRVLPAGWADRVDDPAARMGGTSDVTARFRIEPSSLGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSA
        RKKKK  S SE GA   LP   AD VDDP ARM GTS+V  RF +EPSS GV++QV+RISA  LDR LRRASKFVS PGSVLQRTID  AEAF+ASI  A
Subjt:  RKKKKAISPSEVGACRVLPAGWADRVDDPAARMGGTSDVTARFRIEPSSLGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSA

Query:  LVIKAELDGREALASREKEEFSAALEAASSTMKDELLKAHSEVETLKGEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKD
        +++KAELDGREALA++E+E   AALEAA +T+K ELLKA  EV+ L+ EV+++ +LLKKE ++ KA LRAAHAIT+GLE+EKFQLLKEKDD+ Q LE KD
Subjt:  LVIKAELDGREALASREKEEFSAALEAASSTMKDELLKAHSEVETLKGEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKD

Query:  KELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDFSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYS
          +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQID +GLK++Y+EKWASGP+GTP PQ+LVD+YVR+LDSDYS
Subjt:  KELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDFSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYS

Query:  NLEED
        ++EE+
Subjt:  NLEED

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGTCACCGGCATTTGTAGACGAAGAATTTCCTGCATCTGTCGAAGAAGTATCAAGCGGAGGAACAATGGTTTCGTCAACCATGGCGGTGGGAAGTGGAACC
TCTCAGAGAAAATCTGATACCATATTGAAACTGAGAATTGCAGCTCGAACTCGGCCTCCGGACCGATCTGAGTACTTGAGCGGACCTGCACAAAGAGGTGAGCAC
TCCAACGATCAAGTCAGTATAGGTCGGATTCCCAGTTTAGTTCGAGGTCAGAAAATCATCGTACCTGATCGCGGAGTGAGGCGGTCCGAACTGGGCATGACTCAT
GAGTCATCTTGGAGCACCAATAGGGATCTTCTACGTGTCCAGGCCATGGTTTGCGGATTTGCAAGCGGCGTGAAGCGCAAGTCTAAAAGCCGAGCCCATGCTCTT
GAGGCTGCCCAGAGTTCGAAACCTCCCACCCCTGCCGTGGTAGGGCCTGCCTCGGAAGATCCAGTCCCGGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGG
GAGAAGCGCCCCAGGGGTCAGACCGAGGCGGTGGACGCCCAGACCGAGGCGGCGGACGCCCCGCCTTTGGGAGAGGAGGCGAGGGAGGAAGCCCCTCTAAAGCGA
AGAAGGAAGAAAAAGAAGGCGATCTCTCCCTCGGAGGTCGGAGCTTGCAGGGTCTTGCCTGCAGGTTGGGCTGATCGGGTGGACGATCCTGCGGCCAGGATGGGC
GGGACGTCCGATGTGACGGCGCGGTTCAGAATTGAGCCGTCAAGTCTCGGGGTGAGGGAGCAGGTGACCCGCATCTCAGCTGCGAGTTTGGACCGCTGCCTAAGG
AGGGCGTCCAAATTTGTGAGCGCTCCTGGGTCCGTTCTGCAGAGGACCATTGACTACGCCGCCGAGGCGTTTGTTGCTTCCATTCAATCGGCTCTCGTTATAAAG
GCCGAGCTGGATGGGAGGGAAGCTTTGGCATCGAGGGAGAAAGAGGAGTTCTCCGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCT
CACTCCGAGGTGGAGACTTTGAAGGGCGAGGTGGAGTCTCAGGCCGAGCTACTGAAGAAGGAGGAGGACAGGCGCAAGGCTCAACTCCGAGCTGCCCACGCCATC
ACCAGGGGCTTGGAGAGGGAGAAGTTCCAGCTCCTGAAAGAGAAGGACGACATGCTCCAGGCGCTCGAAGCGAAGGATAAGGAGTTGGAGCATGCGACTGCCGAG
CTGGAGACGGCGAAGGAGCGCCTCAGCAATGGAGTCCTGCTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTTTCTGACGCGGGC
TTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCCGACCTTCAGATCGATTTCAGTGGTCTGAAAAGAAGGTATGCCGAGAAGTGGGCGTCTGGGCCTAGC
GGCACCCCTGGCCCCCAAGCGTTGGTGGATCAGTATGTCAGGGATCTGGACTCTGACTACTCCAATCTCGAAGAGGACCAGGTCACTCCACTCAGGAGGGCGCTC
CCCCAGCAGGCTCTTAGGCGACCATCCTTCACGAGGCTTTTCGCTGTTCTCCCTCCCTTTTCTTTTTGTTTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCGTCACCGGCATTTGTAGACGAAGAATTTCCTGCATCTGTCGAAGAAGTATCAAGCGGAGGAACAATGGTTTCGTCAACCATGGCGGTGGGAAGTGGAACC
TCTCAGAGAAAATCTGATACCATATTGAAACTGAGAATTGCAGCTCGAACTCGGCCTCCGGACCGATCTGAGTACTTGAGCGGACCTGCACAAAGAGGTGAGCAC
TCCAACGATCAAGTCAGTATAGGTCGGATTCCCAGTTTAGTTCGAGGTCAGAAAATCATCGTACCTGATCGCGGAGTGAGGCGGTCCGAACTGGGCATGACTCAT
GAGTCATCTTGGAGCACCAATAGGGATCTTCTACGTGTCCAGGCCATGGTTTGCGGATTTGCAAGCGGCGTGAAGCGCAAGTCTAAAAGCCGAGCCCATGCTCTT
GAGGCTGCCCAGAGTTCGAAACCTCCCACCCCTGCCGTGGTAGGGCCTGCCTCGGAAGATCCAGTCCCGGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGG
GAGAAGCGCCCCAGGGGTCAGACCGAGGCGGTGGACGCCCAGACCGAGGCGGCGGACGCCCCGCCTTTGGGAGAGGAGGCGAGGGAGGAAGCCCCTCTAAAGCGA
AGAAGGAAGAAAAAGAAGGCGATCTCTCCCTCGGAGGTCGGAGCTTGCAGGGTCTTGCCTGCAGGTTGGGCTGATCGGGTGGACGATCCTGCGGCCAGGATGGGC
GGGACGTCCGATGTGACGGCGCGGTTCAGAATTGAGCCGTCAAGTCTCGGGGTGAGGGAGCAGGTGACCCGCATCTCAGCTGCGAGTTTGGACCGCTGCCTAAGG
AGGGCGTCCAAATTTGTGAGCGCTCCTGGGTCCGTTCTGCAGAGGACCATTGACTACGCCGCCGAGGCGTTTGTTGCTTCCATTCAATCGGCTCTCGTTATAAAG
GCCGAGCTGGATGGGAGGGAAGCTTTGGCATCGAGGGAGAAAGAGGAGTTCTCCGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCT
CACTCCGAGGTGGAGACTTTGAAGGGCGAGGTGGAGTCTCAGGCCGAGCTACTGAAGAAGGAGGAGGACAGGCGCAAGGCTCAACTCCGAGCTGCCCACGCCATC
ACCAGGGGCTTGGAGAGGGAGAAGTTCCAGCTCCTGAAAGAGAAGGACGACATGCTCCAGGCGCTCGAAGCGAAGGATAAGGAGTTGGAGCATGCGACTGCCGAG
CTGGAGACGGCGAAGGAGCGCCTCAGCAATGGAGTCCTGCTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTTTCTGACGCGGGC
TTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCCGACCTTCAGATCGATTTCAGTGGTCTGAAAAGAAGGTATGCCGAGAAGTGGGCGTCTGGGCCTAGC
GGCACCCCTGGCCCCCAAGCGTTGGTGGATCAGTATGTCAGGGATCTGGACTCTGACTACTCCAATCTCGAAGAGGACCAGGTCACTCCACTCAGGAGGGCGCTC
CCCCAGCAGGCTCTTAGGCGACCATCCTTCACGAGGCTTTTCGCTGTTCTCCCTCCCTTTTCTTTTTGTTTGTAA
Protein sequenceShow/hide protein sequence
MPSPAFVDEEFPASVEEVSSGGTMVSSTMAVGSGTSQRKSDTILKLRIAARTRPPDRSEYLSGPAQRGEHSNDQVSIGRIPSLVRGQKIIVPDRGVRRSELGMTH
ESSWSTNRDLLRVQAMVCGFASGVKRKSKSRAHALEAAQSSKPPTPAVVGPASEDPVPVIELESSGGPSREKRPRGQTEAVDAQTEAADAPPLGEEAREEAPLKR
RRKKKKAISPSEVGACRVLPAGWADRVDDPAARMGGTSDVTARFRIEPSSLGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALVIK
AELDGREALASREKEEFSAALEAASSTMKDELLKAHSEVETLKGEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAE
LETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDFSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYSNLEEDQVTPLRRAL
PQQALRRPSFTRLFAVLPPFSFCL