; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g17440 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g17440
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionBAHD acyltransferase At3g29680-like
Genome locationchr1:11841624..11853428
RNA-Seq ExpressionMoc01g17440
SyntenyMoc01g17440
Gene Ontology termsGO:0043167 - ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]2.5e-9949.9Show/hide
Query:  PAPVIELESSGGPSREKR------PGVGARGALPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSHISAASLDRCLRIASKFVSDPGSVLQRT
        P   I LE+   P R K+        VGA   LPA FADRVDDP ARMGGTSDVTARFR+EPSSSGVRDQVS ISAASLDRCLR ASKFVS PGSVL R 
Subjt:  PAPVIELESSGGPSREKR------PGVGARGALPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSHISAASLDRCLRIASKFVSDPGSVLQRT

Query:  IDYASEAFVASIQSALAVKAKLDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEV-----------------------------------
        IDYA+EAFVASIQSALAVKA+LDGRE LAAREKEEFSAALEAASSTMKDELLKAHSEVE LKAEV                                   
Subjt:  IDYASEAFVASIQSALAVKAKLDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEV-----------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------------EAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALETKEE
                                                         EAKAELLK+E++R KA LRAAHAITKGLEKEKFQLLKEKDDMLQALE K+ 
Subjt:  -------------------------------------------------EAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALETKEE

Query:  ELKHATVELEMVKERLLSNGALLEESFKQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYS
         +     EL+  KER L+NGALLE +F+QHPDFDGFAKDFSDAGFKFLMKGIA+D+P L++DLG LKKRYAE+WASGP+GT GP +LVDKYVRDLDSDYS
Subjt:  ELKHATVELEMVKERLLSNGALLEESFKQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYS

Query:  DLEED
        DL+ED
Subjt:  DLEED

XP_022147182.1 uncharacterized protein LOC111016193 [Momordica charantia]1.7e-9573.96Show/hide
Query:  RFRVEPSSSGVRDQVSHISAASLDRCLRIASKFVSDPGSVLQRTIDYASEAFVASIQSALAVKAKLDGREALAAREKEEFSAALEAASSTMKDELLKAHS
        RFR+E SSSGV+DQVS ISA  LDRCLR AS+FVSDPGSVLQRTID A+EAF+ASI SA+ VKA+LDGREAL A+E+E  S  LEAA +T+K ELLKA  
Subjt:  RFRVEPSSSGVRDQVSHISAASLDRCLRIASKFVSDPGSVLQRTIDYASEAFVASIQSALAVKAKLDGREALAAREKEEFSAALEAASSTMKDELLKAHS

Query:  EVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALETKEEELKHATVELEMVKERLLSNGALLEESFKQHPDFDGFAKDF
        EV+IL+AEV+AK +LLKKE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+ Q LE K+  +   T EL+ +KER L++GALLEESF+QHP+FDGFAKDF
Subjt:  EVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALETKEEELKHATVELEMVKERLLSNGALLEESFKQHPDFDGFAKDF

Query:  SDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEED
        SDAGFKFLMKGIA+DMP LQIDL  LKKRY+E WASGP+GTPGPQ+LVDKYVR+LDSDYSD+EE+
Subjt:  SDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEED

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]3.5e-11786.03Show/hide
Query:  GTSDVTARFRVEPSSSGVRDQVSHISAASLDRCLRIASKFVSDPGSVLQRTIDYASEAFVASIQSALAVKAKLDGREALAAREKEEFSAALEAASSTMKD
        G   + A+ R+EPSSSGVRDQVS ISAASLDRCLR ASKFVS PGSVLQRTIDYA+EAFVASIQSALAVKA+LDGRE LAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRVEPSSSGVRDQVSHISAASLDRCLRIASKFVSDPGSVLQRTIDYASEAFVASIQSALAVKAKLDGREALAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALETKEEELKHATVELEMVKERLLSNGALLEESFKQHPDF
        ELLKAHSEVE LKAEVE++AELLKKEEDRR+AQLRAAHAIT+GLE+EKFQLLKEKDDMLQALE K++EL+HAT ELE  KER LSNG LLEE+F+QHPDF
Subjt:  ELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALETKEEELKHATVELEMVKERLLSNGALLEESFKQHPDF

Query:  DGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEED
        DGFAKDFSDAGFKFLMKGIASDMPDLQIDL GLK+RYAE+WASGP GTPGPQALVD+YVRDLDSDYSD EED
Subjt:  DGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEED

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]3.9e-10075.55Show/hide
Query:  MGGTSDVTARFRVEPSSSGVRDQVSHISAASLDRCLRIASKFVSDPGSVLQRTIDYASEAFVASIQSALAVKAKLDGREALAAREKEEFSAALEAASSTM
        MGGT DV  RFR+EPSSSGV+DQVS ISA  LDRCL+ ASKFVSDPGSVLQRTID A+EAFVASI SA+ VKA+LDGREALAA+E+E  SAALEAA +T+
Subjt:  MGGTSDVTARFRVEPSSSGVRDQVSHISAASLDRCLRIASKFVSDPGSVLQRTIDYASEAFVASIQSALAVKAKLDGREALAAREKEEFSAALEAASSTM

Query:  KDELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALETKEEELKHATVELEMVKERLLSNGALLEESFKQHP
        K ELLKA  EV IL+AEV+AKAELLKKE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+ Q LE K+  +   T EL+ +KER L+NG+LLEESF+QH 
Subjt:  KDELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALETKEEELKHATVELEMVKERLLSNGALLEESFKQHP

Query:  DFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEED
        DFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL  LKK+Y+E+WASGP+GTPGPQ+LV KYVR+LDSDYSD+EE+
Subjt:  DFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEED

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]4.9e-11964.32Show/hide
Query:  AMVCGFASNVKRKSKGRAHALEAAQSSEPATPAV--------AGPASEDPAPVIELESSGGPSREKR---------------------------------
        AMVCGF  +VKRKSKGRAHAL+    +EP TP V        +GP+S  P PVIEL+ SGG S EKR                                 
Subjt:  AMVCGFASNVKRKSKGRAHALEAAQSSEPATPAV--------AGPASEDPAPVIELESSGGPSREKR---------------------------------

Query:  -PGVGARGALPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSHISAASLDRCLRIASKFVSDPGSVLQRTIDYASEAFVASIQSALAVKAKLD
            GARG LP S AD VDDPEARM GTS+V  RF +EPSSSGV+DQVS ISA  LDR LR ASKFVSDPGSVLQRTID  +EAF+ASI  A+ VKA+LD
Subjt:  -PGVGARGALPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSHISAASLDRCLRIASKFVSDPGSVLQRTIDYASEAFVASIQSALAVKAKLD

Query:  GREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALETKEEELKHATV
        GREALAA+E+E   AALEAA +T+K ELLKA  EV+IL+AEV+AK +LLKKE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+ Q LE K+  +   T 
Subjt:  GREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALETKEEELKHATV

Query:  ELEMVKERLLSNGALLEESFKQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEED
        EL+ +KER L+NG LLEESF+QHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL GLKK+Y+E+WASGP+GTP PQ+LVDKYVR+LDSDYSD+EE+
Subjt:  ELEMVKERLLSNGALLEESFKQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEED

TrEMBL top hitse value%identityAlignment
A0A6J1CLV1 uncharacterized protein LOC1110124671.2e-9949.9Show/hide
Query:  PAPVIELESSGGPSREKR------PGVGARGALPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSHISAASLDRCLRIASKFVSDPGSVLQRT
        P   I LE+   P R K+        VGA   LPA FADRVDDP ARMGGTSDVTARFR+EPSSSGVRDQVS ISAASLDRCLR ASKFVS PGSVL R 
Subjt:  PAPVIELESSGGPSREKR------PGVGARGALPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSHISAASLDRCLRIASKFVSDPGSVLQRT

Query:  IDYASEAFVASIQSALAVKAKLDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEV-----------------------------------
        IDYA+EAFVASIQSALAVKA+LDGRE LAAREKEEFSAALEAASSTMKDELLKAHSEVE LKAEV                                   
Subjt:  IDYASEAFVASIQSALAVKAKLDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEV-----------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------------EAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALETKEE
                                                         EAKAELLK+E++R KA LRAAHAITKGLEKEKFQLLKEKDDMLQALE K+ 
Subjt:  -------------------------------------------------EAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALETKEE

Query:  ELKHATVELEMVKERLLSNGALLEESFKQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYS
         +     EL+  KER L+NGALLE +F+QHPDFDGFAKDFSDAGFKFLMKGIA+D+P L++DLG LKKRYAE+WASGP+GT GP +LVDKYVRDLDSDYS
Subjt:  ELKHATVELEMVKERLLSNGALLEESFKQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYS

Query:  DLEED
        DL+ED
Subjt:  DLEED

A0A6J1D1N9 uncharacterized protein LOC1110161938.2e-9673.96Show/hide
Query:  RFRVEPSSSGVRDQVSHISAASLDRCLRIASKFVSDPGSVLQRTIDYASEAFVASIQSALAVKAKLDGREALAAREKEEFSAALEAASSTMKDELLKAHS
        RFR+E SSSGV+DQVS ISA  LDRCLR AS+FVSDPGSVLQRTID A+EAF+ASI SA+ VKA+LDGREAL A+E+E  S  LEAA +T+K ELLKA  
Subjt:  RFRVEPSSSGVRDQVSHISAASLDRCLRIASKFVSDPGSVLQRTIDYASEAFVASIQSALAVKAKLDGREALAAREKEEFSAALEAASSTMKDELLKAHS

Query:  EVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALETKEEELKHATVELEMVKERLLSNGALLEESFKQHPDFDGFAKDF
        EV+IL+AEV+AK +LLKKE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+ Q LE K+  +   T EL+ +KER L++GALLEESF+QHP+FDGFAKDF
Subjt:  EVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALETKEEELKHATVELEMVKERLLSNGALLEESFKQHPDFDGFAKDF

Query:  SDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEED
        SDAGFKFLMKGIA+DMP LQIDL  LKKRY+E WASGP+GTPGPQ+LVDKYVR+LDSDYSD+EE+
Subjt:  SDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEED

A0A6J1D971 uncharacterized protein LOC1110185381.7e-11786.03Show/hide
Query:  GTSDVTARFRVEPSSSGVRDQVSHISAASLDRCLRIASKFVSDPGSVLQRTIDYASEAFVASIQSALAVKAKLDGREALAAREKEEFSAALEAASSTMKD
        G   + A+ R+EPSSSGVRDQVS ISAASLDRCLR ASKFVS PGSVLQRTIDYA+EAFVASIQSALAVKA+LDGRE LAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRVEPSSSGVRDQVSHISAASLDRCLRIASKFVSDPGSVLQRTIDYASEAFVASIQSALAVKAKLDGREALAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALETKEEELKHATVELEMVKERLLSNGALLEESFKQHPDF
        ELLKAHSEVE LKAEVE++AELLKKEEDRR+AQLRAAHAIT+GLE+EKFQLLKEKDDMLQALE K++EL+HAT ELE  KER LSNG LLEE+F+QHPDF
Subjt:  ELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALETKEEELKHATVELEMVKERLLSNGALLEESFKQHPDF

Query:  DGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEED
        DGFAKDFSDAGFKFLMKGIASDMPDLQIDL GLK+RYAE+WASGP GTPGPQALVD+YVRDLDSDYSD EED
Subjt:  DGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEED

A0A6J1DF31 uncharacterized protein LOC1110199091.9e-10075.55Show/hide
Query:  MGGTSDVTARFRVEPSSSGVRDQVSHISAASLDRCLRIASKFVSDPGSVLQRTIDYASEAFVASIQSALAVKAKLDGREALAAREKEEFSAALEAASSTM
        MGGT DV  RFR+EPSSSGV+DQVS ISA  LDRCL+ ASKFVSDPGSVLQRTID A+EAFVASI SA+ VKA+LDGREALAA+E+E  SAALEAA +T+
Subjt:  MGGTSDVTARFRVEPSSSGVRDQVSHISAASLDRCLRIASKFVSDPGSVLQRTIDYASEAFVASIQSALAVKAKLDGREALAAREKEEFSAALEAASSTM

Query:  KDELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALETKEEELKHATVELEMVKERLLSNGALLEESFKQHP
        K ELLKA  EV IL+AEV+AKAELLKKE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+ Q LE K+  +   T EL+ +KER L+NG+LLEESF+QH 
Subjt:  KDELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALETKEEELKHATVELEMVKERLLSNGALLEESFKQHP

Query:  DFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEED
        DFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL  LKK+Y+E+WASGP+GTPGPQ+LV KYVR+LDSDYSD+EE+
Subjt:  DFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEED

A0A6J1DZB3 uncharacterized protein LOC1110256652.4e-11964.32Show/hide
Query:  AMVCGFASNVKRKSKGRAHALEAAQSSEPATPAV--------AGPASEDPAPVIELESSGGPSREKR---------------------------------
        AMVCGF  +VKRKSKGRAHAL+    +EP TP V        +GP+S  P PVIEL+ SGG S EKR                                 
Subjt:  AMVCGFASNVKRKSKGRAHALEAAQSSEPATPAV--------AGPASEDPAPVIELESSGGPSREKR---------------------------------

Query:  -PGVGARGALPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSHISAASLDRCLRIASKFVSDPGSVLQRTIDYASEAFVASIQSALAVKAKLD
            GARG LP S AD VDDPEARM GTS+V  RF +EPSSSGV+DQVS ISA  LDR LR ASKFVSDPGSVLQRTID  +EAF+ASI  A+ VKA+LD
Subjt:  -PGVGARGALPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSHISAASLDRCLRIASKFVSDPGSVLQRTIDYASEAFVASIQSALAVKAKLD

Query:  GREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALETKEEELKHATV
        GREALAA+E+E   AALEAA +T+K ELLKA  EV+IL+AEV+AK +LLKKE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+ Q LE K+  +   T 
Subjt:  GREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALETKEEELKHATV

Query:  ELEMVKERLLSNGALLEESFKQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEED
        EL+ +KER L+NG LLEESF+QHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL GLKK+Y+E+WASGP+GTP PQ+LVDKYVR+LDSDYSD+EE+
Subjt:  ELEMVKERLLSNGALLEESFKQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEED

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTGAAGATGCGAACATATATAATCTTTCAACAGTTCTAAGCAAGGACAACATCTCTATCGATAGAAAGATCGTAGAGGCTCAAGCTAATAGAAACATGAAGAGACC
TGTTGTAACATCTTCAACAGCTTCCCGACATGGTAGCGGTAGCCGGAAGCTGAGAGGTGGAAAAGGCAAGGAAGATGAGAAGCATCATGATCAAAAGGGGGACGAAGAGC
ACCGCCGTGGGCGGCGGTGGCAGCGGCGGGAGAAGCAAAGGCAGCACCAGGAGGGACACTGTCCAAAGAAACAAGGCCATTTTCGATTGAGAAGCTCTCTGGCTGTGTCT
AATAAGTCGTCTCGTGGTGGCAGCGCCGTCGTCGGAGTGGTTTTGGAGGGCGGAGGGAGGGTATCAGTGAATGTGGTGGCGGTGGCGCCGGTGAGGGAGGGCGAAGGAGG
GGAATTTGTGAGGCTAAGAGAGAAAGATGCAAGTTTTGACATAGGATTCTTGACGAGGTGTAGCATTAGAATCGAGTCGAGGCAGTGCCTCATATTCAATCGGGCTAAAG
ATATGCAGACAGAATCTTTAACTAGTGTGGTACTCTTGTACACTTTGAAGTTTAAGTTAATATTTAATCTAAACTGGTGTTGCGTCGAAGCGTCGAACTTAGTAGAAATT
GCGTTAGATTGCATAACTTTGGGAATGATGGACTACACCTTTTATAGAGGATCATTAGGAAGTATTGAACTGCCATTTCGAAACAATGTACGTACGAAGCAGTTAGCAAA
TTTGTCACAGTGTGGCTTCCATGTTGGTGCCAAGAGTTGGGCTTTCACGAGGTTATACAATGCCATGGTTTGCGGATTTGCAAGCAACGTGAAACGCAAGTCCAAGGGCC
GAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGGAACCTGCCACTCCTGCTGTGGCAGGGCCAGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGT
CCTTCGAGGGAGAAGCGCCCAGGGGTCGGAGCTCGTGGGGCCCTGCCTGCGAGCTTCGCAGATCGGGTGGACGATCCTGAGGCCAGGATGGGCGGGACGTCCGACGTGAC
AGCACGGTTCAGAGTCGAGCCGTCAAGTTCTGGGGTGAGGGACCAGGTGTCTCACATCTCGGCTGCAAGTTTGGACCGCTGCCTCAGAATAGCGTCCAAATTTGTAAGTG
ACCCGGGGTCTGTCCTGCAGAGGACCATCGACTACGCCTCTGAGGCGTTTGTTGCTTCCATTCAATCGGCTCTGGCCGTGAAGGCCAAGCTGGATGGGAGGGAAGCTCTG
GCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCCTCTTCCACCATGAAGGATGAGCTGCTGAAAGCTCACTCTGAGGTGGAAATTTTGAAGGCTGAGGT
GGAGGCCAAGGCCGAGCTGCTGAAGAAAGAAGAGGACAGACGCAAGGCCCAGCTCCGAGCTGCCCATGCTATCACCAAGGGCTTGGAGAAGGAGAAGTTCCAACTCCTCA
AGGAGAAGGACGACATGCTCCAGGCGCTTGAAACGAAGGAGGAGGAGCTGAAGCATGCGACTGTTGAGCTGGAGATGGTGAAGGAGCGTCTTCTTAGCAATGGAGCCCTA
TTGGAGGAATCGTTCAAGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTCCGACATGCCTGACCT
TCAGATCGATCTCGGTGGTCTGAAGAAGAGGTATGCTGAGCAGTGGGCGTCTGGGCCTAGCGGCACCCCTGGCCCTCAAGCGTTGGTGGATAAGTACGTCAGAGATCTGG
ACTCTGACTACTCCGACCTCGAAGAGGATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTGAAGATGCGAACATATATAATCTTTCAACAGTTCTAAGCAAGGACAACATCTCTATCGATAGAAAGATCGTAGAGGCTCAAGCTAATAGAAACATGAAGAGACC
TGTTGTAACATCTTCAACAGCTTCCCGACATGGTAGCGGTAGCCGGAAGCTGAGAGGTGGAAAAGGCAAGGAAGATGAGAAGCATCATGATCAAAAGGGGGACGAAGAGC
ACCGCCGTGGGCGGCGGTGGCAGCGGCGGGAGAAGCAAAGGCAGCACCAGGAGGGACACTGTCCAAAGAAACAAGGCCATTTTCGATTGAGAAGCTCTCTGGCTGTGTCT
AATAAGTCGTCTCGTGGTGGCAGCGCCGTCGTCGGAGTGGTTTTGGAGGGCGGAGGGAGGGTATCAGTGAATGTGGTGGCGGTGGCGCCGGTGAGGGAGGGCGAAGGAGG
GGAATTTGTGAGGCTAAGAGAGAAAGATGCAAGTTTTGACATAGGATTCTTGACGAGGTGTAGCATTAGAATCGAGTCGAGGCAGTGCCTCATATTCAATCGGGCTAAAG
ATATGCAGACAGAATCTTTAACTAGTGTGGTACTCTTGTACACTTTGAAGTTTAAGTTAATATTTAATCTAAACTGGTGTTGCGTCGAAGCGTCGAACTTAGTAGAAATT
GCGTTAGATTGCATAACTTTGGGAATGATGGACTACACCTTTTATAGAGGATCATTAGGAAGTATTGAACTGCCATTTCGAAACAATGTACGTACGAAGCAGTTAGCAAA
TTTGTCACAGTGTGGCTTCCATGTTGGTGCCAAGAGTTGGGCTTTCACGAGGTTATACAATGCCATGGTTTGCGGATTTGCAAGCAACGTGAAACGCAAGTCCAAGGGCC
GAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGGAACCTGCCACTCCTGCTGTGGCAGGGCCAGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGT
CCTTCGAGGGAGAAGCGCCCAGGGGTCGGAGCTCGTGGGGCCCTGCCTGCGAGCTTCGCAGATCGGGTGGACGATCCTGAGGCCAGGATGGGCGGGACGTCCGACGTGAC
AGCACGGTTCAGAGTCGAGCCGTCAAGTTCTGGGGTGAGGGACCAGGTGTCTCACATCTCGGCTGCAAGTTTGGACCGCTGCCTCAGAATAGCGTCCAAATTTGTAAGTG
ACCCGGGGTCTGTCCTGCAGAGGACCATCGACTACGCCTCTGAGGCGTTTGTTGCTTCCATTCAATCGGCTCTGGCCGTGAAGGCCAAGCTGGATGGGAGGGAAGCTCTG
GCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCCTCTTCCACCATGAAGGATGAGCTGCTGAAAGCTCACTCTGAGGTGGAAATTTTGAAGGCTGAGGT
GGAGGCCAAGGCCGAGCTGCTGAAGAAAGAAGAGGACAGACGCAAGGCCCAGCTCCGAGCTGCCCATGCTATCACCAAGGGCTTGGAGAAGGAGAAGTTCCAACTCCTCA
AGGAGAAGGACGACATGCTCCAGGCGCTTGAAACGAAGGAGGAGGAGCTGAAGCATGCGACTGTTGAGCTGGAGATGGTGAAGGAGCGTCTTCTTAGCAATGGAGCCCTA
TTGGAGGAATCGTTCAAGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTCCGACATGCCTGACCT
TCAGATCGATCTCGGTGGTCTGAAGAAGAGGTATGCTGAGCAGTGGGCGTCTGGGCCTAGCGGCACCCCTGGCCCTCAAGCGTTGGTGGATAAGTACGTCAGAGATCTGG
ACTCTGACTACTCCGACCTCGAAGAGGATTAG
Protein sequenceShow/hide protein sequence
MVEDANIYNLSTVLSKDNISIDRKIVEAQANRNMKRPVVTSSTASRHGSGSRKLRGGKGKEDEKHHDQKGDEEHRRGRRWQRREKQRQHQEGHCPKKQGHFRLRSSLAVS
NKSSRGGSAVVGVVLEGGGRVSVNVVAVAPVREGEGGEFVRLREKDASFDIGFLTRCSIRIESRQCLIFNRAKDMQTESLTSVVLLYTLKFKLIFNLNWCCVEASNLVEI
ALDCITLGMMDYTFYRGSLGSIELPFRNNVRTKQLANLSQCGFHVGAKSWAFTRLYNAMVCGFASNVKRKSKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSGG
PSREKRPGVGARGALPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSHISAASLDRCLRIASKFVSDPGSVLQRTIDYASEAFVASIQSALAVKAKLDGREAL
AAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALETKEEELKHATVELEMVKERLLSNGAL
LEESFKQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEED