; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g17280 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g17280
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr9:13723075..13728347
RNA-Seq ExpressionMoc09g17280
SyntenyMoc09g17280
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]1.1e-5442.42Show/hide
Query:  EHVKVVSFTDDIAMMYFTTGLNDRNLKIEFRSCPPASLNEMLTRARQYIDGLELWKANGAR------RSSCGKDRDQKSPLSKK-----------RRSDD
        E +KV   +DD AM YF TGL D  L ++     PA+  E+L +A++ IDG EL +    R      R   GKD +   P SK            RR+++
Subjt:  EHVKVVSFTDDIAMMYFTTGLNDRNLKIEFRSCPPASLNEMLTRARQYIDGLELWKANGAR------RSSCGKDRDQKSPLSKK-----------RRSDD

Query:  -----RSSSQFTSLNASITEIYVVVEDTDLE-------------------------AEH-------------IEDLIRKGYLKKYVGSKARVEPEGSARE
             R   +FT     I+EI   +E++ +E                          EH             IE+LI+ GY KK+VG K R        E
Subjt:  -----RSSSQFTSLNASITEIYVVVEDTDLE-------------------------AEH-------------IEDLIRKGYLKKYVGSKARVEPEGSARE

Query:  EKRERSQPLRRKEDRPGIINTIHGGLSGGQSGQKRKALAWEAAHEVCTSYPKESVMPILLDEKDSEGVHMSHNDALVIAPLIDHVKVKRVLVYGGASANI
         KR R+ P  R+ DRP +INTI GG SGGQSG+KRK LA  A  EVC    +    PI  D  D E VH+ HNDALVIAPLIDHV V RVLV GG SANI
Subjt:  EKRERSQPLRRKEDRPGIINTIHGGLSGGQSGQKRKALAWEAAHEVCTSYPKESVMPILLDEKDSEGVHMSHNDALVIAPLIDHVKVKRVLVYGGASANI

Query:  LSFSTYMALRWERRHLKHSPTPLVGFAGESVGAERCISLHVTIGEGEHQVTRVVEFVVIDRSL
        LS  TY+AL W R  LK SPTPLVGF+GESV  E  I L VT+G+ + QVT++ EFV  DR++
Subjt:  LSFSTYMALRWERRHLKHSPTPLVGFAGESVGAERCISLHVTIGEGEHQVTRVVEFVVIDRSL

XP_022149029.1 uncharacterized protein LOC111017548 [Momordica charantia]6.5e-5556.12Show/hide
Query:  NEHVKVVSFTDDIAMMYFTTGLNDRNLKIEFRSCPPASLNEMLTRARQYIDGLELWKANGARRSSCGKDRDQKSPLSKKRRSDDRSSS------------
        +EHVKVVS TDDIAMMYFTTGLNDRNL IEF S PPASLN+ML RARQYIDGLELWKA GARRSS GKDRDQ+S   KKR SDD+SSS            
Subjt:  NEHVKVVSFTDDIAMMYFTTGLNDRNLKIEFRSCPPASLNEMLTRARQYIDGLELWKANGARRSSCGKDRDQKSPLSKKRRSDDRSSS------------

Query:  --------------QFTSLNASITEIYVVVEDTDLEA--------------------------------------EHIEDLIRKGYLKKYVGSKARVEPE
                      +FT LNAS+ EIY  VE+TD++A                                      E ++DLIR+GYLKKYVGS+ R +PE
Subjt:  --------------QFTSLNASITEIYVVVEDTDLEA--------------------------------------EHIEDLIRKGYLKKYVGSKARVEPE

Query:  GSAREEKRERSQPLRRKEDRPGIINTIHGGLSGGQSG
        GS REEKRERSQP  RKEDRP +INTIHGG SG +SG
Subjt:  GSAREEKRERSQPLRRKEDRPGIINTIHGGLSGGQSG

XP_022150385.1 uncharacterized protein LOC111018561 [Momordica charantia]1.3e-5548.07Show/hide
Query:  EHVKVVSFTDDIAMMYFTTGLNDRNLKIEFRSCPPASLNEMLTRARQYIDGLELWKANGARRSSCGKDRDQKSPLSKKRRSDDRSSSQFTSLNASITEIY
        E +KV   +DD AM YF TGL D  L ++     PA+  E+L +A++ IDG EL +    R     K  DQK    +KR+ D +   + +S + S  + +
Subjt:  EHVKVVSFTDDIAMMYFTTGLNDRNLKIEFRSCPPASLNEMLTRARQYIDGLELWKANGARRSSCGKDRDQKSPLSKKRRSDDRSSSQFTSLNASITEIY

Query:  VVVEDTDLEAEHIEDLIRKGYLKKYVGSKARVEPEGSAREEKRERSQPLRRKEDRPGIINTIHGGLSGGQSGQKRKALAWEAAHEVCTSYPKESVMPILL
         +          IEDLI+ GY KK+VG K R        E KR R+ P  R++DRP +INTI GG SGGQSG KRK L  EA  EV     + S   I  
Subjt:  VVVEDTDLEAEHIEDLIRKGYLKKYVGSKARVEPEGSAREEKRERSQPLRRKEDRPGIINTIHGGLSGGQSGQKRKALAWEAAHEVCTSYPKESVMPILL

Query:  DEKDSEGVHMSHNDALVIAPLIDHVKVKRVLVYGGASANILSFSTYMALRWERRHLKHSPTPLVGFAGESVGAERCISLHVTIGE
        D+ D EGVH+ HNDALVI PLI+HV V+RVLV GGASANILS  TY+AL W R  LK SPTPLVGF GESV  E CI L VT+G+
Subjt:  DEKDSEGVHMSHNDALVIAPLIDHVKVKRVLVYGGASANILSFSTYMALRWERRHLKHSPTPLVGFAGESVGAERCISLHVTIGE

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]2.9e-5543.71Show/hide
Query:  MMYFTTGLNDRNLKIEFRSCPPASLNEMLTRARQYIDGLELWKAN-GARRSSCGKDRDQKSPLSKK-----------RRSDD-----RSSSQFTSLNASI
        M YF TGL D  L ++     PA+  E+L +A++ IDG EL +   G  RS  GKD +   P SK            RR+++     R   +FT     I
Subjt:  MMYFTTGLNDRNLKIEFRSCPPASLNEMLTRARQYIDGLELWKAN-GARRSSCGKDRDQKSPLSKK-----------RRSDD-----RSSSQFTSLNASI

Query:  TEIYVVVEDTDLE-------------------------AEH-------------IEDLIRKGYLKKYVGSKARVEPEGSAREEKRERSQPLRRKEDRPGI
        +EI   +E++ +E                          EH             IEDLI+ GY KK+VG K R        E KR R+ P  R+ DRP +
Subjt:  TEIYVVVEDTDLE-------------------------AEH-------------IEDLIRKGYLKKYVGSKARVEPEGSAREEKRERSQPLRRKEDRPGI

Query:  INTIHGGLSGGQSGQKRKALAWEAAHEVCTSYPKESVMPILLDEKDSEGVHMSHNDALVIAPLIDHVKVKRVLVYGGASANILSFSTYMALRWERRHLKH
        INTI GG SGGQSG KRK LA  A  EVC    +    PI  D  D   VH+ HNDALVIAPLIDHV V+RVLV GGASANILS  TY+AL W R  LK 
Subjt:  INTIHGGLSGGQSGQKRKALAWEAAHEVCTSYPKESVMPILLDEKDSEGVHMSHNDALVIAPLIDHVKVKRVLVYGGASANILSFSTYMALRWERRHLKH

Query:  SPTPLVGFAGESVGAERCISLHVTIGEGEHQVTRVVEFVVIDRSLTYSAI
        SPTPLVGF+GESV  E CI L VT+G+ + +VT++ EFVV+D    Y+AI
Subjt:  SPTPLVGFAGESVGAERCISLHVTIGEGEHQVTRVVEFVVIDRSLTYSAI

XP_022158844.1 uncharacterized protein LOC111025310 [Momordica charantia]1.4e-11866.49Show/hide
Query:  NEHVKVVSFTDDIAMMYFTTGLNDRNLKIEFRSCPPASLNEMLTRARQYIDGLELWKANGARRSSCGKDRDQKSPLSKKRRSDDRSSS------------
        +EHVKVVS TDDIAMMYFTTGLNDRNL IEFRS PPASLNEM  RARQYIDGLELWKANGARRSS G+DRD KSP SKKR  DDRSSS            
Subjt:  NEHVKVVSFTDDIAMMYFTTGLNDRNLKIEFRSCPPASLNEMLTRARQYIDGLELWKANGARRSSCGKDRDQKSPLSKKRRSDDRSSS------------

Query:  --------------QFTSLNASITEIYVVVEDTDLEA--------------------------------------EHIEDLIRKGYLKKYVGSKARVEPE
                      +FT LNASI EIY VVEDTD+E                                       E +EDLIR GYLKKYVGS+ + E E
Subjt:  --------------QFTSLNASITEIYVVVEDTDLEA--------------------------------------EHIEDLIRKGYLKKYVGSKARVEPE

Query:  GSAREEKRERSQPLRRKEDRPGIINTIHGGLSGGQSGQKRKALAWEAAHEVCTSYPKESVMPILLDEKDSEGVHMSHNDALVIAPLIDHVKVKRVLVYGG
        GSAREEKRERSQP R KEDRP +INTIHGG SG +SGQKRKALA E AHEVCTSYPK  VMPIL DE+D E VHM HNDALVIAPLIDHVKV+RV V GG
Subjt:  GSAREEKRERSQPLRRKEDRPGIINTIHGGLSGGQSGQKRKALAWEAAHEVCTSYPKESVMPILLDEKDSEGVHMSHNDALVIAPLIDHVKVKRVLVYGG

Query:  ASANILSFSTYMALRWERRHLKHSPTPLVGFAGESVGAERCISLHVTIGEGEHQVTRVVEFVVIDRSLTY
        ASANI SFSTY AL WERRHLKH  T LVGFA ESV  E CISL VTI EGEHQVTRV EFVVIDRS  Y
Subjt:  ASANILSFSTYMALRWERRHLKHSPTPLVGFAGESVGAERCISLHVTIGEGEHQVTRVVEFVVIDRSLTY

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088135.4e-5542.42Show/hide
Query:  EHVKVVSFTDDIAMMYFTTGLNDRNLKIEFRSCPPASLNEMLTRARQYIDGLELWKANGAR------RSSCGKDRDQKSPLSKK-----------RRSDD
        E +KV   +DD AM YF TGL D  L ++     PA+  E+L +A++ IDG EL +    R      R   GKD +   P SK            RR+++
Subjt:  EHVKVVSFTDDIAMMYFTTGLNDRNLKIEFRSCPPASLNEMLTRARQYIDGLELWKANGAR------RSSCGKDRDQKSPLSKK-----------RRSDD

Query:  -----RSSSQFTSLNASITEIYVVVEDTDLE-------------------------AEH-------------IEDLIRKGYLKKYVGSKARVEPEGSARE
             R   +FT     I+EI   +E++ +E                          EH             IE+LI+ GY KK+VG K R        E
Subjt:  -----RSSSQFTSLNASITEIYVVVEDTDLE-------------------------AEH-------------IEDLIRKGYLKKYVGSKARVEPEGSARE

Query:  EKRERSQPLRRKEDRPGIINTIHGGLSGGQSGQKRKALAWEAAHEVCTSYPKESVMPILLDEKDSEGVHMSHNDALVIAPLIDHVKVKRVLVYGGASANI
         KR R+ P  R+ DRP +INTI GG SGGQSG+KRK LA  A  EVC    +    PI  D  D E VH+ HNDALVIAPLIDHV V RVLV GG SANI
Subjt:  EKRERSQPLRRKEDRPGIINTIHGGLSGGQSGQKRKALAWEAAHEVCTSYPKESVMPILLDEKDSEGVHMSHNDALVIAPLIDHVKVKRVLVYGGASANI

Query:  LSFSTYMALRWERRHLKHSPTPLVGFAGESVGAERCISLHVTIGEGEHQVTRVVEFVVIDRSL
        LS  TY+AL W R  LK SPTPLVGF+GESV  E  I L VT+G+ + QVT++ EFV  DR++
Subjt:  LSFSTYMALRWERRHLKHSPTPLVGFAGESVGAERCISLHVTIGEGEHQVTRVVEFVVIDRSL

A0A6J1D5T3 uncharacterized protein LOC1110175483.2e-5556.12Show/hide
Query:  NEHVKVVSFTDDIAMMYFTTGLNDRNLKIEFRSCPPASLNEMLTRARQYIDGLELWKANGARRSSCGKDRDQKSPLSKKRRSDDRSSS------------
        +EHVKVVS TDDIAMMYFTTGLNDRNL IEF S PPASLN+ML RARQYIDGLELWKA GARRSS GKDRDQ+S   KKR SDD+SSS            
Subjt:  NEHVKVVSFTDDIAMMYFTTGLNDRNLKIEFRSCPPASLNEMLTRARQYIDGLELWKANGARRSSCGKDRDQKSPLSKKRRSDDRSSS------------

Query:  --------------QFTSLNASITEIYVVVEDTDLEA--------------------------------------EHIEDLIRKGYLKKYVGSKARVEPE
                      +FT LNAS+ EIY  VE+TD++A                                      E ++DLIR+GYLKKYVGS+ R +PE
Subjt:  --------------QFTSLNASITEIYVVVEDTDLEA--------------------------------------EHIEDLIRKGYLKKYVGSKARVEPE

Query:  GSAREEKRERSQPLRRKEDRPGIINTIHGGLSGGQSG
        GS REEKRERSQP  RKEDRP +INTIHGG SG +SG
Subjt:  GSAREEKRERSQPLRRKEDRPGIINTIHGGLSGGQSG

A0A6J1D8C0 uncharacterized protein LOC1110185616.4e-5648.07Show/hide
Query:  EHVKVVSFTDDIAMMYFTTGLNDRNLKIEFRSCPPASLNEMLTRARQYIDGLELWKANGARRSSCGKDRDQKSPLSKKRRSDDRSSSQFTSLNASITEIY
        E +KV   +DD AM YF TGL D  L ++     PA+  E+L +A++ IDG EL +    R     K  DQK    +KR+ D +   + +S + S  + +
Subjt:  EHVKVVSFTDDIAMMYFTTGLNDRNLKIEFRSCPPASLNEMLTRARQYIDGLELWKANGARRSSCGKDRDQKSPLSKKRRSDDRSSSQFTSLNASITEIY

Query:  VVVEDTDLEAEHIEDLIRKGYLKKYVGSKARVEPEGSAREEKRERSQPLRRKEDRPGIINTIHGGLSGGQSGQKRKALAWEAAHEVCTSYPKESVMPILL
         +          IEDLI+ GY KK+VG K R        E KR R+ P  R++DRP +INTI GG SGGQSG KRK L  EA  EV     + S   I  
Subjt:  VVVEDTDLEAEHIEDLIRKGYLKKYVGSKARVEPEGSAREEKRERSQPLRRKEDRPGIINTIHGGLSGGQSGQKRKALAWEAAHEVCTSYPKESVMPILL

Query:  DEKDSEGVHMSHNDALVIAPLIDHVKVKRVLVYGGASANILSFSTYMALRWERRHLKHSPTPLVGFAGESVGAERCISLHVTIGE
        D+ D EGVH+ HNDALVI PLI+HV V+RVLV GGASANILS  TY+AL W R  LK SPTPLVGF GESV  E CI L VT+G+
Subjt:  DEKDSEGVHMSHNDALVIAPLIDHVKVKRVLVYGGASANILSFSTYMALRWERRHLKHSPTPLVGFAGESVGAERCISLHVTIGE

A0A6J1DD03 uncharacterized protein LOC1110198991.4e-5543.71Show/hide
Query:  MMYFTTGLNDRNLKIEFRSCPPASLNEMLTRARQYIDGLELWKAN-GARRSSCGKDRDQKSPLSKK-----------RRSDD-----RSSSQFTSLNASI
        M YF TGL D  L ++     PA+  E+L +A++ IDG EL +   G  RS  GKD +   P SK            RR+++     R   +FT     I
Subjt:  MMYFTTGLNDRNLKIEFRSCPPASLNEMLTRARQYIDGLELWKAN-GARRSSCGKDRDQKSPLSKK-----------RRSDD-----RSSSQFTSLNASI

Query:  TEIYVVVEDTDLE-------------------------AEH-------------IEDLIRKGYLKKYVGSKARVEPEGSAREEKRERSQPLRRKEDRPGI
        +EI   +E++ +E                          EH             IEDLI+ GY KK+VG K R        E KR R+ P  R+ DRP +
Subjt:  TEIYVVVEDTDLE-------------------------AEH-------------IEDLIRKGYLKKYVGSKARVEPEGSAREEKRERSQPLRRKEDRPGI

Query:  INTIHGGLSGGQSGQKRKALAWEAAHEVCTSYPKESVMPILLDEKDSEGVHMSHNDALVIAPLIDHVKVKRVLVYGGASANILSFSTYMALRWERRHLKH
        INTI GG SGGQSG KRK LA  A  EVC    +    PI  D  D   VH+ HNDALVIAPLIDHV V+RVLV GGASANILS  TY+AL W R  LK 
Subjt:  INTIHGGLSGGQSGQKRKALAWEAAHEVCTSYPKESVMPILLDEKDSEGVHMSHNDALVIAPLIDHVKVKRVLVYGGASANILSFSTYMALRWERRHLKH

Query:  SPTPLVGFAGESVGAERCISLHVTIGEGEHQVTRVVEFVVIDRSLTYSAI
        SPTPLVGF+GESV  E CI L VT+G+ + +VT++ EFVV+D    Y+AI
Subjt:  SPTPLVGFAGESVGAERCISLHVTIGEGEHQVTRVVEFVVIDRSLTYSAI

A0A6J1E0L8 uncharacterized protein LOC1110253106.9e-11966.49Show/hide
Query:  NEHVKVVSFTDDIAMMYFTTGLNDRNLKIEFRSCPPASLNEMLTRARQYIDGLELWKANGARRSSCGKDRDQKSPLSKKRRSDDRSSS------------
        +EHVKVVS TDDIAMMYFTTGLNDRNL IEFRS PPASLNEM  RARQYIDGLELWKANGARRSS G+DRD KSP SKKR  DDRSSS            
Subjt:  NEHVKVVSFTDDIAMMYFTTGLNDRNLKIEFRSCPPASLNEMLTRARQYIDGLELWKANGARRSSCGKDRDQKSPLSKKRRSDDRSSS------------

Query:  --------------QFTSLNASITEIYVVVEDTDLEA--------------------------------------EHIEDLIRKGYLKKYVGSKARVEPE
                      +FT LNASI EIY VVEDTD+E                                       E +EDLIR GYLKKYVGS+ + E E
Subjt:  --------------QFTSLNASITEIYVVVEDTDLEA--------------------------------------EHIEDLIRKGYLKKYVGSKARVEPE

Query:  GSAREEKRERSQPLRRKEDRPGIINTIHGGLSGGQSGQKRKALAWEAAHEVCTSYPKESVMPILLDEKDSEGVHMSHNDALVIAPLIDHVKVKRVLVYGG
        GSAREEKRERSQP R KEDRP +INTIHGG SG +SGQKRKALA E AHEVCTSYPK  VMPIL DE+D E VHM HNDALVIAPLIDHVKV+RV V GG
Subjt:  GSAREEKRERSQPLRRKEDRPGIINTIHGGLSGGQSGQKRKALAWEAAHEVCTSYPKESVMPILLDEKDSEGVHMSHNDALVIAPLIDHVKVKRVLVYGG

Query:  ASANILSFSTYMALRWERRHLKHSPTPLVGFAGESVGAERCISLHVTIGEGEHQVTRVVEFVVIDRSLTY
        ASANI SFSTY AL WERRHLKH  T LVGFA ESV  E CISL VTI EGEHQVTRV EFVVIDRS  Y
Subjt:  ASANILSFSTYMALRWERRHLKHSPTPLVGFAGESVGAERCISLHVTIGEGEHQVTRVVEFVVIDRSLTY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGGTCCCTCAAGTGTTCTAGGAGGCTGGGGTGCCACCACTCTCAATGCCCATCCTCGCCGCACGTATTCTCAGGTGGACGGAACCCCGGTTGCCGAGCGACAGCC
CCAGGCGGGGGTTGTCAAGGAGAATGGAGGTCGGAATGCCACATCTGATCCCATAGTGGCTCGGGACTTCCACCTTGCCTCAGATAAGTTTCCACCGCTCCAGCCTCAGA
GGAACGGGCTGCCGCCCCTTGTTTCTCGTCTTCGTAGTTGGGGGAACACGGGTGCGCATTCTAGGACAAGCGCCAACGTGGGTGTGGATTCTGTCGTAGTAGCTAATGTG
ATCGCCGAGCTTACAAAAGTCAAGGCGTGGCTAGAAACAGTCGAGAGGGGCAGCAAAGTGTTCAACTCTTCCGTCTCCAAGGATCCTGCCAGAGGGAAGGGGTTGCTGCA
TCCGACCAAGAGAATTGAGTATCAGTTCCAACCATGCAGAGAAGCCCGAGCTGGAGCACACTCTCACAAACCTCAATTCTCGGGGTTGGGTGGTACCCTAGGGCACAGGT
GCAAGGCCATCCTCAGCAGGACGACCAGGTGGAGGACCAACGCTAGAGGGTCTGACCAATTCGGACCCATCAGGAATCGATCGATAGCTACAACGCCCACCAGGGCCGAG
GTGAGGGGCCATCGAGACGACGATAAGTGGCACCCGAGGATCAGGAGTACTAGACCAGCTTCAGAAGGAGATAGAAGACCTCAAGCGGCAGTGCAGACCCTTGGATCCAT
ATCGTGCGGTCTAGTAGGAGGAGCCGCCTTTTTCCCGAGTGATCCTCGACGTGCCCATCTCGCCGAGGTTCAAGGCCCTAGTGTGAGCTCTATGATGGGTTCGGGGACCA
GTCTCATATGTAGAAATGTTCTAAGGAAAAGTGGATTTCATGGTTGCAAGCGACGCCATGAAATGCCGAACGTTTCAAATAGTCTTGGAAGGCTCGGCGAGACTGTGGTA
CCGACAGCTAAAGCCCAGGTCAATCGACAATTATCAACAGTTGAGAAGGTTGTTCATCAATCAGTTCTCAACTCGGCAGTTGTTGAAGTTGTCGCCCTCCAACCTCGGAA
CAGTGAAGCAACGAGACAACGAGTCCCTTACGGAGTATATCGCTCGATTCATGAACGAGTCAGTTCGCGAACGTTGGGCGTCGGGCAGCAGCGGTGCGCGGGCGTCAGCA
GCAGCGGTGCGCGGGTGTCGGGCAGCAGCGATGGTGCGCGGGCGCCGGACAGCAGCGGTTGTGCGCGGATTCCGGTAGCAGCGGGTCGTGGCGCGGTGGATGCCTTGGGA
ATAAATGGCAAGGCCGAACATCAGGTTTCTATGGAGAAGTATTGGGGTCTTGGGAATAAATGTCAAGAACCAATACGCCGGGTAGTCATCGGTACCTTGGGGATAAATGT
CAAGGACTGGTGCGCAGTTCAAGGCCTTGGGGATAAATGGCAAGGTGTGAACCGGGCCGACCGTGGTGATGACGTGGAGGAGGGCCGTTACTTCTTTAAACGTGTCAGAA
ATTTGGGTCGTTACAACGAGCATGTCAAGGTGGTGAGCTTCACTGATGATATCGCCATGATGTACTTCACGACGGGATTGAATGACAGGAACTTGAAAATTGAGTTCAGA
AGCTGCCCGCCGGCCTCCCTAAATGAGATGCTTACCCGAGCTCGCCAGTATATTGATGGCCTGGAGCTGTGGAAGGCCAATGGAGCCAGGCGAAGTAGCTGCGGTAAAGA
TCGGGATCAGAAATCTCCCCTTTCCAAGAAACGGCGTAGTGATGATCGAAGTTCGTCTCAGTTTACTTCGCTAAACGCCTCAATCACGGAGATCTACGTGGTAGTCGAAG
ATACCGACTTGGAAGCTGAGCACATCGAGGATTTGATCCGGAAGGGTTATTTGAAGAAATATGTCGGCAGCAAAGCACGAGTTGAGCCAGAGGGATCAGCTCGAGAAGAG
AAGCGAGAGAGATCACAGCCGCTCAGACGAAAAGAAGATCGTCCTGGTATAATAAACACCATTCACGGGGGTTTGAGTGGGGGGCAGTCAGGGCAGAAGAGGAAAGCTCT
AGCCTGGGAAGCAGCGCATGAGGTCTGCACCTCATACCCCAAGGAGTCTGTTATGCCGATCTTATTAGATGAGAAGGACAGTGAAGGGGTGCATATGTCCCATAATGACG
CCTTAGTAATCGCCCCCTTAATAGATCATGTGAAGGTTAAAAGGGTTCTTGTCTACGGCGGAGCATCGGCTAATATATTGTCCTTCTCGACCTACATGGCTCTGAGATGG
GAGAGAAGGCACTTGAAGCACAGCCCAACGCCCTTGGTTGGCTTTGCAGGGGAGTCAGTTGGCGCAGAAAGATGTATCTCGCTCCATGTTACCATCGGAGAAGGAGAGCA
TCAAGTGACCCGAGTTGTCGAGTTTGTTGTAATAGACCGAAGCTTGACATACAGTGCCATACTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCAAGGTCCCTCAAGTGTTCTAGGAGGCTGGGGTGCCACCACTCTCAATGCCCATCCTCGCCGCACGTATTCTCAGGTGGACGGAACCCCGGTTGCCGAGCGACAGCC
CCAGGCGGGGGTTGTCAAGGAGAATGGAGGTCGGAATGCCACATCTGATCCCATAGTGGCTCGGGACTTCCACCTTGCCTCAGATAAGTTTCCACCGCTCCAGCCTCAGA
GGAACGGGCTGCCGCCCCTTGTTTCTCGTCTTCGTAGTTGGGGGAACACGGGTGCGCATTCTAGGACAAGCGCCAACGTGGGTGTGGATTCTGTCGTAGTAGCTAATGTG
ATCGCCGAGCTTACAAAAGTCAAGGCGTGGCTAGAAACAGTCGAGAGGGGCAGCAAAGTGTTCAACTCTTCCGTCTCCAAGGATCCTGCCAGAGGGAAGGGGTTGCTGCA
TCCGACCAAGAGAATTGAGTATCAGTTCCAACCATGCAGAGAAGCCCGAGCTGGAGCACACTCTCACAAACCTCAATTCTCGGGGTTGGGTGGTACCCTAGGGCACAGGT
GCAAGGCCATCCTCAGCAGGACGACCAGGTGGAGGACCAACGCTAGAGGGTCTGACCAATTCGGACCCATCAGGAATCGATCGATAGCTACAACGCCCACCAGGGCCGAG
GTGAGGGGCCATCGAGACGACGATAAGTGGCACCCGAGGATCAGGAGTACTAGACCAGCTTCAGAAGGAGATAGAAGACCTCAAGCGGCAGTGCAGACCCTTGGATCCAT
ATCGTGCGGTCTAGTAGGAGGAGCCGCCTTTTTCCCGAGTGATCCTCGACGTGCCCATCTCGCCGAGGTTCAAGGCCCTAGTGTGAGCTCTATGATGGGTTCGGGGACCA
GTCTCATATGTAGAAATGTTCTAAGGAAAAGTGGATTTCATGGTTGCAAGCGACGCCATGAAATGCCGAACGTTTCAAATAGTCTTGGAAGGCTCGGCGAGACTGTGGTA
CCGACAGCTAAAGCCCAGGTCAATCGACAATTATCAACAGTTGAGAAGGTTGTTCATCAATCAGTTCTCAACTCGGCAGTTGTTGAAGTTGTCGCCCTCCAACCTCGGAA
CAGTGAAGCAACGAGACAACGAGTCCCTTACGGAGTATATCGCTCGATTCATGAACGAGTCAGTTCGCGAACGTTGGGCGTCGGGCAGCAGCGGTGCGCGGGCGTCAGCA
GCAGCGGTGCGCGGGTGTCGGGCAGCAGCGATGGTGCGCGGGCGCCGGACAGCAGCGGTTGTGCGCGGATTCCGGTAGCAGCGGGTCGTGGCGCGGTGGATGCCTTGGGA
ATAAATGGCAAGGCCGAACATCAGGTTTCTATGGAGAAGTATTGGGGTCTTGGGAATAAATGTCAAGAACCAATACGCCGGGTAGTCATCGGTACCTTGGGGATAAATGT
CAAGGACTGGTGCGCAGTTCAAGGCCTTGGGGATAAATGGCAAGGTGTGAACCGGGCCGACCGTGGTGATGACGTGGAGGAGGGCCGTTACTTCTTTAAACGTGTCAGAA
ATTTGGGTCGTTACAACGAGCATGTCAAGGTGGTGAGCTTCACTGATGATATCGCCATGATGTACTTCACGACGGGATTGAATGACAGGAACTTGAAAATTGAGTTCAGA
AGCTGCCCGCCGGCCTCCCTAAATGAGATGCTTACCCGAGCTCGCCAGTATATTGATGGCCTGGAGCTGTGGAAGGCCAATGGAGCCAGGCGAAGTAGCTGCGGTAAAGA
TCGGGATCAGAAATCTCCCCTTTCCAAGAAACGGCGTAGTGATGATCGAAGTTCGTCTCAGTTTACTTCGCTAAACGCCTCAATCACGGAGATCTACGTGGTAGTCGAAG
ATACCGACTTGGAAGCTGAGCACATCGAGGATTTGATCCGGAAGGGTTATTTGAAGAAATATGTCGGCAGCAAAGCACGAGTTGAGCCAGAGGGATCAGCTCGAGAAGAG
AAGCGAGAGAGATCACAGCCGCTCAGACGAAAAGAAGATCGTCCTGGTATAATAAACACCATTCACGGGGGTTTGAGTGGGGGGCAGTCAGGGCAGAAGAGGAAAGCTCT
AGCCTGGGAAGCAGCGCATGAGGTCTGCACCTCATACCCCAAGGAGTCTGTTATGCCGATCTTATTAGATGAGAAGGACAGTGAAGGGGTGCATATGTCCCATAATGACG
CCTTAGTAATCGCCCCCTTAATAGATCATGTGAAGGTTAAAAGGGTTCTTGTCTACGGCGGAGCATCGGCTAATATATTGTCCTTCTCGACCTACATGGCTCTGAGATGG
GAGAGAAGGCACTTGAAGCACAGCCCAACGCCCTTGGTTGGCTTTGCAGGGGAGTCAGTTGGCGCAGAAAGATGTATCTCGCTCCATGTTACCATCGGAGAAGGAGAGCA
TCAAGTGACCCGAGTTGTCGAGTTTGTTGTAATAGACCGAAGCTTGACATACAGTGCCATACTTTGA
Protein sequenceShow/hide protein sequence
MQGPSSVLGGWGATTLNAHPRRTYSQVDGTPVAERQPQAGVVKENGGRNATSDPIVARDFHLASDKFPPLQPQRNGLPPLVSRLRSWGNTGAHSRTSANVGVDSVVVANV
IAELTKVKAWLETVERGSKVFNSSVSKDPARGKGLLHPTKRIEYQFQPCREARAGAHSHKPQFSGLGGTLGHRCKAILSRTTRWRTNARGSDQFGPIRNRSIATTPTRAE
VRGHRDDDKWHPRIRSTRPASEGDRRPQAAVQTLGSISCGLVGGAAFFPSDPRRAHLAEVQGPSVSSMMGSGTSLICRNVLRKSGFHGCKRRHEMPNVSNSLGRLGETVV
PTAKAQVNRQLSTVEKVVHQSVLNSAVVEVVALQPRNSEATRQRVPYGVYRSIHERVSSRTLGVGQQRCAGVSSSGARVSGSSDGARAPDSSGCARIPVAAGRGAVDALG
INGKAEHQVSMEKYWGLGNKCQEPIRRVVIGTLGINVKDWCAVQGLGDKWQGVNRADRGDDVEEGRYFFKRVRNLGRYNEHVKVVSFTDDIAMMYFTTGLNDRNLKIEFR
SCPPASLNEMLTRARQYIDGLELWKANGARRSSCGKDRDQKSPLSKKRRSDDRSSSQFTSLNASITEIYVVVEDTDLEAEHIEDLIRKGYLKKYVGSKARVEPEGSAREE
KRERSQPLRRKEDRPGIINTIHGGLSGGQSGQKRKALAWEAAHEVCTSYPKESVMPILLDEKDSEGVHMSHNDALVIAPLIDHVKVKRVLVYGGASANILSFSTYMALRW
ERRHLKHSPTPLVGFAGESVGAERCISLHVTIGEGEHQVTRVVEFVVIDRSLTYSAIL