; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g41740 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g41740
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr6:32784550..32790198
RNA-Seq ExpressionMoc06g41740
SyntenyMoc06g41740
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]1.2e-11890.16Show/hide
Query:  MCARKGACGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPMGRKVGTLVTDKLLLESGLLDYNP
        MCARKGACGIVKGPTSIKGWVRKWFYASGEWLAKDES              V+IRPVPELTQASFDTLKYYKE FP GRKVGTLVTDKLLLESGLLDYNP
Subjt:  MCARKGACGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPMGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRPIESSRPNLELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGVPSREKRLRDQTEAVDVSPLGEEVREEVPLKRRR
        AVRPIESSRPN ELAMVCGFASNVKRKSKG+AHALEAAQSSKP TPAVVGPASEDPAPVIELESS  PSREKR RDQTEAVDVSPLGEEVREEVPLKRRR
Subjt:  AVRPIESSRPNLELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGVPSREKRLRDQTEAVDVSPLGEEVREEVPLKRRR

Query:  KKKKTTSPLEVGAREVLPASFADRVDDPEARMGGTSDVTTRFRVEPSSSGVRDQ
        KKKKTTSPLEVGAR VLPASFADRVDDPEARMGGT DVTTRFRVEPSSSGVRDQ
Subjt:  KKKKTTSPLEVGAREVLPASFADRVDDPEARMGGTSDVTTRFRVEPSSSGVRDQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]2.0e-10590.58Show/hide
Query:  EEVELLDVDQLLACFEAKRIVKKPGRFYMCARKGACGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYK
        EE ELLDVDQLLACFEAKRI KKPGRFYMCARKGA GIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYK
Subjt:  EEVELLDVDQLLACFEAKRIVKKPGRFYMCARKGACGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYK

Query:  ERFPMGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNLELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGVPSREK
        ERFP GRKVGTLVTD+LLLESGLLDYNPAVRPIE SRPN  LAMVC FAS VKRKSKGRAHALEAAQSSKP TPAVVGPASEDPAPVIELESSG PSREK
Subjt:  ERFPMGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNLELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGVPSREK

Query:  RLRDQTEAV-------DVSPLGE
        R RDQTEAV       DV PLGE
Subjt:  RLRDQTEAV-------DVSPLGE

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]1.5e-14077.75Show/hide
Query:  MSSSFSSDLRSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPD-------------
        MSSS SS+L S  DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLR+PEEGERADNPP+             
Subjt:  MSSSFSSDLRSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPD-------------

Query:  ---------------------------------------------EEVELLDVDQLLACFEAKRIVKKPGRFYMCARKGACGIVKGPTSIKGWVRKWFYA
                                                     EE EL DVDQLLACFEAKRI KKPGRFYMCARKGA GIVKGPTSIKGWVRKWFYA
Subjt:  ---------------------------------------------EEVELLDVDQLLACFEAKRIVKKPGRFYMCARKGACGIVKGPTSIKGWVRKWFYA

Query:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPMGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNLELAMVCGFASNVKRK
        SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFP GRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPN ELAMVCGFAS VKRK
Subjt:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPMGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNLELAMVCGFASNVKRK

Query:  SKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGVPSREKRLRDQTEAVD
        SKGRAHALEAAQSSKPATPAVVGPASEDPA VIELESSG PSREKR RDQTEAVD
Subjt:  SKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGVPSREKRLRDQTEAVD

XP_022159185.1 uncharacterized protein LOC111025606 [Momordica charantia]2.6e-8180.95Show/hide
Query:  MVCGFASNVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGVPSREKRLRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEVGARE
        MVCGFAS+VKRKSKGRAHA EAAQSSKPATPAV GPASEDPAPVIELESSG PSREKR RDQTEAVD  PLGEEVREEVPLKRRRKKKKT SPLEVGA  
Subjt:  MVCGFASNVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGVPSREKRLRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEVGARE

Query:  VLPASFADRVDDPEARMGGTSDVTTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDLESVLQRTIDYAAE-------------AELDGREALAAR
        VLPASFADRVDDPEARMGGTSDVT RFRV+PSS+GVRDQVSRISAASLDRCLRRASKFVSD  SVLQRTIDYAAE             AELDGRE LAAR
Subjt:  VLPASFADRVDDPEARMGGTSDVTTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDLESVLQRTIDYAAE-------------AELDGREALAAR

Query:  EKEEFSAALEAASSTMKDELLKAHSEVEILK
        EKEEFS ALEA       EL  A +E+E  K
Subjt:  EKEEFSAALEAASSTMKDELLKAHSEVEILK

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]6.8e-12268.63Show/hide
Query:  MCARKGACGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPMGRKVGTLVTDKLLLESGLLDYNP
        MCARKG  GIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FP  RK+ TLVTDKLLLESGLLDYNP
Subjt:  MCARKGACGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPMGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRPIESSRPNLELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAV--------VGPASEDPAPVIELESSGVPSREKRLRDQTEAVDVSPLGEEVRE
         VR IE+SRPN ELAMVCGF  +VKRKSKGRAHAL+    ++P TP V         GP+S  P PVIEL+ SG  S EKR R+++EA+DVSPL  EVR 
Subjt:  AVRPIESSRPNLELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAV--------VGPASEDPAPVIELESSGVPSREKRLRDQTEAVDVSPLGEEVRE

Query:  EVPLKRRRKKKKTTSPLEVGAREVLPASFADRVDDPEARMGGTSDVTTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDLESVLQRTIDYAAE--
        E PL+RRRKKKKT+S  E GAR  LP S AD VDDPEARM GTS+V  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSD  SVLQRTID  AE  
Subjt:  EVPLKRRRKKKKTTSPLEVGAREVLPASFADRVDDPEARMGGTSDVTTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDLESVLQRTIDYAAE--

Query:  -----------AELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVEAKAELLKKEEDRRKA
                   AELDGREALAA+E+E   AALEAA +T+K ELLKA  EV+IL+AEV+AK +LLKKE ++ KA
Subjt:  -----------AELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVEAKAELLKKEEDRRKA

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092985.8e-11990.16Show/hide
Query:  MCARKGACGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPMGRKVGTLVTDKLLLESGLLDYNP
        MCARKGACGIVKGPTSIKGWVRKWFYASGEWLAKDES              V+IRPVPELTQASFDTLKYYKE FP GRKVGTLVTDKLLLESGLLDYNP
Subjt:  MCARKGACGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPMGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRPIESSRPNLELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGVPSREKRLRDQTEAVDVSPLGEEVREEVPLKRRR
        AVRPIESSRPN ELAMVCGFASNVKRKSKG+AHALEAAQSSKP TPAVVGPASEDPAPVIELESS  PSREKR RDQTEAVDVSPLGEEVREEVPLKRRR
Subjt:  AVRPIESSRPNLELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGVPSREKRLRDQTEAVDVSPLGEEVREEVPLKRRR

Query:  KKKKTTSPLEVGAREVLPASFADRVDDPEARMGGTSDVTTRFRVEPSSSGVRDQ
        KKKKTTSPLEVGAR VLPASFADRVDDPEARMGGT DVTTRFRVEPSSSGVRDQ
Subjt:  KKKKTTSPLEVGAREVLPASFADRVDDPEARMGGTSDVTTRFRVEPSSSGVRDQ

A0A6J1CR42 uncharacterized protein LOC1110138269.6e-10690.58Show/hide
Query:  EEVELLDVDQLLACFEAKRIVKKPGRFYMCARKGACGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYK
        EE ELLDVDQLLACFEAKRI KKPGRFYMCARKGA GIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYK
Subjt:  EEVELLDVDQLLACFEAKRIVKKPGRFYMCARKGACGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYK

Query:  ERFPMGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNLELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGVPSREK
        ERFP GRKVGTLVTD+LLLESGLLDYNPAVRPIE SRPN  LAMVC FAS VKRKSKGRAHALEAAQSSKP TPAVVGPASEDPAPVIELESSG PSREK
Subjt:  ERFPMGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNLELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGVPSREK

Query:  RLRDQTEAV-------DVSPLGE
        R RDQTEAV       DV PLGE
Subjt:  RLRDQTEAV-------DVSPLGE

A0A6J1DXS5 uncharacterized protein LOC1110255027.0e-14177.75Show/hide
Query:  MSSSFSSDLRSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPD-------------
        MSSS SS+L S  DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLR+PEEGERADNPP+             
Subjt:  MSSSFSSDLRSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPD-------------

Query:  ---------------------------------------------EEVELLDVDQLLACFEAKRIVKKPGRFYMCARKGACGIVKGPTSIKGWVRKWFYA
                                                     EE EL DVDQLLACFEAKRI KKPGRFYMCARKGA GIVKGPTSIKGWVRKWFYA
Subjt:  ---------------------------------------------EEVELLDVDQLLACFEAKRIVKKPGRFYMCARKGACGIVKGPTSIKGWVRKWFYA

Query:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPMGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNLELAMVCGFASNVKRK
        SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFP GRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPN ELAMVCGFAS VKRK
Subjt:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPMGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNLELAMVCGFASNVKRK

Query:  SKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGVPSREKRLRDQTEAVD
        SKGRAHALEAAQSSKPATPAVVGPASEDPA VIELESSG PSREKR RDQTEAVD
Subjt:  SKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGVPSREKRLRDQTEAVD

A0A6J1DXZ1 uncharacterized protein LOC1110256061.3e-8180.95Show/hide
Query:  MVCGFASNVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGVPSREKRLRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEVGARE
        MVCGFAS+VKRKSKGRAHA EAAQSSKPATPAV GPASEDPAPVIELESSG PSREKR RDQTEAVD  PLGEEVREEVPLKRRRKKKKT SPLEVGA  
Subjt:  MVCGFASNVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGVPSREKRLRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEVGARE

Query:  VLPASFADRVDDPEARMGGTSDVTTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDLESVLQRTIDYAAE-------------AELDGREALAAR
        VLPASFADRVDDPEARMGGTSDVT RFRV+PSS+GVRDQVSRISAASLDRCLRRASKFVSD  SVLQRTIDYAAE             AELDGRE LAAR
Subjt:  VLPASFADRVDDPEARMGGTSDVTTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDLESVLQRTIDYAAE-------------AELDGREALAAR

Query:  EKEEFSAALEAASSTMKDELLKAHSEVEILK
        EKEEFS ALEA       EL  A +E+E  K
Subjt:  EKEEFSAALEAASSTMKDELLKAHSEVEILK

A0A6J1DZB3 uncharacterized protein LOC1110256653.3e-12268.63Show/hide
Query:  MCARKGACGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPMGRKVGTLVTDKLLLESGLLDYNP
        MCARKG  GIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FP  RK+ TLVTDKLLLESGLLDYNP
Subjt:  MCARKGACGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPMGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRPIESSRPNLELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAV--------VGPASEDPAPVIELESSGVPSREKRLRDQTEAVDVSPLGEEVRE
         VR IE+SRPN ELAMVCGF  +VKRKSKGRAHAL+    ++P TP V         GP+S  P PVIEL+ SG  S EKR R+++EA+DVSPL  EVR 
Subjt:  AVRPIESSRPNLELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAV--------VGPASEDPAPVIELESSGVPSREKRLRDQTEAVDVSPLGEEVRE

Query:  EVPLKRRRKKKKTTSPLEVGAREVLPASFADRVDDPEARMGGTSDVTTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDLESVLQRTIDYAAE--
        E PL+RRRKKKKT+S  E GAR  LP S AD VDDPEARM GTS+V  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSD  SVLQRTID  AE  
Subjt:  EVPLKRRRKKKKTTSPLEVGAREVLPASFADRVDDPEARMGGTSDVTTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDLESVLQRTIDYAAE--

Query:  -----------AELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVEAKAELLKKEEDRRKA
                   AELDGREALAA+E+E   AALEAA +T+K ELLKA  EV+IL+AEV+AK +LLKKE ++ KA
Subjt:  -----------AELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVEAKAELLKKEEDRRKA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGCCTGCATGTCAGAATTGGGAGCAAATTCTCGAAGAAGTAGCGCCACAGCGCCAGTACACAGCGCCATGGCGCCTACAGACAAAGCTTAGGCCTCTTTCCGTGAC
AACAACGTCACAGCTGTCTGTAGCGCCATGGCGCTACGAACTTGCTCGTCCATCACAATTTTGGGACGGCGCCATGGCGCTTGTATTTCTGATTGCAGCTCGAACTCGGC
CTCCGGACCGATCTGAATACTTGGGCGGACCTGCACAAAAAGGTGAGCACTCCGACGATCAAGTCGGACCTCGGCCAGGTTCACCTCGGCCCTCATACTTAGCATCTGTC
AACGCTAGTGGTGGTGATCTCGACGGTCCGAGCTGGGGCATGACTCATGGGCCATCTTGGAGCACCAATAGGGGTCCTCCACGTGTCCAGGGTATTCTTTTCCCCAAACA
TTGGCCCCCTCTCTGTCTGGTCCGATCTCGACCTGGCAGAGAAGTTCATTCGACTTGTTTTGGACGCGTGGCCACTTCCTATTCGTGGGAAAATACAACCGTTGTCGGAA
TATTCAAATATTCTGACGCTTCGGATCTTCGGGAGCATCCTAGCCGCTCGTTGATTACACGTGTACGGCAACTCGAACCATTGGTAGGTCGGTCTCTTCCTTCACTTTCT
CTTTCGAACGTAGTTGCCATGTCGTCCTCTTTTAGCAGCGACTTAAGGTCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTT
CTCCGATGACGGGGAGGATAGTGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCCGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCC
CTGAGAACATCCTCCTTAGGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGATGAAGAGGTCGAGCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAAGCG
AAAAGGATAGTTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCATGCGGTATAGTTAAGGGGCCGACCTCCATCAAAGGATGGGTGAGGAAGTGGTTCTA
CGCTTCCGGGGAATGGCTTGCAAAGGACGAGTCGGGTCGTTCCTTCTTTGACGTTCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAG
CCTCCTTCGACACGCTGAAATATTACAAGGAGCGTTTTCCGATGGGTAGGAAGGTCGGAACCTTGGTGACCGACAAGCTGCTGCTTGAGTCCGGATTGCTAGATTACAAC
CCTGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTTGGAACTTGCCATGGTTTGCGGATTTGCAAGCAACGTGAAGCGCAAGTCCAAGGGCCGAGCCCATGCTCTTGA
GGCCGCCCAAAGTTCGAAACCTGCCACTCCTGCTGTGGTAGGGCCAGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGTTCCTTCGAGGGAGAAGC
GCCTCAGGGATCAGACCGAGGCGGTGGACGTCTCGCCCTTGGGCGAGGAGGTGAGGGAGGAAGTCCCTCTGAAGCGAAGGAGGAAGAAGAAGAAGACCACCTCCCCCTTG
GAGGTCGGAGCTCGTGAGGTCTTGCCTGCGAGCTTCGCAGATCGGGTGGATGATCCTGAAGCTAGGATGGGCGGGACGTCCGACGTGACGACACGATTCAGAGTTGAGCC
GTCAAGTTCTGGGGTGAGGGACCAGGTGTCCCGCATCTCGGCTGCAAGTTTGGACCGCTGCCTCAGAAGAGCGTCCAAATTTGTAAGTGACCTGGAGTCCGTCCTGCAGA
GGACCATCGACTACGCCGCTGAGGCCGAGCTGGATGGGAGGGAAGCTCTGGCAGCGAGGGAGAAAGAAGAGTTCTCTGCTGCCTTGGAGGCTGCTTCTTCCACCATGAAG
GATGAGCTGCTGAAAGCTCACTCTGAGGTGGAAATTTTGAAGGCCGAGGTGGAGGCGAAGGCCGAGCTGCTGAAGAAAGAAGAGGACAGACGAAAGGCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGCCTGCATGTCAGAATTGGGAGCAAATTCTCGAAGAAGTAGCGCCACAGCGCCAGTACACAGCGCCATGGCGCCTACAGACAAAGCTTAGGCCTCTTTCCGTGAC
AACAACGTCACAGCTGTCTGTAGCGCCATGGCGCTACGAACTTGCTCGTCCATCACAATTTTGGGACGGCGCCATGGCGCTTGTATTTCTGATTGCAGCTCGAACTCGGC
CTCCGGACCGATCTGAATACTTGGGCGGACCTGCACAAAAAGGTGAGCACTCCGACGATCAAGTCGGACCTCGGCCAGGTTCACCTCGGCCCTCATACTTAGCATCTGTC
AACGCTAGTGGTGGTGATCTCGACGGTCCGAGCTGGGGCATGACTCATGGGCCATCTTGGAGCACCAATAGGGGTCCTCCACGTGTCCAGGGTATTCTTTTCCCCAAACA
TTGGCCCCCTCTCTGTCTGGTCCGATCTCGACCTGGCAGAGAAGTTCATTCGACTTGTTTTGGACGCGTGGCCACTTCCTATTCGTGGGAAAATACAACCGTTGTCGGAA
TATTCAAATATTCTGACGCTTCGGATCTTCGGGAGCATCCTAGCCGCTCGTTGATTACACGTGTACGGCAACTCGAACCATTGGTAGGTCGGTCTCTTCCTTCACTTTCT
CTTTCGAACGTAGTTGCCATGTCGTCCTCTTTTAGCAGCGACTTAAGGTCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTT
CTCCGATGACGGGGAGGATAGTGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCCGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCC
CTGAGAACATCCTCCTTAGGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGATGAAGAGGTCGAGCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAAGCG
AAAAGGATAGTTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCATGCGGTATAGTTAAGGGGCCGACCTCCATCAAAGGATGGGTGAGGAAGTGGTTCTA
CGCTTCCGGGGAATGGCTTGCAAAGGACGAGTCGGGTCGTTCCTTCTTTGACGTTCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAG
CCTCCTTCGACACGCTGAAATATTACAAGGAGCGTTTTCCGATGGGTAGGAAGGTCGGAACCTTGGTGACCGACAAGCTGCTGCTTGAGTCCGGATTGCTAGATTACAAC
CCTGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTTGGAACTTGCCATGGTTTGCGGATTTGCAAGCAACGTGAAGCGCAAGTCCAAGGGCCGAGCCCATGCTCTTGA
GGCCGCCCAAAGTTCGAAACCTGCCACTCCTGCTGTGGTAGGGCCAGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGTTCCTTCGAGGGAGAAGC
GCCTCAGGGATCAGACCGAGGCGGTGGACGTCTCGCCCTTGGGCGAGGAGGTGAGGGAGGAAGTCCCTCTGAAGCGAAGGAGGAAGAAGAAGAAGACCACCTCCCCCTTG
GAGGTCGGAGCTCGTGAGGTCTTGCCTGCGAGCTTCGCAGATCGGGTGGATGATCCTGAAGCTAGGATGGGCGGGACGTCCGACGTGACGACACGATTCAGAGTTGAGCC
GTCAAGTTCTGGGGTGAGGGACCAGGTGTCCCGCATCTCGGCTGCAAGTTTGGACCGCTGCCTCAGAAGAGCGTCCAAATTTGTAAGTGACCTGGAGTCCGTCCTGCAGA
GGACCATCGACTACGCCGCTGAGGCCGAGCTGGATGGGAGGGAAGCTCTGGCAGCGAGGGAGAAAGAAGAGTTCTCTGCTGCCTTGGAGGCTGCTTCTTCCACCATGAAG
GATGAGCTGCTGAAAGCTCACTCTGAGGTGGAAATTTTGAAGGCCGAGGTGGAGGCGAAGGCCGAGCTGCTGAAGAAAGAAGAGGACAGACGAAAGGCCTAG
Protein sequenceShow/hide protein sequence
MAPACQNWEQILEEVAPQRQYTAPWRLQTKLRPLSVTTTSQLSVAPWRYELARPSQFWDGAMALVFLIAARTRPPDRSEYLGGPAQKGEHSDDQVGPRPGSPRPSYLASV
NASGGDLDGPSWGMTHGPSWSTNRGPPRVQGILFPKHWPPLCLVRSRPGREVHSTCFGRVATSYSWENTTVVGIFKYSDASDLREHPSRSLITRVRQLEPLVGRSLPSLS
LSNVVAMSSSFSSDLRSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPDEEVELLDVDQLLACFEA
KRIVKKPGRFYMCARKGACGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPMGRKVGTLVTDKLLLESGLLDYN
PAVRPIESSRPNLELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGVPSREKRLRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPL
EVGAREVLPASFADRVDDPEARMGGTSDVTTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDLESVLQRTIDYAAEAELDGREALAAREKEEFSAALEAASSTMK
DELLKAHSEVEILKAEVEAKAELLKKEEDRRKA