; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g17650 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g17650
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr8:13379281..13386837
RNA-Seq ExpressionMoc08g17650
SyntenyMoc08g17650
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0043167 - ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]1.4e-11285.43Show/hide
Query:  MCARKGAGGIVKGPTSIKAWVRKWFYASGEWLAKDESGRSFFDVPTRFENLISIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
        MCARKGA GIVKGPTSIK WVRKWFYASGEWLAKDES              ++IRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKAWVRKWFYASGEWLAKDESGRSFFDVPTRFENLISIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRPIETSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSEPATLAVAGPASEDPTPVIELESSGGPSREKRPRGQTEAVDVSPLGKEVGEAAPLKRRK
        AVRPIE+SRPNSELAMVCGFASNVKRKSKG+AHALEAAQSS+P T AV GPASEDP PVIELESS GPSREKRPR QTEAVDVSPLG+EV E  PLKRR+
Subjt:  AVRPIETSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSEPATLAVAGPASEDPTPVIELESSGGPSREKRPRGQTEAVDVSPLGKEVGEAAPLKRRK

Query:  KKKKTTSPLEVGACGVLSASFADRVDDPEAKMGGTSDVTARFRVEPSSSGVRDQ
        KKKKTTSPLEVGA GVL ASFADRVDDPEA+MGGT DVT RFRVEPSSSGVRDQ
Subjt:  KKKKTTSPLEVGACGVLSASFADRVDDPEAKMGGTSDVTARFRVEPSSSGVRDQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]2.0e-11179.49Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRIGLAP---------------------------VELLDDDQLLACFEAKRIAKKPGWYYMCARKGAGGIVKGPTSIKAWVR
        MFEYGLRLPLHPFVQEFLFR GLAP                            ELLD DQLLACFEAKRIAKKPG +YMCARKGAGGIVKGPTSIK WVR
Subjt:  MFEYGLRLPLHPFVQEFLFRIGLAP---------------------------VELLDDDQLLACFEAKRIAKKPGWYYMCARKGAGGIVKGPTSIKAWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFENLISIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIETSRPNSELAMVCGFAS
        KWFYASGEWLAKDESGRSFFDVPTRF NL+SIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFENLISIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIETSRPNSELAMVCGFAS

Query:  NVKRKSKGRAHALEAAQSSEPATLAVAGPASEDPTPVIELESSGGPSREKRPRGQTEAV-------DVSPLGK
         VKRKSKGRAHALEAAQSS+P T AV GPASEDP PVIELESSGGPSREKRPR QTEAV       DV PLG+
Subjt:  NVKRKSKGRAHALEAAQSSEPATLAVAGPASEDPTPVIELESSGGPSREKRPRGQTEAV-------DVSPLGK

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]1.3e-11081.68Show/hide
Query:  GTSDVTARFRVEPSSSGVRDQVSRISAASLDSCLRRASKFVSDPGSVLQRTIDYAAE-------------AELDGREVLAAREKEEFSAALEAASSTMKD
        G   + A+ R+EPSSSGVRDQVSRISAASLD CLRRASKFVS PGSVLQRTIDYAAE             AELDGREVLAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRVEPSSSGVRDQVSRISAASLDSCLRRASKFVSDPGSVLQRTIDYAAE-------------AELDGREVLAAREKEEFSAALEAASSTMKD

Query:  ELLRAHSEVDILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKDEKLKRATTELETAKERLSNGALLEESFRQHPDFD
        ELL+AHSEV+ LKAEVE++AELLKKEEDRR+AQLRAAHAIT+GLE+EKFQLLKEKDDMLQALEAKD++L+ AT ELETAKERLSNG LLEE+FRQHPDFD
Subjt:  ELLRAHSEVDILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKDEKLKRATTELETAKERLSNGALLEESFRQHPDFD

Query:  GFAKDFFDAGFKFLMKGITSDMPDLQIDLSGLKKRYAKQWASGPNSTPGPQALVNKYVRDLDSDYSDLEEDQI
        GFAKDF DAGFKFLMKGI SDMPDLQIDLSGLK+RYA++WASGP  TPGPQALV++YVRDLDSDYSD EEDQ+
Subjt:  GFAKDFFDAGFKFLMKGITSDMPDLQIDLSGLKKRYAKQWASGPNSTPGPQALVNKYVRDLDSDYSDLEEDQI

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]9.0e-15784.01Show/hide
Query:  EEDLARRLESELEEIENFRFSDDGEDSDTSTSGQGLEYLSRIPEHYLRPLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYGLRLPLHPFVQE
        E DLARRLES+LEEIEN R SDDGEDSD STSGQGLEY SRIPEHYL  LRRGFAIPENILLR+PEEGERADNPPEGWVTLYFKMFEYGLRLPLHPFVQE
Subjt:  EEDLARRLESELEEIENFRFSDDGEDSDTSTSGQGLEYLSRIPEHYLRPLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYGLRLPLHPFVQE

Query:  FLFRIGLAP---------------------------VELLDDDQLLACFEAKRIAKKPGWYYMCARKGAGGIVKGPTSIKAWVRKWFYASGEWLAKDESG
        FLFR GLAP                            EL D DQLLACFEAKRIAKKPG +YMCARKGAGGIVKGPTSIK WVRKWFYASGEWLAKDESG
Subjt:  FLFRIGLAP---------------------------VELLDDDQLLACFEAKRIAKKPGWYYMCARKGAGGIVKGPTSIKAWVRKWFYASGEWLAKDESG

Query:  RSFFDVPTRFENLISIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIETSRPNSELAMVCGFASNVKRKSKGRAHALEAA
        RSFFDVPTRF NL+SIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIE+SRPNSELAMVCGFAS VKRKSKGRAHALEAA
Subjt:  RSFFDVPTRFENLISIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIETSRPNSELAMVCGFASNVKRKSKGRAHALEAA

Query:  QSSEPATLAVAGPASEDPTPVIELESSGGPSREKRPRGQTEAVD
        QSS+PAT AV GPASEDP  VIELESSGGPSREKRPR QTEAVD
Subjt:  QSSEPATLAVAGPASEDPTPVIELESSGGPSREKRPRGQTEAVD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]4.0e-18969.33Show/hide
Query:  MCARKGAGGIVKGPTSIKAWVRKWFYASGEWLAKDESGRSFFDVPTRFENLISIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
        MCARKG GGIVKGPTSIK WV KWF+ASGEWLAKDESGR+FFDVPTRF NL+SI+ +PEL QA+FDTLK+YK+HFPR RK+ TLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKAWVRKWFYASGEWLAKDESGRSFFDVPTRFENLISIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRPIETSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSEPATLAV--------AGPASEDPTPVIELESSGGPSREKRPRGQTEAVDVSPLGKEVGE
         VR IE SRPNSELAMVCGF  +VKRKSKGRAHAL+    +EP T  V        +GP+S  PTPVIEL+ SGG S EKR R ++EA+DVSPL +  GE
Subjt:  AVRPIETSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSEPATLAV--------AGPASEDPTPVIELESSGGPSREKRPRGQTEAVDVSPLGKEVGE

Query:  AAPLKRRKKKKKTTSPLEVGACGVLSASFADRVDDPEAKMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDSCLRRASKFVSDPGSVLQRTIDYAAE--
         +PL+RR+KKKKT+S  E GA G L  S AD VDDPEA+M GTS+V  RF +EPSSSGV+DQVSRISA  LD  LRRASKFVSDPGSVLQRTID  AE  
Subjt:  AAPLKRRKKKKKTTSPLEVGACGVLSASFADRVDDPEAKMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDSCLRRASKFVSDPGSVLQRTIDYAAE--

Query:  -----------AELDGREVLAAREKEEFSAALEAASSTMKDELLRAHSEVDILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDML
                   AELDGRE LAA+E+E   AALEAA +T+K ELL+A  EVDIL+AEV+AK +LLKKE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+ 
Subjt:  -----------AELDGREVLAAREKEEFSAALEAASSTMKDELLRAHSEVDILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDML

Query:  QALEAKDEKLKRATTELETAKERLSNGALLEESFRQHPDFDGFAKDFFDAGFKFLMKGITSDMPDLQIDLSGLKKRYAKQWASGPNSTPGPQALVNKYVR
        Q LE KD  + R TTEL+  KERL+NG LLEESFRQHPDFDGFAKDF DAGFKFLMKGI +DMP LQIDL+GLKK+Y+++WASGPN TP PQ+LV+KYVR
Subjt:  QALEAKDEKLKRATTELETAKERLSNGALLEESFRQHPDFDGFAKDFFDAGFKFLMKGITSDMPDLQIDLSGLKKRYAKQWASGPNSTPGPQALVNKYVR

Query:  DLDSDYSDLEEDQIQRLDHSELFTS
        +LDSDYSD+EE+     +  E+ T+
Subjt:  DLDSDYSDLEEDQIQRLDHSELFTS

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092986.6e-11385.43Show/hide
Query:  MCARKGAGGIVKGPTSIKAWVRKWFYASGEWLAKDESGRSFFDVPTRFENLISIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
        MCARKGA GIVKGPTSIK WVRKWFYASGEWLAKDES              ++IRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKAWVRKWFYASGEWLAKDESGRSFFDVPTRFENLISIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRPIETSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSEPATLAVAGPASEDPTPVIELESSGGPSREKRPRGQTEAVDVSPLGKEVGEAAPLKRRK
        AVRPIE+SRPNSELAMVCGFASNVKRKSKG+AHALEAAQSS+P T AV GPASEDP PVIELESS GPSREKRPR QTEAVDVSPLG+EV E  PLKRR+
Subjt:  AVRPIETSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSEPATLAVAGPASEDPTPVIELESSGGPSREKRPRGQTEAVDVSPLGKEVGEAAPLKRRK

Query:  KKKKTTSPLEVGACGVLSASFADRVDDPEAKMGGTSDVTARFRVEPSSSGVRDQ
        KKKKTTSPLEVGA GVL ASFADRVDDPEA+MGGT DVT RFRVEPSSSGVRDQ
Subjt:  KKKKTTSPLEVGACGVLSASFADRVDDPEAKMGGTSDVTARFRVEPSSSGVRDQ

A0A6J1CR42 uncharacterized protein LOC1110138269.5e-11279.49Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRIGLAP---------------------------VELLDDDQLLACFEAKRIAKKPGWYYMCARKGAGGIVKGPTSIKAWVR
        MFEYGLRLPLHPFVQEFLFR GLAP                            ELLD DQLLACFEAKRIAKKPG +YMCARKGAGGIVKGPTSIK WVR
Subjt:  MFEYGLRLPLHPFVQEFLFRIGLAP---------------------------VELLDDDQLLACFEAKRIAKKPGWYYMCARKGAGGIVKGPTSIKAWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFENLISIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIETSRPNSELAMVCGFAS
        KWFYASGEWLAKDESGRSFFDVPTRF NL+SIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFENLISIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIETSRPNSELAMVCGFAS

Query:  NVKRKSKGRAHALEAAQSSEPATLAVAGPASEDPTPVIELESSGGPSREKRPRGQTEAV-------DVSPLGK
         VKRKSKGRAHALEAAQSS+P T AV GPASEDP PVIELESSGGPSREKRPR QTEAV       DV PLG+
Subjt:  NVKRKSKGRAHALEAAQSSEPATLAVAGPASEDPTPVIELESSGGPSREKRPRGQTEAV-------DVSPLGK

A0A6J1D971 uncharacterized protein LOC1110185386.1e-11181.68Show/hide
Query:  GTSDVTARFRVEPSSSGVRDQVSRISAASLDSCLRRASKFVSDPGSVLQRTIDYAAE-------------AELDGREVLAAREKEEFSAALEAASSTMKD
        G   + A+ R+EPSSSGVRDQVSRISAASLD CLRRASKFVS PGSVLQRTIDYAAE             AELDGREVLAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRVEPSSSGVRDQVSRISAASLDSCLRRASKFVSDPGSVLQRTIDYAAE-------------AELDGREVLAAREKEEFSAALEAASSTMKD

Query:  ELLRAHSEVDILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKDEKLKRATTELETAKERLSNGALLEESFRQHPDFD
        ELL+AHSEV+ LKAEVE++AELLKKEEDRR+AQLRAAHAIT+GLE+EKFQLLKEKDDMLQALEAKD++L+ AT ELETAKERLSNG LLEE+FRQHPDFD
Subjt:  ELLRAHSEVDILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKDEKLKRATTELETAKERLSNGALLEESFRQHPDFD

Query:  GFAKDFFDAGFKFLMKGITSDMPDLQIDLSGLKKRYAKQWASGPNSTPGPQALVNKYVRDLDSDYSDLEEDQI
        GFAKDF DAGFKFLMKGI SDMPDLQIDLSGLK+RYA++WASGP  TPGPQALV++YVRDLDSDYSD EEDQ+
Subjt:  GFAKDFFDAGFKFLMKGITSDMPDLQIDLSGLKKRYAKQWASGPNSTPGPQALVNKYVRDLDSDYSDLEEDQI

A0A6J1DXS5 uncharacterized protein LOC1110255024.3e-15784.01Show/hide
Query:  EEDLARRLESELEEIENFRFSDDGEDSDTSTSGQGLEYLSRIPEHYLRPLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYGLRLPLHPFVQE
        E DLARRLES+LEEIEN R SDDGEDSD STSGQGLEY SRIPEHYL  LRRGFAIPENILLR+PEEGERADNPPEGWVTLYFKMFEYGLRLPLHPFVQE
Subjt:  EEDLARRLESELEEIENFRFSDDGEDSDTSTSGQGLEYLSRIPEHYLRPLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYGLRLPLHPFVQE

Query:  FLFRIGLAP---------------------------VELLDDDQLLACFEAKRIAKKPGWYYMCARKGAGGIVKGPTSIKAWVRKWFYASGEWLAKDESG
        FLFR GLAP                            EL D DQLLACFEAKRIAKKPG +YMCARKGAGGIVKGPTSIK WVRKWFYASGEWLAKDESG
Subjt:  FLFRIGLAP---------------------------VELLDDDQLLACFEAKRIAKKPGWYYMCARKGAGGIVKGPTSIKAWVRKWFYASGEWLAKDESG

Query:  RSFFDVPTRFENLISIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIETSRPNSELAMVCGFASNVKRKSKGRAHALEAA
        RSFFDVPTRF NL+SIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIE+SRPNSELAMVCGFAS VKRKSKGRAHALEAA
Subjt:  RSFFDVPTRFENLISIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIETSRPNSELAMVCGFASNVKRKSKGRAHALEAA

Query:  QSSEPATLAVAGPASEDPTPVIELESSGGPSREKRPRGQTEAVD
        QSS+PAT AV GPASEDP  VIELESSGGPSREKRPR QTEAVD
Subjt:  QSSEPATLAVAGPASEDPTPVIELESSGGPSREKRPRGQTEAVD

A0A6J1DZB3 uncharacterized protein LOC1110256651.9e-18969.33Show/hide
Query:  MCARKGAGGIVKGPTSIKAWVRKWFYASGEWLAKDESGRSFFDVPTRFENLISIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
        MCARKG GGIVKGPTSIK WV KWF+ASGEWLAKDESGR+FFDVPTRF NL+SI+ +PEL QA+FDTLK+YK+HFPR RK+ TLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKAWVRKWFYASGEWLAKDESGRSFFDVPTRFENLISIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRPIETSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSEPATLAV--------AGPASEDPTPVIELESSGGPSREKRPRGQTEAVDVSPLGKEVGE
         VR IE SRPNSELAMVCGF  +VKRKSKGRAHAL+    +EP T  V        +GP+S  PTPVIEL+ SGG S EKR R ++EA+DVSPL +  GE
Subjt:  AVRPIETSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSEPATLAV--------AGPASEDPTPVIELESSGGPSREKRPRGQTEAVDVSPLGKEVGE

Query:  AAPLKRRKKKKKTTSPLEVGACGVLSASFADRVDDPEAKMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDSCLRRASKFVSDPGSVLQRTIDYAAE--
         +PL+RR+KKKKT+S  E GA G L  S AD VDDPEA+M GTS+V  RF +EPSSSGV+DQVSRISA  LD  LRRASKFVSDPGSVLQRTID  AE  
Subjt:  AAPLKRRKKKKKTTSPLEVGACGVLSASFADRVDDPEAKMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDSCLRRASKFVSDPGSVLQRTIDYAAE--

Query:  -----------AELDGREVLAAREKEEFSAALEAASSTMKDELLRAHSEVDILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDML
                   AELDGRE LAA+E+E   AALEAA +T+K ELL+A  EVDIL+AEV+AK +LLKKE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+ 
Subjt:  -----------AELDGREVLAAREKEEFSAALEAASSTMKDELLRAHSEVDILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDML

Query:  QALEAKDEKLKRATTELETAKERLSNGALLEESFRQHPDFDGFAKDFFDAGFKFLMKGITSDMPDLQIDLSGLKKRYAKQWASGPNSTPGPQALVNKYVR
        Q LE KD  + R TTEL+  KERL+NG LLEESFRQHPDFDGFAKDF DAGFKFLMKGI +DMP LQIDL+GLKK+Y+++WASGPN TP PQ+LV+KYVR
Subjt:  QALEAKDEKLKRATTELETAKERLSNGALLEESFRQHPDFDGFAKDFFDAGFKFLMKGITSDMPDLQIDLSGLKKRYAKQWASGPNSTPGPQALVNKYVR

Query:  DLDSDYSDLEEDQIQRLDHSELFTS
        +LDSDYSD+EE+     +  E+ T+
Subjt:  DLDSDYSDLEEDQIQRLDHSELFTS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGGTAGTCCCGTTAATCTCGCGACGGTTACACCCGGTGATCTCGGGACTGACGGTTACACCCGGGTATTTTCTCCCCCAAACATTGGCCCCCTCTCTGTCCGATT
CGACCTCGACCTGGCAGAGAAGTTCATTCGATTCGCTTTGGACACGTGGCGACTTCCTATTCGTGGAAAAATACAACTGTTGCGGTCATACCTTACGCTTCCTGAATTCT
TGGAGTTCGATCTGAAGGCAGCTCGAACCCTTGGATCCGAAGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGACGACGGG
GAGGATAGTGATACTTCCACCTCGGGTCAGGGTTTGGAATACCTTTCTAGGATACCCGAGCACTACCTCAGACCCCTTCGTAGGGGGTTCGCTATTCCTGAAAACATCCT
CCTTAGGATTCCAGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAAGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTCG
TCCAAGAGTTTCTTTTCCGAATTGGGCTGGCTCCAGTCGAGCTATTGGATGATGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTTGGTAC
TATATGTGCGCAAGGAAAGGCGCAGGAGGTATAGTTAAGGGGCCGACCTCCATCAAAGCATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTTGCAAAGGACGA
GTCAGGTCGTTCCTTCTTTGACGTTCCCACTAGGTTTGAGAACCTAATATCAATCCGGCCAGTTCCCGAGCTTACTCAAGCCTCCTTCGACACGCTGAAATATTACAAGG
AGCACTTTCCGAGGGGCAGGAAGGTCGGAACCTTGGTGACCGACAAGCTGCTGCTCGAGTCCGGGCTGTTAGATTACAACCCTGCAGTTCGTCCCATCGAAACTTCAAGG
CCGAACTCCGAACTAGCCATGGTTTGCGGATTTGCGAGTAACGTAAAGCGCAAGTCCAAGGGCAGAGCCCATGCTCTCGAGGCCGCCCAGAGTTCGGAACCTGCAACTCT
TGCTGTGGCAGGGCCAGCCTCAGAAGATCCAACCCCAGTAATCGAGCTGGAGTCTTCTGGAGGTCCTTCGCGGGAGAAGCGCCCAAGGGGTCAGACCGAGGCGGTGGACG
TCTCGCCCTTGGGCAAGGAGGTGGGGGAGGCGGCCCCTCTGAAGCGGAGGAAGAAGAAGAAGAAAACCACCTCCCCCTTGGAGGTCGGAGCTTGTGGGGTCCTGTCCGCG
AGCTTCGCAGACCGGGTGGACGATCCTGAAGCCAAGATGGGCGGGACGTCCGACGTGACAGCACGGTTCAGAGTTGAACCGTCAAGTTCTGGGGTGAGGGACCAGGTGTC
CCGCATCTCCGCTGCAAGTTTGGACAGCTGCCTCAGAAGAGCGTCCAAATTTGTAAGTGACCCGGGGTCCGTTCTGCAGAGGACCATCGACTACGCCGCTGAGGCCGAGC
TGGATGGGAGGGAAGTTCTGGCAGCGAGAGAGAAAGAGGAATTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACGATGAAGGATGAGCTGCTAAGGGCTCACTCTGAGGTG
GACATTCTGAAGGCCGAGGTGGAGGCCAAGGCCGAGCTGTTGAAGAAAGAAGAGGACAGACGCAAGGCCCAGCTCCGAGCTGCCCATGCCATCACCAAGGGCCTGGAGAA
GGAGAAGTTCCAACTCCTCAAAGAGAAGGACGACATGCTTCAGGCGCTTGAAGCGAAGGACGAGAAGCTGAAGCGTGCGACTACCGAGCTAGAGACGGCGAAGGAGCGTC
TCAGCAACGGAGCCCTGCTGGAGGAGTCTTTCAGGCAACATCCTGACTTCGACGGATTTGCCAAAGACTTTTTTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTACT
TCCGACATGCCTGACCTTCAGATCGATCTCAGTGGTCTGAAGAAGAGGTATGCCAAGCAATGGGCTTCTGGGCCTAACAGCACCCCTGGCCCCCAAGCGTTGGTGAATAA
GTATGTCAGAGATCTGGACTCTGACTACTCCGATCTCGAAGAGGACCAGATTCAGAGGCTTGATCATTCTGAACTTTTCACATCGCCCCTTTACCTTGAAGGTTTGAATT
TTAAGTTCATCAGTGGTTTTGGCATCGCACCTCGTACCCTTAGGTTTTGGAGGATTGAATTTTCTGAACTTTTCACATCGCCCCTTTACCTTGAAGGTTTGAATTTTAAG
TTCATCAGTGGTTTTGGAATCGCACCTCGTACCCTTAGGCCCTCTTCAGGGGTTAGGCATCTCAGTAGAGGTAGAGAAAAACCGCGTCGGTACAACGTTCCGCCTCGGAC
TATGAACCGAGCTGCCTTCCTCGCTAACTTTCTGCGCTCCTTGGGGTCTTGTGGTGAATTGCCCCTAATGAAGTCCATAATCGGGTCCATCCATGAGGGCTCTGGAGCGC
CAATCTCCATCAGATCTGGCTCCAAGATCGAGGGATTATCCAAGATCTCAACGGGGACCGACTTGGCCAGGTCGGCCTCTGACACCTGCAGCTTGCCACGATTGATATGT
TGGGAGGGACAAGATGTTGGCAGATGCGCCTCCATCGACCAACACTCTTCGGACCAGGACGTGATCAATCAGAGGGGCGATCACCAGTGCATCGTTGTGAGGCAAGTGGA
TCCCCTCCAGGTCGGCGTCGCGAAAAGTGATGGAGCAAGTGGGCTTCTGCTCTTCGATGATGCATACCTCGAGCTAGCTCTTTCCTCTTGTTTCCAGACTGGCCCCCGCT
CGGACCCCCGAAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTGGTAGTCCCGTTAATCTCGCGACGGTTACACCCGGTGATCTCGGGACTGACGGTTACACCCGGGTATTTTCTCCCCCAAACATTGGCCCCCTCTCTGTCCGATT
CGACCTCGACCTGGCAGAGAAGTTCATTCGATTCGCTTTGGACACGTGGCGACTTCCTATTCGTGGAAAAATACAACTGTTGCGGTCATACCTTACGCTTCCTGAATTCT
TGGAGTTCGATCTGAAGGCAGCTCGAACCCTTGGATCCGAAGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGACGACGGG
GAGGATAGTGATACTTCCACCTCGGGTCAGGGTTTGGAATACCTTTCTAGGATACCCGAGCACTACCTCAGACCCCTTCGTAGGGGGTTCGCTATTCCTGAAAACATCCT
CCTTAGGATTCCAGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAAGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTCG
TCCAAGAGTTTCTTTTCCGAATTGGGCTGGCTCCAGTCGAGCTATTGGATGATGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTTGGTAC
TATATGTGCGCAAGGAAAGGCGCAGGAGGTATAGTTAAGGGGCCGACCTCCATCAAAGCATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTTGCAAAGGACGA
GTCAGGTCGTTCCTTCTTTGACGTTCCCACTAGGTTTGAGAACCTAATATCAATCCGGCCAGTTCCCGAGCTTACTCAAGCCTCCTTCGACACGCTGAAATATTACAAGG
AGCACTTTCCGAGGGGCAGGAAGGTCGGAACCTTGGTGACCGACAAGCTGCTGCTCGAGTCCGGGCTGTTAGATTACAACCCTGCAGTTCGTCCCATCGAAACTTCAAGG
CCGAACTCCGAACTAGCCATGGTTTGCGGATTTGCGAGTAACGTAAAGCGCAAGTCCAAGGGCAGAGCCCATGCTCTCGAGGCCGCCCAGAGTTCGGAACCTGCAACTCT
TGCTGTGGCAGGGCCAGCCTCAGAAGATCCAACCCCAGTAATCGAGCTGGAGTCTTCTGGAGGTCCTTCGCGGGAGAAGCGCCCAAGGGGTCAGACCGAGGCGGTGGACG
TCTCGCCCTTGGGCAAGGAGGTGGGGGAGGCGGCCCCTCTGAAGCGGAGGAAGAAGAAGAAGAAAACCACCTCCCCCTTGGAGGTCGGAGCTTGTGGGGTCCTGTCCGCG
AGCTTCGCAGACCGGGTGGACGATCCTGAAGCCAAGATGGGCGGGACGTCCGACGTGACAGCACGGTTCAGAGTTGAACCGTCAAGTTCTGGGGTGAGGGACCAGGTGTC
CCGCATCTCCGCTGCAAGTTTGGACAGCTGCCTCAGAAGAGCGTCCAAATTTGTAAGTGACCCGGGGTCCGTTCTGCAGAGGACCATCGACTACGCCGCTGAGGCCGAGC
TGGATGGGAGGGAAGTTCTGGCAGCGAGAGAGAAAGAGGAATTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACGATGAAGGATGAGCTGCTAAGGGCTCACTCTGAGGTG
GACATTCTGAAGGCCGAGGTGGAGGCCAAGGCCGAGCTGTTGAAGAAAGAAGAGGACAGACGCAAGGCCCAGCTCCGAGCTGCCCATGCCATCACCAAGGGCCTGGAGAA
GGAGAAGTTCCAACTCCTCAAAGAGAAGGACGACATGCTTCAGGCGCTTGAAGCGAAGGACGAGAAGCTGAAGCGTGCGACTACCGAGCTAGAGACGGCGAAGGAGCGTC
TCAGCAACGGAGCCCTGCTGGAGGAGTCTTTCAGGCAACATCCTGACTTCGACGGATTTGCCAAAGACTTTTTTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTACT
TCCGACATGCCTGACCTTCAGATCGATCTCAGTGGTCTGAAGAAGAGGTATGCCAAGCAATGGGCTTCTGGGCCTAACAGCACCCCTGGCCCCCAAGCGTTGGTGAATAA
GTATGTCAGAGATCTGGACTCTGACTACTCCGATCTCGAAGAGGACCAGATTCAGAGGCTTGATCATTCTGAACTTTTCACATCGCCCCTTTACCTTGAAGGTTTGAATT
TTAAGTTCATCAGTGGTTTTGGCATCGCACCTCGTACCCTTAGGTTTTGGAGGATTGAATTTTCTGAACTTTTCACATCGCCCCTTTACCTTGAAGGTTTGAATTTTAAG
TTCATCAGTGGTTTTGGAATCGCACCTCGTACCCTTAGGCCCTCTTCAGGGGTTAGGCATCTCAGTAGAGGTAGAGAAAAACCGCGTCGGTACAACGTTCCGCCTCGGAC
TATGAACCGAGCTGCCTTCCTCGCTAACTTTCTGCGCTCCTTGGGGTCTTGTGGTGAATTGCCCCTAATGAAGTCCATAATCGGGTCCATCCATGAGGGCTCTGGAGCGC
CAATCTCCATCAGATCTGGCTCCAAGATCGAGGGATTATCCAAGATCTCAACGGGGACCGACTTGGCCAGGTCGGCCTCTGACACCTGCAGCTTGCCACGATTGATATGT
TGGGAGGGACAAGATGTTGGCAGATGCGCCTCCATCGACCAACACTCTTCGGACCAGGACGTGATCAATCAGAGGGGCGATCACCAGTGCATCGTTGTGAGGCAAGTGGA
TCCCCTCCAGGTCGGCGTCGCGAAAAGTGATGGAGCAAGTGGGCTTCTGCTCTTCGATGATGCATACCTCGAGCTAGCTCTTTCCTCTTGTTTCCAGACTGGCCCCCGCT
CGGACCCCCGAAAATAG
Protein sequenceShow/hide protein sequence
MGGSPVNLATVTPGDLGTDGYTRVFSPPNIGPLSVRFDLDLAEKFIRFALDTWRLPIRGKIQLLRSYLTLPEFLEFDLKAARTLGSEEDLARRLESELEEIENFRFSDDG
EDSDTSTSGQGLEYLSRIPEHYLRPLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYGLRLPLHPFVQEFLFRIGLAPVELLDDDQLLACFEAKRIAKKPGWY
YMCARKGAGGIVKGPTSIKAWVRKWFYASGEWLAKDESGRSFFDVPTRFENLISIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIETSR
PNSELAMVCGFASNVKRKSKGRAHALEAAQSSEPATLAVAGPASEDPTPVIELESSGGPSREKRPRGQTEAVDVSPLGKEVGEAAPLKRRKKKKKTTSPLEVGACGVLSA
SFADRVDDPEAKMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDSCLRRASKFVSDPGSVLQRTIDYAAEAELDGREVLAAREKEEFSAALEAASSTMKDELLRAHSEV
DILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKDEKLKRATTELETAKERLSNGALLEESFRQHPDFDGFAKDFFDAGFKFLMKGIT
SDMPDLQIDLSGLKKRYAKQWASGPNSTPGPQALVNKYVRDLDSDYSDLEEDQIQRLDHSELFTSPLYLEGLNFKFISGFGIAPRTLRFWRIEFSELFTSPLYLEGLNFK
FISGFGIAPRTLRPSSGVRHLSRGREKPRRYNVPPRTMNRAAFLANFLRSLGSCGELPLMKSIIGSIHEGSGAPISIRSGSKIEGLSKISTGTDLARSASDTCSLPRLIC
WEGQDVGRCASIDQHSSDQDVINQRGDHQCIVVRQVDPLQVGVAKSDGASGLLLFDDAYLELALSSCFQTGPRSDPRK