; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g01040 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g01040
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr7:793522..802610
RNA-Seq ExpressionMoc07g01040
SyntenyMoc07g01040
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022147182.1 uncharacterized protein LOC111016193 [Momordica charantia]2.6e-8972.98Show/hide
Query:  RRPAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSVLAVKAELDVREVLAAREKKEFSAVLEAASSTMKDELLRAHSEVDILKAEVEAKAEML
        R  A  LDRCLRRAS+FVSDPGSVLQRTID AAEAF+ASI S + VKAELD RE L A+E++  S  LEAA +T+K ELL+A  EVDIL+AEV+AK ++L
Subjt:  RRPAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSVLAVKAELDVREVLAAREKKEFSAVLEAASSTMKDELLRAHSEVDILKAEVEAKAEML

Query:  KKEEDRRKAQLRAAHAITKGLEKEKFQLLKENDDMLQALEAKDEELKRATAELETAKERLGNGALLEESFRQHPDFDGFVKDFSDAGFKFLMKGITSDMP
        KKE ++ KA LRAAHAITKGLEKEKFQLLKE DD+ Q LE KD  + R T EL+  KERL +GALLEESFRQHP+FDGF KDFSDAGFKFLMKGI +DMP
Subjt:  KKEEDRRKAQLRAAHAITKGLEKEKFQLLKENDDMLQALEAKDEELKRATAELETAKERLGNGALLEESFRQHPDFDGFVKDFSDAGFKFLMKGITSDMP

Query:  DLQIDLSGLKKRYAEQWASGPNGTPGPQALVNKYVRDLDSDYSDLEED
         LQIDLS LKKRY+E WASGPNGTPGPQ+LV+KYVR+LDSDYSD+EE+
Subjt:  DLQIDLSGLKKRYAEQWASGPNGTPGPQALVNKYVRDLDSDYSDLEED

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]1.6e-11087.2Show/hide
Query:  RRPAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSVLAVKAELDVREVLAAREKKEFSAVLEAASSTMKDELLRAHSEVDILKAEVEAKAEML
        R  AASLDRCLRRASKFVS PGSVLQRTIDYAAEAFVASIQS LAVKAELD REVLAAREK+EFSA LE ASSTMKDELL+AHSEV+ LKAEVE++AE+L
Subjt:  RRPAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSVLAVKAELDVREVLAAREKKEFSAVLEAASSTMKDELLRAHSEVDILKAEVEAKAEML

Query:  KKEEDRRKAQLRAAHAITKGLEKEKFQLLKENDDMLQALEAKDEELKRATAELETAKERLGNGALLEESFRQHPDFDGFVKDFSDAGFKFLMKGITSDMP
        KKEEDRR+AQLRAAHAIT+GLE+EKFQLLKE DDMLQALEAKD+EL+ ATAELETAKERL NG LLEE+FRQHPDFDGF KDFSDAGFKFLMKGI SDMP
Subjt:  KKEEDRRKAQLRAAHAITKGLEKEKFQLLKENDDMLQALEAKDEELKRATAELETAKERLGNGALLEESFRQHPDFDGFVKDFSDAGFKFLMKGITSDMP

Query:  DLQIDLSGLKKRYAEQWASGPNGTPGPQALVNKYVRDLDSDYSDLEEDQV
        DLQIDLSGLK+RYAE+WASGP GTPGPQALV++YVRDLDSDYSD EEDQV
Subjt:  DLQIDLSGLKKRYAEQWASGPNGTPGPQALVNKYVRDLDSDYSDLEEDQV

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]2.3e-9074.6Show/hide
Query:  RRPAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSVLAVKAELDVREVLAAREKKEFSAVLEAASSTMKDELLRAHSEVDILKAEVEAKAEML
        R  A  LDRCL+RASKFVSDPGSVLQRTID AAEAFVASI S + VKAELD RE LAA+E++  SA LEAA +T+K ELL+A  EV IL+AEV+AKAE+L
Subjt:  RRPAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSVLAVKAELDVREVLAAREKKEFSAVLEAASSTMKDELLRAHSEVDILKAEVEAKAEML

Query:  KKEEDRRKAQLRAAHAITKGLEKEKFQLLKENDDMLQALEAKDEELKRATAELETAKERLGNGALLEESFRQHPDFDGFVKDFSDAGFKFLMKGITSDMP
        KKE ++ KA LRAAHAITKGLEKEKFQLLKE DD+ Q LE KD  + R TAEL+  KERL NG+LLEESFRQH DFDGF KDFSDAGFKFLMKGI +DMP
Subjt:  KKEEDRRKAQLRAAHAITKGLEKEKFQLLKENDDMLQALEAKDEELKRATAELETAKERLGNGALLEESFRQHPDFDGFVKDFSDAGFKFLMKGITSDMP

Query:  DLQIDLSGLKKRYAEQWASGPNGTPGPQALVNKYVRDLDSDYSDLEED
         LQIDLS LKK+Y+E+WASGPNGTPGPQ+LV KYVR+LDSDYSD+EE+
Subjt:  DLQIDLSGLKKRYAEQWASGPNGTPGPQALVNKYVRDLDSDYSDLEED

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]4.1e-11174.65Show/hide
Query:  MSSSFSSDLGSDEDLARRLVSELEEIENFRFFDDGEDSDASTSGQGLEYPSRLPEHYFGPLRRGFDIPENILLRIPKEGERADNPPEGWVTLYFKMFE--
        MSSS SS+L  + DLARRL S+LEEIEN R  DDGEDSDASTSGQGLEYPSR+PEHY G LRRGF IPENILLR+P+EGERADNPPEGWVTLYFKMFE  
Subjt:  MSSSFSSDLGSDEDLARRLVSELEEIENFRFFDDGEDSDASTSGQGLEYPSRLPEHYFGPLRRGFDIPENILLRIPKEGERADNPPEGWVTLYFKMFE--

Query:  -RLQTSP--------SPFRPRVSFPN------------WAGSGSSGPQWLLDVDQLLACFEAKRIAKKPGRYYMCVRKGAGGIVKGSTSIKGWVRKWFYA
         RL   P        +   P    PN            W  +  S    L DVDQLLACFEAKRIAKKPGR+YMC RKGAGGIVKG TSIKGWVRKWFYA
Subjt:  -RLQTSP--------SPFRPRVSFPN------------WAGSGSSGPQWLLDVDQLLACFEAKRIAKKPGRYYMCVRKGAGGIVKGSTSIKGWVRKWFYA

Query:  SGEWLAKDESSRSFFDVPTRFGNLISIRPVLELTQASFDTLKYYKEHFPRGKKVEILVTDKLLLESGLLDYNPAVRPIEASRPNSELA
        SGEWLAKDES RSFFDVPTRFGNL+SIRPV ELTQASFDTLKYYKE FPRG+KV  LVTD+LLLESGLLDYNPAVRPIE+SRPNSELA
Subjt:  SGEWLAKDESSRSFFDVPTRFGNLISIRPVLELTQASFDTLKYYKEHFPRGKKVEILVTDKLLLESGLLDYNPAVRPIEASRPNSELA

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]2.6e-13457.7Show/hide
Query:  MCVRKGAGGIVKGSTSIKGWVRKWFYASGEWLAKDESSRSFFDVPTRFGNLISIRPVLELTQASFDTLKYYKEHFPRGKKVEILVTDKLLLESGLLDYNP
        MC RKG GGIVKG TSIKGWV KWF+ASGEWLAKDES R+FFDVPTRFGNL+SI+ + EL QA+FDTLK+YK+HFPR +K+  LVTDKLLLESGLLDYNP
Subjt:  MCVRKGAGGIVKGSTSIKGWVRKWFYASGEWLAKDESSRSFFDVPTRFGNLISIRPVLELTQASFDTLKYYKEHFPRGKKVEILVTDKLLLESGLLDYNP

Query:  AVRPIEASRPNSELAK------GPSPCSRGRQEFGTCNSCCGRASLRRSNPSDRA-GVFWRSFVGEAPK-GLDRGGGRLALGRGS--------------G
         VR IEASRPNSELA            S+GR       +  G   +  + P   A G    S     P   LD  GGR    R                 
Subjt:  AVRPIEASRPNSELAK------GPSPCSRGRQEFGTCNSCCGRASLRRSNPSDRA-GVFWRSFVGEAPK-GLDRGGGRLALGRGS--------------G

Query:  GGSP----------SEAEEEEEENHLPLGGWSLW-GPARELR-----------RPAAS-------------LDRCLRRASKFVSDPGSVLQRTIDYAAEA
        G SP          S + E      LP     L   P   +R            P++S             LDR LRRASKFVSDPGSVLQRTID  AEA
Subjt:  GGSP----------SEAEEEEEENHLPLGGWSLW-GPARELR-----------RPAAS-------------LDRCLRRASKFVSDPGSVLQRTIDYAAEA

Query:  FVASIQSVLAVKAELDVREVLAAREKKEFSAVLEAASSTMKDELLRAHSEVDILKAEVEAKAEMLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKENDDM
        F+ASI   + VKAELD RE LAA+E++   A LEAA +T+K ELL+A  EVDIL+AEV+AK ++LKKE ++ KA LRAAHAITKGLEKEKFQLLKE DD+
Subjt:  FVASIQSVLAVKAELDVREVLAAREKKEFSAVLEAASSTMKDELLRAHSEVDILKAEVEAKAEMLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKENDDM

Query:  LQALEAKDEELKRATAELETAKERLGNGALLEESFRQHPDFDGFVKDFSDAGFKFLMKGITSDMPDLQIDLSGLKKRYAEQWASGPNGTPGPQALVNKYV
         Q LE KD  + R T EL+  KERL NG LLEESFRQHPDFDGF KDFSDAGFKFLMKGI +DMP LQIDL+GLKK+Y+E+WASGPNGTP PQ+LV+KYV
Subjt:  LQALEAKDEELKRATAELETAKERLGNGALLEESFRQHPDFDGFVKDFSDAGFKFLMKGITSDMPDLQIDLSGLKKRYAEQWASGPNGTPGPQALVNKYV

Query:  RDLDSDYSDLEED
        R+LDSDYSD+EE+
Subjt:  RDLDSDYSDLEED

TrEMBL top hitse value%identityAlignment
A0A6J1D1N9 uncharacterized protein LOC1110161931.3e-8972.98Show/hide
Query:  RRPAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSVLAVKAELDVREVLAAREKKEFSAVLEAASSTMKDELLRAHSEVDILKAEVEAKAEML
        R  A  LDRCLRRAS+FVSDPGSVLQRTID AAEAF+ASI S + VKAELD RE L A+E++  S  LEAA +T+K ELL+A  EVDIL+AEV+AK ++L
Subjt:  RRPAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSVLAVKAELDVREVLAAREKKEFSAVLEAASSTMKDELLRAHSEVDILKAEVEAKAEML

Query:  KKEEDRRKAQLRAAHAITKGLEKEKFQLLKENDDMLQALEAKDEELKRATAELETAKERLGNGALLEESFRQHPDFDGFVKDFSDAGFKFLMKGITSDMP
        KKE ++ KA LRAAHAITKGLEKEKFQLLKE DD+ Q LE KD  + R T EL+  KERL +GALLEESFRQHP+FDGF KDFSDAGFKFLMKGI +DMP
Subjt:  KKEEDRRKAQLRAAHAITKGLEKEKFQLLKENDDMLQALEAKDEELKRATAELETAKERLGNGALLEESFRQHPDFDGFVKDFSDAGFKFLMKGITSDMP

Query:  DLQIDLSGLKKRYAEQWASGPNGTPGPQALVNKYVRDLDSDYSDLEED
         LQIDLS LKKRY+E WASGPNGTPGPQ+LV+KYVR+LDSDYSD+EE+
Subjt:  DLQIDLSGLKKRYAEQWASGPNGTPGPQALVNKYVRDLDSDYSDLEED

A0A6J1D971 uncharacterized protein LOC1110185387.6e-11187.2Show/hide
Query:  RRPAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSVLAVKAELDVREVLAAREKKEFSAVLEAASSTMKDELLRAHSEVDILKAEVEAKAEML
        R  AASLDRCLRRASKFVS PGSVLQRTIDYAAEAFVASIQS LAVKAELD REVLAAREK+EFSA LE ASSTMKDELL+AHSEV+ LKAEVE++AE+L
Subjt:  RRPAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSVLAVKAELDVREVLAAREKKEFSAVLEAASSTMKDELLRAHSEVDILKAEVEAKAEML

Query:  KKEEDRRKAQLRAAHAITKGLEKEKFQLLKENDDMLQALEAKDEELKRATAELETAKERLGNGALLEESFRQHPDFDGFVKDFSDAGFKFLMKGITSDMP
        KKEEDRR+AQLRAAHAIT+GLE+EKFQLLKE DDMLQALEAKD+EL+ ATAELETAKERL NG LLEE+FRQHPDFDGF KDFSDAGFKFLMKGI SDMP
Subjt:  KKEEDRRKAQLRAAHAITKGLEKEKFQLLKENDDMLQALEAKDEELKRATAELETAKERLGNGALLEESFRQHPDFDGFVKDFSDAGFKFLMKGITSDMP

Query:  DLQIDLSGLKKRYAEQWASGPNGTPGPQALVNKYVRDLDSDYSDLEEDQV
        DLQIDLSGLK+RYAE+WASGP GTPGPQALV++YVRDLDSDYSD EEDQV
Subjt:  DLQIDLSGLKKRYAEQWASGPNGTPGPQALVNKYVRDLDSDYSDLEEDQV

A0A6J1DF31 uncharacterized protein LOC1110199091.1e-9074.6Show/hide
Query:  RRPAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSVLAVKAELDVREVLAAREKKEFSAVLEAASSTMKDELLRAHSEVDILKAEVEAKAEML
        R  A  LDRCL+RASKFVSDPGSVLQRTID AAEAFVASI S + VKAELD RE LAA+E++  SA LEAA +T+K ELL+A  EV IL+AEV+AKAE+L
Subjt:  RRPAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSVLAVKAELDVREVLAAREKKEFSAVLEAASSTMKDELLRAHSEVDILKAEVEAKAEML

Query:  KKEEDRRKAQLRAAHAITKGLEKEKFQLLKENDDMLQALEAKDEELKRATAELETAKERLGNGALLEESFRQHPDFDGFVKDFSDAGFKFLMKGITSDMP
        KKE ++ KA LRAAHAITKGLEKEKFQLLKE DD+ Q LE KD  + R TAEL+  KERL NG+LLEESFRQH DFDGF KDFSDAGFKFLMKGI +DMP
Subjt:  KKEEDRRKAQLRAAHAITKGLEKEKFQLLKENDDMLQALEAKDEELKRATAELETAKERLGNGALLEESFRQHPDFDGFVKDFSDAGFKFLMKGITSDMP

Query:  DLQIDLSGLKKRYAEQWASGPNGTPGPQALVNKYVRDLDSDYSDLEED
         LQIDLS LKK+Y+E+WASGPNGTPGPQ+LV KYVR+LDSDYSD+EE+
Subjt:  DLQIDLSGLKKRYAEQWASGPNGTPGPQALVNKYVRDLDSDYSDLEED

A0A6J1DXS5 uncharacterized protein LOC1110255022.0e-11174.65Show/hide
Query:  MSSSFSSDLGSDEDLARRLVSELEEIENFRFFDDGEDSDASTSGQGLEYPSRLPEHYFGPLRRGFDIPENILLRIPKEGERADNPPEGWVTLYFKMFE--
        MSSS SS+L  + DLARRL S+LEEIEN R  DDGEDSDASTSGQGLEYPSR+PEHY G LRRGF IPENILLR+P+EGERADNPPEGWVTLYFKMFE  
Subjt:  MSSSFSSDLGSDEDLARRLVSELEEIENFRFFDDGEDSDASTSGQGLEYPSRLPEHYFGPLRRGFDIPENILLRIPKEGERADNPPEGWVTLYFKMFE--

Query:  -RLQTSP--------SPFRPRVSFPN------------WAGSGSSGPQWLLDVDQLLACFEAKRIAKKPGRYYMCVRKGAGGIVKGSTSIKGWVRKWFYA
         RL   P        +   P    PN            W  +  S    L DVDQLLACFEAKRIAKKPGR+YMC RKGAGGIVKG TSIKGWVRKWFYA
Subjt:  -RLQTSP--------SPFRPRVSFPN------------WAGSGSSGPQWLLDVDQLLACFEAKRIAKKPGRYYMCVRKGAGGIVKGSTSIKGWVRKWFYA

Query:  SGEWLAKDESSRSFFDVPTRFGNLISIRPVLELTQASFDTLKYYKEHFPRGKKVEILVTDKLLLESGLLDYNPAVRPIEASRPNSELA
        SGEWLAKDES RSFFDVPTRFGNL+SIRPV ELTQASFDTLKYYKE FPRG+KV  LVTD+LLLESGLLDYNPAVRPIE+SRPNSELA
Subjt:  SGEWLAKDESSRSFFDVPTRFGNLISIRPVLELTQASFDTLKYYKEHFPRGKKVEILVTDKLLLESGLLDYNPAVRPIEASRPNSELA

A0A6J1DZB3 uncharacterized protein LOC1110256651.3e-13457.7Show/hide
Query:  MCVRKGAGGIVKGSTSIKGWVRKWFYASGEWLAKDESSRSFFDVPTRFGNLISIRPVLELTQASFDTLKYYKEHFPRGKKVEILVTDKLLLESGLLDYNP
        MC RKG GGIVKG TSIKGWV KWF+ASGEWLAKDES R+FFDVPTRFGNL+SI+ + EL QA+FDTLK+YK+HFPR +K+  LVTDKLLLESGLLDYNP
Subjt:  MCVRKGAGGIVKGSTSIKGWVRKWFYASGEWLAKDESSRSFFDVPTRFGNLISIRPVLELTQASFDTLKYYKEHFPRGKKVEILVTDKLLLESGLLDYNP

Query:  AVRPIEASRPNSELAK------GPSPCSRGRQEFGTCNSCCGRASLRRSNPSDRA-GVFWRSFVGEAPK-GLDRGGGRLALGRGS--------------G
         VR IEASRPNSELA            S+GR       +  G   +  + P   A G    S     P   LD  GGR    R                 
Subjt:  AVRPIEASRPNSELAK------GPSPCSRGRQEFGTCNSCCGRASLRRSNPSDRA-GVFWRSFVGEAPK-GLDRGGGRLALGRGS--------------G

Query:  GGSP----------SEAEEEEEENHLPLGGWSLW-GPARELR-----------RPAAS-------------LDRCLRRASKFVSDPGSVLQRTIDYAAEA
        G SP          S + E      LP     L   P   +R            P++S             LDR LRRASKFVSDPGSVLQRTID  AEA
Subjt:  GGSP----------SEAEEEEEENHLPLGGWSLW-GPARELR-----------RPAAS-------------LDRCLRRASKFVSDPGSVLQRTIDYAAEA

Query:  FVASIQSVLAVKAELDVREVLAAREKKEFSAVLEAASSTMKDELLRAHSEVDILKAEVEAKAEMLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKENDDM
        F+ASI   + VKAELD RE LAA+E++   A LEAA +T+K ELL+A  EVDIL+AEV+AK ++LKKE ++ KA LRAAHAITKGLEKEKFQLLKE DD+
Subjt:  FVASIQSVLAVKAELDVREVLAAREKKEFSAVLEAASSTMKDELLRAHSEVDILKAEVEAKAEMLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKENDDM

Query:  LQALEAKDEELKRATAELETAKERLGNGALLEESFRQHPDFDGFVKDFSDAGFKFLMKGITSDMPDLQIDLSGLKKRYAEQWASGPNGTPGPQALVNKYV
         Q LE KD  + R T EL+  KERL NG LLEESFRQHPDFDGF KDFSDAGFKFLMKGI +DMP LQIDL+GLKK+Y+E+WASGPNGTP PQ+LV+KYV
Subjt:  LQALEAKDEELKRATAELETAKERLGNGALLEESFRQHPDFDGFVKDFSDAGFKFLMKGITSDMPDLQIDLSGLKKRYAEQWASGPNGTPGPQALVNKYV

Query:  RDLDSDYSDLEED
        R+LDSDYSD+EE+
Subjt:  RDLDSDYSDLEED

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCAGTGAGATGACTTTCCTACTGACGGAATCGCTTGAATTAGTTGAAGGAAAGGAGGTGGCTGGTCTCACTGGAGAGGAGAGTTTTGACAGCGAGGGAGGTATGTG
TGCGTCTGCTGGTATCGTATATAAGAGAAAGGCTGCATTTAGGAGTGATCTTTCCCTCTTCTTCAAGCCAGTGGTGATCTCACTTTCGAAAGAGAGTCTCGACGTGCTTA
GTAGCGCGTCATACCTTACGCTTCCTGAATTTTTGGAGTTCGATCTGAAGGCAGCTCGAACCCTTGGTAGGTCGATCTCTTCCCTCTCACTTTTTCTTTCGAACGTAGTT
TCCATGTCGTCCTCTTTTAGTAGCGACTTAGGATCCGATGAGGATTTAGCTCGTAGGTTAGTGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTTCGACGACGGGGA
GGATAGTGATGCTTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGTTACCCGAGCACTACTTCGGACCCCTTCGTAGGGGGTTCGATATCCCTGAAAACATCCTCC
TTAGGATCCCGAAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGCGGCTTCAGACTTCCCCTTCACCCTTTCGTCCA
AGAGTTTCTTTTCCGAACTGGGCTGGCTCCGGCTCAAGTGGCCCCCAATGGCTATTGGATGTTGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCCAAGAAGCC
TGGTCGGTACTATATGTGCGTAAGGAAAGGTGCAGGAGGTATAGTTAAGGGGTCGACCTCCATCAAAGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTAG
CAAAGGACGAGTCAAGTCGTTCCTTCTTTGACGTTCCCACTAGGTTTGGGAACCTAATATCAATCAGGCCAGTTCTCGAGCTTACTCAAGCCTCCTTCGACACGCTGAAA
TATTACAAGGAGCACTTTCCGAGGGGCAAGAAGGTCGAAATCTTGGTGACCGACAAGCTGCTGCTCGAGTCCGGGCTGTTAGATTACAACCCTGCAGTTCGTCCCATTGA
AGCTTCAAGGCCGAACTCCGAACTAGCCAAAGGGCCGAGCCCATGCTCTCGAGGCCGCCAAGAGTTCGGAACCTGCAACTCCTGCTGTGGCAGGGCCAGCCTTAGAAGAT
CCAATCCCAGTGATCGAGCTGGAGTCTTCTGGAGGTCCTTCGTGGGAGAAGCGCCCAAGGGGTTAGACCGAGGCGGTGGACGCCTCGCCCTTGGGCGAGGAAGTGGGGGA
GGGAGCCCCTCTGAAGCGGAGGAAGAAGAAGAAGAAAACCACCTCCCCCTTGGAGGTTGGAGCTTGTGGGGTCCTGCCCGCGAGCTTCGCAGACCGGCTGCAAGCTTGGA
CCGCTGCCTCAGAAGAGCGTCCAAGTTTGTAAGTGACCCGGGGTCCGTTCTGCAGAGGACCATTGACTACGCCGCTGAGGCTTTTGTTGCTTCCATTCAATCGGTCCTAG
CTGTAAAGGCCGAGCTGGATGTGAGGGAAGTTCTGGCAGCGAGGGAGAAAAAGGAATTCTCTGCTGTCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTAAGG
GCTCACTCTGAGGTGGACATTCTGAAGGCCGAGGTGGAGGCCAAGGCCGAGATGTTGAAGAAAGAAGAGGACAGACGCAAGGCCCAGCTCCGAGCTGCCCATGCTATCAC
CAAGGGCCTGGAGAAGGAGAAGTTCCAACTCCTCAAGGAGAATGACGACATGTTGCAGGCGCTTGAAGCGAAGGACGAGGAGCTGAAGCGTGCGACTGCCGAGCTGGAGA
CGGCAAAGGAGCGTCTCGGCAATGGAGCCCTACTAGAGGAGTCTTTCAGGCAACATCCTGACTTCGACGGATTTGTCAAAGACTTTTCTGACGCGGGCTTCAAGTTCCTC
ATGAAGGGCATTACTTCTGACATGCCTGACCTTCAGATCGATCTCAGTGGTCTGAAGAAGAGGTATGCCGAGCAGTGGGCTTCTGGGCCTAACGGCACCCCTGGCCCCCA
AGCGTTGGTGAATAAGTATGTCAGAGATCTGGACTCTGACTACTCCGATCTCGAAGAGGACCAGGTGCTCCGCGTTCCATGGGTGCGCCAAGACGTCTCCTTTCAGATCG
GCCAATATGTATGTCCCAGGCCCTCTTCAGGGGTTAGGCATCTCAGTAGAGGCAGGGAAAAACCGCGTCGGTACAACGCTCCATCTCGGACTACGAACCGAGCTGCCTTC
CTCGCCAACTTTCTGCGCTCCTTCGGGTCTTGTGGTGAATTGCCCCTAATGAAGTCCATAATCGGGTCCATCCATGAGGGCTCTGGAGCGCCAATCTCCATCAGATCTGG
CTCCAAGATCGAGGGATTATCCAAGATCTCAACGGGGACCGACCTGGCCAGGTCGGCCTCTGACTCCTGCAGCTTGCCACGATTGATATGTTGGGAGGGACAAGATGTTG
GCAGATACACCTCCATCGACCAACACTCTTCGGACCAGGACGTGATTAATCAGAGAGGCGATCACCAATGCATCGTTGTGAGGCAAGTGGACCCCCTCCAGGTCGGCGTT
GCCAAAAGTGATGGAGCAAGTGGGCTTCTGCTCTCTGATGATGCATACCTCGCGCCTGGCCTCGCGAGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTCAGTGAGATGACTTTCCTACTGACGGAATCGCTTGAATTAGTTGAAGGAAAGGAGGTGGCTGGTCTCACTGGAGAGGAGAGTTTTGACAGCGAGGGAGGTATGTG
TGCGTCTGCTGGTATCGTATATAAGAGAAAGGCTGCATTTAGGAGTGATCTTTCCCTCTTCTTCAAGCCAGTGGTGATCTCACTTTCGAAAGAGAGTCTCGACGTGCTTA
GTAGCGCGTCATACCTTACGCTTCCTGAATTTTTGGAGTTCGATCTGAAGGCAGCTCGAACCCTTGGTAGGTCGATCTCTTCCCTCTCACTTTTTCTTTCGAACGTAGTT
TCCATGTCGTCCTCTTTTAGTAGCGACTTAGGATCCGATGAGGATTTAGCTCGTAGGTTAGTGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTTCGACGACGGGGA
GGATAGTGATGCTTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGTTACCCGAGCACTACTTCGGACCCCTTCGTAGGGGGTTCGATATCCCTGAAAACATCCTCC
TTAGGATCCCGAAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGCGGCTTCAGACTTCCCCTTCACCCTTTCGTCCA
AGAGTTTCTTTTCCGAACTGGGCTGGCTCCGGCTCAAGTGGCCCCCAATGGCTATTGGATGTTGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCCAAGAAGCC
TGGTCGGTACTATATGTGCGTAAGGAAAGGTGCAGGAGGTATAGTTAAGGGGTCGACCTCCATCAAAGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTAG
CAAAGGACGAGTCAAGTCGTTCCTTCTTTGACGTTCCCACTAGGTTTGGGAACCTAATATCAATCAGGCCAGTTCTCGAGCTTACTCAAGCCTCCTTCGACACGCTGAAA
TATTACAAGGAGCACTTTCCGAGGGGCAAGAAGGTCGAAATCTTGGTGACCGACAAGCTGCTGCTCGAGTCCGGGCTGTTAGATTACAACCCTGCAGTTCGTCCCATTGA
AGCTTCAAGGCCGAACTCCGAACTAGCCAAAGGGCCGAGCCCATGCTCTCGAGGCCGCCAAGAGTTCGGAACCTGCAACTCCTGCTGTGGCAGGGCCAGCCTTAGAAGAT
CCAATCCCAGTGATCGAGCTGGAGTCTTCTGGAGGTCCTTCGTGGGAGAAGCGCCCAAGGGGTTAGACCGAGGCGGTGGACGCCTCGCCCTTGGGCGAGGAAGTGGGGGA
GGGAGCCCCTCTGAAGCGGAGGAAGAAGAAGAAGAAAACCACCTCCCCCTTGGAGGTTGGAGCTTGTGGGGTCCTGCCCGCGAGCTTCGCAGACCGGCTGCAAGCTTGGA
CCGCTGCCTCAGAAGAGCGTCCAAGTTTGTAAGTGACCCGGGGTCCGTTCTGCAGAGGACCATTGACTACGCCGCTGAGGCTTTTGTTGCTTCCATTCAATCGGTCCTAG
CTGTAAAGGCCGAGCTGGATGTGAGGGAAGTTCTGGCAGCGAGGGAGAAAAAGGAATTCTCTGCTGTCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTAAGG
GCTCACTCTGAGGTGGACATTCTGAAGGCCGAGGTGGAGGCCAAGGCCGAGATGTTGAAGAAAGAAGAGGACAGACGCAAGGCCCAGCTCCGAGCTGCCCATGCTATCAC
CAAGGGCCTGGAGAAGGAGAAGTTCCAACTCCTCAAGGAGAATGACGACATGTTGCAGGCGCTTGAAGCGAAGGACGAGGAGCTGAAGCGTGCGACTGCCGAGCTGGAGA
CGGCAAAGGAGCGTCTCGGCAATGGAGCCCTACTAGAGGAGTCTTTCAGGCAACATCCTGACTTCGACGGATTTGTCAAAGACTTTTCTGACGCGGGCTTCAAGTTCCTC
ATGAAGGGCATTACTTCTGACATGCCTGACCTTCAGATCGATCTCAGTGGTCTGAAGAAGAGGTATGCCGAGCAGTGGGCTTCTGGGCCTAACGGCACCCCTGGCCCCCA
AGCGTTGGTGAATAAGTATGTCAGAGATCTGGACTCTGACTACTCCGATCTCGAAGAGGACCAGGTGCTCCGCGTTCCATGGGTGCGCCAAGACGTCTCCTTTCAGATCG
GCCAATATGTATGTCCCAGGCCCTCTTCAGGGGTTAGGCATCTCAGTAGAGGCAGGGAAAAACCGCGTCGGTACAACGCTCCATCTCGGACTACGAACCGAGCTGCCTTC
CTCGCCAACTTTCTGCGCTCCTTCGGGTCTTGTGGTGAATTGCCCCTAATGAAGTCCATAATCGGGTCCATCCATGAGGGCTCTGGAGCGCCAATCTCCATCAGATCTGG
CTCCAAGATCGAGGGATTATCCAAGATCTCAACGGGGACCGACCTGGCCAGGTCGGCCTCTGACTCCTGCAGCTTGCCACGATTGATATGTTGGGAGGGACAAGATGTTG
GCAGATACACCTCCATCGACCAACACTCTTCGGACCAGGACGTGATTAATCAGAGAGGCGATCACCAATGCATCGTTGTGAGGCAAGTGGACCCCCTCCAGGTCGGCGTT
GCCAAAAGTGATGGAGCAAGTGGGCTTCTGCTCTCTGATGATGCATACCTCGCGCCTGGCCTCGCGAGCTAG
Protein sequenceShow/hide protein sequence
MFSEMTFLLTESLELVEGKEVAGLTGEESFDSEGGMCASAGIVYKRKAAFRSDLSLFFKPVVISLSKESLDVLSSASYLTLPEFLEFDLKAARTLGRSISSLSLFLSNVV
SMSSSFSSDLGSDEDLARRLVSELEEIENFRFFDDGEDSDASTSGQGLEYPSRLPEHYFGPLRRGFDIPENILLRIPKEGERADNPPEGWVTLYFKMFERLQTSPSPFRP
RVSFPNWAGSGSSGPQWLLDVDQLLACFEAKRIAKKPGRYYMCVRKGAGGIVKGSTSIKGWVRKWFYASGEWLAKDESSRSFFDVPTRFGNLISIRPVLELTQASFDTLK
YYKEHFPRGKKVEILVTDKLLLESGLLDYNPAVRPIEASRPNSELAKGPSPCSRGRQEFGTCNSCCGRASLRRSNPSDRAGVFWRSFVGEAPKGLDRGGGRLALGRGSGG
GSPSEAEEEEEENHLPLGGWSLWGPARELRRPAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSVLAVKAELDVREVLAAREKKEFSAVLEAASSTMKDELLR
AHSEVDILKAEVEAKAEMLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKENDDMLQALEAKDEELKRATAELETAKERLGNGALLEESFRQHPDFDGFVKDFSDAGFKFL
MKGITSDMPDLQIDLSGLKKRYAEQWASGPNGTPGPQALVNKYVRDLDSDYSDLEEDQVLRVPWVRQDVSFQIGQYVCPRPSSGVRHLSRGREKPRRYNAPSRTTNRAAF
LANFLRSFGSCGELPLMKSIIGSIHEGSGAPISIRSGSKIEGLSKISTGTDLARSASDSCSLPRLICWEGQDVGRYTSIDQHSSDQDVINQRGDHQCIVVRQVDPLQVGV
AKSDGASGLLLSDDAYLAPGLAS