; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g15980 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g15980
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr6:12632371..12636263
RNA-Seq ExpressionMoc06g15980
SyntenyMoc06g15980
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]8.1e-11389.2Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVETLVTDKLLLESGLLDYNP
        MCARKGA GIVKGPTSIKGWVRKWFYASGEWLAKDES              V+IRPVPELTQASFDTLKYYKE FPRGRKV TLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVETLVTDKLLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVVGPA-EDPAPVIELESSEGPSREKRPRDQTEAVDVSPLGEKVREEVPLKRRR
        AVRPIESSRPNSELAMVCGFASNVKRKSKG+AHALEAAQSSKP TPAVVGPA EDPAPVIELESS GPSREKRPRDQTEAVDVSPLGE+VREEVPLKRRR
Subjt:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVVGPA-EDPAPVIELESSEGPSREKRPRDQTEAVDVSPLGEKVREEVPLKRRR

Query:  KKKKTTSPLEVRARGVLPASFADRVDDPEARMGGTSDVTARFRVESSSSG
        KKKKTTSPLEV ARGVLPASFADRVDDPEARMGGT DVT RFRVE SSSG
Subjt:  KKKKTTSPLEVRARGVLPASFADRVDDPEARMGGTSDVTARFRVESSSSG

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]5.2e-12090.76Show/hide
Query:  PPVIAPNGWGVIFALAILFWLRARDSEEVQLLNVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPT
        P  +APNGWGVIFALAILFWLRARDSEE +LL+VDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPT
Subjt:  PPVIAPNGWGVIFALAILFWLRARDSEEVQLLNVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPT

Query:  RFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVETLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATP
        RFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKV TLVTD+LLLESGLLDYNPAVRPIE SRPNS LAMVC FAS VKRKSKGRAHALEAAQSSKP TP
Subjt:  RFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVETLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATP

Query:  AVVGPA-EDPAPVIELESSEGPSREKRPRDQTEAV-------DVSPLGE
        AVVGPA EDPAPVIELESS GPSREKRPRDQTEAV       DV PLGE
Subjt:  AVVGPA-EDPAPVIELESSEGPSREKRPRDQTEAV-------DVSPLGE

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]6.9e-8876.86Show/hide
Query:  PGSVLQRTIDYAAEAFVASIQAALAVKTELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKVKVETKAELLKKEEDRR-------------
        PGSVLQRTIDYAAEAFVASIQ+ALAVK ELDGREVLAAREKEEFSAALE ASSTMKDELLKAHSEVE LK +VE++AELLKKEEDRR             
Subjt:  PGSVLQRTIDYAAEAFVASIQAALAVKTELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKVKVETKAELLKKEEDRR-------------

Query:  ----------------KALEAKEEELKHATAELETVKERLSNGALLEESFRQHPNFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASG
                        +ALEAK++EL+HATAELET KERLSNG LLEE+FRQHP+FDGFAKDFSDAGFKFLMKGIASDMPDLQIDL GLK+RYAE+WASG
Subjt:  ----------------KALEAKEEELKHATAELETVKERLSNGALLEESFRQHPNFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASG

Query:  PSGTPGPQALVDKYVRDLNSDYSDLEEDQVGTTQEGAPQAGS
        P GTPGPQALVD+YVRDL+SDYSD EEDQVG+TQEGA   GS
Subjt:  PSGTPGPQALVDKYVRDLNSDYSDLEEDQVGTTQEGAPQAGS

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]2.0e-12761.8Show/hide
Query:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYLLGYLSTTSDPFVGGSLSLRTSSLGFRRRGRELTILQRDGSLSTSKCLSTV
        MSSS SSNL  + DLARRLES+LEEIEN R SDDGEDSDASTSGQGLE    Y S   + ++G   SLR    GF      L  L  +G           
Subjt:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYLLGYLSTTSDPFVGGSLSLRTSSLGFRRRGRELTILQRDGSLSTSKCLSTV

Query:  SDFPFTLSSKSFFSELGWLRLKWPPMGGVSFSLVYMRKKKDHLGVQINQKKQKDEKSFMGGAKRLGSLQKNWFSSNFALNE--TRLPMRFGGSNRCVRVE
                           R   PP G                                            W +  F + E   RLP+            
Subjt:  SDFPFTLSSKSFFSELGWLRLKWPPMGGVSFSLVYMRKKKDHLGVQINQKKQKDEKSFMGGAKRLGSLQKNWFSSNFALNE--TRLPMRFGGSNRCVRVE

Query:  EVFHYQFEHDLVCACIYIIVTTLSACVQHKPPVIAPNGWGVIFALAILFWLRARDSEEVQLLNVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTS
                H  V    ++  T L+      P  +APNGWGVIFALAILFWLRARDSEE +L +VDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTS
Subjt:  EVFHYQFEHDLVCACIYIIVTTLSACVQHKPPVIAPNGWGVIFALAILFWLRARDSEEVQLLNVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTS

Query:  IKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVETLVTDKLLLESGLLDYNPAVRPIESSRPNSELAM
        IKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKV TLVTD+LLLESGLLDYNPAVRPIESSRPNSELAM
Subjt:  IKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVETLVTDKLLLESGLLDYNPAVRPIESSRPNSELAM

Query:  VCGFASNVKRKSKGRAHALEAAQSSKPATPAVVGPA-EDPAPVIELESSEGPSREKRPRDQTEAVD
        VCGFAS VKRKSKGRAHALEAAQSSKPATPAVVGPA EDPA VIELESS GPSREKRPRDQTEAVD
Subjt:  VCGFASNVKRKSKGRAHALEAAQSSKPATPAVVGPA-EDPAPVIELESSEGPSREKRPRDQTEAVD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.2e-16162.13Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVETLVTDKLLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVETLVTDKLLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAV--------VGPAED-PAPVIELESSEGPSREKRPRDQTEAVDVSPLGEKVRE
         VR IE+SRPNSELAMVCGF  +VKRKSKGRAHAL+    ++P TP V         GP+   P PVIEL+ S G S EKR R+++EA+DVSPL E VR 
Subjt:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAV--------VGPAED-PAPVIELESSEGPSREKRPRDQTEAVDVSPLGEKVRE

Query:  EVPLKRRRKKKKTTSPLEVRARGVLPASFADRVDDPEARMGGTSDVTARFRVESSSSG-------------------------DPGSVLQRTIDYAAEAF
        E PL+RRRKKKKT+S  E  ARG LP S AD VDDPEARM GTS+V  RF +E SSSG                         DPGSVLQRTID  AEAF
Subjt:  EVPLKRRRKKKKTTSPLEVRARGVLPASFADRVDDPEARMGGTSDVTARFRVESSSSG-------------------------DPGSVLQRTIDYAAEAF

Query:  VASIQAALAVKTELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKVKVETKAELLKKEEDRRKA---------------------------
        +ASI  A+ VK ELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+IL+ +V+ K +LLKKE ++ KA                           
Subjt:  VASIQAALAVKTELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKVKVETKAELLKKEEDRRKA---------------------------

Query:  --LEAKEEELKHATAELETVKERLSNGALLEESFRQHPNFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVR
          LE K+  +   T EL+ +KERL+NG LLEESFRQHP+FDGFAKDFSDAGFKFLMKGIA+DMP LQIDL GLKK+Y+E+WASGP+GTP PQ+LVDKYVR
Subjt:  --LEAKEEELKHATAELETVKERLSNGALLEESFRQHPNFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVR

Query:  DLNSDYSDLEED--------QVGTTQEGAP--QAGS
        +L+SDYSD+EE+        +VGTTQE  P  Q GS
Subjt:  DLNSDYSDLEED--------QVGTTQEGAP--QAGS

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092983.9e-11389.2Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVETLVTDKLLLESGLLDYNP
        MCARKGA GIVKGPTSIKGWVRKWFYASGEWLAKDES              V+IRPVPELTQASFDTLKYYKE FPRGRKV TLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVETLVTDKLLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVVGPA-EDPAPVIELESSEGPSREKRPRDQTEAVDVSPLGEKVREEVPLKRRR
        AVRPIESSRPNSELAMVCGFASNVKRKSKG+AHALEAAQSSKP TPAVVGPA EDPAPVIELESS GPSREKRPRDQTEAVDVSPLGE+VREEVPLKRRR
Subjt:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVVGPA-EDPAPVIELESSEGPSREKRPRDQTEAVDVSPLGEKVREEVPLKRRR

Query:  KKKKTTSPLEVRARGVLPASFADRVDDPEARMGGTSDVTARFRVESSSSG
        KKKKTTSPLEV ARGVLPASFADRVDDPEARMGGT DVT RFRVE SSSG
Subjt:  KKKKTTSPLEVRARGVLPASFADRVDDPEARMGGTSDVTARFRVESSSSG

A0A6J1CR42 uncharacterized protein LOC1110138262.5e-12090.76Show/hide
Query:  PPVIAPNGWGVIFALAILFWLRARDSEEVQLLNVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPT
        P  +APNGWGVIFALAILFWLRARDSEE +LL+VDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPT
Subjt:  PPVIAPNGWGVIFALAILFWLRARDSEEVQLLNVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPT

Query:  RFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVETLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATP
        RFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKV TLVTD+LLLESGLLDYNPAVRPIE SRPNS LAMVC FAS VKRKSKGRAHALEAAQSSKP TP
Subjt:  RFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVETLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATP

Query:  AVVGPA-EDPAPVIELESSEGPSREKRPRDQTEAV-------DVSPLGE
        AVVGPA EDPAPVIELESS GPSREKRPRDQTEAV       DV PLGE
Subjt:  AVVGPA-EDPAPVIELESSEGPSREKRPRDQTEAV-------DVSPLGE

A0A6J1D971 uncharacterized protein LOC1110185383.3e-8876.86Show/hide
Query:  PGSVLQRTIDYAAEAFVASIQAALAVKTELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKVKVETKAELLKKEEDRR-------------
        PGSVLQRTIDYAAEAFVASIQ+ALAVK ELDGREVLAAREKEEFSAALE ASSTMKDELLKAHSEVE LK +VE++AELLKKEEDRR             
Subjt:  PGSVLQRTIDYAAEAFVASIQAALAVKTELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKVKVETKAELLKKEEDRR-------------

Query:  ----------------KALEAKEEELKHATAELETVKERLSNGALLEESFRQHPNFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASG
                        +ALEAK++EL+HATAELET KERLSNG LLEE+FRQHP+FDGFAKDFSDAGFKFLMKGIASDMPDLQIDL GLK+RYAE+WASG
Subjt:  ----------------KALEAKEEELKHATAELETVKERLSNGALLEESFRQHPNFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASG

Query:  PSGTPGPQALVDKYVRDLNSDYSDLEEDQVGTTQEGAPQAGS
        P GTPGPQALVD+YVRDL+SDYSD EEDQVG+TQEGA   GS
Subjt:  PSGTPGPQALVDKYVRDLNSDYSDLEEDQVGTTQEGAPQAGS

A0A6J1DXS5 uncharacterized protein LOC1110255029.6e-12861.8Show/hide
Query:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYLLGYLSTTSDPFVGGSLSLRTSSLGFRRRGRELTILQRDGSLSTSKCLSTV
        MSSS SSNL  + DLARRLES+LEEIEN R SDDGEDSDASTSGQGLE    Y S   + ++G   SLR    GF      L  L  +G           
Subjt:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYLLGYLSTTSDPFVGGSLSLRTSSLGFRRRGRELTILQRDGSLSTSKCLSTV

Query:  SDFPFTLSSKSFFSELGWLRLKWPPMGGVSFSLVYMRKKKDHLGVQINQKKQKDEKSFMGGAKRLGSLQKNWFSSNFALNE--TRLPMRFGGSNRCVRVE
                           R   PP G                                            W +  F + E   RLP+            
Subjt:  SDFPFTLSSKSFFSELGWLRLKWPPMGGVSFSLVYMRKKKDHLGVQINQKKQKDEKSFMGGAKRLGSLQKNWFSSNFALNE--TRLPMRFGGSNRCVRVE

Query:  EVFHYQFEHDLVCACIYIIVTTLSACVQHKPPVIAPNGWGVIFALAILFWLRARDSEEVQLLNVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTS
                H  V    ++  T L+      P  +APNGWGVIFALAILFWLRARDSEE +L +VDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTS
Subjt:  EVFHYQFEHDLVCACIYIIVTTLSACVQHKPPVIAPNGWGVIFALAILFWLRARDSEEVQLLNVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTS

Query:  IKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVETLVTDKLLLESGLLDYNPAVRPIESSRPNSELAM
        IKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKV TLVTD+LLLESGLLDYNPAVRPIESSRPNSELAM
Subjt:  IKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVETLVTDKLLLESGLLDYNPAVRPIESSRPNSELAM

Query:  VCGFASNVKRKSKGRAHALEAAQSSKPATPAVVGPA-EDPAPVIELESSEGPSREKRPRDQTEAVD
        VCGFAS VKRKSKGRAHALEAAQSSKPATPAVVGPA EDPA VIELESS GPSREKRPRDQTEAVD
Subjt:  VCGFASNVKRKSKGRAHALEAAQSSKPATPAVVGPA-EDPAPVIELESSEGPSREKRPRDQTEAVD

A0A6J1DZB3 uncharacterized protein LOC1110256655.9e-16262.13Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVETLVTDKLLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVETLVTDKLLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAV--------VGPAED-PAPVIELESSEGPSREKRPRDQTEAVDVSPLGEKVRE
         VR IE+SRPNSELAMVCGF  +VKRKSKGRAHAL+    ++P TP V         GP+   P PVIEL+ S G S EKR R+++EA+DVSPL E VR 
Subjt:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAV--------VGPAED-PAPVIELESSEGPSREKRPRDQTEAVDVSPLGEKVRE

Query:  EVPLKRRRKKKKTTSPLEVRARGVLPASFADRVDDPEARMGGTSDVTARFRVESSSSG-------------------------DPGSVLQRTIDYAAEAF
        E PL+RRRKKKKT+S  E  ARG LP S AD VDDPEARM GTS+V  RF +E SSSG                         DPGSVLQRTID  AEAF
Subjt:  EVPLKRRRKKKKTTSPLEVRARGVLPASFADRVDDPEARMGGTSDVTARFRVESSSSG-------------------------DPGSVLQRTIDYAAEAF

Query:  VASIQAALAVKTELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKVKVETKAELLKKEEDRRKA---------------------------
        +ASI  A+ VK ELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+IL+ +V+ K +LLKKE ++ KA                           
Subjt:  VASIQAALAVKTELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKVKVETKAELLKKEEDRRKA---------------------------

Query:  --LEAKEEELKHATAELETVKERLSNGALLEESFRQHPNFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVR
          LE K+  +   T EL+ +KERL+NG LLEESFRQHP+FDGFAKDFSDAGFKFLMKGIA+DMP LQIDL GLKK+Y+E+WASGP+GTP PQ+LVDKYVR
Subjt:  --LEAKEEELKHATAELETVKERLSNGALLEESFRQHPNFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVR

Query:  DLNSDYSDLEED--------QVGTTQEGAP--QAGS
        +L+SDYSD+EE+        +VGTTQE  P  Q GS
Subjt:  DLNSDYSDLEED--------QVGTTQEGAP--QAGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAGGA
TAGTGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTTAG
GATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGTCTCAGACTTCCCCTTCACCCTTTCGTCCAAG
AGTTTCTTTTCCGAACTGGGCTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTGTTTACATGCGTAAAAAGAAGGACCATTTGGGAGTGCAAAT
TAATCAAAAGAAGCAAAAAGATGAAAAAAGCTTCATGGGAGGCGCCAAGCGCCTGGGAAGCCTGCAGAAAAACTGGTTTTCTTCCAACTTTGCCCTTAATGAAACGCGTC
TTCCAATGCGTTTTGGTGGTTCCAACCGATGCGTACGTGTAGAAGAAGTGTTCCACTATCAGTTTGAGCACGATTTGGTGTGCGCTTGCATCTATATAATTGTTACCACG
CTTAGTGCGTGCGTTCAGCATAAACCCCCAGTCATTGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGGGATAGTGAAGAGGT
CCAGCTGTTAAACGTAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTATAGTTA
AGGGACCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTTGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTTCCCACTAGGTTT
GGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGCTGAAATATTACAAGGAGCGTTTTCCGAGGGGTAGGAAGGTCGAAACCTTGGT
GACCGACAAGCTGCTGCTTGAGTCCGGACTGCTAGATTACAACCCCGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCCGAACTTGCCATGGTTTGCGGATTTGCAA
GCAACGTGAAACGCAAGTCCAAGGGCCGAGCTCATGCTCTTGAGGCCGCCCAAAGTTCGAAACCTGCCACCCCTGCTGTGGTAGGGCCAGCTGAAGATCCAGCCCCAGTG
ATCGAGCTGGAGTCTTCTGAGGGTCCTTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCAGTGGACGTCTCGCCCTTGGGCGAGAAGGTGAGGGAGGAAGTCCCTCT
GAAGCGAAGGAGGAAGAAGAAGAAGACCACCTCCCCCTTGGAGGTCAGAGCTCGTGGGGTCCTGCCTGCGAGCTTCGCAGATCGGGTGGACGATCCTGAAGCCAGGATGG
GCGGGACGTCCGACGTGACGGCACGGTTCAGAGTTGAGTCGTCAAGTTCTGGTGACCCAGGGTCCGTTCTGCAGAGGACCATCGACTACGCCGCTGAGGCGTTCGTTGCT
TCCATTCAAGCGGCTCTGGCCGTGAAGACCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCAT
GAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAAATCTTGAAGGTCAAGGTGGAGACCAAGGCCGAGCTGCTGAAGAAGGAAGAAGACAGACGCAAGGCGCTTGAAG
CGAAGGAGGAGGAGCTGAAGCATGCGACTGCCGAGCTGGAGACGGTGAAGGAGCGTCTCAGCAATGGAGCCCTATTGGAGGAATCGTTTAGGCAACATCCTAATTTCGAT
GGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTCCGACATGCCTGACCTTCAGATCGATCTCGGTGGTCTGAAGAAGAGGTATGC
TGAGCAGTGGGCGTCTGGGCCTAGCGGCACCCCTGGTCCCCAAGCGTTGGTGGATAAGTACGTCAGAGATCTGAACTCTGACTACTCCGACCTCGAAGAGGATCAGGTCG
GCACCACTCAAGAGGGCGCTCCTCAAGCAGGCTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAGGA
TAGTGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTTAG
GATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGTCTCAGACTTCCCCTTCACCCTTTCGTCCAAG
AGTTTCTTTTCCGAACTGGGCTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTGTTTACATGCGTAAAAAGAAGGACCATTTGGGAGTGCAAAT
TAATCAAAAGAAGCAAAAAGATGAAAAAAGCTTCATGGGAGGCGCCAAGCGCCTGGGAAGCCTGCAGAAAAACTGGTTTTCTTCCAACTTTGCCCTTAATGAAACGCGTC
TTCCAATGCGTTTTGGTGGTTCCAACCGATGCGTACGTGTAGAAGAAGTGTTCCACTATCAGTTTGAGCACGATTTGGTGTGCGCTTGCATCTATATAATTGTTACCACG
CTTAGTGCGTGCGTTCAGCATAAACCCCCAGTCATTGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGGGATAGTGAAGAGGT
CCAGCTGTTAAACGTAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTATAGTTA
AGGGACCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTTGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTTCCCACTAGGTTT
GGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGCTGAAATATTACAAGGAGCGTTTTCCGAGGGGTAGGAAGGTCGAAACCTTGGT
GACCGACAAGCTGCTGCTTGAGTCCGGACTGCTAGATTACAACCCCGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCCGAACTTGCCATGGTTTGCGGATTTGCAA
GCAACGTGAAACGCAAGTCCAAGGGCCGAGCTCATGCTCTTGAGGCCGCCCAAAGTTCGAAACCTGCCACCCCTGCTGTGGTAGGGCCAGCTGAAGATCCAGCCCCAGTG
ATCGAGCTGGAGTCTTCTGAGGGTCCTTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCAGTGGACGTCTCGCCCTTGGGCGAGAAGGTGAGGGAGGAAGTCCCTCT
GAAGCGAAGGAGGAAGAAGAAGAAGACCACCTCCCCCTTGGAGGTCAGAGCTCGTGGGGTCCTGCCTGCGAGCTTCGCAGATCGGGTGGACGATCCTGAAGCCAGGATGG
GCGGGACGTCCGACGTGACGGCACGGTTCAGAGTTGAGTCGTCAAGTTCTGGTGACCCAGGGTCCGTTCTGCAGAGGACCATCGACTACGCCGCTGAGGCGTTCGTTGCT
TCCATTCAAGCGGCTCTGGCCGTGAAGACCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCAT
GAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAAATCTTGAAGGTCAAGGTGGAGACCAAGGCCGAGCTGCTGAAGAAGGAAGAAGACAGACGCAAGGCGCTTGAAG
CGAAGGAGGAGGAGCTGAAGCATGCGACTGCCGAGCTGGAGACGGTGAAGGAGCGTCTCAGCAATGGAGCCCTATTGGAGGAATCGTTTAGGCAACATCCTAATTTCGAT
GGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTCCGACATGCCTGACCTTCAGATCGATCTCGGTGGTCTGAAGAAGAGGTATGC
TGAGCAGTGGGCGTCTGGGCCTAGCGGCACCCCTGGTCCCCAAGCGTTGGTGGATAAGTACGTCAGAGATCTGAACTCTGACTACTCCGACCTCGAAGAGGATCAGGTCG
GCACCACTCAAGAGGGCGCTCCTCAAGCAGGCTCTTAG
Protein sequenceShow/hide protein sequence
MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYLLGYLSTTSDPFVGGSLSLRTSSLGFRRRGRELTILQRDGSLSTSKCLSTVSDFPFTLSSK
SFFSELGWLRLKWPPMGGVSFSLVYMRKKKDHLGVQINQKKQKDEKSFMGGAKRLGSLQKNWFSSNFALNETRLPMRFGGSNRCVRVEEVFHYQFEHDLVCACIYIIVTT
LSACVQHKPPVIAPNGWGVIFALAILFWLRARDSEEVQLLNVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRF
GNLVSIRPVPELTQASFDTLKYYKERFPRGRKVETLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVVGPAEDPAPV
IELESSEGPSREKRPRDQTEAVDVSPLGEKVREEVPLKRRRKKKKTTSPLEVRARGVLPASFADRVDDPEARMGGTSDVTARFRVESSSSGDPGSVLQRTIDYAAEAFVA
SIQAALAVKTELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKVKVETKAELLKKEEDRRKALEAKEEELKHATAELETVKERLSNGALLEESFRQHPNFD
GFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLNSDYSDLEEDQVGTTQEGAPQAGS