; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g12090 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g12090
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr4:9120299..9123915
RNA-Seq ExpressionMoc04g12090
SyntenyMoc04g12090
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]1.6e-10188.11Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNP
        MCARKGA GIVKGPTSIKGWVRKWFYASGEWLAKDES              V+IRPVPELTQASFDTLKYYKE FPRGRKVGTLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRLIESSRPNSELAMVCGFTSNVKRKSKGRAHALEAAQNSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLSEEVREEVPLKRRR
        AVR IESSRPNSELAMVCGF SNVKRKSKG+AHALEAAQ+SKP TPAVVGPASEDPAPVIELESS GPSREKRPRDQTEAVD  PL EEVREEVPLKRRR
Subjt:  AVRLIESSRPNSELAMVCGFTSNVKRKSKGRAHALEAAQNSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLSEEVREEVPLKRRR

Query:  KKKKTTSPLEVGARGVLPASSADRVDD
        KKKKTTSPLEVGARGVLPAS ADRVDD
Subjt:  KKKKTTSPLEVGARGVLPASSADRVDD

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]4.9e-11989.2Show/hide
Query:  PFRPRVSFPNWAGSGSSGPQWVGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLA
        PF     F          P   GVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLA
Subjt:  PFRPRVSFPNWAGSGSSGPQWVGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLA

Query:  KDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRLIESSRPNSELAMVCGFTSNVKRKSKGRAH
        KDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTD+LLLESGLLDYNPAVR IE SRPNS LAMVC F S VKRKSKGRAH
Subjt:  KDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRLIESSRPNSELAMVCGFTSNVKRKSKGRAH

Query:  ALEAAQNSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDA
        ALEAAQ+SKP TPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDA
Subjt:  ALEAAQNSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDA

XP_022158122.1 uncharacterized protein LOC111024680 [Momordica charantia]7.8e-8588.95Show/hide
Query:  PFRPRVSFPNWAGSGSSGPQWVGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLA
        PF     F          P   GVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLA
Subjt:  PFRPRVSFPNWAGSGSSGPQWVGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLA

Query:  KDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRLIESSRPNSEL
        KDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTD+LLLESGLLDYNPAVR IESSRPNSEL
Subjt:  KDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRLIESSRPNSEL

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]1.8e-13777.46Show/hide
Query:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRI---YLSTTSDPFV------------GVRPQTSPS-------------
        MSSS SSNL  + DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRI   YL +    F             G R    P              
Subjt:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRI---YLSTTSDPFV------------GVRPQTSPS-------------

Query:  ------PFRPRVSFPNWAGSGSSGPQWVGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
              PF     F          P   GVIFALAILFWLRARDSEEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
Subjt:  ------PFRPRVSFPNWAGSGSSGPQWVGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA

Query:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRLIESSRPNSELAMVCGFTSNVKRK
        SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTD+LLLESGLLDYNPAVR IESSRPNSELAMVCGF S VKRK
Subjt:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRLIESSRPNSELAMVCGFTSNVKRK

Query:  SKGRAHALEAAQNSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD
        SKGRAHALEAAQ+SKPATPAVVGPASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  SKGRAHALEAAQNSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.5e-9961.43Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRLIESSRPNSELAMVCGFTSNVKRKSKGRAHALEAAQNSKPATPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLSEEVRE
         VRLIE+SRPNSELAMVCGFT +VKRKSKGRAHAL+    ++P TP V         GP+S  P PVIEL+ SGG S EKR R+++EA+D  PL+ EVR 
Subjt:  AVRLIESSRPNSELAMVCGFTSNVKRKSKGRAHALEAAQNSKPATPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLSEEVRE

Query:  EVPLKRRRKKKKTTSPLEVGARGVLPASSADRVD-------------------------------------------------DLGSVLQRTIDYAAEAF
        E PL+RRRKKKKT+S  E GARG LP S AD VD                                                 D GSVLQRTID  AEAF
Subjt:  EVPLKRRRKKKKTTSPLEVGARGVLPASSADRVD-------------------------------------------------DLGSVLQRTIDYAAEAF

Query:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEL
        +ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELLKA  E+
Subjt:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEL

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092987.6e-10288.11Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNP
        MCARKGA GIVKGPTSIKGWVRKWFYASGEWLAKDES              V+IRPVPELTQASFDTLKYYKE FPRGRKVGTLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRLIESSRPNSELAMVCGFTSNVKRKSKGRAHALEAAQNSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLSEEVREEVPLKRRR
        AVR IESSRPNSELAMVCGF SNVKRKSKG+AHALEAAQ+SKP TPAVVGPASEDPAPVIELESS GPSREKRPRDQTEAVD  PL EEVREEVPLKRRR
Subjt:  AVRLIESSRPNSELAMVCGFTSNVKRKSKGRAHALEAAQNSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLSEEVREEVPLKRRR

Query:  KKKKTTSPLEVGARGVLPASSADRVDD
        KKKKTTSPLEVGARGVLPAS ADRVDD
Subjt:  KKKKTTSPLEVGARGVLPASSADRVDD

A0A6J1CR42 uncharacterized protein LOC1110138262.4e-11989.2Show/hide
Query:  PFRPRVSFPNWAGSGSSGPQWVGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLA
        PF     F          P   GVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLA
Subjt:  PFRPRVSFPNWAGSGSSGPQWVGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLA

Query:  KDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRLIESSRPNSELAMVCGFTSNVKRKSKGRAH
        KDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTD+LLLESGLLDYNPAVR IE SRPNS LAMVC F S VKRKSKGRAH
Subjt:  KDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRLIESSRPNSELAMVCGFTSNVKRKSKGRAH

Query:  ALEAAQNSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDA
        ALEAAQ+SKP TPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDA
Subjt:  ALEAAQNSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDA

A0A6J1DWD2 uncharacterized protein LOC1110246803.8e-8588.95Show/hide
Query:  PFRPRVSFPNWAGSGSSGPQWVGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLA
        PF     F          P   GVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLA
Subjt:  PFRPRVSFPNWAGSGSSGPQWVGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLA

Query:  KDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRLIESSRPNSEL
        KDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTD+LLLESGLLDYNPAVR IESSRPNSEL
Subjt:  KDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRLIESSRPNSEL

A0A6J1DXS5 uncharacterized protein LOC1110255028.6e-13877.46Show/hide
Query:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRI---YLSTTSDPFV------------GVRPQTSPS-------------
        MSSS SSNL  + DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRI   YL +    F             G R    P              
Subjt:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRI---YLSTTSDPFV------------GVRPQTSPS-------------

Query:  ------PFRPRVSFPNWAGSGSSGPQWVGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
              PF     F          P   GVIFALAILFWLRARDSEEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
Subjt:  ------PFRPRVSFPNWAGSGSSGPQWVGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA

Query:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRLIESSRPNSELAMVCGFTSNVKRK
        SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTD+LLLESGLLDYNPAVR IESSRPNSELAMVCGF S VKRK
Subjt:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRLIESSRPNSELAMVCGFTSNVKRK

Query:  SKGRAHALEAAQNSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD
        SKGRAHALEAAQ+SKPATPAVVGPASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  SKGRAHALEAAQNSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD

A0A6J1DZB3 uncharacterized protein LOC1110256657.1e-10061.43Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRLIESSRPNSELAMVCGFTSNVKRKSKGRAHALEAAQNSKPATPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLSEEVRE
         VRLIE+SRPNSELAMVCGFT +VKRKSKGRAHAL+    ++P TP V         GP+S  P PVIEL+ SGG S EKR R+++EA+D  PL+ EVR 
Subjt:  AVRLIESSRPNSELAMVCGFTSNVKRKSKGRAHALEAAQNSKPATPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLSEEVRE

Query:  EVPLKRRRKKKKTTSPLEVGARGVLPASSADRVD-------------------------------------------------DLGSVLQRTIDYAAEAF
        E PL+RRRKKKKT+S  E GARG LP S AD VD                                                 D GSVLQRTID  AEAF
Subjt:  EVPLKRRRKKKKTTSPLEVGARGVLPASSADRVD-------------------------------------------------DLGSVLQRTIDYAAEAF

Query:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEL
        +ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELLKA  E+
Subjt:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGAGAGGA
TAGTGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGTACGGCCTCAGACTTCCCCTTCACCCT
TTCGTCCAAGAGTTTCTTTTCCGAACTGGGCTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGGGAT
AGTGAAGAGGCCGAGCTGTTAGACGTAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGG
TGGTATAGTAAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTTGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTCC
CCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGTTGAAATATTACAAGGAGCGTTTTCCGAGGGGTAGGAAGGTC
GGAACCCTGGTGACCGACAAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCTCATTGAATCCTCAAGGCCGAACTCCGAATTAGCCATGGTTTG
CGGGTTTACGAGTAACGTGAAACGCAAGTCCAAGGGTCGAGCCCATGCTCTTGAGGCCGCCCAGAATTCGAAACCTGCCACTCCTGCTGTGGTAGGGCCAGCCTCGGAAG
ATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGCCTTGCCCTTAAGCGAGGAGGTGAGA
GAGGAAGTCCCTCTGAAGCGAAGGAGGAAGAAGAAGAAGACGACCTCCCCCTTGGAGGTCGGAGCTCGTGGGGTCTTGCCTGCGAGCTCCGCAGATCGGGTGGACGATCT
AGGGTCCGTTCTGCAGAGGACCATCGACTACGCCGCTGAGGCGTTTGTTGCTTCCATTCAATCGGCTCTGGCTGTAAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAG
CGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGCTTGGTTCCTATGCTTACTTGCTTCTC
GGGACTAAGCAGAGGAACAAGCTCGAGCTCCTCAGTGGGTGCGGCAAACTCCCTCCTCGGCAGGTCGGCCTCGAACTCGAGCGTCCCATCCCTACCGGCGAGAGTTTCGA
GGGCGCAGACCGATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGAGAGGA
TAGTGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGTACGGCCTCAGACTTCCCCTTCACCCT
TTCGTCCAAGAGTTTCTTTTCCGAACTGGGCTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGGGAT
AGTGAAGAGGCCGAGCTGTTAGACGTAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGG
TGGTATAGTAAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTTGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTCC
CCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGTTGAAATATTACAAGGAGCGTTTTCCGAGGGGTAGGAAGGTC
GGAACCCTGGTGACCGACAAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCTCATTGAATCCTCAAGGCCGAACTCCGAATTAGCCATGGTTTG
CGGGTTTACGAGTAACGTGAAACGCAAGTCCAAGGGTCGAGCCCATGCTCTTGAGGCCGCCCAGAATTCGAAACCTGCCACTCCTGCTGTGGTAGGGCCAGCCTCGGAAG
ATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGCCTTGCCCTTAAGCGAGGAGGTGAGA
GAGGAAGTCCCTCTGAAGCGAAGGAGGAAGAAGAAGAAGACGACCTCCCCCTTGGAGGTCGGAGCTCGTGGGGTCTTGCCTGCGAGCTCCGCAGATCGGGTGGACGATCT
AGGGTCCGTTCTGCAGAGGACCATCGACTACGCCGCTGAGGCGTTTGTTGCTTCCATTCAATCGGCTCTGGCTGTAAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAG
CGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGCTTGGTTCCTATGCTTACTTGCTTCTC
GGGACTAAGCAGAGGAACAAGCTCGAGCTCCTCAGTGGGTGCGGCAAACTCCCTCCTCGGCAGGTCGGCCTCGAACTCGAGCGTCCCATCCCTACCGGCGAGAGTTTCGA
GGGCGCAGACCGATGA
Protein sequenceShow/hide protein sequence
MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIYLSTTSDPFVGVRPQTSPSPFRPRVSFPNWAGSGSSGPQWVGVIFALAILFWLRARD
SEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKV
GTLVTDKLLLESGLLDYNPAVRLIESSRPNSELAMVCGFTSNVKRKSKGRAHALEAAQNSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLSEEVR
EEVPLKRRRKKKKTTSPLEVGARGVLPASSADRVDDLGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSELGSYAYLLL
GTKQRNKLELLSGCGKLPPRQVGLELERPIPTGESFEGADR