; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC02g0443 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC02g0443
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionProtein of unknown function (DUF1068)
Genome locationMC02:3719468..3722305
RNA-Seq ExpressionMC02g0443
SyntenyMC02g0443
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR010471 - Protein of unknown function DUF1068


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8652279.1 hypothetical protein Csa_022101 [Cucumis sativus]7.87e-10180.53Show/hide
Query:  EMAVKGVGWCSPGATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISEELENTTFRDCVKHDPGMNRETEKSFVELL
        +MAVK VG CSPG TKVGL  M L +AAYI+ PPLYWHF+E L A SSSS STCPPCFCDCSS TDFA +EELENTTFRDCVKHD GMN ETEK+F ELL
Subjt:  EMAVKGVGWCSPGATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISEELENTTFRDCVKHDPGMNRETEKSFVELL

Query:  SEELKLREAEALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSRALARGT
        SEELKLREAEALE+HRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARE+AEA+L SQ+RLTALWETRA QRGWR +IV SR   +G+
Subjt:  SEELKLREAEALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSRALARGT

XP_004151898.1 uncharacterized protein LOC101219040 [Cucumis sativus]2.94e-10180.95Show/hide
Query:  MAVKGVGWCSPGATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISEELENTTFRDCVKHDPGMNRETEKSFVELLS
        MAVK VG CSPG TKVGL  M L +AAYI+ PPLYWHF+E L A SSSS STCPPCFCDCSS TDFA +EELENTTFRDCVKHD GMN ETEK+F ELLS
Subjt:  MAVKGVGWCSPGATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISEELENTTFRDCVKHDPGMNRETEKSFVELLS

Query:  EELKLREAEALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSRALARGT
        EELKLREAEALE+HRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARE+AEA+L SQ+RLTALWETRA QRGWR +IV SR   +G+
Subjt:  EELKLREAEALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSRALARGT

XP_022140315.1 uncharacterized protein LOC111011015 [Momordica charantia]8.15e-134100Show/hide
Query:  MAVKGVGWCSPGATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISEELENTTFRDCVKHDPGMNRETEKSFVELLS
        MAVKGVGWCSPGATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISEELENTTFRDCVKHDPGMNRETEKSFVELLS
Subjt:  MAVKGVGWCSPGATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISEELENTTFRDCVKHDPGMNRETEKSFVELLS

Query:  EELKLREAEALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSRALARGTVQTT
        EELKLREAEALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSRALARGTVQTT
Subjt:  EELKLREAEALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSRALARGTVQTT

XP_022947173.1 uncharacterized protein LOC111451120 isoform X1 [Cucurbita moschata]1.14e-10179.27Show/hide
Query:  MAVKGVGWCSPGATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISEELENTTFRDCVKHDPGMNRETEKSFVELLS
        MAVK  G CSPG TKVGLGF+ L +AAYI+ PPLYWHF+E LA +SSSSSSTCPPCFCDCSS TDFA ++E ENTTFRDCVKHD GMN ETE++F ELLS
Subjt:  MAVKGVGWCSPGATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISEELENTTFRDCVKHDPGMNRETEKSFVELLS

Query:  EELKLREAEALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSRALARGTVQTT
        EELKLREAEA+E HRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARE+AEA+L SQ++LTALWE RA QRGWR DIV S A AR  VQT+
Subjt:  EELKLREAEALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSRALARGTVQTT

XP_023534557.1 uncharacterized protein LOC111796098 isoform X1 [Cucurbita pepo subsp. pepo]1.14e-10179.79Show/hide
Query:  MAVKGVGWCSPGATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISEELENTTFRDCVKHDPGMNRETEKSFVELLS
        MAVK  G CSPG TKVGLGF+ L +AAYI+ PPLYWHF+E LA +SSSSSSTCPPCFCDCSS TDFA ++E ENTTFRDCVKHD GMN ETE+SF ELLS
Subjt:  MAVKGVGWCSPGATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISEELENTTFRDCVKHDPGMNRETEKSFVELLS

Query:  EELKLREAEALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSRALARGTVQTT
        EELKLREAEA+E HRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARE+AEA+L SQ++LTALWE RA QRGWR DIV S A AR  VQT+
Subjt:  EELKLREAEALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSRALARGTVQTT

TrEMBL top hitse value%identityAlignment
A0A0A0LQC2 Uncharacterized protein1.42e-10180.95Show/hide
Query:  MAVKGVGWCSPGATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISEELENTTFRDCVKHDPGMNRETEKSFVELLS
        MAVK VG CSPG TKVGL  M L +AAYI+ PPLYWHF+E L A SSSS STCPPCFCDCSS TDFA +EELENTTFRDCVKHD GMN ETEK+F ELLS
Subjt:  MAVKGVGWCSPGATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISEELENTTFRDCVKHDPGMNRETEKSFVELLS

Query:  EELKLREAEALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSRALARGT
        EELKLREAEALE+HRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARE+AEA+L SQ+RLTALWETRA QRGWR +IV SR   +G+
Subjt:  EELKLREAEALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSRALARGT

A0A1S3C1Z9 uncharacterized protein LOC103495987 isoform X11.17e-10080.31Show/hide
Query:  MAVKGVGWCSPGATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISEELENTTFRDCVKHDPGMNRETEKSFVELLS
        MA K VG  SPG TKVGL FM + +AAYI+ PPLYWHF E LAA SSSS STCPPCFCDCSS TDFA +EEL+NTTFRDCVKHD GMN ETEK+F ELLS
Subjt:  MAVKGVGWCSPGATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISEELENTTFRDCVKHDPGMNRETEKSFVELLS

Query:  EELKLREAEALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSRALARGTVQTT
        EELKLREAEALE+HRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARE+AEA+L SQ+RLT LWETRA QRGWR DIV SR    GTVQT+
Subjt:  EELKLREAEALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSRALARGTVQTT

A0A6J1CER0 uncharacterized protein LOC1110110153.95e-134100Show/hide
Query:  MAVKGVGWCSPGATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISEELENTTFRDCVKHDPGMNRETEKSFVELLS
        MAVKGVGWCSPGATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISEELENTTFRDCVKHDPGMNRETEKSFVELLS
Subjt:  MAVKGVGWCSPGATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISEELENTTFRDCVKHDPGMNRETEKSFVELLS

Query:  EELKLREAEALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSRALARGTVQTT
        EELKLREAEALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSRALARGTVQTT
Subjt:  EELKLREAEALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSRALARGTVQTT

A0A6J1G5W1 uncharacterized protein LOC111451120 isoform X15.51e-10279.27Show/hide
Query:  MAVKGVGWCSPGATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISEELENTTFRDCVKHDPGMNRETEKSFVELLS
        MAVK  G CSPG TKVGLGF+ L +AAYI+ PPLYWHF+E LA +SSSSSSTCPPCFCDCSS TDFA ++E ENTTFRDCVKHD GMN ETE++F ELLS
Subjt:  MAVKGVGWCSPGATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISEELENTTFRDCVKHDPGMNRETEKSFVELLS

Query:  EELKLREAEALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSRALARGTVQTT
        EELKLREAEA+E HRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARE+AEA+L SQ++LTALWE RA QRGWR DIV S A AR  VQT+
Subjt:  EELKLREAEALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSRALARGTVQTT

A0A6J1I131 uncharacterized protein LOC111469880 isoform X13.71e-10078.24Show/hide
Query:  MAVKGVGWCSPGATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISEELENTTFRDCVKHDPGMNRETEKSFVELLS
        MAVK  G CSPG TKVGLGF+ L +AAYI+ PPLYWHF+E LA +SSS SSTCPPCFCDCSS TDFA ++E ENTTFRDCVKHD GMN ETE+SF ELLS
Subjt:  MAVKGVGWCSPGATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISEELENTTFRDCVKHDPGMNRETEKSFVELLS

Query:  EELKLREAEALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSRALARGTVQTT
        E+LKLREA+A+E HRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARE+AEA+L SQ++LTALWE RA QRGWR DIV S A AR  VQT+
Subjt:  EELKLREAEALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSRALARGTVQTT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05070.1 Protein of unknown function (DUF1068)5.6e-5659.78Show/hide
Query:  ATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISEELENTTFRDCVKHDPGMNRETEKSFVELLSEELKLREAEALE
        A K+GL  +GL +A YI+ PPLYWH  E LAAVS+SS   CP C C+CS+Y+   I +EL N +F DC KHDP +N +TEK++ ELL+EELKLREAE+LE
Subjt:  ATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISEELENTTFRDCVKHDPGMNRETEKSFVELLSEELKLREAEALE

Query:  SHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSRALARGTVQ
         H+RAD+ LLEAKK+TS YQKEADKCNSGMETCE AREKAE +L  Q++LT+ WE RA Q+GWR    +    ++  VQ
Subjt:  SHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSRALARGTVQ

AT2G32580.1 Protein of unknown function (DUF1068)8.1e-4755.29Show/hide
Query:  ATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISEELENTTFRDCVKHDPGMNRETEKSFVELLSEELKLREAEALE
        A KVGL  + L +  YI+ PPLYWH  E LA     S+++C  C CDCSS     I   L N +F DC K DP +N +TEK++ ELL+EELK REA ++E
Subjt:  ATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISEELENTTFRDCVKHDPGMNRETEKSFVELLSEELKLREAEALE

Query:  SHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRS
         H+R D  LLEAKK+TS YQKEADKCNSGMETCE AREKAE +LV Q++LT++WE RA Q+G++    +S
Subjt:  SHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRS

AT2G32580.2 Protein of unknown function (DUF1068)2.8e-3161.54Show/hide
Query:  DCVKHDPGMNRETEKSFVELLSEELKLREAEALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGD
        +C K DP +N +TEK++ ELL+EELK REA ++E H+R D  LLEAKK+TS YQKEADKCNSGMETCE AREKAE +LV Q++LT++WE RA Q+G++  
Subjt:  DCVKHDPGMNRETEKSFVELLSEELKLREAEALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGD

Query:  IVRS
          +S
Subjt:  IVRS

AT4G04360.1 Protein of unknown function (DUF1068)6.0e-4255.36Show/hide
Query:  KVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISEELENTTFRDCVKHDPGMNRETEKSFVELLSEELKLREAEALESH
        KV    MGL + AYI  P LYWH  E +A    S  S+CPPC CDCSS    +I + L N +F DC++H+ G + E+E SF E+++EELKLREA+A E  
Subjt:  KVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISEELENTTFRDCVKHDPGMNRETEKSFVELLSEELKLREAEALESH

Query:  RRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRS
         RAD  LL+AKK  SQYQKEADKC+ GMETCE AREKAEA+L  Q+RL+ +WE RA Q GW+   V S
Subjt:  RRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRS

AT4G30996.1 Protein of unknown function (DUF1068)6.7e-3347.4Show/hide
Query:  AAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDC-SSYTDFAISEELENTTFRDCVKHDPGMNRETEKSFVELLSEELKLREAEALESHRRADISLLEA
        A  +  P LYW F +     S+ ++S CPPC CDC    +   I+  L N +  DC   DP + +E EK FV+LL+EELKL+EA A E  R  +++L EA
Subjt:  AAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDC-SSYTDFAISEELENTTFRDCVKHDPGMNRETEKSFVELLSEELKLREAEALESHRRADISLLEA

Query:  KKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGD
        K++ SQYQKEA+KCN+  E CE+ARE+AEA L+ ++++T+LWE RA Q GW G+
Subjt:  KKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GAAATGGCAGTGAAGGGTGTGGGCTGGTGCTCTCCAGGAGCGACGAAGGTGGGATTGGGTTTTATGGGTCTTTTTGTAGCAGCTTATATTGTTGCTCCCCCTCTCTACTG
GCACTTCATCGAGTGCTTGGCCGCCGTCTCTTCTTCCTCTTCCTCCACTTGCCCTCCTTGTTTCTGTGACTGTTCTTCTTACACTGACTTCGCCATTTCTGAAGAGCTCG
AAAACACGACTTTTAGAGATTGTGTGAAACATGACCCTGGTATGAATCGGGAAACAGAAAAGAGTTTTGTGGAGTTGTTGTCGGAGGAACTGAAACTGAGGGAAGCTGAA
GCTTTGGAAAGTCACCGTCGCGCCGACATATCTCTGCTGGAAGCAAAGAAGATGACATCTCAATATCAGAAAGAAGCAGACAAGTGCAATTCAGGTATGGAAACATGTGA
AGCAGCAAGGGAAAAAGCTGAAGCTTCATTAGTTTCACAGCAGAGGCTAACAGCATTATGGGAGACAAGGGCTCATCAAAGAGGATGGAGAGGCGACATTGTCAGATCCC
GTGCTCTGGCTCGTGGTACAGTTCAAACCACATAA
mRNA sequenceShow/hide mRNA sequence
GAAATGGCAGTGAAGGGTGTGGGCTGGTGCTCTCCAGGAGCGACGAAGGTGGGATTGGGTTTTATGGGTCTTTTTGTAGCAGCTTATATTGTTGCTCCCCCTCTCTACTG
GCACTTCATCGAGTGCTTGGCCGCCGTCTCTTCTTCCTCTTCCTCCACTTGCCCTCCTTGTTTCTGTGACTGTTCTTCTTACACTGACTTCGCCATTTCTGAAGAGCTCG
AAAACACGACTTTTAGAGATTGTGTGAAACATGACCCTGGTATGAATCGGGAAACAGAAAAGAGTTTTGTGGAGTTGTTGTCGGAGGAACTGAAACTGAGGGAAGCTGAA
GCTTTGGAAAGTCACCGTCGCGCCGACATATCTCTGCTGGAAGCAAAGAAGATGACATCTCAATATCAGAAAGAAGCAGACAAGTGCAATTCAGGTATGGAAACATGTGA
AGCAGCAAGGGAAAAAGCTGAAGCTTCATTAGTTTCACAGCAGAGGCTAACAGCATTATGGGAGACAAGGGCTCATCAAAGAGGATGGAGAGGCGACATTGTCAGATCCC
GTGCTCTGGCTCGTGGTACAGTTCAAACCACATAAACGAAACCCGATGCTCATTCAAGTGGCTGCTTTTACAGTAGCTTGGAATTCATGGAGCCAGAAGCCAGTACTGAG
CAAAATTTGATTAGTCTACGATGGAGCCCAAGATTTTCTTACTTATCCCACTTCCTTTTCTTCTTGAAAGTTCAGCTTGTGTGAACATCAAACATAGAAATATATTATGG
AAAAGATACTAACCAGATGACTCCCTCCCAGAGATCATGACCTGAAACCTTTGACAAGGTTGAGCAGCAATTTTTCCTGTAATGCTGATTGGACTGTTTCAATATTAAGA
GATCAAAGACAATACAAAGAAATTGCATGGCGTATGAATTCAGCCATTTGTTGAACCCACATTTTTGTTCACTAATACATAAGGCTTGAACTAAAAAGGAATAAAAGTCA
CAAACAACAGTCAATCTAAGCATACTTCAACTAGATAAGATATATATCTATGTCGAAGATTTGAATCTCAACTCCCTATTGAAAAAAGAAAAAAATACAATATTTAACAA
AAGGTAAGATTGGGAGTGGAATGGAAGAAATCAAATTTCTCAGCAATATATTCTTGTTAAATGGCACCAACAATAGCAACAGCTAGAGAAAATGTGAAGTCGGTACAAAA
TACAGTGAGAAATTACAACTTTAGGAAAACTAAGGGGTAAGGAATTTCACAGGTATTATACATCAATATTAACGAGCAAAGTGAATGGTGAGATCATCCAAGTTGTCAGA
GCCATTTATACTTAACCCATGCTCGTTAGCTAAATGTAATAGGCATATAAAACACAAATGGGGCGAGATATCATTGATGGTTTGAGCAGCTCTGCATTCATTTG
Protein sequenceShow/hide protein sequence
EMAVKGVGWCSPGATKVGLGFMGLFVAAYIVAPPLYWHFIECLAAVSSSSSSTCPPCFCDCSSYTDFAISEELENTTFRDCVKHDPGMNRETEKSFVELLSEELKLREAE
ALESHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAAREKAEASLVSQQRLTALWETRAHQRGWRGDIVRSRALARGTVQTT