; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh08G004840 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh08G004840
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionPolyketide cyclase/dehydrase and lipid transporter
Genome locationCmo_Chr08:2987153..2990633
RNA-Seq ExpressionCmoCh08G004840
SyntenyCmoCh08G004840
Gene Ontology termsNA
InterPro domainsIPR023393 - START-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022148300.1 uncharacterized protein LOC111016984 isoform X1 [Momordica charantia]9.1e-8576.79Show/hide
Query:  MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINSRSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MS AA+TCNSNPN+ NLSH SI+R NGVL MAIPTRRSINS+S SLP+L+FRIPR S K   NP  PRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINSRSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVK                                PTLNQKIHWRSLEGLPNRGVVRFYPKG SSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLKRGLESFATFAKKYQTA
        PLLERLLKRGLESFATFAKKYQTA
Subjt:  PLLERLLKRGLESFATFAKKYQTA

XP_022148316.1 uncharacterized protein LOC111016984 isoform X3 [Momordica charantia]9.4e-9089.58Show/hide
Query:  MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINSRSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MS AA+TCNSNPN+ NLSH SI+R NGVL MAIPTRRSINS+S SLP+L+FRIPR S K   NP  PRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINSRSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQPLLERLLKRGLESFATFAKKYQTA
        REAIPKWMPFISSVKPTLNQKIHWRSLEGLPNRGVVRFYPKG SSCLVELTVSYEVPPLLSPVASALQPLLERLLKRGLESFATFAKKYQTA
Subjt:  REAIPKWMPFISSVKPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQPLLERLLKRGLESFATFAKKYQTA

XP_022960550.1 uncharacterized protein LOC111461253 [Cucurbita moschata]9.4e-9885.71Show/hide
Query:  MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINSRSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINSRSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINSRSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVK                                PTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLKRGLESFATFAKKYQTA
        PLLERLLKRGLESFATFAKKYQTA
Subjt:  PLLERLLKRGLESFATFAKKYQTA

XP_023004647.1 uncharacterized protein LOC111497878 [Cucurbita maxima]4.7e-9785.27Show/hide
Query:  MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINSRSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSIN RSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINSRSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVK                                PTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLKRGLESFATFAKKYQTA
        PLLERLLKRGLESFATFAKKYQTA
Subjt:  PLLERLLKRGLESFATFAKKYQTA

XP_023514604.1 uncharacterized protein LOC111778852 [Cucurbita pepo subsp. pepo]6.1e-9784.82Show/hide
Query:  MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINSRSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRR+INSRSLSLPKLLFRIPR SSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINSRSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVK                                PTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLKRGLESFATFAKKYQTA
        PLLERLLKRGLESFATFAKKYQTA
Subjt:  PLLERLLKRGLESFATFAKKYQTA

TrEMBL top hitse value%identityAlignment
A0A1S3CEM7 uncharacterized protein LOC103499992 isoform X17.5e-8575.89Show/hide
Query:  MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINSRSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MS+A LT NS PN+L L HRSIRR NG+LFMAIPT RSINSRS SLP L+F+IPR+SSK  RNPI PR+KFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINSRSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVK                                PT NQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLKRGLESFATFAKKYQTA
        PLLERLL+RGL+SFATFAKKYQTA
Subjt:  PLLERLLKRGLESFATFAKKYQTA

A0A6J1D2J2 uncharacterized protein LOC111016984 isoform X14.4e-8576.79Show/hide
Query:  MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINSRSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MS AA+TCNSNPN+ NLSH SI+R NGVL MAIPTRRSINS+S SLP+L+FRIPR S K   NP  PRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINSRSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVK                                PTLNQKIHWRSLEGLPNRGVVRFYPKG SSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLKRGLESFATFAKKYQTA
        PLLERLLKRGLESFATFAKKYQTA
Subjt:  PLLERLLKRGLESFATFAKKYQTA

A0A6J1D507 uncharacterized protein LOC111016984 isoform X34.6e-9089.58Show/hide
Query:  MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINSRSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MS AA+TCNSNPN+ NLSH SI+R NGVL MAIPTRRSINS+S SLP+L+FRIPR S K   NP  PRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINSRSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQPLLERLLKRGLESFATFAKKYQTA
        REAIPKWMPFISSVKPTLNQKIHWRSLEGLPNRGVVRFYPKG SSCLVELTVSYEVPPLLSPVASALQPLLERLLKRGLESFATFAKKYQTA
Subjt:  REAIPKWMPFISSVKPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQPLLERLLKRGLESFATFAKKYQTA

A0A6J1H9D8 uncharacterized protein LOC1114612534.5e-9885.71Show/hide
Query:  MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINSRSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINSRSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINSRSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVK                                PTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLKRGLESFATFAKKYQTA
        PLLERLLKRGLESFATFAKKYQTA
Subjt:  PLLERLLKRGLESFATFAKKYQTA

A0A6J1KR05 uncharacterized protein LOC1114978782.3e-9785.27Show/hide
Query:  MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINSRSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSIN RSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINSRSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVK                                PTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLKRGLESFATFAKKYQTA
        PLLERLLKRGLESFATFAKKYQTA
Subjt:  PLLERLLKRGLESFATFAKKYQTA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G02470.1 Polyketide cyclase/dehydrase and lipid transport superfamily protein6.4e-3640.89Show/hide
Query:  SLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINSRSLSLPKLLFRIPRSSSKCRR-------NPISPRVKFVSPVMEWQNCTAKMEVDIPASVA
        S+A+ T ++  +T  L  R I +S  VL M       ++  S S P++L  +  SS+   +         IS   K   PVM+WQ+ T KM VD PASVA
Subjt:  SLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINSRSLSLPKLLFRIPRSSSKCRR-------NPISPRVKFVSPVMEWQNCTAKMEVDIPASVA

Query:  YKCYSDREAIPKWMPFISSV--------------------------------KPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSP
        YK Y+DRE  PKWMPF+SSV                                +P  ++KIHWRS+EG  NRG VRF+P+GPSSCLVE++ SYEVP   +P
Subjt:  YKCYSDREAIPKWMPFISSV--------------------------------KPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSP

Query:  VASALQPLLERLLKRGLESFATFAK
        VA A++P +E++++ GLE FA F K
Subjt:  VASALQPLLERLLKRGLESFATFAK

AT1G02470.2 Polyketide cyclase/dehydrase and lipid transport superfamily protein1.6e-3440.71Show/hide
Query:  SLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINSRSLSLPKLLFRIPRSSSKCRR-------NPISPRVKFVSPVMEWQNCT-AKMEVDIPASV
        S+A+ T ++  +T  L  R I +S  VL M       ++  S S P++L  +  SS+   +         IS   K   PVM+WQ+ T  KM VD PASV
Subjt:  SLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINSRSLSLPKLLFRIPRSSSKCRR-------NPISPRVKFVSPVMEWQNCT-AKMEVDIPASV

Query:  AYKCYSDREAIPKWMPFISSV--------------------------------KPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLS
        AYK Y+DRE  PKWMPF+SSV                                +P  ++KIHWRS+EG  NRG VRF+P+GPSSCLVE++ SYEVP   +
Subjt:  AYKCYSDREAIPKWMPFISSV--------------------------------KPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLS

Query:  PVASALQPLLERLLKRGLESFATFAK
        PVA A++P +E++++ GLE FA F K
Subjt:  PVASALQPLLERLLKRGLESFATFAK

AT1G02475.1 Polyketide cyclase/dehydrase and lipid transport superfamily protein1.3e-4146.61Show/hide
Query:  MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINSRSLSLPKLL--FRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCY
        MS+ A+    N NT+N   ++ +  N    +    +    S     PK L  F    SS   R   I P+ +  S  MEWQ+C+ KMEVD+P SVAY  Y
Subjt:  MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINSRSLSLPKLL--FRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCY

Query:  SDREAIPKWMPFISSV--------------------------------KPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASA
         DRE+ PKWMPFISSV                                +PT NQKIHWRSLEGLPN+G VRF+PKGPSSC+VELTVSYEVP LL+PVAS 
Subjt:  SDREAIPKWMPFISSV--------------------------------KPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASA

Query:  LQPLLERLLKRGLESFATFAK
        L+P +E LL+ GLE FA  AK
Subjt:  LQPLLERLLKRGLESFATFAK

AT4G01883.1 Polyketide cyclase / dehydrase and lipid transport protein4.9e-3652.35Show/hide
Query:  VMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFYPKG
        +MEWQ C  KM+V++P SVAY  YS+RE+IPKWM FISSVK                                P  NQKIHW SLEGLPN+G VRF+P G
Subjt:  VMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFYPKG

Query:  PSSCLVELTVSYEVPPLLSPVASALQPLLERLLKRGLESFATFAKKYQT
        PSSC VELT +YEVP LL P A+ALQPL++ L+K  LE FA  AK  +T
Subjt:  PSSCLVELTVSYEVPPLLSPVASALQPLLERLLKRGLESFATFAKKYQT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCACTTGCAGCGCTCACCTGTAATTCGAATCCAAACACTCTGAATCTCAGTCATCGATCGATAAGAAGGAGTAATGGCGTCTTATTCATGGCGATTCCCACTCGCAG
AAGCATCAATTCGAGGTCACTATCTCTTCCCAAGCTCCTCTTCAGAATCCCACGGAGTTCTTCGAAGTGCCGCAGAAACCCCATTTCCCCTCGAGTTAAATTCGTCTCCC
CTGTGATGGAATGGCAGAATTGCACGGCCAAGATGGAAGTAGACATTCCTGCCTCGGTTGCTTATAAATGCTACTCAGATCGTGAAGCCATTCCCAAATGGATGCCCTTC
ATTTCATCTGTGAAGCCGACCCTCAATCAAAAAATCCATTGGCGGTCTCTCGAAGGTCTTCCAAACAGAGGTGTTGTTCGATTTTATCCAAAAGGCCCATCATCTTGCCT
CGTAGAACTGACTGTCTCTTATGAAGTTCCACCTCTTTTATCTCCAGTGGCATCTGCATTGCAACCTTTGCTTGAGAGATTACTTAAACGAGGTCTCGAAAGCTTTGCCA
CATTTGCTAAGAAATACCAAACGGCTTGA
mRNA sequenceShow/hide mRNA sequence
TTCGTTCGTCTGCCTCTCTGACGGCCTTGATATTGTTGTGGCTCTGAAAAGACGGAGAGATTTTCAGAGGCTGTTCTTGCATTTGGTGCCCTCAACTCTCTGCTCACCAC
TCCCTTTGTGTCCAGTCGCTCTCTTCTTCTTCACGAATTTTAAATCCCAGTATGTCACTTGCAGCGCTCACCTGTAATTCGAATCCAAACACTCTGAATCTCAGTCATCG
ATCGATAAGAAGGAGTAATGGCGTCTTATTCATGGCGATTCCCACTCGCAGAAGCATCAATTCGAGGTCACTATCTCTTCCCAAGCTCCTCTTCAGAATCCCACGGAGTT
CTTCGAAGTGCCGCAGAAACCCCATTTCCCCTCGAGTTAAATTCGTCTCCCCTGTGATGGAATGGCAGAATTGCACGGCCAAGATGGAAGTAGACATTCCTGCCTCGGTT
GCTTATAAATGCTACTCAGATCGTGAAGCCATTCCCAAATGGATGCCCTTCATTTCATCTGTGAAGCCGACCCTCAATCAAAAAATCCATTGGCGGTCTCTCGAAGGTCT
TCCAAACAGAGGTGTTGTTCGATTTTATCCAAAAGGCCCATCATCTTGCCTCGTAGAACTGACTGTCTCTTATGAAGTTCCACCTCTTTTATCTCCAGTGGCATCTGCAT
TGCAACCTTTGCTTGAGAGATTACTTAAACGAGGTCTCGAAAGCTTTGCCACATTTGCTAAGAAATACCAAACGGCTTGAAGACGGGTATTTGTGTCATATGGTATGGAT
ATCACCATTTTTTACTAGATACTCTTATTGACAAATGTAAAATCTTGAACACAATACTTGAATACAATCATCTGTTTCAATAAGAAATTAATGTACTTTACAAAAATATT
CTGAGTCCAGCTGATTTATTGGGTTATACATATGTTCTTAATCTC
Protein sequenceShow/hide protein sequence
MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINSRSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPF
ISSVKPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQPLLERLLKRGLESFATFAKKYQTA