; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004860 (gene) of Snake gourd v1 genome

Gene IDTan0004860
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPolyketide_cyc domain-containing protein
Genome locationLG06:5241498..5245320
RNA-Seq ExpressionTan0004860
SyntenyTan0004860
Gene Ontology termsNA
InterPro domainsIPR005031 - Coenzyme Q-binding protein COQ10, START domain
IPR023393 - START-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008461396.1 PREDICTED: uncharacterized protein LOC103499992 isoform X1 [Cucumis melo]7.9e-10989.73Show/hide
Query:  MSIAALTCNSNPNSLNLTHRSIRRRNGVLFMAIPTRRNI----LSLPELVFRIPRGSSKRSRNPICPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MSIA LT NS PNSL L HRSIRRRNG+LFMAIPT R+I     SLP+LVF+IPR SSKRSRNPICPR+KFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALTCNSNPNSLNLTHRSIRRRNGVLFMAIPTRRNI----LSLPELVFRIPRGSSKRSRNPICPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEYSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKVLEDNP+LSRWSLKYNAFGQDIE+SWLARNLQPT NQKIHWRSLEGLPNRGVVRFYPKG SSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEYSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLKRGLESFATFAKKYQTA
        PLLERLL+RGL+SFATFAKKYQTA
Subjt:  PLLERLLKRGLESFATFAKKYQTA

XP_022148300.1 uncharacterized protein LOC111016984 isoform X1 [Momordica charantia]7.9e-10990.18Show/hide
Query:  MSIAALTCNSNPNSLNLTHRSIRRRNGVLFMAIPTRRNI----LSLPELVFRIPRGSSKRSRNPICPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MS AA+TCNSNPNS NL+H SI+RRNGVL MAIPTRR+I     SLPELVFRIPRGS KRS NP  PRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALTCNSNPNSLNLTHRSIRRRNGVLFMAIPTRRNI----LSLPELVFRIPRGSSKRSRNPICPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEYSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKV+EDNP+LSRWSLKY AFGQDIE+SWLA+NLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEYSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLKRGLESFATFAKKYQTA
        PLLERLLKRGLESFATFAKKYQTA
Subjt:  PLLERLLKRGLESFATFAKKYQTA

XP_022960550.1 uncharacterized protein LOC111461253 [Cucurbita moschata]1.5e-11292.41Show/hide
Query:  MSIAALTCNSNPNSLNLTHRSIRRRNGVLFMAIPTRRNI----LSLPELVFRIPRGSSKRSRNPICPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MS+AALTCNSNPN+LNL+HRSIRR NGVLFMAIPTRR+I    LSLP+L+FRIPR SSK  RNPI PRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALTCNSNPNSLNLTHRSIRRRNGVLFMAIPTRRNI----LSLPELVFRIPRGSSKRSRNPICPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEYSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIE+SWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKG SSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEYSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLKRGLESFATFAKKYQTA
        PLLERLLKRGLESFATFAKKYQTA
Subjt:  PLLERLLKRGLESFATFAKKYQTA

XP_023004647.1 uncharacterized protein LOC111497878 [Cucurbita maxima]1.5e-11292.41Show/hide
Query:  MSIAALTCNSNPNSLNLTHRSIRRRNGVLFMAIPTRRNI----LSLPELVFRIPRGSSKRSRNPICPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MS+AALTCNSNPN+LNL+HRSIRR NGVLFMAIPTRR+I    LSLP+L+FRIPR SSK  RNPI PRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALTCNSNPNSLNLTHRSIRRRNGVLFMAIPTRRNI----LSLPELVFRIPRGSSKRSRNPICPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEYSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIE+SWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKG SSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEYSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLKRGLESFATFAKKYQTA
        PLLERLLKRGLESFATFAKKYQTA
Subjt:  PLLERLLKRGLESFATFAKKYQTA

XP_023514604.1 uncharacterized protein LOC111778852 [Cucurbita pepo subsp. pepo]4.1e-11392.86Show/hide
Query:  MSIAALTCNSNPNSLNLTHRSIRRRNGVLFMAIPTRRNI----LSLPELVFRIPRGSSKRSRNPICPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MS+AALTCNSNPN+LNL+HRSIRR NGVLFMAIPTRR I    LSLP+L+FRIPRGSSK  RNPI PRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALTCNSNPNSLNLTHRSIRRRNGVLFMAIPTRRNI----LSLPELVFRIPRGSSKRSRNPICPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEYSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIE+SWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKG SSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEYSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLKRGLESFATFAKKYQTA
        PLLERLLKRGLESFATFAKKYQTA
Subjt:  PLLERLLKRGLESFATFAKKYQTA

TrEMBL top hitse value%identityAlignment
A0A0A0K6F1 Polyketide_cyc domain-containing protein5.7e-10587.95Show/hide
Query:  MSIAALTCNSNPNSLNLTHRSIRRRNGVLFMAIPTRRNI----LSLPELVFRIPRGSSKRSRNPICPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MSIA LT NS+PNSL L HRSIRRRNG++FMAIPT R+I     SLP+LVF+IPR SSKRSRNP+    KFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALTCNSNPNSLNLTHRSIRRRNGVLFMAIPTRRNI----LSLPELVFRIPRGSSKRSRNPICPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEYSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIE+SWLARNLQPT NQKIHWRSLEGLPNRGVVRFYPKG SSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEYSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLKRGLESFATFAKKYQTA
        PLLERLL+RGL+SFATFAKKYQTA
Subjt:  PLLERLLKRGLESFATFAKKYQTA

A0A1S3CEM7 uncharacterized protein LOC103499992 isoform X13.8e-10989.73Show/hide
Query:  MSIAALTCNSNPNSLNLTHRSIRRRNGVLFMAIPTRRNI----LSLPELVFRIPRGSSKRSRNPICPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MSIA LT NS PNSL L HRSIRRRNG+LFMAIPT R+I     SLP+LVF+IPR SSKRSRNPICPR+KFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALTCNSNPNSLNLTHRSIRRRNGVLFMAIPTRRNI----LSLPELVFRIPRGSSKRSRNPICPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEYSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKVLEDNP+LSRWSLKYNAFGQDIE+SWLARNLQPT NQKIHWRSLEGLPNRGVVRFYPKG SSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEYSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLKRGLESFATFAKKYQTA
        PLLERLL+RGL+SFATFAKKYQTA
Subjt:  PLLERLLKRGLESFATFAKKYQTA

A0A6J1D2J2 uncharacterized protein LOC111016984 isoform X13.8e-10990.18Show/hide
Query:  MSIAALTCNSNPNSLNLTHRSIRRRNGVLFMAIPTRRNI----LSLPELVFRIPRGSSKRSRNPICPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MS AA+TCNSNPNS NL+H SI+RRNGVL MAIPTRR+I     SLPELVFRIPRGS KRS NP  PRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALTCNSNPNSLNLTHRSIRRRNGVLFMAIPTRRNI----LSLPELVFRIPRGSSKRSRNPICPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEYSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKV+EDNP+LSRWSLKY AFGQDIE+SWLA+NLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEYSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLKRGLESFATFAKKYQTA
        PLLERLLKRGLESFATFAKKYQTA
Subjt:  PLLERLLKRGLESFATFAKKYQTA

A0A6J1H9D8 uncharacterized protein LOC1114612537.5e-11392.41Show/hide
Query:  MSIAALTCNSNPNSLNLTHRSIRRRNGVLFMAIPTRRNI----LSLPELVFRIPRGSSKRSRNPICPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MS+AALTCNSNPN+LNL+HRSIRR NGVLFMAIPTRR+I    LSLP+L+FRIPR SSK  RNPI PRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALTCNSNPNSLNLTHRSIRRRNGVLFMAIPTRRNI----LSLPELVFRIPRGSSKRSRNPICPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEYSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIE+SWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKG SSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEYSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLKRGLESFATFAKKYQTA
        PLLERLLKRGLESFATFAKKYQTA
Subjt:  PLLERLLKRGLESFATFAKKYQTA

A0A6J1KR05 uncharacterized protein LOC1114978787.5e-11392.41Show/hide
Query:  MSIAALTCNSNPNSLNLTHRSIRRRNGVLFMAIPTRRNI----LSLPELVFRIPRGSSKRSRNPICPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MS+AALTCNSNPN+LNL+HRSIRR NGVLFMAIPTRR+I    LSLP+L+FRIPR SSK  RNPI PRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALTCNSNPNSLNLTHRSIRRRNGVLFMAIPTRRNI----LSLPELVFRIPRGSSKRSRNPICPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEYSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIE+SWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKG SSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEYSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLKRGLESFATFAKKYQTA
        PLLERLLKRGLESFATFAKKYQTA
Subjt:  PLLERLLKRGLESFATFAKKYQTA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G02470.1 Polyketide cyclase/dehydrase and lipid transport superfamily protein3.1e-5060.27Show/hide
Query:  PVMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEYSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPK
        PVM+WQ+ T KM VD PASVAYK Y+DRE  PKWMPF+SSV+ +E +P LSR+ +K  +FGQ+IEY +LA+NLQP  ++KIHWRS+EG  NRG VRF+P+
Subjt:  PVMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEYSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPK

Query:  GSSSCLVELTVSYEVPPLLSPVASALQPLLERLLKRGLESFATFAK
        G SSCLVE++ SYEVP   +PVA A++P +E++++ GLE FA F K
Subjt:  GSSSCLVELTVSYEVPPLLSPVASALQPLLERLLKRGLESFATFAK

AT1G02470.2 Polyketide cyclase/dehydrase and lipid transport superfamily protein7.5e-4959.86Show/hide
Query:  PVMEWQNCT-AKMEVDIPASVAYKCYSDREAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEYSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYP
        PVM+WQ+ T  KM VD PASVAYK Y+DRE  PKWMPF+SSV+ +E +P LSR+ +K  +FGQ+IEY +LA+NLQP  ++KIHWRS+EG  NRG VRF+P
Subjt:  PVMEWQNCT-AKMEVDIPASVAYKCYSDREAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEYSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYP

Query:  KGSSSCLVELTVSYEVPPLLSPVASALQPLLERLLKRGLESFATFAK
        +G SSCLVE++ SYEVP   +PVA A++P +E++++ GLE FA F K
Subjt:  KGSSSCLVELTVSYEVPPLLSPVASALQPLLERLLKRGLESFATFAK

AT1G02475.1 Polyketide cyclase/dehydrase and lipid transport superfamily protein3.4e-6559.17Show/hide
Query:  MSIAALTCNSNPNSLNLTHRSIRRRNGVLFMAIPTRRNILSL-PELVFRIPRGSSKRSRNP--ICPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSDR
        MS+ A+  N+  N L+ T ++   ++ + F   P   +++ L P+ +      SS  SR    I P+ +  S  MEWQ+C+ KMEVD+P SVAY  Y DR
Subjt:  MSIAALTCNSNPNSLNLTHRSIRRRNGVLFMAIPTRRNILSL-PELVFRIPRGSSKRSRNP--ICPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSDR

Query:  EAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEYSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQP
        E+ PKWMPFISSV+VL+D P LSRWSLKYNAFGQDI+YSWLARNLQPT NQKIHWRSLEGLPN+G VRF+PKG SSC+VELTVSYEVP LL+PVAS L+P
Subjt:  EAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEYSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQP

Query:  LLERLLKRGLESFATFAK
         +E LL+ GLE FA  AK
Subjt:  LLERLLKRGLESFATFAK

AT4G01883.1 Polyketide cyclase / dehydrase and lipid transport protein6.4e-5667.11Show/hide
Query:  VMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEYSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKG
        +MEWQ C  KM+V++P SVAY  YS+RE+IPKWM FISSVKVL+D P LSRW+LKY AFGQ++EY+WLA+NLQP  NQKIHW SLEGLPN+G VRF+P G
Subjt:  VMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEYSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKG

Query:  SSSCLVELTVSYEVPPLLSPVASALQPLLERLLKRGLESFATFAKKYQT
         SSC VELT +YEVP LL P A+ALQPL++ L+K  LE FA  AK  +T
Subjt:  SSSCLVELTVSYEVPPLLSPVASALQPLLERLLKRGLESFATFAKKYQT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAATTGCAGCACTCACCTGTAATTCGAATCCAAACTCCCTGAATCTCACTCATCGATCGATCAGAAGGAGAAATGGCGTCTTATTCATGGCGATTCCCACTCGCAG
AAACATTCTGTCTCTTCCCGAGCTCGTCTTCAGAATCCCACGCGGTTCTTCGAAGCGCAGCAGAAACCCCATTTGCCCGCGAGTTAAATTCGTATCCCCTGTGATGGAAT
GGCAGAATTGCACGGCCAAGATGGAAGTTGACATTCCTGCCTCGGTTGCTTATAAATGCTACTCGGATCGTGAAGCCATCCCCAAATGGATGCCCTTCATTTCATCTGTG
AAGGTACTGGAAGATAATCCTACTTTATCAAGGTGGTCGTTAAAATATAATGCTTTTGGACAAGATATTGAATACTCGTGGCTTGCTCGAAACCTGCAGCCGACCCTCAA
TCAGAAAATCCATTGGCGGTCTCTCGAAGGTCTTCCCAACAGAGGTGTGGTACGGTTCTATCCAAAAGGCTCCTCATCTTGCCTTGTAGAACTGACAGTCTCTTATGAAG
TTCCTCCTCTTTTGTCTCCAGTGGCATCTGCACTACAACCTTTGCTTGAGAGATTACTTAAACGGGGTCTTGAAAGCTTCGCCACATTTGCTAAGAAATACCAAACGGCT
TGA
mRNA sequenceShow/hide mRNA sequence
ATGTCAATTGCAGCACTCACCTGTAATTCGAATCCAAACTCCCTGAATCTCACTCATCGATCGATCAGAAGGAGAAATGGCGTCTTATTCATGGCGATTCCCACTCGCAG
AAACATTCTGTCTCTTCCCGAGCTCGTCTTCAGAATCCCACGCGGTTCTTCGAAGCGCAGCAGAAACCCCATTTGCCCGCGAGTTAAATTCGTATCCCCTGTGATGGAAT
GGCAGAATTGCACGGCCAAGATGGAAGTTGACATTCCTGCCTCGGTTGCTTATAAATGCTACTCGGATCGTGAAGCCATCCCCAAATGGATGCCCTTCATTTCATCTGTG
AAGGTACTGGAAGATAATCCTACTTTATCAAGGTGGTCGTTAAAATATAATGCTTTTGGACAAGATATTGAATACTCGTGGCTTGCTCGAAACCTGCAGCCGACCCTCAA
TCAGAAAATCCATTGGCGGTCTCTCGAAGGTCTTCCCAACAGAGGTGTGGTACGGTTCTATCCAAAAGGCTCCTCATCTTGCCTTGTAGAACTGACAGTCTCTTATGAAG
TTCCTCCTCTTTTGTCTCCAGTGGCATCTGCACTACAACCTTTGCTTGAGAGATTACTTAAACGGGGTCTTGAAAGCTTCGCCACATTTGCTAAGAAATACCAAACGGCT
TGA
Protein sequenceShow/hide protein sequence
MSIAALTCNSNPNSLNLTHRSIRRRNGVLFMAIPTRRNILSLPELVFRIPRGSSKRSRNPICPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSV
KVLEDNPTLSRWSLKYNAFGQDIEYSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQPLLERLLKRGLESFATFAKKYQTA