; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi02G025070 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi02G025070
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionPolyketide_cyc domain-containing protein
Genome locationchr02:31530035..31535840
RNA-Seq ExpressionLsi02G025070
SyntenyLsi02G025070
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7025614.1 hypothetical protein SDJN02_12111, partial [Cucurbita argyrosperma subsp. argyrosperma]4.4e-6874.74Show/hide
Query:  SSSLIPTPTMSIAALTCNSTPNPLNPSHRSIRRRNGILFMAIPTCRSINSRSLSLPEFVFKIPRGSSKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPA
        SSS I  P+MS+AALTCNS PN LN SHRSIRR NG+LFMAIPT RSINSRSLSLP+ +F+IP  SSK  RNPI PR+KFVSPVMEWQNCTAKMEVDIPA
Subjt:  SSSLIPTPTMSIAALTCNSTPNPLNPSHRSIRRRNGILFMAIPTCRSINSRSLSLPEFVFKIPRGSSKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPA

Query:  SVAYKCYSDREAIPKWMPFISSVKECSSSSSVLTMSSDDIAADPKSKNSLALTVSYEVPPLLSPVASALQPLLERLLQQGLKSFATFAKKYETA
        SVAYKCYSDREAIPKWMPFISSVK C    S+L+     +           LTVSYEVPPLLSPVASALQPLLERLL++GL+SFATFAKKY+TA
Subjt:  SVAYKCYSDREAIPKWMPFISSVKECSSSSSVLTMSSDDIAADPKSKNSLALTVSYEVPPLLSPVASALQPLLERLLQQGLKSFATFAKKYETA

XP_008461396.1 PREDICTED: uncharacterized protein LOC103499992 isoform X1 [Cucumis melo]9.1e-6667.86Show/hide
Query:  MSIAALTCNSTPNPLNPSHRSIRRRNGILFMAIPTCRSINSRSLSLPEFVFKIPRGSSKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MSIA LT NSTPN L   HRSIRRRNGILFMAIPT RSINSRS SLP+ VFKIPR SSKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALTCNSTPNPLNPSHRSIRRRNGILFMAIPTCRSINSRSLSLPEFVFKIPRGSSKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKECSSSSSVLTMS------SDDIAAD-------------------------------PKSKNS--LALTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVK    + S+   S        DI                                  PK  +S  + LTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKECSSSSSVLTMS------SDDIAAD-------------------------------PKSKNS--LALTVSYEVPPLLSPVASALQ

Query:  PLLERLLQQGLKSFATFAKKYETA
        PLLERLLQ+GLKSFATFAKKY+TA
Subjt:  PLLERLLQQGLKSFATFAKKYETA

XP_022960550.1 uncharacterized protein LOC111461253 [Cucurbita moschata]5.5e-6364.29Show/hide
Query:  MSIAALTCNSTPNPLNPSHRSIRRRNGILFMAIPTCRSINSRSLSLPEFVFKIPRGSSKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MS+AALTCNS PN LN SHRSIRR NG+LFMAIPT RSINSRSLSLP+ +F+IPR SSK  RNPI PR+KFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALTCNSTPNPLNPSHRSIRRRNGILFMAIPTCRSINSRSLSLPEFVFKIPRGSSKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKECSSSSSVLTMS------SDDIAAD-------------------------------PKSKNS--LALTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVK    + ++   S        DI                                  PK  +S  + LTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKECSSSSSVLTMS------SDDIAAD-------------------------------PKSKNS--LALTVSYEVPPLLSPVASALQ

Query:  PLLERLLQQGLKSFATFAKKYETA
        PLLERLL++GL+SFATFAKKY+TA
Subjt:  PLLERLLQQGLKSFATFAKKYETA

XP_023514604.1 uncharacterized protein LOC111778852 [Cucurbita pepo subsp. pepo]2.5e-6364.29Show/hide
Query:  MSIAALTCNSTPNPLNPSHRSIRRRNGILFMAIPTCRSINSRSLSLPEFVFKIPRGSSKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MS+AALTCNS PN LN SHRSIRR NG+LFMAIPT R+INSRSLSLP+ +F+IPRGSSK  RNPI PR+KFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALTCNSTPNPLNPSHRSIRRRNGILFMAIPTCRSINSRSLSLPEFVFKIPRGSSKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKECSSSSSVLTMS------SDDIAAD-------------------------------PKSKNS--LALTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVK    + ++   S        DI                                  PK  +S  + LTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKECSSSSSVLTMS------SDDIAAD-------------------------------PKSKNS--LALTVSYEVPPLLSPVASALQ

Query:  PLLERLLQQGLKSFATFAKKYETA
        PLLERLL++GL+SFATFAKKY+TA
Subjt:  PLLERLLQQGLKSFATFAKKYETA

XP_038898800.1 uncharacterized protein LOC120086302 [Benincasa hispida]2.2e-6466.52Show/hide
Query:  MSIAALTCNSTPNPLNPSHRSIRRRNGILFMAIPTCRSINSRSLSLPEFVFKIPRGSSKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        M+IAALTCNSTPN L  +HRSIRRRNGILFMAIPTCRSI+SRSLSLPE VFKIPR SSKR RNPICP LK VSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALTCNSTPNPLNPSHRSIRRRNGILFMAIPTCRSINSRSLSLPEFVFKIPRGSSKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKECSSSSSVLTMS------SDDIAAD-------------------------------PKSKNS--LALTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVK    + ++   S        DI                                  PK  +S  + LTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKECSSSSSVLTMS------SDDIAAD-------------------------------PKSKNS--LALTVSYEVPPLLSPVASALQ

Query:  PLLERLLQQGLKSFATFAKKYETA
        PLLERLLQ+GLKSFA FAKKY+T+
Subjt:  PLLERLLQQGLKSFATFAKKYETA

TrEMBL top hitse value%identityAlignment
A0A1S3CEM7 uncharacterized protein LOC103499992 isoform X14.4e-6667.86Show/hide
Query:  MSIAALTCNSTPNPLNPSHRSIRRRNGILFMAIPTCRSINSRSLSLPEFVFKIPRGSSKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MSIA LT NSTPN L   HRSIRRRNGILFMAIPT RSINSRS SLP+ VFKIPR SSKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALTCNSTPNPLNPSHRSIRRRNGILFMAIPTCRSINSRSLSLPEFVFKIPRGSSKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKECSSSSSVLTMS------SDDIAAD-------------------------------PKSKNS--LALTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVK    + S+   S        DI                                  PK  +S  + LTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKECSSSSSVLTMS------SDDIAAD-------------------------------PKSKNS--LALTVSYEVPPLLSPVASALQ

Query:  PLLERLLQQGLKSFATFAKKYETA
        PLLERLLQ+GLKSFATFAKKY+TA
Subjt:  PLLERLLQQGLKSFATFAKKYETA

A0A6J1D2J2 uncharacterized protein LOC111016984 isoform X11.2e-6062.95Show/hide
Query:  MSIAALTCNSTPNPLNPSHRSIRRRNGILFMAIPTCRSINSRSLSLPEFVFKIPRGSSKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MS AA+TCNS PN  N SH SI+RRNG+L MAIPT RSINS+S SLPE VF+IPRGS KRS NP  PR+KFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALTCNSTPNPLNPSHRSIRRRNGILFMAIPTCRSINSRSLSLPEFVFKIPRGSSKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKECSSSSSVLTMS------SDDIAAD-------------------------------PKSKNS--LALTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVK    + S+   S        DI                                  PK  +S  + LTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKECSSSSSVLTMS------SDDIAAD-------------------------------PKSKNS--LALTVSYEVPPLLSPVASALQ

Query:  PLLERLLQQGLKSFATFAKKYETA
        PLLERLL++GL+SFATFAKKY+TA
Subjt:  PLLERLLQQGLKSFATFAKKYETA

A0A6J1D507 uncharacterized protein LOC111016984 isoform X31.0e-6271.88Show/hide
Query:  MSIAALTCNSTPNPLNPSHRSIRRRNGILFMAIPTCRSINSRSLSLPEFVFKIPRGSSKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MS AA+TCNS PN  N SH SI+RRNG+L MAIPT RSINS+S SLPE VF+IPRGS KRS NP  PR+KFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALTCNSTPNPLNPSHRSIRRRNGILFMAIPTCRSINSRSLSLPEFVFKIPRGSSKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKECSSSS----SVLTMSSDDIAA-DPKSKNS--LALTVSYEVPPLLSPVASALQPLLERLLQQGLKSFATFAKKYETA
        REAIPKWMPFISSVK   +      S+  + +  +    PK  +S  + LTVSYEVPPLLSPVASALQPLLERLL++GL+SFATFAKKY+TA
Subjt:  REAIPKWMPFISSVKECSSSS----SVLTMSSDDIAA-DPKSKNS--LALTVSYEVPPLLSPVASALQPLLERLLQQGLKSFATFAKKYETA

A0A6J1H9D8 uncharacterized protein LOC1114612532.7e-6364.29Show/hide
Query:  MSIAALTCNSTPNPLNPSHRSIRRRNGILFMAIPTCRSINSRSLSLPEFVFKIPRGSSKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MS+AALTCNS PN LN SHRSIRR NG+LFMAIPT RSINSRSLSLP+ +F+IPR SSK  RNPI PR+KFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALTCNSTPNPLNPSHRSIRRRNGILFMAIPTCRSINSRSLSLPEFVFKIPRGSSKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKECSSSSSVLTMS------SDDIAAD-------------------------------PKSKNS--LALTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVK    + ++   S        DI                                  PK  +S  + LTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKECSSSSSVLTMS------SDDIAAD-------------------------------PKSKNS--LALTVSYEVPPLLSPVASALQ

Query:  PLLERLLQQGLKSFATFAKKYETA
        PLLERLL++GL+SFATFAKKY+TA
Subjt:  PLLERLLQQGLKSFATFAKKYETA

A0A6J1KR05 uncharacterized protein LOC1114978781.3e-6263.84Show/hide
Query:  MSIAALTCNSTPNPLNPSHRSIRRRNGILFMAIPTCRSINSRSLSLPEFVFKIPRGSSKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MS+AALTCNS PN LN SHRSIRR NG+LFMAIPT RSIN RSLSLP+ +F+IPR SSK  RNPI PR+KFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALTCNSTPNPLNPSHRSIRRRNGILFMAIPTCRSINSRSLSLPEFVFKIPRGSSKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKECSSSSSVLTMS------SDDIAAD-------------------------------PKSKNS--LALTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVK    + ++   S        DI                                  PK  +S  + LTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKECSSSSSVLTMS------SDDIAAD-------------------------------PKSKNS--LALTVSYEVPPLLSPVASALQ

Query:  PLLERLLQQGLKSFATFAKKYETA
        PLLERLL++GL+SFATFAKKY+TA
Subjt:  PLLERLLQQGLKSFATFAKKYETA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G02470.1 Polyketide cyclase/dehydrase and lipid transport superfamily protein2.7e-1535.62Show/hide
Query:  PVMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSVKECSSSSSV------LTMSSDDIAADPKSKN----------------------------
        PVM+WQ+ T KM VD PASVAYK Y+DRE  PKWMPF+SSV+    S  +      L     +I     +KN                            
Subjt:  PVMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSVKECSSSSSV------LTMSSDDIAADPKSKN----------------------------

Query:  -----SLALTVSYEVPPLLSPVASALQPLLERLLQQGLKSFATFAK
              + ++ SYEVP   +PVA A++P +E++++ GL+ FA F K
Subjt:  -----SLALTVSYEVPPLLSPVASALQPLLERLLQQGLKSFATFAK

AT1G02470.2 Polyketide cyclase/dehydrase and lipid transport superfamily protein6.6e-1435.37Show/hide
Query:  PVMEWQNCT-AKMEVDIPASVAYKCYSDREAIPKWMPFISSVKECSSSSSV------LTMSSDDIAADPKSKN---------------------------
        PVM+WQ+ T  KM VD PASVAYK Y+DRE  PKWMPF+SSV+    S  +      L     +I     +KN                           
Subjt:  PVMEWQNCT-AKMEVDIPASVAYKCYSDREAIPKWMPFISSVKECSSSSSV------LTMSSDDIAADPKSKN---------------------------

Query:  ------SLALTVSYEVPPLLSPVASALQPLLERLLQQGLKSFATFAK
               + ++ SYEVP   +PVA A++P +E++++ GL+ FA F K
Subjt:  ------SLALTVSYEVPPLLSPVASALQPLLERLLQQGLKSFATFAK

AT1G02475.1 Polyketide cyclase/dehydrase and lipid transport superfamily protein6.2e-2040.12Show/hide
Query:  SKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSVKECSSSSSVLTMS------------------------------
        S+RSR  I P+ +  S  MEWQ+C+ KMEVD+P SVAY  Y DRE+ PKWMPFISSV+       +   S                              
Subjt:  SKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSVKECSSSSSVLTMS------------------------------

Query:  -------SDDIAADPKSKNS--LALTVSYEVPPLLSPVASALQPLLERLLQQGLKSFATFAK
                  +   PK  +S  + LTVSYEVP LL+PVAS L+P +E LL+ GL+ FA  AK
Subjt:  -------SDDIAADPKSKNS--LALTVSYEVPPLLSPVASALQPLLERLLQQGLKSFATFAK

AT4G01883.1 Polyketide cyclase / dehydrase and lipid transport protein1.3e-1434.9Show/hide
Query:  VMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSVKECSSSSSVLTMS---------------SDDIAADPKSK---------------------
        +MEWQ C  KM+V++P SVAY  YS+RE+IPKWM FISSVK       +   +               + ++   P  K                     
Subjt:  VMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSVKECSSSSSVLTMS---------------SDDIAADPKSK---------------------

Query:  ---NSLALTVSYEVPPLLSPVASALQPLLERLLQQGLKSFATFAKKYET
             + LT +YEVP LL P A+ALQPL++ L++  L+ FA  AK  +T
Subjt:  ---NSLALTVSYEVPPLLSPVASALQPLLERLLQQGLKSFATFAKKYET


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTGTGCAAGAAGAACAGCCTCCGAAAATTTCTTTTTCCAGAGCCACAGTAATATCAAAGGGCTGTTCTTGCAGTTTGGTGCAAGACTGCAAGTCAAATCTCTGCT
CGCAACTCCCTTTAATGTCCCTTTGCCGTTTTCTTCTTCACTAATCCCAACACCCACTATGTCAATTGCAGCACTCACCTGCAATTCAACTCCAAACCCTCTGAATCCCA
GTCATCGATCGATCAGAAGGAGAAATGGCATCTTATTCATGGCGATTCCCACTTGCAGAAGCATCAATTCCAGGTCACTGTCTCTACCCGAGTTCGTCTTCAAGATCCCA
CGCGGTTCTTCGAAGCGCAGCAGAAACCCCATTTGCCCTCGACTCAAATTCGTCTCCCCTGTGATGGAATGGCAGAACTGCACGGCTAAGATGGAAGTTGACATACCTGC
TTCGGTTGCCTATAAATGCTACTCAGATCGTGAAGCTATTCCCAAATGGATGCCATTCATTTCATCTGTGAAGGAATGCTCTTCTAGTTCTTCTGTTCTTACAATGTCAT
CTGATGATATTGCAGCCGACCCTAAATCAAAAAATTCATTGGCGCTGACAGTCTCCTATGAAGTTCCTCCTCTTTTGTCTCCAGTGGCATCCGCACTGCAACCTTTGCTT
GAGAGATTACTTCAACAAGGTCTTAAAAGCTTTGCCACGTTTGCCAAGAAATACGAAACGGCTTGA
mRNA sequenceShow/hide mRNA sequence
GAAATTCCTTCCTACACAAACTACAAGAGAGGGAGAGAGAGACCTCTCTCCAAGTTTTAGGGTTAATTTTTTTTTTTCTTCTAAAAATGAAAATTTCAAATCCACTCAAC
ACTAGAAAATATCCTAAGTCAATTTTTGAACAAAAAAAAAAATTATGAAATAATTTTTAACTAATTATGGTTTGTGCAAGAAGAACAGCCTCCGAAAATTTCTTTTTCCA
GAGCCACAGTAATATCAAAGGGCTGTTCTTGCAGTTTGGTGCAAGACTGCAAGTCAAATCTCTGCTCGCAACTCCCTTTAATGTCCCTTTGCCGTTTTCTTCTTCACTAA
TCCCAACACCCACTATGTCAATTGCAGCACTCACCTGCAATTCAACTCCAAACCCTCTGAATCCCAGTCATCGATCGATCAGAAGGAGAAATGGCATCTTATTCATGGCG
ATTCCCACTTGCAGAAGCATCAATTCCAGGTCACTGTCTCTACCCGAGTTCGTCTTCAAGATCCCACGCGGTTCTTCGAAGCGCAGCAGAAACCCCATTTGCCCTCGACT
CAAATTCGTCTCCCCTGTGATGGAATGGCAGAACTGCACGGCTAAGATGGAAGTTGACATACCTGCTTCGGTTGCCTATAAATGCTACTCAGATCGTGAAGCTATTCCCA
AATGGATGCCATTCATTTCATCTGTGAAGGAATGCTCTTCTAGTTCTTCTGTTCTTACAATGTCATCTGATGATATTGCAGCCGACCCTAAATCAAAAAATTCATTGGCG
CTGACAGTCTCCTATGAAGTTCCTCCTCTTTTGTCTCCAGTGGCATCCGCACTGCAACCTTTGCTTGAGAGATTACTTCAACAAGGTCTTAAAAGCTTTGCCACGTTTGC
CAAGAAATACGAAACGGCTTGAAGACTCAATTATTGTTTTATGGGTATCATACATTACCCCATACATTACCCCATACATTACCATTTTTACCAAATACTCTTATTGACAA
ATGTAAAATCTTGAACAGAATACTTTAACACAATCATCTGTTTTACTAAAGAATTTCATCTTCTTTACAAAAATATGCTGAGCCCAGCTGATTTATTGATATATGTTCAT
AATCTGCAATCTGATAAAGAGTTGATATCAGAAAATTATTTACATCAAAATAAATGGGATTGCAGTTGGCTGTACTCTTCAATCTGCTGTTACAAATGAGAGTTTTTACA
GGAAGCAATTACACTCATTGGCATCCACTGTTATCCAACCATTTTTGCAAGCTACAAAGAGCCTGTGCAGAGAGACTGCAGTATCTTAACTTCCGCTGCTGTGGTAGTGG
AGAATTGCCAGGATGGAAACTGGAGAAGCAAACAACAACAACAGTGAGGTTATCAAATGAGTCTAATCGCAAAGCCTGCAGGACAAGGTCTCGAGCACAACGCTCGGGGT
CATCGTGACGTTGAAGCCCCTGTCGAACTACATTGACTGCTTGTTGGCTCGACATCACGTCCCAAATCCCGTCACAGGCTATGATCAAGAACTCATCCTCTTCTGTAAGA
ACCACCTGCCTGCATTCTGGTTCTGCAATGAGAGGTGAAGGAGAACCATCGGGGAGCTTCATGTCCCAATCCCCTAAGGCCCTGGAAACTGACAAAACGCCATTGAGATA
GCCACCATCAACAAAACCACCCAACTCTTCTACCCGTTGCTTCTCCAGAGAATAAACCGGCCGGTGGTCTTGAGACATATCTAATGCCTCTCCATTCCGGCTGAGAACAG
CTCGGCAATCTCCAGCATTCGCCACCATTAAATGCCTGAAGAACAGTAACACACAGTCAGCCATATGACAATGCAGTGCCATTCAACTACGTTAAGACGTGAAAACGAGG
AAGAAGACAGCAAAAGTGAATCGATAAGTTAATGGAAACCAGTAGCAGCAACTACAAACACAGAGATTGCTATCAAAAGGTATGAAGGATCAATACATTGGGGAGGAAAA
ACATACCTTCCAAGCACAAGAGCTGTCAGGGCCGTTGTTCCGGACGAACTACTAACACTGGAATCATCAGCCAGAGCTCGATCAGCTAGAAGAAATGCTTTTTGGAGACA
ATTTTCTATCTCCTCCGGCAAGACTTCATCGATATCTGGAATTTGAGGAAAACTAACATCCTCAAAAAAGAGCCTAAGAACATTCTTTCTGATATAAGCTGCTGCTTCAG
GACCTCCATGTCCATCAAACACCTGATCAACAGATTCAACACATCAAACTCCTACTCAGATCACAACAATCATCCATATACTATACAAAGAAGACAACCCAAAAACAAAA
GATTCAACAACTCACCCCATAGAAAGCACTTGGCTTGGGAAACTTAAAGAGCGATCCTAAATGTGAAGATAAATCATCTATCCTTATATGTTCATCTTCCATGTATCTCC
TAGGTCCAATATCAGCAAAGCTCCCCGACCGGAGGCTCGGAAATTCCCTTCCATTCGCCGTATCGACGTCGGAATCTGGAATTTTCTTCATGGGTTTAATATCCTGAAGA
AAGAAATTTCATAAAACTCAAACGCACACAACAAAAGAACTCAACTCAAAATCATCATTCCAAGACACTAAACCCCAAAATTCCCTAAATCGAACCCGATCACTACAGAA
ACAAGTTCATAACGATCACTGTTTCTTACCCTAAATCAAAGGCTGAATTTCAGAAGAACAAAAACCCATAACAAGAAGAAGAAGAACATACCGATTGAGAACTGGAACAA
GCAGATTCGGAAGCACGAACACGGTTGAGAGTCGGATCAATCGAAATCCCACCATCGATGGGCTCAATTTCATCAACAAGACGAGTCTCTTTACCGAAATACGGGACCTC
CAAAACCGGGAGGCTCTTCGGACAAACAACAACCTCAGCTTCAGCCACCATTTTTACATTTACCAAAGAATCAGATTCTTCAACAGAGAGATCAAAATCGTTGTAATCAG
AAGAAGGGCAAGTTGAAATTTGAGTTGAAACTGGAATTGGAATTGCGACTGGGATTGGAATTTCTATTCTTTCTTTGTTCAGGAAATTTTAGAACGTTGATCAGAAATCT
TTTTGATCAAGTCGGCGATGAAGAAGAATATATAGTGGAGCCGCCGCGGGTGAGAGTAAATTCGTGGAAGGAGACGGAGCACGTAACGCCGTTACCGAGACTTGTCCGTG
AAGACACGTTAACGTTTCCACGTTGACGCGTGGCAGGTTTTTTTTAATTGGTGAAATTACTGAAATGCCTTTCCTTTGAATTGCGTTGACTTTTGCTTTCAGTCTTTTTT
TATTTTATATTTTAAGGGGAGCGGGTTATTGGGTTGGAGTGACGTAGGATCATGGGTCTAGTAGCTGTAGAATTTTCTAGCGACTTAATCGAAGCTTCTCTCTCTCTCTT
TTTTTTCCAACCCTACCGACGTGGAAATTAAGCTATTTTATGATTTTAAAAAAAAAGTTTGTTGGTTATTTGTTCATACTATTAAATTATTGAGAAATGAGAGTAGGAAT
TTGAGATGTGGAAGGGAAGGGAAGGGAATATATTGATATAAATTGGAATTGCG
Protein sequenceShow/hide protein sequence
MVCARRTASENFFFQSHSNIKGLFLQFGARLQVKSLLATPFNVPLPFSSSLIPTPTMSIAALTCNSTPNPLNPSHRSIRRRNGILFMAIPTCRSINSRSLSLPEFVFKIP
RGSSKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSVKECSSSSSVLTMSSDDIAADPKSKNSLALTVSYEVPPLLSPVASALQPLL
ERLLQQGLKSFATFAKKYETA