; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC09G167500 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC09G167500
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionPolyketide_cyc domain-containing protein
Genome locationCicolChr09:4090921..4097311
RNA-Seq ExpressionCcUC09G167500
SyntenyCcUC09G167500
Gene Ontology termsNA
InterPro domainsIPR023393 - START-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008461396.1 PREDICTED: uncharacterized protein LOC103499992 isoform X1 [Cucumis melo]5.6e-8778.57Show/hide
Query:  MSIAALACNSTPNSLNLTHPSIRRRNGILFMAIPTCRSINSRSLSLPELVFKIPRGSSNRSRNPIFPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSD
        MSIA L  NSTPNSL L H SIRRRNGILFMAIPT RSINSRS SLP+LVFKIPR SS RSRNPI PRLKFVSPVMEWQ+CTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALACNSTPNSLNLTHPSIRRRNGILFMAIPTCRSINSRSLSLPELVFKIPRGSSNRSRNPIFPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVK                                PT NQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLQRGLRSFAKFAKKYQTA
        PLLERLLQRGL+SFA FAKKYQTA
Subjt:  PLLERLLQRGLRSFAKFAKKYQTA

XP_022148316.1 uncharacterized protein LOC111016984 isoform X3 [Momordica charantia]4.5e-8987.5Show/hide
Query:  MSIAALACNSTPNSLNLTHPSIRRRNGILFMAIPTCRSINSRSLSLPELVFKIPRGSSNRSRNPIFPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSD
        MS AA+ CNS PNS NL+H SI+RRNG+L MAIPT RSINS+S SLPELVF+IPRGS  RS NP FPR+KFVSPVMEWQ+CTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALACNSTPNSLNLTHPSIRRRNGILFMAIPTCRSINSRSLSLPELVFKIPRGSSNRSRNPIFPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQPLLERLLQRGLRSFAKFAKKYQTA
        REAIPKWMPFISSVKPTLNQKIHWRSLEGLPNRGVVRFYPKG SSCLVELTVSYEVPPLLSPVASALQPLLERLL+RGL SFA FAKKYQTA
Subjt:  REAIPKWMPFISSVKPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQPLLERLLQRGLRSFAKFAKKYQTA

XP_022960550.1 uncharacterized protein LOC111461253 [Cucurbita moschata]6.1e-8675.89Show/hide
Query:  MSIAALACNSTPNSLNLTHPSIRRRNGILFMAIPTCRSINSRSLSLPELVFKIPRGSSNRSRNPIFPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSD
        MS+AAL CNS PN+LNL+H SIRR NG+LFMAIPT RSINSRSLSLP+L+F+IPR SS   RNPI PR+KFVSPVMEWQ+CTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALACNSTPNSLNLTHPSIRRRNGILFMAIPTCRSINSRSLSLPELVFKIPRGSSNRSRNPIFPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVK                                PTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLQRGLRSFAKFAKKYQTA
        PLLERLL+RGL SFA FAKKYQTA
Subjt:  PLLERLLQRGLRSFAKFAKKYQTA

XP_023514604.1 uncharacterized protein LOC111778852 [Cucurbita pepo subsp. pepo]2.8e-8675.89Show/hide
Query:  MSIAALACNSTPNSLNLTHPSIRRRNGILFMAIPTCRSINSRSLSLPELVFKIPRGSSNRSRNPIFPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSD
        MS+AAL CNS PN+LNL+H SIRR NG+LFMAIPT R+INSRSLSLP+L+F+IPRGSS   RNPI PR+KFVSPVMEWQ+CTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALACNSTPNSLNLTHPSIRRRNGILFMAIPTCRSINSRSLSLPELVFKIPRGSSNRSRNPIFPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVK                                PTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLQRGLRSFAKFAKKYQTA
        PLLERLL+RGL SFA FAKKYQTA
Subjt:  PLLERLLQRGLRSFAKFAKKYQTA

XP_038898800.1 uncharacterized protein LOC120086302 [Benincasa hispida]3.6e-8678.12Show/hide
Query:  MSIAALACNSTPNSLNLTHPSIRRRNGILFMAIPTCRSINSRSLSLPELVFKIPRGSSNRSRNPIFPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSD
        M+IAAL CNSTPNS  LTH SIRRRNGILFMAIPTCRSI+SRSLSLPELVFKIPR SS R RNPI P LK VSPVMEWQ+CTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALACNSTPNSLNLTHPSIRRRNGILFMAIPTCRSINSRSLSLPELVFKIPRGSSNRSRNPIFPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVK                                PT NQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLQRGLRSFAKFAKKYQTA
        PLLERLLQRGL+SFA FAKKYQT+
Subjt:  PLLERLLQRGLRSFAKFAKKYQTA

TrEMBL top hitse value%identityAlignment
A0A1S3CEM7 uncharacterized protein LOC103499992 isoform X12.7e-8778.57Show/hide
Query:  MSIAALACNSTPNSLNLTHPSIRRRNGILFMAIPTCRSINSRSLSLPELVFKIPRGSSNRSRNPIFPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSD
        MSIA L  NSTPNSL L H SIRRRNGILFMAIPT RSINSRS SLP+LVFKIPR SS RSRNPI PRLKFVSPVMEWQ+CTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALACNSTPNSLNLTHPSIRRRNGILFMAIPTCRSINSRSLSLPELVFKIPRGSSNRSRNPIFPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVK                                PT NQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLQRGLRSFAKFAKKYQTA
        PLLERLLQRGL+SFA FAKKYQTA
Subjt:  PLLERLLQRGLRSFAKFAKKYQTA

A0A6J1D2J2 uncharacterized protein LOC111016984 isoform X12.1e-8475Show/hide
Query:  MSIAALACNSTPNSLNLTHPSIRRRNGILFMAIPTCRSINSRSLSLPELVFKIPRGSSNRSRNPIFPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSD
        MS AA+ CNS PNS NL+H SI+RRNG+L MAIPT RSINS+S SLPELVF+IPRGS  RS NP FPR+KFVSPVMEWQ+CTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALACNSTPNSLNLTHPSIRRRNGILFMAIPTCRSINSRSLSLPELVFKIPRGSSNRSRNPIFPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVK                                PTLNQKIHWRSLEGLPNRGVVRFYPKG SSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLQRGLRSFAKFAKKYQTA
        PLLERLL+RGL SFA FAKKYQTA
Subjt:  PLLERLLQRGLRSFAKFAKKYQTA

A0A6J1D507 uncharacterized protein LOC111016984 isoform X32.2e-8987.5Show/hide
Query:  MSIAALACNSTPNSLNLTHPSIRRRNGILFMAIPTCRSINSRSLSLPELVFKIPRGSSNRSRNPIFPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSD
        MS AA+ CNS PNS NL+H SI+RRNG+L MAIPT RSINS+S SLPELVF+IPRGS  RS NP FPR+KFVSPVMEWQ+CTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALACNSTPNSLNLTHPSIRRRNGILFMAIPTCRSINSRSLSLPELVFKIPRGSSNRSRNPIFPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQPLLERLLQRGLRSFAKFAKKYQTA
        REAIPKWMPFISSVKPTLNQKIHWRSLEGLPNRGVVRFYPKG SSCLVELTVSYEVPPLLSPVASALQPLLERLL+RGL SFA FAKKYQTA
Subjt:  REAIPKWMPFISSVKPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQPLLERLLQRGLRSFAKFAKKYQTA

A0A6J1H9D8 uncharacterized protein LOC1114612533.0e-8675.89Show/hide
Query:  MSIAALACNSTPNSLNLTHPSIRRRNGILFMAIPTCRSINSRSLSLPELVFKIPRGSSNRSRNPIFPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSD
        MS+AAL CNS PN+LNL+H SIRR NG+LFMAIPT RSINSRSLSLP+L+F+IPR SS   RNPI PR+KFVSPVMEWQ+CTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALACNSTPNSLNLTHPSIRRRNGILFMAIPTCRSINSRSLSLPELVFKIPRGSSNRSRNPIFPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVK                                PTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLQRGLRSFAKFAKKYQTA
        PLLERLL+RGL SFA FAKKYQTA
Subjt:  PLLERLLQRGLRSFAKFAKKYQTA

A0A6J1KR05 uncharacterized protein LOC1114978781.5e-8575.45Show/hide
Query:  MSIAALACNSTPNSLNLTHPSIRRRNGILFMAIPTCRSINSRSLSLPELVFKIPRGSSNRSRNPIFPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSD
        MS+AAL CNS PN+LNL+H SIRR NG+LFMAIPT RSIN RSLSLP+L+F+IPR SS   RNPI PR+KFVSPVMEWQ+CTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALACNSTPNSLNLTHPSIRRRNGILFMAIPTCRSINSRSLSLPELVFKIPRGSSNRSRNPIFPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVK                                PTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLQRGLRSFAKFAKKYQTA
        PLLERLL+RGL SFA FAKKYQTA
Subjt:  PLLERLLQRGLRSFAKFAKKYQTA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G02470.1 Polyketide cyclase/dehydrase and lipid transport superfamily protein2.4e-3541.75Show/hide
Query:  RNGILFMAIPTCRSINS----RSLSLPELVFKIPRGSSN-----RSRNP--IFPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSDREAIPKWMPFISS
        R+ + F  IP   ++ S     S S P ++  +   S+N      S N   I    K   PVM+WQD T KM VD PASVAYK Y+DRE  PKWMPF+SS
Subjt:  RNGILFMAIPTCRSINS----RSLSLPELVFKIPRGSSN-----RSRNP--IFPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSDREAIPKWMPFISS

Query:  V--------------------------------KPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQPLLERLLQRGLRS
        V                                +P  ++KIHWRS+EG  NRG VRF+P+GPSSCLVE++ SYEVP   +PVA A++P +E++++ GL  
Subjt:  V--------------------------------KPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQPLLERLLQRGLRS

Query:  FAKFAK
        FA F K
Subjt:  FAKFAK

AT1G02470.2 Polyketide cyclase/dehydrase and lipid transport superfamily protein5.8e-3441.55Show/hide
Query:  RNGILFMAIPTCRSINS----RSLSLPELVFKIPRGSSN-----RSRNP--IFPRLKFVSPVMEWQDCT-AKMEVDIPASVAYKCYSDREAIPKWMPFIS
        R+ + F  IP   ++ S     S S P ++  +   S+N      S N   I    K   PVM+WQD T  KM VD PASVAYK Y+DRE  PKWMPF+S
Subjt:  RNGILFMAIPTCRSINS----RSLSLPELVFKIPRGSSN-----RSRNP--IFPRLKFVSPVMEWQDCT-AKMEVDIPASVAYKCYSDREAIPKWMPFIS

Query:  SV--------------------------------KPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQPLLERLLQRGLR
        SV                                +P  ++KIHWRS+EG  NRG VRF+P+GPSSCLVE++ SYEVP   +PVA A++P +E++++ GL 
Subjt:  SV--------------------------------KPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQPLLERLLQRGLR

Query:  SFAKFAK
         FA F K
Subjt:  SFAKFAK

AT1G02475.1 Polyketide cyclase/dehydrase and lipid transport superfamily protein3.4e-4248.02Show/hide
Query:  MSIAALACNSTPNSLNLT--HPSIRRRNGILFMAIPTCRSINSRSLSLPELVFKIPRG----SSNRSRNP--IFPRLKFVSPVMEWQDCTAKMEVDIPAS
        MS+ A+  N T N L+ T   P+I+          P C      S SL  L  K   G    SS+ SR    I P+ +  S  MEWQDC+ KMEVD+P S
Subjt:  MSIAALACNSTPNSLNLT--HPSIRRRNGILFMAIPTCRSINSRSLSLPELVFKIPRG----SSNRSRNP--IFPRLKFVSPVMEWQDCTAKMEVDIPAS

Query:  VAYKCYSDREAIPKWMPFISSV--------------------------------KPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLL
        VAY  Y DRE+ PKWMPFISSV                                +PT NQKIHWRSLEGLPN+G VRF+PKGPSSC+VELTVSYEVP LL
Subjt:  VAYKCYSDREAIPKWMPFISSV--------------------------------KPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLL

Query:  SPVASALQPLLERLLQRGLRSFAKFAK
        +PVAS L+P +E LL+ GL  FA  AK
Subjt:  SPVASALQPLLERLLQRGLRSFAKFAK

AT4G01883.1 Polyketide cyclase / dehydrase and lipid transport protein4.8e-3648.54Show/hide
Query:  PRGSSNRS-RNPIFPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSVK--------------------------------PTLNQ
        P  S+NRS ++ +F R      +MEWQ+C  KM+V++P SVAY  YS+RE+IPKWM FISSVK                                P  NQ
Subjt:  PRGSSNRS-RNPIFPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSVK--------------------------------PTLNQ

Query:  KIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQPLLERLLQRGLRSFAKFAKKYQT
        KIHW SLEGLPN+G VRF+P GPSSC VELT +YEVP LL P A+ALQPL++ L++  L  FA+ AK  +T
Subjt:  KIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQPLLERLLQRGLRSFAKFAKKYQT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTGTGCAAGAACAGCCTCCGAAAATCTCTCTTTCCAGAGCCACAATATTATCAAAGGGCTGTTCTTGCAGTTTGGTGCAACTCAAATCTCCGCTCCAACT
CTCTTTAGTGTCCCCTTGCTGTTTTCTTCTTCACTAATCCAAAATCCCACTATGTCAATTGCAGCACTCGCCTGCAATTCCACTCCAAACTCTCTCAATCTCACT
CATCCATCGATCAGAAGGAGAAATGGCATCTTGTTCATGGCGATTCCCACTTGCAGAAGCATCAATTCGAGGTCACTCTCTCTACCCGAGCTCGTCTTCAAGATC
CCACGTGGTTCTTCGAACCGCAGCAGAAACCCCATTTTCCCTCGACTTAAGTTCGTCTCCCCTGTGATGGAATGGCAGGATTGCACGGCTAAGATGGAAGTTGAC
ATACCTGCTTCGGTTGCCTATAAATGCTACTCAGATCGTGAAGCTATTCCCAAATGGATGCCATTCATTTCATCTGTGAAGCCGACCCTAAATCAAAAAATTCAT
TGGCGGTCACTTGAAGGTCTTCCAAACAGAGGTGTTGTACGATTTTACCCAAAGGGCCCCTCATCTTGCCTCGTAGAATTGACAGTCTCCTATGAAGTTCCTCCT
CTTTTGTCTCCAGTGGCCTCTGCACTGCAACCTCTGCTTGAGAGACTACTTCAACGAGGTCTTAGAAGCTTTGCCAAGTTTGCCAAGAAATACCAAACGGCTTGA
mRNA sequenceShow/hide mRNA sequence
AAATAATAAAAAAAATCCAAAGTCAATTATTGAACAAAAGATTTAACTAATTATGGTTTGTGCAAGAACAGCCTCCGAAAATCTCTCTTTCCAGAGCCACAATAT
TATCAAAGGGCTGTTCTTGCAGTTTGGTGCAACTCAAATCTCCGCTCCAACTCTCTTTAGTGTCCCCTTGCTGTTTTCTTCTTCACTAATCCAAAATCCCACTAT
GTCAATTGCAGCACTCGCCTGCAATTCCACTCCAAACTCTCTCAATCTCACTCATCCATCGATCAGAAGGAGAAATGGCATCTTGTTCATGGCGATTCCCACTTG
CAGAAGCATCAATTCGAGGTCACTCTCTCTACCCGAGCTCGTCTTCAAGATCCCACGTGGTTCTTCGAACCGCAGCAGAAACCCCATTTTCCCTCGACTTAAGTT
CGTCTCCCCTGTGATGGAATGGCAGGATTGCACGGCTAAGATGGAAGTTGACATACCTGCTTCGGTTGCCTATAAATGCTACTCAGATCGTGAAGCTATTCCCAA
ATGGATGCCATTCATTTCATCTGTGAAGCCGACCCTAAATCAAAAAATTCATTGGCGGTCACTTGAAGGTCTTCCAAACAGAGGTGTTGTACGATTTTACCCAAA
GGGCCCCTCATCTTGCCTCGTAGAATTGACAGTCTCCTATGAAGTTCCTCCTCTTTTGTCTCCAGTGGCCTCTGCACTGCAACCTCTGCTTGAGAGACTACTTCA
ACGAGGTCTTAGAAGCTTTGCCAAGTTTGCCAAGAAATACCAAACGGCTTGAAGACTCGATTACTGTTTTATGGTTGCTGAACTCTTCAATCTGTTGTTACAAAT
GAGAGATTTTAGAGCAAGCAATCACGCTCATTGGCATCCACTGTTATCCAGCCATCTTTGCAAGCTACAAAGAGCCTGTGCAGAGAGACTGCAGTATCTTAAGTT
CCGCTGCTGTGGTCGTGGAGAATTGCCAGGATGGAAACTGGAGAAGCAAACAACAACAACAGTGAGGTTGTCGAATGAGTCTAGTCGCAAAGCCTGAAGGACGAG
GTCTCGAGCACACCGCTCAGGGTCATCATGACGTTGAAGCCCCTGTCGAACTACATTGACTGCTTGTTGGCTCGACATCACGTCCCAAATCCCATCACAGGCTAT
GATCAAGAACTCATCCTCTTCTGTAAGCACCACCTGCCTGCATTCTGGTTCTGCAATGAGAGGCGAAGGAGAACCATCAGGGAGCTTCATGTCCCAATCCCCCAA
TGCCCTGGAAACCGATAAAACGCCGTTGAGATAGCCACCATCAACAAAGCCACCCAACTCTTCTACCCGTTGCTTCTCTGGAGAATAAACTGGCCGGTGGTCTTG
AGACATATCTACTGCCTCCCCATTCCGGCTGAGAACAGCTCGGCAATCTCCAGCATTCGCCACCATTAAATGCCTTCCAAGCACAAGAGCTGTCAGGGCCGTCGT
CCCGGATGAACTACTGACACTGGAATCATCAGCCAGAGCTCGATCAGCTAGAAGAAATGCTTTTCGGAGACAATTTTCTATCTCCTCCGGCAAGACTTCACCAAT
ATCTGGAATTTGAGGAAAACTAACATCCTCAAAAAAGAGTCTAAGAACATTCTTTCTGATATAAGCTGCTGCTTCAGGACCTCCATGTCCATCGAATACCTGATC
AACAAATTCAACAGATCAAACTCCCACTCAGATCACAACAACAATCATCCACCATACTATACAAAGAAGACAACTCAAAAACAAAGGACTCAGAATCTCACCCCA
TAGAAAGCACTTGGCTTGGGAAACTTAAAGAGCGATCCTAAATGTGAAGACAAATCATCTATCCTTATATGTTCATCTTCCATGTATCTCCTAGGGCCAATATCA
GCAAAGCTCCCCGACCGGAGGCTCGGAAATTCCCTTCCATTCTCCGTATCGACGTCGGAATCTGGAATTTTCTGCATGGGTTTAATATCCTGAAGAAAGAAATAT
CATAAAACTCAGACGCACACAACAAAAGAACTCAAAATCATCCTTCCAAGAGACTAAACCCCAAAATTCCCTAAATCGAACCCGATCACTACAGTAACAAGTTCA
TAACGATCACTGTTTCTTACCCTAAATCAAAGGCTGAATTTCAGAAGAACAAAAGCCCATAAGAAGAAGAAGAAGAAGAACATACCGATTGAGAACTGGAACAAA
CAGATTCGGAAGCACGAACACGATTGAGAGTCGGATCAATCGAAATCCCACCATCGATGGACTCAATTTCATCCACGAGATGAGTCTCTTTACCGAAATACGGGA
CCTCCAAAACCGGGAGGCTCTTCGAACAAACAACAACCTCAGCTTCAGCCACCATTTTTACATTTACCAAAGAATCAGATTCTTCAACAGAGAGATCAAAATCGT
TATAATCAGAAGAAGGGCAAGTCGGAATTTGAATTGGAATTGCAATTGCGATTGGGTTTGGAAACGTTTTTTCTTTCTTTGTTCAGGAAATTTAGAATGTTGATC
AGAAAAATTTCTGATCAAGTCGGCGATGAAGAAGAATATATAGTGGAGCCGCCGCGGGTGAGAGTGAATTCGTGGAAGGAGACGGAGCACGTAACGCCGTTATCG
GGACTTGTCCGCGAACACACGTTAACGTTTTCACGTCGACGCGTGGCAGGTTTTTTTTCCTTCTTTTTTAATGTTGCTGAAATTACCGAAATAGCCTTCCTTTGA
ACTGCGTTGACTTTTTTTGGGGTCTTTTTCTATTTTTTTTTATTTTTTTATTTTTTCTTTTTATTTGAAGGGGAGCAAGTTATTGGGTTGGAGTGACGTAGGATC
ATGGGTCTAGTAGCTGTAGACTTTTCTAGCGACTTAATAGAAGCTTCTCTCTCTTTTTTTTTTTTTTTTTTTCGACCCTTACAGACGTGGAAATTAAGCTATTTT
ATGATTTGTTTTTGTTTTTTAAAAAAAGTTTGTTGGTTGTTTGTTCATACCATTAAATTATTGAGAAGTGAGGGTAGGAATTTGAGATGTGGAAGGGAAGGGAAT
ATATTGATATAAATTAAGGAATTGTGTGAAAAATATTATTATTATTATTTATATTATTTGGAGAAATATAGGGTTTAAGTTGTTATAGAGTATTTCACAAGTTAG
AGAAATTGAGATGGGTTAGAGAAAAGAGGATAAGCTAAAGATAGAATGAAGTGAAGAAGGAGATTGAAAAAATAAAGTTATGTGAAATTATAAGTTAATTTTGTG
TTTAATAGGACTGTAAAGTTTATACTTTCAATTTTTTATACAATATATTTATGATGTTTTTTTTTTTCTTTTTCTAAATGTTAAATGAGTCAAAGATTTATTAAA
CACAAAATTAAAAATTTAGGATGTCATATTTAATAAATATTTGTTTTTAAAAAATATGGTTATAAAGCAGGAAAAGGAAAAAATGTGATAAAGAATCTGTAAATT
GAAATAAAAATGTATGGATATAATTAGTCTTAGAAGTTAGAAGATATGTTACAAAAAATGAGGTCTTTTTATAAGAGAAAATCAATATAGATTAAGATCACATAA
GCAATTATTTAAGGAGTGTGGTGGAATTACGTGTCAAAGTTGGCCGACGATTGGATGGCTTCAGACGCGGTTCCAATAAAATTGTGGGATTATTGCCACGTGTCA
CGAAGACATTGGTTTCTTATTGTACTGTACCTTTAGTCCAATTTTTGAAACAGTGTATCCATTATTATTGAG
Protein sequenceShow/hide protein sequence
MVCARTASENLSFQSHNIIKGLFLQFGATQISAPTLFSVPLLFSSSLIQNPTMSIAALACNSTPNSLNLTHPSIRRRNGILFMAIPTCRSINSRSLSLPELVFKI
PRGSSNRSRNPIFPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSVKPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPP
LLSPVASALQPLLERLLQRGLRSFAKFAKKYQTA