; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0011743 (gene) of Chayote v1 genome

Gene IDSed0011743
OrganismSechium edule (Chayote v1)
DescriptionProtein of unknown function, DUF538
Genome locationLG09:33514893..33518238
RNA-Seq ExpressionSed0011743
SyntenySed0011743
Gene Ontology termsNA
InterPro domainsIPR007493 - Protein of unknown function DUF538
IPR036758 - At5g01610-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022959707.1 uncharacterized protein At5g01610-like [Cucurbita moschata]6.8e-6878.7Show/hide
Query:  MLFIWFFFLYFAPSMADPYPLKPESGSMSIYDVLLAHGLPKGLLPKGVRDYEIDEASGRFMAFLDRECNAMFENQLHYETNVSGTLSYGQIGALNGISAQ
        MLF+ FFFLYF PSMA    +   S S SIYDVL AHGLPKGLLPKG+RDYE  E SGRF+ FLDRECNAMFENQLHYETNV+GTLSYGQIGAL+GISAQ
Subjt:  MLFIWFFFLYFAPSMADPYPLKPESGSMSIYDVLLAHGLPKGLLPKGVRDYEIDEASGRFMAFLDRECNAMFENQLHYETNVSGTLSYGQIGALNGISAQ

Query:  DLFLWFPVKGIRVDIPSSGVIYFDVGVVFKQFSQSLFETPPVCVAVPPKNHQHRKIGKQFVEMGLVEMA
        DLFLWFPVKGIRVDIPSSGVIYFDVGVVFKQFSQSLFETPP CV+     +QHRKIG +FVE G V++A
Subjt:  DLFLWFPVKGIRVDIPSSGVIYFDVGVVFKQFSQSLFETPPVCVAVPPKNHQHRKIGKQFVEMGLVEMA

XP_022991665.1 uncharacterized protein At5g01610-like [Cucurbita maxima]3.1e-6877.51Show/hide
Query:  MLFIWFFFLYFAPSMADPYPLKPESGSMSIYDVLLAHGLPKGLLPKGVRDYEIDEASGRFMAFLDRECNAMFENQLHYETNVSGTLSYGQIGALNGISAQ
        MLF+WF FLYF  SMA PY L   + S SIYDVLLAHGLPKGLLPKGVRDY+ D+ +GRF+ FLD+ECNAMFENQLHYETNVSGTLSYGQIGAL+GISA+
Subjt:  MLFIWFFFLYFAPSMADPYPLKPESGSMSIYDVLLAHGLPKGLLPKGVRDYEIDEASGRFMAFLDRECNAMFENQLHYETNVSGTLSYGQIGALNGISAQ

Query:  DLFLWFPVKGIRVDIPSSGVIYFDVGVVFKQFSQSLFETPPVCVAVPPKNHQHRKIGKQFVEMGLVEMA
        DLFLWFPVKGIRVD+PSSGVIYFDVGVVFKQFSQSLFETPP C  VP    QHRKIG  FV+   V++A
Subjt:  DLFLWFPVKGIRVDIPSSGVIYFDVGVVFKQFSQSLFETPPVCVAVPPKNHQHRKIGKQFVEMGLVEMA

XP_023004236.1 uncharacterized protein At5g01610-like [Cucurbita maxima]8.9e-6878.7Show/hide
Query:  MLFIWFFFLYFAPSMADPYPLKPESGSMSIYDVLLAHGLPKGLLPKGVRDYEIDEASGRFMAFLDRECNAMFENQLHYETNVSGTLSYGQIGALNGISAQ
        MLF+ FFFLYF PSMA    +   S   SIYDVL AHGLPKGLLPKG+RDYE  E SGRF+ FLDRECNAMFENQLHYETNV+GTLSYGQIGAL+GISAQ
Subjt:  MLFIWFFFLYFAPSMADPYPLKPESGSMSIYDVLLAHGLPKGLLPKGVRDYEIDEASGRFMAFLDRECNAMFENQLHYETNVSGTLSYGQIGALNGISAQ

Query:  DLFLWFPVKGIRVDIPSSGVIYFDVGVVFKQFSQSLFETPPVCVAVPPKNHQHRKIGKQFVEMGLVEMA
        DLFLWFPVKGIRVDIPSSGVIYFDVGVVFKQFSQSLFETPP CV+     +QHRKIG QFVE G V++A
Subjt:  DLFLWFPVKGIRVDIPSSGVIYFDVGVVFKQFSQSLFETPPVCVAVPPKNHQHRKIGKQFVEMGLVEMA

XP_023550146.1 uncharacterized protein At5g01610-like [Cucurbita pepo subsp. pepo]2.0e-6778.7Show/hide
Query:  MLFIWFFFLYFAPSMADPYPLKPESGSMSIYDVLLAHGLPKGLLPKGVRDYEIDEASGRFMAFLDRECNAMFENQLHYETNVSGTLSYGQIGALNGISAQ
        MLF+ FFFLY  PSMA    +   S S SIYDVL AHGLPKGLLPKG+RDYE  E SGRF+ FLDRECNAMFENQLHYETNV+GTLSYGQIGAL+GISAQ
Subjt:  MLFIWFFFLYFAPSMADPYPLKPESGSMSIYDVLLAHGLPKGLLPKGVRDYEIDEASGRFMAFLDRECNAMFENQLHYETNVSGTLSYGQIGALNGISAQ

Query:  DLFLWFPVKGIRVDIPSSGVIYFDVGVVFKQFSQSLFETPPVCVAVPPKNHQHRKIGKQFVEMGLVEMA
        DLFLWFPVKGIRVDIPSSGVIYFDVGVVFKQFSQSLFETPP CV+     +QHRKIG QFVE G V++A
Subjt:  DLFLWFPVKGIRVDIPSSGVIYFDVGVVFKQFSQSLFETPPVCVAVPPKNHQHRKIGKQFVEMGLVEMA

XP_038899817.1 uncharacterized protein LOC120087046 [Benincasa hispida]3.6e-6978.95Show/hide
Query:  MLFIWFFFLYFAPSMADPYPLKPESGSMSIYDVLLAHGLPKGLLPKGVRDYEIDEASGRFMAFLDRECNAMFENQLHYETNVSGTLSYGQIGALNGISAQ
        MLF+WF FLYF  SMADP+ L   S S SIYDVLLAHGLPKGLLPKGVRDYE +  +GRF+ FLDRECNAMFENQLHYETNVSGTLSYG+I AL+GISAQ
Subjt:  MLFIWFFFLYFAPSMADPYPLKPESGSMSIYDVLLAHGLPKGLLPKGVRDYEIDEASGRFMAFLDRECNAMFENQLHYETNVSGTLSYGQIGALNGISAQ

Query:  DLFLWFPVKGIRVDIPSSGVIYFDVGVVFKQFSQSLFETPPVCVAVPPKNHQHRKI--GKQFVEMGLVEMA
        DLFLWFPVKGIRVD+PSSGVIYFDVGVV+KQFSQSLFETPPVC  VP   +QHRKI  G QFVE+  V+MA
Subjt:  DLFLWFPVKGIRVDIPSSGVIYFDVGVVFKQFSQSLFETPPVCVAVPPKNHQHRKI--GKQFVEMGLVEMA

TrEMBL top hitse value%identityAlignment
A0A5A7TKT3 DUF538 domain-containing protein2.8e-6780.72Show/hide
Query:  MLFIWFFFLYFAPSMADPYPLKP-ESGSMSIYDVLLAHGLPKGLLPKGVRDYEIDEASGRFMAFLDRECNAMFENQLHYETNVSGTLSYGQIGALNGISA
        MLF+WF FLYF  S+ADP+ L    S S SIYDVLLAHGLPKGLLPKGVRDYE D  +GRF+ FLDRECNAMFENQLHYETNVSGTLSYG+IGAL+GISA
Subjt:  MLFIWFFFLYFAPSMADPYPLKP-ESGSMSIYDVLLAHGLPKGLLPKGVRDYEIDEASGRFMAFLDRECNAMFENQLHYETNVSGTLSYGQIGALNGISA

Query:  QDLFLWFPVKGIRVDIPSSGVIYFDVGVVFKQFSQSLFETPPVCVAVPPKNHQHRKI--GKQFVEM
        QDLFLWFPVKGIRVD+PSSGVIYFDVGVVFKQFSQSLFETPPVC  VP    QHRKI  G QFV+M
Subjt:  QDLFLWFPVKGIRVDIPSSGVIYFDVGVVFKQFSQSLFETPPVCVAVPPKNHQHRKI--GKQFVEM

A0A6J1GS21 uncharacterized protein At5g01610-like1.6e-6779.01Show/hide
Query:  MLFIWFFFLYFAPSMADPYPLKPESGSMSIYDVLLAHGLPKGLLPKGVRDYEIDEASGRFMAFLDRECNAMFENQLHYETNVSGTLSYGQIGALNGISAQ
        +LF+WF FLYF  SMA PY L   + S SIYDVLLAHGLPKGLLPKGVRDY+ D+ +GRF+ FLD+ECNAMFENQLHYETNVSGTLSYGQIGAL+GISA+
Subjt:  MLFIWFFFLYFAPSMADPYPLKPESGSMSIYDVLLAHGLPKGLLPKGVRDYEIDEASGRFMAFLDRECNAMFENQLHYETNVSGTLSYGQIGALNGISAQ

Query:  DLFLWFPVKGIRVDIPSSGVIYFDVGVVFKQFSQSLFETPPVCVAVPPKNHQHRKIGKQFVE
        DLFLWFPVKGIRVD+PSSGVIYFDVGVVFKQFSQSLFETPP C  VP    QHRKIG  FV+
Subjt:  DLFLWFPVKGIRVDIPSSGVIYFDVGVVFKQFSQSLFETPPVCVAVPPKNHQHRKIGKQFVE

A0A6J1H5A7 uncharacterized protein At5g01610-like3.3e-6878.7Show/hide
Query:  MLFIWFFFLYFAPSMADPYPLKPESGSMSIYDVLLAHGLPKGLLPKGVRDYEIDEASGRFMAFLDRECNAMFENQLHYETNVSGTLSYGQIGALNGISAQ
        MLF+ FFFLYF PSMA    +   S S SIYDVL AHGLPKGLLPKG+RDYE  E SGRF+ FLDRECNAMFENQLHYETNV+GTLSYGQIGAL+GISAQ
Subjt:  MLFIWFFFLYFAPSMADPYPLKPESGSMSIYDVLLAHGLPKGLLPKGVRDYEIDEASGRFMAFLDRECNAMFENQLHYETNVSGTLSYGQIGALNGISAQ

Query:  DLFLWFPVKGIRVDIPSSGVIYFDVGVVFKQFSQSLFETPPVCVAVPPKNHQHRKIGKQFVEMGLVEMA
        DLFLWFPVKGIRVDIPSSGVIYFDVGVVFKQFSQSLFETPP CV+     +QHRKIG +FVE G V++A
Subjt:  DLFLWFPVKGIRVDIPSSGVIYFDVGVVFKQFSQSLFETPPVCVAVPPKNHQHRKIGKQFVEMGLVEMA

A0A6J1JTK6 uncharacterized protein At5g01610-like1.5e-6877.51Show/hide
Query:  MLFIWFFFLYFAPSMADPYPLKPESGSMSIYDVLLAHGLPKGLLPKGVRDYEIDEASGRFMAFLDRECNAMFENQLHYETNVSGTLSYGQIGALNGISAQ
        MLF+WF FLYF  SMA PY L   + S SIYDVLLAHGLPKGLLPKGVRDY+ D+ +GRF+ FLD+ECNAMFENQLHYETNVSGTLSYGQIGAL+GISA+
Subjt:  MLFIWFFFLYFAPSMADPYPLKPESGSMSIYDVLLAHGLPKGLLPKGVRDYEIDEASGRFMAFLDRECNAMFENQLHYETNVSGTLSYGQIGALNGISAQ

Query:  DLFLWFPVKGIRVDIPSSGVIYFDVGVVFKQFSQSLFETPPVCVAVPPKNHQHRKIGKQFVEMGLVEMA
        DLFLWFPVKGIRVD+PSSGVIYFDVGVVFKQFSQSLFETPP C  VP    QHRKIG  FV+   V++A
Subjt:  DLFLWFPVKGIRVDIPSSGVIYFDVGVVFKQFSQSLFETPPVCVAVPPKNHQHRKIGKQFVEMGLVEMA

A0A6J1KRK2 uncharacterized protein At5g01610-like4.3e-6878.7Show/hide
Query:  MLFIWFFFLYFAPSMADPYPLKPESGSMSIYDVLLAHGLPKGLLPKGVRDYEIDEASGRFMAFLDRECNAMFENQLHYETNVSGTLSYGQIGALNGISAQ
        MLF+ FFFLYF PSMA    +   S   SIYDVL AHGLPKGLLPKG+RDYE  E SGRF+ FLDRECNAMFENQLHYETNV+GTLSYGQIGAL+GISAQ
Subjt:  MLFIWFFFLYFAPSMADPYPLKPESGSMSIYDVLLAHGLPKGLLPKGVRDYEIDEASGRFMAFLDRECNAMFENQLHYETNVSGTLSYGQIGALNGISAQ

Query:  DLFLWFPVKGIRVDIPSSGVIYFDVGVVFKQFSQSLFETPPVCVAVPPKNHQHRKIGKQFVEMGLVEMA
        DLFLWFPVKGIRVDIPSSGVIYFDVGVVFKQFSQSLFETPP CV+     +QHRKIG QFVE G V++A
Subjt:  DLFLWFPVKGIRVDIPSSGVIYFDVGVVFKQFSQSLFETPPVCVAVPPKNHQHRKIGKQFVEMGLVEMA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G61667.1 Protein of unknown function, DUF5383.9e-2945.97Show/hide
Query:  PLKPESGSMSIYDVLLAHGLPKGLLPKGVRDYEIDEASGRFMAFLDRECNAMFENQLHYETNVSGTLSYGQIGALNGISAQDLFLWFPVKGIRVDIPSSG
        P    S   SI ++L A GLP GL P  V  Y +D+ +G     L   C A FEN+++++  +   LSYG +  L G++ ++LFLW PVKGI V+ PSSG
Subjt:  PLKPESGSMSIYDVLLAHGLPKGLLPKGVRDYEIDEASGRFMAFLDRECNAMFENQLHYETNVSGTLSYGQIGALNGISAQDLFLWFPVKGIRVDIPSSG

Query:  VIYFDVGVVFKQFSQSLFETPPVC
        ++ FD+GV  KQ S+SLFE PPVC
Subjt:  VIYFDVGVVFKQFSQSLFETPPVC

AT3G07460.1 Protein of unknown function, DUF5381.0e-3760.17Show/hide
Query:  SIYDVLLAHGLPKGLLPKGVRDYEIDEASGRFMAFLDRECNAMFENQLHYETNVSGTLSYGQIGALNGISAQDLFLWFPVKGIRVDIPSSGVIYFDVGVV
        SI ++LLA+GLP GL PKGV+ + ++  +GRF  +L++ C A +E +LHY+  VSGT+ Y QI  L+GISAQ+LFLW  VKGIRVD+PSSG+I+FDVGV+
Subjt:  SIYDVLLAHGLPKGLLPKGVRDYEIDEASGRFMAFLDRECNAMFENQLHYETNVSGTLSYGQIGALNGISAQDLFLWFPVKGIRVDIPSSGVIYFDVGVV

Query:  FKQFSQSLFETPPVCVAV
         KQ+S SLFETP  CVAV
Subjt:  FKQFSQSLFETPPVCVAV

AT3G07460.2 Protein of unknown function, DUF5381.0e-3760.17Show/hide
Query:  SIYDVLLAHGLPKGLLPKGVRDYEIDEASGRFMAFLDRECNAMFENQLHYETNVSGTLSYGQIGALNGISAQDLFLWFPVKGIRVDIPSSGVIYFDVGVV
        SI ++LLA+GLP GL PKGV+ + ++  +GRF  +L++ C A +E +LHY+  VSGT+ Y QI  L+GISAQ+LFLW  VKGIRVD+PSSG+I+FDVGV+
Subjt:  SIYDVLLAHGLPKGLLPKGVRDYEIDEASGRFMAFLDRECNAMFENQLHYETNVSGTLSYGQIGALNGISAQDLFLWFPVKGIRVDIPSSGVIYFDVGVV

Query:  FKQFSQSLFETPPVCVAV
         KQ+S SLFETP  CVAV
Subjt:  FKQFSQSLFETPPVCVAV

AT3G07470.1 Protein of unknown function, DUF5384.4e-4159.84Show/hide
Query:  SGSMSIYDVLLAHGLPKGLLPKGVRDYEIDEASGRFMAFLDRECNAMFENQLHYETNVSGTLSYGQIGALNGISAQDLFLWFPVKGIRVDIPSSGVIYFD
        S + +IY++LLA+GLP G+ PKGVR++  D  +GRF  +L++ C A +E ++HY+ N++GT+   QI  L+GISAQ+LFLWFPVKGIRVD+PSSG+IYFD
Subjt:  SGSMSIYDVLLAHGLPKGLLPKGVRDYEIDEASGRFMAFLDRECNAMFENQLHYETNVSGTLSYGQIGALNGISAQDLFLWFPVKGIRVDIPSSGVIYFD

Query:  VGVVFKQFSQSLFETPPVCVAV
        VGVV KQ+S SLFETP  CV V
Subjt:  VGVVFKQFSQSLFETPPVCVAV

AT5G16380.1 Protein of unknown function, DUF5383.8e-3248.92Show/hide
Query:  FFLYFAPSMADPYPLKPESGSMSIYDVLLAHGLPKGLLPKGVRDYEIDEASGRFMAFLDRECNAMFENQLHYETNVSGTLSYGQIGALNGISAQDLFLWF
        FFL      +  +P        S YD L    LP G++PKGV ++ ID  +GRF   L   C+A FENQ H++ N+SG LS G+IG L+G++ ++LFLWF
Subjt:  FFLYFAPSMADPYPLKPESGSMSIYDVLLAHGLPKGLLPKGVRDYEIDEASGRFMAFLDRECNAMFENQLHYETNVSGTLSYGQIGALNGISAQDLFLWF

Query:  PVKGIRVDIPSSGVIYFDVGVVFKQFSQSLFETPPVCVA
         VKGI VD  SSG+I+FDVGV  KQ S SLFE+P  C A
Subjt:  PVKGIRVDIPSSGVIYFDVGVVFKQFSQSLFETPPVCVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCTTCATCTGGTTCTTCTTCCTCTACTTCGCCCCTTCAATGGCGGATCCTTACCCTCTCAAGCCCGAATCAGGTTCCATGTCGATTTACGATGTGCTTTTGGCCCA
CGGCCTGCCGAAAGGGCTGCTCCCGAAGGGCGTTAGGGATTACGAAATCGACGAGGCGAGTGGGAGATTCATGGCGTTCTTGGATCGGGAGTGTAATGCGATGTTCGAGA
ATCAATTGCATTACGAGACGAATGTTTCCGGGACTTTGAGTTACGGTCAGATCGGCGCCTTGAATGGGATTTCGGCGCAGGATCTGTTCTTGTGGTTTCCTGTTAAGGGC
ATTCGCGTCGATATTCCCAGCTCCGGCGTAATCTATTTCGACGTTGGCGTTGTTTTCAAGCAGTTCTCTCAATCGCTCTTCGAAACCCCTCCCGTATGCGTTGCCGTTCC
GCCTAAGAATCATCAACACAGGAAAATTGGGAAACAATTTGTTGAGATGGGTTTAGTTGAAATGGCTTGA
mRNA sequenceShow/hide mRNA sequence
AAAATATTGTGTACAATTTGGAGTTTTATAGAATATATATTCATAATTGAAGAATTCTTGTAATGAAAGGCCCCACTCTTTTTCTCTTTCACACTCATCACCAATAATTG
ATTTCCATCTTCATCTTCTTCTTCTTCTTCATCTTCCATTTCTTCCACCCTGCATTTTCCCTTCCTTCCATTTCCAAACCTCCATTGCCAAATCCCAAAACCCTACCTCC
AAATTTCAACAATATTCATCTCAGATTCCTCAAATCTCCCTTCCATCAACCCACCATGCTCTTCATCTGGTTCTTCTTCCTCTACTTCGCCCCTTCAATGGCGGATCCTT
ACCCTCTCAAGCCCGAATCAGGTTCCATGTCGATTTACGATGTGCTTTTGGCCCACGGCCTGCCGAAAGGGCTGCTCCCGAAGGGCGTTAGGGATTACGAAATCGACGAG
GCGAGTGGGAGATTCATGGCGTTCTTGGATCGGGAGTGTAATGCGATGTTCGAGAATCAATTGCATTACGAGACGAATGTTTCCGGGACTTTGAGTTACGGTCAGATCGG
CGCCTTGAATGGGATTTCGGCGCAGGATCTGTTCTTGTGGTTTCCTGTTAAGGGCATTCGCGTCGATATTCCCAGCTCCGGCGTAATCTATTTCGACGTTGGCGTTGTTT
TCAAGCAGTTCTCTCAATCGCTCTTCGAAACCCCTCCCGTATGCGTTGCCGTTCCGCCTAAGAATCATCAACACAGGAAAATTGGGAAACAATTTGTTGAGATGGGTTTA
GTTGAAATGGCTTGAAATTTTCTGTTGAAACAAATTGGGGGAAAATAGGCTCAAAGGAGGTTTTGTGAATCTTCCATTGTACATGTCATCCAAAAGCATATATTAGCATC
TGCTATATTGGTTCTTTACACAGTTTGTGCTTGAACTTTCAATCATTAAGAGCAGAGTCAGAGTCTGGGAATTTTGTTGAGCAACTTCTGGTTTGTGTGAGGAAAGGTAA
GAAGAACATTTGGTTGGTTCAGAGGAATTTTTGTTTGGGGGCAATTGCAGAAAAGCTAGGTGGAGTTTCAGGTGAAAAATCTACACATTTCTATTCAATAATCTTCAGCA
ATTTTGGTGTACTGTTGTGAATATTATCATTATCATTATTACTATTGTTATTATTATTATTGTTATTACAGTGGCTATAATAATTTTATTGCTTTTTTGTCTAGAATGCA
AGTTCTCTGCAATAAACATGTCAAGAATAGAGTGTTTTCTTTGTTTTCTGCCCATGTTTTGTTTGCCTCTGAGTTCTTTAACCTCATACTATTGTTGCAATTGGGACAGA
ACAGTGGGGATTCTCATTCTTGTCAGCATTAGTAAATATAGCCTGTGATGTAGAAGTATCAAACTTGGTAGAAACAGTAGTTTTTGTGCTCTTTTTCTAACTATAACCTT
TTTTTCTTGTTTGCATGAAAAGTGATATTCAATGGCTTGCTTGACTATGAAAATGGGATTTTAGCTTTGGTGGACTAGATCACTTAGAGCTGTTGAAACTTGATATACAA
TCTTGTTGGAAGATGAGATCTGTTGGATAATGCATTTACTATTGCTTTTTGAAAAAGTTGTCTTTGGTTTTGATATGGAGATTTTACCATTTGATGAGACATTAGAGAAG
TTTAAGTTAGATAGAGGTTTGACAAGAAACTATTCAATGTGGTCACTACATTGTGGCTAACCTTATTAACAGTTGGATGTAGGCTGAAACTAGTATAAAAGTATCAGCCA
CATCATATCAACTTCCTTGGAGAATTTTCCTGTATGCTAGCATTGCAAGACTGTGTTCATTAAAATAGAGGTGTGTTTGGCATTATTTTTTAAATAGTGGTTGAAAGCAC
ATTTTATGATGTTTATAGCCTTCTTAAAAAGCGTTTTTAGTGACTTGATTACTCTTTCTTTTGAAACTTATCGTTCAACTTTGAAGGTAGATTGATTTGAATCCGTTTCT
CAAGTAAACATTCGTTTCTGGGCAGCTCTGGGTTTTATTTGTTTTTACATGTTTCTTAGGTGACTTTACCTTTACAGATAGGAATCTGAAGATGTGAAATTTACCAACAC
TTTTCCAGAATTCTTGGTCCAAACAGTCCTTGAGCCATTGAAGACTTGGGCTTTGAAGGTATGCTCCCTTCAAGGTTTTAGGTTCGAGACTCACCTGTGACATTAATCCT
TCGATGTCTCCCGATATTTGGCGTAGAGACGGATGTGGTTATCCGGTTCAAACAATCTTTTGGTCTTAGATCTATTTGGTGTTTTACTTTCATGCGTATGAAATTTTAAT
TGTTTTAACAATTATGAATATCTGTTACAGTTTTGACAAATGCATAGTAGGCAATTAATTATATTTATTTAATACTTACATGGTAAGTTAGATTTTTGACCTGTTCCAAA
GCTTTATTAAAGTCTTTTTAAAGCATAGAAGTTACTGAATAAAATAGCTGGAAGTTCAAAACTCATATAAAAAGCTTAGAGTTCTAGACCTATAGAAAACTCTAACATGT
GTCATGAATAGAAGTAGTTCAAAAGGATGAGAACTAAAGAAATGTAAATTCATGCCCAAAAAACATGGAAGTGTTTTTTTTTTCTTTTCTTTTTTTGATTGTGAAGTCTA
TATCAAGACTTATGTGCTGTTGATCTCTGTGTGGCTAAAAAGCTTGACTGTTCAACAACTCCACCAAATATGAAGTGATGAAAGCTCTCCCCCTTTTATTCTCTTATGTC
ATGATGTTATTGTTGTAATATTGTTAAACATGATGGGCTTTAGATTATTGGGGGTTGGATAATTGCACATTGCATCATGTTTATGTGTTTTATGGATGAGACAAAGTTTA
GACAAAATGATGATAATAATGCATAAAGTTGGATTGGATGCAAGTATGAATATGTCCAGGGTGTGTATGATGGAAGCCAAATGCTCTGTTTTCATGTGCCTTTTTCTTTT
TCTTTGGAGGGCTTAAAAATGAATTTTATTTGCATTGATTTGGTTGGATTGATAATGGAACTTGGCTTTCATGGTCAGACAAATAATAATATTGCAATTTCTTGTTGGGT
TTGGTTGAAATGAGATGGATTGTTTGAAAGTTTGTATTATGAAATAGGGATTTTTTTTATTGTTCTTCTTTGAGTAATTATTAATTATCATGGGTATGGCTTCAAAAAGT
TGAAGATGATTGTGGTATTTGAAATTATTGTTCATGAACAGGG
Protein sequenceShow/hide protein sequence
MLFIWFFFLYFAPSMADPYPLKPESGSMSIYDVLLAHGLPKGLLPKGVRDYEIDEASGRFMAFLDRECNAMFENQLHYETNVSGTLSYGQIGALNGISAQDLFLWFPVKG
IRVDIPSSGVIYFDVGVVFKQFSQSLFETPPVCVAVPPKNHQHRKIGKQFVEMGLVEMA