; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy6G032502 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy6G032502
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionTPD1 protein homolog 1-like
Genome locationGy14Chr6:29674114..29677466
RNA-Seq ExpressionCsGy6G032502
SyntenyCsGy6G032502
Gene Ontology termsGO:0001709 - cell fate determination (biological process)
GO:0009408 - response to heat (biological process)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR040361 - Tapetum determinant 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK12989.1 TPD1 protein-like protein 1-like isoform X2 [Cucumis melo var. makuwa]2.75e-10695.71Show/hide
Query:  MKGGGAFHRFCLWFLLTIFLGNEFKLVHLRVLQAAARENRNGISSMSLQSSNSGKEMNRIGSRCAKDDIIIFQGPATPLPGGIPTYIVQILNSCASDCSI
        MKGGGAFHRFCL FLL IFL  EFKLVHLRVLQAAARENRNGISSMSLQSSNSGKEMNRIGS C+K+DIIIFQGPATPLPGGIPTYIVQILNSCASDCSI
Subjt:  MKGGGAFHRFCLWFLLTIFLGNEFKLVHLRVLQAAARENRNGISSMSLQSSNSGKEMNRIGSRCAKDDIIIFQGPATPLPGGIPTYIVQILNSCASDCSI

Query:  SNIHVKCGWFSSARLVNPRIFKRVSYDDCLVNDGRALGPGRTLSFQYANTFPYPLSVSSATCS
        SNIHVKCGWFSSARLVNPRIFKRVSYDDCLVNDGRALGPGRTLSFQYANTFPYPLSVSSATCS
Subjt:  SNIHVKCGWFSSARLVNPRIFKRVSYDDCLVNDGRALGPGRTLSFQYANTFPYPLSVSSATCS

XP_004134986.1 TPD1 protein homolog 1 isoform X2 [Cucumis sativus]4.21e-115100Show/hide
Query:  MKGGGAFHRFCLWFLLTIFLGNEFKLVHLRVLQAAARENRNGISSMSLQSSNSGKEMNRIGSRCAKDDIIIFQGPATPLPGGIPTYIVQILNSCASDCSI
        MKGGGAFHRFCLWFLLTIFLGNEFKLVHLRVLQAAARENRNGISSMSLQSSNSGKEMNRIGSRCAKDDIIIFQGPATPLPGGIPTYIVQILNSCASDCSI
Subjt:  MKGGGAFHRFCLWFLLTIFLGNEFKLVHLRVLQAAARENRNGISSMSLQSSNSGKEMNRIGSRCAKDDIIIFQGPATPLPGGIPTYIVQILNSCASDCSI

Query:  SNIHVKCGWFSSARLVNPRIFKRVSYDDCLVNDGRALGPGRTLSFQYANTFPYPLSVSSATCSS
        SNIHVKCGWFSSARLVNPRIFKRVSYDDCLVNDGRALGPGRTLSFQYANTFPYPLSVSSATCSS
Subjt:  SNIHVKCGWFSSARLVNPRIFKRVSYDDCLVNDGRALGPGRTLSFQYANTFPYPLSVSSATCSS

XP_008440085.1 PREDICTED: TPD1 protein homolog 1-like isoform X1 [Cucumis melo]1.30e-10996.93Show/hide
Query:  MKGGGAFHRFCLWFLLTIFLGNEFKLVHLRVLQAAARENRNGISSMSLQSSNSGKEMNRIGSRCAKDDIIIFQGPATPLPGGIPTYIVQILNSCASDCSI
        MKGGGAFHRFCL FLL IFLGNEFKLVHLRVLQAAARENRNGISSMSLQSSNSGKEMNRIGS C+K DIIIFQGPATPLPGGIPTYIVQILNSCASDCSI
Subjt:  MKGGGAFHRFCLWFLLTIFLGNEFKLVHLRVLQAAARENRNGISSMSLQSSNSGKEMNRIGSRCAKDDIIIFQGPATPLPGGIPTYIVQILNSCASDCSI

Query:  SNIHVKCGWFSSARLVNPRIFKRVSYDDCLVNDGRALGPGRTLSFQYANTFPYPLSVSSATCS
        SNIHVKCGWFSSARLVNPRIFKRVSYDDCLVNDGRALGPGRTLSFQYANTFPYPLSVSSATCS
Subjt:  SNIHVKCGWFSSARLVNPRIFKRVSYDDCLVNDGRALGPGRTLSFQYANTFPYPLSVSSATCS

XP_011658115.1 TPD1 protein homolog 1 isoform X3 [Cucumis sativus]2.55e-11198.78Show/hide
Query:  MKGGGAFHRFCLWFLLTIFLGNEFKLVHLRVLQAAARENRNGISSMSLQSSNSGKEMNRIGSRCAKDDIIIFQGPATPLPGGIPTYIVQILNSCASDCSI
        MKGGGAFHRFCLWFLLTIFL  EFKLVHLRVLQAAARENRNGISSMSLQSSNSGKEMNRIGSRCAKDDIIIFQGPATPLPGGIPTYIVQILNSCASDCSI
Subjt:  MKGGGAFHRFCLWFLLTIFLGNEFKLVHLRVLQAAARENRNGISSMSLQSSNSGKEMNRIGSRCAKDDIIIFQGPATPLPGGIPTYIVQILNSCASDCSI

Query:  SNIHVKCGWFSSARLVNPRIFKRVSYDDCLVNDGRALGPGRTLSFQYANTFPYPLSVSSATCSS
        SNIHVKCGWFSSARLVNPRIFKRVSYDDCLVNDGRALGPGRTLSFQYANTFPYPLSVSSATCSS
Subjt:  SNIHVKCGWFSSARLVNPRIFKRVSYDDCLVNDGRALGPGRTLSFQYANTFPYPLSVSSATCSS

XP_031742589.1 TPD1 protein homolog 1 isoform X1 [Cucumis sativus]2.19e-10679.61Show/hide
Query:  MKGGGAFHRFCLWFLLTIFLGN------------------------------------------EFKLVHLRVLQAAARENRNGISSMSLQSSNSGKEMN
        MKGGGAFHRFCLWFLLTIFLGN                                          EFKLVHLRVLQAAARENRNGISSMSLQSSNSGKEMN
Subjt:  MKGGGAFHRFCLWFLLTIFLGN------------------------------------------EFKLVHLRVLQAAARENRNGISSMSLQSSNSGKEMN

Query:  RIGSRCAKDDIIIFQGPATPLPGGIPTYIVQILNSCASDCSISNIHVKCGWFSSARLVNPRIFKRVSYDDCLVNDGRALGPGRTLSFQYANTFPYPLSVS
        RIGSRCAKDDIIIFQGPATPLPGGIPTYIVQILNSCASDCSISNIHVKCGWFSSARLVNPRIFKRVSYDDCLVNDGRALGPGRTLSFQYANTFPYPLSVS
Subjt:  RIGSRCAKDDIIIFQGPATPLPGGIPTYIVQILNSCASDCSISNIHVKCGWFSSARLVNPRIFKRVSYDDCLVNDGRALGPGRTLSFQYANTFPYPLSVS

Query:  SATCSS
        SATCSS
Subjt:  SATCSS

TrEMBL top hitse value%identityAlignment
A0A0A0KL27 Uncharacterized protein2.04e-115100Show/hide
Query:  MKGGGAFHRFCLWFLLTIFLGNEFKLVHLRVLQAAARENRNGISSMSLQSSNSGKEMNRIGSRCAKDDIIIFQGPATPLPGGIPTYIVQILNSCASDCSI
        MKGGGAFHRFCLWFLLTIFLGNEFKLVHLRVLQAAARENRNGISSMSLQSSNSGKEMNRIGSRCAKDDIIIFQGPATPLPGGIPTYIVQILNSCASDCSI
Subjt:  MKGGGAFHRFCLWFLLTIFLGNEFKLVHLRVLQAAARENRNGISSMSLQSSNSGKEMNRIGSRCAKDDIIIFQGPATPLPGGIPTYIVQILNSCASDCSI

Query:  SNIHVKCGWFSSARLVNPRIFKRVSYDDCLVNDGRALGPGRTLSFQYANTFPYPLSVSSATCSS
        SNIHVKCGWFSSARLVNPRIFKRVSYDDCLVNDGRALGPGRTLSFQYANTFPYPLSVSSATCSS
Subjt:  SNIHVKCGWFSSARLVNPRIFKRVSYDDCLVNDGRALGPGRTLSFQYANTFPYPLSVSSATCSS

A0A1S3AZV4 TPD1 protein homolog 1-like isoform X16.31e-11096.93Show/hide
Query:  MKGGGAFHRFCLWFLLTIFLGNEFKLVHLRVLQAAARENRNGISSMSLQSSNSGKEMNRIGSRCAKDDIIIFQGPATPLPGGIPTYIVQILNSCASDCSI
        MKGGGAFHRFCL FLL IFLGNEFKLVHLRVLQAAARENRNGISSMSLQSSNSGKEMNRIGS C+K DIIIFQGPATPLPGGIPTYIVQILNSCASDCSI
Subjt:  MKGGGAFHRFCLWFLLTIFLGNEFKLVHLRVLQAAARENRNGISSMSLQSSNSGKEMNRIGSRCAKDDIIIFQGPATPLPGGIPTYIVQILNSCASDCSI

Query:  SNIHVKCGWFSSARLVNPRIFKRVSYDDCLVNDGRALGPGRTLSFQYANTFPYPLSVSSATCS
        SNIHVKCGWFSSARLVNPRIFKRVSYDDCLVNDGRALGPGRTLSFQYANTFPYPLSVSSATCS
Subjt:  SNIHVKCGWFSSARLVNPRIFKRVSYDDCLVNDGRALGPGRTLSFQYANTFPYPLSVSSATCS

A0A1S3B0B6 TPD1 protein homolog 1-like isoform X23.82e-10695.71Show/hide
Query:  MKGGGAFHRFCLWFLLTIFLGNEFKLVHLRVLQAAARENRNGISSMSLQSSNSGKEMNRIGSRCAKDDIIIFQGPATPLPGGIPTYIVQILNSCASDCSI
        MKGGGAFHRFCL FLL IFL  EFKLVHLRVLQAAARENRNGISSMSLQSSNSGKEMNRIGS C+K DIIIFQGPATPLPGGIPTYIVQILNSCASDCSI
Subjt:  MKGGGAFHRFCLWFLLTIFLGNEFKLVHLRVLQAAARENRNGISSMSLQSSNSGKEMNRIGSRCAKDDIIIFQGPATPLPGGIPTYIVQILNSCASDCSI

Query:  SNIHVKCGWFSSARLVNPRIFKRVSYDDCLVNDGRALGPGRTLSFQYANTFPYPLSVSSATCS
        SNIHVKCGWFSSARLVNPRIFKRVSYDDCLVNDGRALGPGRTLSFQYANTFPYPLSVSSATCS
Subjt:  SNIHVKCGWFSSARLVNPRIFKRVSYDDCLVNDGRALGPGRTLSFQYANTFPYPLSVSSATCS

A0A5D3CRM1 TPD1 protein-like protein 1-like isoform X21.33e-10695.71Show/hide
Query:  MKGGGAFHRFCLWFLLTIFLGNEFKLVHLRVLQAAARENRNGISSMSLQSSNSGKEMNRIGSRCAKDDIIIFQGPATPLPGGIPTYIVQILNSCASDCSI
        MKGGGAFHRFCL FLL IFL  EFKLVHLRVLQAAARENRNGISSMSLQSSNSGKEMNRIGS C+K+DIIIFQGPATPLPGGIPTYIVQILNSCASDCSI
Subjt:  MKGGGAFHRFCLWFLLTIFLGNEFKLVHLRVLQAAARENRNGISSMSLQSSNSGKEMNRIGSRCAKDDIIIFQGPATPLPGGIPTYIVQILNSCASDCSI

Query:  SNIHVKCGWFSSARLVNPRIFKRVSYDDCLVNDGRALGPGRTLSFQYANTFPYPLSVSSATCS
        SNIHVKCGWFSSARLVNPRIFKRVSYDDCLVNDGRALGPGRTLSFQYANTFPYPLSVSSATCS
Subjt:  SNIHVKCGWFSSARLVNPRIFKRVSYDDCLVNDGRALGPGRTLSFQYANTFPYPLSVSSATCS

A0A6J1GDB1 TPD1 protein homolog 1-like1.80e-7672.5Show/hide
Query:  GAFHRFCLWFLLTIFLGNEFKLVHLRVLQAAARENRNGISSMSLQSSNSGKEMNRIGSRCAKDDIIIFQGPATPLPGGIPTYIVQILNSCASDCSISNIH
        GAF  F L F  ++FL   FKLV L V+QAAARE  +GIS MSLQSS +G+ MNR+GS C+KD++++FQG   PLP GIP++IVQILNSC+S CSISNIH
Subjt:  GAFHRFCLWFLLTIFLGNEFKLVHLRVLQAAARENRNGISSMSLQSSNSGKEMNRIGSRCAKDDIIIFQGPATPLPGGIPTYIVQILNSCASDCSISNIH

Query:  VKCGWFSSARLVNPRIFKRVSYDDCLVNDGRALGPGRTLSFQYANTFPYPLSVSSATCSS
        +KCGWFSSARLVNPRIF+RV YDDCL+NDG+ALGPG+TLSFQYANTFPYPLSVSS TCSS
Subjt:  VKCGWFSSARLVNPRIFKRVSYDDCLVNDGRALGPGRTLSFQYANTFPYPLSVSSATCSS

SwissProt top hitse value%identityAlignment
A8MS78 Uncharacterized protein At1g058352.0e-0640Show/hide
Query:  VQILNSCASDCSISNIHVKCGWFSSARLVNPRIFKRVSYD--DCLVNDGRALGPGRTLSFQYANTFPYPL
        V+++N C   C I N+ +KC  F  + LV+P   + +S    +C+VNDG  L P +TLSF Y+NT  + L
Subjt:  VQILNSCASDCSISNIHVKCGWFSSARLVNPRIFKRVSYD--DCLVNDGRALGPGRTLSFQYANTFPYPL

Q1G3T1 TPD1 protein homolog 18.2e-4563.2Show/hide
Query:  ENRNGISSMSLQSSNSGKEMNRIGSRCAKDDIIIFQGPATPLPGGIPTYIVQILNSCASDCSISNIHVKCGWFSSARLVNPRIFKRVSYDDCLVNDGRAL
        ENR  +S   L S + G   NRIG  C+KDDI++FQG   PLP G+P+Y V+I NSC SDC+I+ IHV CGWFSS RLVNPR+F+R+ YDDCLVNDG+ L
Subjt:  ENRNGISSMSLQSSNSGKEMNRIGSRCAKDDIIIFQGPATPLPGGIPTYIVQILNSCASDCSISNIHVKCGWFSSARLVNPRIFKRVSYDDCLVNDGRAL

Query:  GPGRTLSFQYANTFPYPLSVSSATC
        GPG++LSFQYAN+F YPLSV+S +C
Subjt:  GPGRTLSFQYANTFPYPLSVSSATC

Q2QR54 TPD1 protein homolog 1A7.2e-3355.17Show/hide
Query:  GKEMNRIGSRCA-KDDIIIFQGPATPLPGGIPTYIVQILNSCA------SDCSISNIHVKCGWFSSARLVNPRIFKRVSYDDCLVNDGRALGPGRTLSFQ
        G    R+   CA  DDI I+QG ATPLP G+P Y V ++N CA       +C+I+ IHV+CGWFSS  LV+PR+F+R+ +DDCL+NDGR L  G T+SF+
Subjt:  GKEMNRIGSRCA-KDDIIIFQGPATPLPGGIPTYIVQILNSCA------SDCSISNIHVKCGWFSSARLVNPRIFKRVSYDDCLVNDGRALGPGRTLSFQ

Query:  YANTFPYPLSVSSATC
        Y N+FPY LSVS ATC
Subjt:  YANTFPYPLSVSSATC

Q6TLJ2 Protein TAPETUM DETERMINANT 13.5e-3556.1Show/hide
Query:  MSLQSSNSGK-----EMNRIGSRCAKDDIIIFQGPATPLPGGIPTYIVQILNSCASDCSISNIHVKCGWFSSARLVNPRIFKRVSYDDCLVNDGRALGPG
        M L S  +GK     E  RIG +C   DI++ Q    P+P GIP Y+V+I N C S C IS IH+ CGWFSSA+L+NPR+FKR+ YDDCLVN+G+ L  G
Subjt:  MSLQSSNSGK-----EMNRIGSRCAKDDIIIFQGPATPLPGGIPTYIVQILNSCASDCSISNIHVKCGWFSSARLVNPRIFKRVSYDDCLVNDGRALGPG

Query:  RTLSFQYANTFPYPLSVSSATCS
         TLSF YANTFPY LSV+  TC+
Subjt:  RTLSFQYANTFPYPLSVSSATCS

Q8S6P9 TPD1 protein homolog 1B2.8e-2443.48Show/hide
Query:  SSNSGKE-MNRI-GSRCAKDDIIIFQGPATPLPGGIPTYIVQILNSCASDCSISNIHVKCGWFSSARLVNPRIFKRVSYDDCLVNDGRALGPGRTLSFQY
        S N G+  M R+    C++ +++++Q  A  LP GIPTY V+I+N C + C++ ++H+ CG F+SA LV+P  F+R+ ++DCLV  G  LGP   +SFQY
Subjt:  SSNSGKE-MNRI-GSRCAKDDIIIFQGPATPLPGGIPTYIVQILNSCASDCSISNIHVKCGWFSSARLVNPRIFKRVSYDDCLVNDGRALGPGRTLSFQY

Query:  ANTFPYPLSVSSATC
        +N+F YPL+V++  C
Subjt:  ANTFPYPLSVSSATC

Arabidopsis top hitse value%identityAlignment
AT1G05835.1 PHD finger protein1.4e-0740Show/hide
Query:  VQILNSCASDCSISNIHVKCGWFSSARLVNPRIFKRVSYD--DCLVNDGRALGPGRTLSFQYANTFPYPL
        V+++N C   C I N+ +KC  F  + LV+P   + +S    +C+VNDG  L P +TLSF Y+NT  + L
Subjt:  VQILNSCASDCSISNIHVKCGWFSSARLVNPRIFKRVSYD--DCLVNDGRALGPGRTLSFQYANTFPYPL

AT1G32583.1 FUNCTIONS IN: molecular_function unknown5.8e-4663.2Show/hide
Query:  ENRNGISSMSLQSSNSGKEMNRIGSRCAKDDIIIFQGPATPLPGGIPTYIVQILNSCASDCSISNIHVKCGWFSSARLVNPRIFKRVSYDDCLVNDGRAL
        ENR  +S   L S + G   NRIG  C+KDDI++FQG   PLP G+P+Y V+I NSC SDC+I+ IHV CGWFSS RLVNPR+F+R+ YDDCLVNDG+ L
Subjt:  ENRNGISSMSLQSSNSGKEMNRIGSRCAKDDIIIFQGPATPLPGGIPTYIVQILNSCASDCSISNIHVKCGWFSSARLVNPRIFKRVSYDDCLVNDGRAL

Query:  GPGRTLSFQYANTFPYPLSVSSATC
        GPG++LSFQYAN+F YPLSV+S +C
Subjt:  GPGRTLSFQYANTFPYPLSVSSATC

AT4G24972.1 tapetum determinant 12.5e-3656.1Show/hide
Query:  MSLQSSNSGK-----EMNRIGSRCAKDDIIIFQGPATPLPGGIPTYIVQILNSCASDCSISNIHVKCGWFSSARLVNPRIFKRVSYDDCLVNDGRALGPG
        M L S  +GK     E  RIG +C   DI++ Q    P+P GIP Y+V+I N C S C IS IH+ CGWFSSA+L+NPR+FKR+ YDDCLVN+G+ L  G
Subjt:  MSLQSSNSGK-----EMNRIGSRCAKDDIIIFQGPATPLPGGIPTYIVQILNSCASDCSISNIHVKCGWFSSARLVNPRIFKRVSYDDCLVNDGRALGPG

Query:  RTLSFQYANTFPYPLSVSSATCS
         TLSF YANTFPY LSV+  TC+
Subjt:  RTLSFQYANTFPYPLSVSSATCS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGGTGGTGGTGCTTTTCACAGATTCTGCCTCTGGTTTCTCCTTACAATCTTTCTTGGTAATGAATTCAAGTTGGTTCATTTGAGAGTTCTTCAGGCAGCTGCAAG
AGAAAACAGAAATGGGATTTCTTCGATGTCGCTTCAATCTTCTAACAGTGGTAAAGAGATGAATCGTATTGGTTCGAGATGTGCAAAAGACGATATCATTATATTTCAAG
GCCCAGCAACTCCACTTCCGGGTGGGATCCCAACATACATAGTTCAGATCTTGAACTCATGTGCTTCAGACTGCAGCATCTCCAACATTCATGTGAAATGTGGATGGTTC
AGCTCAGCCAGATTGGTGAATCCAAGAATTTTTAAGCGAGTCAGTTATGATGACTGCCTTGTCAATGATGGTCGGGCTCTTGGTCCTGGTCGAACTCTTTCTTTTCAATA
TGCTAATACTTTCCCTTACCCACTTTCTGTATCTTCTGCAACTTGTTCTTCTTAA
mRNA sequenceShow/hide mRNA sequence
CTGCCTCTATTCTATTGGACTTAATTTCAAATGGGAGAAAGAACAAAAACTTGTCACCACACCTATTTCTCTTTTTCTCTTTCTATTTACTTTAATATTTAAAAGTAACT
AAGTTTTCTAATTATTCGTAATAGCAATAGCTGCCAAGTTCATGAATTGGTTCTGTATCAAGTGTTTCTTTAATTTATGTTTTTATTTAATTGCTGTCGTTAACTATCAT
TAACCTTACAAATAGGGTTTAAAAAAGATGGAGTTTGGGATGAACATCAGAGCTTTTCTTGTAAGAGAGCTTTTGGCGAGTCCACGTGGATGCTGATGTGGATGCTGATG
TGGAGGGTAAAACTCGTGTGCGTACCCACAACCACTCTACAACGTATTTCTCCCTTCGAAACCACATTATTTTTTGTCTTAATCCCTTCAACATCTTCTCTTGTACTGAT
TTCAAATTTCTTCTTTTTTTTTTCTCCAACGATTTAGTAGGTAACAATTAAATTCATTTATAATCATAAAGTAGACATAATTTAAAAAAACTGACTTTTGGTGTGTGGGT
AGGTATAAGAGTTTCCTTTCCTCCGACCAAATAAAGAGGAATTGGTAAGTCCAATTACTCTTCTCCTACCAACGACTATGAAACGAAGATTCAATCTGAACCCACAGACA
AGTAAATAAATAACGCCGCACATCAAAAGCCGACATCCCCTGTTTTTGGCGGCGTCTCTGTATTTTGCACCAAAAAGGATAAACCTTTCCCCTTCCCCACCCTCCCTCTT
TCTCATCTTGATCCACCCAGTTTGAGCAATCCATATCCTCTCTTACACCTTACACCCTCCCCTTTAATGGCGTCCATCCACACTTCACTCTCCTTTTACATCTCTACCCC
TCTCCATCTTTTCTTACTCCTCTTCTTCCATCGAATTTTGCACCCACTTCATACAATATCCCCTAACTTAGTCAACCCGCTACCAGACTAGAAGAAGACACTAACTATTC
TTTGACTCATATAAACTATTTTGTTCACACGTGTGCTCCCTCCATTCCTTTTAGATTCCTTGTAGCCTCTTCGTAAATGTCTTTTATATACTCCCCTTTTTAAGCTAATT
CTGAAACAAAGTGAACTCAGACATGAAAGGTGGTGGTGCTTTTCACAGATTCTGCCTCTGGTTTCTCCTTACAATCTTTCTTGGTAATGAATTCAAGTTGGTTCATTTGA
GAGTTCTTCAGGCAGCTGCAAGAGAAAACAGAAATGGGATTTCTTCGATGTCGCTTCAATCTTCTAACAGTGGTAAAGAGATGAATCGTATTGGTTCGAGATGTGCAAAA
GACGATATCATTATATTTCAAGGCCCAGCAACTCCACTTCCGGGTGGGATCCCAACATACATAGTTCAGATCTTGAACTCATGTGCTTCAGACTGCAGCATCTCCAACAT
TCATGTGAAATGTGGATGGTTCAGCTCAGCCAGATTGGTGAATCCAAGAATTTTTAAGCGAGTCAGTTATGATGACTGCCTTGTCAATGATGGTCGGGCTCTTGGTCCTG
GTCGAACTCTTTCTTTTCAATATGCTAATACTTTCCCTTACCCACTTTCTGTATCTTCTGCAACTTGTTCTTCTTAAATTAACAAGTCTTTAAACAAACAGTCGATGGAG
AAACCTTAAGAATTGTCTATGATCAATCAGTATTCTAGGAGATCCATCTTTTATATATATATACATATATTATGGTTGAGATATGAATCCATATTCTTCCCACAAATATC
GTTTGATCATCATCAGAAGTTCAATGTCCAAAAGATCAAACCTTCGATGTAGAATCTCTTATTCTCTGTCAATTCTCAGATATCTTAGTAAGGAACGCTTATTTTAGTCC
ATTATGACAATTGACAAGTTTATCACAAGTAACTAATAATAGTTATCTGTTGTAATAATTTATTGTTAGAGAATCAGTTTGAAACATTTCTGTCCATTTCGTATCCACGA
AGGATCAGCTTGCTCATTCATTCACAAAAACAGTCTTCTAAGTTATCTCCAACGGTACAAAACCGTAGTCACATTACAGCAACCTCCAATATTTACTCCAACTCAAGTCT
CGACCCTAAGTCTGGAGGGGTTGTTATTTGGGCAGTAACAGAGTTGTTTTGTTACTTATTTTGGCAAAGCTTTATTAAACCAAAAAAAAATCCGCAAAGCCTTCGTTGTG
TTCAATAAATAAAGAAATCACAGATATTTGCTCATTCTGTTCCTCAGTTTCTTTAGTAACAAGCAAAACATAACCTTCCAACTTTCTACATGTAGAATCTTGGAGTGCAA
GAGAAATAATTCTAATCTTCCAACGTTCTAACATATGGTTCAGTGTATGGCGGCTACATTCAATCAACCATGAATTATTAGATTTAAGATCAAAGAAAGAAACGTAGCAT
TGCAGACACAATGGGATAAAATCTTAAACTTGTAATTTGTATGGAGGCAATGGCAAGAAAACAAGATTGAACTTCATTTGATGGGTCAATAAGTATAATCGTAAGAAAAC
TCAGTGACACCAACCATAGCAGAAAGAACTTGAGAGATAATGAAAATGGCACCCCCAAAAATAACCAAAACAAGAACAGCAGTAATGGGGTTCTTCAGCAGTTCCAAAAT
AGGGAGAAGAATAGTATAAATAGCACCAGTACTCACAGTAAGGGCGTAAAGTGCATATCTTCTAACATTGTCCCAAAATTCCTCCACCAAAGGGCCTTCAGGACCCAAAG
CCAAAGCCCTTTCGCCGTCGCAACCGATCAGAAACAGGGCAACGCCCACCGAGACGGCTCCAATGATCAGGAACCTCGATTTCCCACTTGGGGTTGCTTCGATTGTCTTG
AAATCCGTCGGGTGAGGGATTTTTAGGGGTTTGGCGGGAGATTGCGGCGGCGCAGAGGAAGCGAGAGGCCGGAAAGGGGTTCGGATAGGGGAAATGGGCTTGATTGGAAA
TGAAGGAAGAGAAATGAGTGGCGTTTTGAAAGTGGGGAAGAATGGAGTTACGAAAGAAGTGGCCATTGTTGTTGAGTTTGTGGATGGAGGTGGAGCTCAGGCCATGGTAG
TTATGAAGTCGGAGGTGAGGGCTGTGTTATCCAGAGTTTTGAAGAGAGGATTAAATTAACAAAAATGTGA
Protein sequenceShow/hide protein sequence
MKGGGAFHRFCLWFLLTIFLGNEFKLVHLRVLQAAARENRNGISSMSLQSSNSGKEMNRIGSRCAKDDIIIFQGPATPLPGGIPTYIVQILNSCASDCSISNIHVKCGWF
SSARLVNPRIFKRVSYDDCLVNDGRALGPGRTLSFQYANTFPYPLSVSSATCSS