; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0024221 (gene) of Chayote v1 genome

Gene IDSed0024221
OrganismSechium edule (Chayote v1)
Descriptionprotein TAPETUM DETERMINANT 1
Genome locationLG01:11207977..11210893
RNA-Seq ExpressionSed0024221
SyntenySed0024221
Gene Ontology termsGO:0001709 - cell fate determination (biological process)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7019756.1 Protein TAPETUM DETERMINANT 1 [Cucurbita argyrosperma subsp. argyrosperma]8.8e-7987.13Show/hide
Query:  RRVLVILASVFFVFFSVSALLADLDIFCSFMNSTPPSFLRLGSKSAMITSHRK-LLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVV
        RRV VI ASVFF+FF VSA+L DLDIF SFM+S+PPSFLRLG  S +   HRK LLTREA  EEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVV
Subjt:  RRVLVILASVFFVFFSVSALLADLDIFCSFMNSTPPSFLRLGSKSAMITSHRK-LLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVV

Query:  NACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYTLSVSSVLC
        NACVTGCDIYGIHFKCGWFSSAHLINPRVFKRL YDDCLVNDGKPLVYGGTLSFQYANTYPY LSVSSV+C
Subjt:  NACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYTLSVSSVLC

XP_022923940.1 protein TAPETUM DETERMINANT 1-like [Cucurbita moschata]8.8e-7987.13Show/hide
Query:  RRVLVILASVFFVFFSVSALLADLDIFCSFMNSTPPSFLRLGSKSAMITSHRK-LLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVV
        RRV VI ASVFF+FF VSA+L DLDIF SFM+S+PPSFLRLG  S +   HRK LLTREA  EEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVV
Subjt:  RRVLVILASVFFVFFSVSALLADLDIFCSFMNSTPPSFLRLGSKSAMITSHRK-LLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVV

Query:  NACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYTLSVSSVLC
        NACVTGCDIYGIHFKCGWFSSAHLINPRVFKRL YDDCLVNDGKPLVYGGTLSFQYANTYPY LSVSSV+C
Subjt:  NACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYTLSVSSVLC

XP_022955262.1 protein TAPETUM DETERMINANT 1-like [Cucurbita moschata]8.8e-7984.8Show/hide
Query:  RRVLVILASVFFVFFSVSALLADLDIFCSFMNSTPPSFLRLGSKSAMITSHRKLLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVN
        RRVLVI AS  F+F SVSA L DL+I  SFM+S+PPSFLR G +S  +T  RKLLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVN
Subjt:  RRVLVILASVFFVFFSVSALLADLDIFCSFMNSTPPSFLRLGSKSAMITSHRKLLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVN

Query:  ACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYTLSVSSVLCN
        AC TGCDI+GIHFKCGWFSSAHLINPR+FKRLRYDDCLVNDGKPLVYGGTLSFQYANT+PY LSVSSV+CN
Subjt:  ACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYTLSVSSVLCN

XP_022994477.1 protein TAPETUM DETERMINANT 1 [Cucurbita maxima]3.0e-7985.38Show/hide
Query:  RRVLVILASVFFVFFSVSALLADLDIFCSFMNSTPPSFLRLGSKSAMITSHRKLLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVN
        RRVLVI ASV F+F SVSA L DL+I  SFM+S+PPSFLR G +S  +T  RKLLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVN
Subjt:  RRVLVILASVFFVFFSVSALLADLDIFCSFMNSTPPSFLRLGSKSAMITSHRKLLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVN

Query:  ACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYTLSVSSVLCN
        AC TGCDI+GIHFKCGWFSSAHLINPR+FKRLRYDDCLVNDGKPLVYGGTLSFQYANT+PY LSVSSV+CN
Subjt:  ACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYTLSVSSVLCN

XP_023542781.1 protein TAPETUM DETERMINANT 1 [Cucurbita pepo subsp. pepo]3.0e-7985.38Show/hide
Query:  RRVLVILASVFFVFFSVSALLADLDIFCSFMNSTPPSFLRLGSKSAMITSHRKLLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVN
        RRVLVI ASV F+F SVSA L DL+I  SFM+S+PPSFLR G +S  +T  RKLLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVN
Subjt:  RRVLVILASVFFVFFSVSALLADLDIFCSFMNSTPPSFLRLGSKSAMITSHRKLLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVN

Query:  ACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYTLSVSSVLCN
        AC TGCDI+GIHFKCGWFSSAHLINPR+FKRLRYDDCLVNDGKPLVYGGTLSFQYANT+PY LSVSSV+CN
Subjt:  ACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYTLSVSSVLCN

TrEMBL top hitse value%identityAlignment
A0A6J1C7I8 protein TAPETUM DETERMINANT 1-like isoform X21.2e-7884.21Show/hide
Query:  RRVLVILASVFFVFFSVSALLADLDIFCSFMNSTPPSFLRLGSKSAMITSHRKLLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVN
        RRV VI ASVFF+FFS+SALL DLDI  SFMNS+PPS+L LG  + ++  HRKLL  +A  EEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVN
Subjt:  RRVLVILASVFFVFFSVSALLADLDIFCSFMNSTPPSFLRLGSKSAMITSHRKLLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVN

Query:  ACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYTLSVSSVLCN
        ACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANT+PY LSVSSV+C+
Subjt:  ACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYTLSVSSVLCN

A0A6J1C840 protein TAPETUM DETERMINANT 1-like isoform X19.5e-7984.88Show/hide
Query:  RRVLVILASVFFVFFSVSALLADLDIFCSFMNSTPPSFLRLGSKSAMITSHRK-LLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVV
        RRV VI ASVFF+FFS+SALL DLDI  SFMNS+PPS+L LG  + ++  HRK LLT+EA  EEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVV
Subjt:  RRVLVILASVFFVFFSVSALLADLDIFCSFMNSTPPSFLRLGSKSAMITSHRK-LLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVV

Query:  NACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYTLSVSSVLCN
        NACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANT+PY LSVSSV+C+
Subjt:  NACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYTLSVSSVLCN

A0A6J1EDD3 protein TAPETUM DETERMINANT 1-like4.3e-7987.13Show/hide
Query:  RRVLVILASVFFVFFSVSALLADLDIFCSFMNSTPPSFLRLGSKSAMITSHRK-LLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVV
        RRV VI ASVFF+FF VSA+L DLDIF SFM+S+PPSFLRLG  S +   HRK LLTREA  EEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVV
Subjt:  RRVLVILASVFFVFFSVSALLADLDIFCSFMNSTPPSFLRLGSKSAMITSHRK-LLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVV

Query:  NACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYTLSVSSVLC
        NACVTGCDIYGIHFKCGWFSSAHLINPRVFKRL YDDCLVNDGKPLVYGGTLSFQYANTYPY LSVSSV+C
Subjt:  NACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYTLSVSSVLC

A0A6J1GTG1 protein TAPETUM DETERMINANT 1-like4.3e-7984.8Show/hide
Query:  RRVLVILASVFFVFFSVSALLADLDIFCSFMNSTPPSFLRLGSKSAMITSHRKLLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVN
        RRVLVI AS  F+F SVSA L DL+I  SFM+S+PPSFLR G +S  +T  RKLLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVN
Subjt:  RRVLVILASVFFVFFSVSALLADLDIFCSFMNSTPPSFLRLGSKSAMITSHRKLLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVN

Query:  ACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYTLSVSSVLCN
        AC TGCDI+GIHFKCGWFSSAHLINPR+FKRLRYDDCLVNDGKPLVYGGTLSFQYANT+PY LSVSSV+CN
Subjt:  ACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYTLSVSSVLCN

A0A6J1JVY2 protein TAPETUM DETERMINANT 11.5e-7985.38Show/hide
Query:  RRVLVILASVFFVFFSVSALLADLDIFCSFMNSTPPSFLRLGSKSAMITSHRKLLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVN
        RRVLVI ASV F+F SVSA L DL+I  SFM+S+PPSFLR G +S  +T  RKLLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVN
Subjt:  RRVLVILASVFFVFFSVSALLADLDIFCSFMNSTPPSFLRLGSKSAMITSHRKLLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVN

Query:  ACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYTLSVSSVLCN
        AC TGCDI+GIHFKCGWFSSAHLINPR+FKRLRYDDCLVNDGKPLVYGGTLSFQYANT+PY LSVSSV+CN
Subjt:  ACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYTLSVSSVLCN

SwissProt top hitse value%identityAlignment
A8MS78 Uncharacterized protein At1g058353.9e-0538.89Show/hide
Query:  YTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYD--DCLVNDGKPLVYGGTLSFQYANTYPYTL
        + VEV+N C   C I  +  KC  F  + L++P   + L     +C+VNDG PL    TLSF Y+NT+ + L
Subjt:  YTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYD--DCLVNDGKPLVYGGTLSFQYANTYPYTL

Q1G3T1 TPD1 protein homolog 19.6e-3659.66Show/hide
Query:  RKLLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTL
        RKLL      +   RI G+ C+K DIV+ QG T PLP+G+P+YTVE+ N+CV+ C+I  IH  CGWFSS  L+NPRVF+RL YDDCLVNDG+PL  G +L
Subjt:  RKLLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTL

Query:  SFQYANTYPYTLSVSSVLC
        SFQYAN++ Y LSV+SV C
Subjt:  SFQYANTYPYTLSVSSVLC

Q2QR54 TPD1 protein homolog 1A3.9e-2956.44Show/hide
Query:  DIVINQGPTAPLPTGIPTYTVEVVNACVTG------CDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYTLSVSSVL
        DI I QG   PLP+G+P YTV+V+N C  G      C I GIH +CGWFSS  L++PRVF+RL +DDCL+NDG+PL+ G T+SF+Y N++PY LSVS   
Subjt:  DIVINQGPTAPLPTGIPTYTVEVVNACVTG------CDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYTLSVSSVL

Query:  C
        C
Subjt:  C

Q6TLJ2 Protein TAPETUM DETERMINANT 11.6e-3860.63Show/hide
Query:  SHRKLLT------REAAFEEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGK
        SHRK+L       +     EP RI GEKC  +DIV+NQ  T P+P GIP Y VE+ N C++GC I  IH  CGWFSSA LINPRVFKR+ YDDCLVN+GK
Subjt:  SHRKLLT------REAAFEEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGK

Query:  PLVYGGTLSFQYANTYPYTLSVSSVLC
        PL +G TLSF YANT+PY LSV+ V C
Subjt:  PLVYGGTLSFQYANTYPYTLSVSSVLC

Q8S6P9 TPD1 protein homolog 1B7.9e-2241.9Show/hide
Query:  RIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYTLSV
        R+  + C++ ++V+ Q     LP+GIPTY+VE++N C T C +Y +H  CG F+SA L++P  F+R+ ++DCLV  G  L     +SFQY+N++ Y L+V
Subjt:  RIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYTLSV

Query:  SSVLC
        ++V C
Subjt:  SSVLC

Arabidopsis top hitse value%identityAlignment
AT1G05835.1 PHD finger protein2.8e-0638.89Show/hide
Query:  YTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYD--DCLVNDGKPLVYGGTLSFQYANTYPYTL
        + VEV+N C   C I  +  KC  F  + L++P   + L     +C+VNDG PL    TLSF Y+NT+ + L
Subjt:  YTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYD--DCLVNDGKPLVYGGTLSFQYANTYPYTL

AT1G32583.1 FUNCTIONS IN: molecular_function unknown6.8e-3759.66Show/hide
Query:  RKLLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTL
        RKLL      +   RI G+ C+K DIV+ QG T PLP+G+P+YTVE+ N+CV+ C+I  IH  CGWFSS  L+NPRVF+RL YDDCLVNDG+PL  G +L
Subjt:  RKLLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTL

Query:  SFQYANTYPYTLSVSSVLC
        SFQYAN++ Y LSV+SV C
Subjt:  SFQYANTYPYTLSVSSVLC

AT4G24972.1 tapetum determinant 11.1e-3960.63Show/hide
Query:  SHRKLLT------REAAFEEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGK
        SHRK+L       +     EP RI GEKC  +DIV+NQ  T P+P GIP Y VE+ N C++GC I  IH  CGWFSSA LINPRVFKR+ YDDCLVN+GK
Subjt:  SHRKLLT------REAAFEEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGK

Query:  PLVYGGTLSFQYANTYPYTLSVSSVLC
        PL +G TLSF YANT+PY LSV+ V C
Subjt:  PLVYGGTLSFQYANTYPYTLSVSSVLC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGGTCGGCGAGTTTTGGTGATTCTTGCTTCGGTTTTCTTCGTCTTCTTTTCTGTATCTGCTTTACTCGCAGATCTGGATATTTTTTGTTCCTTTATGAACTCGAC
TCCACCGAGTTTTCTTCGATTGGGTTCAAAGAGCGCAATGATTACTTCTCATCGCAAACTACTTACTCGAGAAGCAGCATTTGAGGAACCAACCAGAATTTGGGGGGAAA
AGTGTACCAAATCAGACATTGTGATTAACCAAGGCCCCACAGCTCCACTTCCAACCGGTATTCCCACCTACACCGTGGAGGTGGTAAACGCTTGTGTCACTGGCTGTGAC
ATCTATGGCATCCACTTCAAATGTGGCTGGTTCAGCTCGGCCCACCTCATTAATCCGAGAGTCTTCAAGCGGCTTCGCTACGACGACTGTCTTGTGAACGACGGCAAGCC
TCTGGTCTATGGCGGCACACTCTCTTTTCAATATGCAAACACTTATCCGTATACACTTTCGGTCTCTTCAGTTTTGTGCAATTGA
mRNA sequenceShow/hide mRNA sequence
CCTTCATATTTTTGAACCCCTTCTTTTCCCACTCTCGAATTCCTCCGACAATTCGCCGGTCGCCGCCGCCGCTAGGGTTTCTTCATTCGTCCGCCACCGTCTGCAGCATT
GTGTTTCCACGATCTCCGTCTGCCATTCCTGTCTCATCTTTCTGCGCCATTTTTGCTCTGTTTTTGAGTTTTTAACCTTCCGACTCTGATCTCTGTTGAACAGTTCTTCG
CTGAAATCTCTTTAAGATTCCTGTTTAAATCATGAAAGGTCGGCGAGTTTTGGTGATTCTTGCTTCGGTTTTCTTCGTCTTCTTTTCTGTATCTGCTTTACTCGCAGATC
TGGATATTTTTTGTTCCTTTATGAACTCGACTCCACCGAGTTTTCTTCGATTGGGTTCAAAGAGCGCAATGATTACTTCTCATCGCAAACTACTTACTCGAGAAGCAGCA
TTTGAGGAACCAACCAGAATTTGGGGGGAAAAGTGTACCAAATCAGACATTGTGATTAACCAAGGCCCCACAGCTCCACTTCCAACCGGTATTCCCACCTACACCGTGGA
GGTGGTAAACGCTTGTGTCACTGGCTGTGACATCTATGGCATCCACTTCAAATGTGGCTGGTTCAGCTCGGCCCACCTCATTAATCCGAGAGTCTTCAAGCGGCTTCGCT
ACGACGACTGTCTTGTGAACGACGGCAAGCCTCTGGTCTATGGCGGCACACTCTCTTTTCAATATGCAAACACTTATCCGTATACACTTTCGGTCTCTTCAGTTTTGTGC
AATTGAATCGACTCGAGCCAAACCATTTAGGAGATGTTCATATGAAGCATTAGAAGCAGCAACATGTACTTCAGTCCATTACTGGTTGACTCTGGCCCCCACCTGCCTTT
TGTTATTACTTTCATTTCACTTCTTGGTTCTGCTTTTGCTTTATATGTGTGTGACCTTGTGGGCATTGGCCATTTTCTGTTCCTGTTCCTACAAGTTTGTTTTTAGAGAG
TTCTGGTTTTCCTGAGATTGGAGGGATTGTGGCTTAGTGTAATTGTCATGATCCCAACTTTGGATATACATAGATGGTGAGATAGAGATAGAACCTTACATTTTCATTTC
TTTTTGGGGTGTTATGAGATACAGATAGGCATCGTGGTAGGAAGGTCA
Protein sequenceShow/hide protein sequence
MKGRRVLVILASVFFVFFSVSALLADLDIFCSFMNSTPPSFLRLGSKSAMITSHRKLLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVNACVTGCD
IYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYTLSVSSVLCN