; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10020986 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10020986
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein TAPETUM DETERMINANT 1
Genome locationChr05:4240956..4242713
RNA-Seq ExpressionHG10020986
SyntenyHG10020986
Gene Ontology termsGO:0001709 - cell fate determination (biological process)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR040361 - Tapetum determinant 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYJ99275.1 protein TAPETUM DETERMINANT 1 [Cucumis melo var. makuwa]2.1e-8685.79Show/hide
Query:  MMKDLPTTTTTTTTRRVSVIFALVFFIFLLVSAFLTDLDIMNSSPPSFLRLGLNSTIVGPHRKLLLTK---------EATIEEPTRIWGEKCTKSDIVIN
        MMK+L TTTTTTTTRRVSVIFALV F+F  +SA LTD +IM+SSPPSFL L LN+T V PHRKLLLT+         EATIEEPTRIWGEKCTKSDIVIN
Subjt:  MMKDLPTTTTTTTTRRVSVIFALVFFIFLLVSAFLTDLDIMNSSPPSFLRLGLNSTIVGPHRKLLLTK---------EATIEEPTRIWGEKCTKSDIVIN

Query:  QGPTAPLPTGIPTYTVEVVNACVTGCEIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVC
        QGPTAPLPTGIPTYTVEVVNACVTGCEIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSV+C
Subjt:  QGPTAPLPTGIPTYTVEVVNACVTGCEIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVC

XP_004153503.1 protein TAPETUM DETERMINANT 1 [Cucumis sativus]1.3e-8890.61Show/hide
Query:  MMKDLPTTTTTTTTRRVSVIFALVFFIFLLVSAFLTDLDIMNSSPPSFLRLGLNSTIVGPHRKLLLTKEATIEEPTRIWGEKCTKSDIVINQGPTAPLPT
        MMK+L TTTTTTTTRRVSVIFALV FIF  VSA LTD +IM+S PPSFL L LN+T V PHRKLLLT+EATIEEPTRIWGEKCTKSDIVINQGPTAPLPT
Subjt:  MMKDLPTTTTTTTTRRVSVIFALVFFIFLLVSAFLTDLDIMNSSPPSFLRLGLNSTIVGPHRKLLLTKEATIEEPTRIWGEKCTKSDIVINQGPTAPLPT

Query:  GIPTYTVEVVNACVTGCEIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVC
        GIPTYTVEVVNACVTGCEIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSV+C
Subjt:  GIPTYTVEVVNACVTGCEIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVC

XP_008439976.1 PREDICTED: protein TAPETUM DETERMINANT 1 [Cucumis melo]9.9e-8990.06Show/hide
Query:  MMKDLPTTTTTTTTRRVSVIFALVFFIFLLVSAFLTDLDIMNSSPPSFLRLGLNSTIVGPHRKLLLTKEATIEEPTRIWGEKCTKSDIVINQGPTAPLPT
        MMK+L TTTTTTTTRRVSVIFALV F+F  +SA LTD +IM+SSPPSFL L LN+T V PHRKLLLT+EATIEEPTRIWGEKCTKSDIVINQGPTAPLPT
Subjt:  MMKDLPTTTTTTTTRRVSVIFALVFFIFLLVSAFLTDLDIMNSSPPSFLRLGLNSTIVGPHRKLLLTKEATIEEPTRIWGEKCTKSDIVINQGPTAPLPT

Query:  GIPTYTVEVVNACVTGCEIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVC
        GIPTYTVEVVNACVTGCEIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSV+C
Subjt:  GIPTYTVEVVNACVTGCEIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVC

XP_022137731.1 protein TAPETUM DETERMINANT 1-like isoform X1 [Momordica charantia]9.2e-8788.71Show/hide
Query:  MMKDLPTTTTTTTTRRVSVIFALVFFIFLLVSAFLTDLDI----MNSSPPSFLRLGLNSTIVGPHRKLLLTKEATIEEPTRIWGEKCTKSDIVINQGPTA
        MMKD   +TTTT TRRVSVIFA VFFIF  +SA LTDLDI    MNSSPPS+L LGLN+TIV PHRKLLLTKEATIEEPTRIWGEKCTKSDIVINQGPTA
Subjt:  MMKDLPTTTTTTTTRRVSVIFALVFFIFLLVSAFLTDLDI----MNSSPPSFLRLGLNSTIVGPHRKLLLTKEATIEEPTRIWGEKCTKSDIVINQGPTA

Query:  PLPTGIPTYTVEVVNACVTGCEIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVCN
        PLPTGIPTYTVEVVNACVTGC+IYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANT+PYPLSVSSVVC+
Subjt:  PLPTGIPTYTVEVVNACVTGCEIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVCN

XP_038893959.1 protein TAPETUM DETERMINANT 1-like [Benincasa hispida]1.9e-8794.61Show/hide
Query:  RVSVIFALVFFIFLLVSAFLTDLDIMNSSPPSFLRLGLNSTIVGPHRKLLLTKEATIEEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVNACVT
        RV VIFALVFFIF LVSAFLTDLDIMNSSPPSFLRLGLN+TIV PHRKLLLT++ATIEEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVNACVT
Subjt:  RVSVIFALVFFIFLLVSAFLTDLDIMNSSPPSFLRLGLNSTIVGPHRKLLLTKEATIEEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVNACVT

Query:  GCEIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVCN
        GCEIYGIHFKCGWFSSAHLINPR+FKRLRYDDCLVNDGKPLVYGGTLSFQYANT+PYPLSVSSV CN
Subjt:  GCEIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVCN

TrEMBL top hitse value%identityAlignment
A0A0A0LUG9 Uncharacterized protein6.2e-8990.61Show/hide
Query:  MMKDLPTTTTTTTTRRVSVIFALVFFIFLLVSAFLTDLDIMNSSPPSFLRLGLNSTIVGPHRKLLLTKEATIEEPTRIWGEKCTKSDIVINQGPTAPLPT
        MMK+L TTTTTTTTRRVSVIFALV FIF  VSA LTD +IM+S PPSFL L LN+T V PHRKLLLT+EATIEEPTRIWGEKCTKSDIVINQGPTAPLPT
Subjt:  MMKDLPTTTTTTTTRRVSVIFALVFFIFLLVSAFLTDLDIMNSSPPSFLRLGLNSTIVGPHRKLLLTKEATIEEPTRIWGEKCTKSDIVINQGPTAPLPT

Query:  GIPTYTVEVVNACVTGCEIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVC
        GIPTYTVEVVNACVTGCEIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSV+C
Subjt:  GIPTYTVEVVNACVTGCEIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVC

A0A1S3B0P8 protein TAPETUM DETERMINANT 14.8e-8990.06Show/hide
Query:  MMKDLPTTTTTTTTRRVSVIFALVFFIFLLVSAFLTDLDIMNSSPPSFLRLGLNSTIVGPHRKLLLTKEATIEEPTRIWGEKCTKSDIVINQGPTAPLPT
        MMK+L TTTTTTTTRRVSVIFALV F+F  +SA LTD +IM+SSPPSFL L LN+T V PHRKLLLT+EATIEEPTRIWGEKCTKSDIVINQGPTAPLPT
Subjt:  MMKDLPTTTTTTTTRRVSVIFALVFFIFLLVSAFLTDLDIMNSSPPSFLRLGLNSTIVGPHRKLLLTKEATIEEPTRIWGEKCTKSDIVINQGPTAPLPT

Query:  GIPTYTVEVVNACVTGCEIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVC
        GIPTYTVEVVNACVTGCEIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSV+C
Subjt:  GIPTYTVEVVNACVTGCEIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVC

A0A5A7UPF3 Protein TAPETUM DETERMINANT 14.8e-8990.06Show/hide
Query:  MMKDLPTTTTTTTTRRVSVIFALVFFIFLLVSAFLTDLDIMNSSPPSFLRLGLNSTIVGPHRKLLLTKEATIEEPTRIWGEKCTKSDIVINQGPTAPLPT
        MMK+L TTTTTTTTRRVSVIFALV F+F  +SA LTD +IM+SSPPSFL L LN+T V PHRKLLLT+EATIEEPTRIWGEKCTKSDIVINQGPTAPLPT
Subjt:  MMKDLPTTTTTTTTRRVSVIFALVFFIFLLVSAFLTDLDIMNSSPPSFLRLGLNSTIVGPHRKLLLTKEATIEEPTRIWGEKCTKSDIVINQGPTAPLPT

Query:  GIPTYTVEVVNACVTGCEIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVC
        GIPTYTVEVVNACVTGCEIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSV+C
Subjt:  GIPTYTVEVVNACVTGCEIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVC

A0A5D3BLS2 Protein TAPETUM DETERMINANT 11.0e-8685.79Show/hide
Query:  MMKDLPTTTTTTTTRRVSVIFALVFFIFLLVSAFLTDLDIMNSSPPSFLRLGLNSTIVGPHRKLLLTK---------EATIEEPTRIWGEKCTKSDIVIN
        MMK+L TTTTTTTTRRVSVIFALV F+F  +SA LTD +IM+SSPPSFL L LN+T V PHRKLLLT+         EATIEEPTRIWGEKCTKSDIVIN
Subjt:  MMKDLPTTTTTTTTRRVSVIFALVFFIFLLVSAFLTDLDIMNSSPPSFLRLGLNSTIVGPHRKLLLTK---------EATIEEPTRIWGEKCTKSDIVIN

Query:  QGPTAPLPTGIPTYTVEVVNACVTGCEIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVC
        QGPTAPLPTGIPTYTVEVVNACVTGCEIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSV+C
Subjt:  QGPTAPLPTGIPTYTVEVVNACVTGCEIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVC

A0A6J1C840 protein TAPETUM DETERMINANT 1-like isoform X14.5e-8788.71Show/hide
Query:  MMKDLPTTTTTTTTRRVSVIFALVFFIFLLVSAFLTDLDI----MNSSPPSFLRLGLNSTIVGPHRKLLLTKEATIEEPTRIWGEKCTKSDIVINQGPTA
        MMKD   +TTTT TRRVSVIFA VFFIF  +SA LTDLDI    MNSSPPS+L LGLN+TIV PHRKLLLTKEATIEEPTRIWGEKCTKSDIVINQGPTA
Subjt:  MMKDLPTTTTTTTTRRVSVIFALVFFIFLLVSAFLTDLDI----MNSSPPSFLRLGLNSTIVGPHRKLLLTKEATIEEPTRIWGEKCTKSDIVINQGPTA

Query:  PLPTGIPTYTVEVVNACVTGCEIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVCN
        PLPTGIPTYTVEVVNACVTGC+IYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANT+PYPLSVSSVVC+
Subjt:  PLPTGIPTYTVEVVNACVTGCEIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVCN

SwissProt top hitse value%identityAlignment
A8MS78 Uncharacterized protein At1g058355.4e-0538.89Show/hide
Query:  YTVEVVNACVTGCEIYGIHFKCGWFSSAHLINPRVFKRLRYD--DCLVNDGKPLVYGGTLSFQYANTYPYPL
        + VEV+N C   C I  +  KC  F  + L++P   + L     +C+VNDG PL    TLSF Y+NT+ + L
Subjt:  YTVEVVNACVTGCEIYGIHFKCGWFSSAHLINPRVFKRLRYD--DCLVNDGKPLVYGGTLSFQYANTYPYPL

Q1G3T1 TPD1 protein homolog 13.7e-3857.04Show/hide
Query:  SFLRLGLNSTIVGPHRKLLLTKEATIEEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVNACVTGCEIYGIHFKCGWFSSAHLINPRVFKRLRYD
        SF+ +  N T +   RKLLL+ +  I + T   G+ C+K DIV+ QG T PLP+G+P+YTVE+ N+CV+ C I  IH  CGWFSS  L+NPRVF+RL YD
Subjt:  SFLRLGLNSTIVGPHRKLLLTKEATIEEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVNACVTGCEIYGIHFKCGWFSSAHLINPRVFKRLRYD

Query:  DCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVC
        DCLVNDG+PL  G +LSFQYAN++ YPLSV+SV C
Subjt:  DCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVC

Q2QR54 TPD1 protein homolog 1A2.4e-2956.44Show/hide
Query:  DIVINQGPTAPLPTGIPTYTVEVVNACVTG------CEIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVV
        DI I QG   PLP+G+P YTV+V+N C  G      C I GIH +CGWFSS  L++PRVF+RL +DDCL+NDG+PL+ G T+SF+Y N++PY LSVS   
Subjt:  DIVINQGPTAPLPTGIPTYTVEVVNACVTG------CEIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVV

Query:  C
        C
Subjt:  C

Q6TLJ2 Protein TAPETUM DETERMINANT 11.5e-3961.9Show/hide
Query:  HRKLLLTKEATIE-----EPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVNACVTGCEIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKP
        HRK+LL    T +     EP RI GEKC  +DIV+NQ  T P+P GIP Y VE+ N C++GC I  IH  CGWFSSA LINPRVFKR+ YDDCLVN+GKP
Subjt:  HRKLLLTKEATIE-----EPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVNACVTGCEIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKP

Query:  LVYGGTLSFQYANTYPYPLSVSSVVC
        L +G TLSF YANT+PY LSV+ V C
Subjt:  LVYGGTLSFQYANTYPYPLSVSSVVC

Q8S6P9 TPD1 protein homolog 1B5.7e-2342.86Show/hide
Query:  RIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVNACVTGCEIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSV
        R+  + C++ ++V+ Q     LP+GIPTY+VE++N C T C +Y +H  CG F+SA L++P  F+R+ ++DCLV  G  L     +SFQY+N++ YPL+V
Subjt:  RIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVNACVTGCEIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSV

Query:  SSVVC
        ++V C
Subjt:  SSVVC

Arabidopsis top hitse value%identityAlignment
AT1G05835.1 PHD finger protein3.8e-0638.89Show/hide
Query:  YTVEVVNACVTGCEIYGIHFKCGWFSSAHLINPRVFKRLRYD--DCLVNDGKPLVYGGTLSFQYANTYPYPL
        + VEV+N C   C I  +  KC  F  + L++P   + L     +C+VNDG PL    TLSF Y+NT+ + L
Subjt:  YTVEVVNACVTGCEIYGIHFKCGWFSSAHLINPRVFKRLRYD--DCLVNDGKPLVYGGTLSFQYANTYPYPL

AT1G32583.1 FUNCTIONS IN: molecular_function unknown2.6e-3957.04Show/hide
Query:  SFLRLGLNSTIVGPHRKLLLTKEATIEEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVNACVTGCEIYGIHFKCGWFSSAHLINPRVFKRLRYD
        SF+ +  N T +   RKLLL+ +  I + T   G+ C+K DIV+ QG T PLP+G+P+YTVE+ N+CV+ C I  IH  CGWFSS  L+NPRVF+RL YD
Subjt:  SFLRLGLNSTIVGPHRKLLLTKEATIEEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVNACVTGCEIYGIHFKCGWFSSAHLINPRVFKRLRYD

Query:  DCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVC
        DCLVNDG+PL  G +LSFQYAN++ YPLSV+SV C
Subjt:  DCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVC

AT4G24972.1 tapetum determinant 11.1e-4061.9Show/hide
Query:  HRKLLLTKEATIE-----EPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVNACVTGCEIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKP
        HRK+LL    T +     EP RI GEKC  +DIV+NQ  T P+P GIP Y VE+ N C++GC I  IH  CGWFSSA LINPRVFKR+ YDDCLVN+GKP
Subjt:  HRKLLLTKEATIE-----EPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVNACVTGCEIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKP

Query:  LVYGGTLSFQYANTYPYPLSVSSVVC
        L +G TLSF YANT+PY LSV+ V C
Subjt:  LVYGGTLSFQYANTYPYPLSVSSVVC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGAAAGATCTACCTACTACTACTACTACTACTACTACTCGGCGAGTATCGGTGATTTTCGCTTTGGTTTTCTTCATCTTCCTTTTGGTATCTGCTTTTCTTACAGA
TCTGGATATTATGAACTCGAGTCCACCGAGTTTCCTTCGGTTAGGTTTGAACAGCACAATAGTTGGCCCTCATCGGAAGCTTCTCCTCACTAAAGAAGCAACAATTGAGG
AACCAACCAGAATTTGGGGTGAAAAGTGTACTAAATCAGACATTGTGATTAACCAAGGCCCCACAGCTCCACTTCCAACTGGTATCCCCACCTACACTGTGGAAGTGGTG
AACGCTTGTGTCACTGGATGTGAGATCTATGGTATACACTTCAAATGTGGTTGGTTTAGCTCAGCCCACCTCATCAATCCCAGAGTCTTCAAGCGTCTACGTTATGACGA
CTGCCTCGTAAACGATGGCAAGCCTCTTGTCTACGGCGGGACACTCTCATTCCAATATGCAAACACTTATCCATACCCGCTTTCAGTCTCCTCAGTTGTCTGCAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGATGAAAGATCTACCTACTACTACTACTACTACTACTACTCGGCGAGTATCGGTGATTTTCGCTTTGGTTTTCTTCATCTTCCTTTTGGTATCTGCTTTTCTTACAGA
TCTGGATATTATGAACTCGAGTCCACCGAGTTTCCTTCGGTTAGGTTTGAACAGCACAATAGTTGGCCCTCATCGGAAGCTTCTCCTCACTAAAGAAGCAACAATTGAGG
AACCAACCAGAATTTGGGGTGAAAAGTGTACTAAATCAGACATTGTGATTAACCAAGGCCCCACAGCTCCACTTCCAACTGGTATCCCCACCTACACTGTGGAAGTGGTG
AACGCTTGTGTCACTGGATGTGAGATCTATGGTATACACTTCAAATGTGGTTGGTTTAGCTCAGCCCACCTCATCAATCCCAGAGTCTTCAAGCGTCTACGTTATGACGA
CTGCCTCGTAAACGATGGCAAGCCTCTTGTCTACGGCGGGACACTCTCATTCCAATATGCAAACACTTATCCATACCCGCTTTCAGTCTCCTCAGTTGTCTGCAACTGA
Protein sequenceShow/hide protein sequence
MMKDLPTTTTTTTTRRVSVIFALVFFIFLLVSAFLTDLDIMNSSPPSFLRLGLNSTIVGPHRKLLLTKEATIEEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVV
NACVTGCEIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVCN