; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001889 (gene) of Snake gourd v1 genome

Gene IDTan0001889
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionprotein TAPETUM DETERMINANT 1
Genome locationLG05:81918863..81920637
RNA-Seq ExpressionTan0001889
SyntenyTan0001889
Gene Ontology termsGO:0001709 - cell fate determination (biological process)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR040361 - Tapetum determinant 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573145.1 Protein TAPETUM DETERMINANT 1, partial [Cucurbita argyrosperma subsp. sororia]1.6e-8688.59Show/hide
Query:  MMK--DPSTSTTRRVLVIFASVFFIFFSVSAFLTDLNIIGSFMDPTPPSFLRLGLKSTMVVAHRKLLLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPL
        MMK    ST+TTRRVLVIFASV FIF SVSAFLTDLNIIGSFMD +PPSFLR GL+ST V   RK LLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPL
Subjt:  MMK--DPSTSTTRRVLVIFASVFFIFFSVSAFLTDLNIIGSFMDPTPPSFLRLGLKSTMVVAHRKLLLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPL

Query:  PTGIPTYTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVCN
        PTGIPTYTVEV NAC TGCDI+GIHFKCGWFSSAHLINPR+FKRLRYDDCLVNDGKPLVYGGTLSFQYANT+PYPLSVSSVVCN
Subjt:  PTGIPTYTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVCN

XP_022137731.1 protein TAPETUM DETERMINANT 1-like isoform X1 [Momordica charantia]2.1e-8685.95Show/hide
Query:  MMKD---PSTSTTRRVLVIFASVFFIFFSVSAFLTDLNIIGSFMDPTPPSFLRLGLKSTMVVAHRKLLLTREAAFEEPTRIWGEKCTKSDIVINQGPTAP
        MMKD    +T+ TRRV VIFASVFFIFFS+SA LTDL+IIGSFM+ +PPS+L LGL +T+V  HRKLLLT+EA  EEPTRIWGEKCTKSDIVINQGPTAP
Subjt:  MMKD---PSTSTTRRVLVIFASVFFIFFSVSAFLTDLNIIGSFMDPTPPSFLRLGLKSTMVVAHRKLLLTREAAFEEPTRIWGEKCTKSDIVINQGPTAP

Query:  LPTGIPTYTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVCN
        LPTGIPTYTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANT+PYPLSVSSVVC+
Subjt:  LPTGIPTYTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVCN

XP_022955262.1 protein TAPETUM DETERMINANT 1-like [Cucurbita moschata]9.2e-8788.59Show/hide
Query:  MMK--DPSTSTTRRVLVIFASVFFIFFSVSAFLTDLNIIGSFMDPTPPSFLRLGLKSTMVVAHRKLLLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPL
        MMK    ST+TTRRVLVIFAS  FIF SVSAFLTDLNIIGSFMD +PPSFLR GL+ST V   RK LLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPL
Subjt:  MMK--DPSTSTTRRVLVIFASVFFIFFSVSAFLTDLNIIGSFMDPTPPSFLRLGLKSTMVVAHRKLLLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPL

Query:  PTGIPTYTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVCN
        PTGIPTYTVEVVNAC TGCDI+GIHFKCGWFSSAHLINPR+FKRLRYDDCLVNDGKPLVYGGTLSFQYANT+PYPLSVSSVVCN
Subjt:  PTGIPTYTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVCN

XP_022994477.1 protein TAPETUM DETERMINANT 1 [Cucurbita maxima]8.3e-8890.16Show/hide
Query:  MMKDPSTSTT-RRVLVIFASVFFIFFSVSAFLTDLNIIGSFMDPTPPSFLRLGLKSTMVVAHRKLLLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLP
        MMK PS STT RRVLVIFASV FIF SVSAFLTDLNIIGSFMD +PPSFLR GL+ST V   RK LLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLP
Subjt:  MMKDPSTSTT-RRVLVIFASVFFIFFSVSAFLTDLNIIGSFMDPTPPSFLRLGLKSTMVVAHRKLLLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLP

Query:  TGIPTYTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVCN
        TGIPTYTVEVVNAC TGCDI+GIHFKCGWFSSAHLINPR+FKRLRYDDCLVNDGKPLVYGGTLSFQYANT+PYPLSVSSVVCN
Subjt:  TGIPTYTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVCN

XP_023542781.1 protein TAPETUM DETERMINANT 1 [Cucurbita pepo subsp. pepo]2.4e-8790.16Show/hide
Query:  MMKDPSTS-TTRRVLVIFASVFFIFFSVSAFLTDLNIIGSFMDPTPPSFLRLGLKSTMVVAHRKLLLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLP
        MMK  STS TTRRVLVIFASV FIF SVSAFLTDLNIIGSFMD +PPSFLR GL+ST V   RK LLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLP
Subjt:  MMKDPSTS-TTRRVLVIFASVFFIFFSVSAFLTDLNIIGSFMDPTPPSFLRLGLKSTMVVAHRKLLLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLP

Query:  TGIPTYTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVCN
        TGIPTYTVEVVNAC TGCDI+GIHFKCGWFSSAHLINPR+FKRLRYDDCLVNDGKPLVYGGTLSFQYANT+PYPLSVSSVVCN
Subjt:  TGIPTYTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVCN

TrEMBL top hitse value%identityAlignment
A0A6J1C840 protein TAPETUM DETERMINANT 1-like isoform X11.0e-8685.95Show/hide
Query:  MMKD---PSTSTTRRVLVIFASVFFIFFSVSAFLTDLNIIGSFMDPTPPSFLRLGLKSTMVVAHRKLLLTREAAFEEPTRIWGEKCTKSDIVINQGPTAP
        MMKD    +T+ TRRV VIFASVFFIFFS+SA LTDL+IIGSFM+ +PPS+L LGL +T+V  HRKLLLT+EA  EEPTRIWGEKCTKSDIVINQGPTAP
Subjt:  MMKD---PSTSTTRRVLVIFASVFFIFFSVSAFLTDLNIIGSFMDPTPPSFLRLGLKSTMVVAHRKLLLTREAAFEEPTRIWGEKCTKSDIVINQGPTAP

Query:  LPTGIPTYTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVCN
        LPTGIPTYTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANT+PYPLSVSSVVC+
Subjt:  LPTGIPTYTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVCN

A0A6J1EDD3 protein TAPETUM DETERMINANT 1-like1.3e-8689.5Show/hide
Query:  MMKDPSTSTTRRVLVIFASVFFIFFSVSAFLTDLNIIGSFMDPTPPSFLRLGLKSTMVVAHRKLLLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLPT
        MMKD   STTRRV VIFASVFFIFF VSA LTDL+I  SFMD +PPSFLRLGL ST+   HRKLLLTREA  EEPTRIWGEKCTKSDIVINQGPTAPLPT
Subjt:  MMKDPSTSTTRRVLVIFASVFFIFFSVSAFLTDLNIIGSFMDPTPPSFLRLGLKSTMVVAHRKLLLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLPT

Query:  GIPTYTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVC
        GIPTYTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRL YDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVC
Subjt:  GIPTYTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVC

A0A6J1GTG1 protein TAPETUM DETERMINANT 1-like4.5e-8788.59Show/hide
Query:  MMK--DPSTSTTRRVLVIFASVFFIFFSVSAFLTDLNIIGSFMDPTPPSFLRLGLKSTMVVAHRKLLLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPL
        MMK    ST+TTRRVLVIFAS  FIF SVSAFLTDLNIIGSFMD +PPSFLR GL+ST V   RK LLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPL
Subjt:  MMK--DPSTSTTRRVLVIFASVFFIFFSVSAFLTDLNIIGSFMDPTPPSFLRLGLKSTMVVAHRKLLLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPL

Query:  PTGIPTYTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVCN
        PTGIPTYTVEVVNAC TGCDI+GIHFKCGWFSSAHLINPR+FKRLRYDDCLVNDGKPLVYGGTLSFQYANT+PYPLSVSSVVCN
Subjt:  PTGIPTYTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVCN

A0A6J1JVY2 protein TAPETUM DETERMINANT 14.0e-8890.16Show/hide
Query:  MMKDPSTSTT-RRVLVIFASVFFIFFSVSAFLTDLNIIGSFMDPTPPSFLRLGLKSTMVVAHRKLLLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLP
        MMK PS STT RRVLVIFASV FIF SVSAFLTDLNIIGSFMD +PPSFLR GL+ST V   RK LLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLP
Subjt:  MMKDPSTSTT-RRVLVIFASVFFIFFSVSAFLTDLNIIGSFMDPTPPSFLRLGLKSTMVVAHRKLLLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLP

Query:  TGIPTYTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVCN
        TGIPTYTVEVVNAC TGCDI+GIHFKCGWFSSAHLINPR+FKRLRYDDCLVNDGKPLVYGGTLSFQYANT+PYPLSVSSVVCN
Subjt:  TGIPTYTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVCN

A0A6J1KH78 protein TAPETUM DETERMINANT 1-like4.9e-8688.4Show/hide
Query:  MMKDPSTSTTRRVLVIFASVFFIFFSVSAFLTDLNIIGSFMDPTPPSFLRLGLKSTMVVAHRKLLLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLPT
        MMKD   STTRRV VIF SVFFIFF VSA LTDL+II SFM+P+PPSFLR+ LKS +   HRKLLLTREA  EEPTRIWGEKCTKSDIVINQGPTAPLPT
Subjt:  MMKDPSTSTTRRVLVIFASVFFIFFSVSAFLTDLNIIGSFMDPTPPSFLRLGLKSTMVVAHRKLLLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLPT

Query:  GIPTYTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVC
        GIPTYTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRL YDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVC
Subjt:  GIPTYTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVC

SwissProt top hitse value%identityAlignment
A8MS78 Uncharacterized protein At1g058355.4e-0538.89Show/hide
Query:  YTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYD--DCLVNDGKPLVYGGTLSFQYANTYPYPL
        + VEV+N C   C I  +  KC  F  + L++P   + L     +C+VNDG PL    TLSF Y+NT+ + L
Subjt:  YTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYD--DCLVNDGKPLVYGGTLSFQYANTYPYPL

Q1G3T1 TPD1 protein homolog 14.1e-3760Show/hide
Query:  RKLLLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGT
        RKLLL+ +    + T   G+ C+K DIV+ QG T PLP+G+P+YTVE+ N+CV+ C+I  IH  CGWFSS  L+NPRVF+RL YDDCLVNDG+PL  G +
Subjt:  RKLLLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGT

Query:  LSFQYANTYPYPLSVSSVVC
        LSFQYAN++ YPLSV+SV C
Subjt:  LSFQYANTYPYPLSVSSVVC

Q2QR54 TPD1 protein homolog 1A2.4e-2956.44Show/hide
Query:  DIVINQGPTAPLPTGIPTYTVEVVNACVTG------CDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVV
        DI I QG   PLP+G+P YTV+V+N C  G      C I GIH +CGWFSS  L++PRVF+RL +DDCL+NDG+PL+ G T+SF+Y N++PY LSVS   
Subjt:  DIVINQGPTAPLPTGIPTYTVEVVNACVTG------CDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVV

Query:  C
        C
Subjt:  C

Q6TLJ2 Protein TAPETUM DETERMINANT 11.1e-3959.09Show/hide
Query:  STMVVAHRKLLLTREAAFE-----EPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCL
        S+  ++HRK+LL      +     EP RI GEKC  +DIV+NQ  T P+P GIP Y VE+ N C++GC I  IH  CGWFSSA LINPRVFKR+ YDDCL
Subjt:  STMVVAHRKLLLTREAAFE-----EPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCL

Query:  VNDGKPLVYGGTLSFQYANTYPYPLSVSSVVC
        VN+GKPL +G TLSF YANT+PY LSV+ V C
Subjt:  VNDGKPLVYGGTLSFQYANTYPYPLSVSSVVC

Q8S6P9 TPD1 protein homolog 1B5.7e-2342.86Show/hide
Query:  RIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSV
        R+  + C++ ++V+ Q     LP+GIPTY+VE++N C T C +Y +H  CG F+SA L++P  F+R+ ++DCLV  G  L     +SFQY+N++ YPL+V
Subjt:  RIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSV

Query:  SSVVC
        ++V C
Subjt:  SSVVC

Arabidopsis top hitse value%identityAlignment
AT1G05835.1 PHD finger protein3.8e-0638.89Show/hide
Query:  YTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYD--DCLVNDGKPLVYGGTLSFQYANTYPYPL
        + VEV+N C   C I  +  KC  F  + L++P   + L     +C+VNDG PL    TLSF Y+NT+ + L
Subjt:  YTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYD--DCLVNDGKPLVYGGTLSFQYANTYPYPL

AT1G32583.1 FUNCTIONS IN: molecular_function unknown2.9e-3860Show/hide
Query:  RKLLLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGT
        RKLLL+ +    + T   G+ C+K DIV+ QG T PLP+G+P+YTVE+ N+CV+ C+I  IH  CGWFSS  L+NPRVF+RL YDDCLVNDG+PL  G +
Subjt:  RKLLLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGT

Query:  LSFQYANTYPYPLSVSSVVC
        LSFQYAN++ YPLSV+SV C
Subjt:  LSFQYANTYPYPLSVSSVVC

AT4G24972.1 tapetum determinant 18.2e-4159.09Show/hide
Query:  STMVVAHRKLLLTREAAFE-----EPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCL
        S+  ++HRK+LL      +     EP RI GEKC  +DIV+NQ  T P+P GIP Y VE+ N C++GC I  IH  CGWFSSA LINPRVFKR+ YDDCL
Subjt:  STMVVAHRKLLLTREAAFE-----EPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVVNACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCL

Query:  VNDGKPLVYGGTLSFQYANTYPYPLSVSSVVC
        VN+GKPL +G TLSF YANT+PY LSV+ V C
Subjt:  VNDGKPLVYGGTLSFQYANTYPYPLSVSSVVC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGAAAGATCCATCTACTTCTACCACTCGGCGAGTTTTGGTGATTTTCGCTTCGGTTTTCTTCATCTTCTTTTCGGTATCTGCCTTTCTTACAGATCTGAATATTAT
TGGTTCCTTTATGGACCCGACTCCACCGAGTTTTCTTCGATTGGGTTTAAAGAGCACCATGGTTGTCGCTCATCGCAAACTTCTCCTTACCAGAGAAGCAGCATTTGAGG
AACCAACCAGAATTTGGGGTGAAAAGTGTACTAAATCAGACATTGTGATTAATCAAGGCCCCACAGCTCCACTTCCAACTGGTATTCCCACCTACACTGTGGAAGTGGTG
AACGCTTGTGTCACTGGCTGTGACATTTATGGCATTCACTTCAAATGTGGCTGGTTCAGCTCTGCCCACCTCATCAATCCCAGAGTCTTCAAGCGCCTTCGCTACGACGA
CTGCCTTGTGAACGACGGCAAGCCTCTGGTCTATGGCGGCACGCTCTCTTTCCAGTATGCAAACACTTATCCTTACCCACTTTCCGTCTCTTCAGTTGTCTGCAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGATGAAAGATCCATCTACTTCTACCACTCGGCGAGTTTTGGTGATTTTCGCTTCGGTTTTCTTCATCTTCTTTTCGGTATCTGCCTTTCTTACAGATCTGAATATTAT
TGGTTCCTTTATGGACCCGACTCCACCGAGTTTTCTTCGATTGGGTTTAAAGAGCACCATGGTTGTCGCTCATCGCAAACTTCTCCTTACCAGAGAAGCAGCATTTGAGG
AACCAACCAGAATTTGGGGTGAAAAGTGTACTAAATCAGACATTGTGATTAATCAAGGCCCCACAGCTCCACTTCCAACTGGTATTCCCACCTACACTGTGGAAGTGGTG
AACGCTTGTGTCACTGGCTGTGACATTTATGGCATTCACTTCAAATGTGGCTGGTTCAGCTCTGCCCACCTCATCAATCCCAGAGTCTTCAAGCGCCTTCGCTACGACGA
CTGCCTTGTGAACGACGGCAAGCCTCTGGTCTATGGCGGCACGCTCTCTTTCCAGTATGCAAACACTTATCCTTACCCACTTTCCGTCTCTTCAGTTGTCTGCAACTGA
Protein sequenceShow/hide protein sequence
MMKDPSTSTTRRVLVIFASVFFIFFSVSAFLTDLNIIGSFMDPTPPSFLRLGLKSTMVVAHRKLLLTREAAFEEPTRIWGEKCTKSDIVINQGPTAPLPTGIPTYTVEVV
NACVTGCDIYGIHFKCGWFSSAHLINPRVFKRLRYDDCLVNDGKPLVYGGTLSFQYANTYPYPLSVSSVVCN