; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007481 (gene) of Snake gourd v1 genome

Gene IDTan0007481
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
Genome locationLG01:16197098..16198670
RNA-Seq ExpressionTan0007481
SyntenyTan0007481
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7015466.1 hypothetical protein SDJN02_23102 [Cucurbita argyrosperma subsp. argyrosperma]2.6e-10584.23Show/hide
Query:  MTSFIRFSTLYILHDHFCKKPTRFNPLPTSKLASCDRNGSRLRAYGGRLFFL--GRGNEWILGRDGGFKGKRRLIVARFNQGLGFNG---GGGGGDDGAT
        MTSFIRFS+  +LHD+F  KPTRFNPLP  K+ASC RNG RLR   G+L FL  G GN  + G+ G FKGKR LIVARFNQG GFNG   GGGGGDDGAT
Subjt:  MTSFIRFSTLYILHDHFCKKPTRFNPLPTSKLASCDRNGSRLRAYGGRLFFL--GRGNEWILGRDGGFKGKRRLIVARFNQGLGFNG---GGGGGDDGAT

Query:  ARLLGNIALAAGLTYLSVTGQLGWVLDAIVSIWLVAVLIPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCTQPFSVVDDKFVRDSV
        ARLLGNIALAAGLTYLSVTGQLGW+LDAIVSIWLVAVL+PIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFC+QPFSVVDDKFVRDSV
Subjt:  ARLLGNIALAAGLTYLSVTGQLGWVLDAIVSIWLVAVLIPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCTQPFSVVDDKFVRDSV

Query:  QFSNKTSSTFGQAFSNFTSPRKGKETTGAVVDIEAEVKDVD
         FSNKTSSTFGQAFS+FTSPRKGKET+GAVVDIEAEVKDVD
Subjt:  QFSNKTSSTFGQAFSNFTSPRKGKETTGAVVDIEAEVKDVD

XP_022142897.1 uncharacterized protein LOC111012900 [Momordica charantia]6.5e-10985.42Show/hide
Query:  MTSFIRFSTLYILHDHFCKKPTRFNPLPTSKLASCDRNGSRLRAYGGRLFFLGR--GNEWILGRDGGFKGKRRLIVARFNQGLGFN--GGGGGGDDGATA
        MTS IRFST  ILHDHFCKKPTRF+PLP  K+  C+RNG RLRAYG RL FLG   GNE I G+D GFKG+R LIVARFNQG GFN  GGGGGGDDGATA
Subjt:  MTSFIRFSTLYILHDHFCKKPTRFNPLPTSKLASCDRNGSRLRAYGGRLFFLGR--GNEWILGRDGGFKGKRRLIVARFNQGLGFN--GGGGGGDDGATA

Query:  RLLGNIALAAGLTYLSVTGQLGWVLDAIVSIWLVAVLIPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCTQPFSVVDDKFVRDSVQ
        R+LGNIALA GLTYLSVTGQLGWVLDAIVSIWLVAVL+PIVGVAAFIWWAGRDIVQ TCPNCGN+FQIFKSTLNEELQLCPFC+QPFSVVDDKFVRDSV+
Subjt:  RLLGNIALAAGLTYLSVTGQLGWVLDAIVSIWLVAVLIPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCTQPFSVVDDKFVRDSVQ

Query:  FSNKTSSTFGQAFSNFTSPRKGKETTGAVVDIEAEVKDVD
        FSNKTSSTFGQAFSNFTSP+KGKET+ AVVDIEAEVKDVD
Subjt:  FSNKTSSTFGQAFSNFTSPRKGKETTGAVVDIEAEVKDVD

XP_022985259.1 uncharacterized protein LOC111483302 [Cucurbita maxima]8.0e-10785.48Show/hide
Query:  MTSFIRFSTL-YILHDHFCKKPTRFNPLPTSKLASCDRNGSRLRAYGGRLFFLGRG--NEWILGRDGGFKGKRRLIVARFNQGLGFN--GGGGGGDDGAT
        MTSFIRFS+   +LHD+F KKPTRFNPLP  K+ASCDRNG RLR  GG+L FL  G  N+ +LG+ G FKGKR LIVARFNQG GFN  GGGGGGDDGAT
Subjt:  MTSFIRFSTL-YILHDHFCKKPTRFNPLPTSKLASCDRNGSRLRAYGGRLFFLGRG--NEWILGRDGGFKGKRRLIVARFNQGLGFN--GGGGGGDDGAT

Query:  ARLLGNIALAAGLTYLSVTGQLGWVLDAIVSIWLVAVLIPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCTQPFSVVDDKFVRDSV
        ARLLGNIALAAGLTYLSVTGQLGW+LDAIVSIWLVAVL+PIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFC+QPFSVVDDKFVRDSV
Subjt:  ARLLGNIALAAGLTYLSVTGQLGWVLDAIVSIWLVAVLIPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCTQPFSVVDDKFVRDSV

Query:  QFSNKTSSTFGQAFSNFTSPRKGKETTGAVVDIEAEVKDVD
         FSNKTSSTFGQAFS+FTSPRKGKET+GAVVDIEAEVKDVD
Subjt:  QFSNKTSSTFGQAFSNFTSPRKGKETTGAVVDIEAEVKDVD

XP_023552443.1 uncharacterized protein LOC111810103 [Cucurbita pepo subsp. pepo]2.6e-10583.82Show/hide
Query:  MTSFIRFSTLYILHDHFCKKPTRFNPLPTSKLASCDRNGSRLRAYGGRLFFL--GRGNEWILGRDGGFKGKRRLIVARFNQGLGFNG---GGGGGDDGAT
        MTSFIRFS+  +LHD+F  KPTRFNPLP  K+ASC RNG RLR  GG+L FL  G GN  + G+ G FKG+R LIVARF+QG GFNG   GGGGGDDGAT
Subjt:  MTSFIRFSTLYILHDHFCKKPTRFNPLPTSKLASCDRNGSRLRAYGGRLFFL--GRGNEWILGRDGGFKGKRRLIVARFNQGLGFNG---GGGGGDDGAT

Query:  ARLLGNIALAAGLTYLSVTGQLGWVLDAIVSIWLVAVLIPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCTQPFSVVDDKFVRDSV
        ARLLGNIALAAGLTYLSVTGQLGW+LDAIVSIWLVAVL+PIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFC+QPFSVVDDKFVRDSV
Subjt:  ARLLGNIALAAGLTYLSVTGQLGWVLDAIVSIWLVAVLIPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCTQPFSVVDDKFVRDSV

Query:  QFSNKTSSTFGQAFSNFTSPRKGKETTGAVVDIEAEVKDVD
         FSNKTSSTFGQAFS+FTSPRKGKET+GAVVDIEAEVKDVD
Subjt:  QFSNKTSSTFGQAFSNFTSPRKGKETTGAVVDIEAEVKDVD

XP_038905451.1 uncharacterized protein LOC120091477 isoform X2 [Benincasa hispida]7.2e-10885.48Show/hide
Query:  MTSFIRFSTLYILHDHFCKKPTRFNPLPTSKLASCDRNGSRLRAYGGRLFFLG--RGNEWILGRDGGFKGKRRLIVARFNQGLGFN---GGGGGGDDGAT
        MTSFIRFSTL ILHD FC KPT+FNPLP  K+AS DRNG RLR YG RL FL   RGN  + G+ GG KGKR LIVARFNQG GFN   GGGGGGDDGAT
Subjt:  MTSFIRFSTLYILHDHFCKKPTRFNPLPTSKLASCDRNGSRLRAYGGRLFFLG--RGNEWILGRDGGFKGKRRLIVARFNQGLGFN---GGGGGGDDGAT

Query:  ARLLGNIALAAGLTYLSVTGQLGWVLDAIVSIWLVAVLIPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCTQPFSVVDDKFVRDSV
        ARLLGNIALA GLTYLSVTGQLGWVLDAIVSIWLVAVL+PIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFC+QPFSVVDDKFVRDSV
Subjt:  ARLLGNIALAAGLTYLSVTGQLGWVLDAIVSIWLVAVLIPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCTQPFSVVDDKFVRDSV

Query:  QFSNKTSSTFGQAFSNFTSPRKGKETTGAVVDIEAEVKDVD
        +FSNKTSSTFGQAF +FTSP+KGKET+GAVVDIEAEVKDVD
Subjt:  QFSNKTSSTFGQAFSNFTSPRKGKETTGAVVDIEAEVKDVD

TrEMBL top hitse value%identityAlignment
A0A0A0L5A4 Uncharacterized protein1.4e-10481.71Show/hide
Query:  MTSFIRFSTLYILHDHFCKKPTRFNPLPTSKLASCDRNGSRLRAYGGRLFFLGRG--NEWILGRDGGFKGKR-RLIVARFNQGLGFNGGGG-------GG
        MTSFIRFST  ILH++FC KPT+FNPLP  K+  C  +G RLR YG RL F G G  ++ + G+ GGFKGKR RLIVARFNQG GFNGGGG       GG
Subjt:  MTSFIRFSTLYILHDHFCKKPTRFNPLPTSKLASCDRNGSRLRAYGGRLFFLGRG--NEWILGRDGGFKGKR-RLIVARFNQGLGFNGGGG-------GG

Query:  DDGATARLLGNIALAAGLTYLSVTGQLGWVLDAIVSIWLVAVLIPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCTQPFSVVDDKF
        DDGATARL+GNIALAAGLTYLSVTGQLGWVLDAIVSIWLVAVL+PIVGVAAFIWWAGRDIVQS CPNCGNEFQIFKSTLNEELQLCPFC+QPFSVVDDKF
Subjt:  DDGATARLLGNIALAAGLTYLSVTGQLGWVLDAIVSIWLVAVLIPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCTQPFSVVDDKF

Query:  VRDSVQFSNKTSSTFGQAFSNFTSPRKGKETTGAVVDIEAEVKDVD
        VRDSV+FSNKTSSTFGQAFS+FTSPRKGKET+GAVVDIEAEVKDVD
Subjt:  VRDSVQFSNKTSSTFGQAFSNFTSPRKGKETTGAVVDIEAEVKDVD

A0A5A7TVG5 Uncharacterized protein1.4e-10483.75Show/hide
Query:  MTSFIRFSTLYILHDHFCKKPTRFNPLPTSKLASCDRNGSRLRAYGGRLFFLGRG--NEWILGRDGGFKGKR-RLIVARFNQGLGFNGGGG-GGDDGATA
        MTSFIRFST  ILH++FC KPT+FNPLP  K+  C+ +G RLR YG RLFF G G   + + G+  GFKGKR RLIVARFNQG GFNGGGG GGDDGATA
Subjt:  MTSFIRFSTLYILHDHFCKKPTRFNPLPTSKLASCDRNGSRLRAYGGRLFFLGRG--NEWILGRDGGFKGKR-RLIVARFNQGLGFNGGGG-GGDDGATA

Query:  RLLGNIALAAGLTYLSVTGQLGWVLDAIVSIWLVAVLIPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCTQPFSVVDDKFVRDSVQ
        RLLGNIALAAGLTYLSVTGQLGWVLDAIVSIWLVAVL+PIVGVAAFIWWAGRDIVQS CPNCGNEFQIFKSTLNEELQLCPFC+QPFSVVDDKFVRDSV 
Subjt:  RLLGNIALAAGLTYLSVTGQLGWVLDAIVSIWLVAVLIPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCTQPFSVVDDKFVRDSVQ

Query:  FSNKTSSTFGQAFSNFTSPRKGKETTGAVVDIEAEVKDVD
        FS KTSSTFGQAFS+FTSPRKGKET+GAVVDIEAEVKDVD
Subjt:  FSNKTSSTFGQAFSNFTSPRKGKETTGAVVDIEAEVKDVD

A0A6J1CP68 uncharacterized protein LOC1110129003.2e-10985.42Show/hide
Query:  MTSFIRFSTLYILHDHFCKKPTRFNPLPTSKLASCDRNGSRLRAYGGRLFFLGR--GNEWILGRDGGFKGKRRLIVARFNQGLGFN--GGGGGGDDGATA
        MTS IRFST  ILHDHFCKKPTRF+PLP  K+  C+RNG RLRAYG RL FLG   GNE I G+D GFKG+R LIVARFNQG GFN  GGGGGGDDGATA
Subjt:  MTSFIRFSTLYILHDHFCKKPTRFNPLPTSKLASCDRNGSRLRAYGGRLFFLGR--GNEWILGRDGGFKGKRRLIVARFNQGLGFN--GGGGGGDDGATA

Query:  RLLGNIALAAGLTYLSVTGQLGWVLDAIVSIWLVAVLIPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCTQPFSVVDDKFVRDSVQ
        R+LGNIALA GLTYLSVTGQLGWVLDAIVSIWLVAVL+PIVGVAAFIWWAGRDIVQ TCPNCGN+FQIFKSTLNEELQLCPFC+QPFSVVDDKFVRDSV+
Subjt:  RLLGNIALAAGLTYLSVTGQLGWVLDAIVSIWLVAVLIPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCTQPFSVVDDKFVRDSVQ

Query:  FSNKTSSTFGQAFSNFTSPRKGKETTGAVVDIEAEVKDVD
        FSNKTSSTFGQAFSNFTSP+KGKET+ AVVDIEAEVKDVD
Subjt:  FSNKTSSTFGQAFSNFTSPRKGKETTGAVVDIEAEVKDVD

A0A6J1EMN9 uncharacterized protein LOC1114359811.2e-10584.23Show/hide
Query:  MTSFIRFSTLYILHDHFCKKPTRFNPLPTSKLASCDRNGSRLRAYGGRLFFL--GRGNEWILGRDGGFKGKRRLIVARFNQGLGFNG---GGGGGDDGAT
        MTSFIRFS+  +LHD+F  KPTRFNPLP  K+ASC RNG RLR   G+L FL  G GN  + G+ G FKGKR LIVARFNQG GFNG   GGGGGDDGAT
Subjt:  MTSFIRFSTLYILHDHFCKKPTRFNPLPTSKLASCDRNGSRLRAYGGRLFFL--GRGNEWILGRDGGFKGKRRLIVARFNQGLGFNG---GGGGGDDGAT

Query:  ARLLGNIALAAGLTYLSVTGQLGWVLDAIVSIWLVAVLIPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCTQPFSVVDDKFVRDSV
        ARLLGNIALAAGLTYLSVTGQLGW+LDAIVSIWLVAVL+PIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFC+QPFSVVDDKFVRDSV
Subjt:  ARLLGNIALAAGLTYLSVTGQLGWVLDAIVSIWLVAVLIPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCTQPFSVVDDKFVRDSV

Query:  QFSNKTSSTFGQAFSNFTSPRKGKETTGAVVDIEAEVKDVD
         FSNKTSSTFGQAFS+FTSPRKGKET+GAVVDIEAEVKDVD
Subjt:  QFSNKTSSTFGQAFSNFTSPRKGKETTGAVVDIEAEVKDVD

A0A6J1JCT5 uncharacterized protein LOC1114833023.9e-10785.48Show/hide
Query:  MTSFIRFSTL-YILHDHFCKKPTRFNPLPTSKLASCDRNGSRLRAYGGRLFFLGRG--NEWILGRDGGFKGKRRLIVARFNQGLGFN--GGGGGGDDGAT
        MTSFIRFS+   +LHD+F KKPTRFNPLP  K+ASCDRNG RLR  GG+L FL  G  N+ +LG+ G FKGKR LIVARFNQG GFN  GGGGGGDDGAT
Subjt:  MTSFIRFSTL-YILHDHFCKKPTRFNPLPTSKLASCDRNGSRLRAYGGRLFFLGRG--NEWILGRDGGFKGKRRLIVARFNQGLGFN--GGGGGGDDGAT

Query:  ARLLGNIALAAGLTYLSVTGQLGWVLDAIVSIWLVAVLIPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCTQPFSVVDDKFVRDSV
        ARLLGNIALAAGLTYLSVTGQLGW+LDAIVSIWLVAVL+PIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFC+QPFSVVDDKFVRDSV
Subjt:  ARLLGNIALAAGLTYLSVTGQLGWVLDAIVSIWLVAVLIPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCTQPFSVVDDKFVRDSV

Query:  QFSNKTSSTFGQAFSNFTSPRKGKETTGAVVDIEAEVKDVD
         FSNKTSSTFGQAFS+FTSPRKGKET+GAVVDIEAEVKDVD
Subjt:  QFSNKTSSTFGQAFSNFTSPRKGKETTGAVVDIEAEVKDVD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G27390.1 unknown protein2.3e-5953.88Show/hide
Query:  YILHDHFCKKPT-RFNPLPTSKLASCDRNGSRLRA----YGGRLFFLGRGNEWILGRDGGFKGKRRLIVARFNQGLGFNGGGGGGDDGATARLLGNIALA
        + +HD F K+P+ R +    S   S  +    LRA    +  R   +G G+  +L  +     KRR++  R   GL FN    GGD+    R+LGN+ALA
Subjt:  YILHDHFCKKPT-RFNPLPTSKLASCDRNGSRLRA----YGGRLFFLGRGNEWILGRDGGFKGKRRLIVARFNQGLGFNGGGGGGDDGATARLLGNIALA

Query:  AGLTYLSVTGQLGWVLDAIVSIWLVAVLIPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCTQPFSVVDDKFVRDSVQFSNKTSSTF
         GLTYLS+TGQLGW+LDAIVS+WL+ V++PI+G+ AF WWA RDIVQS CPNCGNEFQIFKS +++E+QLCPFCTQPFSVVDDKFV++ V+FSN+T++ F
Subjt:  AGLTYLSVTGQLGWVLDAIVSIWLVAVLIPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCTQPFSVVDDKFVRDSVQFSNKTSSTF

Query:  GQAFSNFTS-PRKGKETTGAVVDIEAEVKDVD
        GQ  + F+S P+KGK ++ AVVDIEAEV D D
Subjt:  GQAFSNFTS-PRKGKETTGAVVDIEAEVKDVD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAAGTTTTATCAGATTTTCGACCTTGTACATACTACACGATCATTTCTGCAAGAAACCCACCAGATTTAATCCTCTTCCGACTTCGAAGCTTGCCTCCTGCGATCG
AAATGGGTCCCGTTTGAGGGCTTATGGAGGGAGATTGTTCTTTCTGGGTCGGGGTAACGAATGGATTTTGGGAAGAGATGGAGGGTTTAAGGGAAAGAGAAGGCTAATTG
TAGCAAGGTTTAATCAGGGTTTAGGGTTTAATGGCGGCGGCGGTGGCGGAGACGATGGTGCAACCGCGAGGCTTCTGGGAAATATTGCTTTGGCTGCTGGGTTAACTTAC
CTTTCGGTAACAGGGCAGCTTGGCTGGGTTTTGGATGCGATTGTTTCCATTTGGCTTGTTGCAGTTCTTATACCAATTGTCGGTGTAGCCGCTTTTATCTGGTGGGCAGG
ACGAGATATCGTTCAAAGCACTTGCCCGAACTGTGGAAATGAATTTCAAATCTTCAAATCAACTCTGAATGAAGAGCTGCAACTATGCCCTTTCTGTACCCAACCTTTCT
CTGTGGTGGATGACAAGTTTGTGAGGGATTCTGTGCAGTTCTCCAACAAAACTTCTTCCACCTTTGGACAGGCTTTCAGTAACTTTACTTCTCCCAGAAAAGGGAAGGAA
ACCACCGGCGCTGTGGTTGACATAGAAGCAGAGGTAAAAGATGTGGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGACAAGTTTTATCAGATTTTCGACCTTGTACATACTACACGATCATTTCTGCAAGAAACCCACCAGATTTAATCCTCTTCCGACTTCGAAGCTTGCCTCCTGCGATCG
AAATGGGTCCCGTTTGAGGGCTTATGGAGGGAGATTGTTCTTTCTGGGTCGGGGTAACGAATGGATTTTGGGAAGAGATGGAGGGTTTAAGGGAAAGAGAAGGCTAATTG
TAGCAAGGTTTAATCAGGGTTTAGGGTTTAATGGCGGCGGCGGTGGCGGAGACGATGGTGCAACCGCGAGGCTTCTGGGAAATATTGCTTTGGCTGCTGGGTTAACTTAC
CTTTCGGTAACAGGGCAGCTTGGCTGGGTTTTGGATGCGATTGTTTCCATTTGGCTTGTTGCAGTTCTTATACCAATTGTCGGTGTAGCCGCTTTTATCTGGTGGGCAGG
ACGAGATATCGTTCAAAGCACTTGCCCGAACTGTGGAAATGAATTTCAAATCTTCAAATCAACTCTGAATGAAGAGCTGCAACTATGCCCTTTCTGTACCCAACCTTTCT
CTGTGGTGGATGACAAGTTTGTGAGGGATTCTGTGCAGTTCTCCAACAAAACTTCTTCCACCTTTGGACAGGCTTTCAGTAACTTTACTTCTCCCAGAAAAGGGAAGGAA
ACCACCGGCGCTGTGGTTGACATAGAAGCAGAGGTAAAAGATGTGGACTGA
Protein sequenceShow/hide protein sequence
MTSFIRFSTLYILHDHFCKKPTRFNPLPTSKLASCDRNGSRLRAYGGRLFFLGRGNEWILGRDGGFKGKRRLIVARFNQGLGFNGGGGGGDDGATARLLGNIALAAGLTY
LSVTGQLGWVLDAIVSIWLVAVLIPIVGVAAFIWWAGRDIVQSTCPNCGNEFQIFKSTLNEELQLCPFCTQPFSVVDDKFVRDSVQFSNKTSSTFGQAFSNFTSPRKGKE
TTGAVVDIEAEVKDVD