; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004604 (gene) of Snake gourd v1 genome

Gene IDTan0004604
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGUB_WAK_bind domain-containing protein
Genome locationLG01:23741257..23744357
RNA-Seq ExpressionTan0004604
SyntenyTan0004604
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596874.1 hypothetical protein SDJN03_10054, partial [Cucurbita argyrosperma subsp. sororia]3.3e-10377.91Show/hide
Query:  MEISVLSIFFFFFFLLSPISIKAQ--TTTQCSHGSPRIQFPFNFNLSCSSNTTRIHFKTYDSLSVKSISYDKKRLDLVDLNGCVHGAFLKLNLSLTPFRY
        MEIS L  FF  FFLLSPISIKAQ  TTT CSHGSP I  PF FNLSCSSNTTRIHFK+YDSLS+KSISYD+KRLDL+DLNGCVHGAFLKLNL+LTPFRY
Subjt:  MEISVLSIFFFFFFLLSPISIKAQ--TTTQCSHGSPRIQFPFNFNLSCSSNTTRIHFKTYDSLSVKSISYDKKRLDLVDLNGCVHGAFLKLNLSLTPFRY

Query:  FYVVQDYRYLNCTSKLLSPSPPIPCLSRPRDYYVYVVKQM-ETPRFCKEVKRVKIPFEYSPYLDDSSFGLALTWGV-DDQSKTESQTGCFFKATNYQVLS
        FYVV+DY+YLNCTSKL SPSPPIPCLSRPRDYYVYVV++M ETPRFCKEVK+VKIPFEYSPYLDD SFGL+L+WG  DD+ +TES+ GC +KA NY+VL 
Subjt:  FYVVQDYRYLNCTSKLLSPSPPIPCLSRPRDYYVYVVKQM-ETPRFCKEVKRVKIPFEYSPYLDDSSFGLALTWGV-DDQSKTESQTGCFFKATNYQVLS

Query:  ISLLVAMVAISSMVIMKIYHSKKRKCFKEEADKKMFEHSYEALKPGSDE
        + LLVAMV ISS VI+KI  SKK+K  KEE  KK+FEH YE LKPGS+E
Subjt:  ISLLVAMVAISSMVIMKIYHSKKRKCFKEEADKKMFEHSYEALKPGSDE

XP_022938488.1 uncharacterized protein LOC111444707 isoform X1 [Cucurbita moschata]2.1e-10579.44Show/hide
Query:  MEISVLSIFFFFFFLLSPISIKAQ-TTTQCSHGSPRIQFPFNFNLSCSSNTTRIHFKTYDSLSVKSISYDKKRLDLVDLNGCVHGAFLKLNLSLTPFRYF
        MEIS L  FF  FFLLSPISIKAQ TTT CSHGSP I  PF FNLSCSSNTTRIHFK+YDSLS+KSISYD+KRLDL+DLNGCVHGAFLKLNL+LTPFRYF
Subjt:  MEISVLSIFFFFFFLLSPISIKAQ-TTTQCSHGSPRIQFPFNFNLSCSSNTTRIHFKTYDSLSVKSISYDKKRLDLVDLNGCVHGAFLKLNLSLTPFRYF

Query:  YVVQDYRYLNCTSKLLSPSPPIPCLSRPRDYYVYVVKQM-ETPRFCKEVKRVKIPFEYSPYLDDSSFGLALTWGV-DDQSKTESQTGCFFKATNYQVLSI
        YVV+DY+YLNCTSKL SPSPPIPCLSRPRDYYVYVV++M ETPRFCKEVK+VKIPFEYSPYLDD SFGL+L+WG  DD+ +TES+ GC +KA NY+VL +
Subjt:  YVVQDYRYLNCTSKLLSPSPPIPCLSRPRDYYVYVVKQM-ETPRFCKEVKRVKIPFEYSPYLDDSSFGLALTWGV-DDQSKTESQTGCFFKATNYQVLSI

Query:  SLLVAMVAISSMVIMKIYHSKKRKCFKEEADKKMFEHSYEALKPGSDE
         LL AMV ISSMVI+KI HSKK K  KEE  KK+FEH YEALKPGSDE
Subjt:  SLLVAMVAISSMVIMKIYHSKKRKCFKEEADKKMFEHSYEALKPGSDE

XP_023005514.1 uncharacterized protein LOC111498479 isoform X1 [Cucurbita maxima]9.9e-10880.88Show/hide
Query:  MEISVLSIFFFFFFLLSPISIKAQ----TTTQCSHGSPRIQFPFNFNLSCSSNTTRIHFKTYDSLSVKSISYDKKRLDLVDLNGCVHGAFLKLNLSLTPF
        MEIS L  FFF FFLLSPISIKAQ    TTT CSHGSP I  PF FNLSCSSNTTRIHFK+YDSLS+KSISYDKKRLDL+DLNGCVHGAFLKLNLSLTPF
Subjt:  MEISVLSIFFFFFFLLSPISIKAQ----TTTQCSHGSPRIQFPFNFNLSCSSNTTRIHFKTYDSLSVKSISYDKKRLDLVDLNGCVHGAFLKLNLSLTPF

Query:  RYFYVVQDYRYLNCTSKLLSPSPPIPCLSRPRDYYVYVVKQM-ETPRFCKEVKRVKIPFEYSPYLDDSSFGLALTWGV-DDQSKTESQTGCFFKATNYQV
        RYFYVV+DY+YLNCTSKLL+PSPPIPCLSRPRDYYVYVV++M ETPRFCKEVK+VKIPFEYSPYLDD SFGL+LTWG  DD+ +TESQ GC +KA NY+V
Subjt:  RYFYVVQDYRYLNCTSKLLSPSPPIPCLSRPRDYYVYVVKQM-ETPRFCKEVKRVKIPFEYSPYLDDSSFGLALTWGV-DDQSKTESQTGCFFKATNYQV

Query:  LSISLLVAMVAISSMVIMKIYHSKKRKCFKEEADKKMFEHSYEALKPGSDE
        L +SLLVAMV IS MVI+KI HSKK+K  KEEA KK+FEHSYEA+KP SDE
Subjt:  LSISLLVAMVAISSMVIMKIYHSKKRKCFKEEADKKMFEHSYEALKPGSDE

XP_023005515.1 uncharacterized protein LOC111498479 isoform X2 [Cucurbita maxima]1.6e-10580.08Show/hide
Query:  MEISVLSIFFFFFFLLSPISIKAQ----TTTQCSHGSPRIQFPFNFNLSCSSNTTRIHFKTYDSLSVKSISYDKKRLDLVDLNGCVHGAFLKLNLSLTPF
        MEIS L  FFF FFLLSPISIKAQ    TTT CSHGSP I  PF FNLSCSSNTTRIHFK+YDSLS+KSISYDKKRLDL+DLNGCVHGAFLKLNLSLTPF
Subjt:  MEISVLSIFFFFFFLLSPISIKAQ----TTTQCSHGSPRIQFPFNFNLSCSSNTTRIHFKTYDSLSVKSISYDKKRLDLVDLNGCVHGAFLKLNLSLTPF

Query:  RYFYVVQDYRYLNCTSKLLSPSPPIPCLSRPRDYYVYVVKQM-ETPRFCKEVKRVKIPFEYSPYLDDSSFGLALTWGV-DDQSKTESQTGCFFKATNYQV
        RYFYVV+DY+YLNCTSKLL+PSPPIPCLSRPRDYYVYVV++M ETPRFCKEVK+VKIPFEYSPYLDD SFGL+LTWG  DD+ +TESQ GC +KA NY+ 
Subjt:  RYFYVVQDYRYLNCTSKLLSPSPPIPCLSRPRDYYVYVVKQM-ETPRFCKEVKRVKIPFEYSPYLDDSSFGLALTWGV-DDQSKTESQTGCFFKATNYQV

Query:  LSISLLVAMVAISSMVIMKIYHSKKRKCFKEEADKKMFEHSYEALKPGSDE
          +SLLVAMV IS MVI+KI HSKK+K  KEEA KK+FEHSYEA+KP SDE
Subjt:  LSISLLVAMVAISSMVIMKIYHSKKRKCFKEEADKKMFEHSYEALKPGSDE

XP_038905185.1 putative RING-H2 finger protein ATL21A [Benincasa hispida]3.3e-10376.47Show/hide
Query:  MEISVLSIFFFFFFLLSPISIKAQ---TTTQCSHGSPRIQFPFNFNLSCSSNTTRIHFKTYDSLSVKSISYDKKRLDLVDLNGCVHGAFLKLNLSLTPFR
        MEIS+ SIFFFFFFL+SPISIKAQ   + T C+HG P+IQFPF+FN+SCSSNTTRIHFKTYDSLS+KSISYD+KRLDLVDLN CVH AFLKLNL LTPFR
Subjt:  MEISVLSIFFFFFFLLSPISIKAQ---TTTQCSHGSPRIQFPFNFNLSCSSNTTRIHFKTYDSLSVKSISYDKKRLDLVDLNGCVHGAFLKLNLSLTPFR

Query:  YFYVVQDYRYLNCTSKLLS-PSPPIPCLSRPRDYYVYVVK---QMETPRFCKEVKRVKIPFEYSPYLDDSSFGLALTWGVDDQSKTESQTGCFFKATNYQ
        YFYVV+DY YLNCT++L+S  S PIPCLSR  +YYVY V+      TPR CKE+KRVKIPFEYSPYLDD SFGL+LTWG DD +KT+SQ  CFFKATN+Q
Subjt:  YFYVVQDYRYLNCTSKLLS-PSPPIPCLSRPRDYYVYVVK---QMETPRFCKEVKRVKIPFEYSPYLDDSSFGLALTWGVDDQSKTESQTGCFFKATNYQ

Query:  VLSISLLVAMVAISSMVIMKIYHSKKRKCFKEEADKKMFEHSYEALKPGSDEPLV
        V+ ISLLVAMVAI SMV+MKIYHSK +K  KEEA+KKMFEHSYE LK GSDEPLV
Subjt:  VLSISLLVAMVAISSMVIMKIYHSKKRKCFKEEADKKMFEHSYEALKPGSDEPLV

TrEMBL top hitse value%identityAlignment
A0A0A0L6T1 Uncharacterized protein1.0e-8970.11Show/hide
Query:  ISVLSIFFFFFFLLSPISIKAQ----TTTQCSHGSPRIQFPFNFNLSCSSNTTRIHFKTYDSLSVKSISYDKKRLDLVDLNGCVHGAFLKLNLSLTPFRY
        IS+LS  FFFFFL+SPIS  +Q    +TT C+HG P+IQFPF+FNLSCSSNTTRIHFKTYDSLS+KSISYD+KRLDL+DLN CVH AFL L+LSLTPFRY
Subjt:  ISVLSIFFFFFFLLSPISIKAQ----TTTQCSHGSPRIQFPFNFNLSCSSNTTRIHFKTYDSLSVKSISYDKKRLDLVDLNGCVHGAFLKLNLSLTPFRY

Query:  FYVVQDYRYLNCTSKLL-SPSPPIPCLSRPRDYYVYVVKQ----METPRFCKEVKRVKIPFEYSPYLDDSSFGLALTWGVDDQSKTESQTGCFFKATNYQ
        FYVV+DY YLNCT++L+ S S  IPCLSR  +YYVYVVK        PRFCKEVKRVKIPFEYSPYLDD SFGLALTWG DDQ+KT+SQ  CFFKAT++Q
Subjt:  FYVVQDYRYLNCTSKLL-SPSPPIPCLSRPRDYYVYVVKQ----METPRFCKEVKRVKIPFEYSPYLDDSSFGLALTWGVDDQSKTESQTGCFFKATNYQ

Query:  VLSISLL---VAMVAISSMVIM--KIYHSKKRKCFKEEADKKMFEH-SYEALKPGSDEPLV
        V+ ISLL   VAMVAI +MV+M  K Y SK +   KEE +KKMFEH SYE LK  S++PL+
Subjt:  VLSISLL---VAMVAISSMVIM--KIYHSKKRKCFKEEADKKMFEH-SYEALKPGSDEPLV

A0A6J1FDB4 uncharacterized protein LOC111444707 isoform X21.6e-10378.63Show/hide
Query:  MEISVLSIFFFFFFLLSPISIKAQ-TTTQCSHGSPRIQFPFNFNLSCSSNTTRIHFKTYDSLSVKSISYDKKRLDLVDLNGCVHGAFLKLNLSLTPFRYF
        MEIS L  FF  FFLLSPISIKAQ TTT CSHGSP I  PF FNLSCSSNTTRIHFK+YDSLS+KSISYD+KRLDL+DLNGCVHGAFLKLNL+LTPFRYF
Subjt:  MEISVLSIFFFFFFLLSPISIKAQ-TTTQCSHGSPRIQFPFNFNLSCSSNTTRIHFKTYDSLSVKSISYDKKRLDLVDLNGCVHGAFLKLNLSLTPFRYF

Query:  YVVQDYRYLNCTSKLLSPSPPIPCLSRPRDYYVYVVKQM-ETPRFCKEVKRVKIPFEYSPYLDDSSFGLALTWGV-DDQSKTESQTGCFFKATNYQVLSI
        YVV+DY+YLNCTSKL SPSPPIPCLSRPRDYYVYVV++M ETPRFCKEVK+VKIPFEYSPYLDD SFGL+L+WG  DD+ +TES+ GC +KA NY+   +
Subjt:  YVVQDYRYLNCTSKLLSPSPPIPCLSRPRDYYVYVVKQM-ETPRFCKEVKRVKIPFEYSPYLDDSSFGLALTWGV-DDQSKTESQTGCFFKATNYQVLSI

Query:  SLLVAMVAISSMVIMKIYHSKKRKCFKEEADKKMFEHSYEALKPGSDE
         LL AMV ISSMVI+KI HSKK K  KEE  KK+FEH YEALKPGSDE
Subjt:  SLLVAMVAISSMVIMKIYHSKKRKCFKEEADKKMFEHSYEALKPGSDE

A0A6J1FJ12 uncharacterized protein LOC111444707 isoform X11.0e-10579.44Show/hide
Query:  MEISVLSIFFFFFFLLSPISIKAQ-TTTQCSHGSPRIQFPFNFNLSCSSNTTRIHFKTYDSLSVKSISYDKKRLDLVDLNGCVHGAFLKLNLSLTPFRYF
        MEIS L  FF  FFLLSPISIKAQ TTT CSHGSP I  PF FNLSCSSNTTRIHFK+YDSLS+KSISYD+KRLDL+DLNGCVHGAFLKLNL+LTPFRYF
Subjt:  MEISVLSIFFFFFFLLSPISIKAQ-TTTQCSHGSPRIQFPFNFNLSCSSNTTRIHFKTYDSLSVKSISYDKKRLDLVDLNGCVHGAFLKLNLSLTPFRYF

Query:  YVVQDYRYLNCTSKLLSPSPPIPCLSRPRDYYVYVVKQM-ETPRFCKEVKRVKIPFEYSPYLDDSSFGLALTWGV-DDQSKTESQTGCFFKATNYQVLSI
        YVV+DY+YLNCTSKL SPSPPIPCLSRPRDYYVYVV++M ETPRFCKEVK+VKIPFEYSPYLDD SFGL+L+WG  DD+ +TES+ GC +KA NY+VL +
Subjt:  YVVQDYRYLNCTSKLLSPSPPIPCLSRPRDYYVYVVKQM-ETPRFCKEVKRVKIPFEYSPYLDDSSFGLALTWGV-DDQSKTESQTGCFFKATNYQVLSI

Query:  SLLVAMVAISSMVIMKIYHSKKRKCFKEEADKKMFEHSYEALKPGSDE
         LL AMV ISSMVI+KI HSKK K  KEE  KK+FEH YEALKPGSDE
Subjt:  SLLVAMVAISSMVIMKIYHSKKRKCFKEEADKKMFEHSYEALKPGSDE

A0A6J1KXL9 uncharacterized protein LOC111498479 isoform X27.6e-10680.08Show/hide
Query:  MEISVLSIFFFFFFLLSPISIKAQ----TTTQCSHGSPRIQFPFNFNLSCSSNTTRIHFKTYDSLSVKSISYDKKRLDLVDLNGCVHGAFLKLNLSLTPF
        MEIS L  FFF FFLLSPISIKAQ    TTT CSHGSP I  PF FNLSCSSNTTRIHFK+YDSLS+KSISYDKKRLDL+DLNGCVHGAFLKLNLSLTPF
Subjt:  MEISVLSIFFFFFFLLSPISIKAQ----TTTQCSHGSPRIQFPFNFNLSCSSNTTRIHFKTYDSLSVKSISYDKKRLDLVDLNGCVHGAFLKLNLSLTPF

Query:  RYFYVVQDYRYLNCTSKLLSPSPPIPCLSRPRDYYVYVVKQM-ETPRFCKEVKRVKIPFEYSPYLDDSSFGLALTWGV-DDQSKTESQTGCFFKATNYQV
        RYFYVV+DY+YLNCTSKLL+PSPPIPCLSRPRDYYVYVV++M ETPRFCKEVK+VKIPFEYSPYLDD SFGL+LTWG  DD+ +TESQ GC +KA NY+ 
Subjt:  RYFYVVQDYRYLNCTSKLLSPSPPIPCLSRPRDYYVYVVKQM-ETPRFCKEVKRVKIPFEYSPYLDDSSFGLALTWGV-DDQSKTESQTGCFFKATNYQV

Query:  LSISLLVAMVAISSMVIMKIYHSKKRKCFKEEADKKMFEHSYEALKPGSDE
          +SLLVAMV IS MVI+KI HSKK+K  KEEA KK+FEHSYEA+KP SDE
Subjt:  LSISLLVAMVAISSMVIMKIYHSKKRKCFKEEADKKMFEHSYEALKPGSDE

A0A6J1KZE3 uncharacterized protein LOC111498479 isoform X14.8e-10880.88Show/hide
Query:  MEISVLSIFFFFFFLLSPISIKAQ----TTTQCSHGSPRIQFPFNFNLSCSSNTTRIHFKTYDSLSVKSISYDKKRLDLVDLNGCVHGAFLKLNLSLTPF
        MEIS L  FFF FFLLSPISIKAQ    TTT CSHGSP I  PF FNLSCSSNTTRIHFK+YDSLS+KSISYDKKRLDL+DLNGCVHGAFLKLNLSLTPF
Subjt:  MEISVLSIFFFFFFLLSPISIKAQ----TTTQCSHGSPRIQFPFNFNLSCSSNTTRIHFKTYDSLSVKSISYDKKRLDLVDLNGCVHGAFLKLNLSLTPF

Query:  RYFYVVQDYRYLNCTSKLLSPSPPIPCLSRPRDYYVYVVKQM-ETPRFCKEVKRVKIPFEYSPYLDDSSFGLALTWGV-DDQSKTESQTGCFFKATNYQV
        RYFYVV+DY+YLNCTSKLL+PSPPIPCLSRPRDYYVYVV++M ETPRFCKEVK+VKIPFEYSPYLDD SFGL+LTWG  DD+ +TESQ GC +KA NY+V
Subjt:  RYFYVVQDYRYLNCTSKLLSPSPPIPCLSRPRDYYVYVVKQM-ETPRFCKEVKRVKIPFEYSPYLDDSSFGLALTWGV-DDQSKTESQTGCFFKATNYQV

Query:  LSISLLVAMVAISSMVIMKIYHSKKRKCFKEEADKKMFEHSYEALKPGSDE
        L +SLLVAMV IS MVI+KI HSKK+K  KEEA KK+FEHSYEA+KP SDE
Subjt:  LSISLLVAMVAISSMVIMKIYHSKKRKCFKEEADKKMFEHSYEALKPGSDE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAATCTCAGTTCTCTCCATCTTCTTCTTCTTCTTCTTCCTCCTCTCTCCCATCTCCATTAAAGCCCAAACAACAACTCAGTGCAGCCATGGCAGTCCAAGAATCCA
ATTCCCTTTCAACTTCAACCTTTCTTGCTCATCAAACACCACAAGAATCCACTTCAAAACCTATGATTCACTTTCAGTCAAATCCATTTCCTATGATAAAAAAAGACTTG
ATCTTGTAGACCTCAATGGCTGCGTCCACGGCGCCTTTCTCAAACTAAACCTCTCCCTCACCCCATTTCGCTACTTCTACGTCGTCCAAGATTACCGATACCTTAACTGC
ACGTCGAAATTGTTGTCACCGTCACCGCCGATACCGTGCCTCAGCCGACCCAGGGATTACTATGTCTATGTTGTGAAGCAAATGGAAACACCAAGATTTTGCAAGGAAGT
GAAGAGAGTGAAGATCCCATTTGAGTATAGTCCTTATCTTGATGATAGTTCTTTTGGACTTGCCTTAACTTGGGGGGTTGATGATCAAAGTAAAACAGAGTCCCAAACAG
GGTGTTTCTTTAAAGCAACAAATTATCAAGTGCTTAGCATTAGCTTGCTTGTAGCCATGGTGGCAATATCATCCATGGTGATCATGAAGATATATCACTCAAAAAAACGA
AAATGTTTTAAGGAAGAAGCTGACAAAAAGATGTTTGAACATTCATATGAAGCACTCAAACCTGGCTCAGATGAGCCTTTAGTTTGA
mRNA sequenceShow/hide mRNA sequence
TAATTGTTGAACTAAAGAAAAAACCTTAAAAAATTCAAGAACTAAACTGGTAATTTAACTATCTAATCTATATAAACTTAACCCCTAAGTCAAGACAATCTCCCAAAAGA
AAATTCCCAATTTACTTCTCCCTCAAAAGAACCAAAAAAGAGGAATCCCCATCAATCTATATGGCTGCCTAAATCTTTCTTTGCTTTCAAGAGTTTTAACACTAAAACTC
TTTCTCTTTCTCTCTCTCTGTTTCATTCCTTTTACCTCTTCACCATGGAAATCTCAGTTCTCTCCATCTTCTTCTTCTTCTTCTTCCTCCTCTCTCCCATCTCCATTAAA
GCCCAAACAACAACTCAGTGCAGCCATGGCAGTCCAAGAATCCAATTCCCTTTCAACTTCAACCTTTCTTGCTCATCAAACACCACAAGAATCCACTTCAAAACCTATGA
TTCACTTTCAGTCAAATCCATTTCCTATGATAAAAAAAGACTTGATCTTGTAGACCTCAATGGCTGCGTCCACGGCGCCTTTCTCAAACTAAACCTCTCCCTCACCCCAT
TTCGCTACTTCTACGTCGTCCAAGATTACCGATACCTTAACTGCACGTCGAAATTGTTGTCACCGTCACCGCCGATACCGTGCCTCAGCCGACCCAGGGATTACTATGTC
TATGTTGTGAAGCAAATGGAAACACCAAGATTTTGCAAGGAAGTGAAGAGAGTGAAGATCCCATTTGAGTATAGTCCTTATCTTGATGATAGTTCTTTTGGACTTGCCTT
AACTTGGGGGGTTGATGATCAAAGTAAAACAGAGTCCCAAACAGGGTGTTTCTTTAAAGCAACAAATTATCAAGTGCTTAGCATTAGCTTGCTTGTAGCCATGGTGGCAA
TATCATCCATGGTGATCATGAAGATATATCACTCAAAAAAACGAAAATGTTTTAAGGAAGAAGCTGACAAAAAGATGTTTGAACATTCATATGAAGCACTCAAACCTGGC
TCAGATGAGCCTTTAGTTTGAATTTTTTTTTAAGTTTGTGGGGGTGGAATTTGAACCTCTAATTCTTAGTTGAGGGTATATACCTTAACTAATTAAACTATAGTCGGATT
AGTTAGTTTGAGCCAATATTTTCATATGATTTCTCTCAATCACCAAAAACTTTAATTATAGCTTTTCGCTTAGCTTGCTACACACGTCTGTAGTTCACACAATATCCAAA
ATCTTTTTTAATAGTTTCCTACATCACGAATATTTTAACACCAA
Protein sequenceShow/hide protein sequence
MEISVLSIFFFFFFLLSPISIKAQTTTQCSHGSPRIQFPFNFNLSCSSNTTRIHFKTYDSLSVKSISYDKKRLDLVDLNGCVHGAFLKLNLSLTPFRYFYVVQDYRYLNC
TSKLLSPSPPIPCLSRPRDYYVYVVKQMETPRFCKEVKRVKIPFEYSPYLDDSSFGLALTWGVDDQSKTESQTGCFFKATNYQVLSISLLVAMVAISSMVIMKIYHSKKR
KCFKEEADKKMFEHSYEALKPGSDEPLV