; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0009855 (gene) of Snake gourd v1 genome

Gene IDTan0009855
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionSWI/SNF complex subunit SWI3C like
Genome locationLG07:64432993..64437127
RNA-Seq ExpressionTan0009855
SyntenyTan0009855
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004141102.1 uncharacterized protein LOC101216879 [Cucumis sativus]1.1e-9974.44Show/hide
Query:  MAFAAQFLRPLPRACVFA--SSSSSSSSFFNSARFCCSKFEPLKSFPKNFRSLLCNRVSNLRLAFSGANGIYLWKSHLIGSQFSKGTILGASVVIGSISL
        MAFA++FLRPLPRA VFA  SSSSSSSSF N ARFCC+KFEP K FP NF S LCNR+ NLRLAFSGA GIYL    L+GSQ SK TILG SVV+GSI+ 
Subjt:  MAFAAQFLRPLPRACVFA--SSSSSSSSFFNSARFCCSKFEPLKSFPKNFRSLLCNRVSNLRLAFSGANGIYLWKSHLIGSQFSKGTILGASVVIGSISL

Query:  WPNVSLAMDDRFVDDVQEDLGASDYVKSAKIFWELALKLWLPFLFCWTVLINLNHPVLVVGKVVLFLLSTKPSPLSVYIFVEQLRSGSSQAFRLSKRTEC
        WPN S AM+DR +D  QE +  S YVKS K  WELAL+LWLPFL CWTVLINLNHP+ VVGKVVLFL+STKPSPLSVYIFV++LRS SS    LS   + 
Subjt:  WPNVSLAMDDRFVDDVQEDLGASDYVKSAKIFWELALKLWLPFLFCWTVLINLNHPVLVVGKVVLFLLSTKPSPLSVYIFVEQLRSGSSQAFRLSKRTEC

Query:  LVARKVEVEDYKLLCVAKVEMKLEKFTLVGVLGGWWKWPPLSSADEFIAFMDKLAFLSHRLKSIII
        LVARKVEVEDYK+LCVAKVEMK + FT+VGVLGGWWKWPPLSS DEFIAFMDKLA L+HRLKSI+I
Subjt:  LVARKVEVEDYKLLCVAKVEMKLEKFTLVGVLGGWWKWPPLSSADEFIAFMDKLAFLSHRLKSIII

XP_008443551.1 PREDICTED: uncharacterized protein LOC103487112 isoform X1 [Cucumis melo]3.2e-10479.15Show/hide
Query:  MAFAAQFLRPLPRACVFASSSSSSSSFFNSARFCCSKFEPLKSFPKNFRSLLCNRVSNLRLAFSGANGIYLWKSHLIGSQFSKGTILGASVVIGSISLWP
        MAFA++FLRPLPRA VFASSSSSSSS FN ARFCC+KFEP   FP NF S LCNR+ NLRLAFSGA GIYL    LIGSQFSK TILG SVV+GSIS WP
Subjt:  MAFAAQFLRPLPRACVFASSSSSSSSFFNSARFCCSKFEPLKSFPKNFRSLLCNRVSNLRLAFSGANGIYLWKSHLIGSQFSKGTILGASVVIGSISLWP

Query:  NVSLAMDDRFVDDVQEDLGASDYVKSAKIFWELALKLWLPFLFCWTVLINLNHPVLVVGKVVLFLLSTKPSPLSVYIFVEQLRSGSSQAFRLSKRTECLV
        N SLAMDDR +D  QED+  SDYVKSAK FWELAL+LWLPFL CWTVLINLNHP++VVGKVVLFL+STKPSPLSVYIFVE+LRS SSQ   LS   + LV
Subjt:  NVSLAMDDRFVDDVQEDLGASDYVKSAKIFWELALKLWLPFLFCWTVLINLNHPVLVVGKVVLFLLSTKPSPLSVYIFVEQLRSGSSQAFRLSKRTECLV

Query:  ARKVEVEDYKLLCVAKVEMKLEKFTLVGVLGGWWKWPPLSSADEFIAFMDKLAFLSHRL
        ARKVEVEDYK+LCVAKVEMK + FTLVGVLGGWWKWPPLSS DEFIAFMDKLA L+HRL
Subjt:  ARKVEVEDYKLLCVAKVEMKLEKFTLVGVLGGWWKWPPLSSADEFIAFMDKLAFLSHRL

XP_008443552.1 PREDICTED: uncharacterized protein LOC103487112 isoform X2 [Cucumis melo]1.5e-8568.73Show/hide
Query:  MAFAAQFLRPLPRACVFASSSSSSSSFFNSARFCCSKFEPLKSFPKNFRSLLCNRVSNLRLAFSGANGIYLWKSHLIGSQFSKGTILGASVVIGSISLWP
        MAFA++FLRPLPRA VFASSSSSSSS FN ARFCC+KFEP   FP NF S LCNR+ NLRLAFSGA GIY                              
Subjt:  MAFAAQFLRPLPRACVFASSSSSSSSFFNSARFCCSKFEPLKSFPKNFRSLLCNRVSNLRLAFSGANGIYLWKSHLIGSQFSKGTILGASVVIGSISLWP

Query:  NVSLAMDDRFVDDVQEDLGASDYVKSAKIFWELALKLWLPFLFCWTVLINLNHPVLVVGKVVLFLLSTKPSPLSVYIFVEQLRSGSSQAFRLSKRTECLV
           L + DR +D  QED+  SDYVKSAK FWELAL+LWLPFL CWTVLINLNHP++VVGKVVLFL+STKPSPLSVYIFVE+LRS SSQ   LS   + LV
Subjt:  NVSLAMDDRFVDDVQEDLGASDYVKSAKIFWELALKLWLPFLFCWTVLINLNHPVLVVGKVVLFLLSTKPSPLSVYIFVEQLRSGSSQAFRLSKRTECLV

Query:  ARKVEVEDYKLLCVAKVEMKLEKFTLVGVLGGWWKWPPLSSADEFIAFMDKLAFLSHRL
        ARKVEVEDYK+LCVAKVEMK + FTLVGVLGGWWKWPPLSS DEFIAFMDKLA L+HRL
Subjt:  ARKVEVEDYKLLCVAKVEMKLEKFTLVGVLGGWWKWPPLSSADEFIAFMDKLAFLSHRL

XP_022152133.1 uncharacterized protein LOC111019921 [Momordica charantia]1.4e-8369.02Show/hide
Query:  MAFAAQFLRPLPRACVFA-SSSSSSSSFFNSARFCCSKFEPLKSFPKNFRSLLCNRVSNLRLAFSGANGIYL-WKSHLIGSQFSKGTILGASVVIGSISL
        MA  A+ LRPLPRA VFA SSSSSSSSFFNSARF C++FEP KSFPKNF SLLCNRVSN R  F  ++ IYL  K  L+GSQ S+GTILGASVV GSISL
Subjt:  MAFAAQFLRPLPRACVFA-SSSSSSSSFFNSARFCCSKFEPLKSFPKNFRSLLCNRVSNLRLAFSGANGIYL-WKSHLIGSQFSKGTILGASVVIGSISL

Query:  WPNVSLAMDDRFVDDVQEDLGASDYVKSAKIFWELALKLWLPFLFCWTVLINLNHPVLVVGKVVLFLLSTKPSPLSVYIFVEQLRSGSSQAFRLSKRTEC
        WPNVSLAMDD F+DD Q+DL A D  K     WELAL+LWLPFLFCWTVL NLNHP+LV  KV+LFLLSTKPSPLSVYIFVEQL   S Q  RLS   +C
Subjt:  WPNVSLAMDDRFVDDVQEDLGASDYVKSAKIFWELALKLWLPFLFCWTVLINLNHPVLVVGKVVLFLLSTKPSPLSVYIFVEQLRSGSSQAFRLSKRTEC

Query:  LVARKVEVEDYKLLCVAKVEMKLEKFTLVGVLGGWWKWPPLS-SADEFIAFMDKL
        +VA KVEV+DYK+ CVA+VE++ +K TL+G+LGGWW+ PP S +A +F  F+DKL
Subjt:  LVARKVEVEDYKLLCVAKVEMKLEKFTLVGVLGGWWKWPPLS-SADEFIAFMDKL

XP_038903549.1 uncharacterized protein LOC120090109 [Benincasa hispida]5.9e-11483.21Show/hide
Query:  MAFAAQFLRPLPRACVFASSSSSSSSFFNSARFCCSKFEPLKSFPKNFRSLLCNRVSNLRLAFSGANGIYLWKSHLIGSQFSKGTILGASVVIGSISLWP
        MAFA +FLRPLPRA VFA  SSSSSSFFN  RFCC KF+  KSFP NF SLLCNR+SN RLAFSGANGIYL KS LIGSQFSKGTILGASVV GSISLWP
Subjt:  MAFAAQFLRPLPRACVFASSSSSSSSFFNSARFCCSKFEPLKSFPKNFRSLLCNRVSNLRLAFSGANGIYLWKSHLIGSQFSKGTILGASVVIGSISLWP

Query:  NVSLAMDDRFVDDVQEDLGASDYVKSAKIFWELALKLWLPFLFCWTVLINLNHPVLVVGKVVLFLLSTKPSPLSVYIFVEQLRSGSSQAFRLSKRTECLV
        N SLAMDDR +D  QEDL ASDYVKSAKIFWELAL+LWLPFL CWTVLINLNHP+LVVGKVVLFL+STKPSPLSVYIFVE+LRS SSQ   LS   +CL 
Subjt:  NVSLAMDDRFVDDVQEDLGASDYVKSAKIFWELALKLWLPFLFCWTVLINLNHPVLVVGKVVLFLLSTKPSPLSVYIFVEQLRSGSSQAFRLSKRTECLV

Query:  ARKVEVEDYKLLCVAKVEMKLEKFTLVGVLGGWWKWPPLSSADEFIAFMDKLAFLSHRLKSI
        ARKVEVEDYKLLCVAKVEMK +KFTLVG+LGGWWKWPP SS DEFIAFMDKLAFL+HRLKSI
Subjt:  ARKVEVEDYKLLCVAKVEMKLEKFTLVGVLGGWWKWPPLSSADEFIAFMDKLAFLSHRLKSI

TrEMBL top hitse value%identityAlignment
A0A0A0LI06 Uncharacterized protein5.2e-10074.44Show/hide
Query:  MAFAAQFLRPLPRACVFA--SSSSSSSSFFNSARFCCSKFEPLKSFPKNFRSLLCNRVSNLRLAFSGANGIYLWKSHLIGSQFSKGTILGASVVIGSISL
        MAFA++FLRPLPRA VFA  SSSSSSSSF N ARFCC+KFEP K FP NF S LCNR+ NLRLAFSGA GIYL    L+GSQ SK TILG SVV+GSI+ 
Subjt:  MAFAAQFLRPLPRACVFA--SSSSSSSSFFNSARFCCSKFEPLKSFPKNFRSLLCNRVSNLRLAFSGANGIYLWKSHLIGSQFSKGTILGASVVIGSISL

Query:  WPNVSLAMDDRFVDDVQEDLGASDYVKSAKIFWELALKLWLPFLFCWTVLINLNHPVLVVGKVVLFLLSTKPSPLSVYIFVEQLRSGSSQAFRLSKRTEC
        WPN S AM+DR +D  QE +  S YVKS K  WELAL+LWLPFL CWTVLINLNHP+ VVGKVVLFL+STKPSPLSVYIFV++LRS SS    LS   + 
Subjt:  WPNVSLAMDDRFVDDVQEDLGASDYVKSAKIFWELALKLWLPFLFCWTVLINLNHPVLVVGKVVLFLLSTKPSPLSVYIFVEQLRSGSSQAFRLSKRTEC

Query:  LVARKVEVEDYKLLCVAKVEMKLEKFTLVGVLGGWWKWPPLSSADEFIAFMDKLAFLSHRLKSIII
        LVARKVEVEDYK+LCVAKVEMK + FT+VGVLGGWWKWPPLSS DEFIAFMDKLA L+HRLKSI+I
Subjt:  LVARKVEVEDYKLLCVAKVEMKLEKFTLVGVLGGWWKWPPLSSADEFIAFMDKLAFLSHRLKSIII

A0A1S3B8A6 uncharacterized protein LOC103487112 isoform X11.6e-10479.15Show/hide
Query:  MAFAAQFLRPLPRACVFASSSSSSSSFFNSARFCCSKFEPLKSFPKNFRSLLCNRVSNLRLAFSGANGIYLWKSHLIGSQFSKGTILGASVVIGSISLWP
        MAFA++FLRPLPRA VFASSSSSSSS FN ARFCC+KFEP   FP NF S LCNR+ NLRLAFSGA GIYL    LIGSQFSK TILG SVV+GSIS WP
Subjt:  MAFAAQFLRPLPRACVFASSSSSSSSFFNSARFCCSKFEPLKSFPKNFRSLLCNRVSNLRLAFSGANGIYLWKSHLIGSQFSKGTILGASVVIGSISLWP

Query:  NVSLAMDDRFVDDVQEDLGASDYVKSAKIFWELALKLWLPFLFCWTVLINLNHPVLVVGKVVLFLLSTKPSPLSVYIFVEQLRSGSSQAFRLSKRTECLV
        N SLAMDDR +D  QED+  SDYVKSAK FWELAL+LWLPFL CWTVLINLNHP++VVGKVVLFL+STKPSPLSVYIFVE+LRS SSQ   LS   + LV
Subjt:  NVSLAMDDRFVDDVQEDLGASDYVKSAKIFWELALKLWLPFLFCWTVLINLNHPVLVVGKVVLFLLSTKPSPLSVYIFVEQLRSGSSQAFRLSKRTECLV

Query:  ARKVEVEDYKLLCVAKVEMKLEKFTLVGVLGGWWKWPPLSSADEFIAFMDKLAFLSHRL
        ARKVEVEDYK+LCVAKVEMK + FTLVGVLGGWWKWPPLSS DEFIAFMDKLA L+HRL
Subjt:  ARKVEVEDYKLLCVAKVEMKLEKFTLVGVLGGWWKWPPLSSADEFIAFMDKLAFLSHRL

A0A1S3B8D2 uncharacterized protein LOC103487112 isoform X27.3e-8668.73Show/hide
Query:  MAFAAQFLRPLPRACVFASSSSSSSSFFNSARFCCSKFEPLKSFPKNFRSLLCNRVSNLRLAFSGANGIYLWKSHLIGSQFSKGTILGASVVIGSISLWP
        MAFA++FLRPLPRA VFASSSSSSSS FN ARFCC+KFEP   FP NF S LCNR+ NLRLAFSGA GIY                              
Subjt:  MAFAAQFLRPLPRACVFASSSSSSSSFFNSARFCCSKFEPLKSFPKNFRSLLCNRVSNLRLAFSGANGIYLWKSHLIGSQFSKGTILGASVVIGSISLWP

Query:  NVSLAMDDRFVDDVQEDLGASDYVKSAKIFWELALKLWLPFLFCWTVLINLNHPVLVVGKVVLFLLSTKPSPLSVYIFVEQLRSGSSQAFRLSKRTECLV
           L + DR +D  QED+  SDYVKSAK FWELAL+LWLPFL CWTVLINLNHP++VVGKVVLFL+STKPSPLSVYIFVE+LRS SSQ   LS   + LV
Subjt:  NVSLAMDDRFVDDVQEDLGASDYVKSAKIFWELALKLWLPFLFCWTVLINLNHPVLVVGKVVLFLLSTKPSPLSVYIFVEQLRSGSSQAFRLSKRTECLV

Query:  ARKVEVEDYKLLCVAKVEMKLEKFTLVGVLGGWWKWPPLSSADEFIAFMDKLAFLSHRL
        ARKVEVEDYK+LCVAKVEMK + FTLVGVLGGWWKWPPLSS DEFIAFMDKLA L+HRL
Subjt:  ARKVEVEDYKLLCVAKVEMKLEKFTLVGVLGGWWKWPPLSSADEFIAFMDKLAFLSHRL

A0A5A7UGM7 Uncharacterized protein1.6e-10479.15Show/hide
Query:  MAFAAQFLRPLPRACVFASSSSSSSSFFNSARFCCSKFEPLKSFPKNFRSLLCNRVSNLRLAFSGANGIYLWKSHLIGSQFSKGTILGASVVIGSISLWP
        MAFA++FLRPLPRA VFASSSSSSSS FN ARFCC+KFEP   FP NF S LCNR+ NLRLAFSGA GIYL    LIGSQFSK TILG SVV+GSIS WP
Subjt:  MAFAAQFLRPLPRACVFASSSSSSSSFFNSARFCCSKFEPLKSFPKNFRSLLCNRVSNLRLAFSGANGIYLWKSHLIGSQFSKGTILGASVVIGSISLWP

Query:  NVSLAMDDRFVDDVQEDLGASDYVKSAKIFWELALKLWLPFLFCWTVLINLNHPVLVVGKVVLFLLSTKPSPLSVYIFVEQLRSGSSQAFRLSKRTECLV
        N SLAMDDR +D  QED+  SDYVKSAK FWELAL+LWLPFL CWTVLINLNHP++VVGKVVLFL+STKPSPLSVYIFVE+LRS SSQ   LS   + LV
Subjt:  NVSLAMDDRFVDDVQEDLGASDYVKSAKIFWELALKLWLPFLFCWTVLINLNHPVLVVGKVVLFLLSTKPSPLSVYIFVEQLRSGSSQAFRLSKRTECLV

Query:  ARKVEVEDYKLLCVAKVEMKLEKFTLVGVLGGWWKWPPLSSADEFIAFMDKLAFLSHRL
        ARKVEVEDYK+LCVAKVEMK + FTLVGVLGGWWKWPPLSS DEFIAFMDKLA L+HRL
Subjt:  ARKVEVEDYKLLCVAKVEMKLEKFTLVGVLGGWWKWPPLSSADEFIAFMDKLAFLSHRL

A0A6J1DE17 uncharacterized protein LOC1110199216.9e-8469.02Show/hide
Query:  MAFAAQFLRPLPRACVFA-SSSSSSSSFFNSARFCCSKFEPLKSFPKNFRSLLCNRVSNLRLAFSGANGIYL-WKSHLIGSQFSKGTILGASVVIGSISL
        MA  A+ LRPLPRA VFA SSSSSSSSFFNSARF C++FEP KSFPKNF SLLCNRVSN R  F  ++ IYL  K  L+GSQ S+GTILGASVV GSISL
Subjt:  MAFAAQFLRPLPRACVFA-SSSSSSSSFFNSARFCCSKFEPLKSFPKNFRSLLCNRVSNLRLAFSGANGIYL-WKSHLIGSQFSKGTILGASVVIGSISL

Query:  WPNVSLAMDDRFVDDVQEDLGASDYVKSAKIFWELALKLWLPFLFCWTVLINLNHPVLVVGKVVLFLLSTKPSPLSVYIFVEQLRSGSSQAFRLSKRTEC
        WPNVSLAMDD F+DD Q+DL A D  K     WELAL+LWLPFLFCWTVL NLNHP+LV  KV+LFLLSTKPSPLSVYIFVEQL   S Q  RLS   +C
Subjt:  WPNVSLAMDDRFVDDVQEDLGASDYVKSAKIFWELALKLWLPFLFCWTVLINLNHPVLVVGKVVLFLLSTKPSPLSVYIFVEQLRSGSSQAFRLSKRTEC

Query:  LVARKVEVEDYKLLCVAKVEMKLEKFTLVGVLGGWWKWPPLS-SADEFIAFMDKL
        +VA KVEV+DYK+ CVA+VE++ +K TL+G+LGGWW+ PP S +A +F  F+DKL
Subjt:  LVARKVEVEDYKLLCVAKVEMKLEKFTLVGVLGGWWKWPPLS-SADEFIAFMDKL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTTTGCAGCACAGTTTCTTCGACCGCTCCCAAGAGCTTGCGTTTTCGCTTCGTCTTCGTCTTCATCTTCTTCGTTCTTCAATTCCGCCAGATTCTGTTGCTCGAA
ATTTGAGCCCTTGAAATCGTTTCCGAAGAACTTCAGATCATTACTCTGCAACCGTGTTTCCAATCTCAGACTTGCATTTTCTGGCGCTAACGGAATTTATCTGTGGAAAT
CTCATTTGATAGGAAGTCAGTTTAGTAAGGGAACCATTTTGGGTGCATCAGTTGTAATTGGATCAATCAGTTTGTGGCCCAATGTTTCATTGGCTATGGATGACAGATTT
GTGGATGATGTCCAAGAGGATTTAGGTGCTTCAGATTATGTAAAATCTGCAAAAATCTTCTGGGAATTAGCATTAAAACTCTGGTTGCCTTTTCTTTTCTGTTGGACTGT
GTTGATAAACTTGAATCATCCTGTACTAGTTGTGGGCAAAGTGGTTCTATTCCTTCTTAGTACAAAACCCAGTCCTCTCTCTGTTTACATTTTTGTGGAGCAGCTGCGTT
CTGGTTCATCCCAAGCGTTTCGTCTCTCTAAACGGACGGAGTGTTTGGTTGCAAGAAAAGTGGAAGTTGAAGACTACAAGCTTCTGTGTGTAGCTAAAGTTGAAATGAAA
CTTGAAAAGTTCACTCTTGTGGGAGTTCTTGGAGGTTGGTGGAAATGGCCACCTCTGTCCTCTGCTGATGAATTCATTGCTTTTATGGATAAGCTAGCTTTTCTTTCACA
TCGCCTAAAATCTATTATTATTAACTATAGACTAACTATATAA
mRNA sequenceShow/hide mRNA sequence
GTTAATAGTCTGGTCTTTCTGTTTGCTATGCTATAGCCAAATTGAATCTTATACTGCAAACCAAGAGTGTTATATCACAGCCTCGACGCTAAATTTTGAAGAGTTTTCAT
TGATCAAACAGATCGATTGATTTAGTTCTTCGCAATTTCGTGCATGGCCTTTGCAGCACAGTTTCTTCGACCGCTCCCAAGAGCTTGCGTTTTCGCTTCGTCTTCGTCTT
CATCTTCTTCGTTCTTCAATTCCGCCAGATTCTGTTGCTCGAAATTTGAGCCCTTGAAATCGTTTCCGAAGAACTTCAGATCATTACTCTGCAACCGTGTTTCCAATCTC
AGACTTGCATTTTCTGGCGCTAACGGAATTTATCTGTGGAAATCTCATTTGATAGGAAGTCAGTTTAGTAAGGGAACCATTTTGGGTGCATCAGTTGTAATTGGATCAAT
CAGTTTGTGGCCCAATGTTTCATTGGCTATGGATGACAGATTTGTGGATGATGTCCAAGAGGATTTAGGTGCTTCAGATTATGTAAAATCTGCAAAAATCTTCTGGGAAT
TAGCATTAAAACTCTGGTTGCCTTTTCTTTTCTGTTGGACTGTGTTGATAAACTTGAATCATCCTGTACTAGTTGTGGGCAAAGTGGTTCTATTCCTTCTTAGTACAAAA
CCCAGTCCTCTCTCTGTTTACATTTTTGTGGAGCAGCTGCGTTCTGGTTCATCCCAAGCGTTTCGTCTCTCTAAACGGACGGAGTGTTTGGTTGCAAGAAAAGTGGAAGT
TGAAGACTACAAGCTTCTGTGTGTAGCTAAAGTTGAAATGAAACTTGAAAAGTTCACTCTTGTGGGAGTTCTTGGAGGTTGGTGGAAATGGCCACCTCTGTCCTCTGCTG
ATGAATTCATTGCTTTTATGGATAAGCTAGCTTTTCTTTCACATCGCCTAAAATCTATTATTATTAACTATAGACTAACTATATAATTTACTCCCTCATGTTTTGGTATT
CTTTCTTTGGATATAAATTTACCTGACATATATTAGCC
Protein sequenceShow/hide protein sequence
MAFAAQFLRPLPRACVFASSSSSSSSFFNSARFCCSKFEPLKSFPKNFRSLLCNRVSNLRLAFSGANGIYLWKSHLIGSQFSKGTILGASVVIGSISLWPNVSLAMDDRF
VDDVQEDLGASDYVKSAKIFWELALKLWLPFLFCWTVLINLNHPVLVVGKVVLFLLSTKPSPLSVYIFVEQLRSGSSQAFRLSKRTECLVARKVEVEDYKLLCVAKVEMK
LEKFTLVGVLGGWWKWPPLSSADEFIAFMDKLAFLSHRLKSIIINYRLTI