; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016939 (gene) of Snake gourd v1 genome

Gene IDTan0016939
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionF-box-like domain-containing protein
Genome locationLG10:8290932..8293736
RNA-Seq ExpressionTan0016939
SyntenyTan0016939
Gene Ontology termsGO:0000209 - protein polyubiquitination (biological process)
GO:0031146 - SCF-dependent proteasomal ubiquitin-dependent protein catabolic process (biological process)
GO:0019005 - SCF ubiquitin ligase complex (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001810 - F-box domain
IPR036047 - F-box-like domain superfamily
IPR039588 - F-box only protein 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597495.1 hypothetical protein SDJN03_10675, partial [Cucurbita argyrosperma subsp. sororia]2.3e-11788.28Show/hide
Query:  MGMLHLPVDVALKIASSLQISDICALGCCSRFCRELFDSDPLWESLANERWPYI---NATGSSSSTLAKSPISMGWKSFYIQRHIEILGRAQAAVKFIEQ
        MG+LHLP+DVALKIASSLQ SDICALGCCSR CRE+FDSD LWESLA ERWPYI   ++TGSSSST AK PISMGWKSFYI RHIEILGRAQAAVKFIEQ
Subjt:  MGMLHLPVDVALKIASSLQISDICALGCCSRFCRELFDSDPLWESLANERWPYI---NATGSSSSTLAKSPISMGWKSFYIQRHIEILGRAQAAVKFIEQ

Query:  CPPSTPIEGGDYLRTMAGLWDLKLSFIDVQMVLFKPQLNGLLNLVGLHYCTNLLKVPANQVMEALQRCKISEKHVCVKWWKLGRWFYGFRMRDEQQTRRV
        CPPSTPIEGGDYLRT+ GL DLKLSFIDVQMVLFKPQLNGLLNLVGLHYCTNLL++PA +VMEALQRCKISEKH+CVKWWKLGRWFYGFRMRDEQ TRRV
Subjt:  CPPSTPIEGGDYLRTMAGLWDLKLSFIDVQMVLFKPQLNGLLNLVGLHYCTNLLKVPANQVMEALQRCKISEKHVCVKWWKLGRWFYGFRMRDEQQTRRV

Query:  SLAELITEEGEDVLGVLSRGAVHEVLRVQVSVSDPFGSH
        SLAEL+T EGEDVLGVLSRG VHEVLRVQVSVSDPF SH
Subjt:  SLAELITEEGEDVLGVLSRGAVHEVLRVQVSVSDPFGSH

XP_022950192.1 uncharacterized protein LOC111453355 [Cucurbita moschata]1.0e-11788.7Show/hide
Query:  MGMLHLPVDVALKIASSLQISDICALGCCSRFCRELFDSDPLWESLANERWPYI---NATGSSSSTLAKSPISMGWKSFYIQRHIEILGRAQAAVKFIEQ
        MG+LHLP+DVALKIASSLQ SDICALGCCSR CRELFDSD LWESLA ERWPYI   ++TGSSSST AK PISMGWKSFYI RHIEILGRAQAAVKFIEQ
Subjt:  MGMLHLPVDVALKIASSLQISDICALGCCSRFCRELFDSDPLWESLANERWPYI---NATGSSSSTLAKSPISMGWKSFYIQRHIEILGRAQAAVKFIEQ

Query:  CPPSTPIEGGDYLRTMAGLWDLKLSFIDVQMVLFKPQLNGLLNLVGLHYCTNLLKVPANQVMEALQRCKISEKHVCVKWWKLGRWFYGFRMRDEQQTRRV
        CPPSTPIEGGDYLRT+ GL DLKLSFIDVQMVLFKPQLNGLLNLVGLHYCTNLL++PA +VMEALQRCKISEKH+CVKWWKLGRWFYGFRMRDEQ TRRV
Subjt:  CPPSTPIEGGDYLRTMAGLWDLKLSFIDVQMVLFKPQLNGLLNLVGLHYCTNLLKVPANQVMEALQRCKISEKHVCVKWWKLGRWFYGFRMRDEQQTRRV

Query:  SLAELITEEGEDVLGVLSRGAVHEVLRVQVSVSDPFGSH
        SLAEL+T EGEDVLGVLSRG VHEVLRVQVSVSDPF SH
Subjt:  SLAELITEEGEDVLGVLSRGAVHEVLRVQVSVSDPFGSH

XP_022973765.1 uncharacterized protein LOC111472328 isoform X1 [Cucurbita maxima]3.0e-11788.7Show/hide
Query:  MGMLHLPVDVALKIASSLQISDICALGCCSRFCRELFDSDPLWESLANERWPYI---NATGSSSSTLAKSPISMGWKSFYIQRHIEILGRAQAAVKFIEQ
        MG+LHLP+DVALKIASSLQ SDICALGCCSR CRELFDSD LWESLA ERWPYI   ++TGSSSST AK PISMGWKSFYI RHIEILGRAQAAVKFIEQ
Subjt:  MGMLHLPVDVALKIASSLQISDICALGCCSRFCRELFDSDPLWESLANERWPYI---NATGSSSSTLAKSPISMGWKSFYIQRHIEILGRAQAAVKFIEQ

Query:  CPPSTPIEGGDYLRTMAGLWDLKLSFIDVQMVLFKPQLNGLLNLVGLHYCTNLLKVPANQVMEALQRCKISEKHVCVKWWKLGRWFYGFRMRDEQQTRRV
        CPP TPIEGGDYLRT+ GL DLKLSFIDVQMVLFKPQLNGLLNLVGLHYCTNLL++PA +VMEALQRCKISEKHVCVKWWKLGRWFYGFRMRDEQ TRRV
Subjt:  CPPSTPIEGGDYLRTMAGLWDLKLSFIDVQMVLFKPQLNGLLNLVGLHYCTNLLKVPANQVMEALQRCKISEKHVCVKWWKLGRWFYGFRMRDEQQTRRV

Query:  SLAELITEEGEDVLGVLSRGAVHEVLRVQVSVSDPFGSH
        SLAEL+T EGEDVLGVLSRG VHEVLRVQVSVSDPF SH
Subjt:  SLAELITEEGEDVLGVLSRGAVHEVLRVQVSVSDPFGSH

XP_023540636.1 uncharacterized protein LOC111800940 isoform X1 [Cucurbita pepo subsp. pepo]1.0e-11789.12Show/hide
Query:  MGMLHLPVDVALKIASSLQISDICALGCCSRFCRELFDSDPLWESLANERWPYI---NATGSSSSTLAKSPISMGWKSFYIQRHIEILGRAQAAVKFIEQ
        MG+LHLP+DVALKIASSLQ SDICALGCCSR CRELFDSD LWESLA ERWPYI   ++TGSSSST AK PISMGWKSFYI RHIEILGRAQAAVKFIEQ
Subjt:  MGMLHLPVDVALKIASSLQISDICALGCCSRFCRELFDSDPLWESLANERWPYI---NATGSSSSTLAKSPISMGWKSFYIQRHIEILGRAQAAVKFIEQ

Query:  CPPSTPIEGGDYLRTMAGLWDLKLSFIDVQMVLFKPQLNGLLNLVGLHYCTNLLKVPANQVMEALQRCKISEKHVCVKWWKLGRWFYGFRMRDEQQTRRV
        CPPSTPIEGGDYLRT+ GL DLKLSFIDVQMVLFKPQLNGLLNLVGLHYCTNLL++PA +VMEALQRCKISEKHVCVKWWKLGRWFYGFRMRDEQ TRRV
Subjt:  CPPSTPIEGGDYLRTMAGLWDLKLSFIDVQMVLFKPQLNGLLNLVGLHYCTNLLKVPANQVMEALQRCKISEKHVCVKWWKLGRWFYGFRMRDEQQTRRV

Query:  SLAELITEEGEDVLGVLSRGAVHEVLRVQVSVSDPFGSH
        SLAEL+T EGEDVLGVLSRG VHEVLRVQVSVSDPF SH
Subjt:  SLAELITEEGEDVLGVLSRGAVHEVLRVQVSVSDPFGSH

XP_038893805.1 uncharacterized protein LOC120082593 [Benincasa hispida]1.6e-11586.13Show/hide
Query:  MGMLHLPVDVALKIASSLQISDICALGCCSRFCRELFDSDPLWESLANERWPYINATGSSSSTLAKSPISMGWKSFYIQRHIEILGRAQAAVKFIEQCPP
        MG+ HLP+D ALKIASSLQ SDIC+LGCCSRFCR+L DSD LWESLA ERWPYIN  GSS STLA+SPISMGWKS YIQRHIEILGRAQAAVKFIEQC P
Subjt:  MGMLHLPVDVALKIASSLQISDICALGCCSRFCRELFDSDPLWESLANERWPYINATGSSSSTLAKSPISMGWKSFYIQRHIEILGRAQAAVKFIEQCPP

Query:  STPIEGGDYLRTMAGLWDLKLSFIDVQMVLFKPQLNGLLNLVGLHYCTNLLKVPANQVMEALQRCKISEKHVCVKWWKLGRWFYGFRMRDEQQTRRVSLA
        STPIEGGDYLRT+ GLWDLKLSFID QMVLFKPQLN LLNLVGLHYC N L+VPANQVMEALQRCK++E+++CVKWWKLGRWFYGFRMRDEQQTRRVSLA
Subjt:  STPIEGGDYLRTMAGLWDLKLSFIDVQMVLFKPQLNGLLNLVGLHYCTNLLKVPANQVMEALQRCKISEKHVCVKWWKLGRWFYGFRMRDEQQTRRVSLA

Query:  ELITEEGEDVLGVLSRGAVHEVLRVQVSVSDPFGSHQS
        EL+TEEGEDVLGVLSRGAVHEVLRVQVS  DPF SH S
Subjt:  ELITEEGEDVLGVLSRGAVHEVLRVQVSVSDPFGSHQS

TrEMBL top hitse value%identityAlignment
A0A0A0L7T9 F-box domain-containing protein9.7e-11483.47Show/hide
Query:  MGMLHLPVDVALKIASSLQISDICALGCCSRFCRELFDSDPLWESLANERWPYINATGSSSSTLAKSPISMGWKSFYIQRHIEILGRAQAAVKFIEQCPP
        MG+ +L +D ALKIASSLQ+SDIC+LGCCSRFCR+L DSD LWESLA ERWPYIN  GSSSSTLA+S ISMGWKSFYIQRHIEI GRAQAAVKF+EQC P
Subjt:  MGMLHLPVDVALKIASSLQISDICALGCCSRFCRELFDSDPLWESLANERWPYINATGSSSSTLAKSPISMGWKSFYIQRHIEILGRAQAAVKFIEQCPP

Query:  STPIEGGDYLRTMAGLWDLKLSFIDVQMVLFKPQLNGLLNLVGLHYCTNLLKVPANQVMEALQRCKISEKHVCVKWWKLGRWFYGFRMRDEQQTRRVSLA
        STPIEGGDYLRT+ GLWDLKLSF+D QMVLFKPQLN LLNLVGLHYC   L++PANQ+MEALQRCKI+E+H+CVKWWKLGRWFYGFRMRDEQQTRRVSLA
Subjt:  STPIEGGDYLRTMAGLWDLKLSFIDVQMVLFKPQLNGLLNLVGLHYCTNLLKVPANQVMEALQRCKISEKHVCVKWWKLGRWFYGFRMRDEQQTRRVSLA

Query:  ELITEEGEDVLGVLSRGAVHEVLRVQVSVSDPFGSHQSQQST
        EL+TEEGEDVLGVLSRGAVHEVLRVQVSV D F  H S+QST
Subjt:  ELITEEGEDVLGVLSRGAVHEVLRVQVSVSDPFGSHQSQQST

A0A1S3AXY1 uncharacterized protein LOC1034837968.0e-11684.71Show/hide
Query:  MGMLHLPVDVALKIASSLQISDICALGCCSRFCRELFDSDPLWESLANERWPYINATGSSSSTLAKSPISMGWKSFYIQRHIEILGRAQAAVKFIEQCPP
        MG  HLP D ALKIASSLQ+S+IC+LGCCSRFCR+L DSD LWESLA ERWPYIN  GSSSSTLA+SPISMGWKSFYIQRHIEILGRAQAAVKF+EQC P
Subjt:  MGMLHLPVDVALKIASSLQISDICALGCCSRFCRELFDSDPLWESLANERWPYINATGSSSSTLAKSPISMGWKSFYIQRHIEILGRAQAAVKFIEQCPP

Query:  STPIEGGDYLRTMAGLWDLKLSFIDVQMVLFKPQLNGLLNLVGLHYCTNLLKVPANQVMEALQRCKISEKHVCVKWWKLGRWFYGFRMRDEQQTRRVSLA
        STPIEGGDYLRT+ GLWDLKLSF+D QMVLFKPQLN LLNLVGLHYC   L+VPANQ++EALQRCKI+E+H+CVKWWKLGRWFYGFRMRDEQQTRRVSLA
Subjt:  STPIEGGDYLRTMAGLWDLKLSFIDVQMVLFKPQLNGLLNLVGLHYCTNLLKVPANQVMEALQRCKISEKHVCVKWWKLGRWFYGFRMRDEQQTRRVSLA

Query:  ELITEEGEDVLGVLSRGAVHEVLRVQVSVSDPFGSHQSQQST
        EL+TEEGEDVLGVLSRGAVHEVLRVQVSV D F  H S+QST
Subjt:  ELITEEGEDVLGVLSRGAVHEVLRVQVSVSDPFGSHQSQQST

A0A5A7U795 F-box-like domain-containing protein8.0e-11684.71Show/hide
Query:  MGMLHLPVDVALKIASSLQISDICALGCCSRFCRELFDSDPLWESLANERWPYINATGSSSSTLAKSPISMGWKSFYIQRHIEILGRAQAAVKFIEQCPP
        MG  HLP D ALKIASSLQ+S+IC+LGCCSRFCR+L DSD LWESLA ERWPYIN  GSSSSTLA+SPISMGWKSFYIQRHIEILGRAQAAVKF+EQC P
Subjt:  MGMLHLPVDVALKIASSLQISDICALGCCSRFCRELFDSDPLWESLANERWPYINATGSSSSTLAKSPISMGWKSFYIQRHIEILGRAQAAVKFIEQCPP

Query:  STPIEGGDYLRTMAGLWDLKLSFIDVQMVLFKPQLNGLLNLVGLHYCTNLLKVPANQVMEALQRCKISEKHVCVKWWKLGRWFYGFRMRDEQQTRRVSLA
        STPIEGGDYLRT+ GLWDLKLSF+D QMVLFKPQLN LLNLVGLHYC   L+VPANQ++EALQRCKI+E+H+CVKWWKLGRWFYGFRMRDEQQTRRVSLA
Subjt:  STPIEGGDYLRTMAGLWDLKLSFIDVQMVLFKPQLNGLLNLVGLHYCTNLLKVPANQVMEALQRCKISEKHVCVKWWKLGRWFYGFRMRDEQQTRRVSLA

Query:  ELITEEGEDVLGVLSRGAVHEVLRVQVSVSDPFGSHQSQQST
        EL+TEEGEDVLGVLSRGAVHEVLRVQVSV D F  H S+QST
Subjt:  ELITEEGEDVLGVLSRGAVHEVLRVQVSVSDPFGSHQSQQST

A0A6J1GE44 uncharacterized protein LOC1114533555.0e-11888.7Show/hide
Query:  MGMLHLPVDVALKIASSLQISDICALGCCSRFCRELFDSDPLWESLANERWPYI---NATGSSSSTLAKSPISMGWKSFYIQRHIEILGRAQAAVKFIEQ
        MG+LHLP+DVALKIASSLQ SDICALGCCSR CRELFDSD LWESLA ERWPYI   ++TGSSSST AK PISMGWKSFYI RHIEILGRAQAAVKFIEQ
Subjt:  MGMLHLPVDVALKIASSLQISDICALGCCSRFCRELFDSDPLWESLANERWPYI---NATGSSSSTLAKSPISMGWKSFYIQRHIEILGRAQAAVKFIEQ

Query:  CPPSTPIEGGDYLRTMAGLWDLKLSFIDVQMVLFKPQLNGLLNLVGLHYCTNLLKVPANQVMEALQRCKISEKHVCVKWWKLGRWFYGFRMRDEQQTRRV
        CPPSTPIEGGDYLRT+ GL DLKLSFIDVQMVLFKPQLNGLLNLVGLHYCTNLL++PA +VMEALQRCKISEKH+CVKWWKLGRWFYGFRMRDEQ TRRV
Subjt:  CPPSTPIEGGDYLRTMAGLWDLKLSFIDVQMVLFKPQLNGLLNLVGLHYCTNLLKVPANQVMEALQRCKISEKHVCVKWWKLGRWFYGFRMRDEQQTRRV

Query:  SLAELITEEGEDVLGVLSRGAVHEVLRVQVSVSDPFGSH
        SLAEL+T EGEDVLGVLSRG VHEVLRVQVSVSDPF SH
Subjt:  SLAELITEEGEDVLGVLSRGAVHEVLRVQVSVSDPFGSH

A0A6J1IEA8 uncharacterized protein LOC111472328 isoform X11.5e-11788.7Show/hide
Query:  MGMLHLPVDVALKIASSLQISDICALGCCSRFCRELFDSDPLWESLANERWPYI---NATGSSSSTLAKSPISMGWKSFYIQRHIEILGRAQAAVKFIEQ
        MG+LHLP+DVALKIASSLQ SDICALGCCSR CRELFDSD LWESLA ERWPYI   ++TGSSSST AK PISMGWKSFYI RHIEILGRAQAAVKFIEQ
Subjt:  MGMLHLPVDVALKIASSLQISDICALGCCSRFCRELFDSDPLWESLANERWPYI---NATGSSSSTLAKSPISMGWKSFYIQRHIEILGRAQAAVKFIEQ

Query:  CPPSTPIEGGDYLRTMAGLWDLKLSFIDVQMVLFKPQLNGLLNLVGLHYCTNLLKVPANQVMEALQRCKISEKHVCVKWWKLGRWFYGFRMRDEQQTRRV
        CPP TPIEGGDYLRT+ GL DLKLSFIDVQMVLFKPQLNGLLNLVGLHYCTNLL++PA +VMEALQRCKISEKHVCVKWWKLGRWFYGFRMRDEQ TRRV
Subjt:  CPPSTPIEGGDYLRTMAGLWDLKLSFIDVQMVLFKPQLNGLLNLVGLHYCTNLLKVPANQVMEALQRCKISEKHVCVKWWKLGRWFYGFRMRDEQQTRRV

Query:  SLAELITEEGEDVLGVLSRGAVHEVLRVQVSVSDPFGSH
        SLAEL+T EGEDVLGVLSRG VHEVLRVQVSVSDPF SH
Subjt:  SLAELITEEGEDVLGVLSRGAVHEVLRVQVSVSDPFGSH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G15563.1 unknown protein4.1e-3244.74Show/hide
Query:  RHIEILGRAQAAVKFIEQCPPSTPIEGGDYLRTMAGLWDLKLSFIDVQMVLFKPQLNGLLNLVGLHYCTNLLKVPANQVMEALQRCKISEKHVCVKWWKL
        +H E+  RA   +K++ + P +  +E G YL  +  +  ++  F DV++  FKP L+ LLNL+GL YC   LK    QV++ L++C ISE+ V VKW  L
Subjt:  RHIEILGRAQAAVKFIEQCPPSTPIEGGDYLRTMAGLWDLKLSFIDVQMVLFKPQLNGLLNLVGLHYCTNLLKVPANQVMEALQRCKISEKHVCVKWWKL

Query:  GRWFYGFRMRDEQQTRRVSLAELITEEGEDVLGVLSRGAVHEVLRVQVSVSD
        GRW  G RMRD+  +R+VSL +++T + E VL VL RG VHEVLRV +S  D
Subjt:  GRWFYGFRMRDEQQTRRVSLAELITEEGEDVLGVLSRGAVHEVLRVQVSVSD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGATGCTCCATCTTCCAGTTGATGTTGCCCTGAAAATCGCTTCTTCCCTTCAGATATCGGACATTTGTGCTTTGGGCTGTTGCTCGCGATTTTGTAGGGAACTGTT
CGATTCCGACCCTCTGTGGGAGTCTCTCGCAAACGAAAGATGGCCTTATATCAATGCTACTGGTTCTTCTTCGTCAACTCTCGCAAAATCCCCCATCTCCATGGGATGGA
AAAGCTTTTACATCCAAAGGCATATTGAGATATTAGGAAGAGCCCAAGCAGCAGTCAAGTTTATAGAACAATGCCCTCCTTCTACGCCGATTGAGGGTGGAGACTATCTC
AGGACAATGGCAGGCTTGTGGGATTTGAAGCTTAGTTTTATAGATGTTCAAATGGTGCTTTTCAAACCCCAACTTAATGGGCTGCTGAACTTGGTTGGGTTACACTACTG
TACAAATTTGCTGAAAGTCCCTGCCAATCAAGTCATGGAAGCACTTCAAAGATGCAAGATCTCAGAGAAGCATGTATGTGTGAAGTGGTGGAAGCTGGGGAGATGGTTTT
ATGGCTTCCGCATGAGAGACGAACAACAAACTCGTCGAGTCTCTCTGGCAGAACTCATAACAGAAGAAGGGGAAGACGTTCTTGGGGTGCTTAGCCGAGGTGCTGTTCAT
GAGGTGCTCCGGGTTCAGGTTTCTGTAAGTGATCCCTTTGGCAGTCATCAATCTCAACAAAGCACATAA
mRNA sequenceShow/hide mRNA sequence
GTGTCAGCGTCGTCAAAGCCCTCGCCTCGAGTTCTCAACTTCAGCAAACTCCGTTCGGAAGCAAGCTCGGTTAGGGATTGAGTCCAATCGGTGGTTTTCAAATGTTGATT
TTCTTCTCCCGATGAAATCGATCATTCGAGGGTTTACCAGATTAAATCTCCATTAAGAAACTTGTGCGAAGAAAGAAGAGCGTTGAGAGCAATCGGAAATGGGGATGCTC
CATCTTCCAGTTGATGTTGCCCTGAAAATCGCTTCTTCCCTTCAGATATCGGACATTTGTGCTTTGGGCTGTTGCTCGCGATTTTGTAGGGAACTGTTCGATTCCGACCC
TCTGTGGGAGTCTCTCGCAAACGAAAGATGGCCTTATATCAATGCTACTGGTTCTTCTTCGTCAACTCTCGCAAAATCCCCCATCTCCATGGGATGGAAAAGCTTTTACA
TCCAAAGGCATATTGAGATATTAGGAAGAGCCCAAGCAGCAGTCAAGTTTATAGAACAATGCCCTCCTTCTACGCCGATTGAGGGTGGAGACTATCTCAGGACAATGGCA
GGCTTGTGGGATTTGAAGCTTAGTTTTATAGATGTTCAAATGGTGCTTTTCAAACCCCAACTTAATGGGCTGCTGAACTTGGTTGGGTTACACTACTGTACAAATTTGCT
GAAAGTCCCTGCCAATCAAGTCATGGAAGCACTTCAAAGATGCAAGATCTCAGAGAAGCATGTATGTGTGAAGTGGTGGAAGCTGGGGAGATGGTTTTATGGCTTCCGCA
TGAGAGACGAACAACAAACTCGTCGAGTCTCTCTGGCAGAACTCATAACAGAAGAAGGGGAAGACGTTCTTGGGGTGCTTAGCCGAGGTGCTGTTCATGAGGTGCTCCGG
GTTCAGGTTTCTGTAAGTGATCCCTTTGGCAGTCATCAATCTCAACAAAGCACATAAAAACACCCTTAGCATCTCTATAAACATTCTGCAGCTGCAATTATCTAAGTGCT
AATAGTTGGTCATGCATTTCAAATAATTGATTGTAAATTCATGTATATATAAAAAAAGTAAAATACTTGTTGGGTATTCAGATTTGATTTATAAGTTGCAGTTCGTCAAG
ATATGCACATCAATAATTAGTGTATTCTGTAAATTATTAAGGGAGTTTGGCTCAAAGTATCGGATTTTACACTTTCTTCA
Protein sequenceShow/hide protein sequence
MGMLHLPVDVALKIASSLQISDICALGCCSRFCRELFDSDPLWESLANERWPYINATGSSSSTLAKSPISMGWKSFYIQRHIEILGRAQAAVKFIEQCPPSTPIEGGDYL
RTMAGLWDLKLSFIDVQMVLFKPQLNGLLNLVGLHYCTNLLKVPANQVMEALQRCKISEKHVCVKWWKLGRWFYGFRMRDEQQTRRVSLAELITEEGEDVLGVLSRGAVH
EVLRVQVSVSDPFGSHQSQQST