; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026472 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026472
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTTHA0068-like domain containing protein
Genome locationchr10:37528554..37530666
RNA-Seq ExpressionLag0026472
SyntenyLag0026472
Gene Ontology termsNA
InterPro domainsIPR005500 - Protein of unknown function DUF309
IPR023203 - TTHA0068-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7027740.1 hypothetical protein SDJN02_08917, partial [Cucurbita argyrosperma subsp. argyrosperma]4.4e-11686.35Show/hide
Query:  MASLPSLYFSSSSPSPLRPRRNSNSTFRHGSSLPSPPRTRSSRKPTTKSLSFRTSYRFAVDH----EDEQIARDLGFDEAVDLFNEGAYYDCHDVLEILW
        MASLPSL   SS+PS LRP R+SNS FRHGSSLP+PPR    R  TT SLSFRTSYRFAVDH    EDEQIAR+  FDEAVDLFN+GAYYDCHDVLEILW
Subjt:  MASLPSLYFSSSSPSPLRPRRNSNSTFRHGSSLPSPPRTRSSRKPTTKSLSFRTSYRFAVDH----EDEQIARDLGFDEAVDLFNEGAYYDCHDVLEILW

Query:  NGAEDPSRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEFESGPFHDFEREISAVLDFVYLTQIELAACDENVCVTMEGSERSYELLGMYGA
        NGAEDP+RTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEF++GPFH FEREISAVLDF+YLTQIELAACDENVCVTMEGSERSYELLG YGA
Subjt:  NGAEDPSRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEFESGPFHDFEREISAVLDFVYLTQIELAACDENVCVTMEGSERSYELLGMYGA

Query:  GQKLYEIESEVDGSMCIVFSPQTSQTHPLRVKLPTLAATKQHLLALDYH
        GQKLY+ E E DGSMCIVFSPQTSQTHPLRVKLPTLAATKQHLLALDYH
Subjt:  GQKLYEIESEVDGSMCIVFSPQTSQTHPLRVKLPTLAATKQHLLALDYH

XP_022945654.1 uncharacterized protein LOC111449825 [Cucurbita moschata]1.8e-11786.75Show/hide
Query:  MASLPSLYFSSSSPSPLRPRRNSNSTFRHGSSLPSPPRTRSSRKPTTKSLSFRTSYRFAVDH----EDEQIARDLGFDEAVDLFNEGAYYDCHDVLEILW
        MASLPSL  SSS+PS L P R+SNS FRHGSSLP+PPR    R  TT SLSFRTSYRFAVDH    EDEQIAR+  FDEAVDLFN+GAYYDCHDVLEILW
Subjt:  MASLPSLYFSSSSPSPLRPRRNSNSTFRHGSSLPSPPRTRSSRKPTTKSLSFRTSYRFAVDH----EDEQIARDLGFDEAVDLFNEGAYYDCHDVLEILW

Query:  NGAEDPSRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEFESGPFHDFEREISAVLDFVYLTQIELAACDENVCVTMEGSERSYELLGMYGA
        NGAEDP+RTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEF++GPFH FEREISAVLDF+YLTQIELAACDENVCVTMEGSERSYELLG YGA
Subjt:  NGAEDPSRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEFESGPFHDFEREISAVLDFVYLTQIELAACDENVCVTMEGSERSYELLGMYGA

Query:  GQKLYEIESEVDGSMCIVFSPQTSQTHPLRVKLPTLAATKQHLLALDYH
        GQKLY+ E EVDGSMCIVFSPQTSQTHPLRVKLPTLAATKQHLLALDYH
Subjt:  GQKLYEIESEVDGSMCIVFSPQTSQTHPLRVKLPTLAATKQHLLALDYH

XP_022971602.1 uncharacterized protein LOC111470277 [Cucurbita maxima]2.7e-11887.15Show/hide
Query:  MASLPSLYFSSSSPSPLRPRRNSNSTFRHGSSLPSPPRTRSSRKPTTKSLSFRTSYRFAVDH----EDEQIARDLGFDEAVDLFNEGAYYDCHDVLEILW
        MASLPSL  SSS+PS LRP R+SNS FRHGSSLP+PPR    R  TT SLSFRTSYRFAVDH    EDEQIAR+  FDEAVDLFN+GAYYDCHDVLEILW
Subjt:  MASLPSLYFSSSSPSPLRPRRNSNSTFRHGSSLPSPPRTRSSRKPTTKSLSFRTSYRFAVDH----EDEQIARDLGFDEAVDLFNEGAYYDCHDVLEILW

Query:  NGAEDPSRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEFESGPFHDFEREISAVLDFVYLTQIELAACDENVCVTMEGSERSYELLGMYGA
        NGAEDP+RTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEF++GPFH FEREISAVLDF+YLTQIELAACDENVCVTMEGSERSYELLG YGA
Subjt:  NGAEDPSRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEFESGPFHDFEREISAVLDFVYLTQIELAACDENVCVTMEGSERSYELLGMYGA

Query:  GQKLYEIESEVDGSMCIVFSPQTSQTHPLRVKLPTLAATKQHLLALDYH
        GQKLY+ E EVDGSMCIVFSPQTSQTHPLRVKLPTLAATKQHLLALDYH
Subjt:  GQKLYEIESEVDGSMCIVFSPQTSQTHPLRVKLPTLAATKQHLLALDYH

XP_023539066.1 uncharacterized protein LOC111799819 [Cucurbita pepo subsp. pepo]3.0e-11786.69Show/hide
Query:  MASLPSLYFSSSSPSPLRPRRNSNSTFRHGSSLPSPPRTRSSRKPTTKSLSFRTSYRFAVDH----EDEQIARDLGFDEAVDLFNEGAYYDCHDVLEILW
        MASLPSL  SSS+PS LRP R+SN+ FRHGSSLP+PPR   +R  TT SLSFRTSYRFAVDH    EDEQIAR+  FDEAVDLFN+GAYYDCHDVLEILW
Subjt:  MASLPSLYFSSSSPSPLRPRRNSNSTFRHGSSLPSPPRTRSSRKPTTKSLSFRTSYRFAVDH----EDEQIARDLGFDEAVDLFNEGAYYDCHDVLEILW

Query:  NGAEDPSRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEFESGPFHDFEREISAVLDFVYLTQIELAACDENVCVTMEGSERSYELLGMYGA
        NGAEDP+RTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEF++GPFH FEREISAVLDF+YLTQIELAACDENVCVTMEGSERSYELLG YGA
Subjt:  NGAEDPSRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEFESGPFHDFEREISAVLDFVYLTQIELAACDENVCVTMEGSERSYELLGMYGA

Query:  GQKLYEIESEVDGSMCIVFSPQTSQTHPLRVKLPTLAATKQHLLALDY
        GQKLY+ E EVDGSMCIVFSPQTSQTHPLRVKLPTLAATKQHLLALDY
Subjt:  GQKLYEIESEVDGSMCIVFSPQTSQTHPLRVKLPTLAATKQHLLALDY

XP_038905159.1 uncharacterized protein LOC120091273 [Benincasa hispida]9.8e-11686.18Show/hide
Query:  MASLPSLYFSSSSPSPLRPRRNSNSTFRHGSSLPSPPRTRSSRKPTTKSLSFRTSYRFAVDH--EDEQIARDLGFDEAVDLFNEGAYYDCHDVLEILWNG
        MASLP+LY SSS PSPLRP R+ NS F H +SLP PPRTRSS++ TT SLSFRTSY FA DH  EDEQIARD GFDEAVDLFN+GAYYDCHDVLE LWNG
Subjt:  MASLPSLYFSSSSPSPLRPRRNSNSTFRHGSSLPSPPRTRSSRKPTTKSLSFRTSYRFAVDH--EDEQIARDLGFDEAVDLFNEGAYYDCHDVLEILWNG

Query:  AEDPSRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEFESGPFHDFEREISAVLDFVYLTQIELAACDENVCVTMEGSERSYELLGMYGAGQ
        AEDP+RTL HGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKM+F+SGPF+ FEREI+AVLDFVYLTQIELAACDENVCVTMEGSERSYELLG YGAGQ
Subjt:  AEDPSRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEFESGPFHDFEREISAVLDFVYLTQIELAACDENVCVTMEGSERSYELLGMYGAGQ

Query:  KLYEIESEVDGSMCIVFSPQTSQTHPLRVKLPTLAATKQHLLALDY
        KLY+IE EVDG MCIVFSPQTSQ HPLRVKLPTLAATKQHLLALDY
Subjt:  KLYEIESEVDGSMCIVFSPQTSQTHPLRVKLPTLAATKQHLLALDY

TrEMBL top hitse value%identityAlignment
A0A0A0LC38 Uncharacterized protein1.3e-10580.57Show/hide
Query:  MASLPSLYFSSSSPSPLRPRRNSNSTFRHGSSLPSPPRTRSSRKPTTKSLSFRTSYRFAVDHE--DEQIARDLGFDEAVDLFNEGAYYDCHDVLEILWNG
        MASL SLY SSS PSPL P R  +S   HGS+L S PRT +SR+ TT  LSFRTSYRF  DHE  DE+I  D GFDEAVDLFN+GAYYDCHDVLE LWN 
Subjt:  MASLPSLYFSSSSPSPLRPRRNSNSTFRHGSSLPSPPRTRSSRKPTTKSLSFRTSYRFAVDHE--DEQIARDLGFDEAVDLFNEGAYYDCHDVLEILWNG

Query:  AEDPSRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEFESGPFHDFEREISAVLDFVYLTQIELAACDENVCVTMEGSERSYELLGMYGAGQ
        AEDP+RTLIHGILQCAVGLHHLFNRNHRGAMMELGEG+CKLRKMEF SGPF  FEREI+AVLDFVYLTQIELAACDE+VCVTMEGSERSYELLG YG GQ
Subjt:  AEDPSRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEFESGPFHDFEREISAVLDFVYLTQIELAACDENVCVTMEGSERSYELLGMYGAGQ

Query:  KLYEIESEVDGSMCIVFSPQTSQTHPLRVKLPTLAATKQHLLALDYH
        KLY++E +VDGS CIVFS QTSQTHPLRVKLPTL ATKQHLLALD H
Subjt:  KLYEIESEVDGSMCIVFSPQTSQTHPLRVKLPTLAATKQHLLALDYH

A0A1S3B7D9 uncharacterized protein LOC1034868361.5e-10681.78Show/hide
Query:  MASLPSLYFSSSSPSPLRPRRNSNSTFR--HGSSLPSPPRTRSSRKPTTKSLSFRTSYRFAVDHE--DEQIARDLGFDEAVDLFNEGAYYDCHDVLEILW
        MASL SL+ SSS PSPL P R  NS  R  HGS  PS PRT +SR+ TT S SFRTSYRF  DHE  DE+I  D GFDEAVDLFN+GAYYDCHDVLE LW
Subjt:  MASLPSLYFSSSSPSPLRPRRNSNSTFR--HGSSLPSPPRTRSSRKPTTKSLSFRTSYRFAVDHE--DEQIARDLGFDEAVDLFNEGAYYDCHDVLEILW

Query:  NGAEDPSRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEFESGPFHDFEREISAVLDFVYLTQIELAACDENVCVTMEGSERSYELLGMYGA
        N AEDP+RTLIHGILQCAVGLHHLFNRNHRGAMMELGEG+CKLRKMEF SGPFH FEREI+AVLDFVYLTQIELAACDE+VCVTMEGSERSYELLG YG 
Subjt:  NGAEDPSRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEFESGPFHDFEREISAVLDFVYLTQIELAACDENVCVTMEGSERSYELLGMYGA

Query:  GQKLYEIESEVDGSMCIVFSPQTSQTHPLRVKLPTLAATKQHLLALD
        GQKLY++E +VDGSMCIVFSPQTSQTHPLRVKLPTL ATKQHLLALD
Subjt:  GQKLYEIESEVDGSMCIVFSPQTSQTHPLRVKLPTLAATKQHLLALD

A0A5A7UHC0 Uncharacterized ypuF1.5e-10681.78Show/hide
Query:  MASLPSLYFSSSSPSPLRPRRNSNSTFR--HGSSLPSPPRTRSSRKPTTKSLSFRTSYRFAVDHE--DEQIARDLGFDEAVDLFNEGAYYDCHDVLEILW
        MASL SL+ SSS PSPL P R  NS  R  HGS  PS PRT +SR+ TT S SFRTSYRF  DHE  DE+I  D GFDEAVDLFN+GAYYDCHDVLE LW
Subjt:  MASLPSLYFSSSSPSPLRPRRNSNSTFR--HGSSLPSPPRTRSSRKPTTKSLSFRTSYRFAVDHE--DEQIARDLGFDEAVDLFNEGAYYDCHDVLEILW

Query:  NGAEDPSRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEFESGPFHDFEREISAVLDFVYLTQIELAACDENVCVTMEGSERSYELLGMYGA
        N AEDP+RTLIHGILQCAVGLHHLFNRNHRGAMMELGEG+CKLRKMEF SGPFH FEREI+AVLDFVYLTQIELAACDE+VCVTMEGSERSYELLG YG 
Subjt:  NGAEDPSRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEFESGPFHDFEREISAVLDFVYLTQIELAACDENVCVTMEGSERSYELLGMYGA

Query:  GQKLYEIESEVDGSMCIVFSPQTSQTHPLRVKLPTLAATKQHLLALD
        GQKLY++E +VDGSMCIVFSPQTSQTHPLRVKLPTL ATKQHLLALD
Subjt:  GQKLYEIESEVDGSMCIVFSPQTSQTHPLRVKLPTLAATKQHLLALD

A0A6J1G1I9 uncharacterized protein LOC1114498258.6e-11886.75Show/hide
Query:  MASLPSLYFSSSSPSPLRPRRNSNSTFRHGSSLPSPPRTRSSRKPTTKSLSFRTSYRFAVDH----EDEQIARDLGFDEAVDLFNEGAYYDCHDVLEILW
        MASLPSL  SSS+PS L P R+SNS FRHGSSLP+PPR    R  TT SLSFRTSYRFAVDH    EDEQIAR+  FDEAVDLFN+GAYYDCHDVLEILW
Subjt:  MASLPSLYFSSSSPSPLRPRRNSNSTFRHGSSLPSPPRTRSSRKPTTKSLSFRTSYRFAVDH----EDEQIARDLGFDEAVDLFNEGAYYDCHDVLEILW

Query:  NGAEDPSRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEFESGPFHDFEREISAVLDFVYLTQIELAACDENVCVTMEGSERSYELLGMYGA
        NGAEDP+RTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEF++GPFH FEREISAVLDF+YLTQIELAACDENVCVTMEGSERSYELLG YGA
Subjt:  NGAEDPSRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEFESGPFHDFEREISAVLDFVYLTQIELAACDENVCVTMEGSERSYELLGMYGA

Query:  GQKLYEIESEVDGSMCIVFSPQTSQTHPLRVKLPTLAATKQHLLALDYH
        GQKLY+ E EVDGSMCIVFSPQTSQTHPLRVKLPTLAATKQHLLALDYH
Subjt:  GQKLYEIESEVDGSMCIVFSPQTSQTHPLRVKLPTLAATKQHLLALDYH

A0A6J1I2E8 uncharacterized protein LOC1114702771.3e-11887.15Show/hide
Query:  MASLPSLYFSSSSPSPLRPRRNSNSTFRHGSSLPSPPRTRSSRKPTTKSLSFRTSYRFAVDH----EDEQIARDLGFDEAVDLFNEGAYYDCHDVLEILW
        MASLPSL  SSS+PS LRP R+SNS FRHGSSLP+PPR    R  TT SLSFRTSYRFAVDH    EDEQIAR+  FDEAVDLFN+GAYYDCHDVLEILW
Subjt:  MASLPSLYFSSSSPSPLRPRRNSNSTFRHGSSLPSPPRTRSSRKPTTKSLSFRTSYRFAVDH----EDEQIARDLGFDEAVDLFNEGAYYDCHDVLEILW

Query:  NGAEDPSRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEFESGPFHDFEREISAVLDFVYLTQIELAACDENVCVTMEGSERSYELLGMYGA
        NGAEDP+RTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEF++GPFH FEREISAVLDF+YLTQIELAACDENVCVTMEGSERSYELLG YGA
Subjt:  NGAEDPSRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEFESGPFHDFEREISAVLDFVYLTQIELAACDENVCVTMEGSERSYELLGMYGA

Query:  GQKLYEIESEVDGSMCIVFSPQTSQTHPLRVKLPTLAATKQHLLALDYH
        GQKLY+ E EVDGSMCIVFSPQTSQTHPLRVKLPTLAATKQHLLALDYH
Subjt:  GQKLYEIESEVDGSMCIVFSPQTSQTHPLRVKLPTLAATKQHLLALDYH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G41120.1 unknown protein4.6e-6354.03Show/hide
Query:  PSLYFSSSSPSPLRPRRNSNSTFRHGSSLPSPPRTRSSRKPTTKSLSFRTSYRFAVDHEDEQIARD--LGFDEAVDLFNEGAYYDCHDVLEILWNGAEDP
        PS +FSS  PS   P   + ST R  S+  S  R R SR+        R +     D ED    ++    F+EAV LFN+  YY  HD LE LW  AE+P
Subjt:  PSLYFSSSSPSPLRPRRNSNSTFRHGSSLPSPPRTRSSRKPTTKSLSFRTSYRFAVDHEDEQIARD--LGFDEAVDLFNEGAYYDCHDVLEILWNGAEDP

Query:  SRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEFESGPFHDFEREISAVLDFVYLTQIELAACDENVCVTMEGSERSYELLGMYGAGQKLYE
        +RTLIHGILQCAVG HHLFN NH+GAMMELGEG+CKLRKM FE GPFH+FER++SAVL+FVY TQ+ELAAC E++C+TM+ S+RSY+LLG Y AG+ +Y 
Subjt:  SRTLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEFESGPFHDFEREISAVLDFVYLTQIELAACDENVCVTMEGSERSYELLGMYGAGQKLYE

Query:  IESEVD------GSMCIVFSPQTSQTHPLRVKLPTLAATKQHLLALDY
        +E+ +D       +  I+FSP  S + P RVKLPTL AT +HLLA  Y
Subjt:  IESEVD------GSMCIVFSPQTSQTHPLRVKLPTLAATKQHLLALDY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTCTTCCGTCCCTCTATTTTTCCTCCTCCTCACCATCCCCTCTTCGACCTCGCCGCAATTCGAACTCCACTTTCCGCCATGGCAGCAGCCTCCCAAGCCCTCC
AAGAACCAGAAGCAGTCGAAAACCAACCACAAAATCGCTCTCCTTCCGCACCTCCTACCGATTCGCCGTCGATCACGAAGACGAGCAGATCGCGAGAGATCTCGGCTTCG
ACGAAGCAGTCGATCTCTTCAATGAAGGAGCTTACTACGATTGCCATGACGTCCTCGAGATTCTGTGGAACGGAGCCGAGGATCCTAGCAGAACCCTAATTCATGGCATT
CTTCAGTGCGCCGTGGGGCTTCATCATCTCTTCAATCGGAATCATAGAGGGGCGATGATGGAGCTTGGAGAGGGGCTGTGTAAGCTAAGGAAGATGGAGTTTGAGAGTGG
GCCTTTCCATGATTTTGAGAGGGAGATTTCTGCAGTTTTGGACTTTGTTTACCTCACCCAGATTGAATTAGCTGCCTGTGATGAGAATGTGTGTGTTACAATGGAGGGTT
CAGAGAGATCATATGAATTGCTTGGAATGTATGGTGCTGGACAGAAGCTGTATGAAATTGAGAGTGAAGTTGATGGAAGCATGTGTATTGTCTTCTCTCCTCAAACATCT
CAAACTCATCCACTAAGGGTAAAGCTTCCCACTCTTGCTGCTACAAAACAACACCTCTTAGCCCTTGACTACCATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTCTTCCGTCCCTCTATTTTTCCTCCTCCTCACCATCCCCTCTTCGACCTCGCCGCAATTCGAACTCCACTTTCCGCCATGGCAGCAGCCTCCCAAGCCCTCC
AAGAACCAGAAGCAGTCGAAAACCAACCACAAAATCGCTCTCCTTCCGCACCTCCTACCGATTCGCCGTCGATCACGAAGACGAGCAGATCGCGAGAGATCTCGGCTTCG
ACGAAGCAGTCGATCTCTTCAATGAAGGAGCTTACTACGATTGCCATGACGTCCTCGAGATTCTGTGGAACGGAGCCGAGGATCCTAGCAGAACCCTAATTCATGGCATT
CTTCAGTGCGCCGTGGGGCTTCATCATCTCTTCAATCGGAATCATAGAGGGGCGATGATGGAGCTTGGAGAGGGGCTGTGTAAGCTAAGGAAGATGGAGTTTGAGAGTGG
GCCTTTCCATGATTTTGAGAGGGAGATTTCTGCAGTTTTGGACTTTGTTTACCTCACCCAGATTGAATTAGCTGCCTGTGATGAGAATGTGTGTGTTACAATGGAGGGTT
CAGAGAGATCATATGAATTGCTTGGAATGTATGGTGCTGGACAGAAGCTGTATGAAATTGAGAGTGAAGTTGATGGAAGCATGTGTATTGTCTTCTCTCCTCAAACATCT
CAAACTCATCCACTAAGGGTAAAGCTTCCCACTCTTGCTGCTACAAAACAACACCTCTTAGCCCTTGACTACCATTGA
Protein sequenceShow/hide protein sequence
MASLPSLYFSSSSPSPLRPRRNSNSTFRHGSSLPSPPRTRSSRKPTTKSLSFRTSYRFAVDHEDEQIARDLGFDEAVDLFNEGAYYDCHDVLEILWNGAEDPSRTLIHGI
LQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEFESGPFHDFEREISAVLDFVYLTQIELAACDENVCVTMEGSERSYELLGMYGAGQKLYEIESEVDGSMCIVFSPQTS
QTHPLRVKLPTLAATKQHLLALDYH