; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS000051 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS000051
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionUnknown protein
Genome locationscaffold946_1:472125..474199
RNA-Seq ExpressionMS000051
SyntenyMS000051
Gene Ontology termsNA
InterPro domainsIPR005500 - Protein of unknown function DUF309
IPR023203 - TTHA0068-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7027740.1 hypothetical protein SDJN02_08917, partial [Cucurbita argyrosperma subsp. argyrosperma]2.2e-11083Show/hide
Query:  MASLSSLYVSSSSSSPLPPHRNLNSFFRHGSNLPNPTRSSSSSTRRSRSTISLSFRTSYRFSVDHEDEDEEEDQIARDSCFDEAVDLFNRGAYYDCHDVL
        MASL SL VSS+ S  L PHR+ NS FRHGS+LP P R   ++ RR ++T SLSFRTSYRF+VDHEDEDE+E QIAR+  FDEAVDLFN+GAYYDCHDVL
Subjt:  MASLSSLYVSSSSSSPLPPHRNLNSFFRHGSNLPNPTRSSSSSTRRSRSTISLSFRTSYRFSVDHEDEDEEEDQIARDSCFDEAVDLFNRGAYYDCHDVL

Query:  EILWNGAEDPTRTLIHGILQCAVGLHHLFNRNHKGAMMELGEGLCKLRKMEFRSGPFFTFEREISAVLDFVYQTQIELAACDETVCVTMERSERSYELLG
        EILWNGAEDPTRTLIHGILQCAVGLHHLFNRNH+GAMMELGEGLCKLRKMEF++GPF TFEREISAVLDF+Y TQIELAACDE VCVTME SERSYELLG
Subjt:  EILWNGAEDPTRTLIHGILQCAVGLHHLFNRNHKGAMMELGEGLCKLRKMEFRSGPFFTFEREISAVLDFVYQTQIELAACDETVCVTMERSERSYELLG

Query:  GYGAGQKLYDLEREADGSMCILFSPQTSQTHPLRVRLPTLAATKQHLLALDYH
         YGAGQKLYD E EADGSMCI+FSPQTSQTHPLRV+LPTLAATKQHLLALDYH
Subjt:  GYGAGQKLYDLEREADGSMCILFSPQTSQTHPLRVRLPTLAATKQHLLALDYH

XP_022945654.1 uncharacterized protein LOC111449825 [Cucurbita moschata]1.4e-11283.4Show/hide
Query:  MASLSSLYVSSSSSSPLPPHRNLNSFFRHGSNLPNPTRSSSSSTRRSRSTISLSFRTSYRFSVDHEDEDEEEDQIARDSCFDEAVDLFNRGAYYDCHDVL
        MASL SL VSSS+ S LPPHR+ NS FRHGS+LP P R   ++ RR ++T SLSFRTSYRF+VDHEDEDE+E QIAR+  FDEAVDLFN+GAYYDCHDVL
Subjt:  MASLSSLYVSSSSSSPLPPHRNLNSFFRHGSNLPNPTRSSSSSTRRSRSTISLSFRTSYRFSVDHEDEDEEEDQIARDSCFDEAVDLFNRGAYYDCHDVL

Query:  EILWNGAEDPTRTLIHGILQCAVGLHHLFNRNHKGAMMELGEGLCKLRKMEFRSGPFFTFEREISAVLDFVYQTQIELAACDETVCVTMERSERSYELLG
        EILWNGAEDPTRTLIHGILQCAVGLHHLFNRNH+GAMMELGEGLCKLRKMEF++GPF TFEREISAVLDF+Y TQIELAACDE VCVTME SERSYELLG
Subjt:  EILWNGAEDPTRTLIHGILQCAVGLHHLFNRNHKGAMMELGEGLCKLRKMEFRSGPFFTFEREISAVLDFVYQTQIELAACDETVCVTMERSERSYELLG

Query:  GYGAGQKLYDLEREADGSMCILFSPQTSQTHPLRVRLPTLAATKQHLLALDYH
         YGAGQKLYD E E DGSMCI+FSPQTSQTHPLRV+LPTLAATKQHLLALDYH
Subjt:  GYGAGQKLYDLEREADGSMCILFSPQTSQTHPLRVRLPTLAATKQHLLALDYH

XP_022971602.1 uncharacterized protein LOC111470277 [Cucurbita maxima]1.5e-11183Show/hide
Query:  MASLSSLYVSSSSSSPLPPHRNLNSFFRHGSNLPNPTRSSSSSTRRSRSTISLSFRTSYRFSVDHEDEDEEEDQIARDSCFDEAVDLFNRGAYYDCHDVL
        MASL SL VSSS+ S L PHR+ NS FRHGS+LP P R   ++ RR ++T SLSFRTSYRF+VDHEDEDE+E QIAR+  FDEAVDLFN+GAYYDCHDVL
Subjt:  MASLSSLYVSSSSSSPLPPHRNLNSFFRHGSNLPNPTRSSSSSTRRSRSTISLSFRTSYRFSVDHEDEDEEEDQIARDSCFDEAVDLFNRGAYYDCHDVL

Query:  EILWNGAEDPTRTLIHGILQCAVGLHHLFNRNHKGAMMELGEGLCKLRKMEFRSGPFFTFEREISAVLDFVYQTQIELAACDETVCVTMERSERSYELLG
        EILWNGAEDPTRTLIHGILQCAVGLHHLFNRNH+GAMMELGEGLCKLRKMEF++GPF TFEREISAVLDF+Y TQIELAACDE VCVTME SERSYELLG
Subjt:  EILWNGAEDPTRTLIHGILQCAVGLHHLFNRNHKGAMMELGEGLCKLRKMEFRSGPFFTFEREISAVLDFVYQTQIELAACDETVCVTMERSERSYELLG

Query:  GYGAGQKLYDLEREADGSMCILFSPQTSQTHPLRVRLPTLAATKQHLLALDYH
         YGAGQKLYD E E DGSMCI+FSPQTSQTHPLRV+LPTLAATKQHLLALDYH
Subjt:  GYGAGQKLYDLEREADGSMCILFSPQTSQTHPLRVRLPTLAATKQHLLALDYH

XP_023539066.1 uncharacterized protein LOC111799819 [Cucurbita pepo subsp. pepo]8.3e-11082.14Show/hide
Query:  MASLSSLYVSSSSSSPLPPHRNLNSFFRHGSNLPNPTRSSSSSTRRSRSTISLSFRTSYRFSVDHEDEDEEEDQIARDSCFDEAVDLFNRGAYYDCHDVL
        MASL SL VSSS+ S L PHR+ N+ FRHGS+LP P R++ +   R ++T SLSFRTSYRF+VDHEDEDE+E QIAR+  FDEAVDLFN+GAYYDCHDVL
Subjt:  MASLSSLYVSSSSSSPLPPHRNLNSFFRHGSNLPNPTRSSSSSTRRSRSTISLSFRTSYRFSVDHEDEDEEEDQIARDSCFDEAVDLFNRGAYYDCHDVL

Query:  EILWNGAEDPTRTLIHGILQCAVGLHHLFNRNHKGAMMELGEGLCKLRKMEFRSGPFFTFEREISAVLDFVYQTQIELAACDETVCVTMERSERSYELLG
        EILWNGAEDPTRTLIHGILQCAVGLHHLFNRNH+GAMMELGEGLCKLRKMEF++GPF TFEREISAVLDF+Y TQIELAACDE VCVTME SERSYELLG
Subjt:  EILWNGAEDPTRTLIHGILQCAVGLHHLFNRNHKGAMMELGEGLCKLRKMEFRSGPFFTFEREISAVLDFVYQTQIELAACDETVCVTMERSERSYELLG

Query:  GYGAGQKLYDLEREADGSMCILFSPQTSQTHPLRVRLPTLAATKQHLLALDY
         YGAGQKLYD E E DGSMCI+FSPQTSQTHPLRV+LPTLAATKQHLLALDY
Subjt:  GYGAGQKLYDLEREADGSMCILFSPQTSQTHPLRVRLPTLAATKQHLLALDY

XP_038905159.1 uncharacterized protein LOC120091273 [Benincasa hispida]4.5e-10880.95Show/hide
Query:  MASLSSLYVSSSSSSPLPPHRNLNSFFRHGSNLPNPTRSSSSSTRRSRSTISLSFRTSYRFSVDHEDEDEEEDQIARDSCFDEAVDLFNRGAYYDCHDVL
        MASL +LYVSSS  SPL PHR+LNS F H ++LP P R+ SS   + R+TISLSFRTSY F+ DHEDEDE   QIARD  FDEAVDLFN+GAYYDCHDVL
Subjt:  MASLSSLYVSSSSSSPLPPHRNLNSFFRHGSNLPNPTRSSSSSTRRSRSTISLSFRTSYRFSVDHEDEDEEEDQIARDSCFDEAVDLFNRGAYYDCHDVL

Query:  EILWNGAEDPTRTLIHGILQCAVGLHHLFNRNHKGAMMELGEGLCKLRKMEFRSGPFFTFEREISAVLDFVYQTQIELAACDETVCVTMERSERSYELLG
        E LWNGAEDPTRTL HGILQCAVGLHHLFNRNH+GAMMELGEGLCKLRKM+F+SGPF+TFEREI+AVLDFVY TQIELAACDE VCVTME SERSYELLG
Subjt:  EILWNGAEDPTRTLIHGILQCAVGLHHLFNRNHKGAMMELGEGLCKLRKMEFRSGPFFTFEREISAVLDFVYQTQIELAACDETVCVTMERSERSYELLG

Query:  GYGAGQKLYDLEREADGSMCILFSPQTSQTHPLRVRLPTLAATKQHLLALDY
         YGAGQKLYD+E+E DG MCI+FSPQTSQ HPLRV+LPTLAATKQHLLALDY
Subjt:  GYGAGQKLYDLEREADGSMCILFSPQTSQTHPLRVRLPTLAATKQHLLALDY

TrEMBL top hitse value%identityAlignment
A0A0A0LC38 Uncharacterized protein1.3e-10076.68Show/hide
Query:  MASLSSLYVSSSSSSPLPPHRNLNSFFRHGSNLPNPTRSSSSSTRRSRSTISLSFRTSYRFSVDHEDEDEEEDQIARDSCFDEAVDLFNRGAYYDCHDVL
        MASL SLY+SSS  SPLPPHR  +S   HGS L +  R+++S   R  +TI LSFRTSYRF+ DHED DE   +I  D  FDEAVDLFN+GAYYDCHDVL
Subjt:  MASLSSLYVSSSSSSPLPPHRNLNSFFRHGSNLPNPTRSSSSSTRRSRSTISLSFRTSYRFSVDHEDEDEEEDQIARDSCFDEAVDLFNRGAYYDCHDVL

Query:  EILWNGAEDPTRTLIHGILQCAVGLHHLFNRNHKGAMMELGEGLCKLRKMEFRSGPFFTFEREISAVLDFVYQTQIELAACDETVCVTMERSERSYELLG
        E LWN AEDPTRTLIHGILQCAVGLHHLFNRNH+GAMMELGEG+CKLRKMEF SGPF TFEREI+AVLDFVY TQIELAACDE+VCVTME SERSYELLG
Subjt:  EILWNGAEDPTRTLIHGILQCAVGLHHLFNRNHKGAMMELGEGLCKLRKMEFRSGPFFTFEREISAVLDFVYQTQIELAACDETVCVTMERSERSYELLG

Query:  GYGAGQKLYDLEREADGSMCILFSPQTSQTHPLRVRLPTLAATKQHLLALDYH
         YG GQKLYD+E++ DGS CI+FS QTSQTHPLRV+LPTL ATKQHLLALD H
Subjt:  GYGAGQKLYDLEREADGSMCILFSPQTSQTHPLRVRLPTLAATKQHLLALDYH

A0A1S3B7D9 uncharacterized protein LOC1034868361.7e-10076.89Show/hide
Query:  MASLSSLYVSSSSSSPLPPHRNLNSFFRHGSNLPNPTRSSSSSTRRSRSTISLSFRTSYRFSVDHEDEDEEEDQIARDSCFDEAVDLFNRGAYYDCHDVL
        MASL SL++SSS  SPL PHR  NS  RHG     P+   ++++RR  +TIS SFRTSYRF+ DHED DE   +I  D  FDEAVDLFN+GAYYDCHDVL
Subjt:  MASLSSLYVSSSSSSPLPPHRNLNSFFRHGSNLPNPTRSSSSSTRRSRSTISLSFRTSYRFSVDHEDEDEEEDQIARDSCFDEAVDLFNRGAYYDCHDVL

Query:  EILWNGAEDPTRTLIHGILQCAVGLHHLFNRNHKGAMMELGEGLCKLRKMEFRSGPFFTFEREISAVLDFVYQTQIELAACDETVCVTMERSERSYELLG
        E LWN AEDPTRTLIHGILQCAVGLHHLFNRNH+GAMMELGEG+CKLRKMEF SGPF TFEREI+AVLDFVY TQIELAACDE+VCVTME SERSYELLG
Subjt:  EILWNGAEDPTRTLIHGILQCAVGLHHLFNRNHKGAMMELGEGLCKLRKMEFRSGPFFTFEREISAVLDFVYQTQIELAACDETVCVTMERSERSYELLG

Query:  GYGAGQKLYDLEREADGSMCILFSPQTSQTHPLRVRLPTLAATKQHLLALD
         YG GQKLYD+E++ DGSMCI+FSPQTSQTHPLRV+LPTL ATKQHLLALD
Subjt:  GYGAGQKLYDLEREADGSMCILFSPQTSQTHPLRVRLPTLAATKQHLLALD

A0A5A7UHC0 Uncharacterized ypuF1.7e-10076.89Show/hide
Query:  MASLSSLYVSSSSSSPLPPHRNLNSFFRHGSNLPNPTRSSSSSTRRSRSTISLSFRTSYRFSVDHEDEDEEEDQIARDSCFDEAVDLFNRGAYYDCHDVL
        MASL SL++SSS  SPL PHR  NS  RHG     P+   ++++RR  +TIS SFRTSYRF+ DHED DE   +I  D  FDEAVDLFN+GAYYDCHDVL
Subjt:  MASLSSLYVSSSSSSPLPPHRNLNSFFRHGSNLPNPTRSSSSSTRRSRSTISLSFRTSYRFSVDHEDEDEEEDQIARDSCFDEAVDLFNRGAYYDCHDVL

Query:  EILWNGAEDPTRTLIHGILQCAVGLHHLFNRNHKGAMMELGEGLCKLRKMEFRSGPFFTFEREISAVLDFVYQTQIELAACDETVCVTMERSERSYELLG
        E LWN AEDPTRTLIHGILQCAVGLHHLFNRNH+GAMMELGEG+CKLRKMEF SGPF TFEREI+AVLDFVY TQIELAACDE+VCVTME SERSYELLG
Subjt:  EILWNGAEDPTRTLIHGILQCAVGLHHLFNRNHKGAMMELGEGLCKLRKMEFRSGPFFTFEREISAVLDFVYQTQIELAACDETVCVTMERSERSYELLG

Query:  GYGAGQKLYDLEREADGSMCILFSPQTSQTHPLRVRLPTLAATKQHLLALD
         YG GQKLYD+E++ DGSMCI+FSPQTSQTHPLRV+LPTL ATKQHLLALD
Subjt:  GYGAGQKLYDLEREADGSMCILFSPQTSQTHPLRVRLPTLAATKQHLLALD

A0A6J1G1I9 uncharacterized protein LOC1114498256.6e-11383.4Show/hide
Query:  MASLSSLYVSSSSSSPLPPHRNLNSFFRHGSNLPNPTRSSSSSTRRSRSTISLSFRTSYRFSVDHEDEDEEEDQIARDSCFDEAVDLFNRGAYYDCHDVL
        MASL SL VSSS+ S LPPHR+ NS FRHGS+LP P R   ++ RR ++T SLSFRTSYRF+VDHEDEDE+E QIAR+  FDEAVDLFN+GAYYDCHDVL
Subjt:  MASLSSLYVSSSSSSPLPPHRNLNSFFRHGSNLPNPTRSSSSSTRRSRSTISLSFRTSYRFSVDHEDEDEEEDQIARDSCFDEAVDLFNRGAYYDCHDVL

Query:  EILWNGAEDPTRTLIHGILQCAVGLHHLFNRNHKGAMMELGEGLCKLRKMEFRSGPFFTFEREISAVLDFVYQTQIELAACDETVCVTMERSERSYELLG
        EILWNGAEDPTRTLIHGILQCAVGLHHLFNRNH+GAMMELGEGLCKLRKMEF++GPF TFEREISAVLDF+Y TQIELAACDE VCVTME SERSYELLG
Subjt:  EILWNGAEDPTRTLIHGILQCAVGLHHLFNRNHKGAMMELGEGLCKLRKMEFRSGPFFTFEREISAVLDFVYQTQIELAACDETVCVTMERSERSYELLG

Query:  GYGAGQKLYDLEREADGSMCILFSPQTSQTHPLRVRLPTLAATKQHLLALDYH
         YGAGQKLYD E E DGSMCI+FSPQTSQTHPLRV+LPTLAATKQHLLALDYH
Subjt:  GYGAGQKLYDLEREADGSMCILFSPQTSQTHPLRVRLPTLAATKQHLLALDYH

A0A6J1I2E8 uncharacterized protein LOC1114702777.3e-11283Show/hide
Query:  MASLSSLYVSSSSSSPLPPHRNLNSFFRHGSNLPNPTRSSSSSTRRSRSTISLSFRTSYRFSVDHEDEDEEEDQIARDSCFDEAVDLFNRGAYYDCHDVL
        MASL SL VSSS+ S L PHR+ NS FRHGS+LP P R   ++ RR ++T SLSFRTSYRF+VDHEDEDE+E QIAR+  FDEAVDLFN+GAYYDCHDVL
Subjt:  MASLSSLYVSSSSSSPLPPHRNLNSFFRHGSNLPNPTRSSSSSTRRSRSTISLSFRTSYRFSVDHEDEDEEEDQIARDSCFDEAVDLFNRGAYYDCHDVL

Query:  EILWNGAEDPTRTLIHGILQCAVGLHHLFNRNHKGAMMELGEGLCKLRKMEFRSGPFFTFEREISAVLDFVYQTQIELAACDETVCVTMERSERSYELLG
        EILWNGAEDPTRTLIHGILQCAVGLHHLFNRNH+GAMMELGEGLCKLRKMEF++GPF TFEREISAVLDF+Y TQIELAACDE VCVTME SERSYELLG
Subjt:  EILWNGAEDPTRTLIHGILQCAVGLHHLFNRNHKGAMMELGEGLCKLRKMEFRSGPFFTFEREISAVLDFVYQTQIELAACDETVCVTMERSERSYELLG

Query:  GYGAGQKLYDLEREADGSMCILFSPQTSQTHPLRVRLPTLAATKQHLLALDYH
         YGAGQKLYD E E DGSMCI+FSPQTSQTHPLRV+LPTLAATKQHLLALDYH
Subjt:  GYGAGQKLYDLEREADGSMCILFSPQTSQTHPLRVRLPTLAATKQHLLALDYH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G41120.1 unknown protein6.0e-6654.58Show/hide
Query:  SLSSLYVSSSSSSPLPPHRNLNSFFRHGSNLP-NPTRSSSSSTRRSRSTISLSFRTSYRFSV---DHEDEDEEEDQIARDS--CFDEAVDLFNRGAYYDC
        ++SS +    SS  LPP    +SFF   S +P +P   ++S+ R S    S+  R S R  V   DH  EDEE+    ++    F+EAV LFN+  YY  
Subjt:  SLSSLYVSSSSSSPLPPHRNLNSFFRHGSNLP-NPTRSSSSSTRRSRSTISLSFRTSYRFSV---DHEDEDEEEDQIARDS--CFDEAVDLFNRGAYYDC

Query:  HDVLEILWNGAEDPTRTLIHGILQCAVGLHHLFNRNHKGAMMELGEGLCKLRKMEFRSGPFFTFEREISAVLDFVYQTQIELAACDETVCVTMERSERSY
        HD LE LW  AE+PTRTLIHGILQCAVG HHLFN NHKGAMMELGEG+CKLRKM F  GPF  FER++SAVL+FVYQTQ+ELAAC E +C+TM++S+RSY
Subjt:  HDVLEILWNGAEDPTRTLIHGILQCAVGLHHLFNRNHKGAMMELGEGLCKLRKMEFRSGPFFTFEREISAVLDFVYQTQIELAACDETVCVTMERSERSY

Query:  ELLGGYGAGQKLYDLEREAD------GSMCILFSPQTSQTHPLRVRLPTLAATKQHLLALDY
        +LLGGY AG+ +Y LE   D       +  ILFSP  S + P RV+LPTL AT +HLLA  Y
Subjt:  ELLGGYGAGQKLYDLEREAD------GSMCILFSPQTSQTHPLRVRLPTLAATKQHLLALDY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCCTTTCATCGCTGTACGTTTCCTCCTCCTCATCATCCCCTCTTCCACCTCACCGCAATTTGAACTCCTTTTTCCGCCATGGAAGCAATCTCCCAAACCCTAC
AAGAAGCAGCAGCAGCAGCACCAGACGAAGCAGATCCACAATCTCGCTCTCCTTCCGTACCTCCTACCGATTCTCTGTCGATCACGAAGACGAAGACGAGGAGGAGGATC
AGATCGCTAGAGATTCCTGCTTCGACGAAGCGGTCGATCTCTTCAATCGAGGGGCATATTACGATTGCCACGACGTGCTCGAGATTCTGTGGAACGGAGCCGAAGACCCT
ACCAGAACCCTAATCCATGGCATTCTTCAGTGCGCCGTGGGGCTTCATCACCTCTTCAATCGGAACCATAAAGGGGCGATGATGGAGCTGGGAGAGGGGCTGTGTAAGCT
ACGGAAGATGGAGTTTAGAAGTGGCCCTTTCTTTACTTTTGAGAGGGAGATTTCTGCAGTTCTGGACTTCGTTTACCAGACCCAGATTGAATTAGCTGCTTGTGATGAGA
CTGTGTGTGTTACAATGGAGCGTTCAGAGAGATCATATGAACTGCTTGGAGGGTATGGTGCAGGACAGAAGCTGTATGATTTAGAGAGAGAAGCTGATGGAAGCATGTGC
ATTCTCTTCTCTCCTCAAACTTCTCAAACTCATCCACTAAGGGTAAGGCTTCCCACTCTTGCTGCTACCAAACAGCACCTTCTAGCCCTTGACTACCAC
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCCTTTCATCGCTGTACGTTTCCTCCTCCTCATCATCCCCTCTTCCACCTCACCGCAATTTGAACTCCTTTTTCCGCCATGGAAGCAATCTCCCAAACCCTAC
AAGAAGCAGCAGCAGCAGCACCAGACGAAGCAGATCCACAATCTCGCTCTCCTTCCGTACCTCCTACCGATTCTCTGTCGATCACGAAGACGAAGACGAGGAGGAGGATC
AGATCGCTAGAGATTCCTGCTTCGACGAAGCGGTCGATCTCTTCAATCGAGGGGCATATTACGATTGCCACGACGTGCTCGAGATTCTGTGGAACGGAGCCGAAGACCCT
ACCAGAACCCTAATCCATGGCATTCTTCAGTGCGCCGTGGGGCTTCATCACCTCTTCAATCGGAACCATAAAGGGGCGATGATGGAGCTGGGAGAGGGGCTGTGTAAGCT
ACGGAAGATGGAGTTTAGAAGTGGCCCTTTCTTTACTTTTGAGAGGGAGATTTCTGCAGTTCTGGACTTCGTTTACCAGACCCAGATTGAATTAGCTGCTTGTGATGAGA
CTGTGTGTGTTACAATGGAGCGTTCAGAGAGATCATATGAACTGCTTGGAGGGTATGGTGCAGGACAGAAGCTGTATGATTTAGAGAGAGAAGCTGATGGAAGCATGTGC
ATTCTCTTCTCTCCTCAAACTTCTCAAACTCATCCACTAAGGGTAAGGCTTCCCACTCTTGCTGCTACCAAACAGCACCTTCTAGCCCTTGACTACCAC
Protein sequenceShow/hide protein sequence
MASLSSLYVSSSSSSPLPPHRNLNSFFRHGSNLPNPTRSSSSSTRRSRSTISLSFRTSYRFSVDHEDEDEEEDQIARDSCFDEAVDLFNRGAYYDCHDVLEILWNGAEDP
TRTLIHGILQCAVGLHHLFNRNHKGAMMELGEGLCKLRKMEFRSGPFFTFEREISAVLDFVYQTQIELAACDETVCVTMERSERSYELLGGYGAGQKLYDLEREADGSMC
ILFSPQTSQTHPLRVRLPTLAATKQHLLALDYH