; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10022175 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10022175
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionLOB domain-containing protein
Genome locationChr05:21595613..21596104
RNA-Seq ExpressionHG10022175
SyntenyHG10022175
Gene Ontology termsNA
InterPro domainsIPR004883 - Lateral organ boundaries, LOB


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037575.1 LOB domain-containing protein 25 [Cucumis melo var. makuwa]9.8e-8092.73Show/hide
Query:  MASSSYSHSPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRLQ
        MASSSY++SPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRLQ
Subjt:  MASSSYSHSPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRLQ

Query:  KELDATNADLVRYASGGARPPSQFGRRAATASV--MRHHSGLSFLSPLSSDDHDDPYSHDKEEGN
        KELDATNADLVRYASG ARPPSQ+GRRA TASV   RHHSGLSFLSPLSS+ HDDPY H+KEEGN
Subjt:  KELDATNADLVRYASGGARPPSQFGRRAATASV--MRHHSGLSFLSPLSSDDHDDPYSHDKEEGN

XP_008458802.1 PREDICTED: LOB domain-containing protein 25 [Cucumis melo]3.7e-7992.12Show/hide
Query:  MASSSYSHSPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRLQ
        MASSSY++SPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRLQ
Subjt:  MASSSYSHSPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRLQ

Query:  KELDATNADLVRYASGGARPPSQFGRRAATASV--MRHHSGLSFLSPLSSDDHDDPYSHDKEEGN
        KELDATNADLVRYASG ARPPSQ+GRRA TASV   RHHSGLSFLSPLSS+ HDDPY H+ EEGN
Subjt:  KELDATNADLVRYASGGARPPSQFGRRAATASV--MRHHSGLSFLSPLSSDDHDDPYSHDKEEGN

XP_011655399.1 LOB domain-containing protein 25 [Cucumis sativus]1.4e-7892.68Show/hide
Query:  MASSSYSHSPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRLQ
        MASSSY++SPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRLQ
Subjt:  MASSSYSHSPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRLQ

Query:  KELDATNADLVRYASGGARPPSQFGRRAATASV-MRHHSGLSFLSPLSSDDHDDPYSHDKEEGN
        KELDATNADLVRYASG ARPPSQ+GRRA TASV  RHHSGLSFLSPLSS  HD  Y HDKEEGN
Subjt:  KELDATNADLVRYASGGARPPSQFGRRAATASV-MRHHSGLSFLSPLSSDDHDDPYSHDKEEGN

XP_022991035.1 LOB domain-containing protein 25-like [Cucurbita maxima]8.6e-6883.95Show/hide
Query:  ASSSYSHSPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRLQK
        +SSSYS+SPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLL+EV PHQREDAVNSLAYEAEARMKDPVYGCVGAIS+LQRQVIRLQK
Subjt:  ASSSYSHSPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRLQK

Query:  ELDATNADLVRYASGGARPPSQFGRRAATASVMRHHSGLSFLSPLSSDDHDDPYSHDKEEGN
        ELDATNADLVRYASGGARP   +GRRAA        SG+ F+SPL+S+D  DPY HDKEE N
Subjt:  ELDATNADLVRYASGGARPPSQFGRRAATASVMRHHSGLSFLSPLSSDDHDDPYSHDKEEGN

XP_038890699.1 LOB domain-containing protein 25-like [Benincasa hispida]2.1e-8295.71Show/hide
Query:  MASSSYSHSPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRLQ
        MASSSYS+SPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRLQ
Subjt:  MASSSYSHSPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRLQ

Query:  KELDATNADLVRYASGGARPPSQFGRRAATASVMRHHSGLSFLSPLSSDDHDDPYSHDKEEGN
        KELDATNADLVRYASGGARPPSQFGRRAATAS+ RHHSGLSFLSPLSS+DHD  Y HDKEEGN
Subjt:  KELDATNADLVRYASGGARPPSQFGRRAATASVMRHHSGLSFLSPLSSDDHDDPYSHDKEEGN

TrEMBL top hitse value%identityAlignment
A0A0A0KSE8 LOB domain-containing protein6.8e-7992.68Show/hide
Query:  MASSSYSHSPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRLQ
        MASSSY++SPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRLQ
Subjt:  MASSSYSHSPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRLQ

Query:  KELDATNADLVRYASGGARPPSQFGRRAATASV-MRHHSGLSFLSPLSSDDHDDPYSHDKEEGN
        KELDATNADLVRYASG ARPPSQ+GRRA TASV  RHHSGLSFLSPLSS  HD  Y HDKEEGN
Subjt:  KELDATNADLVRYASGGARPPSQFGRRAATASV-MRHHSGLSFLSPLSSDDHDDPYSHDKEEGN

A0A1S3C8P8 LOB domain-containing protein 251.8e-7992.12Show/hide
Query:  MASSSYSHSPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRLQ
        MASSSY++SPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRLQ
Subjt:  MASSSYSHSPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRLQ

Query:  KELDATNADLVRYASGGARPPSQFGRRAATASV--MRHHSGLSFLSPLSSDDHDDPYSHDKEEGN
        KELDATNADLVRYASG ARPPSQ+GRRA TASV   RHHSGLSFLSPLSS+ HDDPY H+ EEGN
Subjt:  KELDATNADLVRYASGGARPPSQFGRRAATASV--MRHHSGLSFLSPLSSDDHDDPYSHDKEEGN

A0A5A7T800 LOB domain-containing protein 254.7e-8092.73Show/hide
Query:  MASSSYSHSPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRLQ
        MASSSY++SPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRLQ
Subjt:  MASSSYSHSPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRLQ

Query:  KELDATNADLVRYASGGARPPSQFGRRAATASV--MRHHSGLSFLSPLSSDDHDDPYSHDKEEGN
        KELDATNADLVRYASG ARPPSQ+GRRA TASV   RHHSGLSFLSPLSS+ HDDPY H+KEEGN
Subjt:  KELDATNADLVRYASGGARPPSQFGRRAATASV--MRHHSGLSFLSPLSSDDHDDPYSHDKEEGN

A0A6J1E293 LOB domain-containing protein 25-like2.3e-6682.1Show/hide
Query:  ASSSYSHSPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRLQK
        +SSS+S+SPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLL+EV PHQREDAVNSLAYEAEARMKDPVYGCVGAIS+LQRQVIRLQK
Subjt:  ASSSYSHSPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRLQK

Query:  ELDATNADLVRYASGGARPPSQFGRRAATASVMRHHSGLSFLSPLSSDDHDDPYSHDKEEGN
        ELDATNADLVRYASGGARP   +GRRAA        SG+ F+SPL+++D  DPY  DKEE N
Subjt:  ELDATNADLVRYASGGARPPSQFGRRAATASVMRHHSGLSFLSPLSSDDHDDPYSHDKEEGN

A0A6J1JRR4 LOB domain-containing protein 25-like4.2e-6883.95Show/hide
Query:  ASSSYSHSPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRLQK
        +SSSYS+SPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLL+EV PHQREDAVNSLAYEAEARMKDPVYGCVGAIS+LQRQVIRLQK
Subjt:  ASSSYSHSPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRLQK

Query:  ELDATNADLVRYASGGARPPSQFGRRAATASVMRHHSGLSFLSPLSSDDHDDPYSHDKEEGN
        ELDATNADLVRYASGGARP   +GRRAA        SG+ F+SPL+S+D  DPY HDKEE N
Subjt:  ELDATNADLVRYASGGARPPSQFGRRAATASVMRHHSGLSFLSPLSSDDHDDPYSHDKEEGN

SwissProt top hitse value%identityAlignment
A2WXT0 LOB domain-containing protein 62.5e-3864.55Show/hide
Query:  SPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRLQKELDATNA
        SPCAACKFLRRKC  DC+FAPYFPP+ P KF +VH++FGASNV+KLLNE+ P+QREDAVNSLAYEA+ R++DPVYGCV  IS+LQR + +LQ++L     
Subjt:  SPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRLQKELDATNA

Query:  DLVRYASGGA
        +L +Y    A
Subjt:  DLVRYASGGA

O04479 Protein ASYMMETRIC LEAVES 21.9e-4168.38Show/hide
Query:  MASSSYSHSPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRLQ
        MASSS ++SPCAACKFLRRKC  +C+FAPYFPP++P KFANVHK+FGASNV+KLLNE+ P QREDAVNSLAYEA+ R++DPVYGCVG IS+LQ Q+ +LQ
Subjt:  MASSSYSHSPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRLQ

Query:  KELDATNADLVRYASGG
         +L    ++L +Y S G
Subjt:  KELDATNADLVRYASGG

Q32SG3 LOB domain-containing protein 61.9e-3863.16Show/hide
Query:  SPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRLQKELDATNA
        SPCAACKFLRRKC  DC+FAPYFPP+ P KF +VH++FGASNV+KLLNE+ P QREDAVNSLAYEA+ R++DPVYGCVG IS+LQ  + +LQ++L     
Subjt:  SPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRLQKELDATNA

Query:  DLVRYASGGARPPS
        +L +Y +  A   S
Subjt:  DLVRYASGGARPPS

Q8L8Q3 LOB domain-containing protein 252.8e-5379.37Show/hide
Query:  SSYSHSPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRLQKEL
        S+Y++SPCAACKFLRRKC +DC+FAPYFPPEEPTKFANVH+IFGASNVSK+L+EV PHQREDAVNSLAYEAEAR+KDPVYGCVGAISVLQRQV+RLQ+EL
Subjt:  SSYSHSPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRLQKEL

Query:  DATNADLVRYAS--GGARPPSQFGRR
        + TNADL+RYA   GG    +  GRR
Subjt:  DATNADLVRYAS--GGARPPSQFGRR

Q9FML4 Protein LATERAL ORGAN BOUNDARIES3.2e-4985.09Show/hide
Query:  MASSSYSH-SPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRL
        MASSS S+ SPCAACKFLRRKC+  CIFAPYFPPEEP KFANVHKIFGASNV+KLLNE+ PHQREDAVNSLAYEAEAR++DPVYGCVGAIS LQRQV RL
Subjt:  MASSSYSH-SPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRL

Query:  QKELDATNADLVRY
        QKELDA NADL  Y
Subjt:  QKELDATNADLVRY

Arabidopsis top hitse value%identityAlignment
AT3G27650.1 LOB domain-containing protein 252.0e-5479.37Show/hide
Query:  SSYSHSPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRLQKEL
        S+Y++SPCAACKFLRRKC +DC+FAPYFPPEEPTKFANVH+IFGASNVSK+L+EV PHQREDAVNSLAYEAEAR+KDPVYGCVGAISVLQRQV+RLQ+EL
Subjt:  SSYSHSPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRLQKEL

Query:  DATNADLVRYAS--GGARPPSQFGRR
        + TNADL+RYA   GG    +  GRR
Subjt:  DATNADLVRYAS--GGARPPSQFGRR

AT5G63090.1 Lateral organ boundaries (LOB) domain family protein2.3e-5085.09Show/hide
Query:  MASSSYSH-SPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRL
        MASSS S+ SPCAACKFLRRKC+  CIFAPYFPPEEP KFANVHKIFGASNV+KLLNE+ PHQREDAVNSLAYEAEAR++DPVYGCVGAIS LQRQV RL
Subjt:  MASSSYSH-SPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRL

Query:  QKELDATNADLVRY
        QKELDA NADL  Y
Subjt:  QKELDATNADLVRY

AT5G63090.2 Lateral organ boundaries (LOB) domain family protein2.3e-5085.09Show/hide
Query:  MASSSYSH-SPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRL
        MASSS S+ SPCAACKFLRRKC+  CIFAPYFPPEEP KFANVHKIFGASNV+KLLNE+ PHQREDAVNSLAYEAEAR++DPVYGCVGAIS LQRQV RL
Subjt:  MASSSYSH-SPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRL

Query:  QKELDATNADLVRY
        QKELDA NADL  Y
Subjt:  QKELDATNADLVRY

AT5G63090.3 Lateral organ boundaries (LOB) domain family protein2.3e-5085.09Show/hide
Query:  MASSSYSH-SPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRL
        MASSS S+ SPCAACKFLRRKC+  CIFAPYFPPEEP KFANVHKIFGASNV+KLLNE+ PHQREDAVNSLAYEAEAR++DPVYGCVGAIS LQRQV RL
Subjt:  MASSSYSH-SPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRL

Query:  QKELDATNADLVRY
        QKELDA NADL  Y
Subjt:  QKELDATNADLVRY

AT5G63090.4 Lateral organ boundaries (LOB) domain family protein2.3e-5085.09Show/hide
Query:  MASSSYSH-SPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRL
        MASSS S+ SPCAACKFLRRKC+  CIFAPYFPPEEP KFANVHKIFGASNV+KLLNE+ PHQREDAVNSLAYEAEAR++DPVYGCVGAIS LQRQV RL
Subjt:  MASSSYSH-SPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRL

Query:  QKELDATNADLVRY
        QKELDA NADL  Y
Subjt:  QKELDATNADLVRY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTTCAAGCTACTCACACTCACCATGTGCAGCATGCAAGTTCCTAAGAAGAAAATGCTTAGCAGACTGCATATTTGCACCCTACTTTCCTCCTGAAGAGCCAAC
CAAATTTGCCAATGTTCACAAGATTTTTGGAGCTAGCAATGTGAGCAAGCTTCTCAACGAGGTCCAGCCCCATCAGCGTGAGGACGCGGTCAATTCTCTAGCTTACGAGG
CTGAGGCTCGTATGAAAGACCCCGTCTATGGTTGCGTCGGGGCCATCTCCGTTCTACAACGTCAAGTCATTCGATTGCAGAAGGAGCTCGATGCTACAAATGCTGATTTA
GTACGATATGCTAGCGGGGGCGCTCGCCCGCCCTCGCAATTTGGAAGGAGGGCAGCAACCGCCTCGGTGATGCGCCATCATTCTGGTCTGTCTTTTCTTTCTCCCTTGAG
TAGTGATGATCATGATGATCCTTATTCTCATGACAAAGAGGAAGGCAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTTCAAGCTACTCACACTCACCATGTGCAGCATGCAAGTTCCTAAGAAGAAAATGCTTAGCAGACTGCATATTTGCACCCTACTTTCCTCCTGAAGAGCCAAC
CAAATTTGCCAATGTTCACAAGATTTTTGGAGCTAGCAATGTGAGCAAGCTTCTCAACGAGGTCCAGCCCCATCAGCGTGAGGACGCGGTCAATTCTCTAGCTTACGAGG
CTGAGGCTCGTATGAAAGACCCCGTCTATGGTTGCGTCGGGGCCATCTCCGTTCTACAACGTCAAGTCATTCGATTGCAGAAGGAGCTCGATGCTACAAATGCTGATTTA
GTACGATATGCTAGCGGGGGCGCTCGCCCGCCCTCGCAATTTGGAAGGAGGGCAGCAACCGCCTCGGTGATGCGCCATCATTCTGGTCTGTCTTTTCTTTCTCCCTTGAG
TAGTGATGATCATGATGATCCTTATTCTCATGACAAAGAGGAAGGCAATTGA
Protein sequenceShow/hide protein sequence
MASSSYSHSPCAACKFLRRKCLADCIFAPYFPPEEPTKFANVHKIFGASNVSKLLNEVQPHQREDAVNSLAYEAEARMKDPVYGCVGAISVLQRQVIRLQKELDATNADL
VRYASGGARPPSQFGRRAATASVMRHHSGLSFLSPLSSDDHDDPYSHDKEEGN