; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg014413 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg014413
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionBEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein .
Genome locationscaffold3:34927593..34928264
RNA-Seq ExpressionSpg014413
SyntenySpg014413
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036575.1 uncharacterized protein E6C27_scaffold191G00850 [Cucumis melo var. makuwa]3.3e-7871.5Show/hide
Query:  THKDLDLQLSLRPPAG---------AANCGRASSLTSMRVTRSFGNRRPSLRRCNSQ--LTTVTIPPPYPWSTNHRAVVHTLNHLKSNQILTVTGDVRCR
        T++ LDLQLSLRPP+G         A    RA++LT+ R+TR+ G RR SLRRCNS+   TT TI PPYPWSTN RA+V TLN L+S+QIL +TGDVRCR
Subjt:  THKDLDLQLSLRPPAG---------AANCGRASSLTSMRVTRSFGNRRPSLRRCNSQ--LTTVTIPPPYPWSTNHRAVVHTLNHLKSNQILTVTGDVRCR

Query:  RCQRQYEIEYDVVSKFEEIASFVEKNKDSFHDRAPSSWMNPNFPTCRFCGLENGSRPVIPEEMRRINWLFLLLGQMLGYLNLNHLKYFCSYTKNHRTGAK
        +CQ +Y IEYD+VSKFEEIASFVE+NK+ F DRAP SWMNPN+PTCRFCG ENG+RPVIP+E R+INWLFLLLG+MLG LNLNHLKYFCSYT NHRTGAK
Subjt:  RCQRQYEIEYDVVSKFEEIASFVEKNKDSFHDRAPSSWMNPNFPTCRFCGLENGSRPVIPEEMRRINWLFLLLGQMLGYLNLNHLKYFCSYTKNHRTGAK

Query:  NRLLYLT
        NRLLYLT
Subjt:  NRLLYLT

XP_008447299.1 PREDICTED: uncharacterized protein LOC103489770 [Cucumis melo]3.5e-8872.65Show/hide
Query:  THKDLDLQLSLRPPAG---------AANCGRASSLTSMRVTRSFGNRRPSLRRCNSQ--LTTVTIPPPYPWSTNHRAVVHTLNHLKSNQILTVTGDVRCR
        T++ LDLQLSLRPP+G         A    RA++LT+ R+TR+ G RR SLRRCNS+   TT TI PPYPWSTN RA+V TLN L+S+QIL +TGDVRCR
Subjt:  THKDLDLQLSLRPPAG---------AANCGRASSLTSMRVTRSFGNRRPSLRRCNSQ--LTTVTIPPPYPWSTNHRAVVHTLNHLKSNQILTVTGDVRCR

Query:  RCQRQYEIEYDVVSKFEEIASFVEKNKDSFHDRAPSSWMNPNFPTCRFCGLENGSRPVIPEEMRRINWLFLLLGQMLGYLNLNHLKYFCSYTKNHRTGAK
        +CQ +Y IEYD+VSKFEEIASFVE+NK+ F DRAP SWMNPN+PTCRFCG ENG+RPVIP+E R+INWLFLLLG+MLG LNLNHLKYFCSYT NHRTGAK
Subjt:  RCQRQYEIEYDVVSKFEEIASFVEKNKDSFHDRAPSSWMNPNFPTCRFCGLENGSRPVIPEEMRRINWLFLLLGQMLGYLNLNHLKYFCSYTKNHRTGAK

Query:  NRLLYLTYLTLCHQVDPSGRFRR
        NRLLYLTY+TLCHQVDPSGRF R
Subjt:  NRLLYLTYLTLCHQVDPSGRFRR

XP_011659748.1 uncharacterized protein LOC105436256 [Cucumis sativus]2.3e-8772.37Show/hide
Query:  NQSNETHKDLDLQLSLRPPAG-------AANCG--RASSLTSMRVTRSFGNRRPSLRRCNSQ--LTTVTIPPPYPWSTNHRAVVHTLNHLKSNQILTVTG
        NQ NE H  LDL+LSLRPP+G       AA  G  R +++T+MRVTRS G RR S +RCNS+   TT TI PPYPWSTN RA+V TLN LKSNQIL +TG
Subjt:  NQSNETHKDLDLQLSLRPPAG-------AANCG--RASSLTSMRVTRSFGNRRPSLRRCNSQ--LTTVTIPPPYPWSTNHRAVVHTLNHLKSNQILTVTG

Query:  DVRCRRCQRQYEIEYDVVSKFEEIASFVEKNKDSFHDRAPSSWMNPNFPTCRFCGLENGSRPVIPEEMRRINWLFLLLGQMLGYLNLNHLKYFCSYTKNH
        DV+CR+CQ +Y IEYD+ SKFEEIASFVE+NK+SF DRAP SWMNPN+PTCRFCG ENG+RPVIP++ R+INWLFLLLG+MLG LNLNHLKYFCS T NH
Subjt:  DVRCRRCQRQYEIEYDVVSKFEEIASFVEKNKDSFHDRAPSSWMNPNFPTCRFCGLENGSRPVIPEEMRRINWLFLLLGQMLGYLNLNHLKYFCSYTKNH

Query:  RTGAKNRLLYLTYLTLCHQVDPSGRFRR
        RTGAKNRLLYLTY+TLCHQVDPSGRF R
Subjt:  RTGAKNRLLYLTYLTLCHQVDPSGRFRR

XP_022952797.1 uncharacterized protein LOC111455388 [Cucurbita moschata]1.3e-6959.91Show/hide
Query:  LDLQLSLRPPAGAANCGRASS---------------LTSMRVTRSFGNRRPSLR-RCNSQLTTVTIPPPYPWSTNHRAVVHTLNHLKSNQILTVTGDVRC
        +DL+LSL  P+ A     A++               L+S+R   + G R+ SLR R ++  TT  I PPYPWST+  AVVHTL++L SNQILT+TG+V+C
Subjt:  LDLQLSLRPPAGAANCGRASS---------------LTSMRVTRSFGNRRPSLR-RCNSQLTTVTIPPPYPWSTNHRAVVHTLNHLKSNQILTVTGDVRC

Query:  RRCQRQYEIEYDVVSKFEEIASFVEKNKDSFHDRAPSSWMNPNFPTCRFCGLENGSRPVIPEEMRRINWLFLLLGQMLGYLNLNHLKYFCSYTKNHRTGA
        ++C+R YEIEYDVVSKF EI SFVE N +SF DRAP  WM PN+PTCRFCG E G +PVIP+E  +INW+FLLLG+M+G L LNHLKYFCSYTKNHRTG+
Subjt:  RRCQRQYEIEYDVVSKFEEIASFVEKNKDSFHDRAPSSWMNPNFPTCRFCGLENGSRPVIPEEMRRINWLFLLLGQMLGYLNLNHLKYFCSYTKNHRTGA

Query:  KNRLLYLTYLTLCHQVDPSGRF
        K+RL+YLTY+TLC Q+DPSGRF
Subjt:  KNRLLYLTYLTLCHQVDPSGRF

XP_022972401.1 uncharacterized protein LOC111470968 [Cucurbita maxima]2.8e-6965.5Show/hide
Query:  AGAANCGRASSLTSMRVTRSFGNRRPSLRR--CNSQLTTVTIPPPYPWSTNHRAVVHTLNHLKSNQILTVTGDVRCRRCQRQYEIEYDVVSKFEEIASFV
        A AA       L+S+R     G R+ SLRR  CNS  TT  I PPYPWST+  AVVHTL++L  NQILT+TGDV+C++C+R YEIEY+VVSKF EI SFV
Subjt:  AGAANCGRASSLTSMRVTRSFGNRRPSLRR--CNSQLTTVTIPPPYPWSTNHRAVVHTLNHLKSNQILTVTGDVRCRRCQRQYEIEYDVVSKFEEIASFV

Query:  EKNKDSFHDRAPSSWMNPNFPTCRFCGLENGSRPVIPEEMRRINWLFLLLGQMLGYLNLNHLKYFCSYTKNHRTGAKNRLLYLTYLTLCHQVDPSGRFRR
        E N +SF DRAP  WM PN+PTCRFCG E G +PVIP+E  +INW+FLLLG+M+G L LNHLKYFCSYTKNHRTG+K+RL+YLTY+TLC Q+DPSGRF R
Subjt:  EKNKDSFHDRAPSSWMNPNFPTCRFCGLENGSRPVIPEEMRRINWLFLLLGQMLGYLNLNHLKYFCSYTKNHRTGAKNRLLYLTYLTLCHQVDPSGRFRR

TrEMBL top hitse value%identityAlignment
A0A0A0K3Q8 Uncharacterized protein1.1e-8772.37Show/hide
Query:  NQSNETHKDLDLQLSLRPPAG-------AANCG--RASSLTSMRVTRSFGNRRPSLRRCNSQ--LTTVTIPPPYPWSTNHRAVVHTLNHLKSNQILTVTG
        NQ NE H  LDL+LSLRPP+G       AA  G  R +++T+MRVTRS G RR S +RCNS+   TT TI PPYPWSTN RA+V TLN LKSNQIL +TG
Subjt:  NQSNETHKDLDLQLSLRPPAG-------AANCG--RASSLTSMRVTRSFGNRRPSLRRCNSQ--LTTVTIPPPYPWSTNHRAVVHTLNHLKSNQILTVTG

Query:  DVRCRRCQRQYEIEYDVVSKFEEIASFVEKNKDSFHDRAPSSWMNPNFPTCRFCGLENGSRPVIPEEMRRINWLFLLLGQMLGYLNLNHLKYFCSYTKNH
        DV+CR+CQ +Y IEYD+ SKFEEIASFVE+NK+SF DRAP SWMNPN+PTCRFCG ENG+RPVIP++ R+INWLFLLLG+MLG LNLNHLKYFCS T NH
Subjt:  DVRCRRCQRQYEIEYDVVSKFEEIASFVEKNKDSFHDRAPSSWMNPNFPTCRFCGLENGSRPVIPEEMRRINWLFLLLGQMLGYLNLNHLKYFCSYTKNH

Query:  RTGAKNRLLYLTYLTLCHQVDPSGRFRR
        RTGAKNRLLYLTY+TLCHQVDPSGRF R
Subjt:  RTGAKNRLLYLTYLTLCHQVDPSGRFRR

A0A1S3BHR1 uncharacterized protein LOC1034897701.7e-8872.65Show/hide
Query:  THKDLDLQLSLRPPAG---------AANCGRASSLTSMRVTRSFGNRRPSLRRCNSQ--LTTVTIPPPYPWSTNHRAVVHTLNHLKSNQILTVTGDVRCR
        T++ LDLQLSLRPP+G         A    RA++LT+ R+TR+ G RR SLRRCNS+   TT TI PPYPWSTN RA+V TLN L+S+QIL +TGDVRCR
Subjt:  THKDLDLQLSLRPPAG---------AANCGRASSLTSMRVTRSFGNRRPSLRRCNSQ--LTTVTIPPPYPWSTNHRAVVHTLNHLKSNQILTVTGDVRCR

Query:  RCQRQYEIEYDVVSKFEEIASFVEKNKDSFHDRAPSSWMNPNFPTCRFCGLENGSRPVIPEEMRRINWLFLLLGQMLGYLNLNHLKYFCSYTKNHRTGAK
        +CQ +Y IEYD+VSKFEEIASFVE+NK+ F DRAP SWMNPN+PTCRFCG ENG+RPVIP+E R+INWLFLLLG+MLG LNLNHLKYFCSYT NHRTGAK
Subjt:  RCQRQYEIEYDVVSKFEEIASFVEKNKDSFHDRAPSSWMNPNFPTCRFCGLENGSRPVIPEEMRRINWLFLLLGQMLGYLNLNHLKYFCSYTKNHRTGAK

Query:  NRLLYLTYLTLCHQVDPSGRFRR
        NRLLYLTY+TLCHQVDPSGRF R
Subjt:  NRLLYLTYLTLCHQVDPSGRFRR

A0A5A7T547 Uncharacterized protein1.6e-7871.5Show/hide
Query:  THKDLDLQLSLRPPAG---------AANCGRASSLTSMRVTRSFGNRRPSLRRCNSQ--LTTVTIPPPYPWSTNHRAVVHTLNHLKSNQILTVTGDVRCR
        T++ LDLQLSLRPP+G         A    RA++LT+ R+TR+ G RR SLRRCNS+   TT TI PPYPWSTN RA+V TLN L+S+QIL +TGDVRCR
Subjt:  THKDLDLQLSLRPPAG---------AANCGRASSLTSMRVTRSFGNRRPSLRRCNSQ--LTTVTIPPPYPWSTNHRAVVHTLNHLKSNQILTVTGDVRCR

Query:  RCQRQYEIEYDVVSKFEEIASFVEKNKDSFHDRAPSSWMNPNFPTCRFCGLENGSRPVIPEEMRRINWLFLLLGQMLGYLNLNHLKYFCSYTKNHRTGAK
        +CQ +Y IEYD+VSKFEEIASFVE+NK+ F DRAP SWMNPN+PTCRFCG ENG+RPVIP+E R+INWLFLLLG+MLG LNLNHLKYFCSYT NHRTGAK
Subjt:  RCQRQYEIEYDVVSKFEEIASFVEKNKDSFHDRAPSSWMNPNFPTCRFCGLENGSRPVIPEEMRRINWLFLLLGQMLGYLNLNHLKYFCSYTKNHRTGAK

Query:  NRLLYLT
        NRLLYLT
Subjt:  NRLLYLT

A0A6J1GLD4 uncharacterized protein LOC1114553886.1e-7059.91Show/hide
Query:  LDLQLSLRPPAGAANCGRASS---------------LTSMRVTRSFGNRRPSLR-RCNSQLTTVTIPPPYPWSTNHRAVVHTLNHLKSNQILTVTGDVRC
        +DL+LSL  P+ A     A++               L+S+R   + G R+ SLR R ++  TT  I PPYPWST+  AVVHTL++L SNQILT+TG+V+C
Subjt:  LDLQLSLRPPAGAANCGRASS---------------LTSMRVTRSFGNRRPSLR-RCNSQLTTVTIPPPYPWSTNHRAVVHTLNHLKSNQILTVTGDVRC

Query:  RRCQRQYEIEYDVVSKFEEIASFVEKNKDSFHDRAPSSWMNPNFPTCRFCGLENGSRPVIPEEMRRINWLFLLLGQMLGYLNLNHLKYFCSYTKNHRTGA
        ++C+R YEIEYDVVSKF EI SFVE N +SF DRAP  WM PN+PTCRFCG E G +PVIP+E  +INW+FLLLG+M+G L LNHLKYFCSYTKNHRTG+
Subjt:  RRCQRQYEIEYDVVSKFEEIASFVEKNKDSFHDRAPSSWMNPNFPTCRFCGLENGSRPVIPEEMRRINWLFLLLGQMLGYLNLNHLKYFCSYTKNHRTGA

Query:  KNRLLYLTYLTLCHQVDPSGRF
        K+RL+YLTY+TLC Q+DPSGRF
Subjt:  KNRLLYLTYLTLCHQVDPSGRF

A0A6J1I5V9 uncharacterized protein LOC1114709681.4e-6965.5Show/hide
Query:  AGAANCGRASSLTSMRVTRSFGNRRPSLRR--CNSQLTTVTIPPPYPWSTNHRAVVHTLNHLKSNQILTVTGDVRCRRCQRQYEIEYDVVSKFEEIASFV
        A AA       L+S+R     G R+ SLRR  CNS  TT  I PPYPWST+  AVVHTL++L  NQILT+TGDV+C++C+R YEIEY+VVSKF EI SFV
Subjt:  AGAANCGRASSLTSMRVTRSFGNRRPSLRR--CNSQLTTVTIPPPYPWSTNHRAVVHTLNHLKSNQILTVTGDVRCRRCQRQYEIEYDVVSKFEEIASFV

Query:  EKNKDSFHDRAPSSWMNPNFPTCRFCGLENGSRPVIPEEMRRINWLFLLLGQMLGYLNLNHLKYFCSYTKNHRTGAKNRLLYLTYLTLCHQVDPSGRFRR
        E N +SF DRAP  WM PN+PTCRFCG E G +PVIP+E  +INW+FLLLG+M+G L LNHLKYFCSYTKNHRTG+K+RL+YLTY+TLC Q+DPSGRF R
Subjt:  EKNKDSFHDRAPSSWMNPNFPTCRFCGLENGSRPVIPEEMRRINWLFLLLGQMLGYLNLNHLKYFCSYTKNHRTGAKNRLLYLTYLTLCHQVDPSGRFRR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49330.1 hydroxyproline-rich glycoprotein family protein9.7e-4442.45Show/hide
Query:  NQSNETHKDLDLQLSLRPPAGAANCGRASSLTSMRVTRSF-GNRRPSLRRCNSQLTTVTIPPPYPWSTNHRAVVHTLNHLKSNQILTVTGDVRCRRCQRQ
        N   +T     L     PP+G A    +S+LT   V R   G+ R    R      + TI PP+PW+TN R  + +L +L+SNQI T+TG+V+CR C++ 
Subjt:  NQSNETHKDLDLQLSLRPPAGAANCGRASSLTSMRVTRSF-GNRRPSLRRCNSQLTTVTIPPPYPWSTNHRAVVHTLNHLKSNQILTVTGDVRCRRCQRQ

Query:  YEIEYDVVSKFEEIASFVEKNKDSFHDRAPSSWMNPNFPTCRFCGLENGSRPVIPEEMRRINWLFLLLGQMLGYLNLNHLKYFCSYTKNHRTGAKNRLLY
        Y++ Y++  +F E+  F    K    DRA   W  P    C  CG E   +PVI E   +INWLFLLLGQ LG+  L  LK FC ++KNHRTGAK+R+LY
Subjt:  YEIEYDVVSKFEEIASFVEKNKDSFHDRAPSSWMNPNFPTCRFCGLENGSRPVIPEEMRRINWLFLLLGQMLGYLNLNHLKYFCSYTKNHRTGAKNRLLY

Query:  LTYLTLCHQVDP
        LTY+ LC  + P
Subjt:  LTYLTLCHQVDP

AT2G16190.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT1G49330.1)5.0e-4041.21Show/hide
Query:  RPPAGAANCGRASSLTSMRVTRSFGNRRPSLRRCNSQLTTVTIPPPYPWSTNHRAVVHTLNHLKSNQILTVTGDVRCRRCQRQYEIEYDVVSKFEEIASF
        RPP G A   R S      V R+ G+R               I PPYPW+T     + +   L SN I  ++G V C+ C R   +EY++  KF E+  +
Subjt:  RPPAGAANCGRASSLTSMRVTRSFGNRRPSLRRCNSQLTTVTIPPPYPWSTNHRAVVHTLNHLKSNQILTVTGDVRCRRCQRQYEIEYDVVSKFEEIASF

Query:  VEKNKDSFHDRAPSSWMNPNFPTCRFCGLENGSRPVIPEEMRRINWLFLLLGQMLGYLNLNHLKYFCSYTKNHRTGAKNRLLYLTYLTLCHQVDPSGRF
        ++ NK+    RAP SW  P    CR C  E   +PV+ E    INWLFLLLGQMLG   L+ L+YFC     HRTG+K+R++Y+TYL+LC Q+DP G F
Subjt:  VEKNKDSFHDRAPSSWMNPNFPTCRFCGLENGSRPVIPEEMRRINWLFLLLGQMLGYLNLNHLKYFCSYTKNHRTGAKNRLLYLTYLTLCHQVDPSGRF

AT2G16190.2 FUNCTIONS IN: molecular_function unknown2.0e-2537.71Show/hide
Query:  RPPAGAANCGRASSLTSMRVTRSFGNRRPSLRRCNSQLTTVTIPPPYPWSTNHRAVVHTLNHLKSNQILTVTGDVRCRRCQRQYEIEYDVVSKFEEIASF
        RPP G A   R S      V R+ G+R               I PPYPW+T     + +   L SN I  ++G V C+ C R   +EY++  KF E+  +
Subjt:  RPPAGAANCGRASSLTSMRVTRSFGNRRPSLRRCNSQLTTVTIPPPYPWSTNHRAVVHTLNHLKSNQILTVTGDVRCRRCQRQYEIEYDVVSKFEEIASF

Query:  VEKNKDSFHDRAPSSWMNPNFPTCRFCGLENGSRPVIPEEMRRINWLFLLLGQMLGYLNLNHLKYFCSYTKNHRT
        ++ NK+    RAP SW  P    CR C  E   +PV+ E    INWLFLLLGQMLG   L+ L    S  K+H T
Subjt:  VEKNKDSFHDRAPSSWMNPNFPTCRFCGLENGSRPVIPEEMRRINWLFLLLGQMLGYLNLNHLKYFCSYTKNHRT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAATAACTCCATAAACCAAAGCAATGAAACTCACAAAGATCTCGACCTGCAGCTCTCTCTCCGGCCACCGGCCGGCGCTGCCAACTGTGGCAGGGCGAGTTCCCT
GACCTCAATGAGGGTCACTCGCAGCTTCGGAAATCGTCGACCCTCTCTCCGCCGCTGCAATTCCCAACTGACGACGGTGACGATCCCCCCGCCGTATCCGTGGTCGACAA
ACCACCGAGCGGTCGTCCACACCCTGAACCACCTGAAATCAAATCAAATCCTGACGGTCACAGGCGACGTCCGGTGCCGTCGGTGCCAGAGACAGTACGAGATCGAATAC
GACGTCGTGTCGAAGTTCGAGGAGATTGCGAGCTTCGTGGAGAAGAACAAGGATTCATTTCACGACCGAGCGCCGAGCTCGTGGATGAACCCTAATTTTCCGACGTGTAG
ATTTTGCGGTCTTGAAAACGGATCGAGGCCGGTGATCCCAGAGGAAATGAGAAGGATCAATTGGCTGTTCTTGCTTTTGGGACAAATGCTTGGATATTTGAACCTTAATC
ATCTGAAATACTTCTGCAGTTACACTAAAAATCATCGTACGGGTGCTAAGAATCGGCTTCTTTATCTCACTTATCTCACTTTGTGCCACCAAGTTGATCCCTCTGGCCGT
TTCCGGCGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGAATAACTCCATAAACCAAAGCAATGAAACTCACAAAGATCTCGACCTGCAGCTCTCTCTCCGGCCACCGGCCGGCGCTGCCAACTGTGGCAGGGCGAGTTCCCT
GACCTCAATGAGGGTCACTCGCAGCTTCGGAAATCGTCGACCCTCTCTCCGCCGCTGCAATTCCCAACTGACGACGGTGACGATCCCCCCGCCGTATCCGTGGTCGACAA
ACCACCGAGCGGTCGTCCACACCCTGAACCACCTGAAATCAAATCAAATCCTGACGGTCACAGGCGACGTCCGGTGCCGTCGGTGCCAGAGACAGTACGAGATCGAATAC
GACGTCGTGTCGAAGTTCGAGGAGATTGCGAGCTTCGTGGAGAAGAACAAGGATTCATTTCACGACCGAGCGCCGAGCTCGTGGATGAACCCTAATTTTCCGACGTGTAG
ATTTTGCGGTCTTGAAAACGGATCGAGGCCGGTGATCCCAGAGGAAATGAGAAGGATCAATTGGCTGTTCTTGCTTTTGGGACAAATGCTTGGATATTTGAACCTTAATC
ATCTGAAATACTTCTGCAGTTACACTAAAAATCATCGTACGGGTGCTAAGAATCGGCTTCTTTATCTCACTTATCTCACTTTGTGCCACCAAGTTGATCCCTCTGGCCGT
TTCCGGCGCTGA
Protein sequenceShow/hide protein sequence
MENNSINQSNETHKDLDLQLSLRPPAGAANCGRASSLTSMRVTRSFGNRRPSLRRCNSQLTTVTIPPPYPWSTNHRAVVHTLNHLKSNQILTVTGDVRCRRCQRQYEIEY
DVVSKFEEIASFVEKNKDSFHDRAPSSWMNPNFPTCRFCGLENGSRPVIPEEMRRINWLFLLLGQMLGYLNLNHLKYFCSYTKNHRTGAKNRLLYLTYLTLCHQVDPSGR
FRR