; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g10310 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g10310
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRibonuclease 3-like protein 1
Genome locationchr8:7685634..7687946
RNA-Seq ExpressionMoc08g10310
SyntenyMoc08g10310
Gene Ontology termsGO:0030422 - production of siRNA involved in RNA interference (biological process)
GO:0090501 - RNA phosphodiester bond hydrolysis (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0004525 - ribonuclease III activity (molecular function)
InterPro domainsIPR014720 - Double-stranded RNA-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022153429.1 uncharacterized protein LOC111020939 isoform X1 [Momordica charantia]1.5e-108100Show/hide
Query:  MEQTPPYPRRLNVNLKNLPPINPRTGNGNHHFSIPAKFEKSRYIRRVPHFTPPNHRPEMKPDTAGALNTKTIPEASSDSFVAKNENSREIASTCSCGHRS
        MEQTPPYPRRLNVNLKNLPPINPRTGNGNHHFSIPAKFEKSRYIRRVPHFTPPNHRPEMKPDTAGALNTKTIPEASSDSFVAKNENSREIASTCSCGHRS
Subjt:  MEQTPPYPRRLNVNLKNLPPINPRTGNGNHHFSIPAKFEKSRYIRRVPHFTPPNHRPEMKPDTAGALNTKTIPEASSDSFVAKNENSREIASTCSCGHRS

Query:  AKEGTLEKRAAKSLLFEVCTANYWKPPLFECCEEEGPSHAKKFRFKVIVEMKGACDAVLECYGNSQARKKVAAQNAAEGALWCLKHLGYSHL
        AKEGTLEKRAAKSLLFEVCTANYWKPPLFECCEEEGPSHAKKFRFKVIVEMKGACDAVLECYGNSQARKKVAAQNAAEGALWCLKHLGYSHL
Subjt:  AKEGTLEKRAAKSLLFEVCTANYWKPPLFECCEEEGPSHAKKFRFKVIVEMKGACDAVLECYGNSQARKKVAAQNAAEGALWCLKHLGYSHL

XP_022153430.1 uncharacterized protein LOC111020939 isoform X2 [Momordica charantia]9.1e-77100Show/hide
Query:  MEQTPPYPRRLNVNLKNLPPINPRTGNGNHHFSIPAKFEKSRYIRRVPHFTPPNHRPEMKPDTAGALNTKTIPEASSDSFVAKNENSREIASTCSCGHRS
        MEQTPPYPRRLNVNLKNLPPINPRTGNGNHHFSIPAKFEKSRYIRRVPHFTPPNHRPEMKPDTAGALNTKTIPEASSDSFVAKNENSREIASTCSCGHRS
Subjt:  MEQTPPYPRRLNVNLKNLPPINPRTGNGNHHFSIPAKFEKSRYIRRVPHFTPPNHRPEMKPDTAGALNTKTIPEASSDSFVAKNENSREIASTCSCGHRS

Query:  AKEGTLEKRAAKSLLFEVCTANYWKPPLFECCEEEGPSHAK
        AKEGTLEKRAAKSLLFEVCTANYWKPPLFECCEEEGPSHAK
Subjt:  AKEGTLEKRAAKSLLFEVCTANYWKPPLFECCEEEGPSHAK

XP_038878769.1 ribonuclease 3-like protein 1 isoform X1 [Benincasa hispida]4.5e-6870.85Show/hide
Query:  MEQTPPYPRRLNVNLKNLPPINPRTGNGNHHFSIPAKFEKSRYIRRVPHFTPPNHRPE------MKPDTAG---ALNTKTIPEASSDSFVAKNENSREIA
        ME T   PRRLN NLKNLPPIN       +H SI AKFEKSRYIRRVP F P NHRP       MKP  +G   ALN+KTIPEA SDSFV K +NS+EIA
Subjt:  MEQTPPYPRRLNVNLKNLPPINPRTGNGNHHFSIPAKFEKSRYIRRVPHFTPPNHRPE------MKPDTAG---ALNTKTIPEASSDSFVAKNENSREIA

Query:  STCS-CGHRSAKEGTLEKRAAKSLLFEVCTANYWKPPLFECCEEEGPSHAKKFRFKVIVEMKGACDAVLECYGNSQARKKVAAQNAAEGALWCLKHLGY
        S  S CGH S KEGT EKRAAKSLLFE+CTAN+W+PPLFECCEEEGPSHAKK+RFKV VEMKGA + VLECYGN Q RKKVAA++AAEGA+W LK+LGY
Subjt:  STCS-CGHRSAKEGTLEKRAAKSLLFEVCTANYWKPPLFECCEEEGPSHAKKFRFKVIVEMKGACDAVLECYGNSQARKKVAAQNAAEGALWCLKHLGY

XP_038878786.1 ribonuclease 3-like protein 1 isoform X2 [Benincasa hispida]3.6e-6570.83Show/hide
Query:  MEQTPPYPRRLNVNLKNLPPINPRTGNGNHHFSIPAKFEKSRYIRRVPHFTPPNHRPE------MKPDTAG---ALNTKTIPEASSDSFVAKNENSREIA
        ME T   PRRLN NLKNLPPIN       +H SI AKFEKSRYIRRVP F P NHRP       MKP  +G   ALN+KTIPEA SDSFV K +NS+EIA
Subjt:  MEQTPPYPRRLNVNLKNLPPINPRTGNGNHHFSIPAKFEKSRYIRRVPHFTPPNHRPE------MKPDTAG---ALNTKTIPEASSDSFVAKNENSREIA

Query:  STCS-CGHRSAKEGTLEKRAAKSLLFEVCTANYWKPPLFECCEEEGPSHAKKFRFKVIVEMKGACDAVLECYGNSQARKKVAAQNAAEGALW
        S  S CGH S KEGT EKRAAKSLLFE+CTAN+W+PPLFECCEEEGPSHAKK+RFKV VEMKGA + VLECYGN Q RKKVAA++AAEGA+W
Subjt:  STCS-CGHRSAKEGTLEKRAAKSLLFEVCTANYWKPPLFECCEEEGPSHAKKFRFKVIVEMKGACDAVLECYGNSQARKKVAAQNAAEGALW

XP_038878793.1 ribonuclease 3-like protein 1 isoform X3 [Benincasa hispida]4.5e-6870.85Show/hide
Query:  MEQTPPYPRRLNVNLKNLPPINPRTGNGNHHFSIPAKFEKSRYIRRVPHFTPPNHRPE------MKPDTAG---ALNTKTIPEASSDSFVAKNENSREIA
        ME T   PRRLN NLKNLPPIN       +H SI AKFEKSRYIRRVP F P NHRP       MKP  +G   ALN+KTIPEA SDSFV K +NS+EIA
Subjt:  MEQTPPYPRRLNVNLKNLPPINPRTGNGNHHFSIPAKFEKSRYIRRVPHFTPPNHRPE------MKPDTAG---ALNTKTIPEASSDSFVAKNENSREIA

Query:  STCS-CGHRSAKEGTLEKRAAKSLLFEVCTANYWKPPLFECCEEEGPSHAKKFRFKVIVEMKGACDAVLECYGNSQARKKVAAQNAAEGALWCLKHLGY
        S  S CGH S KEGT EKRAAKSLLFE+CTAN+W+PPLFECCEEEGPSHAKK+RFKV VEMKGA + VLECYGN Q RKKVAA++AAEGA+W LK+LGY
Subjt:  STCS-CGHRSAKEGTLEKRAAKSLLFEVCTANYWKPPLFECCEEEGPSHAKKFRFKVIVEMKGACDAVLECYGNSQARKKVAAQNAAEGALWCLKHLGY

TrEMBL top hitse value%identityAlignment
A0A0A0KYI7 DRBM domain-containing protein3.0e-6566.5Show/hide
Query:  MEQTPPY-PRRLNVNLKNLPPI-NPRTGNGNHHFSIPAKFEKSRYIRRVPHFTPPNHRPEMK-------------PDTAGALNTKTIPEA-SSDSFVAKN
        ME  P   P RLN+NLKNLPPI NPR  +G+H  SIPAKFEKSRYIRRVP F P +HRPE++              D+A ALNTKT P A S+DSFV K 
Subjt:  MEQTPPY-PRRLNVNLKNLPPI-NPRTGNGNHHFSIPAKFEKSRYIRRVPHFTPPNHRPEMK-------------PDTAGALNTKTIPEA-SSDSFVAKN

Query:  ENSREIASTCS-CGHRSAKEGTLEKRAAKSLLFEVCTANYWKPPLFECCEEEGPSHAKKFRFKVIVEMKGACDAVLECYGNSQARKKVAAQNAAEGALWC
        +  +++AS CS C H S KEGT EKRAAKSLLFE+CTAN+W+PPLFECCEEEGPSHAKK+RFKV +EMKG C+AV+ECYGN Q RKKVAA++AAEGALW 
Subjt:  ENSREIASTCS-CGHRSAKEGTLEKRAAKSLLFEVCTANYWKPPLFECCEEEGPSHAKKFRFKVIVEMKGACDAVLECYGNSQARKKVAAQNAAEGALWC

Query:  LKHLGY
        L HLGY
Subjt:  LKHLGY

A0A1S3BXK0 uncharacterized protein LOC103494682 isoform X12.5e-6464.88Show/hide
Query:  EQTPPYPRRLNVNLKNLPPI-NPRTGNGNHHFSIPAKFEKSRYIRRVPHFTPPNHRPEMK--------------PDTAGALNTKTIPEA-SSDSFVAKNE
        +  P  P RLN+NLKNLPPI NPR   G H  SIPAKFEKSRYIRRVP F P +HRPE++               D+A A+NTKTIP A S+D FV K +
Subjt:  EQTPPYPRRLNVNLKNLPPI-NPRTGNGNHHFSIPAKFEKSRYIRRVPHFTPPNHRPEMK--------------PDTAGALNTKTIPEA-SSDSFVAKNE

Query:  NSREIASTCS-CGHRSAKEGTLEKRAAKSLLFEVCTANYWKPPLFECCEEEGPSHAKKFRFKVIVEMKGACDAVLECYGNSQARKKVAAQNAAEGALWCL
        N +++AS CS C H S KEGT EKRAAKSLLFE+CTAN+W+PPLFECCEEEGPSHA+K+RFKV +EMKG C+AV+ECYGN Q RKK+AA++AAEGALW L
Subjt:  NSREIASTCS-CGHRSAKEGTLEKRAAKSLLFEVCTANYWKPPLFECCEEEGPSHAKKFRFKVIVEMKGACDAVLECYGNSQARKKVAAQNAAEGALWCL

Query:  KHLGY
        K+LGY
Subjt:  KHLGY

A0A5D3E0Z9 Ribonuclease 3-like protein 12.5e-6464.88Show/hide
Query:  EQTPPYPRRLNVNLKNLPPI-NPRTGNGNHHFSIPAKFEKSRYIRRVPHFTPPNHRPEMK--------------PDTAGALNTKTIPEA-SSDSFVAKNE
        +  P  P RLN+NLKNLPPI NPR   G H  SIPAKFEKSRYIRRVP F P +HRPE++               D+A A+NTKTIP A S+D FV K +
Subjt:  EQTPPYPRRLNVNLKNLPPI-NPRTGNGNHHFSIPAKFEKSRYIRRVPHFTPPNHRPEMK--------------PDTAGALNTKTIPEA-SSDSFVAKNE

Query:  NSREIASTCS-CGHRSAKEGTLEKRAAKSLLFEVCTANYWKPPLFECCEEEGPSHAKKFRFKVIVEMKGACDAVLECYGNSQARKKVAAQNAAEGALWCL
        N +++AS CS C H S KEGT EKRAAKSLLFE+CTAN+W+PPLFECCEEEGPSHA+K+RFKV +EMKG C+AV+ECYGN Q RKK+AA++AAEGALW L
Subjt:  NSREIASTCS-CGHRSAKEGTLEKRAAKSLLFEVCTANYWKPPLFECCEEEGPSHAKKFRFKVIVEMKGACDAVLECYGNSQARKKVAAQNAAEGALWCL

Query:  KHLGY
        K+LGY
Subjt:  KHLGY

A0A6J1DGU1 uncharacterized protein LOC111020939 isoform X24.4e-77100Show/hide
Query:  MEQTPPYPRRLNVNLKNLPPINPRTGNGNHHFSIPAKFEKSRYIRRVPHFTPPNHRPEMKPDTAGALNTKTIPEASSDSFVAKNENSREIASTCSCGHRS
        MEQTPPYPRRLNVNLKNLPPINPRTGNGNHHFSIPAKFEKSRYIRRVPHFTPPNHRPEMKPDTAGALNTKTIPEASSDSFVAKNENSREIASTCSCGHRS
Subjt:  MEQTPPYPRRLNVNLKNLPPINPRTGNGNHHFSIPAKFEKSRYIRRVPHFTPPNHRPEMKPDTAGALNTKTIPEASSDSFVAKNENSREIASTCSCGHRS

Query:  AKEGTLEKRAAKSLLFEVCTANYWKPPLFECCEEEGPSHAK
        AKEGTLEKRAAKSLLFEVCTANYWKPPLFECCEEEGPSHAK
Subjt:  AKEGTLEKRAAKSLLFEVCTANYWKPPLFECCEEEGPSHAK

A0A6J1DJ12 uncharacterized protein LOC111020939 isoform X17.5e-109100Show/hide
Query:  MEQTPPYPRRLNVNLKNLPPINPRTGNGNHHFSIPAKFEKSRYIRRVPHFTPPNHRPEMKPDTAGALNTKTIPEASSDSFVAKNENSREIASTCSCGHRS
        MEQTPPYPRRLNVNLKNLPPINPRTGNGNHHFSIPAKFEKSRYIRRVPHFTPPNHRPEMKPDTAGALNTKTIPEASSDSFVAKNENSREIASTCSCGHRS
Subjt:  MEQTPPYPRRLNVNLKNLPPINPRTGNGNHHFSIPAKFEKSRYIRRVPHFTPPNHRPEMKPDTAGALNTKTIPEASSDSFVAKNENSREIASTCSCGHRS

Query:  AKEGTLEKRAAKSLLFEVCTANYWKPPLFECCEEEGPSHAKKFRFKVIVEMKGACDAVLECYGNSQARKKVAAQNAAEGALWCLKHLGYSHL
        AKEGTLEKRAAKSLLFEVCTANYWKPPLFECCEEEGPSHAKKFRFKVIVEMKGACDAVLECYGNSQARKKVAAQNAAEGALWCLKHLGYSHL
Subjt:  AKEGTLEKRAAKSLLFEVCTANYWKPPLFECCEEEGPSHAKKFRFKVIVEMKGACDAVLECYGNSQARKKVAAQNAAEGALWCLKHLGYSHL

SwissProt top hitse value%identityAlignment
A7LFZ6 Endoribonuclease Dicer homolog 42.9e-2552.17Show/hide
Query:  HRSAKEGTLEKRAAKSLLFEVCTANYWKPPLFECCEEEGPSHAKKFRFKVIVEMKGACDAVLECYGNSQARKKVAAQNAAEGALWCLKHLGY
        ++    G +  + A+S LFE+C ANYWKPP F+ C+EEGPSH +KF +KV+VE+KGA   +LEC+ + + +KK A ++AA+GALWCLK LG+
Subjt:  HRSAKEGTLEKRAAKSLLFEVCTANYWKPPLFECCEEEGPSHAKKFRFKVIVEMKGACDAVLECYGNSQARKKVAAQNAAEGALWCLKHLGY

P84634 Dicer-like protein 44.9e-2557.95Show/hide
Query:  KEGTLEKRAAKSLLFEVCTANYWKPPLFECCEEEGPSHAKKFRFKVIVEMKGACDAVLECYGNSQARKKVAAQNAAEGALWCLKHLGY
        K G    + AKSLL E C AN WKPP FECCEEEGP H K F +KVI+E++ A +  LECYG ++A KK AA++AA+ A+WCLKH G+
Subjt:  KEGTLEKRAAKSLLFEVCTANYWKPPLFECCEEEGPSHAKKFRFKVIVEMKGACDAVLECYGNSQARKKVAAQNAAEGALWCLKHLGY

Q9M8N2 Ribonuclease 3-like protein 11.3e-1728.12Show/hide
Query:  RRLNVNLKNLPPINPRT------------GNGNHHFSIPAKFEKSRYIRRVPHFTPPNHRPEMKPDTAGALNTKTIPEASSDSFVAKNENSREIASTCSC
        R + ++LK++PP++P +              G   F     F+       +  F+  N + +     + +L  K  P+   +        S++       
Subjt:  RRLNVNLKNLPPINPRT------------GNGNHHFSIPAKFEKSRYIRRVPHFTPPNHRPEMKPDTAGALNTKTIPEASSDSFVAKNENSREIASTCSC

Query:  GHRSAKEGTLEKRAAKSLLFEVCTANYWKPPLFECCEEEGPSHAKKFRFKVIVEMKGAC-DAVLECYGNSQARKKVAAQNAAEGALWCLKHL
              E    + +AKS+L E+C +  W+PP++ECC  +GP H + F +KV+VE++ +    VLEC+G+ + +KK AA++AAEGALW L+H+
Subjt:  GHRSAKEGTLEKRAAKSLLFEVCTANYWKPPLFECCEEEGPSHAKKFRFKVIVEMKGAC-DAVLECYGNSQARKKVAAQNAAEGALWCLKHL

Arabidopsis top hitse value%identityAlignment
AT1G80650.1 RNAse THREE-like protein 19.3e-1928.12Show/hide
Query:  RRLNVNLKNLPPINPRT------------GNGNHHFSIPAKFEKSRYIRRVPHFTPPNHRPEMKPDTAGALNTKTIPEASSDSFVAKNENSREIASTCSC
        R + ++LK++PP++P +              G   F     F+       +  F+  N + +     + +L  K  P+   +        S++       
Subjt:  RRLNVNLKNLPPINPRT------------GNGNHHFSIPAKFEKSRYIRRVPHFTPPNHRPEMKPDTAGALNTKTIPEASSDSFVAKNENSREIASTCSC

Query:  GHRSAKEGTLEKRAAKSLLFEVCTANYWKPPLFECCEEEGPSHAKKFRFKVIVEMKGAC-DAVLECYGNSQARKKVAAQNAAEGALWCLKHL
              E    + +AKS+L E+C +  W+PP++ECC  +GP H + F +KV+VE++ +    VLEC+G+ + +KK AA++AAEGALW L+H+
Subjt:  GHRSAKEGTLEKRAAKSLLFEVCTANYWKPPLFECCEEEGPSHAKKFRFKVIVEMKGAC-DAVLECYGNSQARKKVAAQNAAEGALWCLKHL

AT4G00420.2 Double-stranded RNA-binding domain (DsRBD)-containing protein1.7e-1749.44Show/hide
Query:  EGTLEKRAAKSLLFEVCTANYWKPPLFECCEEEGPSHAKKFRFKVIVEMK-GACDAVLECYGNSQARKKVAAQNAAEGALWCLKHLGYS
        E   ++ +AKS L+ +C+  +WK PL+E    EGP H K F  KV VEMK  +   VLEC+GN Q +KK+AA+ AAE ALW LK++GY+
Subjt:  EGTLEKRAAKSLLFEVCTANYWKPPLFECCEEEGPSHAKKFRFKVIVEMK-GACDAVLECYGNSQARKKVAAQNAAEGALWCLKHLGYS

AT4G00420.3 Double-stranded RNA-binding domain (DsRBD)-containing protein1.7e-1749.44Show/hide
Query:  EGTLEKRAAKSLLFEVCTANYWKPPLFECCEEEGPSHAKKFRFKVIVEMK-GACDAVLECYGNSQARKKVAAQNAAEGALWCLKHLGYS
        E   ++ +AKS L+ +C+  +WK PL+E    EGP H K F  KV VEMK  +   VLEC+GN Q +KK+AA+ AAE ALW LK++GY+
Subjt:  EGTLEKRAAKSLLFEVCTANYWKPPLFECCEEEGPSHAKKFRFKVIVEMK-GACDAVLECYGNSQARKKVAAQNAAEGALWCLKHLGYS

AT5G20320.1 dicer-like 43.5e-2657.95Show/hide
Query:  KEGTLEKRAAKSLLFEVCTANYWKPPLFECCEEEGPSHAKKFRFKVIVEMKGACDAVLECYGNSQARKKVAAQNAAEGALWCLKHLGY
        K G    + AKSLL E C AN WKPP FECCEEEGP H K F +KVI+E++ A +  LECYG ++A KK AA++AA+ A+WCLKH G+
Subjt:  KEGTLEKRAAKSLLFEVCTANYWKPPLFECCEEEGPSHAKKFRFKVIVEMKGACDAVLECYGNSQARKKVAAQNAAEGALWCLKHLGY

AT5G20320.2 dicer-like 43.5e-2657.95Show/hide
Query:  KEGTLEKRAAKSLLFEVCTANYWKPPLFECCEEEGPSHAKKFRFKVIVEMKGACDAVLECYGNSQARKKVAAQNAAEGALWCLKHLGY
        K G    + AKSLL E C AN WKPP FECCEEEGP H K F +KVI+E++ A +  LECYG ++A KK AA++AA+ A+WCLKH G+
Subjt:  KEGTLEKRAAKSLLFEVCTANYWKPPLFECCEEEGPSHAKKFRFKVIVEMKGACDAVLECYGNSQARKKVAAQNAAEGALWCLKHLGY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCAGACTCCTCCGTACCCTCGACGATTAAATGTCAATCTCAAAAATCTTCCTCCAATTAATCCCCGCACTGGAAATGGAAATCATCACTTCTCGATTCCGGCCAA
GTTTGAGAAATCCAGATATATTAGGCGCGTACCTCACTTTACGCCGCCGAATCACCGGCCGGAAATGAAGCCGGACACCGCTGGAGCTCTGAATACGAAGACGATTCCAG
AGGCTTCGTCGGATTCTTTCGTCGCGAAGAACGAGAACAGCCGGGAAATAGCTTCCACTTGCAGTTGCGGACATCGCTCAGCCAAAGAAGGTACTTTAGAAAAGAGAGCG
GCAAAGTCCCTTCTGTTTGAGGTCTGCACTGCAAATTACTGGAAACCTCCTCTGTTTGAATGCTGCGAGGAAGAAGGGCCAAGCCATGCAAAAAAGTTTAGGTTCAAGGT
TATTGTGGAGATGAAGGGAGCTTGTGATGCAGTTTTAGAATGCTATGGAAATTCTCAGGCAAGAAAGAAAGTAGCAGCACAGAATGCTGCAGAAGGAGCATTATGGTGTT
TAAAGCATTTGGGATATTCTCATCTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGCAGACTCCTCCGTACCCTCGACGATTAAATGTCAATCTCAAAAATCTTCCTCCAATTAATCCCCGCACTGGAAATGGAAATCATCACTTCTCGATTCCGGCCAA
GTTTGAGAAATCCAGATATATTAGGCGCGTACCTCACTTTACGCCGCCGAATCACCGGCCGGAAATGAAGCCGGACACCGCTGGAGCTCTGAATACGAAGACGATTCCAG
AGGCTTCGTCGGATTCTTTCGTCGCGAAGAACGAGAACAGCCGGGAAATAGCTTCCACTTGCAGTTGCGGACATCGCTCAGCCAAAGAAGGTACTTTAGAAAAGAGAGCG
GCAAAGTCCCTTCTGTTTGAGGTCTGCACTGCAAATTACTGGAAACCTCCTCTGTTTGAATGCTGCGAGGAAGAAGGGCCAAGCCATGCAAAAAAGTTTAGGTTCAAGGT
TATTGTGGAGATGAAGGGAGCTTGTGATGCAGTTTTAGAATGCTATGGAAATTCTCAGGCAAGAAAGAAAGTAGCAGCACAGAATGCTGCAGAAGGAGCATTATGGTGTT
TAAAGCATTTGGGATATTCTCATCTCTAA
Protein sequenceShow/hide protein sequence
MEQTPPYPRRLNVNLKNLPPINPRTGNGNHHFSIPAKFEKSRYIRRVPHFTPPNHRPEMKPDTAGALNTKTIPEASSDSFVAKNENSREIASTCSCGHRSAKEGTLEKRA
AKSLLFEVCTANYWKPPLFECCEEEGPSHAKKFRFKVIVEMKGACDAVLECYGNSQARKKVAAQNAAEGALWCLKHLGYSHL