; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0029972 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0029972
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRibonuclease H-like superfamily protein
Genome locationchr8:43490383..43497079
RNA-Seq ExpressionLag0029972
SyntenyLag0029972
Gene Ontology termsGO:0010190 - cytochrome b6f complex assembly (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0016020 - membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN51853.2 hypothetical protein Csa_008870 [Cucumis sativus]1.4e-5883.33Show/hide
Query:  MEENPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKVPWMVGRRCLQRARKKRKKRKLILRKGECDGAAAAAATE
        M+E+PS+ATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVK+PWMVGRRCLQ+ARKKRKKRKL+ R+GECDGA AA   E
Subjt:  MEENPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKVPWMVGRRCLQRARKKRKKRKLILRKGECDGAAAAAATE

Query:  TGGGPAAVEGLPEILPGSGEDEEGMGNFSARFEAERIWLQLYQV
        TGG     +GLPE+ PGSGE++E  GNFSARFEAERIWLQLYQV
Subjt:  TGGGPAAVEGLPEILPGSGEDEEGMGNFSARFEAERIWLQLYQV

XP_008458773.1 PREDICTED: uncharacterized protein LOC103498078 [Cucumis melo]4.9e-5984.72Show/hide
Query:  MEENPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKVPWMVGRRCLQRARKKRKKRKLILRKGECDGAAAAAATE
        M+E+PSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVK+PWMVGRRCLQ+ARKKRKKRKL+ R+GECDGA AA   E
Subjt:  MEENPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKVPWMVGRRCLQRARKKRKKRKLILRKGECDGAAAAAATE

Query:  TGGGPAAVEGLPEILPGSGEDEEGMGNFSARFEAERIWLQLYQV
        TGG     +GLPEI PGSGE++E  GNFSARFEAERIWLQLYQV
Subjt:  TGGGPAAVEGLPEILPGSGEDEEGMGNFSARFEAERIWLQLYQV

XP_011655414.1 uncharacterized protein LOC105435525 [Cucumis sativus]1.4e-5883.33Show/hide
Query:  MEENPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKVPWMVGRRCLQRARKKRKKRKLILRKGECDGAAAAAATE
        M+E+PS+ATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVK+PWMVGRRCLQ+ARKKRKKRKL+ R+GECDGA AA   E
Subjt:  MEENPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKVPWMVGRRCLQRARKKRKKRKLILRKGECDGAAAAAATE

Query:  TGGGPAAVEGLPEILPGSGEDEEGMGNFSARFEAERIWLQLYQV
        TGG     +GLPE+ PGSGE++E  GNFSARFEAERIWLQLYQV
Subjt:  TGGGPAAVEGLPEILPGSGEDEEGMGNFSARFEAERIWLQLYQV

XP_022989907.1 uncharacterized protein LOC111486958 [Cucurbita maxima]1.2e-5479.08Show/hide
Query:  MEENPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKVPWMVGRRCLQRARKKRKKRKLILRKGECDGAAAAAATE
        MEE+PS  TRRRRF VDDG DLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVK+PWM+GRRCLQRA   RKKRKLI RK ECDG  AAAA+E
Subjt:  MEENPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKVPWMVGRRCLQRARKKRKKRKLILRKGECDGAAAAAATE

Query:  TGGGPAAVEGLPEILPGS---------GEDEEGMGNFSARFEAERIWLQLYQV
        TG GPA  EGLPEI+PGS          E+EEG+GNFSARFEAERIWLQLYQ+
Subjt:  TGGGPAAVEGLPEILPGS---------GEDEEGMGNFSARFEAERIWLQLYQV

XP_038891069.1 uncharacterized protein LOC120080480 [Benincasa hispida]6.8e-6186.81Show/hide
Query:  MEENPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKVPWMVGRRCLQRARKKRKKRKLILRKGECDGAAAAAATE
        M+E+PSR TRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALAL+K+PWMVGRRCLQRARKKRKKRKLI R+GECDGA AA   E
Subjt:  MEENPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKVPWMVGRRCLQRARKKRKKRKLILRKGECDGAAAAAATE

Query:  TGGGPAAVEGLPEILPGSGEDEEGMGNFSARFEAERIWLQLYQV
        TGG  A+ EGLPEI PGSGE+EE +GNFSARFEAERIWLQLYQV
Subjt:  TGGGPAAVEGLPEILPGSGEDEEGMGNFSARFEAERIWLQLYQV

TrEMBL top hitse value%identityAlignment
A0A0A0KR58 Uncharacterized protein6.9e-5983.33Show/hide
Query:  MEENPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKVPWMVGRRCLQRARKKRKKRKLILRKGECDGAAAAAATE
        M+E+PS+ATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVK+PWMVGRRCLQ+ARKKRKKRKL+ R+GECDGA AA   E
Subjt:  MEENPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKVPWMVGRRCLQRARKKRKKRKLILRKGECDGAAAAAATE

Query:  TGGGPAAVEGLPEILPGSGEDEEGMGNFSARFEAERIWLQLYQV
        TGG     +GLPE+ PGSGE++E  GNFSARFEAERIWLQLYQV
Subjt:  TGGGPAAVEGLPEILPGSGEDEEGMGNFSARFEAERIWLQLYQV

A0A1S3C8M3 uncharacterized protein LOC1034980782.4e-5984.72Show/hide
Query:  MEENPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKVPWMVGRRCLQRARKKRKKRKLILRKGECDGAAAAAATE
        M+E+PSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVK+PWMVGRRCLQ+ARKKRKKRKL+ R+GECDGA AA   E
Subjt:  MEENPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKVPWMVGRRCLQRARKKRKKRKLILRKGECDGAAAAAATE

Query:  TGGGPAAVEGLPEILPGSGEDEEGMGNFSARFEAERIWLQLYQV
        TGG     +GLPEI PGSGE++E  GNFSARFEAERIWLQLYQV
Subjt:  TGGGPAAVEGLPEILPGSGEDEEGMGNFSARFEAERIWLQLYQV

A0A5A7T2E5 Uncharacterized protein2.4e-5984.72Show/hide
Query:  MEENPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKVPWMVGRRCLQRARKKRKKRKLILRKGECDGAAAAAATE
        M+E+PSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVK+PWMVGRRCLQ+ARKKRKKRKL+ R+GECDGA AA   E
Subjt:  MEENPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKVPWMVGRRCLQRARKKRKKRKLILRKGECDGAAAAAATE

Query:  TGGGPAAVEGLPEILPGSGEDEEGMGNFSARFEAERIWLQLYQV
        TGG     +GLPEI PGSGE++E  GNFSARFEAERIWLQLYQV
Subjt:  TGGGPAAVEGLPEILPGSGEDEEGMGNFSARFEAERIWLQLYQV

A0A6J1HEE2 uncharacterized protein LOC1114632061.1e-5380.41Show/hide
Query:  MEENPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKVPWMVGRRCLQRARKKRKKRKLILRKGECDGAAAAAATE
        MEE+PS  TRRRRF VDDG DLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVK+PWM+GRRCLQRA   RKKRKLI RK ECDG  AAAA+E
Subjt:  MEENPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKVPWMVGRRCLQRARKKRKKRKLILRKGECDGAAAAAATE

Query:  TGGGPAAVEGLPEILPGS----GEDEEGMGNFSARFEAERIWLQLYQV
        T  G A  EGLPEI+PGS     E+EEG+GNFSARFEAERIWLQLYQ+
Subjt:  TGGGPAAVEGLPEILPGS----GEDEEGMGNFSARFEAERIWLQLYQV

A0A6J1JH40 uncharacterized protein LOC1114869586.0e-5579.08Show/hide
Query:  MEENPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKVPWMVGRRCLQRARKKRKKRKLILRKGECDGAAAAAATE
        MEE+PS  TRRRRF VDDG DLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVK+PWM+GRRCLQRA   RKKRKLI RK ECDG  AAAA+E
Subjt:  MEENPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKVPWMVGRRCLQRARKKRKKRKLILRKGECDGAAAAAATE

Query:  TGGGPAAVEGLPEILPGS---------GEDEEGMGNFSARFEAERIWLQLYQV
        TG GPA  EGLPEI+PGS          E+EEG+GNFSARFEAERIWLQLYQ+
Subjt:  TGGGPAAVEGLPEILPGS---------GEDEEGMGNFSARFEAERIWLQLYQV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01516.1 unknown protein3.8e-1740Show/hide
Query:  CSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKVPWMVGRRCLQRARKKRKKRKLILRKGECD-------GAAAAAATETGGGPAAVEG-------
        CSGK CRS  A  +ADCVA+CCCPC+VV+   LA VKVPWM+GR+C+ R    +K+ K I R+            A   +    GGG    E        
Subjt:  CSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKVPWMVGRRCLQRARKKRKKRKLILRKGECD-------GAAAAAATETGGGPAAVEG-------

Query:  ------LPEILPGSGEDEEGMGNFSARFEAERIWLQLYQV
                E    +   EE     SAR EAER+WL+LYQ+
Subjt:  ------LPEILPGSGEDEEGMGNFSARFEAERIWLQLYQV

AT3G11690.1 unknown protein2.2e-0434.41Show/hide
Query:  NPSRATRRR---RFAVDDGADLIDCSGKHCRSCTAGLVADCVAV-CCCPCSVVSFLALALVKVPWMVGRRCLQRARKKRKKRKLILRKGECDG
        +PSR+ RR+   + ++   +    C G        G  A C AV CCCPC +V+ L LA+ KVP  + RR ++  R+K+  +  IL     DG
Subjt:  NPSRATRRR---RFAVDDGADLIDCSGKHCRSCTAGLVADCVAV-CCCPCSVVSFLALALVKVPWMVGRRCLQRARKKRKKRKLILRKGECDG

AT4G09490.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.9e-0632.33Show/hide
Query:  VTLFADAAVLPTLDGAGLGVLLVNEVGNLCGAMEI--YESSALS---PFAAEVQAIMHGLRLTARMEIQRVHVYSNSLNAIRMINGEIGEWLEVRHWISE
        VT+F DAA        G G ++ N     C  +    Y+S+A +   P  AE  A+   L+    + I ++ + S+S   I  I  E     E    I +
Subjt:  VTLFADAAVLPTLDGAGLGVLLVNEVGNLCGAMEI--YESSALS---PFAAEVQAIMHGLRLTARMEIQRVHVYSNSLNAIRMINGEIGEWLEVRHWISE

Query:  IHDLCKGFLSVCFSYIPRDKNRKADALAKDALM
        I +L  GF  V FS++PR +NR AD LAK +L+
Subjt:  IHDLCKGFLSVCFSYIPRDKNRKADALAKDALM

AT5G06380.1 unknown protein8.2e-0429.85Show/hide
Query:  GKHCRSCTAGLVADCVAVC-CCPCSVVSFLALALVKVPWMVGRRCLQRARKKRKKRKLILRKGECDGAAAAAATETGGGPAAVEGLPEILPGSGEDEEGM
        G     C  G  A C A+C C PCSVV+ + LA+ K+P  + RR ++R R+KR  +K  +  G   G         G    AV  L        E+EE  
Subjt:  GKHCRSCTAGLVADCVAVC-CCPCSVVSFLALALVKVPWMVGRRCLQRARKKRKKRKLILRKGECDGAAAAAATETGGGPAAVEGLPEILPGSGEDEEGM

Query:  GNFS------ARFEAERIWLQLYQVDDHIPVENN
           +      +RF +   W  L Q +     +NN
Subjt:  GNFS------ARFEAERIWLQLYQVDDHIPVENN

AT5G14690.1 unknown protein3.1e-1934.8Show/hide
Query:  MEENPSRATRR-------RRFAVDD-----------GADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKVPWMVGRRCLQRARKKRKKRK
        MEENP R +RR       +  AVD+             D + CS K CRS  A  +ADCVA+CCCPC++++ L L LVKVPWM+GRRCL    + +KKR+
Subjt:  MEENPSRATRR-------RRFAVDD-----------GADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKVPWMVGRRCLQRARKKRKKRK

Query:  LILRK---------------------------GECDGAAAAAATETGGGPAAVEGLPEILPGS---------------GEDEEGMGNFSARFEAERIWLQ
        +I R+                           GE  G         GGG            GS               GED +     SAR EAER+WL+
Subjt:  LILRK---------------------------GECDGAAAAAATETGGGPAAVEGLPEILPGS---------------GEDEEGMGNFSARFEAERIWLQ

Query:  LYQV
        LYQ+
Subjt:  LYQV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAAAACCCATCTCGAGCAACACGCCGGCGGAGGTTCGCCGTGGACGACGGCGCCGATCTTATCGACTGCTCCGGCAAGCATTGCCGGTCCTGCACCGCCGGCCT
GGTGGCCGATTGCGTCGCCGTCTGCTGCTGCCCGTGTTCCGTAGTCAGCTTCTTGGCTCTGGCCCTCGTCAAAGTGCCCTGGATGGTCGGCCGGCGGTGTCTGCAGCGGG
CGAGAAAGAAAAGGAAGAAGAGGAAATTGATTCTCCGGAAAGGGGAATGCGACGGCGCTGCGGCGGCGGCAGCGACGGAGACTGGTGGGGGTCCGGCGGCAGTGGAGGGG
TTGCCGGAGATTCTGCCGGGGTCCGGCGAGGATGAAGAAGGGATGGGGAATTTCAGTGCGAGGTTTGAAGCAGAGAGAATTTGGTTGCAGTTGTATCAAGTGGATGATCA
TATCCCCGTGGAGAATAATTTTGCCGATCGCTTAATTCACCTTGCAAGGCAGCTGAAGGAAGCAGACTTTGAAAAGGCTTGTATTGCGTTTTGGTCTGTATGGAATAATA
GAAACACACGGCGTTTGGGCAGTCCGATTGTTGAATGGACACATCAGTGTGAGTGGGTTTTTAACTATTGGGCAGAAATTTCAAGGGGGAAACAGGAGGTCGTGTCGGAC
CATACCCATTTGAATACATGTGCACTGGACGCTGACGGGGTTGTTACTCTCTTTGCAGACGCGGCCGTCCTCCCGACCTTGGATGGAGCTGGCCTAGGTGTCTTATTGGT
TAATGAAGTCGGCAATCTTTGTGGAGCAATGGAGATATATGAGAGTTCAGCTTTGTCACCGTTCGCAGCGGAAGTTCAGGCTATAATGCATGGACTAAGATTGACTGCAC
GCATGGAAATACAAAGAGTTCATGTGTACTCGAACTCTTTAAATGCGATTCGGATGATCAATGGAGAGATTGGAGAGTGGCTGGAAGTTCGACACTGGATATCAGAAATT
CATGATCTATGCAAAGGGTTCCTTTCAGTGTGTTTCTCTTATATCCCAAGGGATAAAAATAGGAAAGCAGATGCTCTAGCGAAGGATGCGTTAATGCACCAGCAAACTAT
GTTGTGGTTAGAAATTTTCCGATATGGCTATTATCAATGGGTAACCAATCCCACGATAAGAGTATTGTTACTAATCAAATCAAGTTGCTATGATTTGCTCCATACGTGTC
AGGACCAGGAAAGGGACCTAGAGGGAGACCATACCGACGGGCCGGGCCAACGTGGCCCGACCCGTACGGTCGGCCTCGGCCTTGGGCCGAGGCCGAGCATTATGGTCGGC
CTCGGCCCAAGGCCGAGGCCGACCACTCGGCCCGCTTGCACGGGCCGAGTCCGTTTGCCTCCGCTCGGCCCCTACCGCTTCCAGCTGCCTCGGTCCAGCCTGCTTCGTCC
CAGAACGCCTCCAAACCCTAGGAGTCCGAGCAGGCATCGGAGGCGGTGTGGCCTACACCACACCGGTGTCCAGCGATTCTTGCCGGTCTTGCAGGTCACGTCTTCCCCAG
CTTCTACAAATTCATTGTTGGTGTCACGTGAAGGGCAGGGGTTGCCCAAGCAGCAGAAGCCTGAAGCCAGGCGAGCAGAGAGAGATCGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGAAAACCCATCTCGAGCAACACGCCGGCGGAGGTTCGCCGTGGACGACGGCGCCGATCTTATCGACTGCTCCGGCAAGCATTGCCGGTCCTGCACCGCCGGCCT
GGTGGCCGATTGCGTCGCCGTCTGCTGCTGCCCGTGTTCCGTAGTCAGCTTCTTGGCTCTGGCCCTCGTCAAAGTGCCCTGGATGGTCGGCCGGCGGTGTCTGCAGCGGG
CGAGAAAGAAAAGGAAGAAGAGGAAATTGATTCTCCGGAAAGGGGAATGCGACGGCGCTGCGGCGGCGGCAGCGACGGAGACTGGTGGGGGTCCGGCGGCAGTGGAGGGG
TTGCCGGAGATTCTGCCGGGGTCCGGCGAGGATGAAGAAGGGATGGGGAATTTCAGTGCGAGGTTTGAAGCAGAGAGAATTTGGTTGCAGTTGTATCAAGTGGATGATCA
TATCCCCGTGGAGAATAATTTTGCCGATCGCTTAATTCACCTTGCAAGGCAGCTGAAGGAAGCAGACTTTGAAAAGGCTTGTATTGCGTTTTGGTCTGTATGGAATAATA
GAAACACACGGCGTTTGGGCAGTCCGATTGTTGAATGGACACATCAGTGTGAGTGGGTTTTTAACTATTGGGCAGAAATTTCAAGGGGGAAACAGGAGGTCGTGTCGGAC
CATACCCATTTGAATACATGTGCACTGGACGCTGACGGGGTTGTTACTCTCTTTGCAGACGCGGCCGTCCTCCCGACCTTGGATGGAGCTGGCCTAGGTGTCTTATTGGT
TAATGAAGTCGGCAATCTTTGTGGAGCAATGGAGATATATGAGAGTTCAGCTTTGTCACCGTTCGCAGCGGAAGTTCAGGCTATAATGCATGGACTAAGATTGACTGCAC
GCATGGAAATACAAAGAGTTCATGTGTACTCGAACTCTTTAAATGCGATTCGGATGATCAATGGAGAGATTGGAGAGTGGCTGGAAGTTCGACACTGGATATCAGAAATT
CATGATCTATGCAAAGGGTTCCTTTCAGTGTGTTTCTCTTATATCCCAAGGGATAAAAATAGGAAAGCAGATGCTCTAGCGAAGGATGCGTTAATGCACCAGCAAACTAT
GTTGTGGTTAGAAATTTTCCGATATGGCTATTATCAATGGGTAACCAATCCCACGATAAGAGTATTGTTACTAATCAAATCAAGTTGCTATGATTTGCTCCATACGTGTC
AGGACCAGGAAAGGGACCTAGAGGGAGACCATACCGACGGGCCGGGCCAACGTGGCCCGACCCGTACGGTCGGCCTCGGCCTTGGGCCGAGGCCGAGCATTATGGTCGGC
CTCGGCCCAAGGCCGAGGCCGACCACTCGGCCCGCTTGCACGGGCCGAGTCCGTTTGCCTCCGCTCGGCCCCTACCGCTTCCAGCTGCCTCGGTCCAGCCTGCTTCGTCC
CAGAACGCCTCCAAACCCTAGGAGTCCGAGCAGGCATCGGAGGCGGTGTGGCCTACACCACACCGGTGTCCAGCGATTCTTGCCGGTCTTGCAGGTCACGTCTTCCCCAG
CTTCTACAAATTCATTGTTGGTGTCACGTGAAGGGCAGGGGTTGCCCAAGCAGCAGAAGCCTGAAGCCAGGCGAGCAGAGAGAGATCGTTAG
Protein sequenceShow/hide protein sequence
MEENPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALVKVPWMVGRRCLQRARKKRKKRKLILRKGECDGAAAAAATETGGGPAAVEG
LPEILPGSGEDEEGMGNFSARFEAERIWLQLYQVDDHIPVENNFADRLIHLARQLKEADFEKACIAFWSVWNNRNTRRLGSPIVEWTHQCEWVFNYWAEISRGKQEVVSD
HTHLNTCALDADGVVTLFADAAVLPTLDGAGLGVLLVNEVGNLCGAMEIYESSALSPFAAEVQAIMHGLRLTARMEIQRVHVYSNSLNAIRMINGEIGEWLEVRHWISEI
HDLCKGFLSVCFSYIPRDKNRKADALAKDALMHQQTMLWLEIFRYGYYQWVTNPTIRVLLLIKSSCYDLLHTCQDQERDLEGDHTDGPGQRGPTRTVGLGLGPRPSIMVG
LGPRPRPTTRPACTGRVRLPPLGPYRFQLPRSSLLRPRTPPNPRSPSRHRRRCGLHHTGVQRFLPVLQVTSSPASTNSLLVSREGQGLPKQQKPEARRAERDR