; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018334 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018334
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr5:23580631..23589236
RNA-Seq ExpressionLag0018334
SyntenyLag0018334
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PNY15174.1 ribonuclease H, partial [Trifolium pratense]9.4e-2247.9Show/hide
Query:  DDNLIFFKALMEEGERIKEVLSTYEKATGQKVNLDKSACLISKNVALPKALEICKRMGIKRIESLGQYLVLPEQSGRKKVDLFKRMRERVWKTLQGWKGS
        DD+L+F +A   E E I E+LSTY+KA+GQ VN+DKS    S+N+   +   IC +MG+K +++  +YL LP   GR K  +F  +++RVWK L+GWK  
Subjt:  DDNLIFFKALMEEGERIKEVLSTYEKATGQKVNLDKSACLISKNVALPKALEICKRMGIKRIESLGQYLVLPEQSGRKKVDLFKRMRERVWKTLQGWKGS

Query:  LFSLGGKEILIKTIVQVLP
          S  GKEILIK+IVQ +P
Subjt:  LFSLGGKEILIKTIVQVLP

XP_010690177.1 PREDICTED: uncharacterized protein LOC104903764 [Beta vulgaris subsp. vulgaris]7.2e-2231.94Show/hide
Query:  DDNLIFFKALMEEGERIKEVLSTYEKATGQKVNLDKSACLISKNVALPKALEICKRMGIKRIESLGQYLVLPEQSGRKKVDLFKRMRERVWKTLQGWKGS
        DD+++F KA ++E  R+ +++STYE+A+GQKVNL K+    S NV   +  +I + +G++ ++   +YL LP   GR K  +F  ++ER+WK LQGWK  
Subjt:  DDNLIFFKALMEEGERIKEVLSTYEKATGQKVNLDKSACLISKNVALPKALEICKRMGIKRIESLGQYLVLPEQSGRKKVDLFKRMRERVWKTLQGWKGS

Query:  LFSLGGKEILIKTIVQVLP-------------IPQMHGPQAR-KWGSVAMLLAQRRDAVTSTNLFAKKWDSVAMLSPQLLHAVQIPELGRRDASWCEKEG
        L S  GKEI+IK + Q +P             I ++H   AR  WGS            +   L   KW+ + +  P+ +  +   +L   +A+   K+ 
Subjt:  LFSLGGKEILIKTIVQVLP-------------IPQMHGPQAR-KWGSVAMLLAQRRDAVTSTNLFAKKWDSVAMLSPQLLHAVQIPELGRRDASWCEKEG

Query:  RGGLGWSIRDSNGSLI
            GW +    G+L+
Subjt:  RGGLGWSIRDSNGSLI

XP_023894138.1 uncharacterized protein LOC112006071 [Quercus suber]1.4e-2040.34Show/hide
Query:  DDNLIFFKALMEEGERIKEVLSTYEKATGQKVNLDKSACLISKNVALPKALEICKRMGIKRIESLGQYLVLPEQSGRKKVDLFKRMRERVWKTLQGWKGS
        DD+L+F +A ++E  ++ E+L+TYE+ +GQ++N +K+A   SK+ AL    +I   +G+ +++   +YL LP   GR K   F ++++RVWK LQGW+G 
Subjt:  DDNLIFFKALMEEGERIKEVLSTYEKATGQKVNLDKSACLISKNVALPKALEICKRMGIKRIESLGQYLVLPEQSGRKKVDLFKRMRERVWKTLQGWKGS

Query:  LFSLGGKEILIKTIVQVLP
        L S  G+E+LIK+++Q +P
Subjt:  LFSLGGKEILIKTIVQVLP

XP_030923330.1 uncharacterized protein LOC115950239 [Quercus lobata]2.7e-2145.38Show/hide
Query:  DDNLIFFKALMEEGERIKEVLSTYEKATGQKVNLDKSACLISKNVALPKALEICKRMGIKRIESLGQYLVLPEQSGRKKVDLFKRMRERVWKTLQGWKGS
        DD+L+F +A   EGE I E+L  YE+A+GQ +NL+KS+   S N +  +  +I + +G+K ++   +YL LP   GR K + F  +++RVWK LQGWKG 
Subjt:  DDNLIFFKALMEEGERIKEVLSTYEKATGQKVNLDKSACLISKNVALPKALEICKRMGIKRIESLGQYLVLPEQSGRKKVDLFKRMRERVWKTLQGWKGS

Query:  LFSLGGKEILIKTIVQVLP
        L S  GKEILIK + Q +P
Subjt:  LFSLGGKEILIKTIVQVLP

XP_030933459.1 uncharacterized protein LOC115959256 [Quercus lobata]1.4e-2040Show/hide
Query:  DDNLIFFKALMEEGERIKEVLSTYEKATGQKVNLDKSACLISKNVALPKALEICKRMGIKRIESLGQYLVLPEQSGRKKVDLFKRMRERVWKTLQGWKGS
        DD+L+F +A  +EGE I E+L  YE+A+GQ +NL+KS+   S N +  +  +I   +G+K +     YL LP   GR K   F  +++R+WK LQGWKG+
Subjt:  DDNLIFFKALMEEGERIKEVLSTYEKATGQKVNLDKSACLISKNVALPKALEICKRMGIKRIESLGQYLVLPEQSGRKKVDLFKRMRERVWKTLQGWKGS

Query:  LFSLGGKEILIKTIVQVLPIPQMHGPQARKWGSVAMLLAQRRDAVTSTNLFAKKW
        L S  GKEILIK + Q +P   M   Q      + + L    DA     L+AK W
Subjt:  LFSLGGKEILIKTIVQVLPIPQMHGPQARKWGSVAMLLAQRRDAVTSTNLFAKKW

TrEMBL top hitse value%identityAlignment
A0A0J8B9Y6 Reverse transcriptase domain-containing protein1.1e-2043.9Show/hide
Query:  DDNLIFFKALMEEGERIKEVLSTYEKATGQKVNLDKSACLISKNVALPKALEICKRMGIKRIESLGQYLVLPEQSGRKKVDLFKRMRERVWKTLQGWKGS
        DD+++F +A ++E   + E+LSTYE+A+GQK+N DKS    SK+V   + + I    G++ +E   +YL LP   GR K  +F  ++ERVWK LQGWK  
Subjt:  DDNLIFFKALMEEGERIKEVLSTYEKATGQKVNLDKSACLISKNVALPKALEICKRMGIKRIESLGQYLVLPEQSGRKKVDLFKRMRERVWKTLQGWKGS

Query:  LFSLGGKEILIKTIVQVLPIPQM
        L S  GKE+L+K I+Q +P   M
Subjt:  LFSLGGKEILIKTIVQVLPIPQM

A0A2K3PIT6 Ribonuclease H (Fragment)4.6e-2247.9Show/hide
Query:  DDNLIFFKALMEEGERIKEVLSTYEKATGQKVNLDKSACLISKNVALPKALEICKRMGIKRIESLGQYLVLPEQSGRKKVDLFKRMRERVWKTLQGWKGS
        DD+L+F +A   E E I E+LSTY+KA+GQ VN+DKS    S+N+   +   IC +MG+K +++  +YL LP   GR K  +F  +++RVWK L+GWK  
Subjt:  DDNLIFFKALMEEGERIKEVLSTYEKATGQKVNLDKSACLISKNVALPKALEICKRMGIKRIESLGQYLVLPEQSGRKKVDLFKRMRERVWKTLQGWKGS

Query:  LFSLGGKEILIKTIVQVLP
          S  GKEILIK+IVQ +P
Subjt:  LFSLGGKEILIKTIVQVLP

A0A6J1DUG8 uncharacterized protein LOC1110241351.5e-2050Show/hide
Query:  DDNLIFFKALMEEGERIKEVLSTYEKA-TGQKVNLDKSACLISKNVALPKALEICKRMGIKRIESLGQYLVLPEQSGRKKVDLFKRMRERVWKTLQGWKG
        DD L+FFKA       IK +L +YEKA +GQ +NLDKS  ++SKN        I + + +   ESLGQYL LP Q+GR K  +F  +++RVWK LQGWKG
Subjt:  DDNLIFFKALMEEGERIKEVLSTYEKA-TGQKVNLDKSACLISKNVALPKALEICKRMGIKRIESLGQYLVLPEQSGRKKVDLFKRMRERVWKTLQGWKG

Query:  SLFSLGGKEILI
         LFS+GG+E+L+
Subjt:  SLFSLGGKEILI

A0A803QE56 Uncharacterized protein1.9e-2045Show/hide
Query:  VDDNLIFFKALMEEGERIKEVLSTYEKATGQKVNLDKSACLISKNVALPKALEICKRMGIKRIESLGQYLVLPEQSGRKKVDLFKRMRERVWKTLQGWKG
        VDD+L+FF A  E     K +L  Y KA+GQ VN  KS     + V     ++I + +G+K +++ G+YL LP   GR K  LF  ++ RVW  L+GWKG
Subjt:  VDDNLIFFKALMEEGERIKEVLSTYEKATGQKVNLDKSACLISKNVALPKALEICKRMGIKRIESLGQYLVLPEQSGRKKVDLFKRMRERVWKTLQGWKG

Query:  SLFSLGGKEILIKTIVQVLP
        S+FS+ GKE+LIK IVQ +P
Subjt:  SLFSLGGKEILIKTIVQVLP

A0A803QJV0 Uncharacterized protein6.6e-2141.61Show/hide
Query:  DDNLIFFKALMEEGERIKEVLSTYEKATGQKVNLDKSACLISKNVALPKALEICKRMGIKRIESLGQYLVLPEQSGRKKVDLFKRMRERVWKTLQGWKGS
        DD++IFF A  E  +R   +LS Y  A+GQ VN  KS     KNVA    LE+ + +G+K +++ G+YL L    GR K  +F  ++ RVW +L+GWKG 
Subjt:  DDNLIFFKALMEEGERIKEVLSTYEKATGQKVNLDKSACLISKNVALPKALEICKRMGIKRIESLGQYLVLPEQSGRKKVDLFKRMRERVWKTLQGWKGS

Query:  LFSLGGKEILIKTIVQVLP-------------IPQMHGPQAR-KWGSVA
        +FS+GG E+LIK IVQ +P             I  +H   AR  WGS A
Subjt:  LFSLGGKEILIKTIVQVLP-------------IPQMHGPQAR-KWGSVA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G27870.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein5.5e-0443.14Show/hide
Query:  NHVSWTPPPPKFLKLNSDGSWNERLEVGGIGWVIRDSAGSLVLAHKRITRK
        NH  W  P   ++K N DGS+         GWV+RDS GS +LA + I RK
Subjt:  NHVSWTPPPPKFLKLNSDGSWNERLEVGGIGWVIRDSAGSLVLAHKRITRK

AT2G04420.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.9e-0428.08Show/hide
Query:  HVSWTPPPPKFLKLNSDGSWNERLEVGGIGWVIRDSAGSLVLAHKRI--TRKRGINQMELTAIKEGLEAYLSLDRRRRTALIAEADAMDAIKVLNYE---
        H  W  PP  ++K N DGS+N R +    GW+IRD  G    A + +  T    + + EL A+   ++   S   R+   +I E D+    ++LN +   
Subjt:  HVSWTPPPPKFLKLNSDGSWNERLEVGGIGWVIRDSAGSLVLAHKRI--TRKRGINQMELTAIKEGLEAYLSLDRRRRTALIAEADAMDAIKVLNYE---

Query:  ---INDISEAKAIVIDIEHLTTKLGDVSFVKCPREGNRVAHGLART
            N I EA +        + +  +V F   PR  N+ A  LA++
Subjt:  ---INDISEAKAIVIDIEHLTTKLGDVSFVKCPREGNRVAHGLART

AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.2e-1130.28Show/hide
Query:  VSWTPPPPKFLKLNSDGSWNERLEVGGIGWVIR-DSAGSLVLAHKRITRKRGINQMELTAIKEGLEAYLSLDRRRRTALIAEADAMDAIKVLNYEINDIS
        V W  PP +++K N+D +W       GIGW++R +S G L +  + + R + + + EL A++    A L++ R     +I E+DA   + +LN + +   
Subjt:  VSWTPPPPKFLKLNSDGSWNERLEVGGIGWVIR-DSAGSLVLAHKRITRKRGINQMELTAIKEGLEAYLSLDRRRRTALIAEADAMDAIKVLNYEINDIS

Query:  EAKAIVIDIEHLTTKLGDVSFVKCPREGNRVAHGLARTTAGF
          +  + DI+ L     +V F   PR GN+VA  +AR +  F
Subjt:  EAKAIVIDIEHLTTKLGDVSFVKCPREGNRVAHGLARTTAGF

AT4G29090.1 Ribonuclease H-like superfamily protein2.4e-1533.33Show/hide
Query:  WTPPPPKFLKLNSDGSWNERLEVGGIGWVIRDSAGSLV-LAHKRITRKRGINQMELTAIKEGLEAYLSLDRRRRTALIAEADAMDAIKVLNYEINDISEA
        W PPP +++K N+D +WN   E  GIGWV+R+  G +  +  + + + + + + EL A++    A LSL R +   +I E+D+   I++LN +       
Subjt:  WTPPPPKFLKLNSDGSWNERLEVGGIGWVIRDSAGSLV-LAHKRITRKRGINQMELTAIKEGLEAYLSLDRRRRTALIAEADAMDAIKVLNYEINDISEA

Query:  KAIVIDIEHLTTKLGDVSFVKCPREGNRVAHGLARTTAGFL
        K  + D++ L ++  +V FV  PREGN +A  +AR +  FL
Subjt:  KAIVIDIEHLTTKLGDVSFVKCPREGNRVAHGLARTTAGFL

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.2e-0441.46Show/hide
Query:  HVSWTPPPPKFLKLNSDGSWNERLEVGGIGWVIRDSAGSLV
        +  W+PP    LK N D S +ER  V G+GW++R+S G+++
Subjt:  HVSWTPPPPKFLKLNSDGSWNERLEVGGIGWVIRDSAGSLV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATTTCGCCTGGAGAGCCTTCAAAATCACGTGAGTTGGACCCCTCCTCCTCCAAAATTCCTCAAATTAAACTCGGATGGCTCTTGGAATGAAAGGCTCGAGGTTGG
GGGTATTGGTTGGGTCATCCGTGATTCCGCAGGATCTCTGGTGTTGGCCCACAAGCGAATCACACGAAAAAGGGGGATCAACCAAATGGAGCTGACGGCGATTAAGGAAG
GGTTAGAAGCGTACCTTTCTCTGGATCGAAGAAGGCGTACTGCTCTGATTGCTGAAGCAGACGCCATGGACGCGATAAAAGTACTAAATTACGAAATCAACGACATCTCT
GAAGCCAAGGCGATCGTTATTGACATTGAGCATTTGACGACTAAATTAGGAGATGTTTCCTTCGTGAAATGCCCAAGGGAGGGCAATCGAGTGGCGCACGGTCTTGCGCG
AACGACGGCAGGGTTCTTGCCGTCCTCCTCTCTGTCGATTGATGTTGTTGACGGTTTTTTTGAATGTAATTCTTCTTCTTCTTCCATGTTGGAAGCTCCTTCACGTGATA
CCTCTGGCGACGTGGTTCTTCGTCTCCGGCACTTGTGGCGTCTCCGGCGAGCAGACCCAGTCATGGTGAGCGGCGACACGTGCAGTTTCAAGCAGTTTTCTTCACTCATC
TTCAGCGGGCGTTCGTGGGTCTCCGACAAGCAAGGCGGCGCAACGTCAGCAGCAGGCGTTCGCGACGGTCCGAGCAGCTTCAGGCGAGCGATCTCCGACGTCTTCTTCGT
GGGTTCTTCTAGCGGAGCAGCGACGATGCAGGCACGACGCAACGGGTGTGTGGATGACAACTTAATCTTCTTCAAAGCATTGATGGAGGAAGGTGAAAGGATTAAAGAAG
TGCTTAGCACATACGAGAAAGCTACTGGGCAAAAAGTCAATCTGGATAAGTCTGCCTGTTTGATTAGTAAAAATGTGGCCCTGCCAAAAGCTTTGGAAATTTGCAAAAGG
ATGGGGATTAAACGCATTGAGTCCCTTGGGCAATACCTGGTGCTCCCGGAGCAATCGGGAAGAAAGAAGGTGGATCTGTTCAAGAGGATGAGAGAGCGAGTGTGGAAAAC
TCTTCAAGGGTGGAAGGGAAGCTTGTTCTCGCTCGGAGGGAAAGAAATTCTCATAAAAACTATTGTCCAAGTGTTGCCAATACCACAAATGCACGGGCCGCAAGCTCGGA
AATGGGGTAGCGTCGCGATGCTATTGGCACAGCGTCGCGACGCTGTGACGTCAACCAACTTGTTCGCCAAGAAATGGGACAGCGTCGCGATGCTGTCCCCACAGCTTCTC
CACGCTGTGCAGATTCCAGAACTGGGCAGGCGTGATGCTTCCTGGTGTGAGAAGGAAGGAAGAGGTGGGCTTGGTTGGTCCATTCGTGACTCTAACGGATCTCTTATTGG
AGCTGGGTGCAAATCGTTATCTCAGAAAAGGCCGATCAAGTGGCTGGAGGCGACTGCTCTTATGGAAGGGTTGAAATCGCTTGAGAATCAGATGGGAGAAAGGATCTCGA
TTCCTTTTTTGTCGTTGATCGTCGAGTCGGATTCTGTGGAAGTGGTGAAAATTTTAAACCACGAAGAGGAAGATTTATCTGAAATCTCCATGGTGATGGATGAAATTGAA
GCTGTGGCCCGACAAGTTAATGTCTCCTCCTTTTCCAAGTGTTCGAGAAAGGAGAATCAAGAGGCTCACCTTCTTGCACGCGCAGCTGTGCGACATGGAAACTGTAGTTA
TTTTTTTGGACGCTCCTTGTACCCTGGTATAGGGGAATCTTTTTTGTGTAGGGATGTCGTTATCCCTTCTTGGTTCTCTTCAATCATTTATGAAGGGGTTGGTGTAACTG
AATTTTATCATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAATTTCGCCTGGAGAGCCTTCAAAATCACGTGAGTTGGACCCCTCCTCCTCCAAAATTCCTCAAATTAAACTCGGATGGCTCTTGGAATGAAAGGCTCGAGGTTGG
GGGTATTGGTTGGGTCATCCGTGATTCCGCAGGATCTCTGGTGTTGGCCCACAAGCGAATCACACGAAAAAGGGGGATCAACCAAATGGAGCTGACGGCGATTAAGGAAG
GGTTAGAAGCGTACCTTTCTCTGGATCGAAGAAGGCGTACTGCTCTGATTGCTGAAGCAGACGCCATGGACGCGATAAAAGTACTAAATTACGAAATCAACGACATCTCT
GAAGCCAAGGCGATCGTTATTGACATTGAGCATTTGACGACTAAATTAGGAGATGTTTCCTTCGTGAAATGCCCAAGGGAGGGCAATCGAGTGGCGCACGGTCTTGCGCG
AACGACGGCAGGGTTCTTGCCGTCCTCCTCTCTGTCGATTGATGTTGTTGACGGTTTTTTTGAATGTAATTCTTCTTCTTCTTCCATGTTGGAAGCTCCTTCACGTGATA
CCTCTGGCGACGTGGTTCTTCGTCTCCGGCACTTGTGGCGTCTCCGGCGAGCAGACCCAGTCATGGTGAGCGGCGACACGTGCAGTTTCAAGCAGTTTTCTTCACTCATC
TTCAGCGGGCGTTCGTGGGTCTCCGACAAGCAAGGCGGCGCAACGTCAGCAGCAGGCGTTCGCGACGGTCCGAGCAGCTTCAGGCGAGCGATCTCCGACGTCTTCTTCGT
GGGTTCTTCTAGCGGAGCAGCGACGATGCAGGCACGACGCAACGGGTGTGTGGATGACAACTTAATCTTCTTCAAAGCATTGATGGAGGAAGGTGAAAGGATTAAAGAAG
TGCTTAGCACATACGAGAAAGCTACTGGGCAAAAAGTCAATCTGGATAAGTCTGCCTGTTTGATTAGTAAAAATGTGGCCCTGCCAAAAGCTTTGGAAATTTGCAAAAGG
ATGGGGATTAAACGCATTGAGTCCCTTGGGCAATACCTGGTGCTCCCGGAGCAATCGGGAAGAAAGAAGGTGGATCTGTTCAAGAGGATGAGAGAGCGAGTGTGGAAAAC
TCTTCAAGGGTGGAAGGGAAGCTTGTTCTCGCTCGGAGGGAAAGAAATTCTCATAAAAACTATTGTCCAAGTGTTGCCAATACCACAAATGCACGGGCCGCAAGCTCGGA
AATGGGGTAGCGTCGCGATGCTATTGGCACAGCGTCGCGACGCTGTGACGTCAACCAACTTGTTCGCCAAGAAATGGGACAGCGTCGCGATGCTGTCCCCACAGCTTCTC
CACGCTGTGCAGATTCCAGAACTGGGCAGGCGTGATGCTTCCTGGTGTGAGAAGGAAGGAAGAGGTGGGCTTGGTTGGTCCATTCGTGACTCTAACGGATCTCTTATTGG
AGCTGGGTGCAAATCGTTATCTCAGAAAAGGCCGATCAAGTGGCTGGAGGCGACTGCTCTTATGGAAGGGTTGAAATCGCTTGAGAATCAGATGGGAGAAAGGATCTCGA
TTCCTTTTTTGTCGTTGATCGTCGAGTCGGATTCTGTGGAAGTGGTGAAAATTTTAAACCACGAAGAGGAAGATTTATCTGAAATCTCCATGGTGATGGATGAAATTGAA
GCTGTGGCCCGACAAGTTAATGTCTCCTCCTTTTCCAAGTGTTCGAGAAAGGAGAATCAAGAGGCTCACCTTCTTGCACGCGCAGCTGTGCGACATGGAAACTGTAGTTA
TTTTTTTGGACGCTCCTTGTACCCTGGTATAGGGGAATCTTTTTTGTGTAGGGATGTCGTTATCCCTTCTTGGTTCTCTTCAATCATTTATGAAGGGGTTGGTGTAACTG
AATTTTATCATTAA
Protein sequenceShow/hide protein sequence
MEFRLESLQNHVSWTPPPPKFLKLNSDGSWNERLEVGGIGWVIRDSAGSLVLAHKRITRKRGINQMELTAIKEGLEAYLSLDRRRRTALIAEADAMDAIKVLNYEINDIS
EAKAIVIDIEHLTTKLGDVSFVKCPREGNRVAHGLARTTAGFLPSSSLSIDVVDGFFECNSSSSSMLEAPSRDTSGDVVLRLRHLWRLRRADPVMVSGDTCSFKQFSSLI
FSGRSWVSDKQGGATSAAGVRDGPSSFRRAISDVFFVGSSSGAATMQARRNGCVDDNLIFFKALMEEGERIKEVLSTYEKATGQKVNLDKSACLISKNVALPKALEICKR
MGIKRIESLGQYLVLPEQSGRKKVDLFKRMRERVWKTLQGWKGSLFSLGGKEILIKTIVQVLPIPQMHGPQARKWGSVAMLLAQRRDAVTSTNLFAKKWDSVAMLSPQLL
HAVQIPELGRRDASWCEKEGRGGLGWSIRDSNGSLIGAGCKSLSQKRPIKWLEATALMEGLKSLENQMGERISIPFLSLIVESDSVEVVKILNHEEEDLSEISMVMDEIE
AVARQVNVSSFSKCSRKENQEAHLLARAAVRHGNCSYFFGRSLYPGIGESFLCRDVVIPSWFSSIIYEGVGVTEFYH