; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg032838 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg032838
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold11:15757694..15764286
RNA-Seq ExpressionSpg032838
SyntenySpg032838
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR005135 - Endonuclease/exonuclease/phosphatase
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA3477524.1 reverse transcriptase [Gossypium australe]2.1e-3738.1Show/hide
Query:  IQGGGCGTAPPRAMKTLCWNVCGLGNPQTFQAVRDGIRHYDPHLLFLLEMKCSSNARNLNKIKVSLNYSGCFTMDCNGRSGRLCLFWKEEMDVTIRSFSQ
        ++GGG   APPRAMK LCWN  G+GNP   + ++  +   DP ++FL E KC SN  +L+ ++      GC  +D  GRSG L L W+E + VT++++S+
Subjt:  IQGGGCGTAPPRAMKTLCWNVCGLGNPQTFQAVRDGIRHYDPHLLFLLEMKCSSNARNLNKIKVSLNYSGCFTMDCNGRSGRLCLFWKEEMDVTIRSFSQ

Query:  FHIDAPIQW-DSKVWRFSGIYGNPNASHRNHTWNLFRRLHNNDDLAWIVGGDFNTTLLNEEKEGG---------------------------DMFTWSNR
        +HID+ +   DS+  RF+G YG  +   +  +W++ +R+ +     WIVGGDFN  L NEEKEGG                             FTWSN 
Subjt:  FHIDAPIQW-DSKVWRFSGIYGNPNASHRNHTWNLFRRLHNNDDLAWIVGGDFNTTLLNEEKEGG---------------------------DMFTWSNR

Query:  KDRESQINERLDRFVVNEAFIQL--FLNALV
        ++    + ERLDRF+V++ FI    F+NA+V
Subjt:  KDRESQINERLDRFVVNEAFIQL--FLNALV

KAF5443558.1 hypothetical protein F2P56_036105, partial [Juglans regia]7.4e-3843.58Show/hide
Query:  MKTLCWNVCGLGNPQTFQAVRDGIRHYDPHLLFLLEMKCSSNARNLNKIKVSLNYSGCFTMDCNGRSGRLCLFWKEEMDVTIRSFSQFHIDAPIQ-WDSK
        MK LCWN  GLGNPQ  + +RD I + DP L+FL E K    AR +   K  L+ + CFT+DC GRSG L L WK ++ V ++SFS  HIDA IQ  D  
Subjt:  MKTLCWNVCGLGNPQTFQAVRDGIRHYDPHLLFLLEMKCSSNARNLNKIKVSLNYSGCFTMDCNGRSGRLCLFWKEEMDVTIRSFSQFHIDAPIQ-WDSK

Query:  VWRFSGIYGNPNASHRNHTWNLFRRLHNNDDLAWIVGGDFNTTLLNEEKEG---------------------------GDMFTWSNRKDRESQINERLDR
         WRF+G+YGNP   +R  TWNL RRL++  D  W+VGGDFN  L   EK G                           G  +TW N +     I+ERLDR
Subjt:  VWRFSGIYGNPNASHRNHTWNLFRRLHNNDDLAWIVGGDFNTTLLNEEKEG---------------------------GDMFTWSNRKDRESQINERLDR

Query:  FVVNEAFIQLFLNALVEH
        F+ N  F  LF   +V H
Subjt:  FVVNEAFIQLFLNALVEH

KAF5449841.1 hypothetical protein F2P56_030246 [Juglans regia]8.2e-3742.66Show/hide
Query:  MKTLCWNVCGLGNPQTFQAVRDGIRHYDPHLLFLLEMKCSSNARNLNKIKVSLNYSGCFTMDCNGRSGRLCLFWKEEMDVTIRSFSQFHIDAPIQ-WDSK
        MK LCWN  GLGNPQ  +A+RD I   DP L+FL E K  + A  +   K    +  CF +DC GRSG L L WK E+ V+I SFS++HIDA IQ  D  
Subjt:  MKTLCWNVCGLGNPQTFQAVRDGIRHYDPHLLFLLEMKCSSNARNLNKIKVSLNYSGCFTMDCNGRSGRLCLFWKEEMDVTIRSFSQFHIDAPIQ-WDSK

Query:  VWRFSGIYGNPNASHRNHTWNLFRRLHNNDDLAWIVGGDFNTTLLNEEKEG---------------------------GDMFTWSNRKDRESQINERLDR
         WRF+G+YG+P+AS+R  TWNL R L  +    W+VGGDFN  L   EK G                           G  FTW N +    +I+ERLDR
Subjt:  VWRFSGIYGNPNASHRNHTWNLFRRLHNNDDLAWIVGGDFNTTLLNEEKEG---------------------------GDMFTWSNRKDRESQINERLDR

Query:  FVVNEAFIQLFLNALVEH
        F  N+   ++F    V H
Subjt:  FVVNEAFIQLFLNALVEH

XP_042950313.1 uncharacterized protein LOC122282426 [Carya illinoinensis]5.6e-3841.44Show/hide
Query:  MKTLCWNVCGLGNPQTFQAVRDGIRHYDPHLLFLLEMKCSSNARNLNKIKVSLNYSGCFTMDCNGRSGRLCLFWKEEMDVTIRSFSQFHIDAPIQWDS-K
        MKT+CWN  GLGNP   +A+RD I    P LLFL E K S+  + ++ +K  L +  CF++D  GRSG L L W  ++ V +RSFSQ+HID  I+ D   
Subjt:  MKTLCWNVCGLGNPQTFQAVRDGIRHYDPHLLFLLEMKCSSNARNLNKIKVSLNYSGCFTMDCNGRSGRLCLFWKEEMDVTIRSFSQFHIDAPIQWDS-K

Query:  VWRFSGIYGNPNASHRNHTWNLFRRLHNNDDLAWIVGGDFNTTLLNEEKEG---------------------------GDMFTWSNRKDRESQINERLDR
        VWRF+G+YG+P+ S R  TWNL R L + + L W+VGGD N  L   EK G                           G  FTW N +   + I E LDR
Subjt:  VWRFSGIYGNPNASHRNHTWNLFRRLHNNDDLAWIVGGDFNTTLLNEEKEG---------------------------GDMFTWSNRKDRESQINERLDR

Query:  FVVNEAFIQLFLNALVEHLNWA
        FV N+    LF + +V+H N A
Subjt:  FVVNEAFIQLFLNALVEHLNWA

XP_042980077.1 uncharacterized protein LOC122310261 [Carya illinoinensis]4.3e-3840.99Show/hide
Query:  MKTLCWNVCGLGNPQTFQAVRDGIRHYDPHLLFLLEMKCSSNARNLNKIKVSLNYSGCFTMDCNGRSGRLCLFWKEEMDVTIRSFSQFHIDAPIQWDS-K
        MKT+CWN CGLGNP   +A+RD I    P LLFL E K  +  + ++ +K  L +  CF++D  GRSG L L W  ++ V +RSFS++HID  I+ D   
Subjt:  MKTLCWNVCGLGNPQTFQAVRDGIRHYDPHLLFLLEMKCSSNARNLNKIKVSLNYSGCFTMDCNGRSGRLCLFWKEEMDVTIRSFSQFHIDAPIQWDS-K

Query:  VWRFSGIYGNPNASHRNHTWNLFRRLHNNDDLAWIVGGDFNTTLLNEEKEG---------------------------GDMFTWSNRKDRESQINERLDR
        VWRF+G+YG+P+A  R  TWNL R L + + + W+VGGD N  L   EK G                           G  FTW N +   + I ERLDR
Subjt:  VWRFSGIYGNPNASHRNHTWNLFRRLHNNDDLAWIVGGDFNTTLLNEEKEG---------------------------GDMFTWSNRKDRESQINERLDR

Query:  FVVNEAFIQLFLNALVEHLNWA
        FV N+    LF + +V+H N A
Subjt:  FVVNEAFIQLFLNALVEHLNWA

TrEMBL top hitse value%identityAlignment
A0A2N9FR55 Reverse transcriptase domain-containing protein7.2e-3933.75Show/hide
Query:  GGGCGTAPPRAMKTLCWNVCGLGNPQTFQAVRDGIRHYDPHLLFLLEMKCSSNARNLNKIKVSLNYSGCFTMDCNGRSGRLCLFWKEEMDVTIRSFSQFH
        GGGCG AP   M+ L  N  GLGNP++ + +   ++   P  LFL+E     N R L ++++ L YS C  +D  G+ G L L W +++ ++I ++S+F+
Subjt:  GGGCGTAPPRAMKTLCWNVCGLGNPQTFQAVRDGIRHYDPHLLFLLEMKCSSNARNLNKIKVSLNYSGCFTMDCNGRSGRLCLFWKEEMDVTIRSFSQFH

Query:  IDAPIQWDSKV-WRFSGIYGNPNASHRNHTWNLFRRLHNNDDLAWIVGGDFNTTLLNEEKEG---------------------------GDMFTWSNRKD
        ID  I   + V W F+G YG+P  ++R  +W L R LH+     W+V GDFN  +   EK G                           G  +TWSNR+D
Subjt:  IDAPIQWDSKV-WRFSGIYGNPNASHRNHTWNLFRRLHNNDDLAWIVGGDFNTTLLNEEKEG---------------------------GDMFTWSNRKD

Query:  RESQINERLDRFVVNEAFIQLFLNALVEHLNWASPITAQSFLMANESHLYTA-TIRNVSPELQN-----SSPKMGQR----------VELQLDKALEEEE
         +++   RLDR V ++ ++ L  NA ++H++           +A   HL  A      + ELQN     S  K+GQR          + ++L+  LEE+E
Subjt:  RESQINERLDRFVVNEAFIQLFLNALVEHLNWASPITAQSFLMANESHLYTA-TIRNVSPELQN-----SSPKMGQR----------VELQLDKALEEEE

Query:  IYWKQRSRENWLKWGDRNTR
         YW QRSR +WLK GD NT+
Subjt:  IYWKQRSRENWLKWGDRNTR

A0A2N9GKW3 Reverse transcriptase domain-containing protein1.4e-3734.94Show/hide
Query:  GGCGTAPPRAMKTLCWNVCGLGNPQTFQAVRDGIRHYDPHLLFLLEMKCSSNARNLNKIKVSLNYSGCFTMDCNGRSGRLCLFWKEEMDVTIRSFSQFHI
        GGCG APP AM  L WN  GLGNP+T Q +   +R  DP ++FL+E     +   L +++  L +   F      + G LCLFWK+E+ + + SFS  HI
Subjt:  GGCGTAPPRAMKTLCWNVCGLGNPQTFQAVRDGIRHYDPHLLFLLEMKCSSNARNLNKIKVSLNYSGCFTMDCNGRSGRLCLFWKEEMDVTIRSFSQFHI

Query:  DAPI-QWDSKVWRFSGIYGNPNASHRNHTWNLFRRLHNNDDLAWIVGGDFNTTLLNEEKEG---------------------------GDMFTWSNRKDR
        DA + +     WRF+G YG P    R  +WNL RRL+    L W   GDFN  +  EEK G                           G  FTW+N + R
Subjt:  DAPI-QWDSKVWRFSGIYGNPNASHRNHTWNLFRRLHNNDDLAWIVGGDFNTTLLNEEKEG---------------------------GDMFTWSNRKDR

Query:  ESQINERLDRFVVNEAFIQLFLNALVEHL-NWASPITAQSFLMANESHLYTATIRNVSPELQNSSPKMGQ--------RVELQLDKALEEEEIYWKQRSR
        +    ERLDR V    ++  F +A V HL  W   + + S    N      A I+ V   L+ +     Q        ++  +L   L +EE  W+QRSR
Subjt:  ESQINERLDRFVVNEAFIQLFLNALVEHL-NWASPITAQSFLMANESHLYTATIRNVSPELQNSSPKMGQ--------RVELQLDKALEEEEIYWKQRSR

Query:  ENWLKWGDRNTR
          WL+ GDRNTR
Subjt:  ENWLKWGDRNTR

A0A2N9IZB6 Reverse transcriptase domain-containing protein6.7e-3733.99Show/hide
Query:  GGGCGTAPPRAMKTLCWNVCGLGNPQTFQAVRDGIRHYDPHLLFLLEMKCSSNARNLNKIKVSLNYSGCFTMDCNGRSGRLCLFWKEEMDVTIRSFSQFH
        GGGC  APP  M  L  N  GLGNP+T   + D +R   P+++FL+E +     R L  + V L   GCF +D +G  G L L W   + V I+S+S +H
Subjt:  GGGCGTAPPRAMKTLCWNVCGLGNPQTFQAVRDGIRHYDPHLLFLLEMKCSSNARNLNKIKVSLNYSGCFTMDCNGRSGRLCLFWKEEMDVTIRSFSQFH

Query:  IDAPI-QWDSKVWRFSGIYGNPNASHRNHTWNLFRRLHNNDDLAWIVGGDFNTTLLNEEKEG---------------------------GDMFTWSNRKD
        IDA +   D + WR +G YG+P  S R  TW L RRL   ++  W+V  DFN  +  +E+ G                           G  FTWSNR+ 
Subjt:  IDAPI-QWDSKVWRFSGIYGNPNASHRNHTWNLFRRLHNNDDLAWIVGGDFNTTLLNEEKEG---------------------------GDMFTWSNRKD

Query:  RESQINERLDRFVVNEAFIQLFLNALVEHLNWASPITAQSFLMANESHLYTATIRNVSPELQNSSPKMGQRVELQLDKALEEEEIYWKQRSRENWLKWGD
         E  +  RLDR V N A+  LF NA V H+   S    +  +    +      +   S  +     +    +  ++   L +EE+ W+QRSR NWL  GD
Subjt:  RESQINERLDRFVVNEAFIQLFLNALVEHLNWASPITAQSFLMANESHLYTATIRNVSPELQNSSPKMGQRVELQLDKALEEEEIYWKQRSRENWLKWGD

Query:  RNT
        +NT
Subjt:  RNT

A0A5B6W8D4 Reverse transcriptase1.0e-3738.1Show/hide
Query:  IQGGGCGTAPPRAMKTLCWNVCGLGNPQTFQAVRDGIRHYDPHLLFLLEMKCSSNARNLNKIKVSLNYSGCFTMDCNGRSGRLCLFWKEEMDVTIRSFSQ
        ++GGG   APPRAMK LCWN  G+GNP   + ++  +   DP ++FL E KC SN  +L+ ++      GC  +D  GRSG L L W+E + VT++++S+
Subjt:  IQGGGCGTAPPRAMKTLCWNVCGLGNPQTFQAVRDGIRHYDPHLLFLLEMKCSSNARNLNKIKVSLNYSGCFTMDCNGRSGRLCLFWKEEMDVTIRSFSQ

Query:  FHIDAPIQW-DSKVWRFSGIYGNPNASHRNHTWNLFRRLHNNDDLAWIVGGDFNTTLLNEEKEGG---------------------------DMFTWSNR
        +HID+ +   DS+  RF+G YG  +   +  +W++ +R+ +     WIVGGDFN  L NEEKEGG                             FTWSN 
Subjt:  FHIDAPIQW-DSKVWRFSGIYGNPNASHRNHTWNLFRRLHNNDDLAWIVGGDFNTTLLNEEKEGG---------------------------DMFTWSNR

Query:  KDRESQINERLDRFVVNEAFIQL--FLNALV
        ++    + ERLDRF+V++ FI    F+NA+V
Subjt:  KDRESQINERLDRFVVNEAFIQL--FLNALV

A0A803Q9W0 Uncharacterized protein2.0e-3630.63Show/hide
Query:  PKQMIQGGGCGTAPPRAMKTLCWNVCGLGNPQTFQAVRDGIRHYDPHLLFLLEMKCSSNARNLNKIKVSLNYSGCFTMDCNGRSGRLCLFWKEEMDVTIR
        PK  I G  C    PR M +L WNV GLGNP T  A+ + ++ + P ++FL E +  SN   L  I++ L + GCF +D  G+SG L L WKE   V + 
Subjt:  PKQMIQGGGCGTAPPRAMKTLCWNVCGLGNPQTFQAVRDGIRHYDPHLLFLLEMKCSSNARNLNKIKVSLNYSGCFTMDCNGRSGRLCLFWKEEMDVTIR

Query:  SFSQFHIDAPIQWDSKV-WRFSGIYGNPNASHRNHTWNLFRRLHNNDDLAWIVGGDFN---------------------------TTLLNEEKEGGDMFT
        SF+ FHIDA I  +  + WRF+G YG+P+ S R H+W L +R+  N +  W+ GGDFN                           T  L E    G  FT
Subjt:  SFSQFHIDAPIQWDSKV-WRFSGIYGNPNASHRNHTWNLFRRLHNNDDLAWIVGGDFN---------------------------TTLLNEEKEGGDMFT

Query:  WSNRKDRESQINERLDRFVVNEAFIQLFLNALVEHLN-----------------------------------WASPITAQSFL--MANESHLYT------
        W N +   + I ERLDR +VN  + +++  A V+HL+                                   WA     Q  +  +  E +  T      
Subjt:  WSNRKDRESQINERLDRFVVNEAFIQLFLNALVEHLN-----------------------------------WASPITAQSFL--MANESHLYT------

Query:  ----------------------ATIRNVSPELQNSSPKMG-------QRVELQLDKALEEEEIYWKQRSRENWLKWGDRNTR
                              A I+++  E++  S +         + +E  L+  L +EE++WKQRSR  WL  GDRNTR
Subjt:  ----------------------ATIRNVSPELQNSSPKMG-------QRVELQLDKALEEEEIYWKQRSRENWLKWGDRNTR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTCATCCGAAGCAAATGATCCAAGGTGGAGGTTGTGGTACAGCCCCGCCGAGAGCCATGAAAACGCTATGTTGGAATGTCTGTGGTCTGGGGAATCCTCAGACATT
CCAAGCTGTTCGTGACGGTATACGTCACTATGATCCTCACCTACTTTTTCTGTTGGAAATGAAATGTAGTTCGAATGCTCGAAATCTTAATAAAATAAAGGTGTCTTTGA
ATTACTCGGGTTGTTTTACTATGGACTGTAATGGTCGTAGTGGTCGTCTATGTTTGTTTTGGAAAGAGGAAATGGACGTTACCATAAGATCTTTCTCTCAATTCCATATT
GATGCTCCGATCCAATGGGATTCCAAAGTATGGCGTTTCTCTGGTATTTACGGCAATCCAAATGCTAGCCATCGAAATCATACTTGGAATTTATTTCGTCGATTGCACAA
TAATGATGATTTGGCTTGGATTGTTGGTGGTGATTTTAATACAACTCTATTGAATGAGGAGAAGGAAGGTGGTGACATGTTTACTTGGTCAAATAGAAAAGACAGGGAAT
CCCAAATAAATGAACGCCTTGACAGATTCGTAGTGAACGAAGCTTTCATTCAATTATTTCTGAATGCATTAGTGGAGCATCTGAATTGGGCCAGTCCGATCACCGCCCAG
TCATTCTTAATGGCGAACGAGAGTCACCTTTACACGGCCACAATTAGAAATGTGTCTCCAGAATTGCAGAACTCGTCTCCAAAAATGGGGCAAAGGGTGGAGCTTCAGTT
GGATAAAGCACTAGAAGAAGAAGAGATTTACTGGAAGCAGCGATCTCGAGAGAACTGGCTTAAGTGGGGCGATAGAAATACGAGAAATAATTTAAACAATCACATGGCAG
TAATGGATTGGTCTAAGCGTTGTGAGTGGATCTATGATTATATGGAAGAGACAAGACCTATGCAAAATACCAACAAGGCACCTAAGAAGAGGCAGGAGGACACGACGTCT
ACGGTATTGAACAGAGAGAATCACACGGTATTGCAAAATAACACTGTCCGGCTGTATACGGACTGTGCTGTACGGACGCAGGCAGACGGGTCTGGGTATGGGGCAGTGGT
TGTCGGCATGGATGGGCAGGTTTGTGGTACGATGGAATTTTATGATTCAACCTCACATTCGCCACTTGCAGGGGAGGTAAATGCTTTACTTAATGGTCTTCGCCTTTTGC
AGCGTATGCAAATTTCTTGTGCTCATGTTTATTCCGACTCCACGAATGCCATCAGGATGATTATAGGAGATACTCAGATTACATCTGACGTGTATCATTGGATAATGCAA
ATTCGAAAGATGTCTGAGTCTTTTGAATTTGTATCTTTCAATCATGTGTCTAGAAATTGTAATAGGAGGGCGGATTCTCTTGCGAAACATGCCTTAACTCAGCGCAATTC
TATGCTTTGGCTAGGGAATGTCTCACAGGAGCTCGCATCAATGAGTGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGACTCATCCGAAGCAAATGATCCAAGGTGGAGGTTGTGGTACAGCCCCGCCGAGAGCCATGAAAACGCTATGTTGGAATGTCTGTGGTCTGGGGAATCCTCAGACATT
CCAAGCTGTTCGTGACGGTATACGTCACTATGATCCTCACCTACTTTTTCTGTTGGAAATGAAATGTAGTTCGAATGCTCGAAATCTTAATAAAATAAAGGTGTCTTTGA
ATTACTCGGGTTGTTTTACTATGGACTGTAATGGTCGTAGTGGTCGTCTATGTTTGTTTTGGAAAGAGGAAATGGACGTTACCATAAGATCTTTCTCTCAATTCCATATT
GATGCTCCGATCCAATGGGATTCCAAAGTATGGCGTTTCTCTGGTATTTACGGCAATCCAAATGCTAGCCATCGAAATCATACTTGGAATTTATTTCGTCGATTGCACAA
TAATGATGATTTGGCTTGGATTGTTGGTGGTGATTTTAATACAACTCTATTGAATGAGGAGAAGGAAGGTGGTGACATGTTTACTTGGTCAAATAGAAAAGACAGGGAAT
CCCAAATAAATGAACGCCTTGACAGATTCGTAGTGAACGAAGCTTTCATTCAATTATTTCTGAATGCATTAGTGGAGCATCTGAATTGGGCCAGTCCGATCACCGCCCAG
TCATTCTTAATGGCGAACGAGAGTCACCTTTACACGGCCACAATTAGAAATGTGTCTCCAGAATTGCAGAACTCGTCTCCAAAAATGGGGCAAAGGGTGGAGCTTCAGTT
GGATAAAGCACTAGAAGAAGAAGAGATTTACTGGAAGCAGCGATCTCGAGAGAACTGGCTTAAGTGGGGCGATAGAAATACGAGAAATAATTTAAACAATCACATGGCAG
TAATGGATTGGTCTAAGCGTTGTGAGTGGATCTATGATTATATGGAAGAGACAAGACCTATGCAAAATACCAACAAGGCACCTAAGAAGAGGCAGGAGGACACGACGTCT
ACGGTATTGAACAGAGAGAATCACACGGTATTGCAAAATAACACTGTCCGGCTGTATACGGACTGTGCTGTACGGACGCAGGCAGACGGGTCTGGGTATGGGGCAGTGGT
TGTCGGCATGGATGGGCAGGTTTGTGGTACGATGGAATTTTATGATTCAACCTCACATTCGCCACTTGCAGGGGAGGTAAATGCTTTACTTAATGGTCTTCGCCTTTTGC
AGCGTATGCAAATTTCTTGTGCTCATGTTTATTCCGACTCCACGAATGCCATCAGGATGATTATAGGAGATACTCAGATTACATCTGACGTGTATCATTGGATAATGCAA
ATTCGAAAGATGTCTGAGTCTTTTGAATTTGTATCTTTCAATCATGTGTCTAGAAATTGTAATAGGAGGGCGGATTCTCTTGCGAAACATGCCTTAACTCAGCGCAATTC
TATGCTTTGGCTAGGGAATGTCTCACAGGAGCTCGCATCAATGAGTGCTTAG
Protein sequenceShow/hide protein sequence
MTHPKQMIQGGGCGTAPPRAMKTLCWNVCGLGNPQTFQAVRDGIRHYDPHLLFLLEMKCSSNARNLNKIKVSLNYSGCFTMDCNGRSGRLCLFWKEEMDVTIRSFSQFHI
DAPIQWDSKVWRFSGIYGNPNASHRNHTWNLFRRLHNNDDLAWIVGGDFNTTLLNEEKEGGDMFTWSNRKDRESQINERLDRFVVNEAFIQLFLNALVEHLNWASPITAQ
SFLMANESHLYTATIRNVSPELQNSSPKMGQRVELQLDKALEEEEIYWKQRSRENWLKWGDRNTRNNLNNHMAVMDWSKRCEWIYDYMEETRPMQNTNKAPKKRQEDTTS
TVLNRENHTVLQNNTVRLYTDCAVRTQADGSGYGAVVVGMDGQVCGTMEFYDSTSHSPLAGEVNALLNGLRLLQRMQISCAHVYSDSTNAIRMIIGDTQITSDVYHWIMQ
IRKMSESFEFVSFNHVSRNCNRRADSLAKHALTQRNSMLWLGNVSQELASMSA