; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028176 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028176
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionDNA-directed DNA polymerase
Genome locationchr8:14940270..14944163
RNA-Seq ExpressionLag0028176
SyntenyLag0028176
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PIN05661.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]8.3e-13048.03Show/hide
Query:  VNQV--TDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGWRNHPNFSWGGQGNNVQAQKKVNQPGFAKAQVMPQQNKSALPQQNSGNS
        VNQV  T   C  CGE H  + CP +  S+ FV N R   NNPYSN YNPGWR HPNFSW    NN Q Q      G A       Q +   P Q    S
Subjt:  VNQV--TDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGWRNHPNFSWGGQGNNVQAQKKVNQPGFAKAQVMPQQNKSALPQQNSGNS

Query:  LEAMMKEFMAHTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVNAVTLRSGKPLEERI-EPSKTQVINNNGDRNNNVVVEKE
        LE  + +FMA       S  A+ + +E Q+GQLAN + +RPQG LPS+TE +PR++GK Q  AVTLR+G+ L+E + EP+K++            V+ +E
Subjt:  LEAMMKEFMAHTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVNAVTLRSGKPLEERI-EPSKTQVINNNGDRNNNVVVEKE

Query:  LEFGQGAGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQR-QKPKNQD-------------------EAIEQMPNYAKFLKDILTKKKRLGEFETVSL
         E                   +VE P     P     PFPQR QK K +                    EA+EQMP+Y KF+KDIL+KK+RLG++ETV+L
Subjt:  LEFGQGAGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQR-QKPKNQD-------------------EAIEQMPNYAKFLKDILTKKKRLGEFETVSL

Query:  TEECSAILNNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIIL
        TEECSAI+ N LPPK KDPGSFTIP +IG    GRALCDLGASINLMP S+YR LG+GEA+PT++TLQLADRS+TYP+G IED+LVKVDKFIFP DF++L
Subjt:  TEECSAILNNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIIL

Query:  DYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPNEMEDCSFIRILES-----AVIETAIQDSTNKHSENHGEVSVEDFEFCS
        D E D +VPIILGRPFLATGR LIDVQKGELTMRV ++++ FNVFKAMK+PNE ++C  + + ++     ++ E ++        +   E + ED+E   
Subjt:  DYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPNEMEDCSFIRILES-----AVIETAIQDSTNKHSENHGEVSVEDFEFCS

Query:  LDRKNEKQLFRCEDVFESLDLDQRKAPP--IKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGLAH
        +   +  + F+   V ESL   +R AP   +KPS+ E PTL+LKPLP HL Y YLGES+TLP+I++S L     E L+++L+ ++ AIGWT+ DI+G++ 
Subjt:  LDRKNEKQLFRCEDVFESLDLDQRKAPP--IKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGLAH

Query:  LFVCTKSL
         F   K L
Subjt:  LFVCTKSL

PIN22487.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]1.1e-12947.72Show/hide
Query:  VNQV--TDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGWRNHPNFSWGGQGNNVQAQKKVNQPGFAKAQVMPQQNKSALPQQNSGNS
        VNQV  T   C  CGE H  + CP +  S+ FV N R   NNPYSN YNPGWR HPNFSW    NN Q Q      G A       Q +   P Q    S
Subjt:  VNQV--TDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGWRNHPNFSWGGQGNNVQAQKKVNQPGFAKAQVMPQQNKSALPQQNSGNS

Query:  LEAMMKEFMAHTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVNAVTLRSGKPLEERI-EPSKTQVINNNGDRNNNVVVEKE
        LE  + +FMA       S  A+ + +E Q+GQLAN + +RPQG LPS+TE +PR++GK Q  AVTLR+G+ L+E + EP+K++            V+ +E
Subjt:  LEAMMKEFMAHTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVNAVTLRSGKPLEERI-EPSKTQVINNNGDRNNNVVVEKE

Query:  LEFGQGAGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQR-QKPKNQD-------------------EAIEQMPNYAKFLKDILTKKKRLGEFETVSL
         E                   +VE P     P     PFPQR QK K +                    EA+EQMP+Y KF+KDIL+KK+RLG++ETV+L
Subjt:  LEFGQGAGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQR-QKPKNQD-------------------EAIEQMPNYAKFLKDILTKKKRLGEFETVSL

Query:  TEECSAILNNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIIL
        TEECSAI+ N LPPK KDPGSFTIP +IG    GRALCDLGASINLMP S+YR LG+GEA+PT++TLQLADRS+TYP+G IED+LVKVDKFIFP DF++L
Subjt:  TEECSAILNNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIIL

Query:  DYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPNEMEDCSFIRILESAVIETAIQDSTNKHSENHGEVSVEDFEFCSLDRKN
        D E D +VPIILGRPFLATGR LIDVQKGELTMRV ++++ FNVFKAMK+PNE ++C  + + +    + A  +S  +   +  E ++ D     LD +N
Subjt:  DYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPNEMEDCSFIRILESAVIETAIQDSTNKHSENHGEVSVEDFEFCSLDRKN

Query:  EKQLFRCEDVFESLDLD-----------QRKAPP--IKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLAD
        E+ L    +V ++LD             +R  P   +KPS+ + PTL+LKPLP HL Y YLGES+TLP+I++S L     E L+++L+ ++ AIGWT+AD
Subjt:  EKQLFRCEDVFESLDLD-----------QRKAPP--IKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLAD

Query:  IQGLAHLFVCTKSL
        I+G++  F   K L
Subjt:  IQGLAHLFVCTKSL

PIN26668.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]2.0e-12847.2Show/hide
Query:  VNQV--TDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGWRNHPNFSWGGQGNNVQAQKKVNQPGFAKAQVMPQQNKSALPQQNSGNS
        VNQV  T   C  CGE H    CP++  S+ FV N R   NNPYSN YNPGWR HPNFSW    NN Q Q      G A       Q +   P Q    S
Subjt:  VNQV--TDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGWRNHPNFSWGGQGNNVQAQKKVNQPGFAKAQVMPQQNKSALPQQNSGNS

Query:  LEAMMKEFMAHTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVNAVTLRSGKPLEERI-EPSKTQVINNNGDRNNNVVVEKE
        LE  + +FMA       S   +++ +E Q+GQLAN + +RPQG L S+TE +PR++GK Q  AVTLR+G+ L+E + EP+K+        +   V+ EKE
Subjt:  LEAMMKEFMAHTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVNAVTLRSGKPLEERI-EPSKTQVINNNGDRNNNVVVEKE

Query:  LEFGQGAGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQ-----------------DEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEE
         +                   +VE           PL   Q+QK K Q                  EA+EQMP+Y KF+K IL+KK+RLG++ETV+LTEE
Subjt:  LEFGQGAGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQ-----------------DEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEE

Query:  CSAILNNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYE
        CSAI+ N LPPK KDPGSFTIP +IG    GRALCDLGASINLMP S+YR LG+GEA+PT++TLQLA+RS+TYP+G IED+LVKVDKFIFP DF++LD E
Subjt:  CSAILNNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYE

Query:  ADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPNEMEDCSFIRILESAV--------IETAIQDSTNKHSENHGEVSVEDFEFCS
         D +VPIILGRPFLATGR LIDVQKG+LTMRV ++++ FNVFKAMK+PNE ++C  + + ++          +E A+ D  ++ +E   EV         
Subjt:  ADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPNEMEDCSFIRILESAV--------IETAIQDSTNKHSENHGEVSVEDFEFCS

Query:  LDRKNEKQLFRCEDVFESLDLDQRKAPP--IKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGLAH
        +   +  + F+   V ESL   +R AP   +KPS+ E+PTL+LKPLP HL Y YLGES+TLP+I++S L     E L+++ + ++ AIGWT+ADI+G++H
Subjt:  LDRKNEKQLFRCEDVFESLDLDQRKAPP--IKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGLAH

Query:  LFVCTKSL
         F   K L
Subjt:  LFVCTKSL

XP_017239676.1 PREDICTED: uncharacterized protein LOC108212460 [Daucus carota subsp. sativus]8.3e-13045.38Show/hide
Query:  ASLAKNGQVYNEVISHQQPPAMEPAAVVNQVTD--EACVYCGEDHNYEFCP------SNPASVFFVG---NQRNNPYSNFYNPGWRNHPNFSWGGQGNNV
        A +    +  N+ +        +P    +QV +    C  CGE H  + CP      +  +SV +VG   NQ+NNP+SN YNPGWRNHPNFSW    NNV
Subjt:  ASLAKNGQVYNEVISHQQPPAMEPAAVVNQVTD--EACVYCGEDHNYEFCP------SNPASVFFVG---NQRNNPYSNFYNPGWRNHPNFSWGGQGNNV

Query:  QAQKKVNQ---PGFAKAQVMPQQNKSALPQQNSGNSLEAMMKEFMAHTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVNAV
        +      Q   PGF       QQN      +   N+ E ++ ++M  TDA IQS  ASMRALE+QVGQLA+ +  RP G LPS+TE +P+ + +E   A+
Subjt:  QAQKKVNQ---PGFAKAQVMPQQNKSALPQQNSGNSLEAMMKEFMAHTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVNAV

Query:  TLRSGKPLEERIEPSKTQVINNNGDRNNNVVVEKELEFGQGAGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQD----------------
        TLRSGK +E       T+ +++ GD    +  E  +           N  A  S P     +V PPP     PFPQR + + QD                
Subjt:  TLRSGKPLEERIEPSKTQVINNNGDRNNNVVVEKELEFGQGAGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQD----------------

Query:  ----EAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILNNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTV
            EA+EQM +Y KF+KDIL++K+RL EFETV+LTEECSAIL   LPPK KDPGSFTIP +IG +  G+ALCDLGAS+NLMPLS++ KLG+GE +PT+V
Subjt:  ----EAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILNNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTV

Query:  TLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPNEMEDCSFIRILES
         LQLADRS+ YP G +EDVLVKVDKFIFP DFI+LD E D D+P++LGRPFLATGR LIDVQKGELTMRV +E+V FNVF AMK+ N+ E C  +     
Subjt:  TLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPNEMEDCSFIRILES

Query:  A-----VIETAIQDSTNKHSENHGEVSVEDFEFCSLDRKNEKQLFRCEDVFESLDLDQRKAPPIKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVAS
              ++E    D         G+ S E+   C  +        R    FES ++   K+   KPS+ E P L+LK LP HLKY +LGE  TLP+I++S
Subjt:  A-----VIETAIQDSTNKHSENHGEVSVEDFEFCSLDRKNEKQLFRCEDVFESLDLDQRKAPPIKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVAS

Query:  DLMPEHEEALIKLLQQYRKAIGWTLADIQGLAHLFVCTK
         L  EHEE L+++L++Y++AIGW +ADI+G++  F   K
Subjt:  DLMPEHEEALIKLLQQYRKAIGWTLADIQGLAHLFVCTK

XP_024028757.1 uncharacterized protein LOC112093792 [Morus notabilis]1.2e-14449.52Show/hide
Query:  AMEPAAVVNQVTDEA--CVYCGEDHNYEFCPSNPASVFFVG--NQRNNPYSNFYNPGWRNHPNFSWGGQGNNVQ--AQKKVNQPGFAKAQVMPQ----QN
        ++  AA  N  T  A  CVYCG +H++E CPSNP SV +V   N+ NNPYSN YN GW+ HPNFSW  Q  N      K    PGF + Q   Q    Q+
Subjt:  AMEPAAVVNQVTDEA--CVYCGEDHNYEFCPSNPASVFFVG--NQRNNPYSNFYNPGWRNHPNFSWGGQGNNVQ--AQKKVNQPGFAKAQVMPQ----QN

Query:  KSALPQQNSGNSLEAMMKEFMAHTD-------AAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREG----KEQVNAVTLRSGKPLEERIEP
            P Q S   +EA++KE+MA  D       A +QS  AS+R LE QVGQLAN L  RPQG LPSDT++PRR+G    KE   A+TL++G+ +E+    
Subjt:  KSALPQQNSGNSLEAMMKEFMAHTD-------AAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREG----KEQVNAVTLRSGKPLEERIEP

Query:  SKTQVINNNGDRNNNVVVEKELEFGQGAGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQD--------------------EAIEQMPNYA
        ++      +       V +   E  Q        A    + P+  PP           PFPQR + + QD                    EA+EQMP+Y 
Subjt:  SKTQVINNNGDRNNNVVVEKELEFGQGAGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQD--------------------EAIEQMPNYA

Query:  KFLKDILTKKKRLGEFETVSLTEECSAILNNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEG
        KF+KDILTKK+RLGEFETV+LTEECSAIL N LPPK KDPGSFTIP SIG + +G+ALCDLGASINLMP+S++RKLGIGE  PTTVTLQLADRS  +PEG
Subjt:  KFLKDILTKKKRLGEFETVSLTEECSAILNNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEG

Query:  KIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPNEMEDCSFIRILESAVIETAIQDSTNKH
        KIEDVLV+VDKFIFP DFI+LDYEADK+VPIILGRPFLATG+ LIDVQKGELTMRV +++V FNVFKAM++ +E+E+CS + +L+S V     +    K 
Subjt:  KIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPNEMEDCSFIRILESAVIETAIQDSTNKH

Query:  SENHGEVSVEDFEFCSLDRKNEKQLFRCED---------VFESLDLDQRKAPPIKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEAL
              +  E  E       N+KQ+ R E           FESLDL        KPS+ E P L+L+PLP HL+Y YLG+S+TLP+I+AS L    E  L
Subjt:  SENHGEVSVEDFEFCSLDRKNEKQLFRCED---------VFESLDLDQRKAPPIKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEAL

Query:  IKLLQQYRKAIGWTLADIQGLA
        +++L+++++AIGWT+ADI+G++
Subjt:  IKLLQQYRKAIGWTLADIQGLA

TrEMBL top hitse value%identityAlignment
A0A2G9GK35 Reverse transcriptase4.0e-13048.03Show/hide
Query:  VNQV--TDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGWRNHPNFSWGGQGNNVQAQKKVNQPGFAKAQVMPQQNKSALPQQNSGNS
        VNQV  T   C  CGE H  + CP +  S+ FV N R   NNPYSN YNPGWR HPNFSW    NN Q Q      G A       Q +   P Q    S
Subjt:  VNQV--TDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGWRNHPNFSWGGQGNNVQAQKKVNQPGFAKAQVMPQQNKSALPQQNSGNS

Query:  LEAMMKEFMAHTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVNAVTLRSGKPLEERI-EPSKTQVINNNGDRNNNVVVEKE
        LE  + +FMA       S  A+ + +E Q+GQLAN + +RPQG LPS+TE +PR++GK Q  AVTLR+G+ L+E + EP+K++            V+ +E
Subjt:  LEAMMKEFMAHTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVNAVTLRSGKPLEERI-EPSKTQVINNNGDRNNNVVVEKE

Query:  LEFGQGAGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQR-QKPKNQD-------------------EAIEQMPNYAKFLKDILTKKKRLGEFETVSL
         E                   +VE P     P     PFPQR QK K +                    EA+EQMP+Y KF+KDIL+KK+RLG++ETV+L
Subjt:  LEFGQGAGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQR-QKPKNQD-------------------EAIEQMPNYAKFLKDILTKKKRLGEFETVSL

Query:  TEECSAILNNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIIL
        TEECSAI+ N LPPK KDPGSFTIP +IG    GRALCDLGASINLMP S+YR LG+GEA+PT++TLQLADRS+TYP+G IED+LVKVDKFIFP DF++L
Subjt:  TEECSAILNNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIIL

Query:  DYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPNEMEDCSFIRILES-----AVIETAIQDSTNKHSENHGEVSVEDFEFCS
        D E D +VPIILGRPFLATGR LIDVQKGELTMRV ++++ FNVFKAMK+PNE ++C  + + ++     ++ E ++        +   E + ED+E   
Subjt:  DYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPNEMEDCSFIRILES-----AVIETAIQDSTNKHSENHGEVSVEDFEFCS

Query:  LDRKNEKQLFRCEDVFESLDLDQRKAPP--IKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGLAH
        +   +  + F+   V ESL   +R AP   +KPS+ E PTL+LKPLP HL Y YLGES+TLP+I++S L     E L+++L+ ++ AIGWT+ DI+G++ 
Subjt:  LDRKNEKQLFRCEDVFESLDLDQRKAPP--IKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGLAH

Query:  LFVCTKSL
         F   K L
Subjt:  LFVCTKSL

A0A2G9HH15 Reverse transcriptase4.1e-12747.55Show/hide
Query:  VNQV--TDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGWRNHPNFSWGGQGNNVQAQKKVNQPGFAKAQVMPQQNKSALPQQNSGNS
        VNQV  T   C  CGE H  + CP +  S+ FV N R   NNPYSN YNPGWR HPNFSW    NN Q Q      G A       Q +   P Q    S
Subjt:  VNQV--TDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGWRNHPNFSWGGQGNNVQAQKKVNQPGFAKAQVMPQQNKSALPQQNSGNS

Query:  LEAMMKEFMAHTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVNAVTLRSGKPLEERI-EPSKTQVINNNGDRNNNVVVEKE
        LE  + +FMA       S  A+ + +E Q+GQLAN + +RP+  LPS+TE +PR++ K Q  AVTLR+G  L+E + EP+K++              EKE
Subjt:  LEAMMKEFMAHTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVNAVTLRSGKPLEERI-EPSKTQVINNNGDRNNNVVVEKE

Query:  LEFGQGAGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILNNGLPPKAKDPG
        +          E  G      ++E P                     + +A+EQMP+Y KF+KDIL+KK+RLG++ETV+LTEECSAI+ N LPPK KDPG
Subjt:  LEFGQGAGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILNNGLPPKAKDPG

Query:  SFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATG
        SFTIP +IG    GRALCDLGASINLMP S+YR LG+GEA+PT++TLQLADRS+TYP G IED+LVKVDKFIFP DF++LD E D +VPIILGRPFLATG
Subjt:  SFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATG

Query:  RALIDVQKGELTMRVCNEEVKFNVFKAMKYPNEMEDCSFIRILESAVIETAIQDSTNKHSENHGEVSVEDFEFCSLDRKNEKQLFRCEDV--------FE
        R LIDVQKGELTMRV ++++ FNVFKAMK+PNE ++C  + + +               +E+  E  ++  E   LD  +E+    CE V        F+
Subjt:  RALIDVQKGELTMRVCNEEVKFNVFKAMKYPNEMEDCSFIRILESAVIETAIQDSTNKHSENHGEVSVEDFEFCSLDRKNEKQLFRCEDV--------FE

Query:  SLDLD--QRKAPP--IKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGLAHLFVCTKSL
        S  ++  +R AP   +KPS+ E PTL+LKPLP HL Y YLGES+TLP+I++S L     E L+++L+ ++ AIGWT+ADI+G++  F   K L
Subjt:  SLDLD--QRKAPP--IKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGLAHLFVCTKSL

A0A2G9HYA0 Reverse transcriptase5.2e-13047.72Show/hide
Query:  VNQV--TDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGWRNHPNFSWGGQGNNVQAQKKVNQPGFAKAQVMPQQNKSALPQQNSGNS
        VNQV  T   C  CGE H  + CP +  S+ FV N R   NNPYSN YNPGWR HPNFSW    NN Q Q      G A       Q +   P Q    S
Subjt:  VNQV--TDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGWRNHPNFSWGGQGNNVQAQKKVNQPGFAKAQVMPQQNKSALPQQNSGNS

Query:  LEAMMKEFMAHTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVNAVTLRSGKPLEERI-EPSKTQVINNNGDRNNNVVVEKE
        LE  + +FMA       S  A+ + +E Q+GQLAN + +RPQG LPS+TE +PR++GK Q  AVTLR+G+ L+E + EP+K++            V+ +E
Subjt:  LEAMMKEFMAHTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVNAVTLRSGKPLEERI-EPSKTQVINNNGDRNNNVVVEKE

Query:  LEFGQGAGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQR-QKPKNQD-------------------EAIEQMPNYAKFLKDILTKKKRLGEFETVSL
         E                   +VE P     P     PFPQR QK K +                    EA+EQMP+Y KF+KDIL+KK+RLG++ETV+L
Subjt:  LEFGQGAGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQR-QKPKNQD-------------------EAIEQMPNYAKFLKDILTKKKRLGEFETVSL

Query:  TEECSAILNNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIIL
        TEECSAI+ N LPPK KDPGSFTIP +IG    GRALCDLGASINLMP S+YR LG+GEA+PT++TLQLADRS+TYP+G IED+LVKVDKFIFP DF++L
Subjt:  TEECSAILNNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIIL

Query:  DYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPNEMEDCSFIRILESAVIETAIQDSTNKHSENHGEVSVEDFEFCSLDRKN
        D E D +VPIILGRPFLATGR LIDVQKGELTMRV ++++ FNVFKAMK+PNE ++C  + + +    + A  +S  +   +  E ++ D     LD +N
Subjt:  DYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPNEMEDCSFIRILESAVIETAIQDSTNKHSENHGEVSVEDFEFCSLDRKN

Query:  EKQLFRCEDVFESLDLD-----------QRKAPP--IKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLAD
        E+ L    +V ++LD             +R  P   +KPS+ + PTL+LKPLP HL Y YLGES+TLP+I++S L     E L+++L+ ++ AIGWT+AD
Subjt:  EKQLFRCEDVFESLDLD-----------QRKAPP--IKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLAD

Query:  IQGLAHLFVCTKSL
        I+G++  F   K L
Subjt:  IQGLAHLFVCTKSL

A0A2G9HYD8 Reverse transcriptase4.1e-12747.72Show/hide
Query:  EDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGWRNHPNFSWGGQGNNVQAQKKVNQPGFAKAQVMPQQNKSALPQQNSGNSLEAMMKEFMAHTDAA
        E H  + CP +  S+ FV N R   NNPYSN YNPGWR HPNFSW    NN Q Q   + P F +    P Q     P Q    SLE  + +FMA     
Subjt:  EDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGWRNHPNFSWGGQGNNVQAQKKVNQPGFAKAQVMPQQNKSALPQQNSGNSLEAMMKEFMAHTDAA

Query:  IQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVNAVTLRSGKPLEERI-EPSKTQVINNNGDRNNNVVVEKELEFGQGAGGSKENAG
          S  A+ + +E Q+GQLAN + +RPQG LPS+TE +PR++GK Q  AVTLR+G+ L+E + +P+K++            V+ KE E             
Subjt:  IQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVNAVTLRSGKPLEERI-EPSKTQVINNNGDRNNNVVVEKELEFGQGAGGSKENAG

Query:  ASGSVPDVEPPYVPPPPYVPPLPFPQR-QKPKNQD-------------------EAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILNNGLPPK
              +VE P     P     PFPQ+ QK K +                    EA+EQMP+Y KF+KDIL+KK+RLG++ET +LTEEC+AI+ N LPPK
Subjt:  ASGSVPDVEPPYVPPPPYVPPLPFPQR-QKPKNQD-------------------EAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILNNGLPPK

Query:  AKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRP
         KDPGSFTIP +IG    GRALCDLGASINLMP S+YR LG+GEA+PT++TLQLADRS+TYP+G IED+LVKVDKFIFP DF++LD E D +VPIILGRP
Subjt:  AKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRP

Query:  FLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPNEMEDCSFIRILES-----AVIETAIQDSTNKHSENHGEVSVEDFEFCSLDRKNEKQLFRCEDV
        FLATGR LIDVQKGELTMRV ++++ FNVFKAMK+PNE ++C  + + ++     ++ E  +        +   E + ED E   +   N  + F+   V
Subjt:  FLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPNEMEDCSFIRILES-----AVIETAIQDSTNKHSENHGEVSVEDFEFCSLDRKNEKQLFRCEDV

Query:  FESLDLDQRKAPP--IKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGLAHLFVCTKSL
         ESL   +R  P   +KPS+ + PTL+LKPLP+HL YVYLGES+TLP+I++S L     E L+++L+ ++ AIGWT+ADI+G++  F   K L
Subjt:  FESLDLDQRKAPP--IKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGLAHLFVCTKSL

A0A2G9IA86 DNA-directed DNA polymerase9.9e-12947.2Show/hide
Query:  VNQV--TDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGWRNHPNFSWGGQGNNVQAQKKVNQPGFAKAQVMPQQNKSALPQQNSGNS
        VNQV  T   C  CGE H    CP++  S+ FV N R   NNPYSN YNPGWR HPNFSW    NN Q Q      G A       Q +   P Q    S
Subjt:  VNQV--TDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGWRNHPNFSWGGQGNNVQAQKKVNQPGFAKAQVMPQQNKSALPQQNSGNS

Query:  LEAMMKEFMAHTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVNAVTLRSGKPLEERI-EPSKTQVINNNGDRNNNVVVEKE
        LE  + +FMA       S   +++ +E Q+GQLAN + +RPQG L S+TE +PR++GK Q  AVTLR+G+ L+E + EP+K+        +   V+ EKE
Subjt:  LEAMMKEFMAHTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVNAVTLRSGKPLEERI-EPSKTQVINNNGDRNNNVVVEKE

Query:  LEFGQGAGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQ-----------------DEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEE
         +                   +VE           PL   Q+QK K Q                  EA+EQMP+Y KF+K IL+KK+RLG++ETV+LTEE
Subjt:  LEFGQGAGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQ-----------------DEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEE

Query:  CSAILNNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYE
        CSAI+ N LPPK KDPGSFTIP +IG    GRALCDLGASINLMP S+YR LG+GEA+PT++TLQLA+RS+TYP+G IED+LVKVDKFIFP DF++LD E
Subjt:  CSAILNNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYE

Query:  ADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPNEMEDCSFIRILESAV--------IETAIQDSTNKHSENHGEVSVEDFEFCS
         D +VPIILGRPFLATGR LIDVQKG+LTMRV ++++ FNVFKAMK+PNE ++C  + + ++          +E A+ D  ++ +E   EV         
Subjt:  ADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPNEMEDCSFIRILESAV--------IETAIQDSTNKHSENHGEVSVEDFEFCS

Query:  LDRKNEKQLFRCEDVFESLDLDQRKAPP--IKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGLAH
        +   +  + F+   V ESL   +R AP   +KPS+ E+PTL+LKPLP HL Y YLGES+TLP+I++S L     E L+++ + ++ AIGWT+ADI+G++H
Subjt:  LDRKNEKQLFRCEDVFESLDLDQRKAPP--IKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGLAH

Query:  LFVCTKSL
         F   K L
Subjt:  LFVCTKSL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCCCTGCCCACTCTATAGCACGAGAGGGACTTCTGTTTGTTGGTTGGACCTCAAACAGGTTGTTCATTAGAGGAGCACTGGGACTTAAGGATCAAGAGCGAAGAAA
CTACAAGGAGGCTGTTGCGTTTTCGTTTGTTGAAGCGTCGTTGGCGAAGAACGGTCAAGTCTACAACGAAGTGATTAGTCATCAGCAACCACCAGCTATGGAGCCTGCAG
CAGTGGTGAATCAAGTCACGGACGAAGCATGTGTCTATTGCGGTGAAGACCACAACTACGAGTTTTGCCCCAGCAATCCAGCTTCTGTGTTTTTTGTAGGTAATCAGAGG
AACAACCCTTATTCTAACTTCTATAATCCAGGTTGGCGCAACCACCCCAACTTCTCATGGGGAGGTCAAGGAAATAACGTACAAGCACAAAAAAAGGTGAACCAGCCGGG
ATTTGCTAAAGCGCAGGTAATGCCCCAGCAAAATAAGTCGGCTTTGCCCCAGCAAAATTCGGGAAATTCTCTCGAGGCGATGATGAAAGAATTTATGGCTCACACAGACG
CTGCAATTCAAAGTAATCAAGCTTCAATGAGAGCCCTAGAATTGCAAGTGGGTCAGCTAGCTAATGAGCTCAAGGCAAGGCCTCAAGGGAAACTTCCATCAGATACTGAA
CACCCTAGAAGGGAAGGTAAGGAGCAGGTAAATGCAGTTACTCTTAGGAGTGGTAAACCATTAGAAGAAAGAATTGAGCCTAGTAAAACCCAGGTTATAAATAATAATGG
TGATAGAAATAATAATGTTGTTGTTGAGAAAGAATTGGAGTTTGGTCAGGGAGCTGGAGGCAGCAAAGAGAATGCTGGAGCATCTGGTTCTGTGCCAGATGTAGAACCAC
CATATGTGCCGCCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAGACAAAAGCCTAAGAATCAGGATGAAGCTATTGAGCAAATGCCTAATTATGCTAAATTCCTT
AAGGATATTTTGACTAAAAAGAAAAGGTTAGGTGAGTTTGAAACTGTATCTCTTACTGAGGAGTGTAGTGCTATTCTCAACAATGGGCTACCCCCCAAGGCTAAGGATCC
AGGGTCATTCACCATACCTGTGTCTATAGGTGGAAAGGAGTTAGGTAGAGCACTCTGTGATTTAGGTGCGAGCATTAACCTTATGCCTCTTTCGGTCTATCGAAAGTTAG
GTATTGGTGAAGCTAGACCTACCACAGTTACACTCCAATTAGCTGATAGGTCTATCACATATCCAGAGGGTAAAATTGAGGATGTCTTAGTAAAAGTTGATAAGTTCATA
TTTCCTGTTGATTTTATTATTTTAGACTATGAGGCTGATAAAGATGTCCCAATTATTCTAGGTCGTCCATTTTTGGCTACTGGTAGGGCATTAATAGATGTTCAAAAAGG
AGAATTAACAATGAGAGTCTGTAATGAGGAAGTGAAATTTAATGTCTTTAAAGCCATGAAATATCCAAACGAGATGGAGGATTGCTCCTTCATCAGGATTCTGGAGAGCG
CAGTTATTGAGACAGCAATACAGGATTCGACTAATAAGCATTCGGAAAATCATGGAGAGGTTAGTGTAGAAGATTTTGAATTTTGTTCTTTAGATAGAAAAAATGAAAAA
CAATTGTTTAGGTGTGAGGATGTTTTTGAGTCTTTAGATTTAGATCAAAGAAAGGCTCCCCCAATTAAACCATCTCTGATTGAGGCACCCACTTTAGATTTGAAACCTTT
ACCGGATCATCTAAAGTATGTGTATCTTGGGGAAAGTGAGACGTTGCCCATTATTGTTGCATCAGATTTAATGCCAGAGCATGAGGAGGCCTTAATAAAATTGCTGCAGC
AATACCGCAAAGCTATAGGTTGGACATTGGCTGATATTCAGGGATTAGCCCATCTTTTTGTATGCACAAAATCACTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGCCCTGCCCACTCTATAGCACGAGAGGGACTTCTGTTTGTTGGTTGGACCTCAAACAGGTTGTTCATTAGAGGAGCACTGGGACTTAAGGATCAAGAGCGAAGAAA
CTACAAGGAGGCTGTTGCGTTTTCGTTTGTTGAAGCGTCGTTGGCGAAGAACGGTCAAGTCTACAACGAAGTGATTAGTCATCAGCAACCACCAGCTATGGAGCCTGCAG
CAGTGGTGAATCAAGTCACGGACGAAGCATGTGTCTATTGCGGTGAAGACCACAACTACGAGTTTTGCCCCAGCAATCCAGCTTCTGTGTTTTTTGTAGGTAATCAGAGG
AACAACCCTTATTCTAACTTCTATAATCCAGGTTGGCGCAACCACCCCAACTTCTCATGGGGAGGTCAAGGAAATAACGTACAAGCACAAAAAAAGGTGAACCAGCCGGG
ATTTGCTAAAGCGCAGGTAATGCCCCAGCAAAATAAGTCGGCTTTGCCCCAGCAAAATTCGGGAAATTCTCTCGAGGCGATGATGAAAGAATTTATGGCTCACACAGACG
CTGCAATTCAAAGTAATCAAGCTTCAATGAGAGCCCTAGAATTGCAAGTGGGTCAGCTAGCTAATGAGCTCAAGGCAAGGCCTCAAGGGAAACTTCCATCAGATACTGAA
CACCCTAGAAGGGAAGGTAAGGAGCAGGTAAATGCAGTTACTCTTAGGAGTGGTAAACCATTAGAAGAAAGAATTGAGCCTAGTAAAACCCAGGTTATAAATAATAATGG
TGATAGAAATAATAATGTTGTTGTTGAGAAAGAATTGGAGTTTGGTCAGGGAGCTGGAGGCAGCAAAGAGAATGCTGGAGCATCTGGTTCTGTGCCAGATGTAGAACCAC
CATATGTGCCGCCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAGACAAAAGCCTAAGAATCAGGATGAAGCTATTGAGCAAATGCCTAATTATGCTAAATTCCTT
AAGGATATTTTGACTAAAAAGAAAAGGTTAGGTGAGTTTGAAACTGTATCTCTTACTGAGGAGTGTAGTGCTATTCTCAACAATGGGCTACCCCCCAAGGCTAAGGATCC
AGGGTCATTCACCATACCTGTGTCTATAGGTGGAAAGGAGTTAGGTAGAGCACTCTGTGATTTAGGTGCGAGCATTAACCTTATGCCTCTTTCGGTCTATCGAAAGTTAG
GTATTGGTGAAGCTAGACCTACCACAGTTACACTCCAATTAGCTGATAGGTCTATCACATATCCAGAGGGTAAAATTGAGGATGTCTTAGTAAAAGTTGATAAGTTCATA
TTTCCTGTTGATTTTATTATTTTAGACTATGAGGCTGATAAAGATGTCCCAATTATTCTAGGTCGTCCATTTTTGGCTACTGGTAGGGCATTAATAGATGTTCAAAAAGG
AGAATTAACAATGAGAGTCTGTAATGAGGAAGTGAAATTTAATGTCTTTAAAGCCATGAAATATCCAAACGAGATGGAGGATTGCTCCTTCATCAGGATTCTGGAGAGCG
CAGTTATTGAGACAGCAATACAGGATTCGACTAATAAGCATTCGGAAAATCATGGAGAGGTTAGTGTAGAAGATTTTGAATTTTGTTCTTTAGATAGAAAAAATGAAAAA
CAATTGTTTAGGTGTGAGGATGTTTTTGAGTCTTTAGATTTAGATCAAAGAAAGGCTCCCCCAATTAAACCATCTCTGATTGAGGCACCCACTTTAGATTTGAAACCTTT
ACCGGATCATCTAAAGTATGTGTATCTTGGGGAAAGTGAGACGTTGCCCATTATTGTTGCATCAGATTTAATGCCAGAGCATGAGGAGGCCTTAATAAAATTGCTGCAGC
AATACCGCAAAGCTATAGGTTGGACATTGGCTGATATTCAGGGATTAGCCCATCTTTTTGTATGCACAAAATCACTCTAG
Protein sequenceShow/hide protein sequence
MGPAHSIAREGLLFVGWTSNRLFIRGALGLKDQERRNYKEAVAFSFVEASLAKNGQVYNEVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQR
NNPYSNFYNPGWRNHPNFSWGGQGNNVQAQKKVNQPGFAKAQVMPQQNKSALPQQNSGNSLEAMMKEFMAHTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE
HPRREGKEQVNAVTLRSGKPLEERIEPSKTQVINNNGDRNNNVVVEKELEFGQGAGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDEAIEQMPNYAKFL
KDILTKKKRLGEFETVSLTEECSAILNNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFI
FPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPNEMEDCSFIRILESAVIETAIQDSTNKHSENHGEVSVEDFEFCSLDRKNEK
QLFRCEDVFESLDLDQRKAPPIKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGWTLADIQGLAHLFVCTKSL