; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001169 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001169
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr4:25947235..25951306
RNA-Seq ExpressionLag0001169
SyntenyLag0001169
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PIN22487.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]5.4e-11741.73Show/hide
Query:  VNQVAEE--ACVYCGENHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGIAQ----------------------QNKQALPQQNSGSSLEAMMKEFMA
        VNQV      C  CGE H  + CP +  S+ FV N R   NNPYSN YNPG  Q                      Q +   P Q    SLE  + +FMA
Subjt:  VNQVAEE--ACVYCGENHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGIAQ----------------------QNKQALPQQNSGSSLEAMMKEFMA

Query:  RTNAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEERI-EPSKTQVIDKNGDKNVVVEQELESGQGAGGRNK
               S  A+ + +E Q+GQLAN + +RPQG LPS+TE +PR++GK Q +AVTLR+G+ L+E + EP+K++      +K V+ E++ +          
Subjt:  RTNAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEERI-EPSKTQVIDKNGDKNVVVEQELESGQGAGGRNK

Query:  DAGASGPVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGQFRKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSVILKNGL
                 +VE P     P     PFPQR + +  + QF KFLE+ K+LHINIP  EA+EQMP+Y KF+KDIL+KK+RLG++ETV+LTEECS I++N L
Subjt:  DAGASGPVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGQFRKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSVILKNGL

Query:  PPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKI----LESTVIETAIHDSASKHSE----------------------------------
        PPK KDPGSFTIP +IG    GRALCDLGASINLMP S+YR +     + T I   + D +  + +                                  
Subjt:  PPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKI----LESTVIETAIHDSASKHSE----------------------------------

Query:  --------------KHGEVS--VEDFEIC---------------------------------------------SLERKNEKELFRCEEVFESLDLD---
                      + GE++  V+D +I                                               L+ +NE++L    EV ++LD     
Subjt:  --------------KHGEVS--VEDFEIC---------------------------------------------SLERKNEKELFRCEEVFESLDLD---

Query:  --------QRKAPP--IKPSLIEAPTLDLKPLPDHLKYVYLGEGETLPIIVASNLMPGHEEALIKLLQQYQKAIGWTLADIQGISPSFCMHKITLEEGSF
                +R  P   +KPS+ + PTL+LKPLP HL Y YLGE +TLP+I++S+L     E L+++L+ ++ AIGWT+ADI+GISPSFCMHKI LE+   
Subjt:  --------QRKAPP--IKPSLIEAPTLDLKPLPDHLKYVYLGEGETLPIIVASNLMPGHEEALIKLLQQYQKAIGWTLADIQGISPSFCMHKITLEEGSF

Query:  RSIEQQRKLNPAMKEVVKNEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVIVVSNKDNELIPTRTVTGWR
         S+E QR+LNP MKEVVK E+IKWLDAGIIYPI+DS+WVSPVQCVPKKGG+ VV N  NELIPTRTVTGWR
Subjt:  RSIEQQRKLNPAMKEVVKNEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVIVVSNKDNELIPTRTVTGWR

PIN22518.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]3.1e-11241.1Show/hide
Query:  ENHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGIAQ----------------------QNKQALPQQNSGSSLEAMMKEFMARTNAAIQSNQASMRA
        E+H  + CP +  S+ FV N R   NNPYSN YNPG  Q                      Q     P Q    SLE  + +FMA       S  A+ + 
Subjt:  ENHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGIAQ----------------------QNKQALPQQNSGSSLEAMMKEFMARTNAAIQSNQASMRA

Query:  LELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEERI-EPSKTQVIDKNGDKNVVVEQELESGQGAGGRNKDAGASGPVPDVEPPY
        +E Q+GQLAN + +RPQG LPS+TE +PR++GK Q +AVTLR+G+ L+E + +P+K++       +  V+ +E E                   +VE P 
Subjt:  LELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEERI-EPSKTQVIDKNGDKNVVVEQELESGQGAGGRNKDAGASGPVPDVEPPY

Query:  VPPPPYVPPLPFPQRQKPKNQDGQFRKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSVILKNGLPPKAKDPGSFTIPVS
            P     PFPQ+ + +  + QF KFLE+ K+LHINIP  EA+EQMP+Y KF+KDIL+KK+RLG++ET +LTEEC+ I++N LPPK KDPGSFTIP +
Subjt:  VPPPPYVPPLPFPQRQKPKNQDGQFRKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSVILKNGLPPKAKDPGSFTIPVS

Query:  IGGKELGRALCDLGASINLMPLSVYRK-------------------------ILESTVIET---------------------------------------
        IG    GRALCDLGASINLMP S+YR                          ++E  +++                                        
Subjt:  IGGKELGRALCDLGASINLMPLSVYRK-------------------------ILESTVIET---------------------------------------

Query:  ------AIHD---------------------SASKHSEKHGEVSVEDFEICSLERK--------NEKEL-----FRCEEVFESLDLD--QRKAPP--IKP
               + D                     S S      G  S+ +  + SLER         NE++L         + F+S  ++  +R  P   +KP
Subjt:  ------AIHD---------------------SASKHSEKHGEVSVEDFEICSLERK--------NEKEL-----FRCEEVFESLDLD--QRKAPP--IKP

Query:  SLIEAPTLDLKPLPDHLKYVYLGEGETLPIIVASNLMPGHEEALIKLLQQYQKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRKLNPAMKEVVKN
        S+ + PTL+LKPLP+HL YVYLGE +TLP+I++S+L     E L+++L+ ++ AIGWT+ADI+GISPSFCMHKI LE+    S+E QR+LN  MKEVVK 
Subjt:  SLIEAPTLDLKPLPDHLKYVYLGEGETLPIIVASNLMPGHEEALIKLLQQYQKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRKLNPAMKEVVKN

Query:  EVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVIVVSNKDNELIPTRTVTGWR
        E+IKWLDAGIIYPI+DS+WVSPVQCVPKKGG+ VV N  NELIPTRTVTGWR
Subjt:  EVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVIVVSNKDNELIPTRTVTGWR

XP_017227899.1 PREDICTED: uncharacterized protein LOC108203467 [Daucus carota subsp. sativus]1.3e-11344.5Show/hide
Query:  MMKEFM-------ARTNAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEERIEPSK-TQVIDKNGDKNVVVEQ
        M+KE++       ++T A + S  AS+R LE QVGQLANEL+ RP G L SDTE P+  G E  KA+TL+SGK L      +K    ++ +G++ +  ++
Subjt:  MMKEFM-------ARTNAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEERIEPSK-TQVIDKNGDKNVVVEQ

Query:  ELESGQGAGGRNKDAGASGPVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGQFRKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVS
        E E+ +           S P   +E  ++ P P     PFPQR + + QD QF+KFL++LKQLHINIPLVEA+EQMPNY KF+KDILTKK+RLGEFETV+
Subjt:  ELESGQGAGGRNKDAGASGPVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGQFRKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVS

Query:  LTEECSVILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKI----LESTVIETAIHDSASKHSE---------------------
        LT+ECS  L++ LP K KDPGSFTIP +IG    G ALCDLGASINLMP+SV+RK+    +  T +   + D +  H E                     
Subjt:  LTEECSVILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKI----LESTVIETAIHDSASKHSE---------------------

Query:  ---------------------------KHGEVSV------------------EDFEICSL------------------------------ERKNEKELF-
                                   ++GE+++                  +D E CS                               E  +  EL  
Subjt:  ---------------------------KHGEVSV------------------EDFEICSL------------------------------ERKNEKELF-

Query:  ---------RCEEVFESLDLDQRKAPPIKPSLIEAPTLDLKPLPDHLKYVYLGEGETLPIIVASNLMPGHEEALIKLLQQYQKAIGWTLADIQGISPSFC
                 R    FESLDL  R+    K S+ E P L+LK LP HLKY YLG   TLP+I+++ L    EE L++LL+ ++KAIGW++ADI+GISPS C
Subjt:  ---------RCEEVFESLDLDQRKAPPIKPSLIEAPTLDLKPLPDHLKYVYLGEGETLPIIVASNLMPGHEEALIKLLQQYQKAIGWTLADIQGISPSFC

Query:  MHKITLEEGSFRSIEQQRKLNPAMKEVVKNEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVIVVSNKDNELIPTRTVTGWR
        MHKI LE+G+  SIE QR+LNP MKEVVK EVIKWLDAGIIYPI+DS WVSPVQCVPKKGG+ V+ N++NELIPTR VTGWR
Subjt:  MHKITLEEGSFRSIEQQRKLNPAMKEVVKNEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVIVVSNKDNELIPTRTVTGWR

XP_017239676.1 PREDICTED: uncharacterized protein LOC108212460 [Daucus carota subsp. sativus]1.4e-11740.64Show/hide
Query:  EPIAVVNQV--AEEACVYCGENHNYEFCP------SNPASVFFVG---NQRNNPYSNFYNPG------IAQQN--------KQALP---QQN-------S
        +P+   +QV      C  CGE H  + CP      +  +SV +VG   NQ+NNP+SN YNPG       +  N        KQ +P   QQN        
Subjt:  EPIAVVNQV--AEEACVYCGENHNYEFCP------SNPASVFFVG---NQRNNPYSNFYNPG------IAQQN--------KQALP---QQN-------S

Query:  GSSLEAMMKEFMARTNAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEERIEPSKTQVIDKNGDKNVVVEQE
          + E ++ ++M +T+A IQS  ASMRALE+QVGQLA+ +  RP G LPS+TE +P+ + +E  KA+TLRSGK +E       T+ +D  GD   V+ +E
Subjt:  GSSLEAMMKEFMARTNAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEERIEPSKTQVIDKNGDKNVVVEQE

Query:  LESGQGAGGRNKDAGASGPVPDVEPP--YVPPPPYVPPLPFPQRQKPKNQDGQFRKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETV
                     +  S P  D   P  +V PPP     PFPQR + + QD QF+KF+++ K+L INIP  EA+EQM +Y KF+KDIL++K+RL EFETV
Subjt:  LESGQGAGGRNKDAGASGPVPDVEPP--YVPPPPYVPPLPFPQRQKPKNQDGQFRKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETV

Query:  SLTEECSVILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKILESTVIETAIHDSASKHSEKHGEVSVE-------------DFE
        +LTEECS IL+  LPPK KDPGSFTIP +IG +  G+ALCDLGAS+NLMPLS++ K+    V  T++    +  S  +    VE             DF 
Subjt:  SLTEECSVILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKILESTVIETAIHDSASKHSEKHGEVSVE-------------DFE

Query:  ICSLER--------------------------------------------------------------------------------------KNEKELFR
        +  +E                                                                                       ++ +E+  
Subjt:  ICSLER--------------------------------------------------------------------------------------KNEKELFR

Query:  CEEVFESLDLDQR------------KAPPIKPSLIEAPTLDLKPLPDHLKYVYLGEGETLPIIVASNLMPGHEEALIKLLQQYQKAIGWTLADIQGISPS
        C +   +L   +R            K+   KPS+ E P L+LK LP HLKY +LGE  TLP+I++S L   HEE L+++L++Y++AIGW +ADI+GISPS
Subjt:  CEEVFESLDLDQR------------KAPPIKPSLIEAPTLDLKPLPDHLKYVYLGEGETLPIIVASNLMPGHEEALIKLLQQYQKAIGWTLADIQGISPS

Query:  FCMHKITLEEGSFRSIEQQRKLNPAMKEVVKNEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVIVVSNKDNELIPTRTVTGWR
        FCMHKI++E+    +IE QR+LNP MKEVVK E+IKWLDAGIIYPI+DS+WVSP+QCVPKKGG+ VV+N+ NELIPTRTVTGWR
Subjt:  FCMHKITLEEGSFRSIEQQRKLNPAMKEVVKNEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVIVVSNKDNELIPTRTVTGWR

XP_023522102.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111785979 [Cucurbita pepo subsp. pepo]4.2e-11744.46Show/hide
Query:  AMMKEFMARTNAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEERIEPSK-------TQVIDKNGDKNVVVEQ
        +++KE+MA+ +AAIQS QAS+R LE+QVGQLANEL+ RP  KLP+DTE P+REG EQ +A+ LRSGK +  R E  K        +  D    K   V Q
Subjt:  AMMKEFMARTNAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEERIEPSK-------TQVIDKNGDKNVVVEQ

Query:  ELESGQGAGGRNKDAGASGPVPDVEPPYVPPPP--------YVPPLPFPQRQKPKNQDGQFRKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKR
        E  +   A    +   +      V+PP              Y P  PFPQR K K ++  F KF++I K++HINIPLVEA++QMPNY KFLKD+LT +++
Subjt:  ELESGQGAGGRNKDAGASGPVPDVEPPYVPPPP--------YVPPLPFPQRQKPKNQDGQFRKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKR

Query:  LGEFETVSLTEECSVILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKILESTVIETAIHDSASKHSEKHGEVSVEDF-------
          EF+ V L EECS ILKN +P K KDPGSFTIP+SIGGK+LGRALCDLG+SINLMPLS+Y+K+       T +    +  S  + E  +ED        
Subjt:  LGEFETVSLTEECSVILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKILESTVIETAIHDSASKHSEKHGEVSVEDF-------

Query:  ---------------------------------------------------------------EICSL----------------ERKNEKE---------
                                                                       E CS                 E   E++         
Subjt:  ---------------------------------------------------------------EICSL----------------ERKNEKE---------

Query:  ---LFRCEEVFESLDLDQRKAPPIKPSLIEAPTLDLKPLPDHLKYVYLGEGETLPIIVASNLMPGHEEALIKLLQQYQKAIGWTLADIQGISPSFCMHKI
           L      FESL+ + RK+ P++PS+ EAP LDLKPLP +LKY YLG+ +TLPII+++ L    E+ L++ L++++ AIGWTLADI+GISPS CMHKI
Subjt:  ---LFRCEEVFESLDLDQRKAPPIKPSLIEAPTLDLKPLPDHLKYVYLGEGETLPIIVASNLMPGHEEALIKLLQQYQKAIGWTLADIQGISPSFCMHKI

Query:  TLEEGSFRSIEQQRKLNPAMKEVVKNEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVIVVSNKDNELIPTRTVTGWR
         LEEG  +SIEQQR+LNP MKEVV+ E++KWLDAGIIYPIA+S+ VSP+QCVPKKGG+ V++N++NELI TR V GWR
Subjt:  TLEEGSFRSIEQQRKLNPAMKEVVKNEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVIVVSNKDNELIPTRTVTGWR

TrEMBL top hitse value%identityAlignment
A0A2G9G6G2 Reverse transcriptase8.8e-10540.55Show/hide
Query:  VNQVAEE--ACVYCGENHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGIAQ----------------------QNKQALPQQNSGSSLEAMMKEFMA
        VNQV      C  CGE H  + CP +  S+ FV N R   NNPYSN YNPG  Q                      Q +   P Q    SLE  + +FMA
Subjt:  VNQVAEE--ACVYCGENHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGIAQ----------------------QNKQALPQQNSGSSLEAMMKEFMA

Query:  RTNAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEERI-EPSKTQVIDKNGDKNVVVEQELESGQGAGGRNKD
               S  A+ + +E Q+GQLAN + +RPQG LPS+TE   R+       VTLR+G+ L+E + EP+K++      +K V+ E++ +           
Subjt:  RTNAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEERI-EPSKTQVIDKNGDKNVVVEQELESGQGAGGRNKD

Query:  AGASGPVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGQFRKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSVILKNGLP
                +VE P                                L++LHINIP  EA+EQMP+Y KF+KDIL+KK+RLG++E V+LTEECS I++N LP
Subjt:  AGASGPVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGQFRKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSVILKNGLP

Query:  PKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKI----LESTVIETAIHD------------------------------------------
        PK K+PGSFTIP +IG    GRALCDLGASINLMP S+YR +     + T I   + D                                          
Subjt:  PKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKI----LESTVIETAIHD------------------------------------------

Query:  ---------------------------SASKHSEKHGEVSVEDFEICSLER--------KNEKELFRCEEV--------FESLDLD--QRKAPP--IKPS
                                   + S      G  S+ +  +  LER        +NEK+   CE V        F+S  ++  +R AP   +KPS
Subjt:  ---------------------------SASKHSEKHGEVSVEDFEICSLER--------KNEKELFRCEEV--------FESLDLD--QRKAPP--IKPS

Query:  LIEAPTLDLKPLPDHLKYVYLGEGETLPIIVASNLMPGHEEALIKLLQQYQKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRKLNPAMKEVVKNE
        + E PTL+LKPLP HL Y YLGE +TLP+I++S+L     E L+++L+ ++  IGWT+ADI+GISPSFCMHKI LE+    SIE QR+LNP MKEVVK E
Subjt:  LIEAPTLDLKPLPDHLKYVYLGEGETLPIIVASNLMPGHEEALIKLLQQYQKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRKLNPAMKEVVKNE

Query:  VIKWLDAGIIYPIADSNWVSPVQCVPKKGGVIVVSNKDNELIPTRTVTGWR
        +IKWLDAGIIYPI+DS+WVSPVQCVPKKGG+ VV N  NELIPTRTVTGWR
Subjt:  VIKWLDAGIIYPIADSNWVSPVQCVPKKGGVIVVSNKDNELIPTRTVTGWR

A0A2G9H2I8 DNA-directed DNA polymerase1.6e-10638.28Show/hide
Query:  REQRRNQMENVSQLPQVP---EGPTDA--DPQNRVLQQNPLFEHNEQQNNQAENPILIANDRTRAIRAVDLAMIANALK--NVTVISHQQPLAMEPIA--
        R Q R       +LP V     G TD   D  N +   + LF    + +N   N  L+AN   +          A  +K   VT ++ +    M+ +   
Subjt:  REQRRNQMENVSQLPQVP---EGPTDA--DPQNRVLQQNPLFEHNEQQNNQAENPILIANDRTRAIRAVDLAMIANALK--NVTVISHQQPLAMEPIA--

Query:  VVNQVAEE--ACVYCGENHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGIAQ----------------------QNKQALPQQNSGSSLEAMMKEFM
         VNQV      C  CGE H  + CP +  S+ FV N R   NNPYSN YNP   Q                      Q +   P Q    SLE  + +FM
Subjt:  VVNQVAEE--ACVYCGENHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGIAQ----------------------QNKQALPQQNSGSSLEAMMKEFM

Query:  ARTNAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEERI-EPSKTQVIDKNGDKNVVVEQELESGQGAGGRN
        A       S  A+ + +E Q+GQLAN + ++PQG LPS+TE +PR+ GK Q +AVTLR+G+ L+E I EP+K++            E+E+E+        
Subjt:  ARTNAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEERI-EPSKTQVIDKNGDKNVVVEQELESGQGAGGRN

Query:  KDAGASGPVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGQFRKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSVILKNG
                     P  V  P  + P  FPQ  + +  + QF KFLE+ K+LHINIP  EA+EQMP+Y KF+KDIL+KK+RLG++ETV+LTEECS I++N 
Subjt:  KDAGASGPVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGQFRKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSVILKNG

Query:  LPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKI----LESTVIETAIHDSASKHSE---------------------------------
        LPPK KDPGSF IP +IG    GRALCDLGASINLMP S+YR +     + T I   + D +  + +                                 
Subjt:  LPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKI----LESTVIETAIHDSASKHSE---------------------------------

Query:  ---------------KHGEVS--VEDFEIC---------------------------------------------SLERKNEKE-----LFRCEEVFESL
                       + GE++  V+D +I                                               L+ +NE++     +    + F+S 
Subjt:  ---------------KHGEVS--VEDFEIC---------------------------------------------SLERKNEKE-----LFRCEEVFESL

Query:  DLD--QRKAPP--IKPSLIEAPTLDLKPLPDHLKYVYLGEGETLPIIVASNLMPGHEEALIKLLQQYQKAIGWTLADIQGISPSFCMHKITLEEGSFRSI
         ++  +R AP   +KPS+ E PTL+LKPLP+HL YVYLGE +TLP+I++ +L     E L+++L+ ++ AIGWT+ADI+GISPSFC+HKI LE+    S+
Subjt:  DLD--QRKAPP--IKPSLIEAPTLDLKPLPDHLKYVYLGEGETLPIIVASNLMPGHEEALIKLLQQYQKAIGWTLADIQGISPSFCMHKITLEEGSFRSI

Query:  EQQRKLNPAMKEVVKNEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVIVVSNKDNELIPTRTVTGWR
        E QR+LNP M EVVK E+IKWLDAGII+PI  S+WVSPVQCVPKKGG+ VV N  NELIPTRTVTGWR
Subjt:  EQQRKLNPAMKEVVKNEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVIVVSNKDNELIPTRTVTGWR

A0A2G9HYA0 Reverse transcriptase2.6e-11741.73Show/hide
Query:  VNQVAEE--ACVYCGENHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGIAQ----------------------QNKQALPQQNSGSSLEAMMKEFMA
        VNQV      C  CGE H  + CP +  S+ FV N R   NNPYSN YNPG  Q                      Q +   P Q    SLE  + +FMA
Subjt:  VNQVAEE--ACVYCGENHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGIAQ----------------------QNKQALPQQNSGSSLEAMMKEFMA

Query:  RTNAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEERI-EPSKTQVIDKNGDKNVVVEQELESGQGAGGRNK
               S  A+ + +E Q+GQLAN + +RPQG LPS+TE +PR++GK Q +AVTLR+G+ L+E + EP+K++      +K V+ E++ +          
Subjt:  RTNAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEERI-EPSKTQVIDKNGDKNVVVEQELESGQGAGGRNK

Query:  DAGASGPVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGQFRKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSVILKNGL
                 +VE P     P     PFPQR + +  + QF KFLE+ K+LHINIP  EA+EQMP+Y KF+KDIL+KK+RLG++ETV+LTEECS I++N L
Subjt:  DAGASGPVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGQFRKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSVILKNGL

Query:  PPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKI----LESTVIETAIHDSASKHSE----------------------------------
        PPK KDPGSFTIP +IG    GRALCDLGASINLMP S+YR +     + T I   + D +  + +                                  
Subjt:  PPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKI----LESTVIETAIHDSASKHSE----------------------------------

Query:  --------------KHGEVS--VEDFEIC---------------------------------------------SLERKNEKELFRCEEVFESLDLD---
                      + GE++  V+D +I                                               L+ +NE++L    EV ++LD     
Subjt:  --------------KHGEVS--VEDFEIC---------------------------------------------SLERKNEKELFRCEEVFESLDLD---

Query:  --------QRKAPP--IKPSLIEAPTLDLKPLPDHLKYVYLGEGETLPIIVASNLMPGHEEALIKLLQQYQKAIGWTLADIQGISPSFCMHKITLEEGSF
                +R  P   +KPS+ + PTL+LKPLP HL Y YLGE +TLP+I++S+L     E L+++L+ ++ AIGWT+ADI+GISPSFCMHKI LE+   
Subjt:  --------QRKAPP--IKPSLIEAPTLDLKPLPDHLKYVYLGEGETLPIIVASNLMPGHEEALIKLLQQYQKAIGWTLADIQGISPSFCMHKITLEEGSF

Query:  RSIEQQRKLNPAMKEVVKNEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVIVVSNKDNELIPTRTVTGWR
         S+E QR+LNP MKEVVK E+IKWLDAGIIYPI+DS+WVSPVQCVPKKGG+ VV N  NELIPTRTVTGWR
Subjt:  RSIEQQRKLNPAMKEVVKNEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVIVVSNKDNELIPTRTVTGWR

A0A2G9HYD8 Reverse transcriptase1.5e-11241.1Show/hide
Query:  ENHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGIAQ----------------------QNKQALPQQNSGSSLEAMMKEFMARTNAAIQSNQASMRA
        E+H  + CP +  S+ FV N R   NNPYSN YNPG  Q                      Q     P Q    SLE  + +FMA       S  A+ + 
Subjt:  ENHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGIAQ----------------------QNKQALPQQNSGSSLEAMMKEFMARTNAAIQSNQASMRA

Query:  LELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEERI-EPSKTQVIDKNGDKNVVVEQELESGQGAGGRNKDAGASGPVPDVEPPY
        +E Q+GQLAN + +RPQG LPS+TE +PR++GK Q +AVTLR+G+ L+E + +P+K++       +  V+ +E E                   +VE P 
Subjt:  LELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEERI-EPSKTQVIDKNGDKNVVVEQELESGQGAGGRNKDAGASGPVPDVEPPY

Query:  VPPPPYVPPLPFPQRQKPKNQDGQFRKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSVILKNGLPPKAKDPGSFTIPVS
            P     PFPQ+ + +  + QF KFLE+ K+LHINIP  EA+EQMP+Y KF+KDIL+KK+RLG++ET +LTEEC+ I++N LPPK KDPGSFTIP +
Subjt:  VPPPPYVPPLPFPQRQKPKNQDGQFRKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSVILKNGLPPKAKDPGSFTIPVS

Query:  IGGKELGRALCDLGASINLMPLSVYRK-------------------------ILESTVIET---------------------------------------
        IG    GRALCDLGASINLMP S+YR                          ++E  +++                                        
Subjt:  IGGKELGRALCDLGASINLMPLSVYRK-------------------------ILESTVIET---------------------------------------

Query:  ------AIHD---------------------SASKHSEKHGEVSVEDFEICSLERK--------NEKEL-----FRCEEVFESLDLD--QRKAPP--IKP
               + D                     S S      G  S+ +  + SLER         NE++L         + F+S  ++  +R  P   +KP
Subjt:  ------AIHD---------------------SASKHSEKHGEVSVEDFEICSLERK--------NEKEL-----FRCEEVFESLDLD--QRKAPP--IKP

Query:  SLIEAPTLDLKPLPDHLKYVYLGEGETLPIIVASNLMPGHEEALIKLLQQYQKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRKLNPAMKEVVKN
        S+ + PTL+LKPLP+HL YVYLGE +TLP+I++S+L     E L+++L+ ++ AIGWT+ADI+GISPSFCMHKI LE+    S+E QR+LN  MKEVVK 
Subjt:  SLIEAPTLDLKPLPDHLKYVYLGEGETLPIIVASNLMPGHEEALIKLLQQYQKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRKLNPAMKEVVKN

Query:  EVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVIVVSNKDNELIPTRTVTGWR
        E+IKWLDAGIIYPI+DS+WVSPVQCVPKKGG+ VV N  NELIPTRTVTGWR
Subjt:  EVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVIVVSNKDNELIPTRTVTGWR

A0A2G9IA86 DNA-directed DNA polymerase1.8e-11041.18Show/hide
Query:  VNQVAEE--ACVYCGENHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGIAQ----------------------QNKQALPQQNSGSSLEAMMKEFMA
        VNQV      C  CGE H    CP++  S+ FV N R   NNPYSN YNPG  Q                      Q +   P Q    SLE  + +FMA
Subjt:  VNQVAEE--ACVYCGENHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGIAQ----------------------QNKQALPQQNSGSSLEAMMKEFMA

Query:  RTNAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEERI-EPSKTQVIDKNGDKNVVVEQELESGQGAGGRNK
               S   +++ +E Q+GQLAN + +RPQG L S+TE +PR++GK Q +AVTLR+G+ L+E + EP+K++       K V+ E+E           K
Subjt:  RTNAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEERI-EPSKTQVIDKNGDKNVVVEQELESGQGAGGRNK

Query:  DAGASGPVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGQFRKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSVILKNGL
        +  A                   PL   Q+QK K    QF KFLE+ K+LHIN P  EA+EQMP+Y KF+K IL+KK+RLG++ETV+LTEECS I++N L
Subjt:  DAGASGPVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGQFRKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSVILKNGL

Query:  PPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKILESTVIETAIHDSASKHSEKHGEVSVE-------------DFEICSLERKNE-----
        PPK KDPGSFTIP +IG    GRALCDLGASINLMP S+YR +       T+I    +  S  + +  +E             DF +  +E  +E     
Subjt:  PPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKILESTVIETAIHDSASKHSEKHGEVSVE-------------DFEICSLERKNE-----

Query:  ---------------------------------------KELFRC----------------------------------EEVFESLDLD-----------
                                                E   C                                   EV ++LD             
Subjt:  ---------------------------------------KELFRC----------------------------------EEVFESLDLD-----------

Query:  QRKAPP--IKPSLIEAPTLDLKPLPDHLKYVYLGEGETLPIIVASNLMPGHEEALIKLLQQYQKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRK
        +R AP   +KPS+ E+PTL+LKPLP HL Y YLGE +TLP+I++S+L     E L+++ + ++ AIGWT+ADI+GIS SFCMHKI LE+    S+E QR+
Subjt:  QRKAPP--IKPSLIEAPTLDLKPLPDHLKYVYLGEGETLPIIVASNLMPGHEEALIKLLQQYQKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRK

Query:  LNPAMKEVVKNEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVIVVSNKDNELIPTRTVTGWR
        LNP MKEVVK E+IKW+DAGIIYPI+DS+WVSPVQCVPKKGG+ VV N  NELIPTRTVTGWR
Subjt:  LNPAMKEVVKNEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVIVVSNKDNELIPTRTVTGWR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCGATCCGCCTGGGGTAAGGTTCGAGCTTGATCCAGAAATTGAAAGAACATTCAGGATAAGAAGGAGAGAGCAGCGCAGAAACCAGATGGAGAACGTGTCGCAACT
TCCGCAGGTTCCTGAAGGTCCAACAGACGCAGACCCCCAGAATCGTGTGCTGCAGCAAAACCCGCTGTTTGAACATAATGAGCAGCAAAATAATCAGGCTGAGAATCCTA
TCTTGATAGCGAACGATAGAACTAGAGCCATTCGAGCGGTTGATCTTGCTATGATTGCTAATGCTCTTAAGAATGTGACAGTGATTAGTCATCAGCAGCCACTAGCTATG
GAGCCTATTGCAGTGGTGAACCAAGTGGCAGAGGAAGCATGTGTCTATTGTGGTGAAAATCACAACTACGAGTTTTGCCCCAGTAATCCAGCTTCTGTGTTTTTTGTAGG
TAATCAGAGGAATAACCCTTACTCTAACTTTTATAATCCAGGTATTGCCCAGCAAAATAAGCAGGCTTTGCCCCAGCAAAATTCGGGAAGTTCTCTCGAGGCGATGATGA
AAGAATTTATGGCTCGCACAAATGCCGCAATTCAAAGTAATCAAGCTTCGATGAGAGCCCTGGAATTGCAAGTGGGTCAGCTAGCTAATGAGCTGAAGGCAAGGCCTCAA
GGGAAACTTCCATCAGATACTGAACACCCTAGAAGGGAAGGTAAGGAGCAGGTAAAGGCAGTAACTCTTAGGAGTGGCAAGCCATTAGAAGAAAGAATTGAGCCTAGTAA
AACCCAGGTTATAGATAAAAATGGTGATAAAAATGTTGTTGTCGAACAAGAGTTGGAGTCTGGTCAGGGTGCTGGAGGCAGAAATAAAGATGCTGGAGCATCTGGTCCTG
TTCCAGATGTGGAACCACCTTATGTGCCGCCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAGGCAAAAGCCTAAGAATCAGGATGGTCAATTTAGGAAGTTCTTA
GAGATTCTTAAGCAATTGCATATAAATATCCCTTTAGTAGAAGCTATTGAGCAAATGCCTAATTATGCTAAATTTCTTAAGGATATTTTGACTAAAAAGAAGAGATTAGG
TGAGTTTGAAACTGTATCTCTTACTGAGGAGTGTAGTGTTATTCTTAAGAATGGGCTACCCCCCAAGGCTAAGGATCCAGGGTCATTTACCATACCCGTGTCTATAGGTG
GAAAAGAATTAGGTAGAGCACTCTGTGATTTAGGCGCAAGCATTAACCTTATGCCTCTTTCGGTCTATCGGAAGATTCTGGAGAGCACAGTTATTGAGACAGCAATACAT
GATTCGGCTAGTAAGCATTCGGAAAAGCATGGAGAGGTTAGTGTAGAGGATTTTGAAATTTGTTCTTTAGAAAGAAAAAATGAAAAAGAGTTGTTTAGGTGTGAGGAGGT
TTTTGAGTCTTTAGATTTAGATCAAAGGAAGGCTCCTCCCATTAAGCCATCCCTGATTGAGGCACCTACTTTAGATTTGAAACCCTTACCGGATCATCTAAAATATGTGT
ATCTGGGGGAAGGTGAGACGTTGCCCATTATTGTTGCATCAAATTTAATGCCAGGGCATGAAGAGGCCTTAATAAAATTACTACAGCAATACCAGAAGGCTATAGGTTGG
ACATTGGCTGATATTCAGGGAATTAGCCCATCTTTTTGTATGCACAAAATCACTCTAGAGGAGGGATCCTTTAGGAGTATTGAGCAACAAAGAAAGCTTAACCCTGCAAT
GAAAGAGGTTGTTAAGAATGAGGTAATTAAATGGTTGGATGCTGGGATCATTTATCCAATTGCCGATAGCAATTGGGTAAGCCCTGTCCAATGTGTTCCTAAGAAAGGAG
GTGTCATTGTGGTGAGCAATAAAGACAATGAGTTGATCCCAACCAGGACCGTAACTGGCTGGAGGGAGCCCAACACTCATGCTCCTAGGGTTTTTAGGAATTTGAAGACA
TTTCGAGACAAACCAGACGGAACCGTGGCGGTCAGAGGCACAAGGGAACAAACAGAGGCGACGAAGCTCGGCCTCGGCCATGGTAGAGGCCGTGCAGGGGGTCGGGCCAA
AAGCCTGATCCTTTCGGCTTTGGCCCAACCCTTTGGCCTATTCTTCCTCCGGGTTCCGTTTCCTGACTGTCTCCTTGGGTTGGTATCGCGTTGTCCTCATCAATTCCTCG
CATATCGAAGTGATCCAAAATTACCTATAACAACTATTATATTTTATTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGCGATCCGCCTGGGGTAAGGTTCGAGCTTGATCCAGAAATTGAAAGAACATTCAGGATAAGAAGGAGAGAGCAGCGCAGAAACCAGATGGAGAACGTGTCGCAACT
TCCGCAGGTTCCTGAAGGTCCAACAGACGCAGACCCCCAGAATCGTGTGCTGCAGCAAAACCCGCTGTTTGAACATAATGAGCAGCAAAATAATCAGGCTGAGAATCCTA
TCTTGATAGCGAACGATAGAACTAGAGCCATTCGAGCGGTTGATCTTGCTATGATTGCTAATGCTCTTAAGAATGTGACAGTGATTAGTCATCAGCAGCCACTAGCTATG
GAGCCTATTGCAGTGGTGAACCAAGTGGCAGAGGAAGCATGTGTCTATTGTGGTGAAAATCACAACTACGAGTTTTGCCCCAGTAATCCAGCTTCTGTGTTTTTTGTAGG
TAATCAGAGGAATAACCCTTACTCTAACTTTTATAATCCAGGTATTGCCCAGCAAAATAAGCAGGCTTTGCCCCAGCAAAATTCGGGAAGTTCTCTCGAGGCGATGATGA
AAGAATTTATGGCTCGCACAAATGCCGCAATTCAAAGTAATCAAGCTTCGATGAGAGCCCTGGAATTGCAAGTGGGTCAGCTAGCTAATGAGCTGAAGGCAAGGCCTCAA
GGGAAACTTCCATCAGATACTGAACACCCTAGAAGGGAAGGTAAGGAGCAGGTAAAGGCAGTAACTCTTAGGAGTGGCAAGCCATTAGAAGAAAGAATTGAGCCTAGTAA
AACCCAGGTTATAGATAAAAATGGTGATAAAAATGTTGTTGTCGAACAAGAGTTGGAGTCTGGTCAGGGTGCTGGAGGCAGAAATAAAGATGCTGGAGCATCTGGTCCTG
TTCCAGATGTGGAACCACCTTATGTGCCGCCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAGGCAAAAGCCTAAGAATCAGGATGGTCAATTTAGGAAGTTCTTA
GAGATTCTTAAGCAATTGCATATAAATATCCCTTTAGTAGAAGCTATTGAGCAAATGCCTAATTATGCTAAATTTCTTAAGGATATTTTGACTAAAAAGAAGAGATTAGG
TGAGTTTGAAACTGTATCTCTTACTGAGGAGTGTAGTGTTATTCTTAAGAATGGGCTACCCCCCAAGGCTAAGGATCCAGGGTCATTTACCATACCCGTGTCTATAGGTG
GAAAAGAATTAGGTAGAGCACTCTGTGATTTAGGCGCAAGCATTAACCTTATGCCTCTTTCGGTCTATCGGAAGATTCTGGAGAGCACAGTTATTGAGACAGCAATACAT
GATTCGGCTAGTAAGCATTCGGAAAAGCATGGAGAGGTTAGTGTAGAGGATTTTGAAATTTGTTCTTTAGAAAGAAAAAATGAAAAAGAGTTGTTTAGGTGTGAGGAGGT
TTTTGAGTCTTTAGATTTAGATCAAAGGAAGGCTCCTCCCATTAAGCCATCCCTGATTGAGGCACCTACTTTAGATTTGAAACCCTTACCGGATCATCTAAAATATGTGT
ATCTGGGGGAAGGTGAGACGTTGCCCATTATTGTTGCATCAAATTTAATGCCAGGGCATGAAGAGGCCTTAATAAAATTACTACAGCAATACCAGAAGGCTATAGGTTGG
ACATTGGCTGATATTCAGGGAATTAGCCCATCTTTTTGTATGCACAAAATCACTCTAGAGGAGGGATCCTTTAGGAGTATTGAGCAACAAAGAAAGCTTAACCCTGCAAT
GAAAGAGGTTGTTAAGAATGAGGTAATTAAATGGTTGGATGCTGGGATCATTTATCCAATTGCCGATAGCAATTGGGTAAGCCCTGTCCAATGTGTTCCTAAGAAAGGAG
GTGTCATTGTGGTGAGCAATAAAGACAATGAGTTGATCCCAACCAGGACCGTAACTGGCTGGAGGGAGCCCAACACTCATGCTCCTAGGGTTTTTAGGAATTTGAAGACA
TTTCGAGACAAACCAGACGGAACCGTGGCGGTCAGAGGCACAAGGGAACAAACAGAGGCGACGAAGCTCGGCCTCGGCCATGGTAGAGGCCGTGCAGGGGGTCGGGCCAA
AAGCCTGATCCTTTCGGCTTTGGCCCAACCCTTTGGCCTATTCTTCCTCCGGGTTCCGTTTCCTGACTGTCTCCTTGGGTTGGTATCGCGTTGTCCTCATCAATTCCTCG
CATATCGAAGTGATCCAAAATTACCTATAACAACTATTATATTTTATTAA
Protein sequenceShow/hide protein sequence
MSDPPGVRFELDPEIERTFRIRRREQRRNQMENVSQLPQVPEGPTDADPQNRVLQQNPLFEHNEQQNNQAENPILIANDRTRAIRAVDLAMIANALKNVTVISHQQPLAM
EPIAVVNQVAEEACVYCGENHNYEFCPSNPASVFFVGNQRNNPYSNFYNPGIAQQNKQALPQQNSGSSLEAMMKEFMARTNAAIQSNQASMRALELQVGQLANELKARPQ
GKLPSDTEHPRREGKEQVKAVTLRSGKPLEERIEPSKTQVIDKNGDKNVVVEQELESGQGAGGRNKDAGASGPVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGQFRKFL
EILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSVILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKILESTVIETAIH
DSASKHSEKHGEVSVEDFEICSLERKNEKELFRCEEVFESLDLDQRKAPPIKPSLIEAPTLDLKPLPDHLKYVYLGEGETLPIIVASNLMPGHEEALIKLLQQYQKAIGW
TLADIQGISPSFCMHKITLEEGSFRSIEQQRKLNPAMKEVVKNEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVIVVSNKDNELIPTRTVTGWREPNTHAPRVFRNLKT
FRDKPDGTVAVRGTREQTEATKLGLGHGRGRAGGRAKSLILSALAQPFGLFFLRVPFPDCLLGLVSRCPHQFLAYRSDPKLPITTIIFY