; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028646 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028646
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr8:27437394..27439718
RNA-Seq ExpressionLag0028646
SyntenyLag0028646
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ONI01138.1 hypothetical protein PRUPE_6G123900 [Prunus persica]2.8e-15039.06Show/hide
Query:  MSKAYDRVEWFFLEKLILGLGFDEQVVNLIMACVSSVSYSFLINGKRMGHVKPSRGLRQGDPLSPYLFLLCAEGLTRILGWMEEDKRISGLKIARASPPI
        M+KAYDRVEW FL  ++L LGF    V+ +M C+S+ ++S L  G  +GH+ P RGLRQG PLSPYLFL+C EG + +L   E    + G+++AR +P +
Subjt:  MSKAYDRVEWFFLEKLILGLGFDEQVVNLIMACVSSVSYSFLINGKRMGHVKPSRGLRQGDPLSPYLFLLCAEGLTRILGWMEEDKRISGLKIARASPPI

Query:  SHLFFADDCLLFFRAEVAEANQVAKCLKAFSKLTGHEINYGKSGICLSPNVSEQMRRSIVLTLGVKFVPFHDRYLGLPAVMSGSKSLSLKYVKDRLWANL
        +HL FADD +LF +A   +   +    + + ++TG +INY KS + LSPN +      I   L V  V  H+ YLGLP +    +    +++KD+LW ++
Subjt:  SHLFFADDCLLFFRAEVAEANQVAKCLKAFSKLTGHEINYGKSGICLSPNVSEQMRRSIVLTLGVKFVPFHDRYLGLPAVMSGSKSLSLKYVKDRLWANL

Query:  HKWKHSSFSAGGREVLIKAVLQGVPTYSMSCFRLPKDLVQDYNSILARFWWGGDDEGRKVHWVSWRKVCVSKFNGGLGFRDLELFNKALFAKQGWRLLES
          WK    S  G+E+LIKAVLQ +PTYSMSCFR+PK L ++ N I+ARFWW    + R +HWV W  +C SKF GGLGFRDLE FN+AL AKQ WR+L +
Subjt:  HKWKHSSFSAGGREVLIKAVLQGVPTYSMSCFRLPKDLVQDYNSILARFWWGGDDEGRKVHWVSWRKVCVSKFNGGLGFRDLELFNKALFAKQGWRLLES

Query:  PESLVGRVLRGKYFHGTSFLEAKQKRGESFVWRSLVWGRSLLKEGLRWRVGDGRSVNILRDNWLPRESSLKVIQRGEIDPEARVESLLGLDGVWDSAVVN
        PESLV R+ R +Y     FLEA+     SF+WRSL WG+ LL +GLRWRVG G S+ +  D WLP  S  K++   ++    RV  L    G W+  ++ 
Subjt:  PESLVGRVLRGKYFHGTSFLEAKQKRGESFVWRSLVWGRSLLKEGLRWRVGDGRSVNILRDNWLPRESSLKVIQRGEIDPEARVESLLGLDGVWDSAVVN

Query:  DWFGEMDSKVILGIPRPRQMGGDKLIWHYEKDGLYTVRSGYRLACHLRETAS---CSSPELVKAWWRFVWSRQIPSKIKFFCWRFFHEFLLTMSNLQNRG
        D F + +   IL IP     G D LIWHYE++G+Y+V+SGYRLA   ++  S    +  +L   +W+ +W+ +IP+KIKFF WR   +FL     L NR 
Subjt:  DWFGEMDSKVILGIPRPRQMGGDKLIWHYEKDGLYTVRSGYRLACHLRETAS---CSSPELVKAWWRFVWSRQIPSKIKFFCWRFFHEFLLTMSNLQNRG

Query:  MEVSGICPRCRRREESTYHAVWGCKEVKKQWRLSPFAFILEFQPIECVADLLWWCKDKLPPASFDE--FLGLCWWIWNRHS------KGEELWEGGGRIE
        +  + ICP+C R+ ES  HAVW C+  K+ WR S +  + E   +    +L  W   +L  +  ++  F  LCW +WNR +      K E   +   R+ 
Subjt:  MEVSGICPRCRRREESTYHAVWGCKEVKKQWRLSPFAFILEFQPIECVADLLWWCKDKLPPASFDE--FLGLCWWIWNRHS------KGEELWEGGGRIE

Query:  KAS----------------WPPPKFPNYKLNSDAATDRAFQRSSISAIIRNEKGDIMLTVLKEFHHVTDVDSVEAMAIRDGLVVAQEVGFRRLEVETDS
        K +                   P+ P        A         +  ++RN  G+ M   ++  H        E MA  +GL  A ++GF    +E D+
Subjt:  KAS----------------WPPPKFPNYKLNSDAATDRAFQRSSISAIIRNEKGDIMLTVLKEFHHVTDVDSVEAMAIRDGLVVAQEVGFRRLEVETDS

VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]2.5e-15138.33Show/hide
Query:  MSKAYDRVEWFFLEKLILGLGFDEQVVNLIMACVSSVSYSFLINGKRMGHVKPSRGLRQGDPLSPYLFLLCAEGLTRILGWMEEDKRISGLKIARASPPI
        M+KAYDRVEW FL +++L LGF    V  +M C+S+ ++S L  G  +GH+ P RGLRQG PLSPYLFL+C EG + +L   E    + G+++AR  P +
Subjt:  MSKAYDRVEWFFLEKLILGLGFDEQVVNLIMACVSSVSYSFLINGKRMGHVKPSRGLRQGDPLSPYLFLLCAEGLTRILGWMEEDKRISGLKIARASPPI

Query:  SHLFFADDCLLFFRAEVAEANQVAKCLKAFSKLTGHEINYGKSGICLSPNVSEQMRRSIVLTLGVKFVPFHDRYLGLPAVMSGSKSLSLKYVKDRLWANL
        +HL FADD +LF +A       +    + + +++G +INY KS   LSPN +      I   L V  V  H++YLGLP +    +    +++KD+LW ++
Subjt:  SHLFFADDCLLFFRAEVAEANQVAKCLKAFSKLTGHEINYGKSGICLSPNVSEQMRRSIVLTLGVKFVPFHDRYLGLPAVMSGSKSLSLKYVKDRLWANL

Query:  HKWKHSSFSAGGREVLIKAVLQGVPTYSMSCFRLPKDLVQDYNSILARFWWGGDDEGRKVHWVSWRKVCVSKFNGGLGFRDLELFNKALFAKQGWRLLES
          WK    S  G+E+L+KAVLQ +PTYSMSCFR+PK L ++ N I+ARFWW    + R +HWV W  +C SKF GGLGFRDLE FN+AL AKQ WR+L +
Subjt:  HKWKHSSFSAGGREVLIKAVLQGVPTYSMSCFRLPKDLVQDYNSILARFWWGGDDEGRKVHWVSWRKVCVSKFNGGLGFRDLELFNKALFAKQGWRLLES

Query:  PESLVGRVLRGKYFHGTSFLEAKQKRGESFVWRSLVWGRSLLKEGLRWRVGDGRSVNILRDNWLPRESSLKVIQRGEIDPEARVESLLGLDGVWDSAVVN
        PESLV R+ R +Y     FLEA+     SF+WRSL WG+ LL +GLRWRVG+G S+ +  D WLP  S  K++   ++     V  L    G W+  ++ 
Subjt:  PESLVGRVLRGKYFHGTSFLEAKQKRGESFVWRSLVWGRSLLKEGLRWRVGDGRSVNILRDNWLPRESSLKVIQRGEIDPEARVESLLGLDGVWDSAVVN

Query:  DWFGEMDSKVILGIPRPRQMGGDKLIWHYEKDGLYTVRSGYRLACHLRETASCSSP---ELVKAWWRFVWSRQIPSKIKFFCWRFFHEFLLTMSNLQNRG
        D F + +    L IP     G D LIWHYE++G+Y+V+SGYRLAC  ++  S       +L   +W+ +W+ +IP+KIKFF WR   +FL     L NR 
Subjt:  DWFGEMDSKVILGIPRPRQMGGDKLIWHYEKDGLYTVRSGYRLACHLRETASCSSP---ELVKAWWRFVWSRQIPSKIKFFCWRFFHEFLLTMSNLQNRG

Query:  MEVSGICPRCRRREESTYHAVWGCKEVKKQWRLSPFAFILEFQPIECVADLLWWCKDKLPPASFDE--FLGLCWWIWNRHS------KGEELWEGGGRIE
        +  + ICP C R+ ES  HAVW C+  K+ WR S +  + E   +    +L  W   +L  +  ++  F  LCW +WNR +      K E   +   R+ 
Subjt:  MEVSGICPRCRRREESTYHAVWGCKEVKKQWRLSPFAFILEFQPIECVADLLWWCKDKLPPASFDE--FLGLCWWIWNRHS------KGEELWEGGGRIE

Query:  KAS--------------------------WPPPKFPNYKLNSDAATDRAFQRSSISAIIRNEKGDIMLTVLKEFHHVTDVDSVEAMAIRDGLVVAQEVGF
        K +                          W PP    YK+N D A         +  ++RN  G+ M   ++           E MA  +GL  A ++GF
Subjt:  KAS--------------------------WPPPKFPNYKLNSDAATDRAFQRSSISAIIRNEKGDIMLTVLKEFHHVTDVDSVEAMAIRDGLVVAQEVGF

Query:  RRLEVETDSAR-VAALISSE
            +E D+   + +++S+E
Subjt:  RRLEVETDSAR-VAALISSE

XP_017250619.1 PREDICTED: uncharacterized protein LOC108221234 [Daucus carota subsp. sativus]2.0e-14838.54Show/hide
Query:  MSKAYDRVEWFFLEKLILGLGFDEQVVNLIMACVSSVSYSFLINGKRMGHVKPSRGLRQGDPLSPYLFLLCAEGLTRILGWMEEDKRISGLKIARASPPI
        MSKAYDRVEW F+EK++  LGF +Q V  IM CV+SV+YSF +NG+  G V PSRGLRQGDPLSPYLFL+CAEG + +L   E    I GLKIAR +P I
Subjt:  MSKAYDRVEWFFLEKLILGLGFDEQVVNLIMACVSSVSYSFLINGKRMGHVKPSRGLRQGDPLSPYLFLLCAEGLTRILGWMEEDKRISGLKIARASPPI

Query:  SHLFFADDCLLFFRAEVAEANQVAKCLKAFSKLTGHEINYGKSGICLSPNVSEQMRRSIVLTLGVKFVPFHDRYLGLPAVMSGSKSLSLKYVKDRLWANL
        SHLFFADD LLF +A     N +      +S+ +G  IN+ KS +  SPN +  +R + +++L ++     + YLGLP +   +K +  + +K+++WA L
Subjt:  SHLFFADDCLLFFRAEVAEANQVAKCLKAFSKLTGHEINYGKSGICLSPNVSEQMRRSIVLTLGVKFVPFHDRYLGLPAVMSGSKSLSLKYVKDRLWANL

Query:  HKWKHSSFSAGGREVLIKAVLQGVPTYSMSCFRLPKDLVQDYNSILARFWWGGDDEGRKVHWVSWRKVCVSKFNGGLGFRDLELFNKALFAKQGWRLLES
        H W+   FS GG+E+L+KAV+Q +PTY MSCF++P+   Q+   +LAR+WWG     RK+HW +W K+   K  GGLGFR    +N+AL AKQ WR+L  
Subjt:  HKWKHSSFSAGGREVLIKAVLQGVPTYSMSCFRLPKDLVQDYNSILARFWWGGDDEGRKVHWVSWRKVCVSKFNGGLGFRDLELFNKALFAKQGWRLLES

Query:  PESLVGRVLRGKYFHGTSFLEAKQKRGESFVWRSLVWGRSLLKEGLRWRVGDGRSVNILRDNWLPRESSLKVIQRGEIDPEARVESLLGLDGVWDSAVVN
        P SL+ +VL+ KYF  +SFL++K+ R  S  WRS+VWG++LL EGLR R+G+G+S    +D WL R  S   I RG  +    VE + G  G W+  ++ 
Subjt:  PESLVGRVLRGKYFHGTSFLEAKQKRGESFVWRSLVWGRSLLKEGLRWRVGDGRSVNILRDNWLPRESSLKVIQRGEIDPEARVESLLGLDGVWDSAVVN

Query:  DWFGEMDSKVILGIPRPRQMGGDKLIWHYEKDGLYTVRSGYRLACHLRETASCSSPELVKAWWRFVWSRQIPSKIKFFCWRFFHEFLLTMSNLQNRGMEV
          F   D ++IL IP  R    D   WHY   G YTV+SGY+L  +L +  S SS +++  WW++ W+ +IP KI  F WR +HE L T   L  R + +
Subjt:  DWFGEMDSKVILGIPRPRQMGGDKLIWHYEKDGLYTVRSGYRLACHLRETASCSSPELVKAWWRFVWSRQIPSKIKFFCWRFFHEFLLTMSNLQNRGMEV

Query:  SGICPRCRRREESTYHAVWGCKEVKKQWRLSPFAFILEFQPIECVADLLWWCKDKLPPASFDEFLGLCWWIWNRHSKGEELWEGGGRIEKASWPPPKFPN
           CP C   ++S  HAV+ C   ++ W L  + F++  +      D+L +  + L     D  L   W IW   +K     +     +   W    +  
Subjt:  SGICPRCRRREESTYHAVWGCKEVKKQWRLSPFAFILEFQPIECVADLLWWCKDKLPPASFDEFLGLCWWIWNRHSKGEELWEGGGRIEKASWPPPKFPN

Query:  YK-----------------------------LNSDAATDRAFQRSSISAIIRNEKGDIMLTVLKEFHHVTDVDSVEAMAIRDGLVVAQEVGFRRLEVETD
         K                             L  DAA  +  +R  + A I         T+ K    +  V   EA+A+  GL  AQ  G+   +V TD
Subjt:  YK-----------------------------LNSDAATDRAFQRSSISAIIRNEKGDIMLTVLKEFHHVTDVDSVEAMAIRDGLVVAQEVGFRRLEVETD

Query:  SARVAALISSE
        S  +   ++SE
Subjt:  SARVAALISSE

XP_024956542.1 uncharacterized protein LOC112498908 [Citrus sinensis]4.0e-14937.69Show/hide
Query:  MSKAYDRVEWFFLEKLILGLGFDEQVVNLIMACVSSVSYSFLINGKRMGHVKPSRGLRQGDPLSPYLFLLCAEGLTRILGWMEEDKRISGLKIARASPPI
        +SKAYDR+EW FLE+++  LGF  Q ++LIM C+SSVS+S +ING   G +KP RGLRQG P+SPYLF+ CAE  + +L   E+ + I GL   +    I
Subjt:  MSKAYDRVEWFFLEKLILGLGFDEQVVNLIMACVSSVSYSFLINGKRMGHVKPSRGLRQGDPLSPYLFLLCAEGLTRILGWMEEDKRISGLKIARASPPI

Query:  SHLFFADDCLLFFRAEVAEANQVAKCLKAFSKLTGHEINYGKSGICLSPNVSEQMRRSIVLTLGVKFVPFHDRYLGLPAVMSGSKSLSLKYVKDRLWANL
        SHL FADD L+F RA VA+  ++   L  +S  +G   N+ KS + LS NV +    +I     +  V  ++ YLGLPA++   +S     +K ++   +
Subjt:  SHLFFADDCLLFFRAEVAEANQVAKCLKAFSKLTGHEINYGKSGICLSPNVSEQMRRSIVLTLGVKFVPFHDRYLGLPAVMSGSKSLSLKYVKDRLWANL

Query:  HKWKHSSFSAGGREVLIKAVLQGVPTYSMSCFRLPKDLVQDYNSILARFWWGGDDEGRKVHWVSWRKVCVSKFNGGLGFRDLELFNKALFAKQGWRLLES
          W+H  FS GG+EVLIKA +Q +P ++MS F++P  + +D   I+  FWWG   E R +HW  W K+  +K  GG+GFRD   FN+AL AKQGWR+ + 
Subjt:  HKWKHSSFSAGGREVLIKAVLQGVPTYSMSCFRLPKDLVQDYNSILARFWWGGDDEGRKVHWVSWRKVCVSKFNGGLGFRDLELFNKALFAKQGWRLLES

Query:  PESLVGRVLRGKYFHGTSFLEAKQKRGESFVWRSLVWGRSLLKEGLRWRVGDGRSVNILRDNWLPRESSLKVIQRGEIDPEARVESLLGLDGVWDSAVVN
        P+SLV RVL+ +YFH ++FL AK     S++WRS++WGR ++  G RWR+G+G+ V+I + NW+P+  + K + +  +  EA V  L+  +  WD  ++ 
Subjt:  PESLVGRVLRGKYFHGTSFLEAKQKRGESFVWRSLVWGRSLLKEGLRWRVGDGRSVNILRDNWLPRESSLKVIQRGEIDPEARVESLLGLDGVWDSAVVN

Query:  DWFGEMDSKVILGIPRPRQMGGDKLIWHYEKDGLYTVRSGYRLACHLRETASCSSPELVKAWWRFVWSRQIPSKIKFFCWRFFHEFLLTMSNLQNRGMEV
          F +MD+ VI  IP PR++  D+LIWH+ K G YTV+SGY+ A  +R  A  SS E  K  W  +WS  +P KI+ F WR     L +  NL  R +  
Subjt:  DWFGEMDSKVILGIPRPRQMGGDKLIWHYEKDGLYTVRSGYRLACHLRETASCSSPELVKAWWRFVWSRQIPSKIKFFCWRFFHEFLLTMSNLQNRGMEV

Query:  SGICPRCRRREESTYHAVWGCKEVKKQWRLSPFAFILEFQPIECVADLLWWCKDKLPPASFDEFLGLCWWIWNRH----------------SKGEELWEG
           C  C+   E+ +HA+  CK  KK WRLS F   ++  P + +  LL   K     A  D F  + W  WN                  +K E + E 
Subjt:  SGICPRCRRREESTYHAVWGCKEVKKQWRLSPFAFILEFQPIECVADLLWWCKDKLPPASFDEFLGLCWWIWNRH----------------SKGEELWEG

Query:  GGRIEKAS--------------WPPPKFPNYKLNSDAATDRAFQRSSISAIIRNEKGDIMLTVLKEFHHVTDVDSVEAMAIRDGLVVAQEVGFRRLEVET
          R++ ++              W PP+    K+N+DAAT+     + + A+IR+E G +  T +K       V   EA A+  GL VA++   + + +E+
Subjt:  GGRIEKAS--------------WPPPKFPNYKLNSDAATDRAFQRSSISAIIRNEKGDIMLTVLKEFHHVTDVDSVEAMAIRDGLVVAQEVGFRRLEVET

Query:  DSARVAALISSERVDLSEV
        DS  V +L+++ +   SE+
Subjt:  DSARVAALISSERVDLSEV

XP_030483669.1 uncharacterized protein LOC115700241 [Cannabis sativa]5.4e-14637.13Show/hide
Query:  MSKAYDRVEWFFLEKLILGLGFDEQVVNLIMACVSSVSYSFLINGKRMGHVKPSRGLRQGDPLSPYLFLLCAEGLTRILGWMEEDKRISGLKIARASPPI
        MSKA+DRVEW +L +++L +GF   V++LI+ C+ SV+YSFL+NG+ +G + P+RG+RQGDPLSPYLFL+CAEGL+R+L   E++  + GLK++R +P +
Subjt:  MSKAYDRVEWFFLEKLILGLGFDEQVVNLIMACVSSVSYSFLINGKRMGHVKPSRGLRQGDPLSPYLFLLCAEGLTRILGWMEEDKRISGLKIARASPPI

Query:  SHLFFADDCLLFFRAEVAEANQVAKCLKAFSKLTGHEINYGKSGICLSPNVSEQMRRSIVLTLGVKFVPFHDRYLGLPAVMSGSKSLSLKYVKDRLWANL
        SHLFFADD +LF RA    A  +   L  + + +G  +N  K  +  SPN   Q +      L +   P H+ YLGLP+     K+     + D++W  L
Subjt:  SHLFFADDCLLFFRAEVAEANQVAKCLKAFSKLTGHEINYGKSGICLSPNVSEQMRRSIVLTLGVKFVPFHDRYLGLPAVMSGSKSLSLKYVKDRLWANL

Query:  HKWKHSSFSAGGREVLIKAVLQGVPTYSMSCFRLPKDLVQDYNSILARFWWGGDDEGRKVHWVSWRKVCVSKFNGGLGFRDLELFNKALFAKQGWRLLES
          WK   FS GG+EVL+KAV+Q +PTY+MSCFRLP  L     S+++ FWWG    G  +HW +W  +C +K +GGLGFR+  LFN+AL AKQ WRLLES
Subjt:  HKWKHSSFSAGGREVLIKAVLQGVPTYSMSCFRLPKDLVQDYNSILARFWWGGDDEGRKVHWVSWRKVCVSKFNGGLGFRDLELFNKALFAKQGWRLLES

Query:  PESLVGRVLRGKYFHGTSFLEAKQKRGESFVWRSLVWGRSLLKEGLRWRVGDGRSVNILRDNWLPRESSLKVIQRGEIDPEARVESLLGLDGVWDSAVVN
        P SL+GRVL  +YF   + L A      S  WRS+VWG+ LL +GL+WRVG G ++    D+WLP  ++         D   +V  L+     WD   ++
Subjt:  PESLVGRVLRGKYFHGTSFLEAKQKRGESFVWRSLVWGRSLLKEGLRWRVGDGRSVNILRDNWLPRESSLKVIQRGEIDPEARVESLLGLDGVWDSAVVN

Query:  DWFGEMDSKVILGIPRPRQMGGDKLIWHYEKDGLYTVRSGYRLACHLRETASCSSPELVKAWWRFVWSRQIPSKIKFFCWRFFHEFLLTMSNLQNRGMEV
          FG  D   IL IP       D LIW+    G YTV+SGY+ A  L ++   ++   + +WW   W  ++PSKI+ F W+ FH  L   S L  + +  
Subjt:  DWFGEMDSKVILGIPRPRQMGGDKLIWHYEKDGLYTVRSGYRLACHLRETASCSSPELVKAWWRFVWSRQIPSKIKFFCWRFFHEFLLTMSNLQNRGMEV

Query:  SGICPRCRRREESTYHAVWGCKEVKKQWRLSPFAFILEFQPIECVADLLWWCKDKLPPASFDEFLGLCWWIW--------NRHSK---------------
        +  CP C+  +E+  HA++ C   K  WR S   F  +       AD L +         F++FL +CW IW        N+ SK               
Subjt:  SGICPRCRRREESTYHAVWGCKEVKKQWRLSPFAFILEFQPIECVADLLWWCKDKLPPASFDEFLGLCWWIW--------NRHSK---------------

Query:  -----GEELWEGGGRIEKAS----------------WPPPKFPNYKLNSDAATDRAFQRSSISAIIRNEKGDIMLTVLKEFHHVTDVDSVEAMAIRDGLV
                    G     AS                W  P     KLNSDAA ++A  +  I A++R+  G I+  + K        + +EA+A+   L 
Subjt:  -----GEELWEGGGRIEKAS----------------WPPPKFPNYKLNSDAATDRAFQRSSISAIIRNEKGDIMLTVLKEFHHVTDVDSVEAMAIRDGLV

Query:  VAQEVGFRRLEVETDSARV
         A  +G     +ETDS  V
Subjt:  VAQEVGFRRLEVETDSARV

TrEMBL top hitse value%identityAlignment
A0A251NPF0 Reverse transcriptase domain-containing protein1.3e-15039.06Show/hide
Query:  MSKAYDRVEWFFLEKLILGLGFDEQVVNLIMACVSSVSYSFLINGKRMGHVKPSRGLRQGDPLSPYLFLLCAEGLTRILGWMEEDKRISGLKIARASPPI
        M+KAYDRVEW FL  ++L LGF    V+ +M C+S+ ++S L  G  +GH+ P RGLRQG PLSPYLFL+C EG + +L   E    + G+++AR +P +
Subjt:  MSKAYDRVEWFFLEKLILGLGFDEQVVNLIMACVSSVSYSFLINGKRMGHVKPSRGLRQGDPLSPYLFLLCAEGLTRILGWMEEDKRISGLKIARASPPI

Query:  SHLFFADDCLLFFRAEVAEANQVAKCLKAFSKLTGHEINYGKSGICLSPNVSEQMRRSIVLTLGVKFVPFHDRYLGLPAVMSGSKSLSLKYVKDRLWANL
        +HL FADD +LF +A   +   +    + + ++TG +INY KS + LSPN +      I   L V  V  H+ YLGLP +    +    +++KD+LW ++
Subjt:  SHLFFADDCLLFFRAEVAEANQVAKCLKAFSKLTGHEINYGKSGICLSPNVSEQMRRSIVLTLGVKFVPFHDRYLGLPAVMSGSKSLSLKYVKDRLWANL

Query:  HKWKHSSFSAGGREVLIKAVLQGVPTYSMSCFRLPKDLVQDYNSILARFWWGGDDEGRKVHWVSWRKVCVSKFNGGLGFRDLELFNKALFAKQGWRLLES
          WK    S  G+E+LIKAVLQ +PTYSMSCFR+PK L ++ N I+ARFWW    + R +HWV W  +C SKF GGLGFRDLE FN+AL AKQ WR+L +
Subjt:  HKWKHSSFSAGGREVLIKAVLQGVPTYSMSCFRLPKDLVQDYNSILARFWWGGDDEGRKVHWVSWRKVCVSKFNGGLGFRDLELFNKALFAKQGWRLLES

Query:  PESLVGRVLRGKYFHGTSFLEAKQKRGESFVWRSLVWGRSLLKEGLRWRVGDGRSVNILRDNWLPRESSLKVIQRGEIDPEARVESLLGLDGVWDSAVVN
        PESLV R+ R +Y     FLEA+     SF+WRSL WG+ LL +GLRWRVG G S+ +  D WLP  S  K++   ++    RV  L    G W+  ++ 
Subjt:  PESLVGRVLRGKYFHGTSFLEAKQKRGESFVWRSLVWGRSLLKEGLRWRVGDGRSVNILRDNWLPRESSLKVIQRGEIDPEARVESLLGLDGVWDSAVVN

Query:  DWFGEMDSKVILGIPRPRQMGGDKLIWHYEKDGLYTVRSGYRLACHLRETAS---CSSPELVKAWWRFVWSRQIPSKIKFFCWRFFHEFLLTMSNLQNRG
        D F + +   IL IP     G D LIWHYE++G+Y+V+SGYRLA   ++  S    +  +L   +W+ +W+ +IP+KIKFF WR   +FL     L NR 
Subjt:  DWFGEMDSKVILGIPRPRQMGGDKLIWHYEKDGLYTVRSGYRLACHLRETAS---CSSPELVKAWWRFVWSRQIPSKIKFFCWRFFHEFLLTMSNLQNRG

Query:  MEVSGICPRCRRREESTYHAVWGCKEVKKQWRLSPFAFILEFQPIECVADLLWWCKDKLPPASFDE--FLGLCWWIWNRHS------KGEELWEGGGRIE
        +  + ICP+C R+ ES  HAVW C+  K+ WR S +  + E   +    +L  W   +L  +  ++  F  LCW +WNR +      K E   +   R+ 
Subjt:  MEVSGICPRCRRREESTYHAVWGCKEVKKQWRLSPFAFILEFQPIECVADLLWWCKDKLPPASFDE--FLGLCWWIWNRHS------KGEELWEGGGRIE

Query:  KAS----------------WPPPKFPNYKLNSDAATDRAFQRSSISAIIRNEKGDIMLTVLKEFHHVTDVDSVEAMAIRDGLVVAQEVGFRRLEVETDS
        K +                   P+ P        A         +  ++RN  G+ M   ++  H        E MA  +GL  A ++GF    +E D+
Subjt:  KAS----------------WPPPKFPNYKLNSDAATDRAFQRSSISAIIRNEKGDIMLTVLKEFHHVTDVDSVEAMAIRDGLVVAQEVGFRRLEVETDS

A0A5E4FZN9 PREDICTED: retrotransposon1.2e-15138.33Show/hide
Query:  MSKAYDRVEWFFLEKLILGLGFDEQVVNLIMACVSSVSYSFLINGKRMGHVKPSRGLRQGDPLSPYLFLLCAEGLTRILGWMEEDKRISGLKIARASPPI
        M+KAYDRVEW FL +++L LGF    V  +M C+S+ ++S L  G  +GH+ P RGLRQG PLSPYLFL+C EG + +L   E    + G+++AR  P +
Subjt:  MSKAYDRVEWFFLEKLILGLGFDEQVVNLIMACVSSVSYSFLINGKRMGHVKPSRGLRQGDPLSPYLFLLCAEGLTRILGWMEEDKRISGLKIARASPPI

Query:  SHLFFADDCLLFFRAEVAEANQVAKCLKAFSKLTGHEINYGKSGICLSPNVSEQMRRSIVLTLGVKFVPFHDRYLGLPAVMSGSKSLSLKYVKDRLWANL
        +HL FADD +LF +A       +    + + +++G +INY KS   LSPN +      I   L V  V  H++YLGLP +    +    +++KD+LW ++
Subjt:  SHLFFADDCLLFFRAEVAEANQVAKCLKAFSKLTGHEINYGKSGICLSPNVSEQMRRSIVLTLGVKFVPFHDRYLGLPAVMSGSKSLSLKYVKDRLWANL

Query:  HKWKHSSFSAGGREVLIKAVLQGVPTYSMSCFRLPKDLVQDYNSILARFWWGGDDEGRKVHWVSWRKVCVSKFNGGLGFRDLELFNKALFAKQGWRLLES
          WK    S  G+E+L+KAVLQ +PTYSMSCFR+PK L ++ N I+ARFWW    + R +HWV W  +C SKF GGLGFRDLE FN+AL AKQ WR+L +
Subjt:  HKWKHSSFSAGGREVLIKAVLQGVPTYSMSCFRLPKDLVQDYNSILARFWWGGDDEGRKVHWVSWRKVCVSKFNGGLGFRDLELFNKALFAKQGWRLLES

Query:  PESLVGRVLRGKYFHGTSFLEAKQKRGESFVWRSLVWGRSLLKEGLRWRVGDGRSVNILRDNWLPRESSLKVIQRGEIDPEARVESLLGLDGVWDSAVVN
        PESLV R+ R +Y     FLEA+     SF+WRSL WG+ LL +GLRWRVG+G S+ +  D WLP  S  K++   ++     V  L    G W+  ++ 
Subjt:  PESLVGRVLRGKYFHGTSFLEAKQKRGESFVWRSLVWGRSLLKEGLRWRVGDGRSVNILRDNWLPRESSLKVIQRGEIDPEARVESLLGLDGVWDSAVVN

Query:  DWFGEMDSKVILGIPRPRQMGGDKLIWHYEKDGLYTVRSGYRLACHLRETASCSSP---ELVKAWWRFVWSRQIPSKIKFFCWRFFHEFLLTMSNLQNRG
        D F + +    L IP     G D LIWHYE++G+Y+V+SGYRLAC  ++  S       +L   +W+ +W+ +IP+KIKFF WR   +FL     L NR 
Subjt:  DWFGEMDSKVILGIPRPRQMGGDKLIWHYEKDGLYTVRSGYRLACHLRETASCSSP---ELVKAWWRFVWSRQIPSKIKFFCWRFFHEFLLTMSNLQNRG

Query:  MEVSGICPRCRRREESTYHAVWGCKEVKKQWRLSPFAFILEFQPIECVADLLWWCKDKLPPASFDE--FLGLCWWIWNRHS------KGEELWEGGGRIE
        +  + ICP C R+ ES  HAVW C+  K+ WR S +  + E   +    +L  W   +L  +  ++  F  LCW +WNR +      K E   +   R+ 
Subjt:  MEVSGICPRCRRREESTYHAVWGCKEVKKQWRLSPFAFILEFQPIECVADLLWWCKDKLPPASFDE--FLGLCWWIWNRHS------KGEELWEGGGRIE

Query:  KAS--------------------------WPPPKFPNYKLNSDAATDRAFQRSSISAIIRNEKGDIMLTVLKEFHHVTDVDSVEAMAIRDGLVVAQEVGF
        K +                          W PP    YK+N D A         +  ++RN  G+ M   ++           E MA  +GL  A ++GF
Subjt:  KAS--------------------------WPPPKFPNYKLNSDAATDRAFQRSSISAIIRNEKGDIMLTVLKEFHHVTDVDSVEAMAIRDGLVVAQEVGF

Query:  RRLEVETDSAR-VAALISSE
            +E D+   + +++S+E
Subjt:  RRLEVETDSAR-VAALISSE

A0A803PAX6 Uncharacterized protein4.3e-14936.29Show/hide
Query:  MSKAYDRVEWFFLEKLILGLGFDEQVVNLIMACVSSVSYSFLINGKRMGHVKPSRGLRQGDPLSPYLFLLCAEGLTRILGWMEEDKRISGLKIARASPPI
        MSKAYDRV W F+E +++ LG+++Q VN IM C+++VS+S LING+  G ++P+ G+RQGDPLSPYLFLLCAEGL+ ++   E    I GLK  +    +
Subjt:  MSKAYDRVEWFFLEKLILGLGFDEQVVNLIMACVSSVSYSFLINGKRMGHVKPSRGLRQGDPLSPYLFLLCAEGLTRILGWMEEDKRISGLKIARASPPI

Query:  SHLFFADDCLLFFRAEVAEANQVAKCLKAFSKLTGHEINYGKSGICLSPNVSEQMRRSIVLTLGVKFVPFHDRYLGLPAVMSGSKSLSLKYVKDRLWANL
        SHLFFADD  +F  A   E  ++   L  +S L+G +IN+ KS +C+   + ++    +   LGVK V  H +YLG+PA +   K    + ++ ++   L
Subjt:  SHLFFADDCLLFFRAEVAEANQVAKCLKAFSKLTGHEINYGKSGICLSPNVSEQMRRSIVLTLGVKFVPFHDRYLGLPAVMSGSKSLSLKYVKDRLWANL

Query:  HKWKHSSFSAGGREVLIKAVLQGVPTYSMSCFRLPKDLVQDYNSILARFWWGGDDEGRKVHWVSWRKVCVSKFNGGLGFRDLELFNKALFAKQGWRLLES
          W+ + FS  GREVL+KA++Q +PTY MSCFRLPK+L++D ++++ARFWWG  D   K+HW  W K+C  K  GG+GF++LE FN++L AKQGW++L +
Subjt:  HKWKHSSFSAGGREVLIKAVLQGVPTYSMSCFRLPKDLVQDYNSILARFWWGGDDEGRKVHWVSWRKVCVSKFNGGLGFRDLELFNKALFAKQGWRLLES

Query:  PESLVGRVLRGKYFHGTSFLEAKQKRGESFVWRSLVWGRSLLKEGLRWRVGDGRSVNILRDNWLPRESSLKVIQRGEIDPEARVESLLGLDGVWDSAVVN
        P SL+ R+L+  YF  +SFLEAK     SF+WRS++ GR ++++G+RWRV  G+ + +  D WLPR S+  +    ++     +++L   +G W   ++ 
Subjt:  PESLVGRVLRGKYFHGTSFLEAKQKRGESFVWRSLVWGRSLLKEGLRWRVGDGRSVNILRDNWLPRESSLKVIQRGEIDPEARVESLLGLDGVWDSAVVN

Query:  DWFGEMDSKVILGIPRPRQMGGDKLIWHYEKDGLYTVRSGYRLACHLRETASCSSPELVKAWWRFVWSRQIPSKIKFFCWRFFHEFLLTMSNLQNRGMEV
        + F   D  +ILG+   ++   D LIWH+  DGLYTVRSGY +A      A  SS    + WW  +W  + P KI+ F WR  + ++   + LQ RGM +
Subjt:  DWFGEMDSKVILGIPRPRQMGGDKLIWHYEKDGLYTVRSGYRLACHLRETASCSSPELVKAWWRFVWSRQIPSKIKFFCWRFFHEFLLTMSNLQNRGMEV

Query:  SGICPRCRRREESTYHAVWGCKEVKKQWRLSPFAFILEFQPIECVADLLWWCKDKLPPASFDEFLGLCWWIWNRHSK--------GEELWEGGGRIE---
        +  C  C + EE+  H +W C   K+ W+  P+  +L+ Q    + D+L   ++++    F+ F+ + W IWNR +K         +E W    ++E   
Subjt:  SGICPRCRRREESTYHAVWGCKEVKKQWRLSPFAFILEFQPIECVADLLWWCKDKLPPASFDEFLGLCWWIWNRHSK--------GEELWEGGGRIE---

Query:  ------------------KASWPPPKFPNYKLNSDAATDRAFQRSSISAIIRNEKGDIMLTVLKEFHHVTDVDSVEAMAIRDGLVVAQEVGFRRLEVETD
                             W PP    + +NSDA+         +  +IR  KG++     +  + V  ++  EA+AI+ G+ +A +       +++D
Subjt:  ------------------KASWPPPKFPNYKLNSDAATDRAFQRSSISAIIRNEKGDIMLTVLKEFHHVTDVDSVEAMAIRDGLVVAQEVGFRRLEVETD

Query:  SARVAALISSE
          RV   I  +
Subjt:  SARVAALISSE

A0A803PMU7 Uncharacterized protein6.2e-14837.34Show/hide
Query:  MSKAYDRVEWFFLEKLILGLGFDEQVVNLIMACVSSVSYSFLINGKRMGHVKPSRGLRQGDPLSPYLFLLCAEGLTRILGWMEEDKRISGLKIARASPPI
        MSKAYDRVEW FLE +++ LG+DEQ +  IM CV++VS+S LING   G  +P RGLRQGDPLSP+LFLLC+EGL+ ++   E   RI GL+       +
Subjt:  MSKAYDRVEWFFLEKLILGLGFDEQVVNLIMACVSSVSYSFLINGKRMGHVKPSRGLRQGDPLSPYLFLLCAEGLTRILGWMEEDKRISGLKIARASPPI

Query:  SHLFFADDCLLFFRAEVAEANQVAKCLKAFSKLTGHEINYGKSGICLSPNVSEQMRRSIVLTLGVKFVPFHDRYLGLPAVMSGSKSLSLKYVKDRLWANL
        SHL FADD L+F  A   ++N + + L  + KL+G  IN+ K+ +C+   V + M  ++   +GV  V  H +YLG+P  +  +K      +++++ A L
Subjt:  SHLFFADDCLLFFRAEVAEANQVAKCLKAFSKLTGHEINYGKSGICLSPNVSEQMRRSIVLTLGVKFVPFHDRYLGLPAVMSGSKSLSLKYVKDRLWANL

Query:  HKWKHSSFSAGGREVLIKAVLQGVPTYSMSCFRLPKDLVQDYNSILARFWWGGDDEGRKVHWVSWRKVCVSKFNGGLGFRDLELFNKALFAKQGWRLLES
          WK   FS  G+E+LIKAV+Q +P Y MSCFR+ K ++ +   ++A+FWWG      K+HW +W+K+C  K NGG+GFRDLE FN++L AKQGW+L+  
Subjt:  HKWKHSSFSAGGREVLIKAVLQGVPTYSMSCFRLPKDLVQDYNSILARFWWGGDDEGRKVHWVSWRKVCVSKFNGGLGFRDLELFNKALFAKQGWRLLES

Query:  PESLVGRVLRGKYFHGTSFLEAKQKRGESFVWRSLVWGRSLLKEGLRWRVGDGRSVNILRDNWLPRESSLKVIQRGEIDPEARVESLLGLDGVWDSAVVN
        P+  + ++L+  YF   SF EAK+    S VW S++WGR LL +G RW +GDG  + I  D+W+PR     +  + ++  EA V SLL  +G W    V 
Subjt:  PESLVGRVLRGKYFHGTSFLEAKQKRGESFVWRSLVWGRSLLKEGLRWRVGDGRSVNILRDNWLPRESSLKVIQRGEIDPEARVESLLGLDGVWDSAVVN

Query:  DWFGEMDSKVILGIPRPRQMGGDKLIWHYEKDGLYTVRSGYRLACHLRETASCSSPELVKAWWRFVWSRQIPSKIKFFCWRFFHEFLLTMSNLQNRGMEV
         WF   D   +LGI +P     D L W    +G+Y+V SGY+L       A CS+   +KAWW+F+W   +  K+K F WR F+ ++ T   L  RGM +
Subjt:  DWFGEMDSKVILGIPRPRQMGGDKLIWHYEKDGLYTVRSGYRLACHLRETASCSSPELVKAWWRFVWSRQIPSKIKFFCWRFFHEFLLTMSNLQNRGMEV

Query:  SGICPRCRRREESTYHAVWGCKEVKKQWRLSPFAFILEFQPIECVADLLWWCKDKLPPASFDEFLGLCWWIWNRHSK--------GEELW----------
           C  C  ++E+  HA+W C +VK  W+   F  I+    I+  AD+LWW KD LP  +F +F+GL W +W R +          + +W          
Subjt:  SGICPRCRRREESTYHAVWGCKEVKKQWRLSPFAFILEFQPIECVADLLWWCKDKLPPASFDEFLGLCWWIWNRHSK--------GEELW----------

Query:  --EGGGRIEKA-------SWPPPKFPNYKLNSDAATDRAFQRSSISAIIRNEKGDIMLTVLKEFHHVTDVDSVEAMAIRDGLVVAQEVGFRRLEVETDSA
          E   + +K        +W  P    + +N+DA+         +SA+IR+  G +++           V   EA+AI+ G+ +A     ++ +V +DS 
Subjt:  --EGGGRIEKA-------SWPPPKFPNYKLNSDAATDRAFQRSSISAIIRNEKGDIMLTVLKEFHHVTDVDSVEAMAIRDGLVVAQEVGFRRLEVETDSA

Query:  RVAALISSERVDLSE
         +   I S   + +E
Subjt:  RVAALISSERVDLSE

M5W5F3 Reverse transcriptase domain-containing protein (Fragment)1.3e-15039.06Show/hide
Query:  MSKAYDRVEWFFLEKLILGLGFDEQVVNLIMACVSSVSYSFLINGKRMGHVKPSRGLRQGDPLSPYLFLLCAEGLTRILGWMEEDKRISGLKIARASPPI
        M+KAYDRVEW FL  ++L LGF    V+ +M C+S+ ++S L  G  +GH+ P RGLRQG PLSPYLFL+C EG + +L   E    + G+++AR +P +
Subjt:  MSKAYDRVEWFFLEKLILGLGFDEQVVNLIMACVSSVSYSFLINGKRMGHVKPSRGLRQGDPLSPYLFLLCAEGLTRILGWMEEDKRISGLKIARASPPI

Query:  SHLFFADDCLLFFRAEVAEANQVAKCLKAFSKLTGHEINYGKSGICLSPNVSEQMRRSIVLTLGVKFVPFHDRYLGLPAVMSGSKSLSLKYVKDRLWANL
        +HL FADD +LF +A   +   +    + + ++TG +INY KS + LSPN +      I   L V  V  H+ YLGLP +    +    +++KD+LW ++
Subjt:  SHLFFADDCLLFFRAEVAEANQVAKCLKAFSKLTGHEINYGKSGICLSPNVSEQMRRSIVLTLGVKFVPFHDRYLGLPAVMSGSKSLSLKYVKDRLWANL

Query:  HKWKHSSFSAGGREVLIKAVLQGVPTYSMSCFRLPKDLVQDYNSILARFWWGGDDEGRKVHWVSWRKVCVSKFNGGLGFRDLELFNKALFAKQGWRLLES
          WK    S  G+E+LIKAVLQ +PTYSMSCFR+PK L ++ N I+ARFWW    + R +HWV W  +C SKF GGLGFRDLE FN+AL AKQ WR+L +
Subjt:  HKWKHSSFSAGGREVLIKAVLQGVPTYSMSCFRLPKDLVQDYNSILARFWWGGDDEGRKVHWVSWRKVCVSKFNGGLGFRDLELFNKALFAKQGWRLLES

Query:  PESLVGRVLRGKYFHGTSFLEAKQKRGESFVWRSLVWGRSLLKEGLRWRVGDGRSVNILRDNWLPRESSLKVIQRGEIDPEARVESLLGLDGVWDSAVVN
        PESLV R+ R +Y     FLEA+     SF+WRSL WG+ LL +GLRWRVG G S+ +  D WLP  S  K++   ++    RV  L    G W+  ++ 
Subjt:  PESLVGRVLRGKYFHGTSFLEAKQKRGESFVWRSLVWGRSLLKEGLRWRVGDGRSVNILRDNWLPRESSLKVIQRGEIDPEARVESLLGLDGVWDSAVVN

Query:  DWFGEMDSKVILGIPRPRQMGGDKLIWHYEKDGLYTVRSGYRLACHLRETAS---CSSPELVKAWWRFVWSRQIPSKIKFFCWRFFHEFLLTMSNLQNRG
        D F + +   IL IP     G D LIWHYE++G+Y+V+SGYRLA   ++  S    +  +L   +W+ +W+ +IP+KIKFF WR   +FL     L NR 
Subjt:  DWFGEMDSKVILGIPRPRQMGGDKLIWHYEKDGLYTVRSGYRLACHLRETAS---CSSPELVKAWWRFVWSRQIPSKIKFFCWRFFHEFLLTMSNLQNRG

Query:  MEVSGICPRCRRREESTYHAVWGCKEVKKQWRLSPFAFILEFQPIECVADLLWWCKDKLPPASFDE--FLGLCWWIWNRHS------KGEELWEGGGRIE
        +  + ICP+C R+ ES  HAVW C+  K+ WR S +  + E   +    +L  W   +L  +  ++  F  LCW +WNR +      K E   +   R+ 
Subjt:  MEVSGICPRCRRREESTYHAVWGCKEVKKQWRLSPFAFILEFQPIECVADLLWWCKDKLPPASFDE--FLGLCWWIWNRHS------KGEELWEGGGRIE

Query:  KAS----------------WPPPKFPNYKLNSDAATDRAFQRSSISAIIRNEKGDIMLTVLKEFHHVTDVDSVEAMAIRDGLVVAQEVGFRRLEVETDS
        K +                   P+ P        A         +  ++RN  G+ M   ++  H        E MA  +GL  A ++GF    +E D+
Subjt:  KAS----------------WPPPKFPNYKLNSDAATDRAFQRSSISAIIRNEKGDIMLTVLKEFHHVTDVDSVEAMAIRDGLVVAQEVGFRRLEVETDS

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein3.3e-2124.58Show/hide
Query:  KAYDRVEWFFLEKLILGLGFDEQVVNLIMACVSSVSYSFLINGKRMGHVKPSRGLRQGDPLSPYLFLLCAEGLTRILGWMEEDKRISGLKIARASPPISH
        KA+D+++  F+ K +  LG D   + +I A     + + ++NG+++       G RQG PLSP LF +  E L R +    ++K I G+++ +    +S 
Subjt:  KAYDRVEWFFLEKLILGLGFDEQVVNLIMACVSSVSYSFLINGKRMGHVKPSRGLRQGDPLSPYLFLLCAEGLTRILGWMEEDKRISGLKIARASPPISH

Query:  LFFADDCLLFFRAEVAEANQVAKCLKAFSKLTGHEINYGKSGICLSPNVSEQMRRSIVLTLGVKFVPFHDRYLGLPAVMSGSKSLSLKY--VKDRLWANL
          FADD +++    +  A  + K +  FSK++G++IN  KS   L  N + Q    I+  L         +YLG+             Y  +   +  + 
Subjt:  LFFADDCLLFFRAEVAEANQVAKCLKAFSKLTGHEINYGKSGICLSPNVSEQMRRSIVLTLGVKFVPFHDRYLGLPAVMSGSKSLSLKY--VKDRLWANL

Query:  HKWKHSSFSAGGREVLIKAVLQGVPTYSMSC--FRLPKDLVQDYNSILARFWWGGDDEGRKVHWVSWRKVCVSKFNGGLGFRDLELFNKALFAKQGW
        +KWK+   S  GR  ++K  +     Y  +    +LP     +      +F W      +K   ++   +      GG+   D +L+ KA   K  W
Subjt:  HKWKHSSFSAGGREVLIKAVLQGVPTYSMSC--FRLPKDLVQDYNSILARFWWGGDDEGRKVHWVSWRKVCVSKFNGGLGFRDLELFNKALFAKQGW

P0C2F6 Putative ribonuclease H protein At1g657503.2e-4028.21Show/hide
Query:  VKDRLWANLHKWKHSSFSAGGREVLIKAVLQGVPTYSMSCFRLPKDLVQDYNSILARFWWGGDDEGRKVHWVSWRKVCVSKFNGGLGFRDLELFNKALFA
        + +R+ + +  W+  + S  GR  L KAVL  +P +SMS   LP+ ++   + +   F WG   E +K H V W KVC  K  GGLG R  +  N+AL +
Subjt:  VKDRLWANLHKWKHSSFSAGGREVLIKAVLQGVPTYSMSCFRLPKDLVQDYNSILARFWWGGDDEGRKVHWVSWRKVCVSKFNGGLGFRDLELFNKALFA

Query:  KQGWRLLESPESLVGRVLRGKYFHGTSFLEAK---QKRGESFVWRSLVWG-RSLLKEGLRWRVGDGRSVNILRDNWLPRESSLKVIQRGE--IDPEARVE
        K GWRLL+   SL   VL+ KY H     +++    K   S  WRS+  G R ++  G+ W  GDG+ +    D W+  +  L+ +  GE   D +  V 
Subjt:  KQGWRLLESPESLVGRVLRGKYFHGTSFLEAK---QKRGESFVWRSLVWG-RSLLKEGLRWRVGDGRSVNILRDNWLPRESSLKVIQRGE--IDPEARVE

Query:  SLLGLDG-VWDSAVVNDWFGEMDSKVILGIPRPRQMGG-DKLIWHYEKDGLYTVRSGYRLACHLRETASCSSPELVKAWWRFVWSRQIPSKIKFFCWRFF
          L + G  WD A ++ +        +  +      G  D+L W + +DG ++VRS Y +      T        + +++  +W  ++P ++K F W   
Subjt:  SLLGLDG-VWDSAVVNDWFGEMDSKVILGIPRPRQMGG-DKLIWHYEKDGLYTVRSGYRLACHLRETASCSSPELVKAWWRFVWSRQIPSKIKFFCWRFF

Query:  HEFLLTMSNLQNRGMEVSGICPRCRRREESTYHAVWGCKEVKKQW-RLSPFAFILEFQPIECVADLLWWCKDKLPPASFDE-------FLGLCWWIW
        ++ ++T      R +  S +C  C+   ES  H +  C      W R+ P       Q       L  W  D L   S  E       F  + WW W
Subjt:  HEFLLTMSNLQNRGMEVSGICPRCRRREESTYHAVWGCKEVKKQW-RLSPFAFILEFQPIECVADLLWWCKDKLPPASFDE-------FLGLCWWIW

P11369 LINE-1 retrotransposable element ORF2 protein1.9e-2124.58Show/hide
Query:  KAYDRVEWFFLEKLILGLGFDEQVVNLIMACVSSVSYSFLINGKRMGHVKPSRGLRQGDPLSPYLFLLCAEGLTRILGWMEEDKRISGLKIARASPPISH
        KA+D+++  F+ K++   G     +N+I A  S    +  +NG+++  +    G RQG PLSPYLF +  E L R +    + K I G++I +    IS 
Subjt:  KAYDRVEWFFLEKLILGLGFDEQVVNLIMACVSSVSYSFLINGKRMGHVKPSRGLRQGDPLSPYLFLLCAEGLTRILGWMEEDKRISGLKIARASPPISH

Query:  LFFADDCLLFFRAEVAEANQVAKCLKAFSKLTGHEINYGKSGICLSPNVSEQMRRSIVLTLGVKFVPFHDRYLG--LPAVMSGSKSLSLKYVKDRLWANL
        L  ADD +++         ++   + +F ++ G++IN  KS   L    ++Q  + I  T     V  + +YLG  L   +      + K +K  +  +L
Subjt:  LFFADDCLLFFRAEVAEANQVAKCLKAFSKLTGHEINYGKSGICLSPNVSEQMRRSIVLTLGVKFVPFHDRYLG--LPAVMSGSKSLSLKYVKDRLWANL

Query:  HKWKHSSFSAGGREVLIKAVLQGVPTYSMSC--FRLPKDLVQDYNSILARFWWGGDDEGRKVHWVSWRKVCVSKFNGGLGFRDLELFNKALFAKQGW
         +WK    S  GR  ++K  +     Y  +    ++P     +    + +F W       K   ++   +   + +GG+   DL+L+ +A+  K  W
Subjt:  HKWKHSSFSAGGREVLIKAVLQGVPTYSMSC--FRLPKDLVQDYNSILARFWWGGDDEGRKVHWVSWRKVCVSKFNGGLGFRDLELFNKALFAKQGW

P92555 Uncharacterized mitochondrial protein AtMg012501.2e-1555.88Show/hide
Query:  FLINGKRMGHVKPSRGLRQGDPLSPYLFLLCAEGLTRILGWMEEDKRISGLKIARASPPISHLFFADD
        F+ING   G V PSRGLRQGDPLSPYLF+LC E L+ +    +E  R+ G++++  SP I+HL FADD
Subjt:  FLINGKRMGHVKPSRGLRQGDPLSPYLFLLCAEGLTRILGWMEEDKRISGLKIARASPPISHLFFADD

P93295 Uncharacterized mitochondrial protein AtMg003102.9e-3343.92Show/hide
Query:  VPTYSMSCFRLPKDLVQDYNSILARFWWGGDDEGRKVHWVSWRKVCVSK-FNGGLGFRDLELFNKALFAKQGWRLLESPESLVGRVLRGKYFHGTSFLEA
        +P Y+MSCFRL K L +   S +  FWW   +  RK+ WV+W+K+C SK  +GGLGFRDL  FN+AL AKQ +R++  P +L+ R+LR +YF  +S +E 
Subjt:  VPTYSMSCFRLPKDLVQDYNSILARFWWGGDDEGRKVHWVSWRKVCVSK-FNGGLGFRDLELFNKALFAKQGWRLLESPESLVGRVLRGKYFHGTSFLEA

Query:  KQKRGESFVWRSLVWGRSLLKEGLRWRVGDGRSVNILRDNWLPRESSL
              S+ WRS++ GR LL  GL   +GDG    +  D W+  E+ L
Subjt:  KQKRGESFVWRSLVWGRSLLKEGLRWRVGDGRSVNILRDNWLPRESSL

Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein8.0e-3126.38Show/hide
Query:  LRGKYFHGTSFLEAKQKRGESFVWRSLVWGRSLLKEGLRWRVGDGRSVNILRDNWLPRESSLKVIQRGEIDPEARVESLLGLDG---VWDSAVVNDWFGE
        ++ +YF   S L+AK ++ +S+ W SL+ G +LLK+G R  +GDG+++ I  DN +      + +   E   E  + +L    G    WD + ++ +  +
Subjt:  LRGKYFHGTSFLEAKQKRGESFVWRSLVWGRSLLKEGLRWRVGDGRSVNILRDNWLPRESSLKVIQRGEIDPEARVESLLGLDG---VWDSAVVNDWFGE

Query:  MDSKVILGIPRPRQMGGDKLIWHYEKDGLYTVRSGYRLACHLRET--ASCSSPELVKAWWRFVWSRQIPSKIKFFCWRFFHEFLLTMSNLQNRGMEVSGI
         D   I  I   +    DK+IW+Y   G YTVRSGY L  H   T   + + P         +W+  I  K+K F WR   + L T   L  RGM +   
Subjt:  MDSKVILGIPRPRQMGGDKLIWHYEKDGLYTVRSGYRLACHLRET--ASCSSPELVKAWWRFVWSRQIPSKIKFFCWRFFHEFLLTMSNLQNRGMEVSGI

Query:  CPRCRRREESTYHAVWGCKEVKKQWRLSPFAFIL------EFQPIECVADLLWWCKDKLPPASFDEFL--GLCWWIWN----------RHSKGEELWEGG
        CPRC R  ES  HA++ C      WRLS  + I       +F+  E ++++L + +D    + F + L   L W IW           R S  + +    
Subjt:  CPRCRRREESTYHAVWGCKEVKKQWRLSPFAFIL------EFQPIECVADLLWWCKDKLPPASFDEFL--GLCWWIWN----------RHSKGEELWEGG

Query:  GRI----------------------EKASWPPPKFPNYKLNSDAATDRAFQRSSISAIIRNEKGDIMLTVLKEFHHVTDVDSVEAMAIRDGLVVAQEVGF
                                  K  W  P     K N DA  D     ++   IIRN  G  +     +  H ++    E  A+   L      G+
Subjt:  GRI----------------------EKASWPPPKFPNYKLNSDAATDRAFQRSSISAIIRNEKGDIMLTVLKEFHHVTDVDSVEAMAIRDGLVVAQEVGF

Query:  RRLEVETDSARVAALIS
         ++ +E D   +  LI+
Subjt:  RRLEVETDSARVAALIS

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.1e-1924.32Show/hide
Query:  RYLGLPAVMSGSKSLSLKYVKDRLWANLHKWKHSSFSAGGREVLIKAVLQGVPTYSMSCFRLPKDLVQDYNSILARFWWGGDDEGRKVHWVSWRKVCVSK
        RYLGLP +     +     + +++   + KW     S  GR  LI +V+  +  + MS FRLP   +++ +SI + F W G +   K   V+W  VC  K
Subjt:  RYLGLPAVMSGSKSLSLKYVKDRLWANLHKWKHSSFSAGGREVLIKAVLQGVPTYSMSCFRLPKDLVQDYNSILARFWWGGDDEGRKVHWVSWRKVCVSK

Query:  FNGGLGFRDLELFNKALFAKQGWRLLESPESLVGRVLRGKYFHGTSFLEAKQKRGESFVWRSLVWGRSLLKEGLRWRVGDGRSVNILRDNWLPRESSLKV
          GGLG R L+  NK  F    W       S+ G    G                 S++W+ ++  R+L    ++  + +G + +   DNW      + V
Subjt:  FNGGLGFRDLELFNKALFAKQGWRLLESPESLVGRVLRGKYFHGTSFLEAKQKRGESFVWRSLVWGRSLLKEGLRWRVGDGRSVNILRDNWLPRESSLKV

Query:  I-QRGEIDPEARVESLLGLDGVWDSAVVNDWFGEMDSKVILGIPRPRQMGGDKL------IWHYEKDGLY----TVR-----SGYRLACHLRET-ASCSS
           RG ID    + + +        AVVN               RPR+   D L      I      GL     TVR       ++   + +ET A+   
Subjt:  I-QRGEIDPEARVESLLGLDGVWDSAVVNDWFGEMDSKVILGIPRPRQMGGDKL------IWHYEKDGLY----TVR-----SGYRLACHLRET-ASCSS

Query:  PELVKAWWRFVWSRQIPSKIKFFCWRFFHEFLLTMSNLQNRGMEVSGICPRCRRREESTYHAVWGC
        P+L   W++ VW      K     W      L T   + +        C  C    E+  H  + C
Subjt:  PELVKAWWRFVWSRQIPSKIKFFCWRFFHEFLLTMSNLQNRGMEVSGICPRCRRREESTYHAVWGC

AT4G29090.1 Ribonuclease H-like superfamily protein1.4e-5427.57Show/hide
Query:  VPTYSMSCFRLPKDLVQDYNSILARFWWGGDDEGRKVHWVSWRKVCVSKFNGGLGFRDLELFNKALFAKQGWRLLESPESLVGRVLRGKYFHGTSFLEAK
        +PTY+M+CF LPK + +   S+LA FWW    E + +HW +W  +   K  GG+GF+D+E FN AL  KQ WR+L  PESL+ +V + +YFH +  L A 
Subjt:  VPTYSMSCFRLPKDLVQDYNSILARFWWGGDDEGRKVHWVSWRKVCVSKFNGGLGFRDLELFNKALFAKQGWRLLESPESLVGRVLRGKYFHGTSFLEAK

Query:  QKRGESFVWRSLVWGRSLLKEGLRWRVGDGRSVNILRDNWLPRESSLKVIQRGEIDPE--ARVESLLGLDGV-------WDSAVVNDWFGEMDSKVILGI
             SFVW+S+   + +L++G R  VG+G  + I R  WL  + +   ++   + P+  A V S+L +  +       W   V+   F E++ K+I G 
Subjt:  QKRGESFVWRSLVWGRSLLKEGLRWRVGDGRSVNILRDNWLPRESSLKVIQRGEIDPE--ARVESLLGLDGV-------WDSAVVNDWFGEMDSKVILGI

Query:  PRPRQMGG----DKLIWHYEKDGLYTVRSGYRLACHLRETAS----CSSPELVKAWWRFVWSRQIPSKIKFFCWRFFHEFLLTMSNLQNRGMEVSGICPR
         RP   GG    D   W Y   G YTV+SGY +   +    S     S P L   + + +W  Q   KI+ F W+     L     L  R +     C R
Subjt:  PRPRQMGG----DKLIWHYEKDGLYTVRSGYRLACHLRETAS----CSSPELVKAWWRFVWSRQIPSKIKFFCWRFFHEFLLTMSNLQNRGMEVSGICPR

Query:  CRRREESTYHAVWGCKEVKKQWRLSPFAFILEFQPIECVADLLWWC------KDKLPPASFDEFLGLCWWIWNRHSK-----------------GEELWE
        C   +E+  H ++ C   +  W +S     L  +  + +   L+W         +   AS      L W +W   ++                  ++L E
Subjt:  CRRREESTYHAVWGCKEVKKQWRLSPFAFILEFQPIECVADLLWWC------KDKLPPASFDEFLGLCWWIWNRHSK-----------------GEELWE

Query:  GGGRIEKAS--------------WPPPKFPNYKLNSDAATDRAFQRSSISAIIRNEKGDIMLTVLKEFHHVTDVDSVEAMAIRDGLVVAQEVGFRRLEVE
           R E  S              W PP     K N+DA  +R  +R  I  ++RNEKG++     +    +  V   E  A+R  ++      +  +  E
Subjt:  GGGRIEKAS--------------WPPPKFPNYKLNSDAATDRAFQRSSISAIIRNEKGDIMLTVLKEFHHVTDVDSVEAMAIRDGLVVAQEVGFRRLEVE

Query:  TDSARVAALISSERV
        +DS  +  +++++ +
Subjt:  TDSARVAALISSERV

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.0e-3443.92Show/hide
Query:  VPTYSMSCFRLPKDLVQDYNSILARFWWGGDDEGRKVHWVSWRKVCVSK-FNGGLGFRDLELFNKALFAKQGWRLLESPESLVGRVLRGKYFHGTSFLEA
        +P Y+MSCFRL K L +   S +  FWW   +  RK+ WV+W+K+C SK  +GGLGFRDL  FN+AL AKQ +R++  P +L+ R+LR +YF  +S +E 
Subjt:  VPTYSMSCFRLPKDLVQDYNSILARFWWGGDDEGRKVHWVSWRKVCVSK-FNGGLGFRDLELFNKALFAKQGWRLLESPESLVGRVLRGKYFHGTSFLEA

Query:  KQKRGESFVWRSLVWGRSLLKEGLRWRVGDGRSVNILRDNWLPRESSL
              S+ WRS++ GR LL  GL   +GDG    +  D W+  E+ L
Subjt:  KQKRGESFVWRSLVWGRSLLKEGLRWRVGDGRSVNILRDNWLPRESSL

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)8.6e-1755.88Show/hide
Query:  FLINGKRMGHVKPSRGLRQGDPLSPYLFLLCAEGLTRILGWMEEDKRISGLKIARASPPISHLFFADD
        F+ING   G V PSRGLRQGDPLSPYLF+LC E L+ +    +E  R+ G++++  SP I+HL FADD
Subjt:  FLINGKRMGHVKPSRGLRQGDPLSPYLFLLCAEGLTRILGWMEEDKRISGLKIARASPPISHLFFADD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTAAGGCCTACGATAGGGTCGAATGGTTTTTCCTTGAGAAGCTTATATTGGGGCTCGGTTTTGATGAGCAAGTGGTTAATCTGATTATGGCTTGTGTGTCATCGGT
GTCTTATTCCTTTCTGATTAACGGGAAGAGAATGGGGCATGTTAAACCATCGAGGGGTCTTAGGCAAGGTGACCCGTTGTCTCCCTACTTGTTTTTGTTGTGTGCGGAAG
GTTTAACAAGAATTCTAGGATGGATGGAGGAGGACAAGAGAATTAGTGGGCTTAAAATTGCACGTGCTAGCCCCCCCATTTCTCACCTTTTCTTTGCGGATGACTGTTTA
TTGTTTTTCAGGGCTGAGGTGGCTGAAGCAAATCAGGTTGCCAAATGCCTGAAGGCTTTTTCCAAGCTAACAGGCCATGAAATCAACTATGGGAAGTCTGGTATCTGTTT
GAGCCCAAATGTGAGTGAGCAAATGAGGCGAAGTATAGTGTTGACGCTGGGAGTAAAGTTTGTCCCCTTTCATGATCGCTATCTAGGCCTTCCAGCTGTTATGTCTGGGA
GCAAGAGTTTGTCGTTGAAATATGTTAAGGACCGGTTGTGGGCTAATCTCCATAAGTGGAAGCATTCTTCATTCTCTGCTGGGGGAAGGGAGGTTCTTATTAAAGCTGTG
TTGCAAGGTGTGCCTACCTATTCGATGTCATGCTTCCGTTTGCCAAAGGATCTTGTTCAAGATTATAACAGCATTTTGGCTAGATTCTGGTGGGGAGGAGATGATGAAGG
TAGAAAGGTTCATTGGGTTTCGTGGAGAAAGGTATGTGTTTCAAAATTTAATGGAGGATTAGGTTTTAGGGATCTTGAGTTGTTTAACAAAGCCCTTTTTGCGAAACAAG
GGTGGAGGTTGTTGGAGAGCCCTGAGTCATTGGTTGGAAGGGTGTTGAGAGGTAAATACTTCCATGGGACCTCCTTTTTGGAGGCTAAACAAAAGAGAGGGGAATCTTTT
GTTTGGAGGAGTCTGGTGTGGGGAAGATCTTTATTGAAGGAAGGGCTTCGGTGGAGAGTTGGTGATGGGAGGTCTGTTAACATTTTGAGGGATAATTGGCTTCCAAGGGA
GTCATCGTTGAAAGTTATCCAGAGGGGTGAGATTGATCCAGAGGCGAGGGTGGAAAGTTTGTTGGGGCTTGATGGGGTGTGGGATAGTGCAGTTGTGAATGATTGGTTTG
GTGAGATGGATTCGAAGGTTATCCTGGGGATCCCTAGACCGCGGCAGATGGGGGGTGATAAGCTTATATGGCACTATGAGAAGGATGGGCTTTATACAGTTCGTAGTGGG
TATCGATTGGCCTGCCATCTAAGGGAGACTGCCTCTTGCTCGAGTCCTGAGCTAGTGAAGGCGTGGTGGAGATTTGTGTGGAGTAGGCAGATTCCATCCAAGATCAAGTT
TTTCTGTTGGCGGTTTTTTCATGAGTTTCTCCTGACGATGAGCAATTTACAGAACAGAGGTATGGAGGTCTCTGGGATCTGTCCGAGATGTAGAAGAAGGGAGGAGTCGA
CCTATCATGCTGTTTGGGGTTGTAAAGAGGTGAAGAAGCAGTGGAGGTTATCACCGTTTGCCTTCATTTTAGAGTTCCAACCTATAGAGTGTGTTGCCGATCTTTTGTGG
TGGTGCAAGGACAAGCTGCCACCAGCAAGCTTTGATGAGTTTTTGGGGTTGTGCTGGTGGATATGGAACCGGCATTCCAAGGGAGAAGAGTTGTGGGAGGGGGGGGGTCG
GATTGAGAAGGCATCTTGGCCTCCTCCAAAATTCCCTAATTATAAGCTAAATTCAGATGCAGCGACAGATAGAGCTTTCCAGAGAAGCAGTATTAGTGCGATAATCCGGA
ATGAGAAGGGAGACATTATGCTTACCGTTTTGAAGGAGTTCCACCATGTGACTGACGTTGATTCAGTTGAAGCTATGGCCATTAGAGACGGTTTAGTTGTTGCGCAGGAG
GTGGGTTTCCGAAGGCTGGAAGTTGAAACGGACTCAGCTCGTGTGGCGGCGTTGATTAGCTCCGAGAGGGTGGATCTGTCAGAGGTGGTGCCGGAGAGAGACGAACGGTT
TGGCCCATGCGGCGACAAAGCTGGTGCTGGAGCAGGGGGTCGTCGGATTCTAGGTCGAGGAGGTGTCGGCGCAGCTACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTAAGGCCTACGATAGGGTCGAATGGTTTTTCCTTGAGAAGCTTATATTGGGGCTCGGTTTTGATGAGCAAGTGGTTAATCTGATTATGGCTTGTGTGTCATCGGT
GTCTTATTCCTTTCTGATTAACGGGAAGAGAATGGGGCATGTTAAACCATCGAGGGGTCTTAGGCAAGGTGACCCGTTGTCTCCCTACTTGTTTTTGTTGTGTGCGGAAG
GTTTAACAAGAATTCTAGGATGGATGGAGGAGGACAAGAGAATTAGTGGGCTTAAAATTGCACGTGCTAGCCCCCCCATTTCTCACCTTTTCTTTGCGGATGACTGTTTA
TTGTTTTTCAGGGCTGAGGTGGCTGAAGCAAATCAGGTTGCCAAATGCCTGAAGGCTTTTTCCAAGCTAACAGGCCATGAAATCAACTATGGGAAGTCTGGTATCTGTTT
GAGCCCAAATGTGAGTGAGCAAATGAGGCGAAGTATAGTGTTGACGCTGGGAGTAAAGTTTGTCCCCTTTCATGATCGCTATCTAGGCCTTCCAGCTGTTATGTCTGGGA
GCAAGAGTTTGTCGTTGAAATATGTTAAGGACCGGTTGTGGGCTAATCTCCATAAGTGGAAGCATTCTTCATTCTCTGCTGGGGGAAGGGAGGTTCTTATTAAAGCTGTG
TTGCAAGGTGTGCCTACCTATTCGATGTCATGCTTCCGTTTGCCAAAGGATCTTGTTCAAGATTATAACAGCATTTTGGCTAGATTCTGGTGGGGAGGAGATGATGAAGG
TAGAAAGGTTCATTGGGTTTCGTGGAGAAAGGTATGTGTTTCAAAATTTAATGGAGGATTAGGTTTTAGGGATCTTGAGTTGTTTAACAAAGCCCTTTTTGCGAAACAAG
GGTGGAGGTTGTTGGAGAGCCCTGAGTCATTGGTTGGAAGGGTGTTGAGAGGTAAATACTTCCATGGGACCTCCTTTTTGGAGGCTAAACAAAAGAGAGGGGAATCTTTT
GTTTGGAGGAGTCTGGTGTGGGGAAGATCTTTATTGAAGGAAGGGCTTCGGTGGAGAGTTGGTGATGGGAGGTCTGTTAACATTTTGAGGGATAATTGGCTTCCAAGGGA
GTCATCGTTGAAAGTTATCCAGAGGGGTGAGATTGATCCAGAGGCGAGGGTGGAAAGTTTGTTGGGGCTTGATGGGGTGTGGGATAGTGCAGTTGTGAATGATTGGTTTG
GTGAGATGGATTCGAAGGTTATCCTGGGGATCCCTAGACCGCGGCAGATGGGGGGTGATAAGCTTATATGGCACTATGAGAAGGATGGGCTTTATACAGTTCGTAGTGGG
TATCGATTGGCCTGCCATCTAAGGGAGACTGCCTCTTGCTCGAGTCCTGAGCTAGTGAAGGCGTGGTGGAGATTTGTGTGGAGTAGGCAGATTCCATCCAAGATCAAGTT
TTTCTGTTGGCGGTTTTTTCATGAGTTTCTCCTGACGATGAGCAATTTACAGAACAGAGGTATGGAGGTCTCTGGGATCTGTCCGAGATGTAGAAGAAGGGAGGAGTCGA
CCTATCATGCTGTTTGGGGTTGTAAAGAGGTGAAGAAGCAGTGGAGGTTATCACCGTTTGCCTTCATTTTAGAGTTCCAACCTATAGAGTGTGTTGCCGATCTTTTGTGG
TGGTGCAAGGACAAGCTGCCACCAGCAAGCTTTGATGAGTTTTTGGGGTTGTGCTGGTGGATATGGAACCGGCATTCCAAGGGAGAAGAGTTGTGGGAGGGGGGGGGTCG
GATTGAGAAGGCATCTTGGCCTCCTCCAAAATTCCCTAATTATAAGCTAAATTCAGATGCAGCGACAGATAGAGCTTTCCAGAGAAGCAGTATTAGTGCGATAATCCGGA
ATGAGAAGGGAGACATTATGCTTACCGTTTTGAAGGAGTTCCACCATGTGACTGACGTTGATTCAGTTGAAGCTATGGCCATTAGAGACGGTTTAGTTGTTGCGCAGGAG
GTGGGTTTCCGAAGGCTGGAAGTTGAAACGGACTCAGCTCGTGTGGCGGCGTTGATTAGCTCCGAGAGGGTGGATCTGTCAGAGGTGGTGCCGGAGAGAGACGAACGGTT
TGGCCCATGCGGCGACAAAGCTGGTGCTGGAGCAGGGGGTCGTCGGATTCTAGGTCGAGGAGGTGTCGGCGCAGCTACTTGA
Protein sequenceShow/hide protein sequence
MSKAYDRVEWFFLEKLILGLGFDEQVVNLIMACVSSVSYSFLINGKRMGHVKPSRGLRQGDPLSPYLFLLCAEGLTRILGWMEEDKRISGLKIARASPPISHLFFADDCL
LFFRAEVAEANQVAKCLKAFSKLTGHEINYGKSGICLSPNVSEQMRRSIVLTLGVKFVPFHDRYLGLPAVMSGSKSLSLKYVKDRLWANLHKWKHSSFSAGGREVLIKAV
LQGVPTYSMSCFRLPKDLVQDYNSILARFWWGGDDEGRKVHWVSWRKVCVSKFNGGLGFRDLELFNKALFAKQGWRLLESPESLVGRVLRGKYFHGTSFLEAKQKRGESF
VWRSLVWGRSLLKEGLRWRVGDGRSVNILRDNWLPRESSLKVIQRGEIDPEARVESLLGLDGVWDSAVVNDWFGEMDSKVILGIPRPRQMGGDKLIWHYEKDGLYTVRSG
YRLACHLRETASCSSPELVKAWWRFVWSRQIPSKIKFFCWRFFHEFLLTMSNLQNRGMEVSGICPRCRRREESTYHAVWGCKEVKKQWRLSPFAFILEFQPIECVADLLW
WCKDKLPPASFDEFLGLCWWIWNRHSKGEELWEGGGRIEKASWPPPKFPNYKLNSDAATDRAFQRSSISAIIRNEKGDIMLTVLKEFHHVTDVDSVEAMAIRDGLVVAQE
VGFRRLEVETDSARVAALISSERVDLSEVVPERDERFGPCGDKAGAGAGGRRILGRGGVGAAT