; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021894 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021894
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr7:13609448..13612536
RNA-Seq ExpressionLag0021894
SyntenyLag0021894
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023872411.1 uncharacterized protein LOC111985024 [Quercus suber]1.5e-10831.48Show/hide
Query:  CGLQDLDYLGETFTWTNRQMQQSQINERLDRFIANETFYQIFPSPSVTHLDWARSDHCPIMLDVCPEEVQEGYTKRRQIFYFEEVWTQNPDCKNIISK--
        CGL+++ + G  FTW  ++   +QI ERLDR +A+  +  +FPS  + H   + SDH P+ML +  ++  +   K ++IF FE +W ++  C+ I+++  
Subjt:  CGLQDLDYLGETFTWTNRQMQQSQINERLDRFIANETFYQIFPSPSVTHLDWARSDHCPIMLDVCPEEVQEGYTKRRQIFYFEEVWTQNPDCKNIISK--

Query:  ---------------------------------LGHKFAR----------DGEAPRLEHFLNRFRGSLKSW-------GKERKD---VKATFESYFREMF
                                         +G K A              +P +   L   R  L  W        K+R D   ++  F  Y+ ++F
Subjt:  ---------------------------------LGHKFAR----------DGEAPRLEHFLNRFRGSLKSW-------GKERKD---VKATFESYFREMF

Query:  QSSSPQ--IDVMDSVLQEV----------------------------------------------QIKSAFVLGRSIFYNVIVGHECLHSIKSRRTGRMS
         SS P    +++++V  +V                                                +SAFV GR I  NV+V  E +H I  ++ G   
Subjt:  QSSSPQ--IDVMDSVLQEV----------------------------------------------QIKSAFVLGRSIFYNVIVGHECLHSIKSRRTGRMS

Query:  WVALKLDMSKANDGVEWCFLEKLMLKIGFHLKWVDLIMDCVKTPTFSIIVNGLPTGRIVPQRGLRQGDPLSPYLFLLCSEVLSSLITTTSSRGTITGIK-
         +ALKLDMSKA D VEW  L+K+M K+GFH +W  L+M C+ + T+++ +NG P+G I+P RGLRQGDPLSPYLFLLC+E LS+LI   ++ G + GI  
Subjt:  WVALKLDMSKANDGVEWCFLEKLMLKIGFHLKWVDLIMDCVKTPTFSIIVNGLPTGRIVPQRGLRQGDPLSPYLFLLCSEVLSSLITTTSSRGTITGIK-

Query:  ----------------------------------PAYKRASGQVVNTDKSTLYFSPNVSKDFQIVLSNILGMPLVEDIGRYLGVPSTFTHRRRDDFKEIK
                                            Y++ASGQ +N  K++L+FSPN +K+ Q  +    G  ++    +YLG+PS     +++ FKE+K
Subjt:  ----------------------------------PAYKRASGQVVNTDKSTLYFSPNVSKDFQIVLSNILGMPLVEDIGRYLGVPSTFTHRRRDDFKEIK

Query:  QRVWQTLQGWKRTI---GGKEVLIKSVAQAIPTYFMSCFRIPQTLYDDLHKIMARFWWGSTDTKRKKIHWKKWDMLCTPKELGGRNFRDLMTFNKALLAK
         ++ + L GWK  +    GKEVLIK+VAQAIPTY MSCF+IP +L D++  ++  FWWG    +R K+ W  W+ LC PK  GG  F+ L  FN A+LAK
Subjt:  QRVWQTLQGWKRTI---GGKEVLIKSVAQAIPTYFMSCFRIPQTLYDDLHKIMARFWWGSTDTKRKKIHWKKWDMLCTPKELGGRNFRDLMTFNKALLAK

Query:  Q----------------------------------RGFVW-----ARDLLQLGLRKQVGNGQSILFFKDPWIPKETTFKPLQLQGHEFQDDVTVSEFITP
        Q                                    F W     A+ L+Q G R QVGNG+SI+ +K+ W+P  +T++ +    +   +D  V++ I  
Subjt:  Q----------------------------------RGFVW-----ARDLLQLGLRKQVGNGQSILFFKDPWIPKETTFKPLQLQGHEFQDDVTVSEFITP

Query:  TMG-WNLAKLQCAVTEDDAHVIASIPISVSNESDKWIWHYTPSGEYSVKSGYKVA---RHTSMIQECSQSYDQRRWWKTLWKAHIPNKIKIFIWKAYHNC
          G W    ++      +A VI  I +S +   DK +W  T +G +SV+S YK+A   R  +  +  S     RR+W+T+W  +IP+KI+ F W+A  + 
Subjt:  TMG-WNLAKLQCAVTEDDAHVIASIPISVSNESDKWIWHYTPSGEYSVKSGYKVA---RHTSMIQECSQSYDQRRWWKTLWKAHIPNKIKIFIWKAYHNC

Query:  LPTKFCLWKR
        LPTK  L +R
Subjt:  LPTKFCLWKR

XP_030477990.1 uncharacterized protein LOC115695032 [Cannabis sativa]3.6e-10737.81Show/hide
Query:  VMDSVLQEVQIKSAFVLGRSIFYNVIVGHECLHSIKSRRTGRMSWVALKLDMSKANDGVEWCFLEKLMLKIGFHLKWVDLIMDCVKTPTFSIIVNGLPTG
        V+  V+ E Q  SAF+  R I  N++V  E +H +K +  G+  + ALKLDMSKA DGVEW F+  +M K+GF ++W+ LI++ ++T   S I+NG  +G
Subjt:  VMDSVLQEVQIKSAFVLGRSIFYNVIVGHECLHSIKSRRTGRMSWVALKLDMSKANDGVEWCFLEKLMLKIGFHLKWVDLIMDCVKTPTFSIIVNGLPTG

Query:  RIVPQRGLRQGDPLSPYLFLLCSEVLSSLITTTSSRGTITG-----------------------------------IKPAYKRASGQVVNTDKSTLYFSP
         + PQRGLRQGDPLSPYLFL+CSE LS L+    S G + G                                   +   Y +ASGQ +NTDKS + FSP
Subjt:  RIVPQRGLRQGDPLSPYLFLLCSEVLSSLITTTSSRGTITG-----------------------------------IKPAYKRASGQVVNTDKSTLYFSP

Query:  NVSKDFQIVLSNILGMPLVEDIGRYLGVPSTFTHRRRDDFKEIKQRVWQTLQGWKR---TIGGKEVLIKSVAQAIPTYFMSCFRIPQTLYDDLHKIMARF
        N S+  +    NILGMP+ E    YLG+P+     ++  F +IK+R+W+ L  W     ++GGKEVL+K+V Q+IPTY MSCF++P     ++  +M+ F
Subjt:  NVSKDFQIVLSNILGMPLVEDIGRYLGVPSTFTHRRRDDFKEIKQRVWQTLQGWKR---TIGGKEVLIKSVAQAIPTYFMSCFRIPQTLYDDLHKIMARF

Query:  WWGSTDTKRKKIHWKKWDMLCTPKELGGRNFRDLMTFNKALLAKQ---------------------------------------RGFVWARDLLQLGLRK
        WWGST  K KKIHWK+W  LC  K  GG  F++ + FN+ALLAKQ                                       +G  W R+LL+ G+R 
Subjt:  WWGSTDTKRKKIHWKKWDMLCTPKELGGRNFRDLMTFNKALLAKQ---------------------------------------RGFVWARDLLQLGLRK

Query:  QVGNGQSILFFKDPWIPKETTFKPLQLQGHEFQDDVTVSEFITPTMGWNLAKLQCAVTEDDAHVIASIPISVSNESDKWIWHYTPSGEYSVKSGYKVARH
        QVGNG  I    DPWIP  +   P+   G       TV+++ITP   WN++KL    +  D   I S+P+S    SD W+WH T  G+Y VKSGY VA  
Subjt:  QVGNGQSILFFKDPWIPKETTFKPLQLQGHEFQDDVTVSEFITPTMGWNLAKLQCAVTEDDAHVIASIPISVSNESDKWIWHYTPSGEYSVKSGYKVARH

Query:  TSMIQECSQSYDQRRWWKTLWKAHIPNKIKIFIWKAYHNCLPTKFCLWKRGMDVSPRYNFADRIVCL-----------ATNLLEEAFELACILF---WSI
         +     S S     WWK+ W+  +P K+KIF WKA HN LP    L+KR         FA   +CL           +       +++    F    SI
Subjt:  TSMIQECSQSYDQRRWWKTLWKAHIPNKIKIFIWKAYHNCLPTKFCLWKRGMDVSPRYNFADRIVCL-----------ATNLLEEAFELACILF---WSI

Query:  WNERNNYQSGR
        W++RNN   G+
Subjt:  WNERNNYQSGR

XP_030479133.1 uncharacterized protein LOC115696372 [Cannabis sativa]1.9e-10837.05Show/hide
Query:  DVMDSVLQEVQIKSAFVLGRSIFYNVIVGHECLHSIKSRRTGRMSWVALKLDMSKANDGVEWCFLEKLMLKIGFHLKWVDLIMDCVKTPTFSIIVNGLPT
        +V+ SV+ E Q  SAF+  R I  N++V  E +HS+K R+ G   + ALK DMSKA D VEW F+  +M K+GF+++W+ LIM C+ T  FS  +NG   
Subjt:  DVMDSVLQEVQIKSAFVLGRSIFYNVIVGHECLHSIKSRRTGRMSWVALKLDMSKANDGVEWCFLEKLMLKIGFHLKWVDLIMDCVKTPTFSIIVNGLPT

Query:  GRIVPQRGLRQGDPLSPYLFLLCSEVLSSLITTTSSRGTITG--------------------------------IKPA---YKRASGQVVNTDKSTLYFS
        G + P RGLRQGDPLSPYLFL+CSE LS L+      G + G                                IK A   Y RASGQ +N DKS + FS
Subjt:  GRIVPQRGLRQGDPLSPYLFLLCSEVLSSLITTTSSRGTITG--------------------------------IKPA---YKRASGQVVNTDKSTLYFS

Query:  PNVSKDFQIVLSNILGMPLVEDIGRYLGVPSTFTHRRRDDFKEIKQRVWQTLQGWKR---TIGGKEVLIKSVAQAIPTYFMSCFRIPQTLYDDLHKIMAR
        PN     Q     ILGMP+ E    YLG+P+     +   F  IK+++W+ +  W     +IGGKEVL+K+V Q+IPTY MSCFR+P  L +++  +MA+
Subjt:  PNVSKDFQIVLSNILGMPLVEDIGRYLGVPSTFTHRRRDDFKEIKQRVWQTLQGWKR---TIGGKEVLIKSVAQAIPTYFMSCFRIPQTLYDDLHKIMAR

Query:  FWWGSTDTKRKKIHWKKWDMLCTPKELGGRNFRDLMTFNKALLAKQ---------------------------------------RGFVWARDLLQLGLR
        FWWGS+ +  KKIHWKKW  LC  K  GG  FR  + FN+ALLAKQ                                       +G VW R+LL  GLR
Subjt:  FWWGSTDTKRKKIHWKKWDMLCTPKELGGRNFRDLMTFNKALLAKQ---------------------------------------RGFVWARDLLQLGLR

Query:  KQVGNGQSILFFKDPWIPKETTFKPLQLQGHEFQDDVTVSEFITPTMGWNLAKLQCAVTEDDAHVIASIPISVSNESDKWIWHYTPSGEYSVKSGYKVAR
         +VG G +I    D WIP    FKP    G        V+++IT T  WN+  LQ   +  D   I  IP+S    +D+WIWHY  SG+YSV SGY +A 
Subjt:  KQVGNGQSILFFKDPWIPKETTFKPLQLQGHEFQDDVTVSEFITPTMGWNLAKLQCAVTEDDAHVIASIPISVSNESDKWIWHYTPSGEYSVKSGYKVAR

Query:  HTSMIQECSQSYDQRRWWKTLWKAHIPNKIKIFIWKAYHNCLPTKFCLWKR-----------------------------------GMDV----SPRYNF
                S S+ Q  WWK+ WK ++P+K+KIF WK   + +P    L+ R                                   G  +    + R   
Subjt:  HTSMIQECSQSYDQRRWWKTLWKAHIPNKIKIFIWKAYHNCLPTKFCLWKR-----------------------------------GMDV----SPRYNF

Query:  ADRIVCLATNLLEEAFELACILFWSIWNERNNYQSGR
         D ++ L++   + AFE    L W IW++RNN+  G+
Subjt:  ADRIVCLATNLLEEAFELACILFWSIWNERNNYQSGR

XP_030505962.1 uncharacterized protein LOC115720894 [Cannabis sativa]2.7e-11038.29Show/hide
Query:  DVMDSVLQEVQIKSAFVLGRSIFYNVIVGHECLHSIKSRRTGRMSWVALKLDMSKANDGVEWCFLEKLMLKIGFHLKWVDLIMDCVKTPTFSIIVNGLPT
        DV+  V+ E Q  SAF+  R I  NV+V  E +HS+K R+ G   + A+KLDMSKA D VEW FL  +M K+GF+++W+ LIM C+ T +FS  +NG   
Subjt:  DVMDSVLQEVQIKSAFVLGRSIFYNVIVGHECLHSIKSRRTGRMSWVALKLDMSKANDGVEWCFLEKLMLKIGFHLKWVDLIMDCVKTPTFSIIVNGLPT

Query:  GRIVPQRGLRQGDPLSPYLFLLCSEVLSSLITTTSSRGTITG--------------------------------IKPA---YKRASGQVVNTDKSTLYFS
        G I PQRGLRQGDPLSPYLFL+CSE LS L+      G + G                                IK A   Y RASGQ +N DKS + FS
Subjt:  GRIVPQRGLRQGDPLSPYLFLLCSEVLSSLITTTSSRGTITG--------------------------------IKPA---YKRASGQVVNTDKSTLYFS

Query:  PNVSKDFQIVLSNILGMPLVEDIGRYLGVPSTFTHRRRDDFKEIKQRVWQTLQGWKR---TIGGKEVLIKSVAQAIPTYFMSCFRIPQTLYDDLHKIMAR
        PN     Q    NILGMP+ E   +YLG+ +     ++  F +IK+R+W+ +  W     +IGGKEVL+K+V Q+IPTY MSCF++P T    L ++MA 
Subjt:  PNVSKDFQIVLSNILGMPLVEDIGRYLGVPSTFTHRRRDDFKEIKQRVWQTLQGWKR---TIGGKEVLIKSVAQAIPTYFMSCFRIPQTLYDDLHKIMAR

Query:  FWWGSTDTKRKKIHWKKWDMLCTPKELGGRNFRDLMTFNKALLAKQRGFVWARDLLQLGLRKQVGNGQSILFFKDPWIPKETTFKPLQLQGHEFQDDVTV
        FWWGS D  +KK HW+KW +LC  K  G                   G  W RDLL  GLR ++G+G S+    DPWIP+ + F P    G        V
Subjt:  FWWGSTDTKRKKIHWKKWDMLCTPKELGGRNFRDLMTFNKALLAKQRGFVWARDLLQLGLRKQVGNGQSILFFKDPWIPKETTFKPLQLQGHEFQDDVTV

Query:  SEFITPTMGWNLAKLQCAVTEDDAHVIASIPISVSNESDKWIWHYTPSGEYSVKSGYKVARHTSMIQECSQSYDQRRWWKTLWKAHIPNKIKIFIWKAYH
        S +IT    WNL  L C  +  D   I SIP+S S+  D+WIWH+T S EY+V++GY +A      +  + S  Q  WWK  W   +P+K+KIF+WKA+H
Subjt:  SEFITPTMGWNLAKLQCAVTEDDAHVIASIPISVSNESDKWIWHYTPSGEYSVKSGYKVARHTSMIQECSQSYDQRRWWKTLWKAHIPNKIKIFIWKAYH

Query:  NCLPTKFCL---------------------------------------WKRGMDVSPRYNFADRIVCLATNLLEEAFELACILFWSIWNERNNYQSGR
          +PT   L                                       +    D+  R  F+D +V L+T       E      W +W +RNNY  G+
Subjt:  NCLPTKFCL---------------------------------------WKRGMDVSPRYNFADRIVCLATNLLEEAFELACILFWSIWNERNNYQSGR

XP_030509336.1 uncharacterized protein LOC115724021 [Cannabis sativa]2.9e-11237.92Show/hide
Query:  DVMDSVLQEVQIKSAFVLGRSIFYNVIVGHECLHSIKSRRTGRMSWVALKLDMSKANDGVEWCFLEKLMLKIGFHLKWVDLIMDCVKTPTFSIIVNGLPT
        DV+  V+ E Q  SAF+  R I  NV+V  E +HS+K R+ G   + A+KLDMSKA D VEW FL  +M K+GF+++W+ LIM C+ T +FS  +NG   
Subjt:  DVMDSVLQEVQIKSAFVLGRSIFYNVIVGHECLHSIKSRRTGRMSWVALKLDMSKANDGVEWCFLEKLMLKIGFHLKWVDLIMDCVKTPTFSIIVNGLPT

Query:  GRIVPQRGLRQGDPLSPYLFLLCSEVLSSLITTTSSRGTITG--------------------------------IKPA---YKRASGQVVNTDKSTLYFS
        G ++PQRGLRQGDPLSPYLFL+CSE LS L+      G + G                                IK A   Y RASGQ +N DKS + FS
Subjt:  GRIVPQRGLRQGDPLSPYLFLLCSEVLSSLITTTSSRGTITG--------------------------------IKPA---YKRASGQVVNTDKSTLYFS

Query:  PNVSKDFQIVLSNILGMPLVEDIGRYLGVPSTFTHRRRDDFKEIKQRVWQTLQGWKR---TIGGKEVLIKSVAQAIPTYFMSCFRIPQTLYDDLHKIMAR
        PN     Q    NILGMP+ +   +YLG+P+     ++  F +IK+R+W+ +  W     +IGGKEVL+K+V Q+IPTY MSCF++P T    L ++MA 
Subjt:  PNVSKDFQIVLSNILGMPLVEDIGRYLGVPSTFTHRRRDDFKEIKQRVWQTLQGWKR---TIGGKEVLIKSVAQAIPTYFMSCFRIPQTLYDDLHKIMAR

Query:  FWWGSTDTKRKKIHWKKWDMLCTPKELGGRNFRDLMTFNKALLAKQ---------------------------------------RGFVWARDLLQLGLR
        FWWGS D  +KK HW+KW +LC  K  GG  FR  + FN+ALLAKQ                                       +G  W RDLL  GLR
Subjt:  FWWGSTDTKRKKIHWKKWDMLCTPKELGGRNFRDLMTFNKALLAKQ---------------------------------------RGFVWARDLLQLGLR

Query:  KQVGNGQSILFFKDPWIPKETTFKPLQLQGHEFQDDVTVSEFITPTMGWNLAKLQCAVTEDDAHVIASIPISVSNESDKWIWHYTPSGEYSVKSGYKVAR
         ++G+G S+    DPWIP+ + F P+   G        VS +IT    WNL  L    +  D   I SIP+S S+ SD+WIWH+T S EY+V+SGY +A 
Subjt:  KQVGNGQSILFFKDPWIPKETTFKPLQLQGHEFQDDVTVSEFITPTMGWNLAKLQCAVTEDDAHVIASIPISVSNESDKWIWHYTPSGEYSVKSGYKVAR

Query:  HTSMIQECSQSYDQRRWWKTLWKAHIPNKIKIFIWKAYHNCLPTKFCLWKRGMDVSPRYNFADRIVCLATNLLEEAFELACILFWSIWNERNNYQSGRGS
                + S +Q  WWK  W   +P+K+KIF+WKA+H  +P           VS R      +  +   +      + C  F++   +RNN+  G+  
Subjt:  HTSMIQECSQSYDQRRWWKTLWKAHIPNKIKIFIWKAYHNCLPTKFCLWKRGMDVSPRYNFADRIVCLATNLLEEAFELACILFWSIWNERNNYQSGRGS

Query:  IH----WSKRCDGILGYWQEITRNK
        +     W+K       + Q+ T +K
Subjt:  IH----WSKRCDGILGYWQEITRNK

TrEMBL top hitse value%identityAlignment
A0A2N9F707 Uncharacterized protein1.5e-11435.78Show/hide
Query:  MDECGLQDLDYLGETFTWTNRQMQQSQINERLDRFIANETFYQIFPSPSVTHLDWARSDHCPIMLDVCPEEVQEGYTKRRQIFYFEEVWTQNPDCKNIIS
        +  C L+D+ + G  FTW N +  +  + ERLDR + +  +   FP   V H+ +  SDH  + LD   +E +     +R  FYFE  W Q   C+ +I 
Subjt:  MDECGLQDLDYLGETFTWTNRQMQQSQINERLDRFIANETFYQIFPSPSVTHLDWARSDHCPIMLDVCPEEVQEGYTKRRQIFYFEEVWTQNPDCKNIIS

Query:  KLGHKFARDGEAPRLEHFLNRFRGSLKSWGKERKDVKATFESYFREMFQSSSPQIDVMDSVLQEVQIKSAFVLGRSIFYNVIVGHECLHSIKSRRTGRMS
                                 + +W       K     Y         P   V+  V+ + Q  SAFV GR I  N+++  E LH +KS+R GR +
Subjt:  KLGHKFARDGEAPRLEHFLNRFRGSLKSWGKERKDVKATFESYFREMFQSSSPQIDVMDSVLQEVQIKSAFVLGRSIFYNVIVGHECLHSIKSRRTGRMS

Query:  WVALKLDMSKANDGVEWCFLEKLMLKIGFHLKWVDLIMDCVKTPTFSIIVNGLPTGRIVPQRGLRQGDPLSPYLFLLCSEVLSSLITTTSSRGT------
         +A+KLDMSKA D VEW F+  +MLK+GF  +WV LIM+CV+T ++S+++NG PTG I P RGLRQGDPLSPYLFL+C+E L+SL++     G+      
Subjt:  WVALKLDMSKANDGVEWCFLEKLMLKIGFHLKWVDLIMDCVKTPTFSIIVNGLPTGRIVPQRGLRQGDPLSPYLFLLCSEVLSSLITTTSSRGT------

Query:  ITGIKPAYKRASGQVVNTDKSTLYFSPNVSKDFQIVLSNILGMPLVEDIGRYLGVPSTFTHRRRDDFKEIKQRVWQTLQGWKRTI---GGKEVLIKSVAQ
        +T +   Y+RASGQ +N +K++L+FS N   D +  +   L      D+G+YLG+P      ++  F ++K ++   LQGWK  +    GKE+LIKSVAQ
Subjt:  ITGIKPAYKRASGQVVNTDKSTLYFSPNVSKDFQIVLSNILGMPLVEDIGRYLGVPSTFTHRRRDDFKEIKQRVWQTLQGWKRTI---GGKEVLIKSVAQ

Query:  AIPTYFMSCFRIPQTLYDDLHKIMARFWWGSTDTKRKKIHWKKWDMLCTPKELGGRNFRDLMTFNKALLAKQ----------------------------
        AIP Y MSCFRIP  L  +++ ++ +FWWG   T+ KKIHW+KW  LC  K+ GG  FRDL  FN ALLAKQ                            
Subjt:  AIPTYFMSCFRIPQTLYDDLHKIMARFWWGSTDTKRKKIHWKKWDMLCTPKELGGRNFRDLMTFNKALLAKQ----------------------------

Query:  ------RGFVW-----ARDLLQLGLRKQVGNGQSILFFKDPWIPKETTFKPLQLQGHEFQDDVTVSEFI-TPTMGWNLAKLQCAVTEDDAHVIASIPISV
                F W     AR +++LG R ++GNG  +  +KD WI   +  K + L+     ++  V + I   +  W  + +       +A  I SIP+  
Subjt:  ------RGFVW-----ARDLLQLGLRKQVGNGQSILFFKDPWIPKETTFKPLQLQGHEFQDDVTVSEFI-TPTMGWNLAKLQCAVTEDDAHVIASIPISV

Query:  SNESDKWIWHYTPSGEYSVKSGYKV--ARHTSMIQECSQSYDQRRWWKTLWKAHIPNKIKIFIWKAYHNCLPTKFCLWKRGM
        S   D  +W  TP+G++S +S Y++  A  +S+    S       +WK+LW   +PNKIK+F+W+A  + LPTK  L+ RG+
Subjt:  SNESDKWIWHYTPSGEYSVKSGYKV--ARHTSMIQECSQSYDQRRWWKTLWKAHIPNKIKIFIWKAYHNCLPTKFCLWKRGM

A0A2N9J5D9 CCHC-type domain-containing protein1.2e-11130.82Show/hide
Query:  MDECGLQDLDYLGETFTWTNRQMQQSQINERLDRFIANETFYQIFPSPSVTHLDWARSDHCPIMLDVCPEEVQEGYTKRRQIFYFEEVWTQNPDCKNIIS
        +++C L DL + G  FTWTN + Q   ++ERLDR +A   +  +FP  ++ H  +A SDH  ++L+        G+ ++++ F+FE  W +   C+ +I+
Subjt:  MDECGLQDLDYLGETFTWTNRQMQQSQINERLDRFIANETFYQIFPSPSVTHLDWARSDHCPIMLDVCPEEVQEGYTKRRQIFYFEEVWTQNPDCKNIIS

Query:  KLGHKFARDGEAPRLEHFLNRFRGSLKSWGKE-----------------------------------RKD------------------------------
        +            RL   + + R +L SW K                                    R+D                              
Subjt:  KLGHKFARDGEAPRLEHFLNRFRGSLKSWGKE-----------------------------------RKD------------------------------

Query:  ------------------------------VKATFESYFREMFQSSSPQIDVMDSVLQEVQ-----------------IKSAFVLGRSIFYNVIVGHECL
                                      ++   ++YF  ++ SS+P    +D+V QEV+                  +SAFV GR I  N+++  E L
Subjt:  ------------------------------VKATFESYFREMFQSSSPQIDVMDSVLQEVQ-----------------IKSAFVLGRSIFYNVIVGHECL

Query:  HSIKSRRTGRMSWVALKLDMSKANDGVEWCFLEKLMLKIGFHLKWVDLIMDCVKTPTFSIIVNGLPTGRIVPQRGLRQGDPLSPYLFLLCSEVLSSLI--
        H +K++R G++  +A+KLDMSKA D VEW +L+K+MLK+GF  +WV LIM+CV + ++SI+VNG P G + P RGLRQGDPLSPYLFL+C+E  S +   
Subjt:  HSIKSRRTGRMSWVALKLDMSKANDGVEWCFLEKLMLKIGFHLKWVDLIMDCVKTPTFSIIVNGLPTGRIVPQRGLRQGDPLSPYLFLLCSEVLSSLI--

Query:  TTTSSRGTITGIKPAYKRASGQVVNTDKSTLYFSPNVSKDFQIVLSNILGMPLVEDIGRYLGVPSTFTHRRRDDFKEIKQRVWQTLQGWKR---TIGGKE
         T +    +  I   Y+ ASGQ +NT K+ L+FS N S   +  + N  G        +YLG+P      ++  F EIK R+W+ LQGWK    +  GK 
Subjt:  TTTSSRGTITGIKPAYKRASGQVVNTDKSTLYFSPNVSKDFQIVLSNILGMPLVEDIGRYLGVPSTFTHRRRDDFKEIKQRVWQTLQGWKR---TIGGKE

Query:  VLIKSVAQAIPTYFMSCFRIPQTLYDDLHKIMARFWWGSTDTKRKKIHWKKWDMLCTPKELGGRNFRDLMTFNKALLAKQRGFVW-----ARDLLQLGLR
        VLIK+V QAIPTY MSCF+ P  L +++  +  RFWWG  +T R KIHW     LC  K+ G + F    +F +A +     ++W     ++ +L+ G+R
Subjt:  VLIKSVAQAIPTYFMSCFRIPQTLYDDLHKIMARFWWGSTDTKRKKIHWKKWDMLCTPKELGGRNFRDLMTFNKALLAKQRGFVW-----ARDLLQLGLR

Query:  KQVGNGQSILFFKDPWIPKETTFKPLQLQGHEFQDDVTVSEFIT-PTMGWNLAKLQCAVTEDDAHVIASIPISVSNESDKWIWHYTPSGEYSVKSGYKVA
         +VG+G+SI  +KD WIP  +T+K +    H   ++ +V   I   +M WN+  LQ      +  +I  IP+SV    D  IW  T  G ++VKS Y + 
Subjt:  KQVGNGQSILFFKDPWIPKETTFKPLQLQGHEFQDDVTVSEFIT-PTMGWNLAKLQCAVTEDDAHVIASIPISVSNESDKWIWHYTPSGEYSVKSGYKVA

Query:  RHTSMIQEC--SQSYDQRRWWKTLWKAHIPNKIKIFIWKAYHNCLPTKFCLWKRGMDVS-----------------------------------PRY---
         H S   E   S S +   +WK LW   +  K+K+F W+A  N +PTK  L+++G+  +                                   P Y   
Subjt:  RHTSMIQEC--SQSYDQRRWWKTLWKAHIPNKIKIFIWKAYHNCLPTKFCLWKRGMDVS-----------------------------------PRY---

Query:  -NFADRIVCLATNLLEEAFELACILFWSIWNERN
         +F D + CL  +L     E+     WS+W  RN
Subjt:  -NFADRIVCLATNLLEEAFELACILFWSIWNERN

A0A7N2LPF9 Uncharacterized protein1.2e-11631.52Show/hide
Query:  CGLQDLDYLGETFTWTNRQMQQSQINERLDRFIANETFYQIFPSPSVTHLDWARSDHCPIMLDVCPEEVQEGYTKRRQIFYFEEVWTQNPDCKNIISKL-
        CGL++  + G  FTW  ++   +QI ERLDR +A   ++ +FP+  + H   + SDH P++L +  ++ ++   K ++IF FE +W ++  C++I+++  
Subjt:  CGLQDLDYLGETFTWTNRQMQQSQINERLDRFIANETFYQIFPSPSVTHLDWARSDHCPIMLDVCPEEVQEGYTKRRQIFYFEEVWTQNPDCKNIISKL-

Query:  GHKFARDGEAPRLE---------------------HFLNRFRGSL--------KSWGKERKDVKATFESYFREMFQSSSP--------------------
        G       + P L                         +RF+ +L          W  ++++V+  F  Y+ ++F SS+P                    
Subjt:  GHKFARDGEAPRLE---------------------HFLNRFRGSL--------KSWGKERKDVKATFESYFREMFQSSSP--------------------

Query:  ------QIDVMDSVLQEVQIKSAFVLGRSIFYNVIVGHECLHSIKSRRTGRMSWVALKLDMSKANDGVEWCFLEKLMLKIGFHLKWVDLIMDCVKTPTFS
              Q   +   L++++ +SAFV GR I  NV+V  E +H I  ++TG M  +ALKLDMSKA D VEW  L+K+M K+GF+ KW +L+M C+ + T++
Subjt:  ------QIDVMDSVLQEVQIKSAFVLGRSIFYNVIVGHECLHSIKSRRTGRMSWVALKLDMSKANDGVEWCFLEKLMLKIGFHLKWVDLIMDCVKTPTFS

Query:  IIVNGLPTGRIVPQRGLRQGDPLSPYLFLLCSEVLSSLITTTSSRGTITGIKPA-----------------------------------YKRASGQVVNT
        + +NG P+G I P RGLRQGDPLSPYLFLLC+E LS+LI   ++ G + GI  +                                   Y++ASGQ +N 
Subjt:  IIVNGLPTGRIVPQRGLRQGDPLSPYLFLLCSEVLSSLITTTSSRGTITGIKPA-----------------------------------YKRASGQVVNT

Query:  DKSTLYFSPNVSKDFQIVLSNILGMPLVEDIGRYLGVPSTFTHRRRDDFKEIKQRVWQTLQGWKRTI---GGKEVLIKSVAQAIPTYFMSCFRIPQTLYD
         K++L+FSPN +K+ Q  +    G  ++    +YLG+PS     +++ FKE+K+++ + L GWK  +    GKEVLIK+VAQAIPTY MS F+I  +L D
Subjt:  DKSTLYFSPNVSKDFQIVLSNILGMPLVEDIGRYLGVPSTFTHRRRDDFKEIKQRVWQTLQGWKRTI---GGKEVLIKSVAQAIPTYFMSCFRIPQTLYD

Query:  DLHKIMARFWWGSTDTKRKKIHWKKWDMLCTPKELGGRNFRDLMTFNKALLAKQ----------------------------------RGFVW-----AR
        ++  ++  FWWG    +R KI W  W+ LC PK  GG  F+ L  FN A+L KQ                                    F W     A+
Subjt:  DLHKIMARFWWGSTDTKRKKIHWKKWDMLCTPKELGGRNFRDLMTFNKALLAKQ----------------------------------RGFVW-----AR

Query:  DLLQLGLRKQVGNGQSILFFKDPWIPKETTFKPLQLQGHEFQDDVTVSEFITPTMG-WNLAKLQCAVTEDDAHVIASIPISVSNESDKWIWHYTPSGEYS
        D++Q G R QVGNG+SI  +KD W+P  +T++ +    +   +D  V E I    G W +  +       +A +I  I +  +   DK +W  T +G +S
Subjt:  DLLQLGLRKQVGNGQSILFFKDPWIPKETTFKPLQLQGHEFQDDVTVSEFITPTMG-WNLAKLQCAVTEDDAHVIASIPISVSNESDKWIWHYTPSGEYS

Query:  VKSGYKVA---RHTSMIQECSQSYDQRRWWKTLWKAHIPNKIKIFIWKAYHNCLPTKFCLWKRGMDVSPRY-NFADRI--VCLATNLLEEAFELACILFW
        V+S YK+A   R    +   S     RR+W+++W  +IP+KI+ F W+A  + LPTK  L KR       + +F D +  V +         E   ++ W
Subjt:  VKSGYKVA---RHTSMIQECSQSYDQRRWWKTLWKAHIPNKIKIFIWKAYHNCLPTKFCLWKRGMDVSPRY-NFADRI--VCLATNLLEEAFELACILFW

Query:  SIWNERNNYQSGRGSIHWSKRCDGILGY
        ++W+ RN  ++            G L Y
Subjt:  SIWNERNNYQSGRGSIHWSKRCDGILGY

A0A803NI87 Uncharacterized protein2.9e-11832.35Show/hide
Query:  MDECGLQDLDYLGETFTWTNRQMQQSQINERLDRFIANETFYQIFPSPSVTHLDWARSDHCPIMLDVCPEEVQEGYTKRRQIFYFEEVWTQNPDCKNIIS
        +D C LQD  + G+ FT    +   S + ERLD  + N+ +      P +THLD+  SDH  +++++      +  T RR  F FE++W  + +C  IIS
Subjt:  MDECGLQDLDYLGETFTWTNRQMQQSQINERLDRFIANETFYQIFPSPSVTHLDWARSDHCPIMLDVCPEEVQEGYTKRRQIFYFEEVWTQNPDCKNIIS

Query:  KLG------------HK----FARDGEAPRLEHFLNRFRGSLKSWGKE-------RKDV------------------KATF-------------------
                       H      A++ +   L H L+    ++     E       R+DV                   A F                   
Subjt:  KLG------------HK----FARDGEAPRLEHFLNRFRGSLKSWGKE-------RKDV------------------KATF-------------------

Query:  ------ESYFREMFQSSSPQI-------------------------------DVMDSVLQEVQIKSAFVLGRSIFYNVIVGHECLHSIKSRRTGRMSWVA
               + F +   +  P++                               DV+  V+ E Q  SAF+  R I  NV+V  E +HS+K R+ G   + A
Subjt:  ------ESYFREMFQSSSPQI-------------------------------DVMDSVLQEVQIKSAFVLGRSIFYNVIVGHECLHSIKSRRTGRMSWVA

Query:  LKLDMSKANDGVEWCFLEKLMLKIGFHLKWVDLIMDCVKTPTFSIIVNGLPTGRIVPQRGLRQGDPLSPYLFLLCSEVLSSLITTTSSRGTITG------
        +KLDMSKA D VEW FL  +M K+GF+++W+ LIM C+ T +FS  +NG   G I PQRGLRQGDPLSPYLFL+CSE LS L+      G + G      
Subjt:  LKLDMSKANDGVEWCFLEKLMLKIGFHLKWVDLIMDCVKTPTFSIIVNGLPTGRIVPQRGLRQGDPLSPYLFLLCSEVLSSLITTTSSRGTITG------

Query:  --------------------------IKPA---YKRASGQVVNTDKSTLYFSPNVSKDFQIVLSNILGMPLVEDIGRYLGVPSTFTHRRRDDFKEIKQRV
                                  IK A   Y RASGQ +N DKS + FSPN     Q    NILGMP+ E   +YLG+ +     ++  F +IK+R+
Subjt:  --------------------------IKPA---YKRASGQVVNTDKSTLYFSPNVSKDFQIVLSNILGMPLVEDIGRYLGVPSTFTHRRRDDFKEIKQRV

Query:  WQTLQGWKR---TIGGKEVLIKSVAQAIPTYFMSCFRIPQTLYDDLHKIMARFWWGSTDTKRKKIHWKKWDMLCTPKELGGRNFRDLMTFNKALLAKQRG
        W+ +  W     +IGGKEVL+K+V Q+IPTY MSCF++P T    L ++MA FWWGS D  +KK HW+KW +LC  K  G                   G
Subjt:  WQTLQGWKR---TIGGKEVLIKSVAQAIPTYFMSCFRIPQTLYDDLHKIMARFWWGSTDTKRKKIHWKKWDMLCTPKELGGRNFRDLMTFNKALLAKQRG

Query:  FVWARDLLQLGLRKQVGNGQSILFFKDPWIPKETTFKPLQLQGHEFQDDVTVSEFITPTMGWNLAKLQCAVTEDDAHVIASIPISVSNESDKWIWHYTPS
          W RDLL  GLR ++G+G S+    DPWIP+ + F P    G        VS +IT    WNL  L C  +  D   I SIP+S S+  D+WIWH+T S
Subjt:  FVWARDLLQLGLRKQVGNGQSILFFKDPWIPKETTFKPLQLQGHEFQDDVTVSEFITPTMGWNLAKLQCAVTEDDAHVIASIPISVSNESDKWIWHYTPS

Query:  GEYSVKSGYKVARHTSMIQECSQSYDQRRWWKTLWKAHIPNKIKIFIWKAYHNCLPTKFCL---------------------------------------
         EY+V++GY +A      +  + S  Q  WWK  W   +P+K+KIF+WKA+H  +PT   L                                       
Subjt:  GEYSVKSGYKVARHTSMIQECSQSYDQRRWWKTLWKAHIPNKIKIFIWKAYHNCLPTKFCL---------------------------------------

Query:  WKRGMDVSPRYNFADRIVCLATNLLEEAFELACILFWSIWNERNNYQSGR
        +    D+  R  F+D +V L+T       E      W +W +RNNY  G+
Subjt:  WKRGMDVSPRYNFADRIVCLATNLLEEAFELACILFWSIWNERNNYQSGR

A0A803PZ04 Uncharacterized protein7.7e-11131.15Show/hide
Query:  MDECGLQDLDYLGETFTWTNRQMQQSQINERLDRFIANETFYQIFPSPSVTHLDWARSDHCPIMLDVCPEEVQEGYTKRRQIFYFEEVWTQNPDCKNI--
        +D   L  + Y G+ +TWTN+    + + ERLDR   N  +   F    ++HLD   SDH  I  +V   +     T R   F FE+ W ++ DC++   
Subjt:  MDECGLQDLDYLGETFTWTNRQMQQSQINERLDRFIANETFYQIFPSPSVTHLDWARSDHCPIMLDVCPEEVQEGYTKRRQIFYFEEVWTQNPDCKNI--

Query:  ------ISKLG---------HKFA---RDGEAPRLEHF------------------LNRFRGSLKSW--------------GKER---------------
              +SK G         HK +    +     L+HF                  L   + +  +W               K R               
Subjt:  ------ISKLG---------HKFA---RDGEAPRLEHF------------------LNRFRGSLKSW--------------GKER---------------

Query:  -----KDVKATFESYFREMFQSSSPQIDVMDSVLQEVQI-------------------------------------------------------KSAFVL
             +D+     SYF ++F S+   I+ ++ VL  +Q                                                        +SAF+ 
Subjt:  -----KDVKATFESYFREMFQSSSPQIDVMDSVLQEVQI-------------------------------------------------------KSAFVL

Query:  GRSIFYNVIVGHECLHSIKSRRTGRMSWVALKLDMSKANDGVEWCFLEKLMLKIGFHLKWVDLIMDCVKTPTFSIIVNGLPTGRIVPQRGLRQGDPLSPY
         R I  NV++  E +H +K+++ GR+ +  LKLDMSKA D VEW FL ++M K+GF + WV+L+M CV T T S  +NG   G +VPQRGLRQGDPLSPY
Subjt:  GRSIFYNVIVGHECLHSIKSRRTGRMSWVALKLDMSKANDGVEWCFLEKLMLKIGFHLKWVDLIMDCVKTPTFSIIVNGLPTGRIVPQRGLRQGDPLSPY

Query:  LFLLCSEVLSSLITTTSSRGTITGIKPA-----------------------------------YKRASGQVVNTDKSTLYFSPNVSKDFQIVLSNILGMP
        LFL+CSE LSSL+    S G + G+  A                                   Y RASGQ++NT+K+ + FSPN S+  +   +++LGMP
Subjt:  LFLLCSEVLSSLITTTSSRGTITGIKPA-----------------------------------YKRASGQVVNTDKSTLYFSPNVSKDFQIVLSNILGMP

Query:  LVEDIGRYLGVPSTFTHRRRDDFKEIKQRVWQTLQGWKR---TIGGKEVLIKSVAQAIPTYFMSCFRIPQTLYDDLHKIMARFWWGSTDTKRKKIHWKKW
        + +   +YLG+PS     +++ F  IK  +W+ +  W     +IGG+E+L+K+V Q+IPTY MSCF +P+   + L  +MA FWWGS +    KIHWKKW
Subjt:  LVEDIGRYLGVPSTFTHRRRDDFKEIKQRVWQTLQGWKR---TIGGKEVLIKSVAQAIPTYFMSCFRIPQTLYDDLHKIMARFWWGSTDTKRKKIHWKKW

Query:  DMLCTPKELGGRNFRDLMTFNKALLAKQR----------------------------------GFVW-----ARDLLQLGLRKQVGNGQSILFFKDPWIP
          LC+ K  GG  FR  + FN+ALLAK                                       W      ++LL  GLR ++GNGQS+   KDPW+P
Subjt:  DMLCTPKELGGRNFRDLMTFNKALLAKQR----------------------------------GFVW-----ARDLLQLGLRKQVGNGQSILFFKDPWIP

Query:  KETTFKPLQLQGHEFQDDVTVSEFITPTMGWNLAKLQCAVTEDDAHVIASIPISVSNESDKWIWHYTPSGEYSVKSGYKVARHTSMIQECSQSYDQRRWW
          TTF P    G       TV+ +IT    W+L  L+      D   I SIP++ S   D  +WH++ +  Y+VKSGY +A     ++  S S   R+WW
Subjt:  KETTFKPLQLQGHEFQDDVTVSEFITPTMGWNLAKLQCAVTEDDAHVIASIPISVSNESDKWIWHYTPSGEYSVKSGYKVARHTSMIQECSQSYDQRRWW

Query:  KTLWKAHIPNKIKIFIWKAYHNCLPTKFCLWKRGMDVS
          LW   +P K+KIF W+  ++ LPT   L  R +  S
Subjt:  KTLWKAHIPNKIKIFIWKAYHNCLPTKFCLWKRGMDVS

SwissProt top hitse value%identityAlignment
P08548 LINE-1 reverse transcriptase homolog4.2e-1323.65Show/hide
Query:  LKLDMSKANDGVEWCFLEKLMLKIGFHLKWVDLIMDCVKTPTFSIIVNGLPTGRIVPQRGLRQGDPLSPYLFLLCSEVLSSLI-----------------
        L +D  KA D ++  F+ + + KIG    ++ LI      PT +II+NG+       + G RQG PLSP LF +  EVL+  I                 
Subjt:  LKLDMSKANDGVEWCFLEKLMLKIGFHLKWVDLIMDCVKTPTFSIIVNGLPTGRIVPQRGLRQGDPLSPYLFLLCSEVLSSLI-----------------

Query:  -------------TTTSSRGTITGIKPAYKRASGQVVNTDKSTLYFSPNVSKDFQIVLSNILGMPLVEDIGRYLGVPST--FTHRRRDDFKEIKQRVWQT
                      T  S   +  +   Y   SG  +NT KS  +   N ++  + V  +I    +V    +YLGV  T       +++++ +++ + + 
Subjt:  -------------TTTSSRGTITGIKPAYKRASGQVVNTDKSTLYFSPNVSKDFQIVLSNILGMPLVEDIGRYLGVPST--FTHRRRDDFKEIKQRVWQT

Query:  LQGWKRT----IGGKEVLIKSV-AQAIPTYFMSCFRIPQTLYDDLHKIMARFWWGSTDTKRKKIHWKKWDMLCTPKELGGRNFRDLMTFNKALLAK
        +  WK      +G   ++  S+  +AI  +     + P + + DL KI+  F W     +  K       +L    + GG    DL  + K+++ K
Subjt:  LQGWKRT----IGGKEVLIKSV-AQAIPTYFMSCFRIPQTLYDDLHKIMARFWWGSTDTKRKKIHWKKWDMLCTPKELGGRNFRDLMTFNKALLAK

P0C2F6 Putative ribonuclease H protein At1g657501.6e-2525Show/hide
Query:  RDDFKEIKQRVWQTLQGWKR---TIGGKEVLIKSVAQAIPTYFMSCFRIPQTLYDDLHKIMARFWWGSTDTKRKKIHWKKWDMLCTPKELGGRNFRDLMT
        +D F EI +RV   + GW+    +  G+  L K+V  ++P + MS   +PQ++ + L ++   F WGST  ++KK H  KW  +C+PK+ GG   R   +
Subjt:  RDDFKEIKQRVWQTLQGWKR---TIGGKEVLIKSVAQAIPTYFMSCFRIPQTLYDDLHKIMARFWWGSTDTKRKKIHWKKWDMLCTPKELGGRNFRDLMT

Query:  FNKALLAK-------QRGFVWA-----------------------------------RDLLQLGLRKQVGNGQSILFFKDPWIPKETTFKP-LQLQGHEF
         N+AL++K       ++  +W                                    RD++  G+    G+GQ I F+ D W+    + KP L+L   E 
Subjt:  FNKALLAK-------QRGFVWA-----------------------------------RDLLQLGLRKQVGNGQSILFFKDPWIPKETTFKP-LQLQGHEF

Query:  Q---DDVTVSEFITPTMGWNLAKLQCAVTEDDAHVIASIPIS-VSNESDKWIWHYTPSGEYSVKSGYKVARHTSMIQECSQSYDQRRWWKTLWKAHIPNK
            D V   +   P  GW+ AK+    T +    + ++ +  V+   D+  W ++  G++SV+S Y++     + +    S+     +  LWK  +P +
Subjt:  Q---DDVTVSEFITPTMGWNLAKLQCAVTEDDAHVIASIPIS-VSNESDKWIWHYTPSGEYSVKSGYKVARHTSMIQECSQSYDQRRWWKTLWKAHIPNK

Query:  IKIFIWKAYHNCLPTK
        +K F+W   +  + T+
Subjt:  IKIFIWKAYHNCLPTK

P11369 LINE-1 retrotransposable element ORF2 protein3.5e-1222.35Show/hide
Query:  FVLGRSIFYNVIVGHECLHSIKSRRTGRMSWVALKLDMSKANDGVEWCFLEKLMLKIGFHLKWVDLIMDCVKTPTFSIIVNGLPTGRIVPQRGLRQGDPL
        F+ G   ++N+      +H I   +    + + + LD  KA D ++  F+ K++ + G    ++++I      P  +I VNG     I  + G RQG PL
Subjt:  FVLGRSIFYNVIVGHECLHSIKSRRTGRMSWVALKLDMSKANDGVEWCFLEKLMLKIGFHLKWVDLIMDCVKTPTFSIIVNGLPTGRIVPQRGLRQGDPL

Query:  SPYLFLLCSEVLSSLI------------------------------TTTSSRGTITGIKPAYKRASGQVVNTDKSTLYFSPNVSKDFQIVLSNILGMPLV
        SPYLF +  EVL+  I                                 +S   +  +  ++    G  +N++KS + F    +K  +  +       +V
Subjt:  SPYLFLLCSEVLSSLI------------------------------TTTSSRGTITGIKPAYKRASGQVVNTDKSTLYFSPNVSKDFQIVLSNILGMPLV

Query:  EDIGRYLGVPSTFTHRRRD----DFKEIKQRVWQTLQGWK----RTIGGKEVLIKSV-AQAIPTYFMSCFRIPQTLYDDLHKIMARFWWGSTDTKRKKIH
         +  +YLGV  T T   +D    +FK +K+ + + L+ WK      IG   ++  ++  +AI  +     +IP   +++L   + +F W +   +  K  
Subjt:  EDIGRYLGVPSTFTHRRRD----DFKEIKQRVWQTLQGWK----RTIGGKEVLIKSV-AQAIPTYFMSCFRIPQTLYDDLHKIMARFWWGSTDTKRKKIH

Query:  WKKWDMLCTPKELGGRNFRDLMTFNKALLAKQRGFVWARD
             +L   +  GG    DL  + +A++ K   + W RD
Subjt:  WKKWDMLCTPKELGGRNFRDLMTFNKALLAKQRGFVWARD

P92555 Uncharacterized mitochondrial protein AtMg012502.5e-1058.82Show/hide
Query:  IVNGLPTGRIVPQRGLRQGDPLSPYLFLLCSEVLSSLITTTSSRGTITGIK
        I+NG P G + P RGLRQGDPLSPYLF+LC+EVLS L      +G + GI+
Subjt:  IVNGLPTGRIVPQRGLRQGDPLSPYLFLLCSEVLSSLITTTSSRGTITGIK

P93295 Uncharacterized mitochondrial protein AtMg003103.4e-1535.29Show/hide
Query:  AIPTYFMSCFRIPQTLYDDLHKIMARFWWGSTDTKRKKIHWKKWDMLCTPKE-LGGRNFRDLMTFNKALLAKQ---------------------------
        A+P Y MSCFR+ + L   L   M  FWW S + KR KI W  W  LC  KE  GG  FRDL  FN+ALLAKQ                           
Subjt:  AIPTYFMSCFRIPQTLYDDLHKIMARFWWGSTDTKRKKIHWKKWDMLCTPKE-LGGRNFRDLMTFNKALLAKQ---------------------------

Query:  ------------RGFVWARDLLQLGLRKQVGNGQSILFFKDPWIPKETTFKPL
                    R  +  R+LL  GL + +G+G     + D WI  ET   PL
Subjt:  ------------RGFVWARDLLQLGLRKQVGNGQSILFFKDPWIPKETTFKPL

Arabidopsis top hitse value%identityAlignment
AT2G22440.1 BEST Arabidopsis thaliana protein match is: Ribonuclease H-like superfamily protein (TAIR:AT4G29090.1)8.6e-0636.56Show/hide
Query:  FKDPWIPKETTFKPLQLQGHEFQDDVTVSEFITP-TMGWNLAKLQCAVTEDDAHVIASIPISVSNESDKWIWHYTPSGEYSVKSGYKVARHTS
        +KDPWIP     +P +   +     + V++ I   T  W L +LQ  +   D  +I  I  S +  SD + W +T SG Y+VKSGY VAR  S
Subjt:  FKDPWIPKETTFKPLQLQGHEFQDDVTVSEFITP-TMGWNLAKLQCAVTEDDAHVIASIPISVSNESDKWIWHYTPSGEYSVKSGYKVARHTS

AT3G09510.1 Ribonuclease H-like superfamily protein3.9e-1428.14Show/hide
Query:  RNFRDLMTFNKALLAKQRGFVWAR-----DLLQLGLRKQVGNGQSILFFKDPWIPKETTFKPLQLQGHEFQDDVTVSEFITPTMG---WNLAKLQCAVTE
        R F+D+   + A + KQ+ + WA       LL+ G R  +G+GQ+I    D  +    +  P  L   E   ++T++           W+ +K+   V +
Subjt:  RNFRDLMTFNKALLAKQRGFVWAR-----DLLQLGLRKQVGNGQSILFFKDPWIPKETTFKPLQLQGHEFQDDVTVSEFITPTMG---WNLAKLQCAVTE

Query:  DDAHVIASIPISVSNESDKWIWHYTPSGEYSVKSGYKVARH--TSMIQECSQSYDQRRWWKTLWKAHIPNKIKIFIWKAYHNCLPTKFCLWKRGMDVSP
         D   I  I ++ S + DK IW+Y  +GEY+V+SGY +  H  ++ I   +  +        +W   I  K+K F+W+A    L T   L  RGM + P
Subjt:  DDAHVIASIPISVSNESDKWIWHYTPSGEYSVKSGYKVARH--TSMIQECSQSYDQRRWWKTLWKAHIPNKIKIFIWKAYHNCLPTKFCLWKRGMDVSP

AT4G29090.1 Ribonuclease H-like superfamily protein6.1e-2827.86Show/hide
Query:  AIPTYFMSCFRIPQTLYDDLHKIMARFWWGSTDTKRKKIHWKKWDMLCTPKELGGRNFRDLMTFNKALLAKQR---------------------------
        A+PTY M+CF +P+T+   +  ++A FWW +   + K +HWK WD L   K  GG  F+D+  FN ALL KQ                            
Subjt:  AIPTYFMSCFRIPQTLYDDLHKIMARFWWGSTDTKRKKIHWKKWDMLCTPKELGGRNFRDLMTFNKALLAKQR---------------------------

Query:  -------GFVW-----ARDLLQLGLRKQVGNGQSILFFKDPWIPKETTFKPLQLQGHEFQDDVTVSEFITPT-------MGWNLAKLQCAVTEDDAHVIA
                FVW     ++++L+ G R  VGNG+ I+ ++  W+  +     L++Q    Q+  +VS  +  +         W    ++    E +  +I 
Subjt:  -------GFVW-----ARDLLQLGLRKQVGNGQSILFFKDPWIPKETTFKPLQLQGHEFQDDVTVSEFITPT-------MGWNLAKLQCAVTEDDAHVIA

Query:  SIPISVSNESDKWIWHYTPSGEYSVKSGY----KVARHTSMIQECSQSYDQRRWWKTLWKAHIPNKIKIFIWKAYHNCLP
         +        D + W YT SG+Y+VKSGY    ++    S  QE S+       ++ +WK+    KI+ F+WK   N LP
Subjt:  SIPISVSNESDKWIWHYTPSGEYSVKSGY----KVARHTSMIQECSQSYDQRRWWKTLWKAHIPNKIKIFIWKAYHNCLP

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.4e-1635.29Show/hide
Query:  AIPTYFMSCFRIPQTLYDDLHKIMARFWWGSTDTKRKKIHWKKWDMLCTPKE-LGGRNFRDLMTFNKALLAKQ---------------------------
        A+P Y MSCFR+ + L   L   M  FWW S + KR KI W  W  LC  KE  GG  FRDL  FN+ALLAKQ                           
Subjt:  AIPTYFMSCFRIPQTLYDDLHKIMARFWWGSTDTKRKKIHWKKWDMLCTPKE-LGGRNFRDLMTFNKALLAKQ---------------------------

Query:  ------------RGFVWARDLLQLGLRKQVGNGQSILFFKDPWIPKETTFKPL
                    R  +  R+LL  GL + +G+G     + D WI  ET   PL
Subjt:  ------------RGFVWARDLLQLGLRKQVGNGQSILFFKDPWIPKETTFKPL

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.8e-1158.82Show/hide
Query:  IVNGLPTGRIVPQRGLRQGDPLSPYLFLLCSEVLSSLITTTSSRGTITGIK
        I+NG P G + P RGLRQGDPLSPYLF+LC+EVLS L      +G + GI+
Subjt:  IVNGLPTGRIVPQRGLRQGDPLSPYLFLLCSEVLSSLITTTSSRGTITGIK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGAATGTGGGCTCCAGGATCTGGATTATTTGGGTGAGACGTTTACGTGGACAAACAGACAAATGCAACAGTCTCAAATCAATGAGAGATTGGATAGATTCATTGC
AAATGAAACTTTTTACCAAATCTTTCCTTCCCCCTCTGTCACACACTTGGATTGGGCCCGCTCTGACCACTGCCCAATCATGTTGGATGTTTGTCCTGAGGAGGTTCAAG
AGGGATACACCAAGAGGCGGCAGATTTTTTATTTTGAGGAAGTCTGGACTCAGAATCCTGATTGTAAAAACATTATCTCCAAATTGGGGCACAAGTTTGCTCGTGATGGT
GAGGCTCCACGCTTGGAACATTTCCTCAATCGATTTCGTGGCAGTTTGAAATCGTGGGGCAAGGAAAGAAAAGATGTGAAAGCAACCTTTGAATCTTATTTCAGGGAGAT
GTTCCAATCATCTAGTCCTCAAATCGATGTCATGGACTCTGTTCTACAAGAAGTTCAAATAAAGTCTGCTTTTGTACTTGGACGGTCCATTTTTTATAATGTAATTGTAG
GTCATGAGTGTTTGCATTCCATTAAATCACGGAGGACAGGTCGGATGAGTTGGGTAGCTCTTAAGCTGGATATGAGTAAAGCAAACGACGGGGTAGAGTGGTGTTTTTTG
GAGAAACTCATGCTAAAAATTGGGTTCCATCTGAAATGGGTGGATCTAATTATGGATTGTGTGAAAACTCCAACCTTTTCAATCATTGTTAATGGGCTTCCCACTGGAAG
AATCGTTCCTCAACGTGGTTTAAGACAGGGTGACCCTCTTTCACCGTACCTTTTCTTACTTTGTTCAGAAGTCTTATCTTCCTTGATTACGACGACATCATCCCGAGGCA
CGATCACTGGTATAAAACCAGCGTACAAGAGGGCATCAGGTCAGGTGGTTAATACAGACAAATCAACCTTATATTTTTCTCCTAATGTGTCGAAAGATTTTCAGATTGTG
CTTTCAAATATATTGGGTATGCCTTTGGTGGAAGATATTGGACGCTATCTAGGCGTGCCATCTACTTTTACTCATCGTAGGAGGGATGATTTTAAAGAGATTAAGCAGCG
TGTATGGCAGACTTTGCAGGGATGGAAAAGAACCATTGGTGGTAAAGAAGTCCTCATCAAGAGTGTGGCCCAAGCAATTCCAACATACTTTATGAGTTGTTTTCGCATCC
CTCAAACACTGTATGATGATTTACACAAGATTATGGCCAGATTTTGGTGGGGATCAACAGATACGAAGAGGAAGAAGATTCATTGGAAGAAATGGGACATGTTGTGTACC
CCTAAGGAGTTAGGTGGTCGAAATTTTCGAGATTTGATGACTTTTAACAAGGCACTTTTGGCAAAGCAGAGGGGTTTTGTTTGGGCACGAGATCTCCTTCAACTTGGGTT
GAGGAAGCAAGTTGGTAATGGGCAGTCTATTCTTTTCTTTAAGGATCCTTGGATTCCTAAAGAAACTACTTTTAAACCTTTGCAATTACAGGGACACGAATTTCAAGATG
ATGTGACGGTGTCAGAATTTATTACACCAACAATGGGTTGGAATTTGGCGAAACTACAATGTGCGGTGACTGAAGATGATGCCCATGTTATTGCATCAATTCCCATTAGT
GTATCCAATGAATCGGATAAATGGATATGGCATTATACTCCTAGTGGAGAATATTCAGTTAAAAGTGGCTATAAAGTAGCTAGGCATACCTCAATGATTCAAGAATGCTC
TCAAAGCTATGACCAAAGGCGTTGGTGGAAAACATTATGGAAAGCTCACATTCCAAATAAGATTAAAATCTTCATTTGGAAAGCTTACCATAATTGTTTACCAACAAAAT
TTTGCCTTTGGAAACGGGGAATGGATGTCTCTCCGAGGTATAATTTTGCAGATAGGATCGTATGTCTAGCAACAAATTTACTTGAGGAGGCATTTGAGTTAGCATGCATT
TTGTTCTGGTCAATTTGGAATGAAAGGAACAACTACCAATCTGGCCGTGGATCTATTCATTGGTCCAAACGATGTGATGGGATTTTGGGATATTGGCAGGAAATTACGAG
GAACAAATTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATGAATGTGGGCTCCAGGATCTGGATTATTTGGGTGAGACGTTTACGTGGACAAACAGACAAATGCAACAGTCTCAAATCAATGAGAGATTGGATAGATTCATTGC
AAATGAAACTTTTTACCAAATCTTTCCTTCCCCCTCTGTCACACACTTGGATTGGGCCCGCTCTGACCACTGCCCAATCATGTTGGATGTTTGTCCTGAGGAGGTTCAAG
AGGGATACACCAAGAGGCGGCAGATTTTTTATTTTGAGGAAGTCTGGACTCAGAATCCTGATTGTAAAAACATTATCTCCAAATTGGGGCACAAGTTTGCTCGTGATGGT
GAGGCTCCACGCTTGGAACATTTCCTCAATCGATTTCGTGGCAGTTTGAAATCGTGGGGCAAGGAAAGAAAAGATGTGAAAGCAACCTTTGAATCTTATTTCAGGGAGAT
GTTCCAATCATCTAGTCCTCAAATCGATGTCATGGACTCTGTTCTACAAGAAGTTCAAATAAAGTCTGCTTTTGTACTTGGACGGTCCATTTTTTATAATGTAATTGTAG
GTCATGAGTGTTTGCATTCCATTAAATCACGGAGGACAGGTCGGATGAGTTGGGTAGCTCTTAAGCTGGATATGAGTAAAGCAAACGACGGGGTAGAGTGGTGTTTTTTG
GAGAAACTCATGCTAAAAATTGGGTTCCATCTGAAATGGGTGGATCTAATTATGGATTGTGTGAAAACTCCAACCTTTTCAATCATTGTTAATGGGCTTCCCACTGGAAG
AATCGTTCCTCAACGTGGTTTAAGACAGGGTGACCCTCTTTCACCGTACCTTTTCTTACTTTGTTCAGAAGTCTTATCTTCCTTGATTACGACGACATCATCCCGAGGCA
CGATCACTGGTATAAAACCAGCGTACAAGAGGGCATCAGGTCAGGTGGTTAATACAGACAAATCAACCTTATATTTTTCTCCTAATGTGTCGAAAGATTTTCAGATTGTG
CTTTCAAATATATTGGGTATGCCTTTGGTGGAAGATATTGGACGCTATCTAGGCGTGCCATCTACTTTTACTCATCGTAGGAGGGATGATTTTAAAGAGATTAAGCAGCG
TGTATGGCAGACTTTGCAGGGATGGAAAAGAACCATTGGTGGTAAAGAAGTCCTCATCAAGAGTGTGGCCCAAGCAATTCCAACATACTTTATGAGTTGTTTTCGCATCC
CTCAAACACTGTATGATGATTTACACAAGATTATGGCCAGATTTTGGTGGGGATCAACAGATACGAAGAGGAAGAAGATTCATTGGAAGAAATGGGACATGTTGTGTACC
CCTAAGGAGTTAGGTGGTCGAAATTTTCGAGATTTGATGACTTTTAACAAGGCACTTTTGGCAAAGCAGAGGGGTTTTGTTTGGGCACGAGATCTCCTTCAACTTGGGTT
GAGGAAGCAAGTTGGTAATGGGCAGTCTATTCTTTTCTTTAAGGATCCTTGGATTCCTAAAGAAACTACTTTTAAACCTTTGCAATTACAGGGACACGAATTTCAAGATG
ATGTGACGGTGTCAGAATTTATTACACCAACAATGGGTTGGAATTTGGCGAAACTACAATGTGCGGTGACTGAAGATGATGCCCATGTTATTGCATCAATTCCCATTAGT
GTATCCAATGAATCGGATAAATGGATATGGCATTATACTCCTAGTGGAGAATATTCAGTTAAAAGTGGCTATAAAGTAGCTAGGCATACCTCAATGATTCAAGAATGCTC
TCAAAGCTATGACCAAAGGCGTTGGTGGAAAACATTATGGAAAGCTCACATTCCAAATAAGATTAAAATCTTCATTTGGAAAGCTTACCATAATTGTTTACCAACAAAAT
TTTGCCTTTGGAAACGGGGAATGGATGTCTCTCCGAGGTATAATTTTGCAGATAGGATCGTATGTCTAGCAACAAATTTACTTGAGGAGGCATTTGAGTTAGCATGCATT
TTGTTCTGGTCAATTTGGAATGAAAGGAACAACTACCAATCTGGCCGTGGATCTATTCATTGGTCCAAACGATGTGATGGGATTTTGGGATATTGGCAGGAAATTACGAG
GAACAAATTTTGA
Protein sequenceShow/hide protein sequence
MDECGLQDLDYLGETFTWTNRQMQQSQINERLDRFIANETFYQIFPSPSVTHLDWARSDHCPIMLDVCPEEVQEGYTKRRQIFYFEEVWTQNPDCKNIISKLGHKFARDG
EAPRLEHFLNRFRGSLKSWGKERKDVKATFESYFREMFQSSSPQIDVMDSVLQEVQIKSAFVLGRSIFYNVIVGHECLHSIKSRRTGRMSWVALKLDMSKANDGVEWCFL
EKLMLKIGFHLKWVDLIMDCVKTPTFSIIVNGLPTGRIVPQRGLRQGDPLSPYLFLLCSEVLSSLITTTSSRGTITGIKPAYKRASGQVVNTDKSTLYFSPNVSKDFQIV
LSNILGMPLVEDIGRYLGVPSTFTHRRRDDFKEIKQRVWQTLQGWKRTIGGKEVLIKSVAQAIPTYFMSCFRIPQTLYDDLHKIMARFWWGSTDTKRKKIHWKKWDMLCT
PKELGGRNFRDLMTFNKALLAKQRGFVWARDLLQLGLRKQVGNGQSILFFKDPWIPKETTFKPLQLQGHEFQDDVTVSEFITPTMGWNLAKLQCAVTEDDAHVIASIPIS
VSNESDKWIWHYTPSGEYSVKSGYKVARHTSMIQECSQSYDQRRWWKTLWKAHIPNKIKIFIWKAYHNCLPTKFCLWKRGMDVSPRYNFADRIVCLATNLLEEAFELACI
LFWSIWNERNNYQSGRGSIHWSKRCDGILGYWQEITRNKF