; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0002546 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0002546
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr4:43721873..43723720
RNA-Seq ExpressionLag0002546
SyntenyLag0002546
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR005135 - Endonuclease/exonuclease/phosphatase
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA3453480.1 reverse transcriptase [Gossypium australe]3.1e-12640.45Show/hide
Query:  MKILCWNVRGMGNPRAVRSLRHEIRKHNPTIVFLAETKNYKLPADKIKRQLGFNNCINVSSRGNSGGLALLWQNQHHITVNSYSKGHIDVTVKEP--EWW
        MK  CWN RG+G+PRAVR LR+ +++H+P +VFL ETK      D+++R  GF N INV + G+ GGL L W++   +T+ SYSK HIDV +KE   +  
Subjt:  MKILCWNVRGMGNPRAVRSLRHEIRKHNPTIVFLAETKNYKLPADKIKRQLGFNNCINVSSRGNSGGLALLWQNQHHITVNSYSKGHIDVTVKEP--EWW

Query:  WRFTGFYGNPDQGKRKDSWRLLERLHDSINLPWIIGGDFNEIMFNKEKKGGTPKPVSLINDFCDAISYCNLIDVGFSGDRYTWSRNKYEKEATKERLDRF
        WRFTGFYG+P    +K  W LLERL    + PW++ GDFNEIMF+ EK+GG  +    +  F D ++ C L+DVGFSG  +TW R    +   +ERLDR 
Subjt:  WRFTGFYGNPDQGKRKDSWRLLERLHDSINLPWIIGGDFNEIMFNKEKKGGTPKPVSLINDFCDAISYCNLIDVGFSGDRYTWSRNKYEKEATKERLDRF

Query:  FFNPTMEQKSKFRRVEHLNFHHSDHRPIVLEIGWENYGQQRLYTKKSIRFEDWWTQNEGSKLSFKEAWDQSDEASTINFNSKIQEGLMAMSTWNK--ERL
          N          R++HL F  SDH P++L    +N     +   +   FE WWT  E  +   +  W+ S E        K++   + +  W K  +R 
Subjt:  FFNPTMEQKSKFRRVEHLNFHHSDHRPIVLEIGWENYGQQRLYTKKSIRFEDWWTQNEGSKLSFKEAWDQSDEASTINFNSKIQEGLMAMSTWNK--ERL

Query:  KGSIKNAIIRKEREINTLVKSKSPDNFTNLIQAEKELEKLLEEEECYWKIRSREDWLKSEDRNTKWFHAKASQRKRKNEIKGILDTNGKWVEKDEEIGRI
        KG +K  +      +  L K +  +    LI  +  L   +++EE YW+ RSR +WL+  DRNT +FH  A+ RK+ N I  ++   G  +  +  +   
Subjt:  KGSIKNAIIRKEREINTLVKSKSPDNFTNLIQAEKELEKLLEEEECYWKIRSREDWLKSEDRNTKWFHAKASQRKRKNEIKGILDTNGKWVEKDEEIGRI

Query:  ATKFIHELLNSNHPSRESIQGSIEAIDSKISEEQKQRLDGPFTKEEIEQVVKNMNLLKAPGPDGAHAKLYQSLWDTIGEDTVKICLGILNNNESLEPMNK
        A  +   L  S   +    +  +E I+S IS+E    L  PF ++E+   +K M  LKAPG DG  A  +Q  W  +G++ ++ CLGILN+ + +E  N 
Subjt:  ATKFIHELLNSNHPSRESIQGSIEAIDSKISEEQKQRLDGPFTKEEIEQVVKNMNLLKAPGPDGAHAKLYQSLWDTIGEDTVKICLGILNNNESLEPMNK

Query:  TLITLIPKKNDPKSMSEYRPISLCNVSYKIVAKALANRMKKVLDSIISQNQSAFVPGRQITDNVIMGFECLHTLNNKRRGKTGHVAIKLDMSKAYDRVEW
        T I LIPK + P ++  +RPISLC V YK+V K +ANR++ V+   I + QSAF+PG  I+DNVI+ +E LHT   KR GK G++A+KLDMSKAYDRVEW
Subjt:  TLITLIPKKNDPKSMSEYRPISLCNVSYKIVAKALANRMKKVLDSIISQNQSAFVPGRQITDNVIMGFECLHTLNNKRRGKTGHVAIKLDMSKAYDRVEW

Query:  SFVEEIMRKMNFSERWTR
         FV ++MRKM FS  W R
Subjt:  SFVEEIMRKMNFSERWTR

XP_030936391.1 uncharacterized protein LOC115961572 [Quercus lobata]4.6e-12240.29Show/hide
Query:  MKILCWNVRGMGNPRAVRSLRHEIRKHNPTIVFLAETKNYKLPADKIKRQLGFNNCINVSSRGNSGGLALLWQNQHHITVNSYSKGHIDVTVKEPEW---
        M IL WN RG+GN R V++L   + K  P +VFL ETK+ +   +K+K +    + + VSS G+ GGLALLW+    + +N+Y++ HID  + E  W   
Subjt:  MKILCWNVRGMGNPRAVRSLRHEIRKHNPTIVFLAETKNYKLPADKIKRQLGFNNCINVSSRGNSGGLALLWQNQHHITVNSYSKGHIDVTVKEPEW---

Query:  WWRFTGFYGNPDQGKRKDSWRLLERLHDSINLPWIIGGDFNEIMFNKEKKGGTPKPVSLINDFCDAISYCNLIDVGFSGDRYTWSRNKYEKEATKERLDR
         W FTGFYGNPD  +R +SW  L+ L  + ++PW+  GDFNEI    EK+GG  +P   + +F DAI+YC   +V F G +YTW  ++ +    +ERLDR
Subjt:  WWRFTGFYGNPDQGKRKDSWRLLERLHDSINLPWIIGGDFNEIMFNKEKKGGTPKPVSLINDFCDAISYCNLIDVGFSGDRYTWSRNKYEKEATKERLDR

Query:  FFFNPTMEQKSKFRRVEHLNFHHSDHRPIVLEIGWENYGQQRLYTKKSIRFEDWWTQNEGSKLSFKEAWDQSDEASTIN-FNSKIQEGLMAMSTWNKERL
           N          ++ HL+   SDH P+ L +  +   +++   +KS RFE  W ++   +   K AW+  + +       S ++     +  WNKE  
Subjt:  FFFNPTMEQKSKFRRVEHLNFHHSDHRPIVLEIGWENYGQQRLYTKKSIRFEDWWTQNEGSKLSFKEAWDQSDEASTIN-FNSKIQEGLMAMSTWNKERL

Query:  KGSIKNAIIRKEREINTLVKSKSPDNFTNLIQAEK-ELEKLLEEEECYWKIRSREDWLKSEDRNTKWFHAKASQRKRKNEIKGILDTNGKWVEKDEEIGR
         G +   I   ++++  L    S       ++  +  L K LE+E+  W+ RSR +W +  DRNT +FHAKAS R +KN I GI+D  G+W E + +I  
Subjt:  KGSIKNAIIRKEREINTLVKSKSPDNFTNLIQAEK-ELEKLLEEEECYWKIRSREDWLKSEDRNTKWFHAKASQRKRKNEIKGILDTNGKWVEKDEEIGR

Query:  IATKFIHELLNSNHPSRESIQGSIEAIDSKISEEQKQRLDGPFTKEEIEQVVKNMNLLKAPGPDGAHAKLYQSLWDTIGEDTVKICLGILNNNESLEPMN
        +A  +  +L  S+ P  E     + A+  K++ +    L   +T +E+   +K M  LKAPGPDG     +Q  W+T GE      L  LN+  S    N
Subjt:  IATKFIHELLNSNHPSRESIQGSIEAIDSKISEEQKQRLDGPFTKEEIEQVVKNMNLLKAPGPDGAHAKLYQSLWDTIGEDTVKICLGILNNNESLEPMN

Query:  KTLITLIPKKNDPKSMSEYRPISLCNVSYKIVAKALANRMKKVLDSIISQNQSAFVPGRQITDNVIMGFECLHTLNNKRRGKTGHVAIKLDMSKAYDRVE
        +T I LIPK N+PK +S+YRPISLCNV+YKI +KA+ANR+KK L SIIS  QSAFV GR ITDNV++ FE +H ++ K+ GK G +AIKLDMSKAYDRVE
Subjt:  KTLITLIPKKNDPKSMSEYRPISLCNVSYKIVAKALANRMKKVLDSIISQNQSAFVPGRQITDNVIMGFECLHTLNNKRRGKTGHVAIKLDMSKAYDRVE

Query:  WSFVEEIMRKMNF
        W FVE+IM K+ F
Subjt:  WSFVEEIMRKMNF

XP_042956310.1 uncharacterized protein LOC122292152 [Carya illinoinensis]5.3e-12640.23Show/hide
Query:  MKILCWNVRGMGNPRAVRSLRHEIRKHNPTIVFLAETKNYKLPADKIKRQLGFNNCINVSSRGNSGGLALLWQNQHHITVNSYSKGHIDVTV----KEPE
        M+ L WN RG+GNPR+V  L   ++   P +VFL ETK  K   ++I + L F +C+ + S G+SGGL L+W N+ ++++ +YS+ HI   +    + P 
Subjt:  MKILCWNVRGMGNPRAVRSLRHEIRKHNPTIVFLAETKNYKLPADKIKRQLGFNNCINVSSRGNSGGLALLWQNQHHITVNSYSKGHIDVTV----KEPE

Query:  WWWRFTGFYGNPDQGKRKDSWRLLERLHDSINLPWIIGGDFNEIMFNKEKKGGTPKPVSLINDFCDAISYCNLIDVGFSGDRYTWSRNKYEKEATKERLD
          W FTGFYG+P+  KR  SW LL  L    ++ W+  GDFNE++   EK+GG P+P   +  F  AI  C+L  +   G  +TWS N+ E E TKERLD
Subjt:  WWWRFTGFYGNPDQGKRKDSWRLLERLHDSINLPWIIGGDFNEIMFNKEKKGGTPKPVSLINDFCDAISYCNLIDVGFSGDRYTWSRNKYEKEATKERLD

Query:  RFFFNPTMEQKSKFRRVEHLNFHHSDHRPIVLEIGWENYGQQRLYTKKSIRFEDWWTQNEGSKLSFKEAWDQ---SDEASTINFNSKIQEGLMAMSTWNK
        R   N            + +    SDH P+++ +  EN   +R       RFE  W   +G     +EAW +    D+A T + + +I    +A+  W K
Subjt:  RFFFNPTMEQKSKFRRVEHLNFHHSDHRPIVLEIGWENYGQQRLYTKKSIRFEDWWTQNEGSKLSFKEAWDQ---SDEASTINFNSKIQEGLMAMSTWNK

Query:  ERLKGSIKNAIIRKEREINTLVKSKSPDNFTNLIQAEKELEKLLEEEECYWKIRSREDWLKSEDRNTKWFHAKASQRKRKNEIKGILDTNGKWVEKDEEI
                  I RK +E+  L  +   D    +   +KE+E+ L EEE  WK R+++ WL++ DRNTK++H  ASQR++ N++  I D+N   V + ++I
Subjt:  ERLKGSIKNAIIRKEREINTLVKSKSPDNFTNLIQAEKELEKLLEEEECYWKIRSREDWLKSEDRNTKWFHAKASQRKRKNEIKGILDTNGKWVEKDEEI

Query:  GRIATKFIHELLNSNHPSRESIQGSIEAIDSKISEEQKQRLDGPFTKEEIEQVVKNMNLLKAPGPDGAHAKLYQSLWDTIGEDTVKICLGILNNNESLEP
        G + T    +L  S++P    I   +E I +K++    + L  PF++EE++  V  M  L +PGPDG  A  YQS W+TIG++     L ++N+  SLE 
Subjt:  GRIATKFIHELLNSNHPSRESIQGSIEAIDSKISEEQKQRLDGPFTKEEIEQVVKNMNLLKAPGPDGAHAKLYQSLWDTIGEDTVKICLGILNNNESLEP

Query:  MNKTLITLIPKKNDPKSMSEYRPISLCNVSYKIVAKALANRMKKVLDSIISQNQSAFVPGRQITDNVIMGFECLHTLNNKRRGKTGHVAIKLDMSKAYDR
        +N+T ITLIPK  DPK + ++RPISLCNV YKIVAK L+NR+K VL  IIS NQSAFVPGR ITDN+++ +E LHT++ + +GK+G++A+KLDMSKAYDR
Subjt:  MNKTLITLIPKKNDPKSMSEYRPISLCNVSYKIVAKALANRMKKVLDSIISQNQSAFVPGRQITDNVIMGFECLHTLNNKRRGKTGHVAIKLDMSKAYDR

Query:  VEWSFVEEIMRKMNFSERW
        VEWSF+  +M ++ F + W
Subjt:  VEWSFVEEIMRKMNFSERW

XP_042958247.1 uncharacterized protein LOC122293873 [Carya illinoinensis]6.9e-12640.23Show/hide
Query:  MKILCWNVRGMGNPRAVRSLRHEIRKHNPTIVFLAETKNYKLPADKIKRQLGFNNCINVSSRGNSGGLALLWQNQHHITVNSYSKGHIDVTV----KEPE
        M+ L WN RG+GNPR+V  L   ++   P +VFL ETK  K   ++I + L F +C+ + S G+SGGL L+W N+ ++++ +YS+ HI   +    + P 
Subjt:  MKILCWNVRGMGNPRAVRSLRHEIRKHNPTIVFLAETKNYKLPADKIKRQLGFNNCINVSSRGNSGGLALLWQNQHHITVNSYSKGHIDVTV----KEPE

Query:  WWWRFTGFYGNPDQGKRKDSWRLLERLHDSINLPWIIGGDFNEIMFNKEKKGGTPKPVSLINDFCDAISYCNLIDVGFSGDRYTWSRNKYEKEATKERLD
          W FTGFYG+P+  KR  SW LL  L    ++ W+  GDFNE++   EK+GG P+P   +  F  AI  C+L  +   G  +TWS N+ E E TKERLD
Subjt:  WWWRFTGFYGNPDQGKRKDSWRLLERLHDSINLPWIIGGDFNEIMFNKEKKGGTPKPVSLINDFCDAISYCNLIDVGFSGDRYTWSRNKYEKEATKERLD

Query:  RFFFNPTMEQKSKFRRVEHLNFHHSDHRPIVLEIGWENYGQQRLYTKKSIRFEDWWTQNEGSKLSFKEAW---DQSDEASTINFNSKIQEGLMAMSTWNK
        R   N            + +    SDH P+++ +  EN   +R       RFE  W   +G     +EAW      D+A+T + + +I    +A+  W K
Subjt:  RFFFNPTMEQKSKFRRVEHLNFHHSDHRPIVLEIGWENYGQQRLYTKKSIRFEDWWTQNEGSKLSFKEAW---DQSDEASTINFNSKIQEGLMAMSTWNK

Query:  ERLKGSIKNAIIRKEREINTLVKSKSPDNFTNLIQAEKELEKLLEEEECYWKIRSREDWLKSEDRNTKWFHAKASQRKRKNEIKGILDTNGKWVEKDEEI
                  I RK +E+  L  +   D    +   +KE+E+ L EEE  WK R+++ WL++ DRNTK++H  ASQR++ N++  I D+N   V + ++I
Subjt:  ERLKGSIKNAIIRKEREINTLVKSKSPDNFTNLIQAEKELEKLLEEEECYWKIRSREDWLKSEDRNTKWFHAKASQRKRKNEIKGILDTNGKWVEKDEEI

Query:  GRIATKFIHELLNSNHPSRESIQGSIEAIDSKISEEQKQRLDGPFTKEEIEQVVKNMNLLKAPGPDGAHAKLYQSLWDTIGEDTVKICLGILNNNESLEP
        G + T    +L  S++P    I   +E I +K++    + L  PF++EE++  V  M  L +PGPDG  A  YQS W+TIG++     L ++N+  SLE 
Subjt:  GRIATKFIHELLNSNHPSRESIQGSIEAIDSKISEEQKQRLDGPFTKEEIEQVVKNMNLLKAPGPDGAHAKLYQSLWDTIGEDTVKICLGILNNNESLEP

Query:  MNKTLITLIPKKNDPKSMSEYRPISLCNVSYKIVAKALANRMKKVLDSIISQNQSAFVPGRQITDNVIMGFECLHTLNNKRRGKTGHVAIKLDMSKAYDR
        +N+T ITLIPK  DPK + ++RPISLCNV YKIVAK L+NR+K VL  IIS NQSAFVPGR ITDN+++ +E LHT++ + +GK+G++A+KLDMSKAYDR
Subjt:  MNKTLITLIPKKNDPKSMSEYRPISLCNVSYKIVAKALANRMKKVLDSIISQNQSAFVPGRQITDNVIMGFECLHTLNNKRRGKTGHVAIKLDMSKAYDR

Query:  VEWSFVEEIMRKMNFSERW
        VEWSF+  +M ++ F + W
Subjt:  VEWSFVEEIMRKMNFSERW

XP_042974832.1 uncharacterized protein LOC122306468 [Carya illinoinensis]9.3e-12338.21Show/hide
Query:  MKILCWNVRGMGNPRAVRSLRHEIRKHNPTIVFLAETKNYKLPADKIKRQLGFNNCINVSSRGNSGGLALLWQNQHHITVNSYSKGHIDVTV-KEPEWWW
        MKIL WN RG+GNPR VR L   +++ +PT++FL ETK  K+  +++   LG++ C+ V SRG+SGGL  LW+ +  +++ +YS+ HI   V    E  W
Subjt:  MKILCWNVRGMGNPRAVRSLRHEIRKHNPTIVFLAETKNYKLPADKIKRQLGFNNCINVSSRGNSGGLALLWQNQHHITVNSYSKGHIDVTV-KEPEWWW

Query:  RFTGFYGNPDQGKRKDSWRLLERLHDSINLPWIIGGDFNEIMFNKEKKGGTPKPVSLINDFCDAISYCNLIDVGFSGDRYTWSRNKYEKEATKERLDRFF
         FTGFYGNP+  KR  SW  L+ +    + PW+  GDFNEI+   EK GG P+P + +  F + + +C+L  +   G  +TW+ N+ +    KERLDR  
Subjt:  RFTGFYGNPDQGKRKDSWRLLERLHDSINLPWIIGGDFNEIMFNKEKKGGTPKPVSLINDFCDAISYCNLIDVGFSGDRYTWSRNKYEKEATKERLDRFF

Query:  FNPTMEQKSKFRRVEHLNFHHSDHRPIVLEIGWENYGQQRLYTKKSIRFEDWWTQNEGSKLSFKEAWDQ--SDEASTINFNSKIQEGLMAMSTWNKERLK
         NP   ++ +      L    SDH P+++++ ++   Q R + K+  R+E  W+  +  + + KEAW +   + A+     S++      +  WN++ ++
Subjt:  FNPTMEQKSKFRRVEHLNFHHSDHRPIVLEIGWENYGQQRLYTKKSIRFEDWWTQNEGSKLSFKEAWDQ--SDEASTINFNSKIQEGLMAMSTWNKERLK

Query:  GSIKNAIIRKEREINTLVKSKSPDNFTNLIQAEKELEKLLEEEECYWKIRSREDWLKSEDRNTKWFHAKASQRKRKNEIKGILDTNGKWVEKDEEIGRIA
         + K  I +K + +  L +    D   ++ + + ++++ L  E+  WK R+++ WL++ DRNT ++H  AS R++ N+I  I+D     + + +++GR+ 
Subjt:  GSIKNAIIRKEREINTLVKSKSPDNFTNLIQAEKELEKLLEEEECYWKIRSREDWLKSEDRNTKWFHAKASQRKRKNEIKGILDTNGKWVEKDEEIGRIA

Query:  TKFIHELLNSNHPSRESIQGSIEAIDSKISEEQKQRLDGPFTKEEIEQVVKNMNLLKAPGPDGAHAKLYQSLWDTIGEDTVKICLGILNNNESLEPMNKT
        T++  +L  S+ PS   I+  I+ + +K+S    + L   FT+EE++  V  M  + +PGPDG  A  +Q+ W+T G++  +  L ILN   SLE +N T
Subjt:  TKFIHELLNSNHPSRESIQGSIEAIDSKISEEQKQRLDGPFTKEEIEQVVKNMNLLKAPGPDGAHAKLYQSLWDTIGEDTVKICLGILNNNESLEPMNKT

Query:  LITLIPKKNDPKSMSEYRPISLCNVSYKIVAKALANRMKKVLDSIISQNQSAFVPGRQITDNVIMGFECLHTLNNKRRGKTGHVAIKLDMSKAYDRVEWS
         ITLIPK  +P  +S+YRPISLCNV YK+VAK L+NR+K +L  IIS NQSAFVPGR ITDN+++ +E LH+++ + +GK+ ++A+KLDMSKAYDRVEW 
Subjt:  LITLIPKKNDPKSMSEYRPISLCNVSYKIVAKALANRMKKVLDSIISQNQSAFVPGRQITDNVIMGFECLHTLNNKRRGKTGHVAIKLDMSKAYDRVEWS

Query:  FVEEIMRKMNFSERW
        F+E +M KM F  RW
Subjt:  FVEEIMRKMNFSERW

TrEMBL top hitse value%identityAlignment
A0A2N9IPS8 Reverse transcriptase domain-containing protein3.6e-12840.23Show/hide
Query:  MKILCWNVRGMGNPRAVRSLRHEIRKHNPTIVFLAETKNYKLPADKIKRQLGFNNCINVSSRGNSGGLALLWQNQHHITVNSYSKGHIDVTV--KEPEWW
        M++L WN +G+GN   VR L   I++ +PT++FL+ET+  K+  ++++  + F+    V  RG  GGLA+LW  +  + + +YS+ HID  +  KE    
Subjt:  MKILCWNVRGMGNPRAVRSLRHEIRKHNPTIVFLAETKNYKLPADKIKRQLGFNNCINVSSRGNSGGLALLWQNQHHITVNSYSKGHIDVTV--KEPEWW

Query:  WRFTGFYGNPDQGKRKDSWRLLERLHDSINLPWIIGGDFNEIMFNKEKKGGTPKPVSLINDFCDAISYCNLIDVGFSGDRYTWSRNKYEKEATKERLDRF
        +R TGFYGNP+  KRK+SW LL+ L    + PW+  GDFNEI+ N E+ G   +P   I DF +A+ +C L D+G+ G+ YTW R +        RLDR 
Subjt:  WRFTGFYGNPDQGKRKDSWRLLERLHDSINLPWIIGGDFNEIMFNKEKKGGTPKPVSLINDFCDAISYCNLIDVGFSGDRYTWSRNKYEKEATKERLDRF

Query:  FFNPTMEQKSKFRRVEHLNFHHSDHRPIVLEIGWENYGQQRLYTKKSIRFEDWWTQNEGSKLSFKEAWDQSDEASTINFN--SKIQEGLMAMSTWNKERL
          + +         V HL   +SDH PI+L+I     G      KK  RFE  W ++E  +     AW       +  F    K++    ++  W++ER 
Subjt:  FFNPTMEQKSKFRRVEHLNFHHSDHRPIVLEIGWENYGQQRLYTKKSIRFEDWWTQNEGSKLSFKEAWDQSDEASTINFN--SKIQEGLMAMSTWNKERL

Query:  KGSIKNAIIRKEREINTLVKSKSPDNF-TNLIQAEKELEKLLEEEECYWKIRSREDWLKSEDRNTKWFHAKASQRKRKNEIKGILDTNGKWVEKDEEIGR
         GS+ ++I RK  ++  L+ +++P  F T +++ + +L  LLE+EE +W+ RSR  W+   D+NTK+FHA+ ++R+R N I G+ D +G W  +  +I  
Subjt:  KGSIKNAIIRKEREINTLVKSKSPDNF-TNLIQAEKELEKLLEEEECYWKIRSREDWLKSEDRNTKWFHAKASQRKRKNEIKGILDTNGKWVEKDEEIGR

Query:  IATKFIHELLNSNHPSRESIQGSIEAIDSKISEEQKQRLDGPFTKEEIEQVVKNMNLLKAPGPDGAHAKLYQSLWDTIGEDTVKICLGILNNNESLEPMN
        IA  +   +  S++PS ESI   ++ ++S ++     +L   FTK+E+   +K M   KAPGPDG  A  YQ+ WD +G +  +  L IL++   L  +N
Subjt:  IATKFIHELLNSNHPSRESIQGSIEAIDSKISEEQKQRLDGPFTKEEIEQVVKNMNLLKAPGPDGAHAKLYQSLWDTIGEDTVKICLGILNNNESLEPMN

Query:  KTLITLIPKKNDPKSMSEYRPISLCNVSYKIVAKALANRMKKVLDSIISQNQSAFVPGRQITDNVIMGFECLHTLNNKRRGKTGHVAIKLDMSKAYDRVE
         T I LIPK  +P++++++RPISLCNV YKIV+K LANR+KKVL  +IS+ QSAFVPGR ITDNV++ FE +H+++ KR+GK G +A+KLDMSKAYDRVE
Subjt:  KTLITLIPKKNDPKSMSEYRPISLCNVSYKIVAKALANRMKKVLDSIISQNQSAFVPGRQITDNVIMGFECLHTLNNKRRGKTGHVAIKLDMSKAYDRVE

Query:  WSFVEEIMRKMNFSERWTR
        W F+E IMR M F++ W R
Subjt:  WSFVEEIMRKMNFSERWTR

A0A7N2LIH6 Uncharacterized protein2.7e-12841.79Show/hide
Query:  MKILCWNVRGMGNPRAVRSLRHEIRKHNPTIVFLAETKNYKLPADKIKRQLGFNNCINVSSRGNSGGLALLWQNQHHITVNSYSKGHIDVTVKEPEWW--
        M IL WN RG+G   AVR+L  E++K NP +VFL ETK         + +LGF   I V S G SGGLALLW+    I   S S  HIDV V        
Subjt:  MKILCWNVRGMGNPRAVRSLRHEIRKHNPTIVFLAETKNYKLPADKIKRQLGFNNCINVSSRGNSGGLALLWQNQHHITVNSYSKGHIDVTVKEPEWW--

Query:  WRFTGFYGNPDQGKRKDSWRLLERLHDSINLPWIIGGDFNEIMFNKEKKGGTPKPVSLINDFCDAISYCNLIDVGFSGDRYTWSRNKYEKEATKERLDRF
        WR TGFYG+PD GKR  SW+LLE L+    +PW++ GDFNEI+   EK G   +  + ++ F + +S C LID+GF G R+TW   ++  + T  RLDR 
Subjt:  WRFTGFYGNPDQGKRKDSWRLLERLHDSINLPWIIGGDFNEIMFNKEKKGGTPKPVSLINDFCDAISYCNLIDVGFSGDRYTWSRNKYEKEATKERLDRF

Query:  FFNPTMEQKSKFRRVEHLNFHHSDHRPIVLEIGWENYGQQRLYTKKSIRFEDWWTQNEGSKLSFKEAWDQSDEASTINFNSKIQEGLMAMSTWNKERLKG
          N          +V H++   SDH  + L +   N  Q+R   KK   FE+ WT+ E  K   + AWD   E S +    +++     +  WN+    G
Subjt:  FFNPTMEQKSKFRRVEHLNFHHSDHRPIVLEIGWENYGQQRLYTKKSIRFEDWWTQNEGSKLSFKEAWDQSDEASTINFNSKIQEGLMAMSTWNKERLKG

Query:  SIKNAIIRKEREINTLVKSKSPDNFTNLIQA-EKELEKLLEEEECYWKIRSREDWLKSEDRNTKWFHAKASQRKRKNEIKGILDTNGKWVEKDEEIGRIA
        ++   I +K+  +  L            IQ  +KE+ +L   EE  WK RSR  WL+  D+N+K+FHA ASQR++KN I G++D  G W E  E   ++ 
Subjt:  SIKNAIIRKEREINTLVKSKSPDNFTNLIQA-EKELEKLLEEEECYWKIRSREDWLKSEDRNTKWFHAKASQRKRKNEIKGILDTNGKWVEKDEEIGRIA

Query:  TKFIHELLNSNHPSRESIQGSIEAIDSKISEEQKQRLDGPFTKEEIEQVVKNMNLLKAPGPDGAHAKLYQSLWDTIGEDTVKICLGILNNNESLEPMNKT
          +  ++ +SN P+  S   S+EA+D +++ E    L   F   E+ Q ++ M+  KAPGPDG     YQ  WD +G       L  LN+    + +NKT
Subjt:  TKFIHELLNSNHPSRESIQGSIEAIDSKISEEQKQRLDGPFTKEEIEQVVKNMNLLKAPGPDGAHAKLYQSLWDTIGEDTVKICLGILNNNESLEPMNKT

Query:  LITLIPKKNDPKSMSEYRPISLCNVSYKIVAKALANRMKKVLDSIISQNQSAFVPGRQITDNVIMGFECLHTLNNKRRGKTGHVAIKLDMSKAYDRVEWS
         I LIPK  +P+ ++E+RPISLCNV YKI++K LANR+KKVL  +I + QSAFVPGR ITDNVI+ FE +H++N +R+GK G +AIKLDMSKAYDRVEW+
Subjt:  LITLIPKKNDPKSMSEYRPISLCNVSYKIVAKALANRMKKVLDSIISQNQSAFVPGRQITDNVIMGFECLHTLNNKRRGKTGHVAIKLDMSKAYDRVEWS

Query:  FVEEIMRKMNFSERW
        ++E +M+KM F +RW
Subjt:  FVEEIMRKMNFSERW

A0A803PBM9 Uncharacterized protein7.4e-13442.88Show/hide
Query:  MKILCWNVRGMGNPRAVRSLRHEIRKHNPTIVFLAETKNYKLPADKIKRQLGFNNCINVSSRGNSGGLALLWQNQHHITVNSYSKGHIDVTV-KEPEWWW
        MK+L WNV+G+GNP  VR+L+  + + +P +VF++E++  K  A+ ++  LG++ C  V + G SGGL LLW N     + S+S  HID  + KE   WW
Subjt:  MKILCWNVRGMGNPRAVRSLRHEIRKHNPTIVFLAETKNYKLPADKIKRQLGFNNCINVSSRGNSGGLALLWQNQHHITVNSYSKGHIDVTV-KEPEWWW

Query:  RFTGFYGNPDQGKRKDSWRLLERLHDSINLPWIIGGDFNEIMFNKEKKGGTPKPVSLINDFCDAISYCNLIDVGFSGDRYTWSRNKYEKEATKERLDRFF
        RFTGFYG+PD  +R +SW+LL R+    + PW+IGGDFNEI+ NKEK GG PKP  LIN+F  A++  NL +V + G  YTW  N  + E   ERLDR  
Subjt:  RFTGFYGNPDQGKRKDSWRLLERLHDSINLPWIIGGDFNEIMFNKEKKGGTPKPVSLINDFCDAISYCNLIDVGFSGDRYTWSRNKYEKEATKERLDRFF

Query:  FNPTMEQKSKFRRVEHLNFHHSDHRPIVLEIGWENYGQQR-LYTKKSIRFEDWWTQNEGSKLSFKEAWDQSDEAST-INFNSKIQEGLMAMSTWNKERLK
         NP         +V HL+   SDH P++L    +N+  ++ +       FE  W   E      KE+WD+    +T +    K+     A+  WNK R K
Subjt:  FNPTMEQKSKFRRVEHLNFHHSDHRPIVLEIGWENYGQQR-LYTKKSIRFEDWWTQNEGSKLSFKEAWDQSDEAST-INFNSKIQEGLMAMSTWNKERLK

Query:  GSIKNAIIRKEREINTLVKSKSPDNFTNLIQAEKELEKLLEEEECYWKIRSREDWLKSEDRNTKWFHAKASQRKRKNEIKGILDTNGKWVEKDEEIGRIA
          +K  +   E +I  L +S +  ++  L   E++   LL++EE +W+ RSR  WLK  DRNTK+FH KA+ RKRKN I G+LD+N KWV  ++ +G++A
Subjt:  GSIKNAIIRKEREINTLVKSKSPDNFTNLIQAEKELEKLLEEEECYWKIRSREDWLKSEDRNTKWFHAKASQRKRKNEIKGILDTNGKWVEKDEEIGRIA

Query:  TKFIHELLNSNHPSRESIQGSIEAIDSKISEEQKQRLDGPFTKEEIEQVVKNMNLLKAPGPDGAHAKLYQSLWDTIGEDTVKICLGILNNNESLEPMNKT
          +  ++  SN  S   ++     + +KIS E  + L  PFTKE++   ++N++  KAPG DG     Y+  W  IGE+  K+CLGILN    L  +N T
Subjt:  TKFIHELLNSNHPSRESIQGSIEAIDSKISEEQKQRLDGPFTKEEIEQVVKNMNLLKAPGPDGAHAKLYQSLWDTIGEDTVKICLGILNNNESLEPMNKT

Query:  LITLIPKKNDPKSMSEYRPISLCNVSYKIVAKALANRMKKVLDSIISQNQSAFVPGRQITDNVIMGFECLHTLNNKRRGKTGHVAIKLDMSKAYDRVEWS
        LI LIPK   P  M+ +RPISLCNV YKIVAK LA R K  L   IS+ QSAFV GR I DN I+GFE LH +  +R G    +A+KLDMSKAYDRVEW 
Subjt:  LITLIPKKNDPKSMSEYRPISLCNVSYKIVAKALANRMKKVLDSIISQNQSAFVPGRQITDNVIMGFECLHTLNNKRRGKTGHVAIKLDMSKAYDRVEWS

Query:  FVEEIMRKMNFSERWTRK
        F+  +MR + + E W  K
Subjt:  FVEEIMRKMNFSERWTRK

A0A803PCN1 Uncharacterized protein2.4e-13242.04Show/hide
Query:  GMGNPRAVRSLRHEIRKHNPTIVFLAETKNYKLPADKIKRQLGFNNCINVSSRGNSGGLALLWQNQHHITVNSYSKGHIDVTVKEP-EWWWRFTGFYGNP
        G+GNP  ++SL   ++ H+P ++FLAET+  +   ++I+   GF++C  V+++G SGGLALLW++   IT+NS++  HID  V+    ++WRFTGFYG+P
Subjt:  GMGNPRAVRSLRHEIRKHNPTIVFLAETKNYKLPADKIKRQLGFNNCINVSSRGNSGGLALLWQNQHHITVNSYSKGHIDVTVKEP-EWWWRFTGFYGNP

Query:  DQGKRKDSWRLLERLHDSINLPWIIGGDFNEIMFNKEKKGGTPKPVSLINDFCDAISYCNLIDVGFSGDRYTWSRNKYEKEATKERLDRFFFNPTMEQKS
        D G RK SW L+ERL D    PWI GGDFNEIM  KEKKGG  K  S I +F  AISYCN  ++   G+ +TW  N  +     E+LDR F NP   +K 
Subjt:  DQGKRKDSWRLLERLHDSINLPWIIGGDFNEIMFNKEKKGGTPKPVSLINDFCDAISYCNLIDVGFSGDRYTWSRNKYEKEATKERLDRFFFNPTMEQKS

Query:  KFRRVEHLNFHHSDHRPIVLEI-GWENYGQQRLYTKKSIRFEDWWTQNEGSKLSFKEAW-DQSDEASTINFNSKIQEGLMAMSTWNKERLKGSIKNAIIR
           +V  L + +SDHRP++L         +  L  K    +E  W   E         W D S+  +      +I      ++ WNK + K  +     +
Subjt:  KFRRVEHLNFHHSDHRPIVLEI-GWENYGQQRLYTKKSIRFEDWWTQNEGSKLSFKEAW-DQSDEASTINFNSKIQEGLMAMSTWNKERLKGSIKNAIIR

Query:  KEREINTLVKSKSPDNFTNLIQAEKELEKLLEEEECYWKIRSREDWLKSEDRNTKWFHAKASQRKRKNEIKGILDTNGKWVEKDEEIGRIATKFIHELLN
         ++E+N L  S    N+ N  + EKEL     +EE  WK RSR  WL   DRNTK+FH KASQRK+KN+I G+ D N KW  K+EEI  I      +L +
Subjt:  KEREINTLVKSKSPDNFTNLIQAEKELEKLLEEEECYWKIRSREDWLKSEDRNTKWFHAKASQRKRKNEIKGILDTNGKWVEKDEEIGRIATKFIHELLN

Query:  SNHPSRESIQGSIEAIDSKISEEQKQRLDGPFTKEEIEQVVKNMNLLKAPGPDGAHAKLYQSLWDTIGEDTVKICLGILNNNESLEPMNKTLITLIPKKN
        S+ P++  +      + +++S +    L   FTKEE+++ +  ++ LKAPG DG     Y + W+ +G + +  CL +LNNN     +N TL+ LIPK  
Subjt:  SNHPSRESIQGSIEAIDSKISEEQKQRLDGPFTKEEIEQVVKNMNLLKAPGPDGAHAKLYQSLWDTIGEDTVKICLGILNNNESLEPMNKTLITLIPKKN

Query:  DPKSMSEYRPISLCNVSYKIVAKALANRMKKVLDSIISQNQSAFVPGRQITDNVIMGFECLHTLNNKRRGKTGHVAIKLDMSKAYDRVEWSFVEEIMRKM
        DP  +S++RP+SLCNV YK ++K LANRMK  ++ +IS+NQSAF+ GRQI DN I+GFE LH + N R G    +A+KLDMSKAYDRVEW F+EE+MR +
Subjt:  DPKSMSEYRPISLCNVSYKIVAKALANRMKKVLDSIISQNQSAFVPGRQITDNVIMGFECLHTLNNKRRGKTGHVAIKLDMSKAYDRVEWSFVEEIMRKM

Query:  NFSERWTRK
         + E+W  K
Subjt:  NFSERWTRK

A0A803QQ69 Uncharacterized protein1.0e-12740.13Show/hide
Query:  MKILCWNVRGMGNPRAVRSLRHEIRKHNPTIVFLAETKNYKLPADKIKRQLGFNNCINVSSRGNSGGLALLWQNQHHITVNSYSKGHIDVTVKEP-EWWW
        M IL WNV+G+GNP  +++L   ++ ++P ++FL+ET+   +  ++I+  LGF+ C  V+++G SGGLALLW+    + V S++  HID  V+    + W
Subjt:  MKILCWNVRGMGNPRAVRSLRHEIRKHNPTIVFLAETKNYKLPADKIKRQLGFNNCINVSSRGNSGGLALLWQNQHHITVNSYSKGHIDVTVKEP-EWWW

Query:  RFTGFYGNPDQGKRKDSWRLLERLHDSINLPWIIGGDFNEIMFNKEKKGGTPKPVSLINDFCDAISYCNLIDVGFSGDRYTWSRNKYEKEATKERLDRFF
        RFTGFYG+PD G RKDSW LLERL D +   W+ GGDFNEI+  KEKKGG  K  +L+ DF  AISYCN  ++  +G  +TW  N  +     E+LDR  
Subjt:  RFTGFYGNPDQGKRKDSWRLLERLHDSINLPWIIGGDFNEIMFNKEKKGGTPKPVSLINDFCDAISYCNLIDVGFSGDRYTWSRNKYEKEATKERLDRFF

Query:  FNPTMEQKSKFRRVEHLNFHHSDHRPIVLEIGWENYGQQRLYTKKS-IRFEDWWTQNEGSKLSFKEAW-DQSDEASTINFNSKIQEGLMAMSTWNKERLK
         NPT  +      V  L +  SDHRP++L+         R+ T +S   +E  W + E      +  W D ++  S      +I      +   NK++ K
Subjt:  FNPTMEQKSKFRRVEHLNFHHSDHRPIVLEIGWENYGQQRLYTKKS-IRFEDWWTQNEGSKLSFKEAW-DQSDEASTINFNSKIQEGLMAMSTWNKERLK

Query:  GSIKNAIIRKEREINTLVKSKSPDNFTNLIQAEKELEKLLEEEECYWKIRSREDWLKSEDRNTKWFHAKASQRKRKNEIKGILDTNGKWVEKDEEIGRIA
          ++    R + E+N L KS    ++    + E EL     ++E  WK RSR  WL   DRNTK+FH KASQRK+KN IKG+ D + +W ++D EI  I 
Subjt:  GSIKNAIIRKEREINTLVKSKSPDNFTNLIQAEKELEKLLEEEECYWKIRSREDWLKSEDRNTKWFHAKASQRKRKNEIKGILDTNGKWVEKDEEIGRIA

Query:  TKFIHELLNSNHPSRESIQGSIEAIDSKISEEQKQRLDGPFTKEEIEQVVKNMNLLKAPGPDGAHAKLYQSLWDTIGEDTVKICLGILNNNESLEPMNKT
         K+  +L  ++ P  +        + +++S +  + L   FT EE+++ +  ++ LKAPG DG     Y + W  +G++ + +CL +LN N+    +N T
Subjt:  TKFIHELLNSNHPSRESIQGSIEAIDSKISEEQKQRLDGPFTKEEIEQVVKNMNLLKAPGPDGAHAKLYQSLWDTIGEDTVKICLGILNNNESLEPMNKT

Query:  LITLIPKKNDPKSMSEYRPISLCNVSYKIVAKALANRMKKVLDSIISQNQSAFVPGRQITDNVIMGFECLHTLNNKRRGKTGHVAIKLDMSKAYDRVEWS
        L+ LIPK  +P  + +YRP+SLCNV YK+++K LANRMK  +D +IS+NQSAF+ GRQI DN I+GFE LH L   R G    +A+KLDMSKAYDRVEW 
Subjt:  LITLIPKKNDPKSMSEYRPISLCNVSYKIVAKALANRMKKVLDSIISQNQSAFVPGRQITDNVIMGFECLHTLNNKRRGKTGHVAIKLDMSKAYDRVEWS

Query:  FVEEIMRKMNFSERWTRK
        F+ E+M+ + + +RW  K
Subjt:  FVEEIMRKMNFSERWTRK

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.2e-2722.22Show/hide
Query:  ILCWNVRGMGNPRAVRSLRHEIRKHNPTIVFLAETKNYKLPADKIKRQLGFNNCINVSSRGNSGGLALLWQNQHHI---TVNSYSKGH---IDVTVKEPE
        IL  NV G+ +P     L   I+  +P++  + ET        ++K + G+      + +    G+A+L  ++       +    +GH   +  ++++ E
Subjt:  ILCWNVRGMGNPRAVRSLRHEIRKHNPTIVFLAETKNYKLPADKIKRQLGFNNCINVSSRGNSGGLALLWQNQHHI---TVNSYSKGH---IDVTVKEPE

Query:  WWWRFTGFYGNPDQGKRKDSWRLLERLHDSINLPWIIGGDFNE--IMFNKEKKGGTPKPVSLINDFCDAISYCNLIDVGFS----GDRYTWSRNKYEKEA
                Y  P+ G  +   ++L  L   ++   +I GDFN    + ++  +    K    +N    A+   +LID+  +       YT+    +    
Subjt:  WWWRFTGFYGNPDQGKRKDSWRLLERLHDSINLPWIIGGDFNE--IMFNKEKKGGTPKPVSLINDFCDAISYCNLIDVGFS----GDRYTWSRNKYEKEA

Query:  TKERLDRFFFNPTMEQKSKFRRVEHLNFHHSDHRPIVLEIGWENYGQQRLYTKK--SIRFEDWWTQNEGSKLSFKEAWDQSDEASTI------NFNSKIQ
        T  ++D    +  +   SK +R E +  + SDH  I LE+  +N  Q R  T K  ++   D+W  NE  K   K  ++ ++   T        F +  +
Subjt:  TKERLDRFFFNPTMEQKSKFRRVEHLNFHHSDHRPIVLEIGWENYGQQRLYTKK--SIRFEDWWTQNEGSKLSFKEAWDQSDEASTI------NFNSKIQ

Query:  EGLMAMSTWNKERLKGSI---KNAIIRKEREINTLVKSKSPDNFTNLIQAEKELEKLLEEEECYWKIRSREDWLKSEDRNTKWFHAKASQRKR-KNEIKG
           +A++ + +++ +  I    + +   E++  T  K+      T +    KE    +E ++   KI     W            A+  ++KR KN+I  
Subjt:  EGLMAMSTWNKERLKGSI---KNAIIRKEREINTLVKSKSPDNFTNLIQAEKELEKLLEEEECYWKIRSREDWLKSEDRNTKWFHAKASQRKR-KNEIKG

Query:  ILDTNGKWVEKDEEIGRIATKFIHELLNSNHPSRESIQGSIEAID-SKISEEQKQRLDGPFTKEEIEQVVKNMNLLKAPGPDGAHAKLYQSLWDTIGEDT
        I +  G       EI     ++   L  +   + E +   ++     ++++E+ + L+ P T  EI  ++ ++   K+PGPDG  A+ YQ   + +    
Subjt:  ILDTNGKWVEKDEEIGRIATKFIHELLNSNHPSRESIQGSIEAID-SKISEEQKQRLDGPFTKEEIEQVVKNMNLLKAPGPDGAHAKLYQSLWDTIGEDT

Query:  VKICLGILNNNESLEPMNKTLITLIPKK-NDPKSMSEYRPISLCNVSYKIVAKALANRMKKVLDSIISQNQSAFVPGRQITDNVIMGFECLHTLNNKRRG
        +K+   I           +  I LIPK   D      +RPISL N+  KI+ K LANR+++ +  +I  +Q  F+PG Q   N+      +  +N  R  
Subjt:  VKICLGILNNNESLEPMNKTLITLIPKK-NDPKSMSEYRPISLCNVSYKIVAKALANRMKKVLDSIISQNQSAFVPGRQITDNVIMGFECLHTLNNKRRG

Query:  KTGHVAIKLDMSKAYDRVEWSFVEEIMRKM
           HV I +D  KA+D+++  F+ + + K+
Subjt:  KTGHVAIKLDMSKAYDRVEWSFVEEIMRKM

P08548 LINE-1 reverse transcriptase homolog1.3e-2623.41Show/hide
Query:  MKILCWNVRGMGNPRAVRSLRHEIRKHNPTIVFLAETKNYKLPADKIKRQL-GFNNCINVSSRGNSGGLALLWQNQHHITVNSYSK---GHIDVTVKEPE
        + I   NV G+  P     L   I+K  P I  + E  ++    DK + ++ G+++    + +    G+A+L+ +          K   GH  + VK   
Subjt:  MKILCWNVRGMGNPRAVRSLRHEIRKHNPTIVFLAETKNYKLPADKIKRQL-GFNNCINVSSRGNSGGLALLWQNQHHITVNSYSK---GHIDVTVKEPE

Query:  WWWRFT--GFYGNPDQGKRKDSWRLLERLHDSINLPWIIGGDFN---EIMFNKEKKGGTPKPVSLINDFCDAISYCNLIDVGFSGDRYTWSRNKYE----
         +   +    Y  P+    +     L  + + I+   I+ GDFN    ++    KK    K    I D    I + +L D+       T+  NK E    
Subjt:  WWWRFT--GFYGNPDQGKRKDSWRLLERLHDSINLPWIIGGDFN---EIMFNKEKKGGTPKPVSLINDFCDAISYCNLIDVGFSGDRYTWSRNKYE----

Query:  --KEATKERLDRFFFNPTMEQKSKFRRVEHLNFHHSDHRPIVLEIGWENYGQQRLYTK----KSIRFEDWWT-------------QNEGSKLSFKEAWDQ
             T  ++D    + +    SKF+++E +    SDH  I +E+   N      +TK     ++  +D W              QN     +++  WD 
Subjt:  --KEATKERLDRFFFNPTMEQKSKFRRVEHLNFHHSDHRPIVLEIGWENYGQQRLYTK----KSIRFEDWWT-------------QNEGSKLSFKEAWDQ

Query:  SDEASTINFNSKIQEGLMAMSTWNKERLKGSIKNAIIRKEREINTLVKSKSPDNFTNLIQAEKELEKLLEEEECYWKIRSREDWLKSEDRNTKWFHAKAS
        +       F + +Q  L          L G +K   + KE   N       P     + +   EL + +E +    +I   + W   +        A  +
Subjt:  SDEASTINFNSKIQEGLMAMSTWNKERLKGSIKNAIIRKEREINTLVKSKSPDNFTNLIQAEKELEKLLEEEECYWKIRSREDWLKSEDRNTKWFHAKAS

Query:  QRKR-KNEIKGILDTNGKWVEKDEEIGRIATKFIHELLNSNHPSRESIQGSIEAID-SKISEEQKQRLDGPFTKEEIEQVVKNMNLLKAPGPDGAHAKLY
        ++KR K+ I  I + N +      EI +I  ++  +L +  + + + I   +EA    ++S+++ + L+ P +  EI   ++N+   K+PGPDG  ++ Y
Subjt:  QRKR-KNEIKGILDTNGKWVEKDEEIGRIATKFIHELLNSNHPSRESIQGSIEAID-SKISEEQKQRLDGPFTKEEIEQVVKNMNLLKAPGPDGAHAKLY

Query:  QSLWDTIGEDTVKICLGILNNNESLEPMNKTL----ITLIPKK-NDPKSMSEYRPISLCNVSYKIVAKALANRMKKVLDSIISQNQSAFVPGRQITDNVI
        Q    T  E+ V I L +  N E    +  T     ITLIPK   DP     YRPISL N+  KI+ K L NR+++ +  II  +Q  F+PG Q   N+ 
Subjt:  QSLWDTIGEDTVKICLGILNNNESLEPMNKTL----ITLIPKK-NDPKSMSEYRPISLCNVSYKIVAKALANRMKKVLDSIISQNQSAFVPGRQITDNVI

Query:  MGFECLHTLNNKRRGKTGHVAIKLDMSKAYDRVEWSFVEEIMRKM
             +  +N  +     H+ + +D  KA+D ++  F+   ++K+
Subjt:  MGFECLHTLNNKRRGKTGHVAIKLDMSKAYDRVEWSFVEEIMRKM

P11369 LINE-1 retrotransposable element ORF2 protein1.8e-2025.13Show/hide
Query:  KKSIRFEDWWTQNEGSKLSFKEAWDQSDEASTINFNSKIQEGLMAMSTWNKERLKGSIKNAIIR----KEREINTLVKSKSPD-----NFTNLIQAEKEL
        KK I+  D+   NE    ++   WD           + ++  L+A+S   K+R      +        +++E N+  +S+  +        N ++  + +
Subjt:  KKSIRFEDWWTQNEGSKLSFKEAWDQSDEASTINFNSKIQEGLMAMSTWNKERLKGSIKNAIIR----KEREINTLVKSKSPD-----NFTNLIQAEKEL

Query:  EKLLEEEECYWKIRSREDWLKSEDRNTKWFHAKASQRKRKNEIKGILDTNGKWVEKDEEIGRIATKFIHELLNSNHPSRESIQGSIEAID-SKISEEQKQ
        +++ +    +++  ++ D  K   R TK    K    K +NE KG + T+       EEI      F   L ++   + + +   ++     K++++Q  
Subjt:  EKLLEEEECYWKIRSREDWLKSEDRNTKWFHAKASQRKRKNEIKGILDTNGKWVEKDEEIGRIATKFIHELLNSNHPSRESIQGSIEAID-SKISEEQKQ

Query:  RLDGPFTKEEIEQVVKNMNLLKAPGPDGAHAKLYQSLWDTIGEDTVKICLGILNNNESLEPMNKTL----ITLIPK-KNDPKSMSEYRPISLCNVSYKIV
         L+ P + +EIE V+ ++   K+PGPDG  A+ YQ    T  ED + I   + +  E    +  +     ITLIPK + DP  +  +RPISL N+  KI+
Subjt:  RLDGPFTKEEIEQVVKNMNLLKAPGPDGAHAKLYQSLWDTIGEDTVKICLGILNNNESLEPMNKTL----ITLIPK-KNDPKSMSEYRPISLCNVSYKIV

Query:  AKALANRMKKVLDSIISQNQSAFVPGRQITDNVIMGFECLHTLNNKRRGKTGHVAIKLDMSKAYDRVEWSFVEEIMRK
         K LANR+++ + +II  +Q  F+PG Q   N+      +H +N  +     H+ I LD  KA+D+++  F+ +++ +
Subjt:  AKALANRMKKVLDSIISQNQSAFVPGRQITDNVIMGFECLHTLNNKRRGKTGHVAIKLDMSKAYDRVEWSFVEEIMRK

P14381 Transposon TX1 uncharacterized 149 kDa protein2.5e-3024.9Show/hide
Query:  IIGGDFNEIMFNKEKKGGTPKPVSLINDFCDAISYCNLIDVGFSGD----RYTWSRNKYEKEATKERLDRFFFNPTMEQKSKFRRVEHLNFHHSDHRPIV
        IIGGDFN  + +   +    K  S  +   + I++ +L+DV    +     +T+ R + +   ++ R+DR + +  +  +++   +    F  SDH  + 
Subjt:  IIGGDFNEIMFNKEKKGGTPKPVSLINDFCDAISYCNLIDVGFSGD----RYTWSRNKYEKEATKERLDRFFFNPTMEQKSKFRRVEHLNFHHSDHRPIV

Query:  LEIGWENYGQQRLYTKKSIRFEDWWTQNEGSKLSFKEAWDQ----SDEASTINFNSKIQEGLMAMSTWNKERLKGSIKNAIIRKEREINTLVKSKSPDNF
        L +       +  Y      F +   ++EG   S ++ W       DE +T+N    + +  + +      +     +NA I         ++ +   + 
Subjt:  LEIGWENYGQQRLYTKKSIRFEDWWTQNEGSKLSFKEAWDQ----SDEASTINFNSKIQEGLMAMSTWNKERLKGSIKNAIIRKEREINTLVKSKSPDNF

Query:  TNLIQAE----KELEKLLEEEECYWK-IRSREDWLKSEDRNTKWFHAKASQRKRKNEIKGILDTNGKWVEKDEEIGRIATKFIHELLNSNHPSRESIQGS
           +Q E    KE  + +E+ +     +RSR   L   DR +++F+A   ++  + +I  +   +G  +E  E I   A  F   L + +  S ++ +  
Subjt:  TNLIQAE----KELEKLLEEEECYWK-IRSREDWLKSEDRNTKWFHAKASQRKRKNEIKGILDTNGKWVEKDEEIGRIATKFIHELLNSNHPSRESIQGS

Query:  IEAIDSKISEEQKQRLDGPFTKEEIEQVVKNMNLLKAPGPDGAHAKLYQSLWDTIGEDTVKICLGILNNNESLEPMNKTLITLIPKKNDPKSMSEYRPIS
         + +   +SE +K+RL+ P T +E+ Q ++ M   K+PG DG   + +Q  WDT+G D  ++        E      + +++L+PKK D + +  +RP+S
Subjt:  IEAIDSKISEEQKQRLDGPFTKEEIEQVVKNMNLLKAPGPDGAHAKLYQSLWDTIGEDTVKICLGILNNNESLEPMNKTLITLIPKKNDPKSMSEYRPIS

Query:  LCNVSYKIVAKALANRMKKVLDSIISQNQSAFVPGRQITDNVIMGFECLHTLNNKRRGKTGHVAIKLDMSKAYDRVEWSFVEEIMRKMNFSERW
        L +  YKIVAKA++ R+K VL  +I  +QS  VPGR I DNV +  + LH     RR       + LD  KA+DRV+  ++   ++  +F  ++
Subjt:  LCNVSYKIVAKALANRMKKVLDSIISQNQSAFVPGRQITDNVIMGFECLHTLNNKRRGKTGHVAIKLDMSKAYDRVEWSFVEEIMRKMNFSERW

Arabidopsis top hitse value%identityAlignment
AT1G40390.1 DNAse I-like superfamily protein4.9e-0529.9Show/hide
Query:  QGKRKDSWRLLERLHDS---INLPWIIGGDFNEIMFNKEKKGGTPKPVSL--INDFCDAISYCNLIDVGFSGDRYTWSRNKYEKEATKERLDRFFFN
        + +R+  W  + RL  S    N PW++ GDFN+I    E     P  +SL  + D    +   +L+D+   G  YTWS ++ +    + +LDR   N
Subjt:  QGKRKDSWRLLERLHDS---INLPWIIGGDFNEIMFNKEKKGGTPKPVSL--INDFCDAISYCNLIDVGFSGDRYTWSRNKYEKEATKERLDRFFFN

AT1G43760.1 DNAse I-like superfamily protein1.0e-1823.7Show/hide
Query:  IIGGDFNEIMFNKEKKG--GTPKPVSLINDFCDAISYCNLIDVGFSGDRYTWSRNKYEKEATKERLDRFFFNPTMEQKSKFRRVEHLNFHHSDHRPIVLE
        I+ GDF++I    +      T  P+  + +F + +   +L+D+   G  YTWS N  +      +LDR   N                   SDH P ++ 
Subjt:  IIGGDFNEIMFNKEKKG--GTPKPVSLINDFCDAISYCNLIDVGFSGDRYTWSRNKYEKEATKERLDRFFFNPTMEQKSKFRRVEHLNFHHSDHRPIVLE

Query:  IGWENYGQQRLYTKKSIRFEDWWTQNEGSKLSFKEAWDQSDEASTINFN--SKIQEGLMAMSTWNKE---RLKGSIKNAIIRKEREINTLVKSKSPDNFT
        +  EN  ++   +KK  R+  + + +    +S   AW++     +  F+    ++         N++    ++   K A+   E   + L+ + S   F 
Subjt:  IGWENYGQQRLYTKKSIRFEDWWTQNEGSKLSFKEAWDQSDEASTINFN--SKIQEGLMAMSTWNKE---RLKGSIKNAIIRKEREINTLVKSKSPDNFT

Query:  NLIQAEKELEKLLEEEECYWKIRSREDWLKSEDRNTKWFHAKASQRKRKNEIKGILDTNGKWVEKDEEIGRIATKFIHELLNSNHP--SRESIQGSIEAI
            A K+        E +++ +SR  WL+  D NT++FH      + KN IK +   +   VE   ++  +   +   LL S+    + +S+Q   +  
Subjt:  NLIQAEKELEKLLEEEECYWKIRSREDWLKSEDRNTKWFHAKASQRKRKNEIKGILDTNGKWVEKDEEIGRIATKFIHELLNSNHP--SRESIQGSIEAI

Query:  DSKISEEQKQRLDGPFTKEEIEQVVKNMNLLKAPGPDGAHAKLYQSLWDTIGEDTVKICLGILNNNESLEPMNKTLITLIPKKNDPKSMSEYRPISLCNV
          + ++    RL    + +EI   V  M   KAPGPD   A+ +   W  + + T+            L+  N T ITLIPK      +S +RP+S C V
Subjt:  DSKISEEQKQRLDGPFTKEEIEQVVKNMNLLKAPGPDGAHAKLYQSLWDTIGEDTVKICLGILNNNESLEPMNKTLITLIPKKNDPKSMSEYRPISLCNV

Query:  SYKIV
         YKI+
Subjt:  SYKIV

AT4G20520.1 RNA binding;RNA-directed DNA polymerases4.1e-1235.37Show/hide
Query:  LANRMKKVLDSIISQNQSAFVPGRQITDNVIMGFECLHTLNNKRRGKTGHVAIKLDMSKAYDRVEWSFVEEIMRKMNFSERW
        +  R+K ++ ++I   Q++F+PGR  TDN++   E +H++  K +G  G + +KLD+ KAYDR+ W ++E+ +    F E W
Subjt:  LANRMKKVLDSIISQNQSAFVPGRQITDNVIMGFECLHTLNNKRRGKTGHVAIKLDMSKAYDRVEWSFVEEIMRKMNFSERW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAATCCTATGCTGGAACGTCCGAGGGATGGGGAATCCTCGGGCGGTCCGATCTTTGCGGCATGAGATCCGCAAGCACAACCCCACAATAGTGTTTTTGGCTGAAAC
AAAAAATTACAAGCTACCAGCAGACAAGATCAAGAGACAACTGGGCTTCAACAATTGTATTAATGTCAGCAGTAGAGGAAACAGTGGAGGACTAGCATTACTTTGGCAAA
ACCAGCACCATATCACGGTCAACTCTTATTCTAAGGGACACATTGATGTCACTGTCAAAGAGCCCGAATGGTGGTGGCGTTTCACCGGATTCTACGGGAACCCAGACCAA
GGCAAGAGAAAGGACTCCTGGAGATTGCTCGAAAGATTGCACGACTCAATCAACCTTCCATGGATCATTGGAGGAGATTTTAATGAAATCATGTTCAACAAAGAGAAAAA
AGGGGGAACTCCTAAACCTGTCTCTTTAATTAATGATTTTTGTGATGCTATTAGCTATTGCAATCTTATTGACGTTGGTTTTTCCGGTGACAGGTACACGTGGTCTAGAA
ATAAATACGAAAAGGAGGCCACAAAGGAAAGGCTAGACCGTTTCTTCTTTAATCCCACTATGGAGCAAAAATCCAAATTCAGGAGAGTGGAACACCTAAACTTCCACCAC
TCTGACCATAGGCCCATTGTGTTGGAGATTGGTTGGGAGAATTATGGCCAGCAAAGATTGTACACCAAGAAAAGTATACGGTTTGAAGATTGGTGGACTCAAAACGAAGG
AAGCAAGTTGTCCTTCAAGGAAGCTTGGGACCAATCGGATGAAGCCTCCACCATTAATTTCAACAGCAAAATCCAAGAAGGCCTTATGGCTATGAGCACGTGGAACAAAG
AAAGATTAAAAGGGTCAATCAAGAATGCCATAATAAGAAAGGAAAGAGAAATCAACACCTTGGTGAAATCAAAGAGCCCAGACAACTTCACCAACCTCATCCAGGCAGAG
AAGGAGTTGGAAAAGCTCTTGGAAGAGGAGGAATGCTACTGGAAAATTCGTTCCAGGGAAGATTGGCTCAAAAGCGAGGATCGAAACACTAAGTGGTTCCATGCCAAGGC
GTCACAGAGAAAGAGGAAGAATGAGATAAAAGGGATCCTAGACACGAATGGTAAATGGGTAGAAAAAGATGAAGAGATTGGTAGAATCGCTACCAAGTTTATCCATGAGC
TCCTAAACTCAAACCATCCAAGCAGAGAAAGCATCCAAGGGTCAATTGAAGCCATAGACTCAAAGATATCGGAAGAGCAAAAACAAAGGTTGGATGGCCCATTCACCAAG
GAGGAGATAGAGCAGGTCGTAAAAAATATGAACCTCCTAAAAGCTCCAGGTCCTGATGGAGCCCATGCCAAGCTCTATCAAAGCCTTTGGGATACGATTGGCGAAGACAC
AGTGAAAATCTGTCTGGGGATCTTAAACAACAACGAAAGCCTGGAACCAATGAACAAGACTCTCATAACTCTTATCCCTAAAAAGAACGACCCAAAATCTATGAGCGAGT
ATCGTCCTATAAGCTTATGCAATGTCAGTTACAAAATAGTGGCAAAAGCTCTTGCGAACAGGATGAAAAAGGTGTTAGACTCCATCATATCTCAGAATCAATCGGCTTTT
GTCCCGGGCAGACAAATAACTGACAACGTCATTATGGGATTTGAATGCCTGCACACGCTCAATAACAAAAGGAGAGGAAAGACAGGGCATGTGGCAATTAAATTAGACAT
GAGTAAGGCGTACGATAGAGTTGAATGGAGTTTTGTGGAGGAAATCATGAGGAAAATGAACTTTAGTGAAAGGTGGACTCGAAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAATCCTATGCTGGAACGTCCGAGGGATGGGGAATCCTCGGGCGGTCCGATCTTTGCGGCATGAGATCCGCAAGCACAACCCCACAATAGTGTTTTTGGCTGAAAC
AAAAAATTACAAGCTACCAGCAGACAAGATCAAGAGACAACTGGGCTTCAACAATTGTATTAATGTCAGCAGTAGAGGAAACAGTGGAGGACTAGCATTACTTTGGCAAA
ACCAGCACCATATCACGGTCAACTCTTATTCTAAGGGACACATTGATGTCACTGTCAAAGAGCCCGAATGGTGGTGGCGTTTCACCGGATTCTACGGGAACCCAGACCAA
GGCAAGAGAAAGGACTCCTGGAGATTGCTCGAAAGATTGCACGACTCAATCAACCTTCCATGGATCATTGGAGGAGATTTTAATGAAATCATGTTCAACAAAGAGAAAAA
AGGGGGAACTCCTAAACCTGTCTCTTTAATTAATGATTTTTGTGATGCTATTAGCTATTGCAATCTTATTGACGTTGGTTTTTCCGGTGACAGGTACACGTGGTCTAGAA
ATAAATACGAAAAGGAGGCCACAAAGGAAAGGCTAGACCGTTTCTTCTTTAATCCCACTATGGAGCAAAAATCCAAATTCAGGAGAGTGGAACACCTAAACTTCCACCAC
TCTGACCATAGGCCCATTGTGTTGGAGATTGGTTGGGAGAATTATGGCCAGCAAAGATTGTACACCAAGAAAAGTATACGGTTTGAAGATTGGTGGACTCAAAACGAAGG
AAGCAAGTTGTCCTTCAAGGAAGCTTGGGACCAATCGGATGAAGCCTCCACCATTAATTTCAACAGCAAAATCCAAGAAGGCCTTATGGCTATGAGCACGTGGAACAAAG
AAAGATTAAAAGGGTCAATCAAGAATGCCATAATAAGAAAGGAAAGAGAAATCAACACCTTGGTGAAATCAAAGAGCCCAGACAACTTCACCAACCTCATCCAGGCAGAG
AAGGAGTTGGAAAAGCTCTTGGAAGAGGAGGAATGCTACTGGAAAATTCGTTCCAGGGAAGATTGGCTCAAAAGCGAGGATCGAAACACTAAGTGGTTCCATGCCAAGGC
GTCACAGAGAAAGAGGAAGAATGAGATAAAAGGGATCCTAGACACGAATGGTAAATGGGTAGAAAAAGATGAAGAGATTGGTAGAATCGCTACCAAGTTTATCCATGAGC
TCCTAAACTCAAACCATCCAAGCAGAGAAAGCATCCAAGGGTCAATTGAAGCCATAGACTCAAAGATATCGGAAGAGCAAAAACAAAGGTTGGATGGCCCATTCACCAAG
GAGGAGATAGAGCAGGTCGTAAAAAATATGAACCTCCTAAAAGCTCCAGGTCCTGATGGAGCCCATGCCAAGCTCTATCAAAGCCTTTGGGATACGATTGGCGAAGACAC
AGTGAAAATCTGTCTGGGGATCTTAAACAACAACGAAAGCCTGGAACCAATGAACAAGACTCTCATAACTCTTATCCCTAAAAAGAACGACCCAAAATCTATGAGCGAGT
ATCGTCCTATAAGCTTATGCAATGTCAGTTACAAAATAGTGGCAAAAGCTCTTGCGAACAGGATGAAAAAGGTGTTAGACTCCATCATATCTCAGAATCAATCGGCTTTT
GTCCCGGGCAGACAAATAACTGACAACGTCATTATGGGATTTGAATGCCTGCACACGCTCAATAACAAAAGGAGAGGAAAGACAGGGCATGTGGCAATTAAATTAGACAT
GAGTAAGGCGTACGATAGAGTTGAATGGAGTTTTGTGGAGGAAATCATGAGGAAAATGAACTTTAGTGAAAGGTGGACTCGAAAGTGA
Protein sequenceShow/hide protein sequence
MKILCWNVRGMGNPRAVRSLRHEIRKHNPTIVFLAETKNYKLPADKIKRQLGFNNCINVSSRGNSGGLALLWQNQHHITVNSYSKGHIDVTVKEPEWWWRFTGFYGNPDQ
GKRKDSWRLLERLHDSINLPWIIGGDFNEIMFNKEKKGGTPKPVSLINDFCDAISYCNLIDVGFSGDRYTWSRNKYEKEATKERLDRFFFNPTMEQKSKFRRVEHLNFHH
SDHRPIVLEIGWENYGQQRLYTKKSIRFEDWWTQNEGSKLSFKEAWDQSDEASTINFNSKIQEGLMAMSTWNKERLKGSIKNAIIRKEREINTLVKSKSPDNFTNLIQAE
KELEKLLEEEECYWKIRSREDWLKSEDRNTKWFHAKASQRKRKNEIKGILDTNGKWVEKDEEIGRIATKFIHELLNSNHPSRESIQGSIEAIDSKISEEQKQRLDGPFTK
EEIEQVVKNMNLLKAPGPDGAHAKLYQSLWDTIGEDTVKICLGILNNNESLEPMNKTLITLIPKKNDPKSMSEYRPISLCNVSYKIVAKALANRMKKVLDSIISQNQSAF
VPGRQITDNVIMGFECLHTLNNKRRGKTGHVAIKLDMSKAYDRVEWSFVEEIMRKMNFSERWTRK