; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032165 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032165
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr11:26475985..26478556
RNA-Seq ExpressionLag0032165
SyntenyLag0032165
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR005135 - Endonuclease/exonuclease/phosphatase
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
MCH80348.1 hypothetical protein [Trifolium medium]2.2e-4825.91Show/hide
Query:  WNVVDRVTIKKAGENLKWAEALGNVVGVFEKVDFQELLRRGTVIR----------------IGKNAEEEWIDIKFEKLPDFCYACGILGHLARECGAPEM
        W  V  + +K   E +  A+ LGN VG FE++DF+E+ R G  +R                +    +E W+D K+E+LP+FC+ACG +GH  R+C   E 
Subjt:  WNVVDRVTIKKAGENLKWAEALGNVVGVFEKVDFQELLRRGTVIR----------------IGKNAEEEWIDIKFEKLPDFCYACGILGHLARECGAPEM

Query:  NN--------KEKLPYGPWLRRESIPKGKISNENRATQSSPVKDQRERSTRGESSWEIPARMKFQRWEGNKTNRTGSWRRSSPERWRKPPEKEESRTKKT
        ++        +++  +GPWLR   +PK           S  VK         ESS    ++  F     +K   +G+ +    E  ++    ++   +K 
Subjt:  NN--------KEKLPYGPWLRRESIPKGKISNENRATQSSPVKDQRERSTRGESSWEIPARMKFQRWEGNKTNRTGSWRRSSPERWRKPPEKEESRTKKT

Query:  ASLEWAKENKDKKKEDQHLSREGQ---------PKGPKSDIGQGKDLGQNYQKGKGQNNTNYPDMVQALSQRDVFSYVIGIARTQEQNNEIRDLATKTVD
             +K+ +    + Q++ +E +             ++ IG+G    Q  ++GKG+         + + Q+        + + + Q N       KTV 
Subjt:  ASLEWAKENKDKKKEDQHLSREGQ---------PKGPKSDIGQGKDLGQNYQKGKGQNNTNYPDMVQALSQRDVFSYVIGIARTQEQNNEIRDLATKTVD

Query:  RKSGKSWKRRAREQQMQDKKNEELSQGTPSDAMKLLIWN-VRGVGNPRTIRSLRHVVRKHNPTIVFLSETKDRNPSSDSLKRKLGFSNSFNVGSEGM---
           GK  KR   +  + D   E +  G       +++ + V   G+PR +R+L  + R  NP +VFL ET+ +    ++++ KLGF N   V   G    
Subjt:  RKSGKSWKRRAREQQMQDKKNEELSQGTPSDAMKLLIWN-VRGVGNPRTIRSLRHVVRKHNPTIVFLSETKDRNPSSDSLKRKLGFSNSFNVGSEGM---

Query:  -SGGLSLLWQNNHAINISSFSKGHIDVIIKEVDGW--WRFTGFYGNPNQNRRKESWQLLERLKESSKLPWIVGGDFNEIM--------------------
         +GGL+L+W  + ++NISSFS  HI    ++ +    W  TG YG P ++ ++++W L+  L   ++  W+  GDFN+I+                    
Subjt:  -SGGLSLLWQNNHAINISSFSKGHIDVIIKEVDGW--WRFTGFYGNPNQNRRKESWQLLERLKESSKLPWIVGGDFNEIM--------------------

Query:  -----------------FTWSRNKNNFEATKERLDRYFINSEMMSKATRSKVEHLKFHHSDHRSILLDISWEKFQKQLTASKRIIKFEESWVAHEGSKKA
                         FTWS  +   E  + RLDR   N+E +++ +  KV HL+   SDH ++++ +            +R+ +FEESW      ++ 
Subjt:  -----------------FTWSRNKNNFEATKERLDRYFINSEMMSKATRSKVEHLKFHHSDHRSILLDISWEKFQKQLTASKRIIKFEESWVAHEGSKKA

Query:  LADAWNDTAEATNFNFNMKMQEGITAMNRWNRVRLKGSLKEAIRVKEREINNLSNLSSQEAFTKMVKAERDLERLLEEKESYWKIRSRENWLKGGDINTK
        +   W      +  +   K++   +  N +    L    KE IR+++R  +      S E   +  + E++   LL+ +E+ W+ RSR  WLK GD NTK
Subjt:  LADAWNDTAEATNFNFNMKMQEGITAMNRWNRVRLKGSLKEAIRVKEREINNLSNLSSQEAFTKMVKAERDLERLLEEKESYWKIRSRENWLKGGDINTK

Query:  WFHSKASHRRKRSEIKGI
        +FH+KAS R K + IK I
Subjt:  WFHSKASHRRKRSEIKGI

XP_022157437.1 uncharacterized protein LOC111024135 [Momordica charantia]1.7e-5342.12Show/hide
Query:  MKLLIWNVRGVGNPRTIRSLRHVVRKHNPTIVFLSETKDRNPSSDSLKRKLGFSNSFNVGSEGMSGGLSLLWQNNHAINISSFSKGHIDVIIKEVDGWWR
        MK L WNV G+GNP T R+LR++VR+  P +VFLSETK         KR+L F    +V S G SGGL LLW ++  + I S S GHID II +  G WR
Subjt:  MKLLIWNVRGVGNPRTIRSLRHVVRKHNPTIVFLSETKDRNPSSDSLKRKLGFSNSFNVGSEGMSGGLSLLWQNNHAINISSFSKGHIDVIIKEVDGWWR

Query:  FTGFYGNPNQNRRKESWQLLERLKESSKLPWIVGGDFNEI------MFTWSRNKNNFEATK--ERLDRYFINSEMMSKATRSKVEHLKFHHSDHRSILLD
        FTGFYGNP   +R  SW+LLERL     LPWI+GGDFNEI      M    RN++        ERLDR+ IN  M++K    KV HL+   SDHR IL  
Subjt:  FTGFYGNPNQNRRKESWQLLERLKESSKLPWIVGGDFNEI------MFTWSRNKNNFEATK--ERLDRYFINSEMMSKATRSKVEHLKFHHSDHRSILLD

Query:  ISWEKFQKQLTASKRIIKFEESWVAHEGSKKALADAWNDTAEATNFNFNMKMQEGITAMNRWNRVRLKGSLKEAIRVKEREINNLSNL--SSQEAFTKMV
          +E  +      +R I+FEESW+  +G +  +   W          F  K+   ++ +N WN++RL  SLK AI  KE+E+  L  L   SQ A     
Subjt:  ISWEKFQKQLTASKRIIKFEESWVAHEGSKKALADAWNDTAEATNFNFNMKMQEGITAMNRWNRVRLKGSLKEAIRVKEREINNLSNL--SSQEAFTKMV

Query:  KAERDLERLLE
            ++E  L+
Subjt:  KAERDLERLLE

XP_028090832.1 uncharacterized protein LOC114291041 [Camellia sinensis]7.8e-5434.72Show/hide
Query:  MKLLIWNVRGVGNPRTIRSLRHVVRKHNPTIVFLSETKDRNPSSDSLKRKLGFSNSFNVGSEGMSGGLSLLWQNNHAINISSFSKGHIDVIIKEVDGW--
        M +L WN RG+GNPR++RSL+ +++     +VFL ETK  +   D +++KLGF   F V   G++GGL+LLW     +++ S S GHIDV +    G   
Subjt:  MKLLIWNVRGVGNPRTIRSLRHVVRKHNPTIVFLSETKDRNPSSDSLKRKLGFSNSFNVGSEGMSGGLSLLWQNNHAINISSFSKGHIDVIIKEVDGW--

Query:  WRFTGFYGNPNQNRRKESWQLLERLKESSKLPWIVGGDFNEIM-------------------------------------FTWSRNKNNFEATKERLDRY
        WRF GFYG+P   +RK SW LL RL    ++PW+  GDFNEI+                                     FTWS N+      +ERLDR 
Subjt:  WRFTGFYGNPNQNRRKESWQLLERLKESSKLPWIVGGDFNEIM-------------------------------------FTWSRNKNNFEATKERLDRY

Query:  FINSEMMSKATRSKVEHLKFHHSDHRSILLDISWEKFQKQ-LTASKRIIKFEESWVAHEGSKKALADAWNDTAEATNFNFNMKMQEGITAMNRWNRVRLK
        F N         +KV  +    S+H  I +DI   K Q+Q L +  R+ +FE  W+ H   ++ +AD W  ++ A   +    +    +++ RW+R  + 
Subjt:  FINSEMMSKATRSKVEHLKFHHSDHRSILLDISWEKFQKQ-LTASKRIIKFEESWVAHEGSKKALADAWNDTAEATNFNFNMKMQEGITAMNRWNRVRLK

Query:  GSLKEAIRVKEREINNLSNLSSQ--EAFTKMVKAERDLERLLEEKESYWKIRSRENWLKGGDINTKWFHSKASHRRKRSEIKGIFN
        G +K  ++ K  ++  L  LSS   +   +M K   D++ LLE++  +W  R+R NWLK GD NT +FHSKA+ R K+ +I GI +
Subjt:  GSLKEAIRVKEREINNLSNLSSQ--EAFTKMVKAERDLERLLEEKESYWKIRSRENWLKGGDINTKWFHSKASHRRKRSEIKGIFN

XP_030495126.1 uncharacterized protein LOC115710915 [Cannabis sativa]9.5e-5235.06Show/hide
Query:  MKLLIWNVRGVGNPRTIRSLRHVVRKHNPTIVFLSETKDRNPSSDSLKRKLGFSNSFNVGSEGMSGGLSLLWQNNHAINISSFSKGHIDV-IIKEVDGWW
        M  L WNV+G+GNP T  +L  +V+ H P +VFLSET+ ++   +S++ +LG+   F V ++G SGGL+LLW+    ++++SF+  HID  I KE D  W
Subjt:  MKLLIWNVRGVGNPRTIRSLRHVVRKHNPTIVFLSETKDRNPSSDSLKRKLGFSNSFNVGSEGMSGGLSLLWQNNHAINISSFSKGHIDV-IIKEVDGWW

Query:  RFTGFYGNPNQNRRKESWQLLERLKESSKLPWIVGGDFNEI-------------------------------------MFTWSRNKNNFEATKERLDRYF
        RFTGFY +P+ ++RK SWQLL+R+  +   PW+ GGDFNEI                                      FTW  N        ERLDR  
Subjt:  RFTGFYGNPNQNRRKESWQLLERLKESSKLPWIVGGDFNEI-------------------------------------MFTWSRNKNNFEATKERLDRYF

Query:  INSEMMSKATRSKVEHLKFHHSDHRSILLDISWEKF--QKQLTASKRIIKFEESWVAHEGSKKALADAWNDTAEATNFNFNMKMQEGI-TAMNRWNRVRL
        +N+      + +KV+HL    SDH  +LL  +      QK+     R   +E++W   E  ++ + D W +    T+     ++     T +++WN+V+ 
Subjt:  INSEMMSKATRSKVEHLKFHHSDHRSILLDISWEKF--QKQLTASKRIIKFEESWVAHEGSKKALADAWNDTAEATNFNFNMKMQEGI-TAMNRWNRVRL

Query:  KGSLKEAIRVKEREINNLSNLSSQEAFTKMVKAERDLERLLEEKESYWKIRSRENWLKGGDINTKWFHSKASHRRKRSEIKGIFN
        K +L + I+  ++E+   SN  S + FTK+   E+DL   L ++E +WK RSR  WL  GD NT++FH KA+ RRK++ I G+F+
Subjt:  KGSLKEAIRVKEREINNLSNLSSQEAFTKMVKAERDLERLLEEKESYWKIRSRENWLKGGDINTKWFHSKASHRRKRSEIKGIFN

XP_030967653.1 uncharacterized protein LOC115988147 [Quercus lobata]2.6e-4934.11Show/hide
Query:  MKLLIWNVRGVGNPRTIRSLRHVVRKHNPTIVFLSETKDRNPSSDSLKRKLGFSNSFNVGSEGMSGGLSLLWQNNHAINISSFSKGHIDVIIKEVDG--W
        M  L WN +G+GNPRT+ +LR  +R+ NP +VFLSETK  +   + +K KLGFSN   V S G  GGL+LLW ++  + I S+S  HID +I E      
Subjt:  MKLLIWNVRGVGNPRTIRSLRHVVRKHNPTIVFLSETKDRNPSSDSLKRKLGFSNSFNVGSEGMSGGLSLLWQNNHAINISSFSKGHIDVIIKEVDG--W

Query:  WRFTGFYGNPNQNRRKESWQLLERLKESSKLPWIVGGDFNEIM-------------------------------------FTWSRNKNNFEATKERLDRY
        WRFTGFYG+P  + R++SW+LL  L      PW   GDFNEI+                                     FTW   K        RLDR 
Subjt:  WRFTGFYGNPNQNRRKESWQLLERLKESSKLPWIVGGDFNEIM-------------------------------------FTWSRNKNNFEATKERLDRY

Query:  FINSEMMSKATRSKVEHLKFHHSDHRSILLDISWEKFQKQLTASKRIIKFEESWVAHEGSKKALADAWND-TAEATNFNFNMKMQEGITAMNRWNRVRLK
        F NSE ++      V HL    SDH  ++         K+     R   FE  W   +  ++ +  AWN  T   T       +Q    A++ WN+  + 
Subjt:  FINSEMMSKATRSKVEHLKFHHSDHRSILLDISWEKFQKQLTASKRIIKFEESWVAHEGSKKALADAWND-TAEATNFNFNMKMQEGITAMNRWNRVRLK

Query:  GSLKEAIRVKEREINNLSNLSSQEAFTKMVKAERDLERLLEEKESYWKIRSRENWLKGGDINTKWFHSKASHRRKRSEIKGIFN
        G++ + I+ K+R +++LS         ++ +  R++  LL+ +E  W+ RS+ +W K GD NTK+FH+ AS RRK++ I G++N
Subjt:  GSLKEAIRVKEREINNLSNLSSQEAFTKMVKAERDLERLLEEKESYWKIRSRENWLKGGDINTKWFHSKASHRRKRSEIKGIFN

TrEMBL top hitse value%identityAlignment
A0A2N9ESV7 Uncharacterized protein2.9e-5426.49Show/hide
Query:  KIWNVVDRVTIKKAGENLKWAEALGNVVGVFEKVD--------------------FQELLRRGTVIRIGKNAEEEWIDIKFEKLPDFCYACGILGHLARE
        ++W V   +  ++ G      EA+G  +G F KVD                     + LLR G V  +  + +  W+  K+E+L  FC+ CG++GH   +
Subjt:  KIWNVVDRVTIKKAGENLKWAEALGNVVGVFEKVD--------------------FQELLRRGTVIRIGKNAEEEWIDIKFEKLPDFCYACGILGHLARE

Query:  CG-APEMNNKEKLPYGPWLRRESIPKGKISNENRATQSSPVKDQRERSTRGESSWEIPARMKFQRWEGNKTNRTGSWRRSSPERWRKPPEKEESRTKKTA
        C   P+  +   LPYG WL        K  +  R  Q  P      R T   ++   P            +N       +SPE    PP  + +     +
Subjt:  CG-APEMNNKEKLPYGPWLRRESIPKGKISNENRATQSSPVKDQRERSTRGESSWEIPARMKFQRWEGNKTNRTGSWRRSSPERWRKPPEKEESRTKKTA

Query:  SLEWAKENKDKKKEDQH-------LSREGQP-----KGPKSDIGQGK-DLGQNYQKGKGQNNTNYPDMVQALSQRDVFSYVIGIARTQEQNNEIRDLATK
          +   +  +    + H       L++ GQP      GP   I   +  LG       G    + P  + +  +         I       N+  ++ TK
Subjt:  SLEWAKENKDKKKEDQH-------LSREGQP-----KGPKSDIGQGK-DLGQNYQKGKGQNNTNYPDMVQALSQRDVFSYVIGIARTQEQNNEIRDLATK

Query:  TVDRKSGKSWKRRAREQQMQDKKNEELSQGTPSDAMKLLIWNVRGVGNPRTIRSLRHVVRKHNPTIVFLSETKDRNPSSDSLKRKLGFSNSFNVGSEGMS
        T   K+ KSWKR        +K+   ++   P   M ++ WN +G+GNP+ IR LR++ ++ +P ++FL ETK      + ++  +GF   F V S+G S
Subjt:  TVDRKSGKSWKRRAREQQMQDKKNEELSQGTPSDAMKLLIWNVRGVGNPRTIRSLRHVVRKHNPTIVFLSETKDRNPSSDSLKRKLGFSNSFNVGSEGMS

Query:  GGLSLLWQNNHAINISSFSKGHIDV-IIKEVDGWWRFTGFYGNPNQNRRKESWQLLERLKESSKLPWIVGGDFNEIM-----------------------
        GGL+L+W+++  + + +FS+ H+DV +  + + WWR TGFYG+P  ++R E+W+LL  L   ++ PW+  GDFNEI+                       
Subjt:  GGLSLLWQNNHAINISSFSKGHIDV-IIKEVDGWWRFTGFYGNPNQNRRKESWQLLERLKESSKLPWIVGGDFNEIM-----------------------

Query:  --------------FTWSRNKNNFEATKERLDRYFINSEMMSKATRSKVEHLKFHHSDHRSILLDISWEKFQKQLTASKRIIKFEESWVAHEGSKKALAD
                      +TW+ N++     +ERLDR    SE ++     +  H+    SDH +I +D  + K   Q    K + +FEE W      ++ +A 
Subjt:  --------------FTWSRNKNNFEATKERLDRYFINSEMMSKATRSKVEHLKFHHSDHRSILLDISWEKFQKQLTASKRIIKFEESWVAHEGSKKALAD

Query:  AWNDTAEATN---FNFNMKMQEGITAMNRWNRVRLKGSLKEAIRVKEREINNLSNLSSQEAFTKMV-KAERDLERLLEEKESYWKIRSRENWLKGGDINT
        AW  TA+ T    F    K++    A+ +W      GS+K+ +  K+ E+ +L + +   A  + + + + ++  LL + E +W+ RSR  WL+ GD NT
Subjt:  AWNDTAEATN---FNFNMKMQEGITAMNRWNRVRLKGSLKEAIRVKEREINNLSNLSSQEAFTKMV-KAERDLERLLEEKESYWKIRSRENWLKGGDINT

Query:  KWFHSKASHRRKRSEIKGIFN
        K+FH KA+ RR+ + I G+ +
Subjt:  KWFHSKASHRRKRSEIKGIFN

A0A2N9IIR5 Uncharacterized protein4.9e-5428.3Show/hide
Query:  GNVVGVFEKVDFQELLRRGTVIRIGKNAEEEWIDIKFEKLPDFCYACGILGHLARECG----APEMNNKEKLPYGPWLRRESIPKGKISNENRATQSSPV
        G+ V +   +D  + L RG  IR+G   +  W+  KFE+LP+FCY CG L H  ++C     +  +   E   YG WLR  +   GK       T+   V
Subjt:  GNVVGVFEKVDFQELLRRGTVIRIGKNAEEEWIDIKFEKLPDFCYACGILGHLARECG----APEMNNKEKLPYGPWLRRESIPKGKISNENRATQSSPV

Query:  KDQRERSTRGESSWEIPARMKFQRWEGN--KTNRTGSWRRSSPERWRKPPEKEESRTKKTASLEWAKENKDKKKEDQ---HLSREGQPKGPKSDIGQGKD
           R  + R +             W+ N  KT+       SSP          +  T         KEN  K+K D    +         P + + Q ++
Subjt:  KDQRERSTRGESSWEIPARMKFQRWEGN--KTNRTGSWRRSSPERWRKPPEKEESRTKKTASLEWAKENKDKKKEDQ---HLSREGQPKGPKSDIGQGKD

Query:  LGQNYQKGKGQNNTNYPDMVQALSQRDVFSYVIGIARTQEQNNEIRDLATKTVDRKSGKSWKRRAREQQMQDKKNEELSQGTPS---DAMKLLIWNVRGV
        + ++ QK K  N+    + + ++   D  S   G  + Q Q   +    + T  + +  +WK+   +Q      N  +  G  +    AM  ++WN RG+
Subjt:  LGQNYQKGKGQNNTNYPDMVQALSQRDVFSYVIGIARTQEQNNEIRDLATKTVDRKSGKSWKRRAREQQMQDKKNEELSQGTPS---DAMKLLIWNVRGV

Query:  GNPRTIRSLRHVVRKHNPTIVFLSETKDRNPSSDSLKRKLGFSNSFNVGSEGMSGGLSLLWQNNHAINISSFSKGHIDVIIKE-VDGWWRFTGFYGNPNQ
        GNPRT++ L  +V   +P  VF+ ET       + ++ KL F+N   V      GG+ L W+   A+ I SFS  HID II E     WRFTGFYG P  
Subjt:  GNPRTIRSLRHVVRKHNPTIVFLSETKDRNPSSDSLKRKLGFSNSFNVGSEGMSGGLSLLWQNNHAINISSFSKGHIDVIIKE-VDGWWRFTGFYGNPNQ

Query:  NRRKESWQLLERLKESSKLPWIVGGDFNEIM-------------------------------------FTWSRNKNNFEATKERLDRYFINSEMMSKATR
        + R  SW +L  L     LPW   GDFNE++                                     FTW  N+ N     ERLDR  +NSE + +   
Subjt:  NRRKESWQLLERLKESSKLPWIVGGDFNEIM-------------------------------------FTWSRNKNNFEATKERLDRYFINSEMMSKATR

Query:  SKVEHLKFHHSDHRSILLDISWEKFQKQLTASKRIIKFEESWVAHEGSKKALADAWNDTAEATNFNFNMKMQEGITAMNRWNRVRL------------KG
        + V H++   SDH  + L  S       +T  K++ +FE  W+  EG +  +  AW   +            +G   +  WNRV L             G
Subjt:  SKVEHLKFHHSDHRSILLDISWEKFQKQLTASKRIIKFEESWVAHEGSKKALADAWNDTAEATNFNFNMKMQEGITAMNRWNRVRL------------KG

Query:  SLKEAIRVKEREINNLSNLSSQEA-FTKMVKAERDLERLLEEKESYWKIRSRENWLKGGDINTKWFHSKASHRRKRSEIKGI
        S++  +  K  ++     LS Q     ++V    +L +LL ++E+ W  RSR +WLK GD NT++FHS+AS RR+R+ I G+
Subjt:  SLKEAIRVKEREINNLSNLSSQEA-FTKMVKAERDLERLLEEKESYWKIRSRENWLKGGDINTKWFHSKASHRRKRSEIKGI

A0A6J1DUG8 uncharacterized protein LOC1110241358.4e-5442.12Show/hide
Query:  MKLLIWNVRGVGNPRTIRSLRHVVRKHNPTIVFLSETKDRNPSSDSLKRKLGFSNSFNVGSEGMSGGLSLLWQNNHAINISSFSKGHIDVIIKEVDGWWR
        MK L WNV G+GNP T R+LR++VR+  P +VFLSETK         KR+L F    +V S G SGGL LLW ++  + I S S GHID II +  G WR
Subjt:  MKLLIWNVRGVGNPRTIRSLRHVVRKHNPTIVFLSETKDRNPSSDSLKRKLGFSNSFNVGSEGMSGGLSLLWQNNHAINISSFSKGHIDVIIKEVDGWWR

Query:  FTGFYGNPNQNRRKESWQLLERLKESSKLPWIVGGDFNEI------MFTWSRNKNNFEATK--ERLDRYFINSEMMSKATRSKVEHLKFHHSDHRSILLD
        FTGFYGNP   +R  SW+LLERL     LPWI+GGDFNEI      M    RN++        ERLDR+ IN  M++K    KV HL+   SDHR IL  
Subjt:  FTGFYGNPNQNRRKESWQLLERLKESSKLPWIVGGDFNEI------MFTWSRNKNNFEATK--ERLDRYFINSEMMSKATRSKVEHLKFHHSDHRSILLD

Query:  ISWEKFQKQLTASKRIIKFEESWVAHEGSKKALADAWNDTAEATNFNFNMKMQEGITAMNRWNRVRLKGSLKEAIRVKEREINNLSNL--SSQEAFTKMV
          +E  +      +R I+FEESW+  +G +  +   W          F  K+   ++ +N WN++RL  SLK AI  KE+E+  L  L   SQ A     
Subjt:  ISWEKFQKQLTASKRIIKFEESWVAHEGSKKALADAWNDTAEATNFNFNMKMQEGITAMNRWNRVRLKGSLKEAIRVKEREINNLSNL--SSQEAFTKMV

Query:  KAERDLERLLE
            ++E  L+
Subjt:  KAERDLERLLE

A0A7N2R0C3 Reverse transcriptase domain-containing protein1.2e-5526.29Show/hide
Query:  KIWNVVDRVTIKKAGENLKWAEALGNVVGVFEKVDFQEL-LRRGTVIRIG-----------------KNAEEEWIDIKFEKLPDFCYACGILGHLAREC-
        +I+N+  +   K+ G      +A+G  +G F +VD +E  ++ GT +R+                  +  E  W+  K+E+LP+FCY CG+L H  ++C 
Subjt:  KIWNVVDRVTIKKAGENLKWAEALGNVVGVFEKVDFQEL-LRRGTVIRIG-----------------KNAEEEWIDIKFEKLPDFCYACGILGHLAREC-

Query:  ---GAPEMNNKEKLPYGPWLRRESIPKG---------KISNENRATQSSPVKDQRERSTRGE----SSWEIPARMKFQRWEGNKTNRTGSWRRSSPERWR
           G  +   +  L YG WLR E I KG         K+  E +  +++   +++ R    E     + E+ A       +    +  G    ++ ++  
Subjt:  ---GAPEMNNKEKLPYGPWLRRESIPKG---------KISNENRATQSSPVKDQRERSTRGE----SSWEIPARMKFQRWEGNKTNRTGSWRRSSPERWR

Query:  KPPEKEESRTKKTASLEWAKENKDKKKEDQHLSREGQPKGPKSDIGQGKDLGQNYQKGKGQN-NTNYPDMVQALSQRDV-FSYVIGIARTQEQNNE----
            +   R+     +E  +EN+                    ++G G    +  QKG+ +N N   P+    L    V    V+G+    ++N +    
Subjt:  KPPEKEESRTKKTASLEWAKENKDKKKEDQHLSREGQPKGPKSDIGQGKDLGQNYQKGKGQN-NTNYPDMVQALSQRDV-FSYVIGIARTQEQNNE----

Query:  -IRDLATKTVDRKSGKS---WKRRARE----------QQMQDKKNEELS-------------QGTPSDAMKLLIWNVRGVGNPRTIRSLRHVVRKHNPTI
           D     V  K G S   WKR  R             +Q K++ +L+             + TP   M  L WN RG+G+   +R+L   V+  +P +
Subjt:  -IRDLATKTVDRKSGKS---WKRRARE----------QQMQDKKNEELS-------------QGTPSDAMKLLIWNVRGVGNPRTIRSLRHVVRKHNPTI

Query:  VFLSETKDRNPSSDSLKRKLGFSNSFNVGSEGMSGGLSLLWQNNHAINISSFSKGHIDVIIKEVDGW--WRFTGFYGNPNQNRRKESWQLLERLKESSKL
        VFL+ETK        L+RKLG +    V S+G SGGL++LW+    +++ S S  HIDV++   +G   WR TGFYG+P+   R  SW+LLE L     +
Subjt:  VFLSETKDRNPSSDSLKRKLGFSNSFNVGSEGMSGGLSLLWQNNHAINISSFSKGHIDVIIKEVDGW--WRFTGFYGNPNQNRRKESWQLLERLKESSKL

Query:  PWIVGGDFNEIM-------------------------------------FTWSRNKNNFEATKERLDRYFINSEMMSKATRSKVEHLKFHHSDHRSILLD
        PW+V GDFNEI+                                     FTW   +   + T  RLDR   N E M+    +KV H     SDH   LL 
Subjt:  PWIVGGDFNEIM-------------------------------------FTWSRNKNNFEATKERLDRYFINSEMMSKATRSKVEHLKFHHSDHRSILLD

Query:  ISWEKFQKQLTASKRIIKFEESWVAHEGSKKALADAWNDTAEATNFNFNMKMQEGITAMNRWNRVRLKGSLKEAIRVKEREINNLSNLS-SQEAFTKMVK
        +S  + + +  A +R + FEE W   EG ++ +  AW+            +++     +  WNR R+ G++ + ++ K+  +  L  L+   E+  ++ K
Subjt:  ISWEKFQKQLTASKRIIKFEESWVAHEGSKKALADAWNDTAEATNFNFNMKMQEGITAMNRWNRVRLKGSLKEAIRVKEREINNLSNLS-SQEAFTKMVK

Query:  AERDLERLLEEKESYWKIRSRENWLKGGDINTKWFHSKASHRRKRSEIKGIFN
         ++++  ++  +E  W  RSR  W+K GD NT++FH+ A++RR++++I+GI +
Subjt:  AERDLERLLEEKESYWKIRSRENWLKGGDINTKWFHSKASHRRKRSEIKGIFN

A0A803PBM9 Uncharacterized protein3.5e-5228.66Show/hide
Query:  WIDIKFEKLPDFCYACGILGHLAREC--------GAPEMNNKEKLPYGPWLR----------------RESIPKG-----KISNENR----ATQSSPVKD
        W+  K+E+LP  C+ CG +GH  +EC        GA  +  K    YG WL+                R  + +G     KIS  N+    A   +P+ D
Subjt:  WIDIKFEKLPDFCYACGILGHLAREC--------GAPEMNNKEKLPYGPWLR----------------RESIPKG-----KISNENR----ATQSSPVKD

Query:  QRERSTRGESSWEIPARMKFQRWEGNKTN-RTGSWRRSSPERWRKPPEKEESRTKKTASLEWAKENKDKKKEDQHLSREGQPKGPKSDIGQGKDLGQNYQ
        +RE          +    +  R E +KTN + G+   +     R   E++E+            ++K K++  +  +  G  K  K+DI     LGQ   
Subjt:  QRERSTRGESSWEIPARMKFQRWEGNKTN-RTGSWRRSSPERWRKPPEKEESRTKKTASLEWAKENKDKKKEDQHLSREGQPKGPKSDIGQGKDLGQNYQ

Query:  KGKGQNNTNYPDMVQALSQRDVFSYVIGIARTQEQNNEIRDLATKTVDRKSGKSWKRRA--REQQMQDKKNEELSQGTPSDAMKLLIWNVRGVGNPRTIR
                  P +           + + IA  +E +N              G S+   A  R     ++   EL        MKLL+WNV+G+GNP T+R
Subjt:  KGKGQNNTNYPDMVQALSQRDVFSYVIGIARTQEQNNEIRDLATKTVDRKSGKSWKRRA--REQQMQDKKNEELSQGTPSDAMKLLIWNVRGVGNPRTIR

Query:  SLRHVVRKHNPTIVFLSETKDRNPSSDSLKRKLGFSNSFNVGSEGMSGGLSLLWQNNHAINISSFSKGHIDVIIKEVDG-WWRFTGFYGNPNQNRRKESW
        +L+ +V + +P +VF+SE++ +   +++L+  LG+   F V + G SGGL LLW N    NI SFS  HID  I++ +G WWRFTGFYG+P+  +R ESW
Subjt:  SLRHVVRKHNPTIVFLSETKDRNPSSDSLKRKLGFSNSFNVGSEGMSGGLSLLWQNNHAINISSFSKGHIDVIIKEVDG-WWRFTGFYGNPNQNRRKESW

Query:  QLLERLKESSKLPWIVGGDFNEIM-------------------------------------FTWSRNKNNFEATKERLDRYFINSEMMSKATRSKVEHLK
        +LL R+      PW++GGDFNEI+                                     +TW   + N E   ERLDR   N E      ++KV HL 
Subjt:  QLLERLKESSKLPWIVGGDFNEIM-------------------------------------FTWSRNKNNFEATKERLDRYFINSEMMSKATRSKVEHLK

Query:  FHHSDHRSILLD--ISWEKFQKQLTASKRIIKFEESWVAHEGSKKALADAWNDTAEATNFNFNMKMQEGIT----AMNRWNRVRLKGSLKEAIRVKEREI
           SDH  +LL   +   + +K +    R   FE +W   E   + + ++W+   +  + N  M +++ +     A+ +WN+ R K  +K+ ++  E +I
Subjt:  FHHSDHRSILLD--ISWEKFQKQLTASKRIIKFEESWVAHEGSKKALADAWNDTAEATNFNFNMKMQEGIT----AMNRWNRVRLKGSLKEAIRVKEREI

Query:  NNLSNLSSQEAFTKMVKAERDLERLLEEKESYWKIRSRENWLKGGDINTKWFHSKASHRRKRSEIKGIFN
          LS  ++ + +  +   E+    LL+++E +W+ RSR  WLK GD NTK+FH KA+ R++++ I G+ +
Subjt:  NNLSNLSSQEAFTKMVKAERDLERLLEEKESYWKIRSRENWLKGGDINTKWFHSKASHRRKRSEIKGIFN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGATGAGGAACTCAACAAGAAAGTACAAGATTTGGAAAATGTGGTAGTGCGTAAGATAGCCACTGAAAAGCACATTAACGTAGAAATATTTAAGAGGATGATTCC
CAAAATATGGAACGTGGTTGATAGAGTCACCATAAAGAAAGCGGGAGAGAATTTGAAGTGGGCGGAAGCCCTGGGCAACGTTGTGGGAGTGTTTGAAAAGGTAGACTTCC
AGGAACTACTGAGAAGAGGAACGGTAATCAGAATAGGGAAAAATGCTGAAGAAGAATGGATAGACATCAAATTCGAAAAGCTCCCTGACTTTTGCTACGCGTGTGGCATA
CTGGGGCACTTAGCGAGGGAGTGCGGTGCACCGGAAATGAACAACAAAGAGAAACTTCCATATGGGCCATGGTTAAGACGAGAGAGCATACCAAAAGGGAAGATATCCAA
TGAAAATAGAGCTACCCAATCAAGTCCTGTAAAGGACCAAAGGGAAAGAAGCACTAGAGGGGAAAGTAGCTGGGAAATCCCAGCAAGAATGAAGTTTCAAAGATGGGAAG
GAAACAAAACGAATCGTACAGGAAGTTGGAGGAGAAGCTCGCCGGAACGATGGAGGAAACCGCCGGAGAAGGAGGAGAGTAGGACAAAAAAAACGGCTAGTTTAGAATGG
GCAAAGGAGAATAAGGACAAGAAAAAAGAAGATCAACATCTCAGCAGAGAAGGACAACCAAAGGGGCCCAAGTCTGACATAGGCCAAGGGAAAGATTTGGGCCAAAATTA
TCAAAAAGGAAAGGGCCAAAATAATACTAACTACCCAGATATGGTACAAGCCCTTAGCCAAAGAGATGTGTTCTCATATGTGATTGGAATAGCGAGAACACAGGAACAGA
ACAATGAAATAAGAGATTTGGCAACCAAAACAGTAGATAGGAAAAGCGGAAAATCTTGGAAAAGACGGGCTAGAGAACAACAAATGCAAGACAAGAAGAATGAAGAGTTG
AGTCAGGGAACCCCGTCGGACGCTATGAAACTTTTAATCTGGAATGTCCGAGGGGTGGGGAATCCTCGGACGATCCGCTCTCTGCGACATGTGGTTCGCAAGCACAACCC
CACGATAGTTTTCTTGAGCGAGACTAAGGACAGGAACCCGTCATCAGATAGCTTGAAGAGAAAACTGGGCTTTAGCAATAGTTTCAACGTTGGCAGTGAAGGAATGAGCG
GGGGATTAAGTCTCCTTTGGCAGAATAACCATGCTATAAACATCAGCTCTTTCTCTAAAGGCCACATAGATGTTATAATAAAAGAAGTTGATGGGTGGTGGAGATTCACG
GGGTTTTATGGAAACCCGAATCAAAACAGGAGAAAAGAATCTTGGCAACTGCTGGAAAGGCTAAAAGAGTCGTCGAAGCTCCCATGGATCGTCGGAGGCGATTTTAATGA
AATTATGTTCACGTGGTCAAGGAACAAGAACAACTTTGAAGCAACCAAAGAGCGTCTCGATAGATACTTTATAAACTCGGAGATGATGTCTAAAGCCACAAGATCGAAAG
TGGAACATCTCAAGTTTCATCACTCAGATCATAGGTCTATATTGCTCGACATTAGCTGGGAGAAGTTCCAAAAACAACTGACAGCCAGCAAAAGAATTATAAAGTTTGAG
GAAAGTTGGGTTGCTCATGAGGGAAGCAAAAAAGCTCTTGCGGATGCTTGGAACGACACAGCTGAAGCTACAAATTTCAACTTCAACATGAAAATGCAAGAAGGAATAAC
AGCCATGAACAGATGGAATAGAGTCAGATTGAAAGGATCCTTAAAAGAAGCTATCAGAGTTAAGGAAAGAGAGATCAACAATCTTTCAAACCTTTCAAGCCAAGAGGCTT
TTACTAAAATGGTCAAAGCCGAGAGGGATCTTGAAAGATTGCTGGAAGAAAAGGAAAGTTATTGGAAAATTCGATCGAGAGAGAATTGGCTCAAAGGGGGAGACATAAAC
ACCAAGTGGTTCCACTCCAAAGCCTCCCATAGAAGGAAAAGAAGCGAAATAAAAGGCATTTTCAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATGATGAGGAACTCAACAAGAAAGTACAAGATTTGGAAAATGTGGTAGTGCGTAAGATAGCCACTGAAAAGCACATTAACGTAGAAATATTTAAGAGGATGATTCC
CAAAATATGGAACGTGGTTGATAGAGTCACCATAAAGAAAGCGGGAGAGAATTTGAAGTGGGCGGAAGCCCTGGGCAACGTTGTGGGAGTGTTTGAAAAGGTAGACTTCC
AGGAACTACTGAGAAGAGGAACGGTAATCAGAATAGGGAAAAATGCTGAAGAAGAATGGATAGACATCAAATTCGAAAAGCTCCCTGACTTTTGCTACGCGTGTGGCATA
CTGGGGCACTTAGCGAGGGAGTGCGGTGCACCGGAAATGAACAACAAAGAGAAACTTCCATATGGGCCATGGTTAAGACGAGAGAGCATACCAAAAGGGAAGATATCCAA
TGAAAATAGAGCTACCCAATCAAGTCCTGTAAAGGACCAAAGGGAAAGAAGCACTAGAGGGGAAAGTAGCTGGGAAATCCCAGCAAGAATGAAGTTTCAAAGATGGGAAG
GAAACAAAACGAATCGTACAGGAAGTTGGAGGAGAAGCTCGCCGGAACGATGGAGGAAACCGCCGGAGAAGGAGGAGAGTAGGACAAAAAAAACGGCTAGTTTAGAATGG
GCAAAGGAGAATAAGGACAAGAAAAAAGAAGATCAACATCTCAGCAGAGAAGGACAACCAAAGGGGCCCAAGTCTGACATAGGCCAAGGGAAAGATTTGGGCCAAAATTA
TCAAAAAGGAAAGGGCCAAAATAATACTAACTACCCAGATATGGTACAAGCCCTTAGCCAAAGAGATGTGTTCTCATATGTGATTGGAATAGCGAGAACACAGGAACAGA
ACAATGAAATAAGAGATTTGGCAACCAAAACAGTAGATAGGAAAAGCGGAAAATCTTGGAAAAGACGGGCTAGAGAACAACAAATGCAAGACAAGAAGAATGAAGAGTTG
AGTCAGGGAACCCCGTCGGACGCTATGAAACTTTTAATCTGGAATGTCCGAGGGGTGGGGAATCCTCGGACGATCCGCTCTCTGCGACATGTGGTTCGCAAGCACAACCC
CACGATAGTTTTCTTGAGCGAGACTAAGGACAGGAACCCGTCATCAGATAGCTTGAAGAGAAAACTGGGCTTTAGCAATAGTTTCAACGTTGGCAGTGAAGGAATGAGCG
GGGGATTAAGTCTCCTTTGGCAGAATAACCATGCTATAAACATCAGCTCTTTCTCTAAAGGCCACATAGATGTTATAATAAAAGAAGTTGATGGGTGGTGGAGATTCACG
GGGTTTTATGGAAACCCGAATCAAAACAGGAGAAAAGAATCTTGGCAACTGCTGGAAAGGCTAAAAGAGTCGTCGAAGCTCCCATGGATCGTCGGAGGCGATTTTAATGA
AATTATGTTCACGTGGTCAAGGAACAAGAACAACTTTGAAGCAACCAAAGAGCGTCTCGATAGATACTTTATAAACTCGGAGATGATGTCTAAAGCCACAAGATCGAAAG
TGGAACATCTCAAGTTTCATCACTCAGATCATAGGTCTATATTGCTCGACATTAGCTGGGAGAAGTTCCAAAAACAACTGACAGCCAGCAAAAGAATTATAAAGTTTGAG
GAAAGTTGGGTTGCTCATGAGGGAAGCAAAAAAGCTCTTGCGGATGCTTGGAACGACACAGCTGAAGCTACAAATTTCAACTTCAACATGAAAATGCAAGAAGGAATAAC
AGCCATGAACAGATGGAATAGAGTCAGATTGAAAGGATCCTTAAAAGAAGCTATCAGAGTTAAGGAAAGAGAGATCAACAATCTTTCAAACCTTTCAAGCCAAGAGGCTT
TTACTAAAATGGTCAAAGCCGAGAGGGATCTTGAAAGATTGCTGGAAGAAAAGGAAAGTTATTGGAAAATTCGATCGAGAGAGAATTGGCTCAAAGGGGGAGACATAAAC
ACCAAGTGGTTCCACTCCAAAGCCTCCCATAGAAGGAAAAGAAGCGAAATAAAAGGCATTTTCAATTAG
Protein sequenceShow/hide protein sequence
MDDEELNKKVQDLENVVVRKIATEKHINVEIFKRMIPKIWNVVDRVTIKKAGENLKWAEALGNVVGVFEKVDFQELLRRGTVIRIGKNAEEEWIDIKFEKLPDFCYACGI
LGHLARECGAPEMNNKEKLPYGPWLRRESIPKGKISNENRATQSSPVKDQRERSTRGESSWEIPARMKFQRWEGNKTNRTGSWRRSSPERWRKPPEKEESRTKKTASLEW
AKENKDKKKEDQHLSREGQPKGPKSDIGQGKDLGQNYQKGKGQNNTNYPDMVQALSQRDVFSYVIGIARTQEQNNEIRDLATKTVDRKSGKSWKRRAREQQMQDKKNEEL
SQGTPSDAMKLLIWNVRGVGNPRTIRSLRHVVRKHNPTIVFLSETKDRNPSSDSLKRKLGFSNSFNVGSEGMSGGLSLLWQNNHAINISSFSKGHIDVIIKEVDGWWRFT
GFYGNPNQNRRKESWQLLERLKESSKLPWIVGGDFNEIMFTWSRNKNNFEATKERLDRYFINSEMMSKATRSKVEHLKFHHSDHRSILLDISWEKFQKQLTASKRIIKFE
ESWVAHEGSKKALADAWNDTAEATNFNFNMKMQEGITAMNRWNRVRLKGSLKEAIRVKEREINNLSNLSSQEAFTKMVKAERDLERLLEEKESYWKIRSRENWLKGGDIN
TKWFHSKASHRRKRSEIKGIFN