; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0017781 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0017781
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase Ty1/copia-type domain-containing protein
Genome locationchr5:8695245..8699077
RNA-Seq ExpressionLag0017781
SyntenyLag0017781
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050146.1 putative mitochondrial protein [Cucumis melo var. makuwa]5.7e-5630.22Show/hide
Query:  KNVAVSVKIEDEEVLVHTLNGLPSDFNAFRTSIRTSSGTLSLAELHVLLGVEEKTIHQHQAAE--PT-------PYDHGS-SEGKLFEPKFSTQRVARKL
        K  A SV +EDEE+LVHTLNGL + FNAFRTSIRT +  +SL ELH L   EE T+ +  A E  PT       P  H S   G+ F    +    +   
Subjt:  KNVAVSVKIEDEEVLVHTLNGLPSDFNAFRTSIRTSSGTLSLAELHVLLGVEEKTIHQHQAAE--PT-------PYDHGS-SEGKLFEPKFSTQRVARKL

Query:  PATRGTWFST----------------------------------------------------------------------------------WKWMNFSY
         + RGT F++                                                                                  +  MNFSY
Subjt:  PATRGTWFST----------------------------------------------------------------------------------WKWMNFSY

Query:  QGRHPPAQLAAMAVNSMNSQSLNDSSNAFWLSDSGCNVHMTNDLANLNLSNNYNGEESVTVGNGQPLNIQNTDIGTLYTSSHTFNLSKILHAPELAANLL
        QGRHPP+QLAAM VNSMNSQ   +++N FWL DSGCNVHMTN+LANLNLSNNYNGEE+VTV N QPLNI+NT                            
Subjt:  QGRHPPAQLAAMAVNSMNSQSLNDSSNAFWLSDSGCNVHMTNDLANLNLSNNYNGEESVTVGNGQPLNIQNTDIGTLYTSSHTFNLSKILHAPELAANLL

Query:  SVHKFCLDNNCIFVFDTDWFLIQDKVSGKILYTGESVNG-LYPIPSPSMLSSDLHPKNFNFMAKQECSLWHHRFGHPAPKILRSSLSRLVFLINRMPSSS
                                      + TG  +N     +P P                             P P+                    
Subjt:  SVHKFCLDNNCIFVFDTDWFLIQDKVSGKILYTGESVNG-LYPIPSPSMLSSDLHPKNFNFMAKQECSLWHHRFGHPAPKILRSSLSRLVFLINRMPSSS

Query:  FNMSSSFEKLYAETSHEHSISQNVHISASTINNHLMQTRAKSGIFKSKAFVSTMTTLVP--TDPTSYFVASKTKYNIDGTVAQYKARLVAKGYHQVEGFD
                                       N H MQTRAKS IFK KAF  T  + +P            + +   +  + ++KARLVAKGYHQV+GFD
Subjt:  FNMSSSFEKLYAETSHEHSISQNVHISASTINNHLMQTRAKSGIFKSKAFVSTMTTLVP--TDPTSYFVASKTKYNIDGTVAQYKARLVAKGYHQVEGFD

Query:  FTETFSPVVKKPTIRVILALVAH-----------------------------------------------------------------------------
        F ETFSPVVKKPTI +ILAL A                                                                              
Subjt:  FTETFSPVVKKPTIRVILALVAH-----------------------------------------------------------------------------

Query:  ---------------------SKYFLGLEVHTTTDDIFVNQAKYLTDLLHTAGMSSAKTCLTPMSTTVDLHISAPLFDNATFYRQ
                              KYFLGLE+ ++ D IFVNQAKYL DLLHT+GM+SAK+C+TPMST++DL+  AP F++ + YR+
Subjt:  ---------------------SKYFLGLEVHTTTDDIFVNQAKYLTDLLHTAGMSSAKTCLTPMSTTVDLHISAPLFDNATFYRQ

KAA0067173.1 retrotransposon protein [Cucumis melo var. makuwa]3.9e-6549.39Show/hide
Query:  KNVAVSVKIEDEEVLVHTLNGLPSDFNAFRTSIRTSSGTLSLAELHVLLGVEEKTIHQHQAAE--PT------PYDHGSSEGK-----------------
        K VA S+ +EDEE+LVHTLNGLP  FNAFRTSIRT SG +SL ELH LL  EE T+ +  A E  PT      P  H SS G+                 
Subjt:  KNVAVSVKIEDEEVLVHTLNGLPSDFNAFRTSIRTSSGTLSLAELHVLLGVEEKTIHQHQAAE--PT------PYDHGSSEGK-----------------

Query:  ------------------------------------------LFEPKFSTQRVARKLPATRGTWF-------------------------STWKWMNFSY
                                                   F P    QR       + G  F                           +  MNFSY
Subjt:  ------------------------------------------LFEPKFSTQRVARKLPATRGTWF-------------------------STWKWMNFSY

Query:  QGRHPPAQLAAMAVNSMNSQSLNDSSNAFWLSDSGCNVHMTNDLANLNLSNNYNGEESVTVGNGQPLNIQNTDIGTLYTSSHTFNLSKILHAPELAANLL
        Q RH P+QLAAMAVNSMNSQ  ++++N FWLSDSG NVHMTN+LANLNLSNNYNGEE+VTVGNGQPLNI+NT  G L T SHTFNLSKILHAP+LA NLL
Subjt:  QGRHPPAQLAAMAVNSMNSQSLNDSSNAFWLSDSGCNVHMTNDLANLNLSNNYNGEESVTVGNGQPLNIQNTDIGTLYTSSHTFNLSKILHAPELAANLL

Query:  SVHKFCLDNNCIFVFDTDWFLIQDKVSG
        SVHKFCLDNNC+FVF TD FLIQDKV+G
Subjt:  SVHKFCLDNNCIFVFDTDWFLIQDKVSG

TYK06402.1 putative mitochondrial protein [Cucumis melo var. makuwa]6.9e-4631.45Show/hide
Query:  KNVAVSVKIEDEEVLVHTLNGLPSDFNAFRTSIRTSSGTLSLAELHVLLGVEEKTIHQHQAAE--PT-------PYDHGS-SEGKLFEPKFSTQRVARKL
        K  A SV +EDEE+LVHTLNGL + FNAFRTSIRT +  +SL ELH L   EE T+ +  A E  PT       P  H S   G+ F    +    +   
Subjt:  KNVAVSVKIEDEEVLVHTLNGLPSDFNAFRTSIRTSSGTLSLAELHVLLGVEEKTIHQHQAAE--PT-------PYDHGS-SEGKLFEPKFSTQRVARKL

Query:  PATRGTWFST----------------------------------------------------------------------------------WKWMNFSY
         + RGT F++                                                                                  +  MNFSY
Subjt:  PATRGTWFST----------------------------------------------------------------------------------WKWMNFSY

Query:  QGRHPPAQLAAMAVNSMNSQSLNDSSNAFWLSDSGCNVHMTNDLANLNLSNNYNGEESVTVGNGQPLNIQNTDIGTLYTSSHTFNLSKILHAPELAANLL
        QGRHPP+QLAAM VNSMNSQ   +++N FWL DSGCNVHMTN+LANLNLSNNYNGEE+VTV N QPLNI+NT  G++ T +     +  L  P    N  
Subjt:  QGRHPPAQLAAMAVNSMNSQSLNDSSNAFWLSDSGCNVHMTNDLANLNLSNNYNGEESVTVGNGQPLNIQNTDIGTLYTSSHTFNLSKILHAPELAANLL

Query:  SVHKFCLDNNCIFVFDTDWFLIQDKVSGKILYTGESVNGLYPIPSPSMLSSDLHPKNFNFMAKQECSLWHHRFGHPAPKILRSSLSRLVFLINRMPSSSF
        ++       +CIF                                         PK F+     +  L HH      P I    L               
Subjt:  SVHKFCLDNNCIFVFDTDWFLIQDKVSGKILYTGESVNGLYPIPSPSMLSSDLHPKNFNFMAKQECSLWHHRFGHPAPKILRSSLSRLVFLINRMPSSSF

Query:  NMSSSFEKLYAETSHEHSISQNVHISASTINNHL-MQTRAKSGIFKSKAFVSTMTTLVPTDPTSYFVASKTKYNIDGTVAQYKARLVAKGYHQVEGFDFT
         M +S    + ET +   ++Q       T  NH+ +  ++  G+ K               P +  + S     +D  +              V G D+ 
Subjt:  NMSSSFEKLYAETSHEHSISQNVHISASTINNHL-MQTRAKSGIFKSKAFVSTMTTLVPTDPTSYFVASKTKYNIDGTVAQYKARLVAKGYHQVEGFDFT

Query:  ETFSPVVKKPTIRVILALVAHSKYFLGLEVHTTTDDIFVNQAKYLTDLLHTAGMSSAKTCLTPMSTTVDLHISAPLFDNATFYRQ
           S    +  +   ++ +   KYFLGLE+ ++ D IFVNQAKYL DLLHT+GM+SAK+C+TPMST++DL+  AP F++ + YR+
Subjt:  ETFSPVVKKPTIRVILALVAHSKYFLGLEVHTTTDDIFVNQAKYLTDLLHTAGMSSAKTCLTPMSTTVDLHISAPLFDNATFYRQ

XP_016902697.1 PREDICTED: uncharacterized protein LOC107991825 [Cucumis melo]1.6e-6347.65Show/hide
Query:  AVSVKIEDEEVLVHTLNGLPSDFNAFRTSIRTSSGTLSLAELHVLLGVEEKTIHQHQAAE--PT-------PYDHGS-SEGKLFEPKFSTQRVARKLPAT
        A SV +EDEE+LVHTLNGLP+ FNAFRTSIRT +G +SL ELH LL  EE  + +  A E  PT       P  HGS   G+ F    +    +    + 
Subjt:  AVSVKIEDEEVLVHTLNGLPSDFNAFRTSIRTSSGTLSLAELHVLLGVEEKTIHQHQAAE--PT-------PYDHGS-SEGKLFEPKFSTQRVARKLPAT

Query:  RGTWFST----------------------------------------------------------------------------------WKWMNFSYQGR
        RGT F++                                                                                  +  MNFSYQG+
Subjt:  RGTWFST----------------------------------------------------------------------------------WKWMNFSYQGR

Query:  HPPAQLAAMAVNSMNSQSLNDSSNAFWLSDSGCNVHMTNDLANLNLSNNYNGEESVTVGNGQPLNIQNTDIGTLYTSSHTFNLSKILHAPELAANLLSVH
        HPP+QL AMA+NSMNSQ   +++N F LSDSGCNVHMTN+LANLNLSNNYNGEE+VTVGNGQP+NI+NT  G L T SHTFNLSKILHAP+LA NLLSVH
Subjt:  HPPAQLAAMAVNSMNSQSLNDSSNAFWLSDSGCNVHMTNDLANLNLSNNYNGEESVTVGNGQPLNIQNTDIGTLYTSSHTFNLSKILHAPELAANLLSVH

Query:  KFCLDNNCIFVFDTDWFLI
        KFCLDNNC+F+FDTDWFLI
Subjt:  KFCLDNNCIFVFDTDWFLI

XP_022158189.1 uncharacterized protein LOC111024722 [Momordica charantia]6.2e-7131.62Show/hide
Query:  MAVNSMNSQSLNDSSNAFWLSDSGCNVHMTNDLANLNLSNNYNGEESVTVGNGQPLNIQNTDIGTLYTSSHTFNLSKILHAPELAANLLSVHKFCLDNNC
        MAVN+M   S +++ N FWLSDSGCN H+TNDL NLNL ++YNGEE VTVGNGQ LNI +T  G L  SSH F +S +LHAP+LA NLLSVHKFCLDN+C
Subjt:  MAVNSMNSQSLNDSSNAFWLSDSGCNVHMTNDLANLNLSNNYNGEESVTVGNGQPLNIQNTDIGTLYTSSHTFNLSKILHAPELAANLLSVHKFCLDNNC

Query:  IFVFDTDWFLIQDKVSGKILYTGESVNGLYPIPSPSMLSS---DLHPKNFNFMAKQECSLWHHRFGHPAPKILRSSLSRLVFLIN--------------R
        IFV+D+DWFLIQDKV+   LY G+SVNGLYPIPS S LSS   +LHPKN   +AK    LWHHR GH +PKILR +LS     I+              +
Subjt:  IFVFDTDWFLIQDKVSGKILYTGESVNGLYPIPSPSMLSS---DLHPKNFNFMAKQECSLWHHRFGHPAPKILRSSLSRLVFLIN--------------R

Query:  MPSSSFNM--SSSFEKL-----------------------------------------------------YAE--------TSHEHSISQNVHISAST--
        M   SF M  SSSF  L                                                     +AE        T H +   + V+ S S+  
Subjt:  MPSSSFNM--SSSFEKL-----------------------------------------------------YAE--------TSHEHSISQNVHISAST--

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------INNHLMQTRAKSGIFKSKAFV-------------------------STMT------------TLVPTDPTSYFVASK----T
                          IN HLMQT AKSGIFK +A++                         + M+            +LVP  P    V  K    T
Subjt:  ------------------INNHLMQTRAKSGIFKSKAFV-------------------------STMT------------TLVPTDPTSYFVASK----T

Query:  KYNIDGTVAQYKARLVAKGYHQVEGFDFTETFSPVVKKPTIRVILALVAH--------------------------------------------------
        K+N DG+ A+YKARL+AKGYH++EGFDF ETFSPVVKKPTIRV+L+L AH                                                  
Subjt:  KYNIDGTVAQYKARLVAKGYHQVEGFDFTETFSPVVKKPTIRVILALVAH--------------------------------------------------

Query:  ----------------------------------------------------------------------------SKYFLGLEVHTTTDDIFVNQAKYL
                                                                                     +YFLGLE+ +    IFVNQA+YL
Subjt:  ----------------------------------------------------------------------------SKYFLGLEVHTTTDDIFVNQAKYL

Query:  TDLLHTAGMSSAKTCLTPMSTTVDLHISAPLFDNATFYRQ
         DLL  +GM SAK+C TPMST++DLH S P+F +A+ YRQ
Subjt:  TDLLHTAGMSSAKTCLTPMSTTVDLHISAPLFDNATFYRQ

TrEMBL top hitse value%identityAlignment
A0A1S4E394 uncharacterized protein LOC1079918258.0e-6447.65Show/hide
Query:  AVSVKIEDEEVLVHTLNGLPSDFNAFRTSIRTSSGTLSLAELHVLLGVEEKTIHQHQAAE--PT-------PYDHGS-SEGKLFEPKFSTQRVARKLPAT
        A SV +EDEE+LVHTLNGLP+ FNAFRTSIRT +G +SL ELH LL  EE  + +  A E  PT       P  HGS   G+ F    +    +    + 
Subjt:  AVSVKIEDEEVLVHTLNGLPSDFNAFRTSIRTSSGTLSLAELHVLLGVEEKTIHQHQAAE--PT-------PYDHGS-SEGKLFEPKFSTQRVARKLPAT

Query:  RGTWFST----------------------------------------------------------------------------------WKWMNFSYQGR
        RGT F++                                                                                  +  MNFSYQG+
Subjt:  RGTWFST----------------------------------------------------------------------------------WKWMNFSYQGR

Query:  HPPAQLAAMAVNSMNSQSLNDSSNAFWLSDSGCNVHMTNDLANLNLSNNYNGEESVTVGNGQPLNIQNTDIGTLYTSSHTFNLSKILHAPELAANLLSVH
        HPP+QL AMA+NSMNSQ   +++N F LSDSGCNVHMTN+LANLNLSNNYNGEE+VTVGNGQP+NI+NT  G L T SHTFNLSKILHAP+LA NLLSVH
Subjt:  HPPAQLAAMAVNSMNSQSLNDSSNAFWLSDSGCNVHMTNDLANLNLSNNYNGEESVTVGNGQPLNIQNTDIGTLYTSSHTFNLSKILHAPELAANLLSVH

Query:  KFCLDNNCIFVFDTDWFLI
        KFCLDNNC+F+FDTDWFLI
Subjt:  KFCLDNNCIFVFDTDWFLI

A0A2N9EP97 Uncharacterized protein4.5e-5931.17Show/hide
Query:  AVSVKIEDEEVLVHTLNGLPSDFNAFRTSIRTSSGTLSLAELHVLLGVEEKTIHQHQ------------AAEP--------TP-----------------
        +V V+I+DEEVL   L GLP+++++F  S+RT S ++S  ELH LL  EE  +   Q            A +P        TP                 
Subjt:  AVSVKIEDEEVLVHTLNGLPSDFNAFRTSIRTSSGTLSLAELHVLLGVEEKTIHQHQ------------AAEP--------TP-----------------

Query:  ----------------YDHGSSEGKL-----FEPKFSTQ-RVARKLPATRG-TWFSTWKWMNFSYQGRHPPAQLAAM--AVNSMNSQ-SLNDSSNAFWLS
                        Y+ G++ G+      + P  +T  R   ++   +G T    ++ MNF+YQGR PPA+LAAM  A  S N Q S N      W+S
Subjt:  ----------------YDHGSSEGKL-----FEPKFSTQ-RVARKLPATRG-TWFSTWKWMNFSYQGRHPPAQLAAM--AVNSMNSQ-SLNDSSNAFWLS

Query:  DSGCNVHMTNDLANLNLSNNYNGEESVTVGNGQPLNIQNTDIGTLYTSSHTFNLSKILHAPELAANLLSVHKFCLDNNCIFVFDTDWFLIQDKVSGKILY
        D+G   H T DL NL   ++Y   + V+VGNGQ L I N D   L+TSS  F L KIL+ P +++NLL V+ FC DNNC F FD   F IQD+ +GK LY
Subjt:  DSGCNVHMTNDLANLNLSNNYNGEESVTVGNGQPLNIQNTDIGTLYTSSHTFNLSKILHAPELAANLLSVHKFCLDNNCIFVFDTDWFLIQDKVSGKILY

Query:  TGESVNGLYPI------PSPSMLSS-------------DLHPKNFNFMAKQEC------SLWHHRFGHP-------------------------------
        TG S +GLYPI      P  S  SS              L P   +   +  C       +WH R GHP                               
Subjt:  TGESVNGLYPI------PSPSMLSS-------------DLHPKNFNFMAKQEC------SLWHHRFGHP-------------------------------

Query:  -----APKILRSSLSRLVFLINRMP-SSSFNMSSS-FEKLYAETSHEHSISQNVHISASTI-----NNHLMQTRAKSGIFKSKAFVSTMTTL-----VPT
             +P   +SS   L    +  P   +F+ SSS         S   S S N+ I ++++     N+H MQTRAKSGI K KAF S +T++     + T
Subjt:  -----APKILRSSLSRLVFLINRMP-SSSFNMSSS-FEKLYAETSHEHSISQNVHISASTI-----NNHLMQTRAKSGIFKSKAFVSTMTTL-----VPT

Query:  DPTSYFVAS----------------------------------------KTKYNIDGTVAQYKARLVAKGYHQVEGFDFTETFSPVVKKPTIRVILALVA
        +P ++ +AS                                        K K + DG+VA+YKARLVAKG+HQ  G D+ ETFSPVVK P +R+IL+L A
Subjt:  DPTSYFVAS----------------------------------------KTKYNIDGTVAQYKARLVAKGYHQVEGFDFTETFSPVVKKPTIRVILALVA

Query:  HSKYFL-GLEVHTT------TDDIFVNQAKYLTDLLHTA----------GMSSAKTCLTPMSTT--VDLHISAPLFDNATFYRQPAPNHLAFCFDVPICL
        H ++ L  L+V          +D+F+ Q +   D  H+A          G+  A        TT  +DL   A + D + F  +        C  V + L
Subjt:  HSKYFL-GLEVHTT------TDDIFVNQAKYLTDLLHTA----------GMSSAKTCLTPMSTT--VDLHISAPLFDNATFYRQPAPNHLAFCFDVPICL

Query:  SLPFVNLI-------GLVTLLTDDPHVASLLFLAPVQY
         L   ++I        ++TL++    V  L  L P++Y
Subjt:  SLPFVNLI-------GLVTLLTDDPHVASLLFLAPVQY

A0A5A7U7J4 Putative mitochondrial protein2.7e-5630.22Show/hide
Query:  KNVAVSVKIEDEEVLVHTLNGLPSDFNAFRTSIRTSSGTLSLAELHVLLGVEEKTIHQHQAAE--PT-------PYDHGS-SEGKLFEPKFSTQRVARKL
        K  A SV +EDEE+LVHTLNGL + FNAFRTSIRT +  +SL ELH L   EE T+ +  A E  PT       P  H S   G+ F    +    +   
Subjt:  KNVAVSVKIEDEEVLVHTLNGLPSDFNAFRTSIRTSSGTLSLAELHVLLGVEEKTIHQHQAAE--PT-------PYDHGS-SEGKLFEPKFSTQRVARKL

Query:  PATRGTWFST----------------------------------------------------------------------------------WKWMNFSY
         + RGT F++                                                                                  +  MNFSY
Subjt:  PATRGTWFST----------------------------------------------------------------------------------WKWMNFSY

Query:  QGRHPPAQLAAMAVNSMNSQSLNDSSNAFWLSDSGCNVHMTNDLANLNLSNNYNGEESVTVGNGQPLNIQNTDIGTLYTSSHTFNLSKILHAPELAANLL
        QGRHPP+QLAAM VNSMNSQ   +++N FWL DSGCNVHMTN+LANLNLSNNYNGEE+VTV N QPLNI+NT                            
Subjt:  QGRHPPAQLAAMAVNSMNSQSLNDSSNAFWLSDSGCNVHMTNDLANLNLSNNYNGEESVTVGNGQPLNIQNTDIGTLYTSSHTFNLSKILHAPELAANLL

Query:  SVHKFCLDNNCIFVFDTDWFLIQDKVSGKILYTGESVNG-LYPIPSPSMLSSDLHPKNFNFMAKQECSLWHHRFGHPAPKILRSSLSRLVFLINRMPSSS
                                      + TG  +N     +P P                             P P+                    
Subjt:  SVHKFCLDNNCIFVFDTDWFLIQDKVSGKILYTGESVNG-LYPIPSPSMLSSDLHPKNFNFMAKQECSLWHHRFGHPAPKILRSSLSRLVFLINRMPSSS

Query:  FNMSSSFEKLYAETSHEHSISQNVHISASTINNHLMQTRAKSGIFKSKAFVSTMTTLVP--TDPTSYFVASKTKYNIDGTVAQYKARLVAKGYHQVEGFD
                                       N H MQTRAKS IFK KAF  T  + +P            + +   +  + ++KARLVAKGYHQV+GFD
Subjt:  FNMSSSFEKLYAETSHEHSISQNVHISASTINNHLMQTRAKSGIFKSKAFVSTMTTLVP--TDPTSYFVASKTKYNIDGTVAQYKARLVAKGYHQVEGFD

Query:  FTETFSPVVKKPTIRVILALVAH-----------------------------------------------------------------------------
        F ETFSPVVKKPTI +ILAL A                                                                              
Subjt:  FTETFSPVVKKPTIRVILALVAH-----------------------------------------------------------------------------

Query:  ---------------------SKYFLGLEVHTTTDDIFVNQAKYLTDLLHTAGMSSAKTCLTPMSTTVDLHISAPLFDNATFYRQ
                              KYFLGLE+ ++ D IFVNQAKYL DLLHT+GM+SAK+C+TPMST++DL+  AP F++ + YR+
Subjt:  ---------------------SKYFLGLEVHTTTDDIFVNQAKYLTDLLHTAGMSSAKTCLTPMSTTVDLHISAPLFDNATFYRQ

A0A5A7VGG0 Retrotransposon protein1.9e-6549.39Show/hide
Query:  KNVAVSVKIEDEEVLVHTLNGLPSDFNAFRTSIRTSSGTLSLAELHVLLGVEEKTIHQHQAAE--PT------PYDHGSSEGK-----------------
        K VA S+ +EDEE+LVHTLNGLP  FNAFRTSIRT SG +SL ELH LL  EE T+ +  A E  PT      P  H SS G+                 
Subjt:  KNVAVSVKIEDEEVLVHTLNGLPSDFNAFRTSIRTSSGTLSLAELHVLLGVEEKTIHQHQAAE--PT------PYDHGSSEGK-----------------

Query:  ------------------------------------------LFEPKFSTQRVARKLPATRGTWF-------------------------STWKWMNFSY
                                                   F P    QR       + G  F                           +  MNFSY
Subjt:  ------------------------------------------LFEPKFSTQRVARKLPATRGTWF-------------------------STWKWMNFSY

Query:  QGRHPPAQLAAMAVNSMNSQSLNDSSNAFWLSDSGCNVHMTNDLANLNLSNNYNGEESVTVGNGQPLNIQNTDIGTLYTSSHTFNLSKILHAPELAANLL
        Q RH P+QLAAMAVNSMNSQ  ++++N FWLSDSG NVHMTN+LANLNLSNNYNGEE+VTVGNGQPLNI+NT  G L T SHTFNLSKILHAP+LA NLL
Subjt:  QGRHPPAQLAAMAVNSMNSQSLNDSSNAFWLSDSGCNVHMTNDLANLNLSNNYNGEESVTVGNGQPLNIQNTDIGTLYTSSHTFNLSKILHAPELAANLL

Query:  SVHKFCLDNNCIFVFDTDWFLIQDKVSG
        SVHKFCLDNNC+FVF TD FLIQDKV+G
Subjt:  SVHKFCLDNNCIFVFDTDWFLIQDKVSG

A0A6J1DYN6 uncharacterized protein LOC1110247223.0e-7131.62Show/hide
Query:  MAVNSMNSQSLNDSSNAFWLSDSGCNVHMTNDLANLNLSNNYNGEESVTVGNGQPLNIQNTDIGTLYTSSHTFNLSKILHAPELAANLLSVHKFCLDNNC
        MAVN+M   S +++ N FWLSDSGCN H+TNDL NLNL ++YNGEE VTVGNGQ LNI +T  G L  SSH F +S +LHAP+LA NLLSVHKFCLDN+C
Subjt:  MAVNSMNSQSLNDSSNAFWLSDSGCNVHMTNDLANLNLSNNYNGEESVTVGNGQPLNIQNTDIGTLYTSSHTFNLSKILHAPELAANLLSVHKFCLDNNC

Query:  IFVFDTDWFLIQDKVSGKILYTGESVNGLYPIPSPSMLSS---DLHPKNFNFMAKQECSLWHHRFGHPAPKILRSSLSRLVFLIN--------------R
        IFV+D+DWFLIQDKV+   LY G+SVNGLYPIPS S LSS   +LHPKN   +AK    LWHHR GH +PKILR +LS     I+              +
Subjt:  IFVFDTDWFLIQDKVSGKILYTGESVNGLYPIPSPSMLSS---DLHPKNFNFMAKQECSLWHHRFGHPAPKILRSSLSRLVFLIN--------------R

Query:  MPSSSFNM--SSSFEKL-----------------------------------------------------YAE--------TSHEHSISQNVHISAST--
        M   SF M  SSSF  L                                                     +AE        T H +   + V+ S S+  
Subjt:  MPSSSFNM--SSSFEKL-----------------------------------------------------YAE--------TSHEHSISQNVHISAST--

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------INNHLMQTRAKSGIFKSKAFV-------------------------STMT------------TLVPTDPTSYFVASK----T
                          IN HLMQT AKSGIFK +A++                         + M+            +LVP  P    V  K    T
Subjt:  ------------------INNHLMQTRAKSGIFKSKAFV-------------------------STMT------------TLVPTDPTSYFVASK----T

Query:  KYNIDGTVAQYKARLVAKGYHQVEGFDFTETFSPVVKKPTIRVILALVAH--------------------------------------------------
        K+N DG+ A+YKARL+AKGYH++EGFDF ETFSPVVKKPTIRV+L+L AH                                                  
Subjt:  KYNIDGTVAQYKARLVAKGYHQVEGFDFTETFSPVVKKPTIRVILALVAH--------------------------------------------------

Query:  ----------------------------------------------------------------------------SKYFLGLEVHTTTDDIFVNQAKYL
                                                                                     +YFLGLE+ +    IFVNQA+YL
Subjt:  ----------------------------------------------------------------------------SKYFLGLEVHTTTDDIFVNQAKYL

Query:  TDLLHTAGMSSAKTCLTPMSTTVDLHISAPLFDNATFYRQ
         DLL  +GM SAK+C TPMST++DLH S P+F +A+ YRQ
Subjt:  TDLLHTAGMSSAKTCLTPMSTTVDLHISAPLFDNATFYRQ

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.0e-0450Show/hide
Query:  KYNIDGTVAQYKARLVAKGYHQVEGFDFTETFSPVVKKPTIRVILALV
        KYN  G   +YKARLVA+G+ Q    D+ ETF+PV +  + R IL+LV
Subjt:  KYNIDGTVAQYKARLVAKGYHQVEGFDFTETFSPVVKKPTIRVILALV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.3e-0654.9Show/hide
Query:  KTKYNIDGTVAQYKARLVAKGYHQVEGFDFTETFSPVVKKPTIRVILALVA
        K K + D  + +YKARLV KG+ Q +G DF E FSPVVK  +IR IL+L A
Subjt:  KTKYNIDGTVAQYKARLVAKGYHQVEGFDFTETFSPVVKKPTIRVILALVA

P92520 Uncharacterized mitochondrial protein AtMg008202.7e-0863.83Show/hide
Query:  KTKYNIDGTVAQYKARLVAKGYHQVEGFDFTETFSPVVKKPTIRVIL
        KTK + DGT+ + KARLVAKG+HQ EG  F ET+SPVV+  TIR IL
Subjt:  KTKYNIDGTVAQYKARLVAKGYHQVEGFDFTETFSPVVKKPTIRVIL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.4e-1734.21Show/hide
Query:  SYQGRHPPAQLAAMAVNSMNSQSLNDSSNAFWLSDSGCNVHMTNDLANLNLSNNYNGEESVTVGNGQPLNIQNTDIGTLYTSSHTFNLSKILHAPELAAN
        S   + PP+        +  +     SSN  WL DSG   H+T+D  NL+L   Y G + V V +G  + I +T   +L T S   NL  IL+ P +  N
Subjt:  SYQGRHPPAQLAAMAVNSMNSQSLNDSSNAFWLSDSGCNVHMTNDLANLNLSNNYNGEESVTVGNGQPLNIQNTDIGTLYTSSHTFNLSKILHAPELAAN

Query:  LLSVHKFCLDNNCIFVFDTDWFLIQDKVSGKILYTGESVNGLY--PIPSPSMLSSDLHPKNFNFMAKQECSLWHHRFGHPAPKILRSSLS
        L+SV++ C  N     F    F ++D  +G  L  G++ + LY  PI S   +S    P      +K   S WH R GHPAP IL S +S
Subjt:  LLSVHKFCLDNNCIFVFDTDWFLIQDKVSGKILYTGESVNGLY--PIPSPSMLSSDLHPKNFNFMAKQECSLWHHRFGHPAPKILRSSLS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.4e-0927.97Show/hide
Query:  TGESVNGLYPIPSPSMLSSDLHPKNFNFMAKQECSLWHHRFGHPAPKILRSSLSRLVFLINRMPSSSFNMSSSFEKLYAETSHEH---SISQNV-HISAS
        T    NG  P   P+   +  H       + Q  S   +   + +P  L  SLS      +  PS + + SSS       +   H    ++Q V + + +
Subjt:  TGESVNGLYPIPSPSMLSSDLHPKNFNFMAKQECSLWHHRFGHPAPKILRSSLSRLVFLINRMPSSSFNMSSSFEKLYAETSHEH---SISQNV-HISAS

Query:  TINNHLMQTRAKSGIFKSKAFVSTMTT-------------------------------------LVPTDPTSYFVAS-----KTKYNIDGTVAQYKARLV
         +N H M TRAK+GI K     S   +                                     LVP  P+   +         KYN DG++ +YKARLV
Subjt:  TINNHLMQTRAKSGIFKSKAFVSTMTT-------------------------------------LVPTDPTSYFVAS-----KTKYNIDGTVAQYKARLV

Query:  AKGYHQVEGFDFTETFSPVVKKPTIRVILAL-VAHSKYFLGLEVH------TTTDDIFVNQ
        AKGY+Q  G D+ ETFSPV+K  +IR++L + V  S     L+V+      T TDD++++Q
Subjt:  AKGYHQVEGFDFTETFSPVVKKPTIRVILAL-VAHSKYFLGLEVH------TTTDDIFVNQ

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.2e-1634.59Show/hide
Query:  WLSDSGCNVHMTNDLANLNLSNNYNGEESVTVGNGQPLNIQNTDIGTLYTSSHTFNLSKILHAPELAANLLSVHKFCLDNNCIFVFDTDWFLIQDKVSGK
        WL DSG   H+T+D  NL+    Y G + V + +G  + I +T   +L TSS + +L+K+L+ P +  NL+SV++ C  N     F    F ++D  +G 
Subjt:  WLSDSGCNVHMTNDLANLNLSNNYNGEESVTVGNGQPLNIQNTDIGTLYTSSHTFNLSKILHAPELAANLLSVHKFCLDNNCIFVFDTDWFLIQDKVSGK

Query:  ILYTGESVNGLY--PIPSPSMLSSDLHPKNFNFMAKQECSLWHHRFGHPAPKILRSSLS
         L  G++ + LY  PI S   +S    P      +K   S WH R GHP+  IL S +S
Subjt:  ILYTGESVNGLY--PIPSPSMLSSDLHPKNFNFMAKQECSLWHHRFGHPAPKILRSSLS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.9e-1031.93Show/hide
Query:  INNHLMQTRAKSGIFKSKAFVSTMTT-------------------------------------LVPTDPTSYFVAS-----KTKYNIDGTVAQYKARLVA
        +N H M TRAK GI K     S  T+                                     LVP  P S  +         K+N DG++ +YKARLVA
Subjt:  INNHLMQTRAKSGIFKSKAFVSTMTT-------------------------------------LVPTDPTSYFVAS-----KTKYNIDGTVAQYKARLVA

Query:  KGYHQVEGFDFTETFSPVVKKPTIRVILAL-VAHSKYFLGLEVH------TTTDDIFVNQAKYLTD
        KGY+Q  G D+ ETFSPV+K  +IR++L + V  S     L+V+      T TD+++++Q     D
Subjt:  KGYHQVEGFDFTETFSPVVKKPTIRVILAL-VAHSKYFLGLEVH------TTTDDIFVNQAKYLTD

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 89.1e-1264.71Show/hide
Query:  KTKYNIDGTVAQYKARLVAKGYHQVEGFDFTETFSPVVKKPTIRVILALVA
        K KYN DGT+ +YKARLVAKGY Q EG DF ETFSPV K  ++++ILA+ A
Subjt:  KTKYNIDGTVAQYKARLVAKGYHQVEGFDFTETFSPVVKKPTIRVILALVA

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.9e-0963.83Show/hide
Query:  KTKYNIDGTVAQYKARLVAKGYHQVEGFDFTETFSPVVKKPTIRVIL
        KTK + DGT+ + KARLVAKG+HQ EG  F ET+SPVV+  TIR IL
Subjt:  KTKYNIDGTVAQYKARLVAKGYHQVEGFDFTETFSPVVKKPTIRVIL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTCTTCTCCTACAGGAACAACAACAACAGAGCGGTTTGAAAAATGTTGCCGTTTCCGTTAAGATTGAAGATGAAGAGGTACTCGTACATACCTTGAACGGTCTTCC
CTCGGATTTCAACGCCTTCCGAACGTCGATTCGAACGAGCAGTGGCACCTTGTCTTTAGCCGAGCTTCATGTATTATTAGGTGTCGAAGAGAAGACGATTCATCAACACC
AGGCGGCTGAACCAACACCCTACGACCATGGCAGCAGTGAGGGGAAACTGTTCGAGCCCAAATTTTCGACGCAGAGGGTGGCGAGGAAATTACCGGCCACGAGGGGCACG
TGGTTTTCAACGTGGAAATGGATGAATTTTTCCTATCAGGGTCGTCATCCACCGGCTCAATTAGCTGCTATGGCGGTAAATTCTATGAACTCACAATCTCTGAATGATAG
TTCTAACGCATTTTGGTTATCTGACAGTGGCTGCAACGTTCATATGACAAACGACCTAGCAAATCTCAATCTCTCCAACAATTATAATGGGGAAGAATCTGTCACAGTGG
GTAATGGCCAACCTCTAAATATCCAAAACACAGACATTGGTACACTTTATACCTCCTCTCATACCTTTAATCTTTCCAAAATTCTTCATGCTCCTGAACTTGCAGCAAAC
CTTTTATCAGTTCATAAATTTTGCCTTGATAATAACTGTATTTTTGTCTTTGATACTGACTGGTTCCTAATTCAGGATAAAGTCTCTGGAAAAATCTTATACACTGGTGA
AAGCGTCAATGGCCTTTATCCCATCCCTAGTCCATCTATGCTATCTTCTGATCTACACCCCAAAAACTTTAATTTTATGGCAAAACAGGAGTGTTCTCTGTGGCATCATC
GGTTTGGTCATCCCGCACCCAAAATATTACGCTCTAGTTTATCTCGTCTTGTTTTTCTAATCAACCGTATGCCTTCTTCTTCTTTCAACATGTCATCTTCTTTTGAGAAA
CTTTATGCTGAGACATCTCATGAGCATTCAATATCTCAAAATGTTCACATTTCTGCAAGCACTATCAATAATCATTTGATGCAAACACGAGCTAAGTCAGGAATTTTCAA
GTCAAAGGCGTTTGTCTCCACCATGACTACTTTAGTTCCAACAGACCCCACTTCATATTTTGTTGCTTCGAAGACCAAATATAACATTGATGGCACTGTTGCTCAATATA
AAGCCCGATTAGTTGCAAAAGGATATCATCAGGTTGAAGGGTTTGACTTTACTGAAACCTTCAGTCCAGTTGTTAAAAAGCCCACAATTAGAGTTATCCTCGCTCTTGTT
GCTCATTCGAAATACTTCCTTGGTCTGGAAGTCCACACTACCACTGATGACATTTTTGTCAATCAAGCTAAGTATCTCACTGACTTACTTCACACTGCAGGAATGTCCTC
TGCTAAGACATGTTTAACACCCATGTCCACTACAGTTGATCTCCACATCTCTGCACCGTTATTTGACAATGCCACGTTTTATCGTCAACCGGCTCCAAACCACTTGGCAT
TCTGTTTCGACGTGCCAATTTGTCTCTCACTGCCTTTTGTGAATCTGATTGGGCTGGTGACACTTCTAACCGACGATCCACATGTGGCTTCATTACTTTTCTTGGCTCCA
GTCCAATATCCTGGTCAGCCAAAAAGCAACCTACGGTATCACGTTCTTCCACTGAGGTTGAATACTGCTCTCTTGCGACTACAACAGCAGATCTTTATTGACTACGACAG
CTTCTTCCTGATCTCCACGTCATCTTATCCAGTCTGCCTACATTATTGTGTGACAACATTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTTCTTCTCCTACAGGAACAACAACAACAGAGCGGTTTGAAAAATGTTGCCGTTTCCGTTAAGATTGAAGATGAAGAGGTACTCGTACATACCTTGAACGGTCTTCC
CTCGGATTTCAACGCCTTCCGAACGTCGATTCGAACGAGCAGTGGCACCTTGTCTTTAGCCGAGCTTCATGTATTATTAGGTGTCGAAGAGAAGACGATTCATCAACACC
AGGCGGCTGAACCAACACCCTACGACCATGGCAGCAGTGAGGGGAAACTGTTCGAGCCCAAATTTTCGACGCAGAGGGTGGCGAGGAAATTACCGGCCACGAGGGGCACG
TGGTTTTCAACGTGGAAATGGATGAATTTTTCCTATCAGGGTCGTCATCCACCGGCTCAATTAGCTGCTATGGCGGTAAATTCTATGAACTCACAATCTCTGAATGATAG
TTCTAACGCATTTTGGTTATCTGACAGTGGCTGCAACGTTCATATGACAAACGACCTAGCAAATCTCAATCTCTCCAACAATTATAATGGGGAAGAATCTGTCACAGTGG
GTAATGGCCAACCTCTAAATATCCAAAACACAGACATTGGTACACTTTATACCTCCTCTCATACCTTTAATCTTTCCAAAATTCTTCATGCTCCTGAACTTGCAGCAAAC
CTTTTATCAGTTCATAAATTTTGCCTTGATAATAACTGTATTTTTGTCTTTGATACTGACTGGTTCCTAATTCAGGATAAAGTCTCTGGAAAAATCTTATACACTGGTGA
AAGCGTCAATGGCCTTTATCCCATCCCTAGTCCATCTATGCTATCTTCTGATCTACACCCCAAAAACTTTAATTTTATGGCAAAACAGGAGTGTTCTCTGTGGCATCATC
GGTTTGGTCATCCCGCACCCAAAATATTACGCTCTAGTTTATCTCGTCTTGTTTTTCTAATCAACCGTATGCCTTCTTCTTCTTTCAACATGTCATCTTCTTTTGAGAAA
CTTTATGCTGAGACATCTCATGAGCATTCAATATCTCAAAATGTTCACATTTCTGCAAGCACTATCAATAATCATTTGATGCAAACACGAGCTAAGTCAGGAATTTTCAA
GTCAAAGGCGTTTGTCTCCACCATGACTACTTTAGTTCCAACAGACCCCACTTCATATTTTGTTGCTTCGAAGACCAAATATAACATTGATGGCACTGTTGCTCAATATA
AAGCCCGATTAGTTGCAAAAGGATATCATCAGGTTGAAGGGTTTGACTTTACTGAAACCTTCAGTCCAGTTGTTAAAAAGCCCACAATTAGAGTTATCCTCGCTCTTGTT
GCTCATTCGAAATACTTCCTTGGTCTGGAAGTCCACACTACCACTGATGACATTTTTGTCAATCAAGCTAAGTATCTCACTGACTTACTTCACACTGCAGGAATGTCCTC
TGCTAAGACATGTTTAACACCCATGTCCACTACAGTTGATCTCCACATCTCTGCACCGTTATTTGACAATGCCACGTTTTATCGTCAACCGGCTCCAAACCACTTGGCAT
TCTGTTTCGACGTGCCAATTTGTCTCTCACTGCCTTTTGTGAATCTGATTGGGCTGGTGACACTTCTAACCGACGATCCACATGTGGCTTCATTACTTTTCTTGGCTCCA
GTCCAATATCCTGGTCAGCCAAAAAGCAACCTACGGTATCACGTTCTTCCACTGAGGTTGAATACTGCTCTCTTGCGACTACAACAGCAGATCTTTATTGACTACGACAG
CTTCTTCCTGATCTCCACGTCATCTTATCCAGTCTGCCTACATTATTGTGTGACAACATTTTAG
Protein sequenceShow/hide protein sequence
MFLLLQEQQQQSGLKNVAVSVKIEDEEVLVHTLNGLPSDFNAFRTSIRTSSGTLSLAELHVLLGVEEKTIHQHQAAEPTPYDHGSSEGKLFEPKFSTQRVARKLPATRGT
WFSTWKWMNFSYQGRHPPAQLAAMAVNSMNSQSLNDSSNAFWLSDSGCNVHMTNDLANLNLSNNYNGEESVTVGNGQPLNIQNTDIGTLYTSSHTFNLSKILHAPELAAN
LLSVHKFCLDNNCIFVFDTDWFLIQDKVSGKILYTGESVNGLYPIPSPSMLSSDLHPKNFNFMAKQECSLWHHRFGHPAPKILRSSLSRLVFLINRMPSSSFNMSSSFEK
LYAETSHEHSISQNVHISASTINNHLMQTRAKSGIFKSKAFVSTMTTLVPTDPTSYFVASKTKYNIDGTVAQYKARLVAKGYHQVEGFDFTETFSPVVKKPTIRVILALV
AHSKYFLGLEVHTTTDDIFVNQAKYLTDLLHTAGMSSAKTCLTPMSTTVDLHISAPLFDNATFYRQPAPNHLAFCFDVPICLSLPFVNLIGLVTLLTDDPHVASLLFLAP
VQYPGQPKSNLRYHVLPLRLNTALLRLQQQIFIDYDSFFLISTSSYPVCLHYCVTTF