; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001804 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001804
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr4:35581502..35585419
RNA-Seq ExpressionLag0001804
SyntenyLag0001804
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0045362.1 putative mitochondrial protein [Cucumis melo var. makuwa]2.0e-8948.22Show/hide
Query:  ESVTVDLSIDERTHENSLSQNVQIPESTTNNRSMQTRTKSGIFKPKAYVSTMTTSVPTDPPSYSVASKYPKWRSAMCEEFNALQEQGTWSLVPRLPSMNV
        +S+ V  ++D     N+ + N+  P    N  +MQTR KS IFKPKA+   +TT++PT P SY+ ASKYP+WR+AM EEFNALQ QGTWSLVPRLPS NV
Subjt:  ESVTVDLSIDERTHENSLSQNVQIPESTTNNRSMQTRTKSGIFKPKAYVSTMTTSVPTDPPSYSVASKYPKWRSAMCEEFNALQEQGTWSLVPRLPSMNV

Query:  VGCKWVFMTKYNTDGTVARYKARLVAKGFHQVEGFDFTKTFSPVVKNPTIRVILALVAQYQWSLTQLDVKNAFLHGYLKEEVYMSQPLGFLDKSSPNHVC
        VGCKWVF  KYN DGT+AR+KARLVAKG+HQV+GFDF +TFSPVVK PTIR+ILAL AQY WSLTQLDVKN F HG L+E VYM+Q   F DK+ PNHVC
Subjt:  VGCKWVFMTKYNTDGTVARYKARLVAKGFHQVEGFDFTKTFSPVVKNPTIRVILALVAQYQWSLTQLDVKNAFLHGYLKEEVYMSQPLGFLDKSSPNHVC

Query:  RLYKSI--DSSLFVWSVGSSLTYLLLYVDDIIITGPDSSYLFVLKKQLANEFQISDLG----------ESSLSNL---------------GLCSA-----
         L+KS+  D SLF+ SVGS L+YLLLYVDDII+TG D  Y+FVLK QLA EF+IS+LG          +SS+  +               G+ SA     
Subjt:  RLYKSI--DSSLFVWSVGSSLTYLLLYVDDIIITGPDSSYLFVLKKQLANEFQISDLG----------ESSLSNL---------------GLCSA-----

Query:  -----VQIYLSLP-------------------------------------------------------------------------FVTLNWVGDTSDQR
             + +Y   P                                                                         F   +W  DTSD+R
Subjt:  -----VQIYLSLP-------------------------------------------------------------------------FVTLNWVGDTSDQR

Query:  STSGFIVFLGSSPISWSSKKQ
        STSGFI FLGS+PISWSSKKQ
Subjt:  STSGFIVFLGSSPISWSSKKQ

KAA0050146.1 putative mitochondrial protein [Cucumis melo var. makuwa]5.9e-8141.41Show/hide
Query:  NRMNFSYQGRHPPAQLAAMALNSMNLQTSNDNSNNFWLSNSGCNIHMTNDLANLNLSNNYNGEESVTVDL----------SIDERTHENSLSQNVQIPES
        NRMNFSYQGRHPP+QLAAM +NSMN Q S +N+NNFWL +SGCN+HMTN+LANLNLSNNYNGEE+VTV+           S+D     N+ + N+  P  
Subjt:  NRMNFSYQGRHPPAQLAAMALNSMNLQTSNDNSNNFWLSNSGCNIHMTNDLANLNLSNNYNGEESVTVDL----------SIDERTHENSLSQNVQIPES

Query:  TTNNRSMQTRTKSGIFKPKAYVSTMTTSVPTDPPSYSVASKYPKWRSAMCEEFNALQEQGTWSLVPRLPSMNVVGCKWVFMTKYNTDGTVARYKARLVAK
          N  +MQTR KS IFKPKA+  T  + +P          K P  ++    E   ++                             +  + ++KARLVAK
Subjt:  TTNNRSMQTRTKSGIFKPKAYVSTMTTSVPTDPPSYSVASKYPKWRSAMCEEFNALQEQGTWSLVPRLPSMNVVGCKWVFMTKYNTDGTVARYKARLVAK

Query:  GFHQVEGFDFTKTFSPVVKNPTIRVILALVAQYQWSLTQLDVKNAFLHGYLKEEVYMSQPLGFLDKSSPNHVCRLYKSIDSSLFVWSVGSSLTYLLLYVD
        G+HQV+GFDF +TFSPVVK PTI +ILAL AQY WSLTQLDVKNAFLHG L+E VY++QP GF DK+ PNHVC L+KS+     V      L++LLLYVD
Subjt:  GFHQVEGFDFTKTFSPVVKNPTIRVILALVAQYQWSLTQLDVKNAFLHGYLKEEVYMSQPLGFLDKSSPNHVCRLYKSIDSSLFVWSVGSSLTYLLLYVD

Query:  DIIITGPDSSYLFVLKKQLANEFQISDLG----------ESSLSNL---------------GLCSA----------VQIYLSLP----------------
        DII+TGPD  Y+ V K QLA EF+ISDLG          +SS+  +               G+ SA          + +Y   P                
Subjt:  DIIITGPDSSYLFVLKKQLANEFQISDLG----------ESSLSNL---------------GLCSA----------VQIYLSLP----------------

Query:  --------------------------------------------------------FVTLNWVGDTSDQRSTSGFIVFLGSSPISWSSKKQPTQS
                                                                F   +W GDTSD+RSTSGFI F GS+PISWSSKK+ T S
Subjt:  --------------------------------------------------------FVTLNWVGDTSDQRSTSGFIVFLGSSPISWSSKKQPTQS

TYJ96936.1 putative mitochondrial protein [Cucumis melo var. makuwa]2.9e-8860.81Show/hide
Query:  ESVTVDLSIDERTHENSLSQNVQIPESTTNNRSMQTRTKSGIFKPKAYVSTMTTSVPTDPPSYSVASKYPKWRSAMCEEFNALQEQGTWSLVPRLPSMNV
        +S+ V  ++D     N+ + N+  P    N  +MQTR KSGIFKPKA+   +TT++PT P SY+ ASKYP+W++AM EEFNALQ QGTWSLVPRLPS NV
Subjt:  ESVTVDLSIDERTHENSLSQNVQIPESTTNNRSMQTRTKSGIFKPKAYVSTMTTSVPTDPPSYSVASKYPKWRSAMCEEFNALQEQGTWSLVPRLPSMNV

Query:  VGCKWVFMTKYNTDGTVARYKARLVAKGFHQVEGFDFTKTFSPVVKNPTIRVILALVAQYQWSLTQLDVKNAFLHGYLKEEVYMSQPLGFLDKSSPNHVC
        VGCKWVF  KYN DGT+AR+KARLVAKG+HQV+GFDF +TFSPVVK PTIR+ILAL AQY WSLTQLDVKNAFLHG L+E VYM+QP+ F DK+ PNHVC
Subjt:  VGCKWVFMTKYNTDGTVARYKARLVAKGFHQVEGFDFTKTFSPVVKNPTIRVILALVAQYQWSLTQLDVKNAFLHGYLKEEVYMSQPLGFLDKSSPNHVC

Query:  RLYKSI--DSSLFVWSVGSSLTYLLLYVDDIIITGPDSSYLFVLKKQLANEFQISDLGESSLSNLGLCSAVQIYLSLPFVTLNWVGDTSDQRSTSG
         L+KS+  D SLF+ SVGS L+YLLLYVDDII+TG D  Y+ VLK QLA EF+IS+LG      L     ++I  S+  + +N     +D   TSG
Subjt:  RLYKSI--DSSLFVWSVGSSLTYLLLYVDDIIITGPDSSYLFVLKKQLANEFQISDLGESSLSNLGLCSAVQIYLSLPFVTLNWVGDTSDQRSTSG

XP_022158189.1 uncharacterized protein LOC111024722 [Momordica charantia]1.7e-8045.16Show/hide
Query:  NNRSMQTRTKSGIFKPKAY-VSTMTTSVPTDPPSYSVASKYPKWRSAMCEEFNALQEQGTWSLVPRLPSMNVVGCKWVFMTKYNTDGTVARYKARLVAKG
        N   MQT  KSGIFKP+AY V + + ++ T P   + A+++ +WR+AM ++F ALQEQGTWSLVPR P MNVVGCKWVF TK+N+DG+ ARYKARL+AKG
Subjt:  NNRSMQTRTKSGIFKPKAY-VSTMTTSVPTDPPSYSVASKYPKWRSAMCEEFNALQEQGTWSLVPRLPSMNVVGCKWVFMTKYNTDGTVARYKARLVAKG

Query:  FHQVEGFDFTKTFSPVVKNPTIRVILALVAQYQWSLTQLDVKNAFLHGYLKEEVYMSQPLGFLDKSSPNHVCRLYKSI----------------------
        +H++EGFDF +TFSPVVK PTIRV+L+L A + WSLTQLDVKN FLHG L+++V+M Q + F+D S P++VC L+KS+                      
Subjt:  FHQVEGFDFTKTFSPVVKNPTIRVILALVAQYQWSLTQLDVKNAFLHGYLKEEVYMSQPLGFLDKSSPNHVCRLYKSI----------------------

Query:  ------DSSLFVWSVGSSLTYLLLYVDDIIITGPDSSYLFVLKKQLANEFQISDLG-------------------------ESSLSNLGLCSAV------
              D+SLFV SV  SLT+LLLYVDDIIITGPDSSY+ VLKK LA EFQISDLG                         +  L   G+CSA       
Subjt:  ------DSSLFVWSVGSSLTYLLLYVDDIIITGPDSSYLFVLKKQLANEFQISDLG-------------------------ESSLSNLGLCSAV------

Query:  --------------------QIYLSLPFVT-------------------------------------LNWVGDTSDQRSTSGFIVFLGSSPISWSSKKQP
                            Q+ +SL ++T                                     ++W GD  ++RST+GF+ FLGSSPISWS+KKQ 
Subjt:  --------------------QIYLSLPFVT-------------------------------------LNWVGDTSDQRSTSGFIVFLGSSPISWSSKKQP

Query:  TQS
        T S
Subjt:  TQS

XP_022158189.1 uncharacterized protein LOC111024722 [Momordica charantia]1.8e-0544.83Show/hide
Query:  MALNSMNLQTSNDNSNNFWLSNSGCNIHMTNDLANLNLSNNYNGEESVTV----DLSIDE--------RTHENSLSQNVQIPESTTN
        MA+N+M   +S++  NNFWLS+SGCN H+TNDL NLNL ++YNGEE VTV     L+I           +H  ++S  +  P+  TN
Subjt:  MALNSMNLQTSNDNSNNFWLSNSGCNIHMTNDLANLNLSNNYNGEESVTV----DLSIDE--------RTHENSLSQNVQIPESTTN

XP_022158189.1 uncharacterized protein LOC111024722 [Momordica charantia]5.6e-7151.07Show/hide
Query:  MCEEFNALQEQGTWSLVPRLPSMNVVGCKWVFMTKYNTDGTVARYKARLVAKGFHQVEGFDFTKTFSPVVKNPTIRVILALVAQYQWSLTQLDVKNAFLH
        M EEFNALQ +GTWSLVPRLPSMNVVGCKWVF  KYN DGT+AR+KARLVAKG+ QV+GFDF +TFSPVVK  TIR+ILALVAQY WSLT LDVKNAFLH
Subjt:  MCEEFNALQEQGTWSLVPRLPSMNVVGCKWVFMTKYNTDGTVARYKARLVAKGFHQVEGFDFTKTFSPVVKNPTIRVILALVAQYQWSLTQLDVKNAFLH

Query:  GYLKEEVYMSQPLGFLDKS----------------SPNHVCRLYKS-----------IDSSLFVWSVGSSLTYLLLYVDDIIITGPDSSYLFVLKKQLAN
        G L+E VYM+QP GF DK+                +P      + S            D SLF+ SVGSSLTYLLLYVDDIIIT PD  Y+ VLK QLA 
Subjt:  GYLKEEVYMSQPLGFLDKS----------------SPNHVCRLYKS-----------IDSSLFVWSVGSSLTYLLLYVDDIIITGPDSSYLFVLKKQLAN

Query:  EFQISDLG----------ESSLSNL-------------------------GLCSAVQIYL----------------SLPFVT---------LNWVGDTSD
        EF+I DLG          +SS+  +                          + +++ +Y                 SL ++T         +N   DTSD
Subjt:  EFQISDLG----------ESSLSNL-------------------------GLCSAVQIYL----------------SLPFVT---------LNWVGDTSD

Query:  QRSTSGFIVFLGSSPISWSSKKQPTQS
        +RSTSGFI FLGS+PISWSSKKQ T S
Subjt:  QRSTSGFIVFLGSSPISWSSKKQPTQS

TrEMBL top hitse value%identityAlignment
A0A5A7TPR4 Putative mitochondrial protein9.9e-9048.22Show/hide
Query:  ESVTVDLSIDERTHENSLSQNVQIPESTTNNRSMQTRTKSGIFKPKAYVSTMTTSVPTDPPSYSVASKYPKWRSAMCEEFNALQEQGTWSLVPRLPSMNV
        +S+ V  ++D     N+ + N+  P    N  +MQTR KS IFKPKA+   +TT++PT P SY+ ASKYP+WR+AM EEFNALQ QGTWSLVPRLPS NV
Subjt:  ESVTVDLSIDERTHENSLSQNVQIPESTTNNRSMQTRTKSGIFKPKAYVSTMTTSVPTDPPSYSVASKYPKWRSAMCEEFNALQEQGTWSLVPRLPSMNV

Query:  VGCKWVFMTKYNTDGTVARYKARLVAKGFHQVEGFDFTKTFSPVVKNPTIRVILALVAQYQWSLTQLDVKNAFLHGYLKEEVYMSQPLGFLDKSSPNHVC
        VGCKWVF  KYN DGT+AR+KARLVAKG+HQV+GFDF +TFSPVVK PTIR+ILAL AQY WSLTQLDVKN F HG L+E VYM+Q   F DK+ PNHVC
Subjt:  VGCKWVFMTKYNTDGTVARYKARLVAKGFHQVEGFDFTKTFSPVVKNPTIRVILALVAQYQWSLTQLDVKNAFLHGYLKEEVYMSQPLGFLDKSSPNHVC

Query:  RLYKSI--DSSLFVWSVGSSLTYLLLYVDDIIITGPDSSYLFVLKKQLANEFQISDLG----------ESSLSNL---------------GLCSA-----
         L+KS+  D SLF+ SVGS L+YLLLYVDDII+TG D  Y+FVLK QLA EF+IS+LG          +SS+  +               G+ SA     
Subjt:  RLYKSI--DSSLFVWSVGSSLTYLLLYVDDIIITGPDSSYLFVLKKQLANEFQISDLG----------ESSLSNL---------------GLCSA-----

Query:  -----VQIYLSLP-------------------------------------------------------------------------FVTLNWVGDTSDQR
             + +Y   P                                                                         F   +W  DTSD+R
Subjt:  -----VQIYLSLP-------------------------------------------------------------------------FVTLNWVGDTSDQR

Query:  STSGFIVFLGSSPISWSSKKQ
        STSGFI FLGS+PISWSSKKQ
Subjt:  STSGFIVFLGSSPISWSSKKQ

A0A5A7U7J4 Putative mitochondrial protein2.9e-8141.41Show/hide
Query:  NRMNFSYQGRHPPAQLAAMALNSMNLQTSNDNSNNFWLSNSGCNIHMTNDLANLNLSNNYNGEESVTVDL----------SIDERTHENSLSQNVQIPES
        NRMNFSYQGRHPP+QLAAM +NSMN Q S +N+NNFWL +SGCN+HMTN+LANLNLSNNYNGEE+VTV+           S+D     N+ + N+  P  
Subjt:  NRMNFSYQGRHPPAQLAAMALNSMNLQTSNDNSNNFWLSNSGCNIHMTNDLANLNLSNNYNGEESVTVDL----------SIDERTHENSLSQNVQIPES

Query:  TTNNRSMQTRTKSGIFKPKAYVSTMTTSVPTDPPSYSVASKYPKWRSAMCEEFNALQEQGTWSLVPRLPSMNVVGCKWVFMTKYNTDGTVARYKARLVAK
          N  +MQTR KS IFKPKA+  T  + +P          K P  ++    E   ++                             +  + ++KARLVAK
Subjt:  TTNNRSMQTRTKSGIFKPKAYVSTMTTSVPTDPPSYSVASKYPKWRSAMCEEFNALQEQGTWSLVPRLPSMNVVGCKWVFMTKYNTDGTVARYKARLVAK

Query:  GFHQVEGFDFTKTFSPVVKNPTIRVILALVAQYQWSLTQLDVKNAFLHGYLKEEVYMSQPLGFLDKSSPNHVCRLYKSIDSSLFVWSVGSSLTYLLLYVD
        G+HQV+GFDF +TFSPVVK PTI +ILAL AQY WSLTQLDVKNAFLHG L+E VY++QP GF DK+ PNHVC L+KS+     V      L++LLLYVD
Subjt:  GFHQVEGFDFTKTFSPVVKNPTIRVILALVAQYQWSLTQLDVKNAFLHGYLKEEVYMSQPLGFLDKSSPNHVCRLYKSIDSSLFVWSVGSSLTYLLLYVD

Query:  DIIITGPDSSYLFVLKKQLANEFQISDLG----------ESSLSNL---------------GLCSA----------VQIYLSLP----------------
        DII+TGPD  Y+ V K QLA EF+ISDLG          +SS+  +               G+ SA          + +Y   P                
Subjt:  DIIITGPDSSYLFVLKKQLANEFQISDLG----------ESSLSNL---------------GLCSA----------VQIYLSLP----------------

Query:  --------------------------------------------------------FVTLNWVGDTSDQRSTSGFIVFLGSSPISWSSKKQPTQS
                                                                F   +W GDTSD+RSTSGFI F GS+PISWSSKK+ T S
Subjt:  --------------------------------------------------------FVTLNWVGDTSDQRSTSGFIVFLGSSPISWSSKKQPTQS

A0A5D3BD76 Putative mitochondrial protein1.4e-8860.81Show/hide
Query:  ESVTVDLSIDERTHENSLSQNVQIPESTTNNRSMQTRTKSGIFKPKAYVSTMTTSVPTDPPSYSVASKYPKWRSAMCEEFNALQEQGTWSLVPRLPSMNV
        +S+ V  ++D     N+ + N+  P    N  +MQTR KSGIFKPKA+   +TT++PT P SY+ ASKYP+W++AM EEFNALQ QGTWSLVPRLPS NV
Subjt:  ESVTVDLSIDERTHENSLSQNVQIPESTTNNRSMQTRTKSGIFKPKAYVSTMTTSVPTDPPSYSVASKYPKWRSAMCEEFNALQEQGTWSLVPRLPSMNV

Query:  VGCKWVFMTKYNTDGTVARYKARLVAKGFHQVEGFDFTKTFSPVVKNPTIRVILALVAQYQWSLTQLDVKNAFLHGYLKEEVYMSQPLGFLDKSSPNHVC
        VGCKWVF  KYN DGT+AR+KARLVAKG+HQV+GFDF +TFSPVVK PTIR+ILAL AQY WSLTQLDVKNAFLHG L+E VYM+QP+ F DK+ PNHVC
Subjt:  VGCKWVFMTKYNTDGTVARYKARLVAKGFHQVEGFDFTKTFSPVVKNPTIRVILALVAQYQWSLTQLDVKNAFLHGYLKEEVYMSQPLGFLDKSSPNHVC

Query:  RLYKSI--DSSLFVWSVGSSLTYLLLYVDDIIITGPDSSYLFVLKKQLANEFQISDLGESSLSNLGLCSAVQIYLSLPFVTLNWVGDTSDQRSTSG
         L+KS+  D SLF+ SVGS L+YLLLYVDDII+TG D  Y+ VLK QLA EF+IS+LG      L     ++I  S+  + +N     +D   TSG
Subjt:  RLYKSI--DSSLFVWSVGSSLTYLLLYVDDIIITGPDSSYLFVLKKQLANEFQISDLGESSLSNLGLCSAVQIYLSLPFVTLNWVGDTSDQRSTSG

A0A6J1DYN6 uncharacterized protein LOC1110247228.4e-8145.16Show/hide
Query:  NNRSMQTRTKSGIFKPKAY-VSTMTTSVPTDPPSYSVASKYPKWRSAMCEEFNALQEQGTWSLVPRLPSMNVVGCKWVFMTKYNTDGTVARYKARLVAKG
        N   MQT  KSGIFKP+AY V + + ++ T P   + A+++ +WR+AM ++F ALQEQGTWSLVPR P MNVVGCKWVF TK+N+DG+ ARYKARL+AKG
Subjt:  NNRSMQTRTKSGIFKPKAY-VSTMTTSVPTDPPSYSVASKYPKWRSAMCEEFNALQEQGTWSLVPRLPSMNVVGCKWVFMTKYNTDGTVARYKARLVAKG

Query:  FHQVEGFDFTKTFSPVVKNPTIRVILALVAQYQWSLTQLDVKNAFLHGYLKEEVYMSQPLGFLDKSSPNHVCRLYKSI----------------------
        +H++EGFDF +TFSPVVK PTIRV+L+L A + WSLTQLDVKN FLHG L+++V+M Q + F+D S P++VC L+KS+                      
Subjt:  FHQVEGFDFTKTFSPVVKNPTIRVILALVAQYQWSLTQLDVKNAFLHGYLKEEVYMSQPLGFLDKSSPNHVCRLYKSI----------------------

Query:  ------DSSLFVWSVGSSLTYLLLYVDDIIITGPDSSYLFVLKKQLANEFQISDLG-------------------------ESSLSNLGLCSAV------
              D+SLFV SV  SLT+LLLYVDDIIITGPDSSY+ VLKK LA EFQISDLG                         +  L   G+CSA       
Subjt:  ------DSSLFVWSVGSSLTYLLLYVDDIIITGPDSSYLFVLKKQLANEFQISDLG-------------------------ESSLSNLGLCSAV------

Query:  --------------------QIYLSLPFVT-------------------------------------LNWVGDTSDQRSTSGFIVFLGSSPISWSSKKQP
                            Q+ +SL ++T                                     ++W GD  ++RST+GF+ FLGSSPISWS+KKQ 
Subjt:  --------------------QIYLSLPFVT-------------------------------------LNWVGDTSDQRSTSGFIVFLGSSPISWSSKKQP

Query:  TQS
        T S
Subjt:  TQS

A0A6J1DYN6 uncharacterized protein LOC1110247228.8e-0644.83Show/hide
Query:  MALNSMNLQTSNDNSNNFWLSNSGCNIHMTNDLANLNLSNNYNGEESVTV----DLSIDE--------RTHENSLSQNVQIPESTTN
        MA+N+M   +S++  NNFWLS+SGCN H+TNDL NLNL ++YNGEE VTV     L+I           +H  ++S  +  P+  TN
Subjt:  MALNSMNLQTSNDNSNNFWLSNSGCNIHMTNDLANLNLSNNYNGEESVTV----DLSIDE--------RTHENSLSQNVQIPESTTN

A0A6J1DYN6 uncharacterized protein LOC1110247223.3e-7745.6Show/hide
Query:  NVQIPESTTNNRSMQTRTKSGIFKPKAYVSTMTTSVPTDPPSYSVASKYPKWRSAMCEEFNALQEQGTWSLVPRLPSMNVVGCKWVFMTKYNTDGTVARY
        ++ +P    N   MQTR+K+GIFKPK   +  T    T+P +Y+ ASK+P+W +AM EEF ALQ+QGTW+LVP   + N+VGCKWV+  KYN+DGT++RY
Subjt:  NVQIPESTTNNRSMQTRTKSGIFKPKAYVSTMTTSVPTDPPSYSVASKYPKWRSAMCEEFNALQEQGTWSLVPRLPSMNVVGCKWVFMTKYNTDGTVARY

Query:  KARLVAKGFHQVEGFDFTKTFSPVVKNPTIRVILALVAQYQWSLTQLDVKNAFLHGYLKEEVYMSQPLGFLDKSSPNHVCRLYKSI--------------
        KARLVAKGFHQ  G DF +TFSPVVK PT+R+IL+L     W L QLDVKNAFLHG LKEEVYM QP G+ D S P+HVC+L KSI              
Subjt:  KARLVAKGFHQVEGFDFTKTFSPVVKNPTIRVILALVAQYQWSLTQLDVKNAFLHGYLKEEVYMSQPLGFLDKSSPNHVCRLYKSI--------------

Query:  --------------DSSLFVWSVGSSLTYLLLYVDDIIITGPDSSYLFVLKKQLANEFQISDLG--------ESSLSNLGL-------------------
                      DSSLFV+   S + YLLLYVDDI++T    SYL  L  QL+  F + DLG        + + S+ GL                   
Subjt:  --------------DSSLFVWSVGSSLTYLLLYVDDIIITGPDSSYLFVLKKQLANEFQISDLG--------ESSLSNLGL-------------------

Query:  --------CSAVQIYLS------LPFVTLNWVGDTSDQRSTSGFIVFLGSSPISWSSKKQPTQS
                C   ++ L         +   +W GD  D+RSTSG++V++GS+PI+WS+KKQPT S
Subjt:  --------CSAVQIYLS------LPFVTLNWVGDTSDQRSTSGFIVFLGSSPISWSSKKQPTQS

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.9e-2829.53Show/hide
Query:  NLQTSNDN--SNNFWLSNS-----GCNIHMTNDLANLNLSNNYNGEESVTVDLSIDERTHENSLSQNVQIPESTTNNRSMQTRTK---------SGIFKP
        N+Q   D+  SN ++L+ S       +++ +    N N S      E +  ++ ID  T  + +           N RS + +TK         + + K 
Subjt:  NLQTSNDN--SNNFWLSNS-----GCNIHMTNDLANLNLSNNYNGEESVTVDLSIDERTHENSLSQNVQIPESTTNNRSMQTRTK---------SGIFKP

Query:  KAYVSTMTTSVPTDPPSYSVASKYPKWRSAMCEEFNALQEQGTWSLVPRLPSMNVVGCKWVFMTKYNTDGTVARYKARLVAKGFHQVEGFDFTKTFSPVV
             T+   VP              W  A+  E NA +   TW++  R  + N+V  +WVF  KYN  G   RYKARLVA+GF Q    D+ +TF+PV 
Subjt:  KAYVSTMTTSVPTDPPSYSVASKYPKWRSAMCEEFNALQEQGTWSLVPRLPSMNVVGCKWVFMTKYNTDGTVARYKARLVAKGFHQVEGFDFTKTFSPVV

Query:  KNPTIRVILALVAQYQWSLTQLDVKNAFLHGYLKEEVYMSQPLGFLDKSSPNHVCRLYK----------------------------SIDSSLFVWSVG-
        +  + R IL+LV QY   + Q+DVK AFL+G LKEE+YM  P G     + ++VC+L K                            S+D  +++   G 
Subjt:  KNPTIRVILALVAQYQWSLTQLDVKNAFLHGYLKEEVYMSQPLGFLDKSSPNHVCRLYK----------------------------SIDSSLFVWSVG-

Query:  -SSLTYLLLYVDDIIITGPDSSYLFVLKKQLANEFQISDLGE
         +   Y+LLYVDD++I   D + +   K+ L  +F+++DL E
Subjt:  -SSLTYLLLYVDDIIITGPDSSYLFVLKKQLANEFQISDLGE

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-3234.3Show/hide
Query:  SNNYNGEESVTVDLSIDERTHENSLSQNVQIPESTTN--------------NRSMQTRTKSGIFKPKAYVSTMTTSVPTDPPSYSVASKYP---KWRSAM
        SNN    ES T ++S         + Q  Q+ E                   RS + R +S  +    YV     S   +P S      +P   +   AM
Subjt:  SNNYNGEESVTVDLSIDERTHENSLSQNVQIPESTTN--------------NRSMQTRTKSGIFKPKAYVSTMTTSVPTDPPSYSVASKYP---KWRSAM

Query:  CEEFNALQEQGTWSLVPRLPSMNVVGCKWVFMTKYNTDGTVARYKARLVAKGFHQVEGFDFTKTFSPVVKNPTIRVILALVAQYQWSLTQLDVKNAFLHG
         EE  +LQ+ GT+ LV        + CKWVF  K + D  + RYKARLV KGF Q +G DF + FSPVVK  +IR IL+L A     + QLDVK AFLHG
Subjt:  CEEFNALQEQGTWSLVPRLPSMNVVGCKWVFMTKYNTDGTVARYKARLVAKGFHQVEGFDFTKTFSPVVKNPTIRVILALVAQYQWSLTQLDVKNAFLHG

Query:  YLKEEVYMSQPLGFLDKSSPNHVCRLYKSI------------------DSSLFVWSVGSSLTY-----------LLLYVDDIIITGPDSSYLFVLKKQLA
         L+EE+YM QP GF      + VC+L KS+                   S  ++ +      Y           LLLYVDD++I G D   +  LK  L+
Subjt:  YLKEEVYMSQPLGFLDKSSPNHVCRLYKSI------------------DSSLFVWSVGSSLTY-----------LLLYVDDIIITGPDSSYLFVLKKQLA

Query:  NEFQISDLG
          F + DLG
Subjt:  NEFQISDLG

P92520 Uncharacterized mitochondrial protein AtMg008201.4e-2752.31Show/hide
Query:  MQTRTKSGIFK--PKAYVSTMTTSVPTDPPSYSVASKYPKWRSAMCEEFNALQEQGTWSLVPRLPSMNVVGCKWVFMTKYNTDGTVARYKARLVAKGFHQ
        M TR+K+GI K  PK Y  T+TT++  +P S   A K P W  AM EE +AL    TW LVP   + N++GCKWVF TK ++DGT+ R KARLVAKGFHQ
Subjt:  MQTRTKSGIFK--PKAYVSTMTTSVPTDPPSYSVASKYPKWRSAMCEEFNALQEQGTWSLVPRLPSMNVVGCKWVFMTKYNTDGTVARYKARLVAKGFHQ

Query:  VEGFDFTKTFSPVVKNPTIRVILALVAQYQ
         EG  F +T+SPVV+  TIR IL +  Q +
Subjt:  VEGFDFTKTFSPVVKNPTIRVILALVAQYQ

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.3e-5142.08Show/hide
Query:  ESTTNNRSMQTRTKSGIFKPK-AYVSTMTTSVPTDPPSYSVASKYPKWRSAMCEEFNALQEQGTWSLVPRLPS-MNVVGCKWVFMTKYNTDGTVARYKAR
        ++  N  SM TR K+GI KP   Y   ++ +  ++P +   A K  +WR+AM  E NA     TW LVP  PS + +VGC+W+F  KYN+DG++ RYKAR
Subjt:  ESTTNNRSMQTRTKSGIFKPK-AYVSTMTTSVPTDPPSYSVASKYPKWRSAMCEEFNALQEQGTWSLVPRLPS-MNVVGCKWVFMTKYNTDGTVARYKAR

Query:  LVAKGFHQVEGFDFTKTFSPVVKNPTIRVILALVAQYQWSLTQLDVKNAFLHGYLKEEVYMSQPLGFLDKSSPNHVCRLYKSI-----------------
        LVAKG++Q  G D+ +TFSPV+K+ +IR++L +     W + QLDV NAFL G L ++VYMSQP GF+DK  PN+VC+L K++                 
Subjt:  LVAKGFHQVEGFDFTKTFSPVVKNPTIRVILALVAQYQWSLTQLDVKNAFLHGYLKEEVYMSQPLGFLDKSSPNHVCRLYKSI-----------------

Query:  -----------DSSLFVWSVGSSLTYLLLYVDDIIITGPDSSYLFVLKKQLANEFQISD
                   D+SLFV   G S+ Y+L+YVDDI+ITG D + L      L+  F + D
Subjt:  -----------DSSLFVWSVGSSLTYLLLYVDDIIITGPDSSYLFVLKKQLANEFQISD

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.7e-0464.52Show/hide
Query:  NWVGDTSDQRSTSGFIVFLGSSPISWSSKKQ
        +W GD  D  ST+G+IV+LG  PISWSSKKQ
Subjt:  NWVGDTSDQRSTSGFIVFLGSSPISWSSKKQ

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.9e-4838.28Show/hide
Query:  QTSNDNSNNFWLSNSGCNIHMTND-LANLNLSNN-YNGEESVTVDLSIDERTHENSLSQN-------------VQI-PESTTNNRSMQTRTKSGIFKPKA
        QT N NSN+  L+N   N    N    N  L  +  +     T   SI E    +S S +             +Q+  ++  N  SM TR K GI KP  
Subjt:  QTSNDNSNNFWLSNSGCNIHMTND-LANLNLSNN-YNGEESVTVDLSIDERTHENSLSQN-------------VQI-PESTTNNRSMQTRTKSGIFKPKA

Query:  YVSTMTTSVPTDPPSYSV-ASKYPKWRSAMCEEFNALQEQGTWSLV-PRLPSMNVVGCKWVFMTKYNTDGTVARYKARLVAKGFHQVEGFDFTKTFSPVV
          S  T+      P  ++ A K  +WR AM  E NA     TW LV P  PS+ +VGC+W+F  K+N+DG++ RYKARLVAKG++Q  G D+ +TFSPV+
Subjt:  YVSTMTTSVPTDPPSYSV-ASKYPKWRSAMCEEFNALQEQGTWSLV-PRLPSMNVVGCKWVFMTKYNTDGTVARYKARLVAKGFHQVEGFDFTKTFSPVV

Query:  KNPTIRVILALVAQYQWSLTQLDVKNAFLHGYLKEEVYMSQPLGFLDKSSPNHVCRLYKSI----------------------------DSSLFVWSVGS
        K+ +IR++L +     W + QLDV NAFL G L +EVYMSQP GF+DK  P++VCRL K+I                            D+SLFV   G 
Subjt:  KNPTIRVILALVAQYQWSLTQLDVKNAFLHGYLKEEVYMSQPLGFLDKSSPNHVCRLYKSI----------------------------DSSLFVWSVGS

Query:  SLTYLLLYVDDIIITGPDSSYLFVLKKQLANEFQISD
        S+ Y+L+YVDDI+ITG D+  L      L+  F + +
Subjt:  SLTYLLLYVDDIIITGPDSSYLFVLKKQLANEFQISD

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE29.4e-0567.74Show/hide
Query:  NWVGDTSDQRSTSGFIVFLGSSPISWSSKKQ
        +W GDT D  ST+G+IV+LG  PISWSSKKQ
Subjt:  NWVGDTSDQRSTSGFIVFLGSSPISWSSKKQ

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.9e-4039.57Show/hide
Query:  DPPSYSVASKYPKWRSAMCEEFNALQEQGTWSLVPRLPSMNVVGCKWVFMTKYNTDGTVARYKARLVAKGFHQVEGFDFTKTFSPVVKNPTIRVILALVA
        +P +Y+ A ++  W  AM +E  A++   TW +    P+   +GCKWV+  KYN+DGT+ RYKARLVAKG+ Q EG DF +TFSPV K  ++++ILA+ A
Subjt:  DPPSYSVASKYPKWRSAMCEEFNALQEQGTWSLVPRLPSMNVVGCKWVFMTKYNTDGTVARYKARLVAKGFHQVEGFDFTKTFSPVVKNPTIRVILALVA

Query:  QYQWSLTQLDVKNAFLHGYLKEEVYMSQPLGFL----DKSSPNHVCRLYKSI----------------------------DSSLFVWSVGSSLTYLLLYV
         Y ++L QLD+ NAFL+G L EE+YM  P G+     D   PN VC L KSI                            D + F+    +    +L+YV
Subjt:  QYQWSLTQLDVKNAFLHGYLKEEVYMSQPLGFL----DKSSPNHVCRLYKSI----------------------------DSSLFVWSVGSSLTYLLLYV

Query:  DDIIITGPDSSYLFVLKKQLANEFQISDLG
        DDIII   + + +  LK QL + F++ DLG
Subjt:  DDIIITGPDSSYLFVLKKQLANEFQISDLG

ATMG00810.1 DNA/RNA polymerases superfamily protein1.5e-0556.41Show/hide
Query:  FVTLNWVGDTSDQRSTSGFIVFLGSSPISWSSKKQPTQS
        F   +W G TS +RST+GF  FLG + ISWS+K+QPT S
Subjt:  FVTLNWVGDTSDQRSTSGFIVFLGSSPISWSSKKQPTQS

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)9.6e-2952.31Show/hide
Query:  MQTRTKSGIFK--PKAYVSTMTTSVPTDPPSYSVASKYPKWRSAMCEEFNALQEQGTWSLVPRLPSMNVVGCKWVFMTKYNTDGTVARYKARLVAKGFHQ
        M TR+K+GI K  PK Y  T+TT++  +P S   A K P W  AM EE +AL    TW LVP   + N++GCKWVF TK ++DGT+ R KARLVAKGFHQ
Subjt:  MQTRTKSGIFK--PKAYVSTMTTSVPTDPPSYSVASKYPKWRSAMCEEFNALQEQGTWSLVPRLPSMNVVGCKWVFMTKYNTDGTVARYKARLVAKGFHQ

Query:  VEGFDFTKTFSPVVKNPTIRVILALVAQYQ
         EG  F +T+SPVV+  TIR IL +  Q +
Subjt:  VEGFDFTKTFSPVVKNPTIRVILALVAQYQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCAACACAACGACATCTTCTTCTTCTACTGCCATCCCTGTCACTACCTCTACCACCATTCCTGCCACCACACCTCTGGCTCATTCTCTTTTCGGCCACATTGATGA
ATCTCTGTCCAAACCTGATCAATATACATTTGTTTCCTACAAGGGAGTTAACAACACAGATGCTGAAGAGAAAACGTTGCAACGCCATCCGATTGCTGAACCTACTCCAA
CAACCATGGTCGCCGTTCGCAACACCCAGCAAATCCCCAATTTTCGAGGAATAGGAAGGAGAAACGCATGTGGTTGTGGATTTTACCAAGGATCTGGAAACTTCAAAGAC
CCAAGCCCATTTCACGCTACTGGTCTGGTTCTTCAATTTCTGATTCTGGACTCAACAGTGGTCGAATTTTTTGTCAGATTTGCTCAAAATCTGGACATGGAGCACTTGAT
TGTTATAATAAATAGAATGAATTTCTCCTACCAGGGTCGTCATCCACCGGCACAATTGGCTGCTATGGCGCTAAATTCTATGAATTTACAGACTTCAAATGATAATTCTA
ACAATTTTTGGTTATCTAACAGTGGCTGCAATATTCATATGACCAATGATCTTGCAAATCTCAATCTCTCCAACAATTACAATGGAGAAGAATCTGTCACAGTGGATCTG
TCAATAGATGAGCGAACTCATGAGAATTCACTATCTCAAAATGTTCAAATTCCCGAAAGCACTACCAATAATCGTTCCATGCAAACACGGACTAAGTCAGGAATTTTCAA
ACCAAAGGCATATGTTTCGACCATGACCACTTCAGTTCCCACAGATCCTCCTTCATACTCTGTTGCTTCCAAGTATCCAAAATGGAGATCCGCTATGTGTGAGGAATTTA
ATGCTCTTCAAGAACAAGGTACGTGGTCCTTAGTACCTCGTTTACCTTCCATGAATGTTGTAGGTTGCAAATGGGTCTTTATGACCAAATATAACACTGATGGAACTGTC
GCTCGATATAAAGCTCGTTTAGTTGCCAAAGGATTTCATCAGGTTGAGGGATTTGATTTTACTAAAACCTTCAGTCCAGTTGTTAAAAATCCTACAATCAGGGTTATATT
AGCTCTTGTTGCTCAGTATCAGTGGTCTCTAACCCAATTGGATGTCAAGAATGCATTTTTGCATGGTTACTTAAAGGAGGAAGTTTACATGTCTCAACCTCTTGGTTTTC
TTGACAAGAGTAGCCCAAATCATGTTTGTCGACTTTACAAATCTATTGATTCATCCTTATTTGTATGGTCAGTTGGATCATCTCTGACATATCTGCTACTTTATGTTGAT
GATATAATTATCACTGGACCAGATTCATCATATCTATTTGTCTTGAAGAAGCAATTGGCAAATGAATTTCAGATATCAGATCTTGGTGAATCGTCTTTAAGCAACTTGGG
ATTATGTTCCGCCGTGCAAATTTATCTCTCACTGCCTTTTGTGACTCTGAATTGGGTTGGTGATACTTCTGATCAACGATCCACATCAGGATTCATTGTTTTCCTAGGAT
CCAGTCCTATATCATGGTCATCCAAAAAGCAACCTACACAGTCTCTCGCTCTTCCATTGAAGCTGAATATCGCTCTCTTGCAACTACAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCAACACAACGACATCTTCTTCTTCTACTGCCATCCCTGTCACTACCTCTACCACCATTCCTGCCACCACACCTCTGGCTCATTCTCTTTTCGGCCACATTGATGA
ATCTCTGTCCAAACCTGATCAATATACATTTGTTTCCTACAAGGGAGTTAACAACACAGATGCTGAAGAGAAAACGTTGCAACGCCATCCGATTGCTGAACCTACTCCAA
CAACCATGGTCGCCGTTCGCAACACCCAGCAAATCCCCAATTTTCGAGGAATAGGAAGGAGAAACGCATGTGGTTGTGGATTTTACCAAGGATCTGGAAACTTCAAAGAC
CCAAGCCCATTTCACGCTACTGGTCTGGTTCTTCAATTTCTGATTCTGGACTCAACAGTGGTCGAATTTTTTGTCAGATTTGCTCAAAATCTGGACATGGAGCACTTGAT
TGTTATAATAAATAGAATGAATTTCTCCTACCAGGGTCGTCATCCACCGGCACAATTGGCTGCTATGGCGCTAAATTCTATGAATTTACAGACTTCAAATGATAATTCTA
ACAATTTTTGGTTATCTAACAGTGGCTGCAATATTCATATGACCAATGATCTTGCAAATCTCAATCTCTCCAACAATTACAATGGAGAAGAATCTGTCACAGTGGATCTG
TCAATAGATGAGCGAACTCATGAGAATTCACTATCTCAAAATGTTCAAATTCCCGAAAGCACTACCAATAATCGTTCCATGCAAACACGGACTAAGTCAGGAATTTTCAA
ACCAAAGGCATATGTTTCGACCATGACCACTTCAGTTCCCACAGATCCTCCTTCATACTCTGTTGCTTCCAAGTATCCAAAATGGAGATCCGCTATGTGTGAGGAATTTA
ATGCTCTTCAAGAACAAGGTACGTGGTCCTTAGTACCTCGTTTACCTTCCATGAATGTTGTAGGTTGCAAATGGGTCTTTATGACCAAATATAACACTGATGGAACTGTC
GCTCGATATAAAGCTCGTTTAGTTGCCAAAGGATTTCATCAGGTTGAGGGATTTGATTTTACTAAAACCTTCAGTCCAGTTGTTAAAAATCCTACAATCAGGGTTATATT
AGCTCTTGTTGCTCAGTATCAGTGGTCTCTAACCCAATTGGATGTCAAGAATGCATTTTTGCATGGTTACTTAAAGGAGGAAGTTTACATGTCTCAACCTCTTGGTTTTC
TTGACAAGAGTAGCCCAAATCATGTTTGTCGACTTTACAAATCTATTGATTCATCCTTATTTGTATGGTCAGTTGGATCATCTCTGACATATCTGCTACTTTATGTTGAT
GATATAATTATCACTGGACCAGATTCATCATATCTATTTGTCTTGAAGAAGCAATTGGCAAATGAATTTCAGATATCAGATCTTGGTGAATCGTCTTTAAGCAACTTGGG
ATTATGTTCCGCCGTGCAAATTTATCTCTCACTGCCTTTTGTGACTCTGAATTGGGTTGGTGATACTTCTGATCAACGATCCACATCAGGATTCATTGTTTTCCTAGGAT
CCAGTCCTATATCATGGTCATCCAAAAAGCAACCTACACAGTCTCTCGCTCTTCCATTGAAGCTGAATATCGCTCTCTTGCAACTACAATAG
Protein sequenceShow/hide protein sequence
MANTTTSSSSTAIPVTTSTTIPATTPLAHSLFGHIDESLSKPDQYTFVSYKGVNNTDAEEKTLQRHPIAEPTPTTMVAVRNTQQIPNFRGIGRRNACGCGFYQGSGNFKD
PSPFHATGLVLQFLILDSTVVEFFVRFAQNLDMEHLIVIINRMNFSYQGRHPPAQLAAMALNSMNLQTSNDNSNNFWLSNSGCNIHMTNDLANLNLSNNYNGEESVTVDL
SIDERTHENSLSQNVQIPESTTNNRSMQTRTKSGIFKPKAYVSTMTTSVPTDPPSYSVASKYPKWRSAMCEEFNALQEQGTWSLVPRLPSMNVVGCKWVFMTKYNTDGTV
ARYKARLVAKGFHQVEGFDFTKTFSPVVKNPTIRVILALVAQYQWSLTQLDVKNAFLHGYLKEEVYMSQPLGFLDKSSPNHVCRLYKSIDSSLFVWSVGSSLTYLLLYVD
DIIITGPDSSYLFVLKKQLANEFQISDLGESSLSNLGLCSAVQIYLSLPFVTLNWVGDTSDQRSTSGFIVFLGSSPISWSSKKQPTQSLALPLKLNIALLQLQ