; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0031729 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0031729
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr11:12931560..12935304
RNA-Seq ExpressionLag0031729
SyntenyLag0031729
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR021109 - Aspartic peptidase domain superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PIN16590.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]8.9e-11049.44Show/hide
Query:  GGTLCDFGARINLMPLSLYRKLGIGEAMPTTVTLQLADMSIEYPEGKIKDVLVKVDKFIFPVDFIILDYEVDKDVPIILGRPFLATGRAFIDVQKGELTM
        G  LCD GA INLMP S+YR LG+GEA PT++TLQLAD S+ YP G I+D+LVKVDKFIFP DF++LD EVD +VPIILGRPFLATGR  IDVQKGELTM
Subjt:  GGTLCDFGARINLMPLSLYRKLGIGEAMPTTVTLQLADMSIEYPEGKIKDVLVKVDKFIFPVDFIILDYEVDKDVPIILGRPFLATGRAFIDVQKGELTM

Query:  RVYDEKVKFNVFKTMKYLDEVEDCSFIMIMEK-----TIVETTMEDL-----------------------ANKHLEDHGEISV------RPIKPALIEAP
        RV D+++ FNVFK MK+ +E ++C  + + +K     +I E  ++ L                       A+K+ +  G  S+      + +KP++ E P
Subjt:  RVYDEKVKFNVFKTMKYLDEVEDCSFIMIMEK-----TIVETTMEDL-----------------------ANKHLEDHGEISV------RPIKPALIEAP

Query:  TLDLKPLPDHLKYVYLGE----------------------------------------------------------------------------------
        TL+LKPLP HL Y YLGE                                                                                  
Subjt:  TLDLKPLPDHLKYVYLGE----------------------------------------------------------------------------------

Query:  ---------DSNWVSPIQCVPKKGGVTVVTNTDNELIPTRTVTAWRVCMDYRKLNKATHKDNFALPFIDQMLDRLVGKAYYYFLDGYSGYNQITIAPEDQ
                 DS+WVSP+QCVPKKGG+TVV N  NELIPTRTVT WRVCMDYRKLNKAT KD+F LPFIDQMLDRL GK +Y FLDGYSGYNQI I PEDQ
Subjt:  ---------DSNWVSPIQCVPKKGGVTVVTNTDNELIPTRTVTAWRVCMDYRKLNKATHKDNFALPFIDQMLDRLVGKAYYYFLDGYSGYNQITIAPEDQ

Query:  EKTTFTYPYGMFAFRRMPFGLCNAPTTFQWCMLAIFSDMIESTVE
        EKTTFT PYG F FR+MPFGLCNAP TFQ CM+AIF+DM+E+ +E
Subjt:  EKTTFTYPYGMFAFRRMPFGLCNAPTTFQWCMLAIFSDMIESTVE

PIN16590.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]5.0e-0432.67Show/hide
Query:  ILLVANDRARAIRAYAFPMFDELNPGIPRPQIKAANFEMKSVMFQMLQTMGQFHGFSSEDPHLYLKSFIGVSDSFVIQGVSKDTLRLTLFPCSLRDGAKT
        I++  N     +R  A P   E    +  P++  A  +++  M +M+Q   QF G S E+P+ ++ +F+ + D+   +GVSKD LRL LF  SL   A  
Subjt:  ILLVANDRARAIRAYAFPMFDELNPGIPRPQIKAANFEMKSVMFQMLQTMGQFHGFSSEDPHLYLKSFIGVSDSFVIQGVSKDTLRLTLFPCSLRDGAKT

Query:  W
        W
Subjt:  W

PIN16590.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]3.4e-10949.77Show/hide
Query:  GGKGLGGTLCDFGARINLMPLSLYRKLGIGEAMPTTVTLQLADMSIEYPEGKIKDVLVKVDKFIFPVDFIILDYEVDKDVPIILGRPFLATGRAFIDVQK
        GGK LG  LCD G+ INLMPLS+Y+KLGIGEA PTTVTLQLAD S  YPEGKI+D+L++VDKFIFP DFIILDYE D DVPIILGRPFL TGR  +DV K
Subjt:  GGKGLGGTLCDFGARINLMPLSLYRKLGIGEAMPTTVTLQLADMSIEYPEGKIKDVLVKVDKFIFPVDFIILDYEVDKDVPIILGRPFLATGRAFIDVQK

Query:  GELTMRVYDEKVKFNVFKTMKYLDEVEDCSFIMIMEKTIVETTMEDLANKHLEDH------------GEIS------------VRPIKPALIEAPTLDLK
        G +T+R+ D+KV+FN+  +MKY    E+CS +  + +       +D  +   ED             GE S              P++P++ EAP LDLK
Subjt:  GELTMRVYDEKVKFNVFKTMKYLDEVEDCSFIMIMEKTIVETTMEDLANKHLEDH------------GEIS------------VRPIKPALIEAPTLDLK

Query:  PLPDHLKYVYLGE---------------------------------------------------------------------------------------
        PLP +LKY YLG+                                                                                       
Subjt:  PLPDHLKYVYLGE---------------------------------------------------------------------------------------

Query:  ----DSNWVSPIQCVPKKGGVTVVTNTDNELIPTRTVTAWRVCMDYRKLNKATHKDNFALPFIDQMLDRLVGKAYYYFLDGYSGYNQITIAPEDQEKTTF
            +S+ VSPIQCVPKKGG+TV+ N +NELIPTR V  WR+CMDYR+LNKAT KD+F LPFIDQMLDRL GK++Y FLDGYSGYNQITI+PEDQEKTTF
Subjt:  ----DSNWVSPIQCVPKKGGVTVVTNTDNELIPTRTVTAWRVCMDYRKLNKATHKDNFALPFIDQMLDRLVGKAYYYFLDGYSGYNQITIAPEDQEKTTF

Query:  TYPYGMFAFRRMPFGLCNAPTTFQWCMLAIFSDMIESTVE
        T PYG+FAFRRMPFGLCNAP TFQ CM+AIF+DM+E+ +E
Subjt:  TYPYGMFAFRRMPFGLCNAPTTFQWCMLAIFSDMIESTVE

PIN22487.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]1.4e-11049.44Show/hide
Query:  GGTLCDFGARINLMPLSLYRKLGIGEAMPTTVTLQLADMSIEYPEGKIKDVLVKVDKFIFPVDFIILDYEVDKDVPIILGRPFLATGRAFIDVQKGELTM
        G  LCD GA INLMP S+YR LG+GEA PT++TLQLAD S+ YP+G I+D+LVKVDKFIFP DF++LD EVD +VPIILGRPFLATGR  IDVQKGELTM
Subjt:  GGTLCDFGARINLMPLSLYRKLGIGEAMPTTVTLQLADMSIEYPEGKIKDVLVKVDKFIFPVDFIILDYEVDKDVPIILGRPFLATGRAFIDVQKGELTM

Query:  RVYDEKVKFNVFKTMKYLDEVEDCSFIMIMEKTI------------VETTMEDLANKHLEDHGEI----------------------SVRPIKPALIEAP
        RV D+++ FNVFK MK+ +E ++C  + + +K              +E  + DL ++  E+  E+                        + +KP++ + P
Subjt:  RVYDEKVKFNVFKTMKYLDEVEDCSFIMIMEKTI------------VETTMEDLANKHLEDHGEI----------------------SVRPIKPALIEAP

Query:  TLDLKPLPDHLKYVYLGE----------------------------------------------------------------------------------
        TL+LKPLP HL Y YLGE                                                                                  
Subjt:  TLDLKPLPDHLKYVYLGE----------------------------------------------------------------------------------

Query:  ---------DSNWVSPIQCVPKKGGVTVVTNTDNELIPTRTVTAWRVCMDYRKLNKATHKDNFALPFIDQMLDRLVGKAYYYFLDGYSGYNQITIAPEDQ
                 DS+WVSP+QCVPKKGG+TVV N  NELIPTRTVT WRVCMDYRKLNKAT KD+F LPFIDQMLDRL GK +Y FLDGYSGYNQI IAPEDQ
Subjt:  ---------DSNWVSPIQCVPKKGGVTVVTNTDNELIPTRTVTAWRVCMDYRKLNKATHKDNFALPFIDQMLDRLVGKAYYYFLDGYSGYNQITIAPEDQ

Query:  EKTTFTYPYGMFAFRRMPFGLCNAPTTFQWCMLAIFSDMIESTVE
        EKTTFT PYG FAFRRMPFGLCNAP TFQ CM+AIF+DM+E+ +E
Subjt:  EKTTFTYPYGMFAFRRMPFGLCNAPTTFQWCMLAIFSDMIESTVE

PIN22518.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]3.6e-11150.34Show/hide
Query:  GGTLCDFGARINLMPLSLYRKLGIGEAMPTTVTLQLADMSIEYPEGKIKDVLVKVDKFIFPVDFIILDYEVDKDVPIILGRPFLATGRAFIDVQKGELTM
        G  LCD GA INLMP S+YR LG+GEA PT++TLQLAD S+ YP+G I+D+LVKVDKFIFP DF++LD EVD +VPIILGRPFLATGR  IDVQKGELTM
Subjt:  GGTLCDFGARINLMPLSLYRKLGIGEAMPTTVTLQLADMSIEYPEGKIKDVLVKVDKFIFPVDFIILDYEVDKDVPIILGRPFLATGRAFIDVQKGELTM

Query:  RVYDEKVKFNVFKTMKYLDEVEDCSFIMIMEK---------------------TIVETTMEDL-------ANKHLEDHGEISV------RPIKPALIEAP
        RV D+++ FNVFK MK+ +E ++C  + + +                       I E   EDL       A+K  +  G  S+      + +KP++ + P
Subjt:  RVYDEKVKFNVFKTMKYLDEVEDCSFIMIMEK---------------------TIVETTMEDL-------ANKHLEDHGEISV------RPIKPALIEAP

Query:  TLDLKPLPDHLKYVYLGE----------------------------------------------------------------------------------
        TL+LKPLP+HL YVYLGE                                                                                  
Subjt:  TLDLKPLPDHLKYVYLGE----------------------------------------------------------------------------------

Query:  ---------DSNWVSPIQCVPKKGGVTVVTNTDNELIPTRTVTAWRVCMDYRKLNKATHKDNFALPFIDQMLDRLVGKAYYYFLDGYSGYNQITIAPEDQ
                 DS+WVSP+QCVPKKGG+TVV N  NELIPTRTVT WRVCMDYRKLNKAT KD+F LPFIDQMLDRL GK +Y FLDGYSGYNQI IAPEDQ
Subjt:  ---------DSNWVSPIQCVPKKGGVTVVTNTDNELIPTRTVTAWRVCMDYRKLNKATHKDNFALPFIDQMLDRLVGKAYYYFLDGYSGYNQITIAPEDQ

Query:  EKTTFTYPYGMFAFRRMPFGLCNAPTTFQWCMLAIFSDMIESTVE
        EKTTFT PYG FAFRRMPFGLCNAP TFQ CM+AIF+DM+E+ +E
Subjt:  EKTTFTYPYGMFAFRRMPFGLCNAPTTFQWCMLAIFSDMIESTVE

XP_022156989.1 uncharacterized protein LOC111023818 [Momordica charantia]2.1e-11152.13Show/hide
Query:  GGKGLGGTLCDFGARINLMPLSLYRKLGIGEAMPTTVTLQLADMSIEYPEGKIKDVLVKVDKFIFPVDFIILDYEVDKDVPIILGRPFLATGRAFIDVQK
        GGK +G  LCD GA INLMPLS+Y+KLGIGEA P TVTLQLAD SI Y EGKI+DVLV+VDKFIFP DFIILDYE DK++PIILGRPFL+TGRA IDV  
Subjt:  GGKGLGGTLCDFGARINLMPLSLYRKLGIGEAMPTTVTLQLADMSIEYPEGKIKDVLVKVDKFIFPVDFIILDYEVDKDVPIILGRPFLATGRAFIDVQK

Query:  GELTMRVYDEKVKFNVFKTMKYLDEVEDCSFIMIMEKTIV-ETTMEDLANKHLEDHGEISVR-----PIKPALIEAPTLDLKPLPDHLKYVYLGE-----
        GELT+RV D++V  ++F ++KY  +VE+CS++ I +  +  E   E+L N+ LED     ++     P++P++++AP L+LK LP HLKY YLGE     
Subjt:  GELTMRVYDEKVKFNVFKTMKYLDEVEDCSFIMIMEKTIV-ETTMEDLANKHLEDHGEISVR-----PIKPALIEAPTLDLKPLPDHLKYVYLGE-----

Query:  --------------------------------------------------------------------------------------DSNWVSPIQCVPKK
                                                                                              D + +SP+QCVPKK
Subjt:  --------------------------------------------------------------------------------------DSNWVSPIQCVPKK

Query:  GGVTVVTNTDNELIPTRTVTAWRVCMDYRKLNKATHKDNFALPFIDQMLDRLVGKAYYYFLDGYSGYNQITIAPEDQEKTTFTYPYGMFAFRRMPFGLCN
        GG+TVV N +NELIPTRT+T W +CMDYRKLNKAT KD+F LPFIDQMLD LVG+ YYY LDGY+GYNQITI P+DQ+KTTFT PYG F+FRRMPFGLCN
Subjt:  GGVTVVTNTDNELIPTRTVTAWRVCMDYRKLNKATHKDNFALPFIDQMLDRLVGKAYYYFLDGYSGYNQITIAPEDQEKTTFTYPYGMFAFRRMPFGLCN

Query:  APTTFQWCMLAIFSDMIESTVE
        APTTFQ CM+AIF D+IE+ VE
Subjt:  APTTFQWCMLAIFSDMIESTVE

TrEMBL top hitse value%identityAlignment
A0A2G9HH15 Reverse transcriptase4.3e-11049.44Show/hide
Query:  GGTLCDFGARINLMPLSLYRKLGIGEAMPTTVTLQLADMSIEYPEGKIKDVLVKVDKFIFPVDFIILDYEVDKDVPIILGRPFLATGRAFIDVQKGELTM
        G  LCD GA INLMP S+YR LG+GEA PT++TLQLAD S+ YP G I+D+LVKVDKFIFP DF++LD EVD +VPIILGRPFLATGR  IDVQKGELTM
Subjt:  GGTLCDFGARINLMPLSLYRKLGIGEAMPTTVTLQLADMSIEYPEGKIKDVLVKVDKFIFPVDFIILDYEVDKDVPIILGRPFLATGRAFIDVQKGELTM

Query:  RVYDEKVKFNVFKTMKYLDEVEDCSFIMIMEK-----TIVETTMEDL-----------------------ANKHLEDHGEISV------RPIKPALIEAP
        RV D+++ FNVFK MK+ +E ++C  + + +K     +I E  ++ L                       A+K+ +  G  S+      + +KP++ E P
Subjt:  RVYDEKVKFNVFKTMKYLDEVEDCSFIMIMEK-----TIVETTMEDL-----------------------ANKHLEDHGEISV------RPIKPALIEAP

Query:  TLDLKPLPDHLKYVYLGE----------------------------------------------------------------------------------
        TL+LKPLP HL Y YLGE                                                                                  
Subjt:  TLDLKPLPDHLKYVYLGE----------------------------------------------------------------------------------

Query:  ---------DSNWVSPIQCVPKKGGVTVVTNTDNELIPTRTVTAWRVCMDYRKLNKATHKDNFALPFIDQMLDRLVGKAYYYFLDGYSGYNQITIAPEDQ
                 DS+WVSP+QCVPKKGG+TVV N  NELIPTRTVT WRVCMDYRKLNKAT KD+F LPFIDQMLDRL GK +Y FLDGYSGYNQI I PEDQ
Subjt:  ---------DSNWVSPIQCVPKKGGVTVVTNTDNELIPTRTVTAWRVCMDYRKLNKATHKDNFALPFIDQMLDRLVGKAYYYFLDGYSGYNQITIAPEDQ

Query:  EKTTFTYPYGMFAFRRMPFGLCNAPTTFQWCMLAIFSDMIESTVE
        EKTTFT PYG F FR+MPFGLCNAP TFQ CM+AIF+DM+E+ +E
Subjt:  EKTTFTYPYGMFAFRRMPFGLCNAPTTFQWCMLAIFSDMIESTVE

A0A2G9HH15 Reverse transcriptase2.4e-0432.67Show/hide
Query:  ILLVANDRARAIRAYAFPMFDELNPGIPRPQIKAANFEMKSVMFQMLQTMGQFHGFSSEDPHLYLKSFIGVSDSFVIQGVSKDTLRLTLFPCSLRDGAKT
        I++  N     +R  A P   E    +  P++  A  +++  M +M+Q   QF G S E+P+ ++ +F+ + D+   +GVSKD LRL LF  SL   A  
Subjt:  ILLVANDRARAIRAYAFPMFDELNPGIPRPQIKAANFEMKSVMFQMLQTMGQFHGFSSEDPHLYLKSFIGVSDSFVIQGVSKDTLRLTLFPCSLRDGAKT

Query:  W
        W
Subjt:  W

A0A2G9HH15 Reverse transcriptase1.4e-10848.99Show/hide
Query:  GGTLCDFGARINLMPLSLYRKLGIGEAMPTTVTLQLADMSIEYPEGKIKDVLVKVDKFIFPVDFIILDYEVDKDVPIILGRPFLATGRAFIDVQKGELTM
        G  LCD GA INLMP S+YR LG+GEA PT++TLQLAD S+ YP+G I+D+LVKVDKFIFP D ++LD EVD ++ IILGRPFLATGR  IDVQKGELTM
Subjt:  GGTLCDFGARINLMPLSLYRKLGIGEAMPTTVTLQLADMSIEYPEGKIKDVLVKVDKFIFPVDFIILDYEVDKDVPIILGRPFLATGRAFIDVQKGELTM

Query:  RVYDEKVKFNVFKTMKYLDEVEDCSFIMIME-----KTIVETTMEDL-----------------------ANKHLEDHGEISV------RPIKPALIEAP
        RV D+++ FNVFK MK+ +E ++C  + + +     ++I E  ++ L                       A+K  +  G  S+      + +KP++ E P
Subjt:  RVYDEKVKFNVFKTMKYLDEVEDCSFIMIME-----KTIVETTMEDL-----------------------ANKHLEDHGEISV------RPIKPALIEAP

Query:  TLDLKPLPDHLKYVYLGE----------------------------------------------------------------------------------
        TL+LKPLP HL YVYLGE                                                                                  
Subjt:  TLDLKPLPDHLKYVYLGE----------------------------------------------------------------------------------

Query:  ---------DSNWVSPIQCVPKKGGVTVVTNTDNELIPTRTVTAWRVCMDYRKLNKATHKDNFALPFIDQMLDRLVGKAYYYFLDGYSGYNQITIAPEDQ
                 DS+WVSP+QCVPKKGG+TVV N  NELIPTRTVT WR CMDYRKLNKAT KD+F LPFIDQMLDRL GK +Y FLDGYSGYNQI IAPEDQ
Subjt:  ---------DSNWVSPIQCVPKKGGVTVVTNTDNELIPTRTVTAWRVCMDYRKLNKATHKDNFALPFIDQMLDRLVGKAYYYFLDGYSGYNQITIAPEDQ

Query:  EKTTFTYPYGMFAFRRMPFGLCNAPTTFQWCMLAIFSDMIESTVE
        EK TFT PYG FAFRRMPFGLCNAP TFQ CM+AIF+DM+E+ +E
Subjt:  EKTTFTYPYGMFAFRRMPFGLCNAPTTFQWCMLAIFSDMIESTVE

A0A2G9HYA0 Reverse transcriptase6.6e-11149.44Show/hide
Query:  GGTLCDFGARINLMPLSLYRKLGIGEAMPTTVTLQLADMSIEYPEGKIKDVLVKVDKFIFPVDFIILDYEVDKDVPIILGRPFLATGRAFIDVQKGELTM
        G  LCD GA INLMP S+YR LG+GEA PT++TLQLAD S+ YP+G I+D+LVKVDKFIFP DF++LD EVD +VPIILGRPFLATGR  IDVQKGELTM
Subjt:  GGTLCDFGARINLMPLSLYRKLGIGEAMPTTVTLQLADMSIEYPEGKIKDVLVKVDKFIFPVDFIILDYEVDKDVPIILGRPFLATGRAFIDVQKGELTM

Query:  RVYDEKVKFNVFKTMKYLDEVEDCSFIMIMEKTI------------VETTMEDLANKHLEDHGEI----------------------SVRPIKPALIEAP
        RV D+++ FNVFK MK+ +E ++C  + + +K              +E  + DL ++  E+  E+                        + +KP++ + P
Subjt:  RVYDEKVKFNVFKTMKYLDEVEDCSFIMIMEKTI------------VETTMEDLANKHLEDHGEI----------------------SVRPIKPALIEAP

Query:  TLDLKPLPDHLKYVYLGE----------------------------------------------------------------------------------
        TL+LKPLP HL Y YLGE                                                                                  
Subjt:  TLDLKPLPDHLKYVYLGE----------------------------------------------------------------------------------

Query:  ---------DSNWVSPIQCVPKKGGVTVVTNTDNELIPTRTVTAWRVCMDYRKLNKATHKDNFALPFIDQMLDRLVGKAYYYFLDGYSGYNQITIAPEDQ
                 DS+WVSP+QCVPKKGG+TVV N  NELIPTRTVT WRVCMDYRKLNKAT KD+F LPFIDQMLDRL GK +Y FLDGYSGYNQI IAPEDQ
Subjt:  ---------DSNWVSPIQCVPKKGGVTVVTNTDNELIPTRTVTAWRVCMDYRKLNKATHKDNFALPFIDQMLDRLVGKAYYYFLDGYSGYNQITIAPEDQ

Query:  EKTTFTYPYGMFAFRRMPFGLCNAPTTFQWCMLAIFSDMIESTVE
        EKTTFT PYG FAFRRMPFGLCNAP TFQ CM+AIF+DM+E+ +E
Subjt:  EKTTFTYPYGMFAFRRMPFGLCNAPTTFQWCMLAIFSDMIESTVE

A0A2G9HYD8 Reverse transcriptase1.7e-11150.34Show/hide
Query:  GGTLCDFGARINLMPLSLYRKLGIGEAMPTTVTLQLADMSIEYPEGKIKDVLVKVDKFIFPVDFIILDYEVDKDVPIILGRPFLATGRAFIDVQKGELTM
        G  LCD GA INLMP S+YR LG+GEA PT++TLQLAD S+ YP+G I+D+LVKVDKFIFP DF++LD EVD +VPIILGRPFLATGR  IDVQKGELTM
Subjt:  GGTLCDFGARINLMPLSLYRKLGIGEAMPTTVTLQLADMSIEYPEGKIKDVLVKVDKFIFPVDFIILDYEVDKDVPIILGRPFLATGRAFIDVQKGELTM

Query:  RVYDEKVKFNVFKTMKYLDEVEDCSFIMIMEK---------------------TIVETTMEDL-------ANKHLEDHGEISV------RPIKPALIEAP
        RV D+++ FNVFK MK+ +E ++C  + + +                       I E   EDL       A+K  +  G  S+      + +KP++ + P
Subjt:  RVYDEKVKFNVFKTMKYLDEVEDCSFIMIMEK---------------------TIVETTMEDL-------ANKHLEDHGEISV------RPIKPALIEAP

Query:  TLDLKPLPDHLKYVYLGE----------------------------------------------------------------------------------
        TL+LKPLP+HL YVYLGE                                                                                  
Subjt:  TLDLKPLPDHLKYVYLGE----------------------------------------------------------------------------------

Query:  ---------DSNWVSPIQCVPKKGGVTVVTNTDNELIPTRTVTAWRVCMDYRKLNKATHKDNFALPFIDQMLDRLVGKAYYYFLDGYSGYNQITIAPEDQ
                 DS+WVSP+QCVPKKGG+TVV N  NELIPTRTVT WRVCMDYRKLNKAT KD+F LPFIDQMLDRL GK +Y FLDGYSGYNQI IAPEDQ
Subjt:  ---------DSNWVSPIQCVPKKGGVTVVTNTDNELIPTRTVTAWRVCMDYRKLNKATHKDNFALPFIDQMLDRLVGKAYYYFLDGYSGYNQITIAPEDQ

Query:  EKTTFTYPYGMFAFRRMPFGLCNAPTTFQWCMLAIFSDMIESTVE
        EKTTFT PYG FAFRRMPFGLCNAP TFQ CM+AIF+DM+E+ +E
Subjt:  EKTTFTYPYGMFAFRRMPFGLCNAPTTFQWCMLAIFSDMIESTVE

A0A6J1DV77 uncharacterized protein LOC1110238181.0e-11152.13Show/hide
Query:  GGKGLGGTLCDFGARINLMPLSLYRKLGIGEAMPTTVTLQLADMSIEYPEGKIKDVLVKVDKFIFPVDFIILDYEVDKDVPIILGRPFLATGRAFIDVQK
        GGK +G  LCD GA INLMPLS+Y+KLGIGEA P TVTLQLAD SI Y EGKI+DVLV+VDKFIFP DFIILDYE DK++PIILGRPFL+TGRA IDV  
Subjt:  GGKGLGGTLCDFGARINLMPLSLYRKLGIGEAMPTTVTLQLADMSIEYPEGKIKDVLVKVDKFIFPVDFIILDYEVDKDVPIILGRPFLATGRAFIDVQK

Query:  GELTMRVYDEKVKFNVFKTMKYLDEVEDCSFIMIMEKTIV-ETTMEDLANKHLEDHGEISVR-----PIKPALIEAPTLDLKPLPDHLKYVYLGE-----
        GELT+RV D++V  ++F ++KY  +VE+CS++ I +  +  E   E+L N+ LED     ++     P++P++++AP L+LK LP HLKY YLGE     
Subjt:  GELTMRVYDEKVKFNVFKTMKYLDEVEDCSFIMIMEKTIV-ETTMEDLANKHLEDHGEISVR-----PIKPALIEAPTLDLKPLPDHLKYVYLGE-----

Query:  --------------------------------------------------------------------------------------DSNWVSPIQCVPKK
                                                                                              D + +SP+QCVPKK
Subjt:  --------------------------------------------------------------------------------------DSNWVSPIQCVPKK

Query:  GGVTVVTNTDNELIPTRTVTAWRVCMDYRKLNKATHKDNFALPFIDQMLDRLVGKAYYYFLDGYSGYNQITIAPEDQEKTTFTYPYGMFAFRRMPFGLCN
        GG+TVV N +NELIPTRT+T W +CMDYRKLNKAT KD+F LPFIDQMLD LVG+ YYY LDGY+GYNQITI P+DQ+KTTFT PYG F+FRRMPFGLCN
Subjt:  GGVTVVTNTDNELIPTRTVTAWRVCMDYRKLNKATHKDNFALPFIDQMLDRLVGKAYYYFLDGYSGYNQITIAPEDQEKTTFTYPYGMFAFRRMPFGLCN

Query:  APTTFQWCMLAIFSDMIESTVE
        APTTFQ CM+AIF D+IE+ VE
Subjt:  APTTFQWCMLAIFSDMIESTVE

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.65.0e-1534.29Show/hide
Query:  DSNWVSPIQCVPKKGGVTVVTNTDNELIPTRTVTAWRVCMDYRKLNKATHKDNFALPFIDQMLDRLVGKAYYYFLDGYSGYNQITIAPEDQEKTTFTYPY
        +S + SPI  VPKK   +                 +R+ +DYRKLN+ T  D   +P +D++L +L    Y+  +D   G++QI + PE   KT F+  +
Subjt:  DSNWVSPIQCVPKKGGVTVVTNTDNELIPTRTVTAWRVCMDYRKLNKATHKDNFALPFIDQMLDRLVGKAYYYFLDGYSGYNQITIAPEDQEKTTFTYPY

Query:  GMFAFRRMPFGLCNAPTTFQWCMLAIFSDMIESTVEQACV
        G + + RMPFGL NAP TFQ CM    +D++   + + C+
Subjt:  GMFAFRRMPFGLCNAPTTFQWCMLAIFSDMIESTVEQACV

P10394 Retrovirus-related Pol polyprotein from transposon 4128.6e-1537.01Show/hide
Query:  SNWVSPIQCVPKKGGVTVVTNTDNELIPTRTVTAWRVCMDYRKLNKATHKDNFALPFIDQMLDRLVGKAYYYFLDGYSGYNQITIAPEDQEKTTFTYPYG
        S + SP+  VPKK              P      WR+ +DYR++NK    D F LP ID +LD+L    Y+  LD  SG++QI +    ++ T+F+   G
Subjt:  SNWVSPIQCVPKKGGVTVVTNTDNELIPTRTVTAWRVCMDYRKLNKATHKDNFALPFIDQMLDRLVGKAYYYFLDGYSGYNQITIAPEDQEKTTFTYPYG

Query:  MFAFRRMPFGLCNAPTTFQWCMLAIFS
         + F R+PFGL  AP +FQ  M   FS
Subjt:  MFAFRRMPFGLCNAPTTFQWCMLAIFS

P31843 RNA-directed DNA polymerase homolog8.6e-1539.02Show/hide
Query:  RVCMDYRKLNKATHKDNFALPFIDQMLDRLVGKAYYYFLDGYSGYNQITIAPEDQEKTTFTYPYGMFAFRRMPFGLCNAPTTFQWCMLAIFSDMIESTV-
        R+C+DYR L K T K+ + +P +D + DRL    ++  LD  SGY Q+ IA  D+ KTT    YG F FR MPFGL NA  TF   M  +  + ++  V 
Subjt:  RVCMDYRKLNKATHKDNFALPFIDQMLDRLVGKAYYYFLDGYSGYNQITIAPEDQEKTTFTYPYGMFAFRRMPFGLCNAPTTFQWCMLAIFSDMIESTV-

Query:  ---EQACVETLALERLDAHIPFL
           +   V T+    L  HI  L
Subjt:  ---EQACVETLALERLDAHIPFL

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein7.5e-1932.22Show/hide
Query:  SPIQCVPKKGGVTVVTNTDNELIPTRTVTAWRVCMDYRKLNKATHKDNFALPFIDQMLDRLVGKAYYYFLDGYSGYNQITIAPEDQEKTTFTYPYGMFAF
        SP+  VPKK G                   +R+C+DYR LNKAT  D F LP ID +L R+     +  LD +SGY+QI + P+D+ KT F  P G + +
Subjt:  SPIQCVPKKGGVTVVTNTDNELIPTRTVTAWRVCMDYRKLNKATHKDNFALPFIDQMLDRLVGKAYYYFLDGYSGYNQITIAPEDQEKTTFTYPYGMFAF

Query:  RRMPFGLCNAPTTFQWCMLAIFSDM--IESTVEQACVETLALERLDAHIPFLIRRVK----VTASRRCDLSVSMLCYFAY
          MPFGL NAP+TF   M   F D+  +   ++   + + + E    H+  ++ R+K    +   ++C  +     +  Y
Subjt:  RRMPFGLCNAPTTFQWCMLAIFSDM--IESTVEQACVETLALERLDAHIPFLIRRVK----VTASRRCDLSVSMLCYFAY

Q99315 Transposon Ty3-G Gag-Pol polyprotein7.5e-1932.22Show/hide
Query:  SPIQCVPKKGGVTVVTNTDNELIPTRTVTAWRVCMDYRKLNKATHKDNFALPFIDQMLDRLVGKAYYYFLDGYSGYNQITIAPEDQEKTTFTYPYGMFAF
        SP+  VPKK G                   +R+C+DYR LNKAT  D F LP ID +L R+     +  LD +SGY+QI + P+D+ KT F  P G + +
Subjt:  SPIQCVPKKGGVTVVTNTDNELIPTRTVTAWRVCMDYRKLNKATHKDNFALPFIDQMLDRLVGKAYYYFLDGYSGYNQITIAPEDQEKTTFTYPYGMFAF

Query:  RRMPFGLCNAPTTFQWCMLAIFSDM--IESTVEQACVETLALERLDAHIPFLIRRVK----VTASRRCDLSVSMLCYFAY
          MPFGL NAP+TF   M   F D+  +   ++   + + + E    H+  ++ R+K    +   ++C  +     +  Y
Subjt:  RRMPFGLCNAPTTFQWCMLAIFSDM--IESTVEQACVETLALERLDAHIPFLIRRVK----VTASRRCDLSVSMLCYFAY

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACAGCAAAATAATCAGGCTGAGAATCCTCTTGGTAGCGAACGATAGGGCTAGAGCCATTCGAGCTTACGCTTTTCCAATGTTTGATGAGTTAAATCCAGGAATTCC
ACGTCCTCAAATTAAGGCAGCAAATTTTGAAATGAAATCGGTAATGTTTCAGATGTTGCAAACCATGGGTCAATTCCATGGTTTTTCATCTGAAGACCCTCATTTATATC
TTAAGTCTTTTATAGGAGTTAGTGATTCGTTTGTAATTCAGGGAGTGTCTAAGGATACCCTGAGATTAACTTTGTTCCCGTGTTCTCTTAGAGATGGAGCAAAGACATGG
GCTGATATTGCAATGTTAGCTAACACTCTTAAAAATGTGACAGTGGTTAGTCATCAACAGCCGCCAGTAGTGGAGCCTGCTGCAGTGGTGAACCAAGTTGGCGCAACCAC
CCCAACTTTGCATGGGGAGGTCAAGGAAGCAATTTGGAAGCCCTCAAGCAAAGCAAAAGATACTGAACACCCTAGAAGGGAAGGTAAGGAGCAGGTACAAGCAGTGACTC
TAAGGACTGAGTTGGAGTCTGGTAAAGGTTCTGGAGCCAGCAATAATGATGTTGGAGCATCTGGTTCTGTTCCAGATGTGGAACCACCTTATGTGCCACCCCTACCTTAT
GTACCACCCCTACCTTTTCCACAAAGGCAAAGGCCTAACAATCAGGATGGTGGAAAAGGGTTAGGTGGAACACTTTGTGATTTTGGCGCAAGAATTAACCTTATGCCTCT
TTCGCTCTATCGAAAGCTAGGTATTGGTGAAGCTATGCCTACCACAGTCACACTCCAATTAGCTGACATGTCTATTGAATATCCAGAGGGTAAAATTAAGGATGTCTTAG
TGAAAGTAGATAAATTCATATTTCCTGTCGATTTTATTATCTTAGATTATGAGGTAGATAAAGATGTCCCAATTATTCTTGGTCGTCCATTTTTGGCTACTGGTAGGGCA
TTCATAGATGTTCAAAAAGGAGAACTAACAATGAGGGTTTATGATGAGAAAGTAAAATTTAATGTGTTTAAGACCATGAAGTATCTAGACGAAGTGGAAGATTGTTCATT
CATTATGATTATGGAGAAAACAATTGTTGAGACAACAATGGAAGATTTGGCAAACAAGCATTTGGAAGATCATGGAGAGATTAGTGTTCGTCCTATTAAGCCAGCCCTGA
TTGAGGCACCCACTTTAGATTTGAAGCCCTTGCCGGATCATCTAAAGTACGTGTATCTTGGGGAGGATAGCAATTGGGTAAGCCCTATCCAATGTGTTCCTAAGAAAGGA
GGTGTCACAGTAGTGACTAATACGGACAATGAATTGATCCCAACCAGGACAGTAACTGCCTGGAGGGTTTGCATGGATTACAGGAAGCTTAACAAAGCCACCCATAAGGA
CAATTTCGCTCTGCCATTTATTGACCAAATGTTGGATAGATTGGTTGGTAAGGCCTACTACTATTTCTTAGATGGTTATTCTGGATATAATCAGATTACCATTGCTCCTG
AGGATCAGGAAAAAACCACTTTCACCTACCCCTATGGGATGTTTGCCTTTAGGCGGATGCCTTTTGGTCTGTGCAATGCTCCAACCACATTTCAGTGGTGTATGTTAGCA
ATATTTTCTGATATGATTGAGTCCACTGTTGAGCAGGCTTGCGTCGAGACGCTAGCTCTTGAGCGTCTCGATGCTCACATTCCATTTCTGATTAGGCGCGTAAAGGTCAC
AGCGTCTCGACGCTGCGACCTTAGCGTCTCGATGCTGTGTTATTTCGCTTACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACAGCAAAATAATCAGGCTGAGAATCCTCTTGGTAGCGAACGATAGGGCTAGAGCCATTCGAGCTTACGCTTTTCCAATGTTTGATGAGTTAAATCCAGGAATTCC
ACGTCCTCAAATTAAGGCAGCAAATTTTGAAATGAAATCGGTAATGTTTCAGATGTTGCAAACCATGGGTCAATTCCATGGTTTTTCATCTGAAGACCCTCATTTATATC
TTAAGTCTTTTATAGGAGTTAGTGATTCGTTTGTAATTCAGGGAGTGTCTAAGGATACCCTGAGATTAACTTTGTTCCCGTGTTCTCTTAGAGATGGAGCAAAGACATGG
GCTGATATTGCAATGTTAGCTAACACTCTTAAAAATGTGACAGTGGTTAGTCATCAACAGCCGCCAGTAGTGGAGCCTGCTGCAGTGGTGAACCAAGTTGGCGCAACCAC
CCCAACTTTGCATGGGGAGGTCAAGGAAGCAATTTGGAAGCCCTCAAGCAAAGCAAAAGATACTGAACACCCTAGAAGGGAAGGTAAGGAGCAGGTACAAGCAGTGACTC
TAAGGACTGAGTTGGAGTCTGGTAAAGGTTCTGGAGCCAGCAATAATGATGTTGGAGCATCTGGTTCTGTTCCAGATGTGGAACCACCTTATGTGCCACCCCTACCTTAT
GTACCACCCCTACCTTTTCCACAAAGGCAAAGGCCTAACAATCAGGATGGTGGAAAAGGGTTAGGTGGAACACTTTGTGATTTTGGCGCAAGAATTAACCTTATGCCTCT
TTCGCTCTATCGAAAGCTAGGTATTGGTGAAGCTATGCCTACCACAGTCACACTCCAATTAGCTGACATGTCTATTGAATATCCAGAGGGTAAAATTAAGGATGTCTTAG
TGAAAGTAGATAAATTCATATTTCCTGTCGATTTTATTATCTTAGATTATGAGGTAGATAAAGATGTCCCAATTATTCTTGGTCGTCCATTTTTGGCTACTGGTAGGGCA
TTCATAGATGTTCAAAAAGGAGAACTAACAATGAGGGTTTATGATGAGAAAGTAAAATTTAATGTGTTTAAGACCATGAAGTATCTAGACGAAGTGGAAGATTGTTCATT
CATTATGATTATGGAGAAAACAATTGTTGAGACAACAATGGAAGATTTGGCAAACAAGCATTTGGAAGATCATGGAGAGATTAGTGTTCGTCCTATTAAGCCAGCCCTGA
TTGAGGCACCCACTTTAGATTTGAAGCCCTTGCCGGATCATCTAAAGTACGTGTATCTTGGGGAGGATAGCAATTGGGTAAGCCCTATCCAATGTGTTCCTAAGAAAGGA
GGTGTCACAGTAGTGACTAATACGGACAATGAATTGATCCCAACCAGGACAGTAACTGCCTGGAGGGTTTGCATGGATTACAGGAAGCTTAACAAAGCCACCCATAAGGA
CAATTTCGCTCTGCCATTTATTGACCAAATGTTGGATAGATTGGTTGGTAAGGCCTACTACTATTTCTTAGATGGTTATTCTGGATATAATCAGATTACCATTGCTCCTG
AGGATCAGGAAAAAACCACTTTCACCTACCCCTATGGGATGTTTGCCTTTAGGCGGATGCCTTTTGGTCTGTGCAATGCTCCAACCACATTTCAGTGGTGTATGTTAGCA
ATATTTTCTGATATGATTGAGTCCACTGTTGAGCAGGCTTGCGTCGAGACGCTAGCTCTTGAGCGTCTCGATGCTCACATTCCATTTCTGATTAGGCGCGTAAAGGTCAC
AGCGTCTCGACGCTGCGACCTTAGCGTCTCGATGCTGTGTTATTTCGCTTACTGA
Protein sequenceShow/hide protein sequence
MDSKIIRLRILLVANDRARAIRAYAFPMFDELNPGIPRPQIKAANFEMKSVMFQMLQTMGQFHGFSSEDPHLYLKSFIGVSDSFVIQGVSKDTLRLTLFPCSLRDGAKTW
ADIAMLANTLKNVTVVSHQQPPVVEPAAVVNQVGATTPTLHGEVKEAIWKPSSKAKDTEHPRREGKEQVQAVTLRTELESGKGSGASNNDVGASGSVPDVEPPYVPPLPY
VPPLPFPQRQRPNNQDGGKGLGGTLCDFGARINLMPLSLYRKLGIGEAMPTTVTLQLADMSIEYPEGKIKDVLVKVDKFIFPVDFIILDYEVDKDVPIILGRPFLATGRA
FIDVQKGELTMRVYDEKVKFNVFKTMKYLDEVEDCSFIMIMEKTIVETTMEDLANKHLEDHGEISVRPIKPALIEAPTLDLKPLPDHLKYVYLGEDSNWVSPIQCVPKKG
GVTVVTNTDNELIPTRTVTAWRVCMDYRKLNKATHKDNFALPFIDQMLDRLVGKAYYYFLDGYSGYNQITIAPEDQEKTTFTYPYGMFAFRRMPFGLCNAPTTFQWCMLA
IFSDMIESTVEQACVETLALERLDAHIPFLIRRVKVTASRRCDLSVSMLCYFAY