; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011319 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011319
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr1:21240887..21245126
RNA-Seq ExpressionLag0011319
SyntenyLag0011319
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GEW84611.1 reverse transcriptase domain-containing protein [Tanacetum cinerariifolium]2.2e-6736.79Show/hide
Query:  DLNERKAPPIKPFLIEEPTLDLKPFSDHLKFVYRGEGEMLLIIVASYLMPEDEETLIELLQQYRKAIGWTLANIQGINTSFCMHKITLEEGSFRSIEQQK
        +L   +A  +K  + E P ++LK F+ HL++V+      L II+A  L  ED+  LI+ L+ +++AI W L++IQGIN  FC HKI ++E    +++ Q+
Subjt:  DLNERKAPPIKPFLIEEPTLDLKPFSDHLKFVYRGEGEMLLIIVASYLMPEDEETLIELLQQYRKAIGWTLANIQGINTSFCMHKITLEEGSFRSIEQQK

Query:  MLNLAMKEVVKNEVIKWLDVGIIYPIADSNWDLGVTIIDSKFDLRVIHLLTRLLVRIIRRMPFGLCNAPTTFQRYCRKAFETLKAALISAPILCAPNWSL
         +N  + +V+K EV K +D G+IYPI+DS W   V  +  K    ++      L  I  R+  G  N P  F   C KAF+ LK  L  APIL APNW +
Subjt:  MLNLAMKEVVKNEVIKWLDVGIIYPIADSNWDLGVTIIDSKFDLRVIHLLTRLLVRIIRRMPFGLCNAPTTFQRYCRKAFETLKAALISAPILCAPNWSL

Query:  SFEVMCDASDVAV---------------------------------------------------GDEAKEILGQCHSSPYGGHFSAQRTTMRILQCGFFW
        SFE+MCDASD A+                                                   G EA +IL  CH+ P GGH  A  TT +I   GFFW
Subjt:  SFEVMCDASDVAV---------------------------------------------------GDEAKEILGQCHSSPYGGHFSAQRTTMRILQCGFFW

Query:  PPLFKDAHWFYKQCD--TCQRRRNVSKYVEAIACHQNGTKTISRFLQSHIFARFGTPRALVSDEGTHFVNNVLTKLLAKYEIKHRIATTYHPQANGQAKV
        P ++KDAH F K  +         +SK+VEA A   N T+ + +FL+S +FARFG PR+++SD GTHF N+   +++ KY + H ++T YHPQ +GQ +V
Subjt:  PPLFKDAHWFYKQCD--TCQRRRNVSKYVEAIACHQNGTKTISRFLQSHIFARFGTPRALVSDEGTHFVNNVLTKLLAKYEIKHRIATTYHPQANGQAKV

Query:  KRMGI
           G+
Subjt:  KRMGI

XP_009591349.1 uncharacterized protein LOC104088396 [Nicotiana tomentosiformis]4.3e-6832.71Show/hide
Query:  FESLDLNERKAPPIKPFLIEEPTLDLKPFSDHLKFVYRGEGEMLLIIVASYLMPEDEETLIELLQQYRKAIGWTLANIQGINTSFCMHKITLEEGSFRSI
        FE LD+     PP KP + E P L+LKP   HL++ Y G  E L +I++S L    EE L+++L++++KAIGWT+A+I+GI+ SFCMHKI LE+G   SI
Subjt:  FESLDLNERKAPPIKPFLIEEPTLDLKPFSDHLKFVYRGEGEMLLIIVASYLMPEDEETLIELLQQYRKAIGWTLANIQGINTSFCMHKITLEEGSFRSI

Query:  EQQKMLNLAMKEVVKNEVIKWLDVGIIYPIADSNWD---LGVTIIDSKFDLRVIHLLTRLLVRIIRRM--PFGLCNAPTTFQRYCRKAFETLKAALISAP
        E Q+ LN  MKEVVK EVIK LD  II+PI++SNW+   LG  +  S  +      + +  V  + ++  P  + +    F     K FE LK  L++AP
Subjt:  EQQKMLNLAMKEVVKNEVIKWLDVGIIYPIADSNWD---LGVTIIDSKFDLRVIHLLTRLLVRIIRRM--PFGLCNAPTTFQRYCRKAFETLKAALISAP

Query:  ILCAPNWSLSFEVMCDASDVAVG-----------------------------------------------------------------------------
        I+ AP+WSL FE+MCDASD A+G                                                                             
Subjt:  ILCAPNWSLSFEVMCDASDVAVG-----------------------------------------------------------------------------

Query:  -------------------------------------------------------------DEAKEILGQCHSSPYGGHFSAQRTTMRILQCGFFWPPLF
                                                                      EA+ +L  CH+SPY  H    RTT ++LQ  FFWP LF
Subjt:  -------------------------------------------------------------DEAKEILGQCHSSPYGGHFSAQRTTMRILQCGFFWPPLF

Query:  KDAHWFYKQCDTCQRRRN--------------------------------------------VSKYVEAIACHQNGTKTISRFLQSHIFARFGTPRALVS
        KDAH F K+CD CQR R                                             +SK+VEAI    N    ++ F++ +IF+RFGTPRAL+S
Subjt:  KDAHWFYKQCDTCQRRRN--------------------------------------------VSKYVEAIACHQNGTKTISRFLQSHIFARFGTPRALVS

Query:  DEGTHFVNNVLTKLLAKYEIKHRIATTYHPQANGQAKV
        DEGTHF N +L  LLAKY +++R+AT YHP+ +GQA+V
Subjt:  DEGTHFVNNVLTKLLAKYEIKHRIATTYHPQANGQAKV

XP_016468571.1 PREDICTED: uncharacterized protein LOC107791089 [Nicotiana tabacum]8.7e-6932.59Show/hide
Query:  FESLDLNERKAPPIKPFLIEEPTLDLKPFSDHLKFVYRGEGEMLLIIVASYLMPEDEETLIELLQQYRKAIGWTLANIQGINTSFCMHKITLEEGSFRSI
        FE LD      PP KP + E P L+LKP S HL + Y G  ++LL+I++S L    EE L+ +L++++KAIGWT+ +I+GI  SFCMHKI LE+G   S+
Subjt:  FESLDLNERKAPPIKPFLIEEPTLDLKPFSDHLKFVYRGEGEMLLIIVASYLMPEDEETLIELLQQYRKAIGWTLANIQGINTSFCMHKITLEEGSFRSI

Query:  EQQKMLNLAMKEVVKNEVIKWLDVGIIYPIADSNW---------DLGVTIIDSK----------------FDLRVIHLLT-----------RLLVRII--
        EQQ+ LN  MKEVVK EVIK LD GII+PI+DSNW           G+T+I +K                 D R ++  T           ++L R++  
Subjt:  EQQKMLNLAMKEVVKNEVIKWLDVGIIYPIADSNW---------DLGVTIIDSK----------------FDLRVIHLLT-----------RLLVRII--

Query:  -------------------------RRMPF--------GLCNAPTT--FQRYCRKAFETLKAALISAPILCAPNWSLSFEVMCDASDVAVG---------
                                  +MPF        GL     T  F   C KAFE LK  L++API+ AP+WSL F++MCDASD A+G         
Subjt:  -------------------------RRMPF--------GLCNAPTT--FQRYCRKAFETLKAALISAPILCAPNWSLSFEVMCDASDVAVG---------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -DEAKEILGQCHSSPYGGHFSAQRTTMRILQCGFFWPPLFKDAHWFYKQCDTCQ------RRRN------------------------------------
          E + +L  CH+SPYGGH    RT  ++LQ GFFWP LFKDAH F K+C  CQ      RR                                      
Subjt:  -DEAKEILGQCHSSPYGGHFSAQRTTMRILQCGFFWPPLFKDAHWFYKQCDTCQ------RRRN------------------------------------

Query:  --VSKYVEAIACHQNGTKTISRFLQSHIFARFGTPRALVSDEGTHFVNNVLTKLLAKYEIKHRIATTYHPQANGQAKVKRMGI
          +SK+V+AIA   N    ++ F++ +IF RFGTPRAL+SDEGTHF N +L  LLAKY ++HR+ T YHPQ +GQ +V    I
Subjt:  --VSKYVEAIACHQNGTKTISRFLQSHIFARFGTPRALVSDEGTHFVNNVLTKLLAKYEIKHRIATTYHPQANGQAKVKRMGI

XP_019229853.1 PREDICTED: uncharacterized protein LOC109210836 [Nicotiana attenuata]3.2e-6336.79Show/hide
Query:  SLDLNERKAPPIKPFLIEEPTLDLKPFSDHLKFVYRGEGEMLLIIVASYLMPEDEETLIELLQQYRKAIGWTLANIQGINTSFCMHKITLEEGSFRSIEQ
        SLDL  R  PP KP +IE P L+LKP   HL++ + G  + L +IV+S L     E L+E+L+ +R+AIGWT+A+I+GI    C HKI LE  +  S+E 
Subjt:  SLDLNERKAPPIKPFLIEEPTLDLKPFSDHLKFVYRGEGEMLLIIVASYLMPEDEETLIELLQQYRKAIGWTLANIQGINTSFCMHKITLEEGSFRSIEQ

Query:  QKMLNLAMKEVVKNEVIKWLDVGIIYPIADSNWDLGVTIIDSKFDLRVIHLLTRLLVRIIRRMPFGLCNAPTTFQRYCRKAFETLKAALISAPILCAPNW
        Q+ LN +M+EVVK E+IKWLDVG++YPIADS+W     ++ SK  +   H   R L+                              A   A   C P  
Subjt:  QKMLNLAMKEVVKNEVIKWLDVGIIYPIADSNWDLGVTIIDSKFDLRVIHLLTRLLVRIIRRMPFGLCNAPTTFQRYCRKAFETLKAALISAPILCAPNW

Query:  SLSFEVMCDASDVAVGDEAKEILGQCHSSPYGGHFSAQRTTMRILQCGFFWPPLFKDAHWFYKQCDTCQRRRN---------------------------
                        +E   IL  CH SP GGH    RT  ++L+CG++WP ++ DA+   K CD CQR+ +                           
Subjt:  SLSFEVMCDASDVAVGDEAKEILGQCHSSPYGGHFSAQRTTMRILQCGFFWPPLFKDAHWFYKQCDTCQRRRN---------------------------

Query:  -----------------VSKYVEAIACHQNGTKTISRFLQSHIFARFGTPRALVSDEGTHFVNNVLTKLLAKYEIKHRIATTYHPQ
                         VSK+VEAIA   N  ++++ FL+ +IF RFGTPRA++SD G+HF N   T LL K+ +KH++ATTYHPQ
Subjt:  -----------------VSKYVEAIACHQNGTKTISRFLQSHIFARFGTPRALVSDEGTHFVNNVLTKLLAKYEIKHRIATTYHPQ

XP_028954189.1 uncharacterized protein LOC108170821 [Malus domestica]1.3e-6434.92Show/hide
Query:  APPIKPFLIEEPTLDLKPFSDHLKFVYRGEGEMLLIIVASYLMPEDEETLIELLQQYRKAIGWTLANIQGINTSFCMHKITLEEGSFRSIEQQKMLNLAM
        A  + P +++ PTL+LKP   HLK+V+ GE + L +I++S L  ++E+ LI +L+ ++ AI WTLA+I+GI+ + CMH+I LE+G+  S E Q  LN  M
Subjt:  APPIKPFLIEEPTLDLKPFSDHLKFVYRGEGEMLLIIVASYLMPEDEETLIELLQQYRKAIGWTLANIQGINTSFCMHKITLEEGSFRSIEQQKMLNLAM

Query:  KEVVKNEVIKWLDVGIIYPIADSNWDLGVTIIDSK-------------------------FDLRVIHLLTR--------LLVRIIRRMPFGLCNAPTTFQ
         EVVK EVIK LD G+IYPI+DS W   V ++  K                          D R+++ +TR        L   + RRMPFGLCNAP TFQ
Subjt:  KEVVKNEVIKWLDVGIIYPIADSNWDLGVTIIDSK-------------------------FDLRVIHLLTR--------LLVRIIRRMPFGLCNAPTTFQ

Query:  R--------YCRKAFE---------TLKAALI---SAPILCAPNWSLSFE--------------VMCDASDVAVGDEA----------------------
        R        Y  K  E          LK  L    + P L    W L  +              VM D     V +EA                      
Subjt:  R--------YCRKAFE---------TLKAALI---SAPILCAPNWSLSFE--------------VMCDASDVAVGDEA----------------------

Query:  -----KEILGQCHSSPYGGHFSAQRTTMRILQCGFFWPPLFKDAHWFYKQCDTCQRRRN-----------------------------------------
             + IL  CHS   GGHF  QRT +++L+CGF+WP +F+DA  F   CD C+R  N                                         
Subjt:  -----KEILGQCHSSPYGGHFSAQRTTMRILQCGFFWPPLFKDAHWFYKQCDTCQRRRN-----------------------------------------

Query:  ---VSKYVEAIACHQNGTKTISRFLQSHIFARFGTPRALVSDEGTHFVNNVLTKLLAKYEIKHRIATTYHPQANGQAKVKRMGI
           VSK+VEA A   N +K ++ F++S+IFARFG PR L+SD G+HF N  +  LL KY + HR++T YHPQ +GQA+V    I
Subjt:  ---VSKYVEAIACHQNGTKTISRFLQSHIFARFGTPRALVSDEGTHFVNNVLTKLLAKYEIKHRIATTYHPQANGQAKVKRMGI

TrEMBL top hitse value%identityAlignment
A0A1S3Y468 uncharacterized protein LOC1077719882.9e-6237.99Show/hide
Query:  DHGEVSIEDLE-VCSLERKSEKEVSRCEDVFESLDLNERKAPPIKPFLIEEPTLDLKPFSDHLKFVYRGEGEMLLIIVASYLMPEDEETLIELLQQYRKA
        D  +  IE++E V ++  K      R ED    + L      P K  + E P L+LKP   HL + Y G+ E L I+++S L    EE L+ + ++++KA
Subjt:  DHGEVSIEDLE-VCSLERKSEKEVSRCEDVFESLDLNERKAPPIKPFLIEEPTLDLKPFSDHLKFVYRGEGEMLLIIVASYLMPEDEETLIELLQQYRKA

Query:  IGWTLANIQGINTSFCMHKITLEEGSFRSIEQQKMLNLAMKEVVKNEVIKWLDVGIIYPIADSNWDLGVTIIDSKFDLRVIHLLTRLLVRIIRRMPFGLC
        IGW + +I+GI+ SFCMH I LE+G   SIEQQ+ LN  +KEVVK EVIKWLD GII+PI+DSNWD                                  
Subjt:  IGWTLANIQGINTSFCMHKITLEEGSFRSIEQQKMLNLAMKEVVKNEVIKWLDVGIIYPIADSNWDLGVTIIDSKFDLRVIHLLTRLLVRIIRRMPFGLC

Query:  NAPTTFQRYCRKAFETLKAALISAPILCAPNWSLSFEVMCDASDVAVGDEAKEILGQCHSSPYGGHFSAQRT-------------TMRILQCGFFWPPLF
         A   F   C KAFE  K  L++API+ AP+WSL FE+MCDASD A+G     +LGQ     +   + A +T              + ++   F     +
Subjt:  NAPTTFQRYCRKAFETLKAALISAPILCAPNWSLSFEVMCDASDVAVGDEAKEILGQCHSSPYGGHFSAQRT-------------TMRILQCGFFWPPLF

Query:  KDAHWFYKQCDTCQRRRN---------VSKYVEAIACHQNGTKTISRFLQSHIFARFGTPRALVSDEGTHFVNNVLTKLLAKYEIKHRIATTYHPQANGQ
                  D    R N         VSK+VEAI    N  K ++ F++ +IF+RFGTPRAL+SDEG +F N +L  LLAKY ++HR++ TYHPQ + Q
Subjt:  KDAHWFYKQCDTCQRRRN---------VSKYVEAIACHQNGTKTISRFLQSHIFARFGTPRALVSDEGTHFVNNVLTKLLAKYEIKHRIATTYHPQANGQ

Query:  AKVKRMGI
         KV    I
Subjt:  AKVKRMGI

A0A1S3ZVZ8 uncharacterized protein LOC1077910894.2e-6932.59Show/hide
Query:  FESLDLNERKAPPIKPFLIEEPTLDLKPFSDHLKFVYRGEGEMLLIIVASYLMPEDEETLIELLQQYRKAIGWTLANIQGINTSFCMHKITLEEGSFRSI
        FE LD      PP KP + E P L+LKP S HL + Y G  ++LL+I++S L    EE L+ +L++++KAIGWT+ +I+GI  SFCMHKI LE+G   S+
Subjt:  FESLDLNERKAPPIKPFLIEEPTLDLKPFSDHLKFVYRGEGEMLLIIVASYLMPEDEETLIELLQQYRKAIGWTLANIQGINTSFCMHKITLEEGSFRSI

Query:  EQQKMLNLAMKEVVKNEVIKWLDVGIIYPIADSNW---------DLGVTIIDSK----------------FDLRVIHLLT-----------RLLVRII--
        EQQ+ LN  MKEVVK EVIK LD GII+PI+DSNW           G+T+I +K                 D R ++  T           ++L R++  
Subjt:  EQQKMLNLAMKEVVKNEVIKWLDVGIIYPIADSNW---------DLGVTIIDSK----------------FDLRVIHLLT-----------RLLVRII--

Query:  -------------------------RRMPF--------GLCNAPTT--FQRYCRKAFETLKAALISAPILCAPNWSLSFEVMCDASDVAVG---------
                                  +MPF        GL     T  F   C KAFE LK  L++API+ AP+WSL F++MCDASD A+G         
Subjt:  -------------------------RRMPF--------GLCNAPTT--FQRYCRKAFETLKAALISAPILCAPNWSLSFEVMCDASDVAVG---------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -DEAKEILGQCHSSPYGGHFSAQRTTMRILQCGFFWPPLFKDAHWFYKQCDTCQ------RRRN------------------------------------
          E + +L  CH+SPYGGH    RT  ++LQ GFFWP LFKDAH F K+C  CQ      RR                                      
Subjt:  -DEAKEILGQCHSSPYGGHFSAQRTTMRILQCGFFWPPLFKDAHWFYKQCDTCQ------RRRN------------------------------------

Query:  --VSKYVEAIACHQNGTKTISRFLQSHIFARFGTPRALVSDEGTHFVNNVLTKLLAKYEIKHRIATTYHPQANGQAKVKRMGI
          +SK+V+AIA   N    ++ F++ +IF RFGTPRAL+SDEGTHF N +L  LLAKY ++HR+ T YHPQ +GQ +V    I
Subjt:  --VSKYVEAIACHQNGTKTISRFLQSHIFARFGTPRALVSDEGTHFVNNVLTKLLAKYEIKHRIATTYHPQANGQAKVKRMGI

A0A1U7XC36 uncharacterized protein LOC1042329161.1e-6128.78Show/hide
Query:  LNERKAPPIKPFLIEEPTLDLKPFSDHLKFVYRGEGEMLLIIVASYLMPEDEETLIELLQQYRKAIGWTLANIQGINTSFCMHKITLEEGSFRSIEQQKM
        LN  + P  +  + E P L+LKP   HL++ Y G+ + L +IV+  L    EE L+ +L+++++A+GWT+ +I+GI+ +FCMHKI +E+G    +EQQ+ 
Subjt:  LNERKAPPIKPFLIEEPTLDLKPFSDHLKFVYRGEGEMLLIIVASYLMPEDEETLIELLQQYRKAIGWTLANIQGINTSFCMHKITLEEGSFRSIEQQKM

Query:  LNLAMKEVVKNEVIKWLDVGIIYPIADSNWDLGVTIIDSKFDLRVI------HLLTRLLVRIIRRMPFGLCNAPTTFQRYCRKAFETLKAALISAPILCA
        LN  MKEVV+ EVIKWL+ GI++PI+DS W   V  +  K  + V+       + TR +  ++ +      +    F   C KAFE LK  L+ API+ A
Subjt:  LNLAMKEVVKNEVIKWLDVGIIYPIADSNWDLGVTIIDSKFDLRVI------HLLTRLLVRIIRRMPFGLCNAPTTFQRYCRKAFETLKAALISAPILCA

Query:  PNWSLSFEVMCDASDVAVG---------------------------------------------------------------------------------
        P+W   FE+MCDASD+AVG                                                                                 
Subjt:  PNWSLSFEVMCDASDVAVG---------------------------------------------------------------------------------

Query:  -------------------------------------------------------------------------------DEAKEILGQCHSSPYGGHFSA
                                                                                       +E   IL  CH+S YGGH   
Subjt:  -------------------------------------------------------------------------------DEAKEILGQCHSSPYGGHFSA

Query:  QRTTMRILQCGFFWPPLFKDAHWFYKQCDTCQRRRN--------------------------------------------VSKYVEAIACHQNGTKTISR
         RT  ++LQ GF+WP  F+DAH F K CD CQR                                               VSK+VE IA   N  K +  
Subjt:  QRTTMRILQCGFFWPPLFKDAHWFYKQCDTCQRRRN--------------------------------------------VSKYVEAIACHQNGTKTISR

Query:  FLQSHIFARFGTPRALVSDEGTHFVNNVLTKLLAKYEIKHRIATTYHPQANGQAKV
        F++ HIF  FGTPR L+ D GTHF N +L  +LAKY +KH+++T YHPQ +GQ KV
Subjt:  FLQSHIFARFGTPRALVSDEGTHFVNNVLTKLLAKYEIKHRIATTYHPQANGQAKV

A0A2N9IZ94 Integrase catalytic domain-containing protein5.9e-6334.7Show/hide
Query:  HGEVSIEDLEVCSLERKSEKEVSRCEDVFESLDLNERKAPPIKPFLIEEPTLDLKPFSDHLKFVYRGEGEMLLIIVASYLMPEDEETLIELLQQYRKAIG
        H     + LE C +      E+S     FE L  +E K  P+ P  IE P L+LK     LK+ +   G+   ++++S L  E E +L++LL++++ AIG
Subjt:  HGEVSIEDLEVCSLERKSEKEVSRCEDVFESLDLNERKAPPIKPFLIEEPTLDLKPFSDHLKFVYRGEGEMLLIIVASYLMPEDEETLIELLQQYRKAIG

Query:  WTLANIQGINTSFCMHKITLEEGSFRSIEQQKMLNLAMKEVVKNEVIKWLDVGIIYPIADSNW---------DLGVTIIDSK----------------FD
        WT+A+I+GI+   C HK+  EE    S E Q+ LN  MKEVVK+EV+K LD GIIYPI DS W           GVT+++++                 D
Subjt:  WTLANIQGINTSFCMHKITLEEGSFRSIEQQKMLNLAMKEVVKNEVIKWLDVGIIYPIADSNW---------DLGVTIIDSK----------------FD

Query:  LRVIHLLTR-------LLVRIIRRMP---------FGLCNAPTTFQRYCRKAFETLKAALISAPILCAPNWSLSFEVMCDASDVAVGDEAKEILGQCHSS
         R ++  TR        + +++ R+          +   +AP  +   C++AF  L   L SAPI+ +P+WSL FE+MCDASD A+G  A EI  +    
Subjt:  LRVIHLLTR-------LLVRIIRRMP---------FGLCNAPTTFQRYCRKAFETLKAALISAPILCAPNWSLSFEVMCDASDVAVGDEAKEILGQCHSS

Query:  PYGGHFSAQRTTMRILQCGFFWPPLFKDAHWFYKQCDTCQR-----RRN---------------------------------------VSKYVEAIACHQ
             +SAQ     +++CGF+WP LFKD + F + C+ CQ+     RRN                                       VSK+VEA+AC +
Subjt:  PYGGHFSAQRTTMRILQCGFFWPPLFKDAHWFYKQCDTCQR-----RRN---------------------------------------VSKYVEAIACHQ

Query:  NGTKTISRFLQSHIFARFGTPRALVSDEGTHFVNNVLTKLLAKYEIKHRIATTYHPQANGQAKV
        N  +T+ +FL+ ++ +RFGTPRA++SD+GTHF N     L+ KY + H++AT+YHPQ +GQ ++
Subjt:  NGTKTISRFLQSHIFARFGTPRALVSDEGTHFVNNVLTKLLAKYEIKHRIATTYHPQANGQAKV

A0A6P5M866 uncharacterized protein LOC1074935904.0e-5933.26Show/hide
Query:  PTLDLKPFSDHLKFVYRGEGEMLLIIVASYLMPEDEETLIELLQQYRKAIGWTLANIQGINTSFCMHKITLEEGSFRSIEQQKMLNLAMKE-----VVKN
        P L+LK     LK+ Y G+     +I+ S L  E EE LI++L+Q++ AIGWTL +++GI+ S CMHKI LEEG+    +  + LN A ++        +
Subjt:  PTLDLKPFSDHLKFVYRGEGEMLLIIVASYLMPEDEETLIELLQQYRKAIGWTLANIQGINTSFCMHKITLEEGSFRSIEQQKMLNLAMKE-----VVKN

Query:  EVIKWLDVGIIYPIADSNWDLGVTIIDSKFDLRVIHLLTRLLVRIIRRMPFGLCNAPTTFQR---------------------------YCRKAFETLKA
        ++++ L     Y   +        ++D   D           V   RRMPFGLC+AP TFQR                            C  AFE LK 
Subjt:  EVIKWLDVGIIYPIADSNWDLGVTIIDSKFDLRVIHLLTRLLVRIIRRMPFGLCNAPTTFQR---------------------------YCRKAFETLKA

Query:  ALISAPILCAPNWSLSFEVMCDASDVAVG-----------------------------------------------------------------------
         L S PI+  P W L FE+MCDASD AV                                                                        
Subjt:  ALISAPILCAPNWSLSFEVMCDASDVAVG-----------------------------------------------------------------------

Query:  -------DEAKEILGQCHSSPYGGHFSAQRTTMRILQCGFFWPPLFKDAHWFYKQCDTCQRRRNVSK-----------------YVEAIACHQNGTKTIS
                E +E+L QCH S YGGHFS + T  ++LQCGF+WP +F+DA     +CD CQR  N+ +                 +VEAIA   N  K + 
Subjt:  -------DEAKEILGQCHSSPYGGHFSAQRTTMRILQCGFFWPPLFKDAHWFYKQCDTCQRRRNVSK-----------------YVEAIACHQNGTKTIS

Query:  RFLQSHIFARFGTPRALVSDEGTHFVNNVLTKLLAKYEIKHRIATTYHPQANGQAKV
         FL+ +IF RFG PRAL+SD GTHF N  L  LL +Y +KH++AT YHPQ NGQA++
Subjt:  RFLQSHIFARFGTPRALVSDEGTHFVNNVLTKLLAKYEIKHRIATTYHPQANGQAKV

SwissProt top hitse value%identityAlignment
O92815 Gag-Pol polyprotein1.6e-0430.99Show/hide
Query:  SKYVEAIACHQNGTKTISRFLQSHIFARFGTPRALVSDEGTHFVNNVLTKLLAKYEIKHRIATTYHPQANG
        SK+ E I C++   KT+   L   I  R+G P  + SD+GTHF   +  +L     +  ++    HP+++G
Subjt:  SKYVEAIACHQNGTKTISRFLQSHIFARFGTPRALVSDEGTHFVNNVLTKLLAKYEIKHRIATTYHPQANG

P10272 Gag-Pol polyprotein3.5e-0434.62Show/hide
Query:  SKYVEAIACHQNGTKTISRFLQSHIFARFGTPRALVSDEGTHFVNNVLTKLLAKYEIKHRIATTYHPQANGQAKVKRM
        S +VEA    Q     +++ +   IF RFG P+ + SD G  FV+ V   L     I  ++   Y PQ++GQ  V+RM
Subjt:  SKYVEAIACHQNGTKTISRFLQSHIFARFGTPRALVSDEGTHFVNNVLTKLLAKYEIKHRIATTYHPQANGQAKVKRM

P31792 Pol polyprotein (Fragment)2.7e-0434.62Show/hide
Query:  SKYVEAIACHQNGTKTISRFLQSHIFARFGTPRALVSDEGTHFVNNVLTKLLAKYEIKHRIATTYHPQANGQAKVKRM
        S +VEA    Q     +++ +   IF RFG P+ + SD G  FV+ V   L     I  ++   Y PQ++GQ  V+RM
Subjt:  SKYVEAIACHQNGTKTISRFLQSHIFARFGTPRALVSDEGTHFVNNVLTKLLAKYEIKHRIATTYHPQANGQAKVKRM

P92516 Uncharacterized mitochondrial protein AtMg007503.2e-0557.58Show/hide
Query:  ILQCGFFWPPLFKDAHWFYKQCDTCQRRRNVSK
        +LQ GF+WP  FKDAH F   CD CQR+ N +K
Subjt:  ILQCGFFWPPLFKDAHWFYKQCDTCQRRRNVSK

Arabidopsis top hitse value%identityAlignment
ATMG00750.1 GAG/POL/ENV polyprotein2.3e-0657.58Show/hide
Query:  ILQCGFFWPPLFKDAHWFYKQCDTCQRRRNVSK
        +LQ GF+WP  FKDAH F   CD CQR+ N +K
Subjt:  ILQCGFFWPPLFKDAHWFYKQCDTCQRRRNVSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGATTTGACAAACAAGCATTTGGGAGATCATGGAGAGGTTAGTATAGAGGATTTAGAAGTTTGTTCTTTAGAAAGAAAAAGTGAAAAAGAAGTGTCTAGGTGTGA
GGATGTTTTTGAGTCTTTAGATTTGAATGAAAGGAAGGCTCCTCCTATTAAGCCATTCCTGATTGAGGAACCCACTTTAGATTTGAAGCCCTTTTCGGATCATCTAAAGT
TTGTGTATCGAGGGGAGGGTGAGATGTTGCTCATTATTGTTGCATCATATTTAATGCCAGAGGATGAAGAGACCTTAATAGAGTTGCTGCAGCAATATCGAAAAGCAATA
GGTTGGACATTGGCTAACATACAGGGAATTAACACGTCTTTCTGTATGCATAAAATCACTCTAGAGGAGGGATCCTTTAGAAGTATTGAGCAACAGAAAATGCTTAACCT
TGCAATGAAAGAGGTTGTTAAAAATGAGGTGATTAAATGGTTGGATGTTGGGATTATCTATCCAATTGCAGATAGCAATTGGGATCTAGGAGTGACCATCATAGATTCTA
AGTTTGATCTAAGAGTGATTCATCTTCTAACTAGACTCTTGGTTCGCATCATTCGGCGAATGCCTTTTGGCCTATGCAATGCTCCAACAACATTTCAGAGGTATTGTAGG
AAGGCTTTTGAGACTTTAAAGGCTGCTTTAATCTCAGCCCCCATTCTTTGTGCACCTAATTGGAGTTTATCATTTGAGGTAATGTGTGATGCAAGTGATGTTGCAGTAGG
TGATGAAGCAAAGGAAATCCTGGGGCAATGTCACTCCTCACCGTATGGAGGTCATTTTAGCGCTCAGAGGACAACTATGAGGATTTTGCAATGTGGATTTTTCTGGCCTC
CTTTATTTAAGGATGCCCATTGGTTCTACAAGCAATGTGATACTTGCCAAAGGAGAAGAAATGTGTCCAAGTATGTGGAAGCCATTGCATGCCATCAGAATGGTACTAAG
ACAATATCAAGGTTTCTCCAATCGCACATTTTTGCGCGGTTTGGGACACCTAGAGCTTTAGTGAGTGATGAGGGTACACACTTTGTTAATAATGTTTTAACTAAGCTTTT
AGCTAAGTATGAAATTAAGCATAGGATAGCTACCACTTATCACCCTCAAGCAAATGGTCAAGCTAAAGTAAAACGGATGGGCATCTGGTGCTTCAACGTGTCACGCAGGA
GAGATCTAACGGTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGGATTTGACAAACAAGCATTTGGGAGATCATGGAGAGGTTAGTATAGAGGATTTAGAAGTTTGTTCTTTAGAAAGAAAAAGTGAAAAAGAAGTGTCTAGGTGTGA
GGATGTTTTTGAGTCTTTAGATTTGAATGAAAGGAAGGCTCCTCCTATTAAGCCATTCCTGATTGAGGAACCCACTTTAGATTTGAAGCCCTTTTCGGATCATCTAAAGT
TTGTGTATCGAGGGGAGGGTGAGATGTTGCTCATTATTGTTGCATCATATTTAATGCCAGAGGATGAAGAGACCTTAATAGAGTTGCTGCAGCAATATCGAAAAGCAATA
GGTTGGACATTGGCTAACATACAGGGAATTAACACGTCTTTCTGTATGCATAAAATCACTCTAGAGGAGGGATCCTTTAGAAGTATTGAGCAACAGAAAATGCTTAACCT
TGCAATGAAAGAGGTTGTTAAAAATGAGGTGATTAAATGGTTGGATGTTGGGATTATCTATCCAATTGCAGATAGCAATTGGGATCTAGGAGTGACCATCATAGATTCTA
AGTTTGATCTAAGAGTGATTCATCTTCTAACTAGACTCTTGGTTCGCATCATTCGGCGAATGCCTTTTGGCCTATGCAATGCTCCAACAACATTTCAGAGGTATTGTAGG
AAGGCTTTTGAGACTTTAAAGGCTGCTTTAATCTCAGCCCCCATTCTTTGTGCACCTAATTGGAGTTTATCATTTGAGGTAATGTGTGATGCAAGTGATGTTGCAGTAGG
TGATGAAGCAAAGGAAATCCTGGGGCAATGTCACTCCTCACCGTATGGAGGTCATTTTAGCGCTCAGAGGACAACTATGAGGATTTTGCAATGTGGATTTTTCTGGCCTC
CTTTATTTAAGGATGCCCATTGGTTCTACAAGCAATGTGATACTTGCCAAAGGAGAAGAAATGTGTCCAAGTATGTGGAAGCCATTGCATGCCATCAGAATGGTACTAAG
ACAATATCAAGGTTTCTCCAATCGCACATTTTTGCGCGGTTTGGGACACCTAGAGCTTTAGTGAGTGATGAGGGTACACACTTTGTTAATAATGTTTTAACTAAGCTTTT
AGCTAAGTATGAAATTAAGCATAGGATAGCTACCACTTATCACCCTCAAGCAAATGGTCAAGCTAAAGTAAAACGGATGGGCATCTGGTGCTTCAACGTGTCACGCAGGA
GAGATCTAACGGTCTAG
Protein sequenceShow/hide protein sequence
MEDLTNKHLGDHGEVSIEDLEVCSLERKSEKEVSRCEDVFESLDLNERKAPPIKPFLIEEPTLDLKPFSDHLKFVYRGEGEMLLIIVASYLMPEDEETLIELLQQYRKAI
GWTLANIQGINTSFCMHKITLEEGSFRSIEQQKMLNLAMKEVVKNEVIKWLDVGIIYPIADSNWDLGVTIIDSKFDLRVIHLLTRLLVRIIRRMPFGLCNAPTTFQRYCR
KAFETLKAALISAPILCAPNWSLSFEVMCDASDVAVGDEAKEILGQCHSSPYGGHFSAQRTTMRILQCGFFWPPLFKDAHWFYKQCDTCQRRRNVSKYVEAIACHQNGTK
TISRFLQSHIFARFGTPRALVSDEGTHFVNNVLTKLLAKYEIKHRIATTYHPQANGQAKVKRMGIWCFNVSRRRDLTV