; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041366 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041366
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr13:16626136..16636061
RNA-Seq ExpressionLag0041366
SyntenyLag0041366
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022158803.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111025268 [Momordica charantia]2.1e-10443.39Show/hide
Query:  MYFSEKLTGATLKYPTYDKELYALVRALQTWQHYLWPKEFVIHIDHQSLQHLKGETKLNRRHAK------------------------------------
        MYFSEKL GA+L YPTYDKEL+ALVR LQ WQHYLWP+EFVIH DH+SL+HLKG++KLNRRHAK                                    
Subjt:  MYFSEKLTGATLKYPTYDKELYALVRALQTWQHYLWPKEFVIHIDHQSLQHLKGETKLNRRHAK------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------CKMAHFIACNKTNDAKHVADLFFKDVVKLHGLPKSIVSDRDVKFLSHFWRVLWGKHGTKLVYSTTCHPQTDGQTEAVNRTMVTMLRAII
                    KMAHFIACNKT+DAKHVADLFF++VV+LHG+P++IVSDRDVKFLSHFW+VLWGK G KL++STTCHPQTDGQTE VN T+  +LR II
Subjt:  -----------CKMAHFIACNKTNDAKHVADLFFKDVVKLHGLPKSIVSDRDVKFLSHFWRVLWGKHGTKLVYSTTCHPQTDGQTEAVNRTMVTMLRAII

Query:  NKNTKTWEDCLPFIEFAYNRVVHSVTKCTPFEIVYGFNPLSPIDLLPFPSNEVVNLQAENKATIIQELHKEVKERTERQNSKGDGPFQVMKKVNDNAYQI
        ++N  +WE+ LPF+EFAYNR +HS T C+PFE+VYGFNPL+P+DLLP PSNE V +    +        +   +R  + + +G GP+QV++++N+NAY+I
Subjt:  NKNTKTWEDCLPFIEFAYNRVVHSVTKCTPFEIVYGFNPLSPIDLLPFPSNEVVNLQAENKATIIQELHKEVKERTERQNSKGDGPFQVMKKVNDNAYQI

Query:  DLKGKYGVSATFNVSDLTPFDVGAEF-DSRTNPSKGGEDDMNQDIIVLPPGPITRSRAKQLQLAFDSHIQTMVDSIKEAFAILE
        DL G + VS+TFNV+DL+PFDVG +  DSRTN  + GEDD N  +  +  GPITR++AK+LQ AF  H+Q  + S++  F +L+
Subjt:  DLKGKYGVSATFNVSDLTPFDVGAEF-DSRTNPSKGGEDDMNQDIIVLPPGPITRSRAKQLQLAFDSHIQTMVDSIKEAFAILE

XP_022932136.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111438459, partial [Cucurbita moschata]2.5e-10244.19Show/hide
Query:  MYFSEKLTGATLKYPTYDKELYALVRALQTWQHYLWPKEFVIHIDHQSLQHLKGETKLNRRHAK------------------------------------
        M+FSEKL GA L YPTYDKELYALVRALQ WQHYLWPKEF+IH DH+SL+HL+ +TKLNRRHAK                                    
Subjt:  MYFSEKLTGATLKYPTYDKELYALVRALQTWQHYLWPKEFVIHIDHQSLQHLKGETKLNRRHAK------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------CKMAHFIACNKTNDAKHVADLFFKDVVKLHGLPKSIVSDRDVKFLSHFWRVLWGKHGTKLVYSTTCHPQTDGQTEAVNRTMVTMLRAII
                    KMAHFI CNKT+DA ++A+LFFK+VV+LHG+PK+I+SDRDVKFLSHFW+VLWGK GTK ++STTCHPQTDGQTE VNRT+ T+LR++I
Subjt:  -----------CKMAHFIACNKTNDAKHVADLFFKDVVKLHGLPKSIVSDRDVKFLSHFWRVLWGKHGTKLVYSTTCHPQTDGQTEAVNRTMVTMLRAII

Query:  NKNTKTWEDCLPFIEFAYNRVVHSVTKCTPFEIVYGFNPLSPIDLLPFPSNEVVNLQAENKATIIQELHKE--VKERTERQNSKGDGPFQVMKKVNDNAY
        +KN K+WE+ LPF+EFAYNR +HS T+C+PFE+VYGFNPL+P+DL P P              +   L KE    +R  +   +GDGPFQV++++NDNAY
Subjt:  NKNTKTWEDCLPFIEFAYNRVVHSVTKCTPFEIVYGFNPLSPIDLLPFPSNEVVNLQAENKATIIQELHKE--VKERTERQNSKGDGPFQVMKKVNDNAY

Query:  QIDLKGKYGVSATFNVSDLTPFDVGAE-FDSRTNPSKGGEDDMNQ--DIIVLPPGPITRSRAKQLQLAFDSHI
        ++DL+G+Y VS+TFNV+DLTPFDVG E  D RTNP K  EDDMN   D + +P GPITRS+ K++Q A+  H+
Subjt:  QIDLKGKYGVSATFNVSDLTPFDVGAE-FDSRTNPSKGGEDDMNQ--DIIVLPPGPITRSRAKQLQLAFDSHI

XP_023541047.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111801285 [Cucurbita pepo subsp. pepo]1.2e-10749.01Show/hide
Query:  MYFSEKLTGATLKYPTYDKELYALVRALQTWQHYLWPKEFVIHIDHQSLQHLKGETKLNRRHAK------------------------------------
        M+FSEKLTGA+L+YPTYDKELYALVRALQTWQHYLWPKEF+IH DH+SL+HL+ + KLNRRHAK                                    
Subjt:  MYFSEKLTGATLKYPTYDKELYALVRALQTWQHYLWPKEFVIHIDHQSLQHLKGETKLNRRHAK------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------CKMAHFIACNKTNDAKHVADLFFKDVVKLHGLPKSIVSDRDVKFLSHFWRVLWGKHGTKLVYSTTCHPQTDGQTEAVNRTMVTMLRAII
                    KMAHFI C+KT+DAKH+ADLFF++VV+LHG+PKSIVSDRDVKFLSHFWRVLWGK GTKLVYSTTCHPQTDGQTE VNRTM TMLRAII
Subjt:  -----------CKMAHFIACNKTNDAKHVADLFFKDVVKLHGLPKSIVSDRDVKFLSHFWRVLWGKHGTKLVYSTTCHPQTDGQTEAVNRTMVTMLRAII

Query:  NKNTKTWEDCLPFIEFAYNRVVHSVTKCTPFEIVYGFNPLSPIDLLPFPSNEV------VNLQAENKATIIQELHKEVKERTERQNSKGDGPFQVMKKVN
        +KN KTWEDCLPFIEFAYNRVVHS TKCTPFEIVYGFNPL+PIDLLP PS E       V+ + E   T          +R  +   +GDGPFQV++++N
Subjt:  NKNTKTWEDCLPFIEFAYNRVVHSVTKCTPFEIVYGFNPLSPIDLLPFPSNEV------VNLQAENKATIIQELHKEVKERTERQNSKGDGPFQVMKKVN

Query:  DNAYQIDLKGKYGVSATFNVSDLTPFDVGAEFDSRTNPSKGGEDDMNQDIIVL
        DNAY+IDL+ KYGVSA FN  DL+PFDVG   DSR +PS+ GE+DMN  + VL
Subjt:  DNAYQIDLKGKYGVSATFNVSDLTPFDVGAEFDSRTNPSKGGEDDMNQDIIVL

XP_023923864.1 uncharacterized protein LOC112035274 [Quercus suber]7.4e-10244.7Show/hide
Query:  YFSEKLTGATLKYPTYDKELYALVRALQTWQHYLWPKEFVIHIDHQSLQHLKGETKLNRRHAK-------------------------------------
        YFSEKL GA L YPTYDKELYALVRAL+TWQHYLWPKEFVIH DH+SL+HLKG+ KLNRRHAK                                     
Subjt:  YFSEKLTGATLKYPTYDKELYALVRALQTWQHYLWPKEFVIHIDHQSLQHLKGETKLNRRHAK-------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------CKMAHFIACNKTNDAKHVADLFFKDVVKLHGLPKSIVSDRDVKFLSHFWRVLWGKHGTKLVYSTTCHPQTDGQTEAVNRTMVTMLRAIIN
                   KMAHFI+C+KT+DA H+ADLFF+++V+LHG+P+SIVSDRDVKFLS+FW+VLWGK GTKL++STTCHPQTDGQTE VNRT+ T+LR II 
Subjt:  ----------CKMAHFIACNKTNDAKHVADLFFKDVVKLHGLPKSIVSDRDVKFLSHFWRVLWGKHGTKLVYSTTCHPQTDGQTEAVNRTMVTMLRAIIN

Query:  KNTKTWEDCLPFIEFAYNRVVHSVTKCTPFEIVYGFNPLSPIDLLPFPSNEVVNLQAENKATIIQELHKEVKERTERQNSKGDGPFQVMKKVNDNAYQID
        KN K WEDCLPFIEFAYNR VHS T  +PFEIVYGFNPL+P+DLLP P NE+ +    +   +     +    R  + + +GDGPFQV++++N+NAY++D
Subjt:  KNTKTWEDCLPFIEFAYNRVVHSVTKCTPFEIVYGFNPLSPIDLLPFPSNEVVNLQAENKATIIQELHKEVKERTERQNSKGDGPFQVMKKVNDNAYQID

Query:  LKGKYGVSATFNVSDLTPFDVGAEFDSRTNPSKGGEDDMNQ----DIIVLPPGPITRSRAKQLQLAFDSHIQ
        L G+Y +SATFNVSDL+PFDVG   DSRTNP +   +D NQ    D + +P GPIT++R+K+++ A +  IQ
Subjt:  LKGKYGVSATFNVSDLTPFDVGAEFDSRTNPSKGGEDDMNQ----DIIVLPPGPITRSRAKQLQLAFDSHIQ

XP_024948175.1 uncharacterized protein LOC112495731, partial [Citrus sinensis]1.9e-11052.76Show/hide
Query:  YFSEKLTGATLKYPTYDKELYALVRALQTWQHYLWPKEFVIHIDHQSLQHLKGETKLNRRHAK------------------------------CKMAHFI
        YFSEKL GA L YPTYDKE+YALVRAL+TWQHYL PKEFVIH DH+SL+HLKG+ KL++RHAK                               KMAHFI
Subjt:  YFSEKLTGATLKYPTYDKELYALVRALQTWQHYLWPKEFVIHIDHQSLQHLKGETKLNRRHAK------------------------------CKMAHFI

Query:  ACNKTNDAKHVADLFFKDVVKLHGLPKSIVSDRDVKFLSHFWRVLWGKHGTKLVYSTTCHPQTDGQTEAVNRTMVTMLRAIINKNTKTWEDCLPFIEFAY
         C+KT+DA  +A+LFFK++V+LHG+P+SIVSDRD KFLSHFW+ LWGK GTKL++STTCHPQTDGQTE VNRT+ T+LRAII KN KTWE+CLP +EFAY
Subjt:  ACNKTNDAKHVADLFFKDVVKLHGLPKSIVSDRDVKFLSHFWRVLWGKHGTKLVYSTTCHPQTDGQTEAVNRTMVTMLRAIINKNTKTWEDCLPFIEFAY

Query:  NRVVHSVTKCTPFEIVYGFNPLSPIDLLPFPSNEVVNLQAENKATIIQELHKEVKERTERQNS-------------------------------------
        NR VHS TK +PFEIVYGFNPL+P+DLLP P +E  ++  + KA  +++LH+  ++  E++                                       
Subjt:  NRVVHSVTKCTPFEIVYGFNPLSPIDLLPFPSNEVVNLQAENKATIIQELHKEVKERTERQNS-------------------------------------

Query:  ---KGDGPFQVMKKVNDNAYQIDLKGKYGVSATFNVSDLTPFDVGAEFDSRTNP-SKGGEDDMNQ-------DIIVLPPGPITRSRAKQLQLAFDSHI
           +GDGPFQV+ ++NDNAY++DL G+Y V ATFNVSDL+PFDVG   DSRTNP  + G D+ +Q       D + +  GPITR+RAK++Q A +  I
Subjt:  ---KGDGPFQVMKKVNDNAYQIDLKGKYGVSATFNVSDLTPFDVGAEFDSRTNP-SKGGEDDMNQ-------DIIVLPPGPITRSRAKQLQLAFDSHI

TrEMBL top hitse value%identityAlignment
A0A2N9GXJ1 Reverse transcriptase1.5e-10849.29Show/hide
Query:  YFSEKLTGATLKYPTYDKELYALVRALQTWQHYLWPKEFVIHIDHQSLQHLKGETKLNRRHAK-------------------------------------
        YFSEKL+GA L Y TYDKE+YALVRAL  WQHYLWPKEFVIH DH+SL+HLKG+ +LN+RHAK                                     
Subjt:  YFSEKLTGATLKYPTYDKELYALVRALQTWQHYLWPKEFVIHIDHQSLQHLKGETKLNRRHAK-------------------------------------

Query:  -------CKMAHFIACNKTNDAKHVADLFFKDVVKLHGLPKSIVSDRDVKFLSHFWRVLWGKHGTKLVYSTTCHPQTDGQTEAVNRTMVTMLRAIINKNT
                KMAHFIAC+KT+DA H+ADLFFK++V+LHG+P++IVSDRD KFLS+FW+ LWGK GTKL++ST CHPQTDGQ E VNRT+ ++LRAII +N 
Subjt:  -------CKMAHFIACNKTNDAKHVADLFFKDVVKLHGLPKSIVSDRDVKFLSHFWRVLWGKHGTKLVYSTTCHPQTDGQTEAVNRTMVTMLRAIINKNT

Query:  KTWEDCLPFIEFAYNRVVHSVTKCTPFEIVYGFNPLSPIDLLPFPSNEVVNLQAENKATIIQELHKEVKERTERQNS-----------------------
        +TWEDCL  +EFAYNR +HS TK +PFE+VYGFNPLSP+DL   P +E VNL  + K   ++ +H++ +   ER+                         
Subjt:  KTWEDCLPFIEFAYNRVVHSVTKCTPFEIVYGFNPLSPIDLLPFPSNEVVNLQAENKATIIQELHKEVKERTERQNS-----------------------

Query:  -----------------KGDGPFQVMKKVNDNAYQIDLKGKYGVSATFNVSDLTPFDVGAEFDSRTNPSKGGEDDMN-----------------QDIIVL
                         +GDGPFQV++++NDNAY++DL G+YGVSA+FNV+DL+PFDVG   D RTNPS+ GE+D N                 QD + L
Subjt:  -----------------KGDGPFQVMKKVNDNAYQIDLKGKYGVSATFNVSDLTPFDVGAEFDSRTNPSKGGEDDMN-----------------QDIIVL

Query:  PPGPITRSRAKQLQLAFDSHIQ
        P GPITR RAK+ + A +  IQ
Subjt:  PPGPITRSRAKQLQLAFDSHIQ

A0A2N9HUK1 Integrase catalytic domain-containing protein1.1e-11150.73Show/hide
Query:  YFSEKLTGATLKYPTYDKELYALVRALQTWQHYLWPKEFVIHIDHQSLQHLKGETKLNRRHAK-------------------------------------
        YFSEKL+GA L YPTYDKELYALVRAL+TWQHYLWPKEFVIH DH+SL+HLKG+ KLNRRHA+                                     
Subjt:  YFSEKLTGATLKYPTYDKELYALVRALQTWQHYLWPKEFVIHIDHQSLQHLKGETKLNRRHAK-------------------------------------

Query:  -------CKMAHFIACNKTNDAKHVADLFFKDVVKLHGLPKSIVSDRDVKFLSHFWRVLWGKHGTKLVYSTTCHPQTDGQTEAVNRTMVTMLRAIINKNT
                KMAHFI C+KT+DA HVADLFF+++V+LHG+P++IVSDRD KFLS+FW+ LW K GTKL++STTCHPQTDGQTE VNRT+ T+LRAII KN 
Subjt:  -------CKMAHFIACNKTNDAKHVADLFFKDVVKLHGLPKSIVSDRDVKFLSHFWRVLWGKHGTKLVYSTTCHPQTDGQTEAVNRTMVTMLRAIINKNT

Query:  KTWEDCLPFIEFAYNRVVHSVTKCTPFEIVYGFNPLSPIDLLPFPSNEVVNLQAENKATIIQELHKEVKERTERQNS-----------------------
        KTWE+CLP +EFAYNR VHS TK +PFEIVYGFNPL+P+DL P P  E VNL  + KA  ++++H++ +   ER+                         
Subjt:  KTWEDCLPFIEFAYNRVVHSVTKCTPFEIVYGFNPLSPIDLLPFPSNEVVNLQAENKATIIQELHKEVKERTERQNS-----------------------

Query:  -----------------KGDGPFQVMKKVNDNAYQIDLKGKYGVSATFNVSDLTPFDVGAEFDSRTNP-----SKGGEDDMNQDIIVLPPGPITRSRAKQ
                         +GDGPFQV++++NDNAY++DL G+Y VSATFNV+DL+PFDVG   D R NP     + G +   ++D++ +P GP+TR+RAK+
Subjt:  -----------------KGDGPFQVMKKVNDNAYQIDLKGKYGVSATFNVSDLTPFDVGAEFDSRTNP-----SKGGEDDMNQDIIVLPPGPITRSRAKQ

Query:  LQLAFDSHIQTM
         +   +  IQ +
Subjt:  LQLAFDSHIQTM

A0A2N9HVZ7 Reverse transcriptase1.8e-10645.71Show/hide
Query:  YFSEKLTGATLKYPTYDKELYALVRALQTWQHYLWPKEFVIHIDHQSLQHLKGETKLNRRHAK-------------------------------------
        YFSEKL+GA L YPTYDKELYALVRAL+TWQHYLWPKEFVIH DH+SL+HLKG+ KLN+RHA+                                     
Subjt:  YFSEKLTGATLKYPTYDKELYALVRALQTWQHYLWPKEFVIHIDHQSLQHLKGETKLNRRHAK-------------------------------------

Query:  --------------------------------------------------CKMAHFIACNKTNDAKHVADLFFKDVVKLHGLPKSIVSDRDVKFLSHFWR
                                                           KMAHFI C+KT+DA HVADLFF+++V+LHG+P++IVSDRD KFLS+FW+
Subjt:  --------------------------------------------------CKMAHFIACNKTNDAKHVADLFFKDVVKLHGLPKSIVSDRDVKFLSHFWR

Query:  VLWGKHGTKLVYSTTCHPQTDGQTEAVNRTMVTMLRAIINKNTKTWEDCLPFIEFAYNRVVHSVTKCTPFEIVYGFNPLSPIDLLPFPSNEVVNLQAENK
         LW K GTKL++STTCHPQTDGQTE VNRT+ T+LRAII KN KTWE+CLP +EFAYNR VHS TK +PFEIVYGFNPL+P+DL P P  E VNL  + K
Subjt:  VLWGKHGTKLVYSTTCHPQTDGQTEAVNRTMVTMLRAIINKNTKTWEDCLPFIEFAYNRVVHSVTKCTPFEIVYGFNPLSPIDLLPFPSNEVVNLQAENK

Query:  ATIIQELHKEVKERTERQNS----------------------------------------KGDGPFQVMKKVNDNAYQIDLKGKYGVSATFNVSDLTPFD
        A  ++++H++ +   ER+                                          +GDGPFQV++++NDNAY++DL G+Y VSATFNV+DL+PFD
Subjt:  ATIIQELHKEVKERTERQNS----------------------------------------KGDGPFQVMKKVNDNAYQIDLKGKYGVSATFNVSDLTPFD

Query:  VGAEFDSRTNP-----SKGGEDDMNQDIIVLPPGPITRSRAKQLQLAFDSHIQTM
        VG   D R NP     + G +   ++D++ +P GP+TR+RAK+ +   +  IQ +
Subjt:  VGAEFDSRTNP-----SKGGEDDMNQDIIVLPPGPITRSRAKQLQLAFDSHIQTM

A0A2N9IKQ6 Reverse transcriptase1.8e-10645.71Show/hide
Query:  YFSEKLTGATLKYPTYDKELYALVRALQTWQHYLWPKEFVIHIDHQSLQHLKGETKLNRRHAK-------------------------------------
        YFSEKL+GA L YPTYDKELYALVRAL+TWQHYLWPKEFVIH DH+SL+HLKG+ KLN+RHA+                                     
Subjt:  YFSEKLTGATLKYPTYDKELYALVRALQTWQHYLWPKEFVIHIDHQSLQHLKGETKLNRRHAK-------------------------------------

Query:  --------------------------------------------------CKMAHFIACNKTNDAKHVADLFFKDVVKLHGLPKSIVSDRDVKFLSHFWR
                                                           KMAHFI C+KT+DA HVADLFF+++V+LHG+P++IVSDRD KFLS+FW+
Subjt:  --------------------------------------------------CKMAHFIACNKTNDAKHVADLFFKDVVKLHGLPKSIVSDRDVKFLSHFWR

Query:  VLWGKHGTKLVYSTTCHPQTDGQTEAVNRTMVTMLRAIINKNTKTWEDCLPFIEFAYNRVVHSVTKCTPFEIVYGFNPLSPIDLLPFPSNEVVNLQAENK
         LW K GTKL++STTCHPQTDGQTE VNRT+ T+LRAII KN KTWE+CLP +EFAYNR VHS TK +PFEIVYGFNPL+P+DL P P  E VNL  + K
Subjt:  VLWGKHGTKLVYSTTCHPQTDGQTEAVNRTMVTMLRAIINKNTKTWEDCLPFIEFAYNRVVHSVTKCTPFEIVYGFNPLSPIDLLPFPSNEVVNLQAENK

Query:  ATIIQELHKEVKERTERQNS----------------------------------------KGDGPFQVMKKVNDNAYQIDLKGKYGVSATFNVSDLTPFD
        A  ++++H++ +   ER+                                          +GDGPFQV++++NDNAY++DL G+Y VSATFNV+DL+PFD
Subjt:  ATIIQELHKEVKERTERQNS----------------------------------------KGDGPFQVMKKVNDNAYQIDLKGKYGVSATFNVSDLTPFD

Query:  VGAEFDSRTNP-----SKGGEDDMNQDIIVLPPGPITRSRAKQLQLAFDSHIQTM
        VG   D R NP     + G +   ++D++ +P GP+TR+RAK+ +   +  IQ +
Subjt:  VGAEFDSRTNP-----SKGGEDDMNQDIIVLPPGPITRSRAKQLQLAFDSHIQTM

A0A2N9IM79 Reverse transcriptase1.8e-10645.71Show/hide
Query:  YFSEKLTGATLKYPTYDKELYALVRALQTWQHYLWPKEFVIHIDHQSLQHLKGETKLNRRHAK-------------------------------------
        YFSEKL+GA L YPTYDKELYALVRAL+TWQHYLWPKEFVIH DH+SL+HLKG+ KLN+RHA+                                     
Subjt:  YFSEKLTGATLKYPTYDKELYALVRALQTWQHYLWPKEFVIHIDHQSLQHLKGETKLNRRHAK-------------------------------------

Query:  --------------------------------------------------CKMAHFIACNKTNDAKHVADLFFKDVVKLHGLPKSIVSDRDVKFLSHFWR
                                                           KMAHFI C+KT+DA HVADLFF+++V+LHG+P++IVSDRD KFLS+FW+
Subjt:  --------------------------------------------------CKMAHFIACNKTNDAKHVADLFFKDVVKLHGLPKSIVSDRDVKFLSHFWR

Query:  VLWGKHGTKLVYSTTCHPQTDGQTEAVNRTMVTMLRAIINKNTKTWEDCLPFIEFAYNRVVHSVTKCTPFEIVYGFNPLSPIDLLPFPSNEVVNLQAENK
         LW K GTKL++STTCHPQTDGQTE VNRT+ T+LRAII KN KTWE+CLP +EFAYNR VHS TK +PFEIVYGFNPL+P+DL P P  E VNL  + K
Subjt:  VLWGKHGTKLVYSTTCHPQTDGQTEAVNRTMVTMLRAIINKNTKTWEDCLPFIEFAYNRVVHSVTKCTPFEIVYGFNPLSPIDLLPFPSNEVVNLQAENK

Query:  ATIIQELHKEVKERTERQNS----------------------------------------KGDGPFQVMKKVNDNAYQIDLKGKYGVSATFNVSDLTPFD
        A  ++++H++ +   ER+                                          +GDGPFQV++++NDNAY++DL G+Y VSATFNV+DL+PFD
Subjt:  ATIIQELHKEVKERTERQNS----------------------------------------KGDGPFQVMKKVNDNAYQIDLKGKYGVSATFNVSDLTPFD

Query:  VGAEFDSRTNP-----SKGGEDDMNQDIIVLPPGPITRSRAKQLQLAFDSHIQTM
        VG   D R NP     + G +   ++D++ +P GP+TR+RAK+ +   +  IQ +
Subjt:  VGAEFDSRTNP-----SKGGEDDMNQDIIVLPPGPITRSRAKQLQLAFDSHIQTM

SwissProt top hitse value%identityAlignment
O92815 Gag-Pol polyprotein1.6e-0630.4Show/hide
Query:  KMAHFIACNKTNDAKHVADLFFKDVVKLHGLPKSIVSDRDVKFLSHFWRVLWGKHGTKLVYSTTCHPQTDGQTEAVNRTMVT-MLRAIINKNTKTWEDCL
        K    I CNK  DAK V D+  KD++   GLP  I SD+   F +   + L    G         HP++ G  E  NRT+ + +++A        W + L
Subjt:  KMAHFIACNKTNDAKHVADLFFKDVVKLHGLPKSIVSDRDVKFLSHFWRVLWGKHGTKLVYSTTCHPQTDGQTEAVNRTMVT-MLRAIINKNTKTWEDCL

Query:  PFIEFAYNRVVHSVTKCTPFEIVYG
        P++     R        +P EIV G
Subjt:  PFIEFAYNRVVHSVTKCTPFEIVYG

P0CT41 Transposon Tf2-12 polyprotein3.3e-0429.06Show/hide
Query:  YFSEKLTGATLKYPTYDKELYALVRALQTWQHYLWP--KEFVIHIDHQSL-QHLKGETK-LNRRHAKCKM----AHFIACNKTNDAKHVADLFFKDVVKL
        Y+S K++ A L Y   DKE+ A++++L+ W+HYL    + F I  DH++L   +  E++  N+R A+ ++     +F    +   A H+AD   + V + 
Subjt:  YFSEKLTGATLKYPTYDKELYALVRALQTWQHYLWP--KEFVIHIDHQSL-QHLKGETK-LNRRHAKCKM----AHFIACNKTNDAKHVADLFFKDVVKL

Query:  HGLPKSIVSDRDVKFLS
          +PK    D  + F++
Subjt:  HGLPKSIVSDRDVKFLS

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein5.2e-2633.74Show/hide
Query:  KMAHFIACNKTNDAKHVADLFFKDVVKLHGLPKSIVSDRDVKFLSHFWRVLWGKHGTKLVYSTTCHPQTDGQTEAVNRTMVTMLRAIINKNTKTWEDCLP
        K AHFIA  KT DA  + DL F+ +   HG P++I SDRDV+  +  ++ L  + G K   S+  HPQTDGQ+E   +T+  +LRA ++ N + W   LP
Subjt:  KMAHFIACNKTNDAKHVADLFFKDVVKLHGLPKSIVSDRDVKFLSHFWRVLWGKHGTKLVYSTTCHPQTDGQTEAVNRTMVTMLRAIINKNTKTWEDCLP

Query:  FIEFAYNRVVHSVTKCTPFEIVYGFNPLSPI----DLLPFPSNEVVNLQAENKATIIQEL----HKEVKERTERQNSK-------GD-------------
         IEF YN         +PFEI  G+ P +P     D +   S   V L    KA  IQ      H +++  T     +       GD             
Subjt:  FIEFAYNRVVHSVTKCTPFEIVYGFNPLSPI----DLLPFPSNEVVNLQAENKATIIQEL----HKEVKERTERQNSK-------GD-------------

Query:  ----------GPFQVMKKVNDNAYQIDLKGKYGVSATFNVSDL
                  GPF+V+KK+NDNAY++DL          NV  L
Subjt:  ----------GPFQVMKKVNDNAYQIDLKGKYGVSATFNVSDL

Q99315 Transposon Ty3-G Gag-Pol polyprotein4.0e-2633.74Show/hide
Query:  KMAHFIACNKTNDAKHVADLFFKDVVKLHGLPKSIVSDRDVKFLSHFWRVLWGKHGTKLVYSTTCHPQTDGQTEAVNRTMVTMLRAIINKNTKTWEDCLP
        K AHFIA  KT DA  + DL F+ +   HG P++I SDRDV+  +  ++ L  + G K   S+  HPQTDGQ+E   +T+  +LRA  + N + W   LP
Subjt:  KMAHFIACNKTNDAKHVADLFFKDVVKLHGLPKSIVSDRDVKFLSHFWRVLWGKHGTKLVYSTTCHPQTDGQTEAVNRTMVTMLRAIINKNTKTWEDCLP

Query:  FIEFAYNRVVHSVTKCTPFEIVYGFNPLSPI----DLLPFPSNEVVNLQAENKATIIQEL----HKEVKERTERQNSK-------GD-------------
         IEF YN         +PFEI  G+ P +P     D +   S   V L    KA  IQ      H +++  T     +       GD             
Subjt:  FIEFAYNRVVHSVTKCTPFEIVYGFNPLSPI----DLLPFPSNEVVNLQAENKATIIQEL----HKEVKERTERQNSK-------GD-------------

Query:  ----------GPFQVMKKVNDNAYQIDLKGKYGVSATFNVSDLTPF
                  GPF+V+KK+NDNAY++DL          NV  L  F
Subjt:  ----------GPFQVMKKVNDNAYQIDLKGKYGVSATFNVSDLTPF

Q9UR07 Transposon Tf2-11 polyprotein3.3e-0429.06Show/hide
Query:  YFSEKLTGATLKYPTYDKELYALVRALQTWQHYLWP--KEFVIHIDHQSL-QHLKGETK-LNRRHAKCKM----AHFIACNKTNDAKHVADLFFKDVVKL
        Y+S K++ A L Y   DKE+ A++++L+ W+HYL    + F I  DH++L   +  E++  N+R A+ ++     +F    +   A H+AD   + V + 
Subjt:  YFSEKLTGATLKYPTYDKELYALVRALQTWQHYLWP--KEFVIHIDHQSL-QHLKGETK-LNRRHAKCKM----AHFIACNKTNDAKHVADLFFKDVVKL

Query:  HGLPKSIVSDRDVKFLS
          +PK    D  + F++
Subjt:  HGLPKSIVSDRDVKFLS

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATTTTAGTGAGAAGCTGACTGGAGCAACCTTGAAGTATCCTACATATGACAAGGAACTTTATGCCTTGGTGAGAGCGCTACAAACATGGCAGCATTACTTGTGGCC
AAAAGAATTCGTCATTCACATAGATCATCAGAGTTTGCAACATTTGAAGGGCGAAACCAAGCTGAATAGAAGACATGCTAAATGCAAAATGGCACACTTCATTGCATGTA
ATAAAACTAATGATGCTAAACATGTGGCTGATTTGTTCTTTAAAGATGTGGTTAAATTGCATGGCCTTCCTAAGAGTATTGTCAGTGATAGGGATGTAAAGTTTTTAAGC
CATTTTTGGAGAGTCCTTTGGGGAAAACATGGTACTAAATTAGTTTACTCCACTACATGTCATCCTCAAACTGATGGACAAACTGAGGCAGTTAATAGGACAATGGTGAC
TATGCTTAGGGCTATCATAAATAAAAACACCAAAACTTGGGAGGATTGTCTACCATTCATTGAATTTGCATACAATAGGGTTGTCCATAGTGTCACTAAATGTACTCCAT
TTGAGATTGTGTATGGGTTTAATCCTTTATCACCTATTGACTTGTTACCTTTTCCTTCAAACGAGGTTGTGAATCTTCAAGCTGAGAACAAGGCGACAATCATACAAGAG
TTACATAAAGAGGTTAAAGAAAGAACTGAAAGACAAAATTCCAAGGGGGATGGACCTTTCCAAGTGATGAAGAAAGTCAATGATAATGCCTATCAAATTGACCTAAAAGG
TAAATATGGTGTGAGTGCTACTTTTAATGTTTCTGACTTGACTCCTTTCGATGTAGGTGCAGAATTTGATTCGAGGACGAATCCTTCAAAAGGGGGAGAAGATGATATGA
ACCAAGACATCATTGTTTTACCTCCAGGACCAATCACACGCTCAAGGGCTAAGCAACTACAATTGGCCTTTGATTCCCACATACAAACAATGGTGGACTCAATTAAGGAA
GCCTTTGCAATTCTTGAATTAAGAGTGGCAAGCATGAACATTCAGATTGAGTTGCTCATTGTTTTGCCGCCACCGCTCACCACATCGTCGCCGCCGTTGAAGATCTACGC
CGCCGCCACTCCGACGCCTGTAGCCACCTTGAACTCCAACCATCTAGCAGTAGCTCCGACCCACATTCTAGTGAATTTCCATGGAAGAAAAGGAAAAAAGAAAGAAGAAA
ATCGTGGTCATCATGCATTTCACACAATCACAGCTCTATACCCGGGCTATACCCTTAGGATTAGAATCGAGGATGTAGTAGTGTCGACCTTCGAAGGGACATCATTGCCC
TCGTGTAGGCGCACGAAAAATCCTCACTCAGTGACAGATGTCTCTCAATCTGACATCCCAGTCAATAATCTTGGTGACTCACCTCATTCGCACAAGTCAGAAATATCTCT
TTGTCAGCCAACTAGAGATTTCACTCAACCACAGTCATCACCCCCCTTCAAAAAGGTAAAAAGTTATTACCAAGAAAATGCTGAAATGCTTAAGGAAAGATTAGAGGCGA
TTAAAGGAACTGACATCTTTGGTAATGTTGGTCCTACCCAATTATGTCTGGTGTCGAACTTGGTTATTCCACCCAAATTCAAAGTACCTAAGTTTGGAAAATATGATGGG
ACGTCTTGTCTCAAAAGCCAGCTCGTTTTGTATTGCAGAAAGATGTCAGCACATTCTCAAAATGACAAGCTCCTCATACACTATTTTCAAGATAATTTAACTGGTCTAGC
ATCTCATTGGTATATCCAATTAGACAGTAGACACATCCGTTCTTGGAGGAGTTTGGTTGATTCCTTTTTGAAGAAATACATGCATAATGTAGACATGGCTCCTGATCGAT
TATACTTGCAGAACATGGGGAGGAAGGAAGAGCAGAGAGCTTTAAAGAATATGCTCAAAGGTGGAGTGATACATTTGCATAAGTACACCCACCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTATTTTAGTGAGAAGCTGACTGGAGCAACCTTGAAGTATCCTACATATGACAAGGAACTTTATGCCTTGGTGAGAGCGCTACAAACATGGCAGCATTACTTGTGGCC
AAAAGAATTCGTCATTCACATAGATCATCAGAGTTTGCAACATTTGAAGGGCGAAACCAAGCTGAATAGAAGACATGCTAAATGCAAAATGGCACACTTCATTGCATGTA
ATAAAACTAATGATGCTAAACATGTGGCTGATTTGTTCTTTAAAGATGTGGTTAAATTGCATGGCCTTCCTAAGAGTATTGTCAGTGATAGGGATGTAAAGTTTTTAAGC
CATTTTTGGAGAGTCCTTTGGGGAAAACATGGTACTAAATTAGTTTACTCCACTACATGTCATCCTCAAACTGATGGACAAACTGAGGCAGTTAATAGGACAATGGTGAC
TATGCTTAGGGCTATCATAAATAAAAACACCAAAACTTGGGAGGATTGTCTACCATTCATTGAATTTGCATACAATAGGGTTGTCCATAGTGTCACTAAATGTACTCCAT
TTGAGATTGTGTATGGGTTTAATCCTTTATCACCTATTGACTTGTTACCTTTTCCTTCAAACGAGGTTGTGAATCTTCAAGCTGAGAACAAGGCGACAATCATACAAGAG
TTACATAAAGAGGTTAAAGAAAGAACTGAAAGACAAAATTCCAAGGGGGATGGACCTTTCCAAGTGATGAAGAAAGTCAATGATAATGCCTATCAAATTGACCTAAAAGG
TAAATATGGTGTGAGTGCTACTTTTAATGTTTCTGACTTGACTCCTTTCGATGTAGGTGCAGAATTTGATTCGAGGACGAATCCTTCAAAAGGGGGAGAAGATGATATGA
ACCAAGACATCATTGTTTTACCTCCAGGACCAATCACACGCTCAAGGGCTAAGCAACTACAATTGGCCTTTGATTCCCACATACAAACAATGGTGGACTCAATTAAGGAA
GCCTTTGCAATTCTTGAATTAAGAGTGGCAAGCATGAACATTCAGATTGAGTTGCTCATTGTTTTGCCGCCACCGCTCACCACATCGTCGCCGCCGTTGAAGATCTACGC
CGCCGCCACTCCGACGCCTGTAGCCACCTTGAACTCCAACCATCTAGCAGTAGCTCCGACCCACATTCTAGTGAATTTCCATGGAAGAAAAGGAAAAAAGAAAGAAGAAA
ATCGTGGTCATCATGCATTTCACACAATCACAGCTCTATACCCGGGCTATACCCTTAGGATTAGAATCGAGGATGTAGTAGTGTCGACCTTCGAAGGGACATCATTGCCC
TCGTGTAGGCGCACGAAAAATCCTCACTCAGTGACAGATGTCTCTCAATCTGACATCCCAGTCAATAATCTTGGTGACTCACCTCATTCGCACAAGTCAGAAATATCTCT
TTGTCAGCCAACTAGAGATTTCACTCAACCACAGTCATCACCCCCCTTCAAAAAGGTAAAAAGTTATTACCAAGAAAATGCTGAAATGCTTAAGGAAAGATTAGAGGCGA
TTAAAGGAACTGACATCTTTGGTAATGTTGGTCCTACCCAATTATGTCTGGTGTCGAACTTGGTTATTCCACCCAAATTCAAAGTACCTAAGTTTGGAAAATATGATGGG
ACGTCTTGTCTCAAAAGCCAGCTCGTTTTGTATTGCAGAAAGATGTCAGCACATTCTCAAAATGACAAGCTCCTCATACACTATTTTCAAGATAATTTAACTGGTCTAGC
ATCTCATTGGTATATCCAATTAGACAGTAGACACATCCGTTCTTGGAGGAGTTTGGTTGATTCCTTTTTGAAGAAATACATGCATAATGTAGACATGGCTCCTGATCGAT
TATACTTGCAGAACATGGGGAGGAAGGAAGAGCAGAGAGCTTTAAAGAATATGCTCAAAGGTGGAGTGATACATTTGCATAAGTACACCCACCTTTAA
Protein sequenceShow/hide protein sequence
MYFSEKLTGATLKYPTYDKELYALVRALQTWQHYLWPKEFVIHIDHQSLQHLKGETKLNRRHAKCKMAHFIACNKTNDAKHVADLFFKDVVKLHGLPKSIVSDRDVKFLS
HFWRVLWGKHGTKLVYSTTCHPQTDGQTEAVNRTMVTMLRAIINKNTKTWEDCLPFIEFAYNRVVHSVTKCTPFEIVYGFNPLSPIDLLPFPSNEVVNLQAENKATIIQE
LHKEVKERTERQNSKGDGPFQVMKKVNDNAYQIDLKGKYGVSATFNVSDLTPFDVGAEFDSRTNPSKGGEDDMNQDIIVLPPGPITRSRAKQLQLAFDSHIQTMVDSIKE
AFAILELRVASMNIQIELLIVLPPPLTTSSPPLKIYAAATPTPVATLNSNHLAVAPTHILVNFHGRKGKKKEENRGHHAFHTITALYPGYTLRIRIEDVVVSTFEGTSLP
SCRRTKNPHSVTDVSQSDIPVNNLGDSPHSHKSEISLCQPTRDFTQPQSSPPFKKVKSYYQENAEMLKERLEAIKGTDIFGNVGPTQLCLVSNLVIPPKFKVPKFGKYDG
TSCLKSQLVLYCRKMSAHSQNDKLLIHYFQDNLTGLASHWYIQLDSRHIRSWRSLVDSFLKKYMHNVDMAPDRLYLQNMGRKEEQRALKNMLKGGVIHLHKYTHL