; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001040 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001040
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr4:22685375..22694174
RNA-Seq ExpressionLag0001040
SyntenyLag0001040
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7990634.1 hypothetical protein I3843_02G035100 [Carya illinoinensis]7.0e-4126.57Show/hide
Query:  ISDKARDWLQSTTPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPHMVTP---------IGFRG-----------
        + DKAR WLQS  PGSI +W  + + FL KFFPPAKT +LR+EIG F+Q   E L+EAWER+K+L+R+CP    P          G  G           
Subjt:  ISDKARDWLQSTTPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPHMVTP---------IGFRG-----------

Query:  -----------------MATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAFMKF--------------------------------------
                         MA+N+YQWP+ER+  KK+ AG+ +++ ++AL AQ+ +L++                                           
Subjt:  -----------------MATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAFMKF--------------------------------------

Query:  --------------------------------------SESSNRTTKLEEAVIAINSTVNGH------------------SAAIKNIETQLGQLVNVVST
                                              S+ S R   LE+A+++     N                     AAIKNIE Q+GQL   ++ 
Subjt:  --------------------------------------SESSNRTTKLEEAVIAINSTVNGH------------------SAAIKNIETQLGQLVNVVST

Query:  MNKGG-----------------------IEEEPESEDYETPT----GEAEEDTSSDEAEKPNLE-------------PPIPSPTLLVPKEKKKKKKKKN-
          +G                        IE  P  E   TPT    G+++     DE     LE             PPI +P L  P+  +K+K  K  
Subjt:  MNKGG-----------------------IEEEPESEDYETPT----GEAEEDTSSDEAEKPNLE-------------PPIPSPTLLVPKEKKKKKKKKN-

Query:  -------NQVH-------------------EGVVSKERKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCS-------------------------
                ++H                   + ++SK+R+ ++ +TV L+  CS  +Q+K+P+K+ DPGSF++PC+                         
Subjt:  -------NQVH-------------------EGVVSKERKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCS-------------------------

Query:  ----------------------------FENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRAIIDIERRELTIRVKNEKEIFK
                                     E+VL++V +F  P D  V+DM E+  +P+ILGRPFLATGRA+ID+++ ELT+RV  E+ +FK
Subjt:  ----------------------------FENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRAIIDIERRELTIRVKNEKEIFK

XP_015385502.1 uncharacterized protein LOC107176892 [Citrus sinensis]1.0e-3927.55Show/hide
Query:  ISDKARDWLQSTTPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPHMVTPIGFR---------------------
        + D A++WL S   G+ITTWD L Q FL K+FPPAKT KLR +I TF Q   E L+EAWER+K+LLRKCPH   P+  +                     
Subjt:  ISDKARDWLQSTTPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPHMVTPIGFR---------------------

Query:  ----------------GMATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLAN-------------------------------------------
                         MA+N+YQW SERS P+KI  G   VD V+AL AQMT+L+N                                           
Subjt:  ----------------GMATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLAN-------------------------------------------

Query:  ---------------------------------------AFMKFSESSN-------RTTKLEEAVIAINSTVNGHSAAIKNIETQLGQLVNVVSTMNKGG
                                                F    + SN        TT + + +    +T    +A+I+N+E Q+GQ+ N++S+   G 
Subjt:  ---------------------------------------AFMKFSESSN-------RTTKLEEAVIAINSTVNGHSAAIKNIETQLGQLVNVVSTMNKGG

Query:  IEEEPESEDYETPTG---------------------EAEEDT------SSDEAEKPNLEPPIPS--PTLLVPKEKKKK-------------KKKKNN---
        +    E+   E                         E  EDT      +S E  +P L  P+ +  P +  P+  +K              KK   N   
Subjt:  IEEEPESEDYETPTG---------------------EAEEDT------SSDEAEKPNLEPPIPS--PTLLVPKEKKKK-------------KKKKNN---

Query:  -----------QVHEGVVSKERKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSF--------------------------------------EN
                   +  + ++S +RK ++ +TV L   C+  +Q K+P K+ DPGSF++PC+                                       E+
Subjt:  -----------QVHEGVVSKERKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSF--------------------------------------EN

Query:  VLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRAIIDIERRELTIRVKNEKEIF
        VL++V +F  P D  V+DM E+  +P+ILGRPFLATGRA+ID++  +L +RV+NE+ IF
Subjt:  VLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRAIIDIERRELTIRVKNEKEIF

XP_022157708.1 uncharacterized protein LOC111024361 [Momordica charantia]1.3e-3931.15Show/hide
Query:  LVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPH---------------------------------MVTP----IGFRGMATNS
        +VQAFL  FFPPAKT +LRTEI +F++   EQLFE WER+KELLRKCP                                    TP    I  + MA NS
Subjt:  LVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPH---------------------------------MVTP----IGFRGMATNS

Query:  YQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAFMKFS-------------------------------------------------ESSNRTTKLE
        +QWPSERS  KK+ AG++E+D++S+L+AQ+ +L NA  K S                                                 E  +R +++E
Subjt:  YQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAFMKFS-------------------------------------------------ESSNRTTKLE

Query:  EAVIAINSTVNGHSAAIKNIETQLGQLVNVVSTMNKG---------------------GIE-EEPESEDYETPTGEAEEDTSSDEAEK---PNLEPPIP-
          V  +   + G++ +IKN+E Q+GQ+   ++TM KG                     G E +EPE +  E P    EE  + +E  K   P L+   P 
Subjt:  EAVIAINSTVNGHSAAIKNIETQLGQLVNVVSTMNKG---------------------GIE-EEPESEDYETPTGEAEEDTSSDEAEK---PNLEPPIP-

Query:  -----SPTLLVPKEKKKKKKKKNN-QVHEGVVSKERKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCS------------------------FEN
             SP   +P  +   ++  N  +  + +++ +RK +  +TV L   CS  +Q+K+P+K+ DPGSF++PC+                         E+
Subjt:  -----SPTLLVPKEKKKKKKKKNN-QVHEGVVSKERKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCS------------------------FEN

Query:  VLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRAIIDIERRELTIRVKNEKEIF
        VL++V R   P D  V+   E+  +P+ILGR FLATG A+ID++   LT+RV  E  +F
Subjt:  VLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRAIIDIERRELTIRVKNEKEIF

XP_030964936.1 uncharacterized protein LOC115986224 [Quercus lobata]1.2e-4027.98Show/hide
Query:  ISDKARDWLQSTTPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPHMVTP---------IGFRG-----------
        + DKAR WLQS  PGSIT+W  + + FL K FPPAKT +LR++IG F+Q   E L+EAWER+K+L+R+CP    P          G  G           
Subjt:  ISDKARDWLQSTTPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPHMVTP---------IGFRG-----------

Query:  -----------------MATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAFMKF--------------------------------------
                         MA+N+YQW +ER+  KK+ AG+ E+D  + L AQ+ SL++                                           
Subjt:  -----------------MATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAFMKF--------------------------------------

Query:  ----SESSNRTTKLEEAVIA------------------INSTVNGHSAAIKNIETQLGQLVNVVSTMNKG-----------------------GIEEEPE
            S+ S +   LE+A+I+                  I +  +   A +KN+E Q+GQL   ++   +G                        IE  P 
Subjt:  ----SESSNRTTKLEEAVIA------------------INSTVNGHSAAIKNIETQLGQLVNVVSTMNKG-----------------------GIEEEPE

Query:  SEDYETPTG-------------EAEEDTSSDEAEKPNL----EPPIPSPTLLVPKEKKKKKKKKNNQVHEGVVSKERKEKKVDTVYLASTCSTRVQQKVP
         E   TPT              E  +DT  +    P++     PPI S  L  P+   ++       + + ++SK+R+ ++ +TV L+  CS  +Q+K+P
Subjt:  SEDYETPTG-------------EAEEDTSSDEAEKPNL----EPPIPSPTLLVPKEKKKKKKKKNNQVHEGVVSKERKEKKVDTVYLASTCSTRVQQKVP

Query:  EKVADPGSFSVPCS-----------------------------------------------------FENVLIRVGRFFLPIDLYVMDMIENPSMPVILG
        +K+ DPGSF++PC+                                                      E+VL++V +F  P D  V+DM E+  +P+ILG
Subjt:  EKVADPGSFSVPCS-----------------------------------------------------FENVLIRVGRFFLPIDLYVMDMIENPSMPVILG

Query:  RPFLATGRAIIDIERRELTIRVKNEKEIF
        RPFLATGRA+ID+++ ELT+RV  E+ +F
Subjt:  RPFLATGRAIIDIERRELTIRVKNEKEIF

XP_038973113.1 uncharacterized protein LOC120105094 [Phoenix dactylifera]5.9e-4029.13Show/hide
Query:  ISDKARDWLQSTTPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPHMV-----TPIG--------------FRGM
        + DKA+ WL S  P S T W+AL QAFL K+FPP KT KLR +I +F Q   E L+EAWERFK+L RKCPH V        G                 M
Subjt:  ISDKARDWLQSTTPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPHMV-----TPIG--------------FRGM

Query:  ATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAFMKFSESS---NRTTKLEEAVIAINSTVNGHSAAIKNIE----------TQLGQLVNVVS
        A+N+YQW +ER  PKK+  G+++VD ++ L A++ SL   F   S ++     TT+    +   +  +     A+ ++            Q G L +   
Subjt:  ATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAFMKFSESS---NRTTKLEEAVIAINSTVNGHSAAIKNIE----------TQLGQLVNVVS

Query:  ----------TMNKGG----------IEEEPESEDYETPTGEAEEDTSSDEAEKPNLE---PPIPSPTLLVPK---EKKKKKKKKNNQVH----------
                  T+  G           + +E + ++      E  ED +   +  P ++   PPIP P  L      ++ +K  K   Q+H          
Subjt:  ----------TMNKGG----------IEEEPESEDYETPTGEAEEDTSSDEAEKPNLE---PPIPSPTLLVPK---EKKKKKKKKNNQVH----------

Query:  ---------EGVVSKERKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCS----------------------------------------------
                 + ++SK+RK +  +T+ L   CS  +Q K+P K+ DPGSFS+PC+                                              
Subjt:  ---------EGVVSKERKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCS----------------------------------------------

Query:  -------FENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRAIIDIERRELTIRVKNEKEIFKAVEDSKMKCFTWATGKVQERAPLLDSQNKS
                ENVLI+V +F +P+D  V++M E+  +P+ILGRPFLAT  AIIDI+   LT++V  E+  F   E +K   FT    +V     ++D   + 
Subjt:  -------FENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRAIIDIERRELTIRVKNEKEIFKAVEDSKMKCFTWATGKVQERAPLLDSQNKS

Query:  LLEARATR
        +  A  T+
Subjt:  LLEARATR

TrEMBL top hitse value%identityAlignment
A0A1U7Z951 uncharacterized protein LOC1045905686.6e-2924.42Show/hide
Query:  ISDKARDWLQSTTPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPHMVTPIGFR---------------------
        + DK + WL S    SI+TWD +   FL K+FPP+K  K+R +I TF QQ  E L+E+WER+KELLRK PH   P+  +                     
Subjt:  ISDKARDWLQSTTPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPHMVTPIGFR---------------------

Query:  ----------------GMATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLA---------------------------------NAFMK------
                         M  N+YQW SER+  ++    +  VD  + L AQ+ +L+                                 N FM+      
Subjt:  ----------------GMATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLA---------------------------------NAFMK------

Query:  -----------------------------------------FSESSNRTTKLEEA----VIAINSTVNGHSAAIKNIETQLGQLVNVVSTMNKGGIEEEP
                                                 F++  N+ + LEE     + +  +      A+IKN+ETQ+GQL  ++S+  +G +    
Subjt:  -----------------------------------------FSESSNRTTKLEEA----VIAINSTVNGHSAAIKNIETQLGQLVNVVSTMNKGGIEEEP

Query:  ESEDYETPTGEAEEDTSSDEAEKPNLEPPIPSPTLLVPKEKKKKKKKKNNQVHEGVVSKERKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCS--
        E+     P  + +  T     E   +E        L       K  K+       +++ + K   V TV +   CS  +  K+P+K+ DPGSF++PC+  
Subjt:  ESEDYETPTGEAEEDTSSDEAEKPNLEPPIPSPTLLVPKEKKKKKKKKNNQVHEGVVSKERKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCS--

Query:  ---------------------------------------------------FENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRAIIDIERR
                                                            E+VL++V +F  P+D  V+DM E+  +P+ILGRPFLATG+A +D+++ 
Subjt:  ---------------------------------------------------FENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRAIIDIERR

Query:  ELTIRVKNEKEIFKAVEDSK
        +L++++++E+ IFK  +  K
Subjt:  ELTIRVKNEKEIFKAVEDSK

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129451.5e-3324.72Show/hide
Query:  ISDKARDWLQSTTPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPHMVTP------------IG-----------
        + DKA+ WL S   GSITTW+ L Q FL KFFPPAKT K+R +I +F Q   E L+EAWERFKELLR+CPH   P            +G           
Subjt:  ISDKARDWLQSTTPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPHMVTP------------IG-----------

Query:  --------------FRGMATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAF-----------------------------------------
                         MA+N+YQWPSERS  +K A G +E+D +  L  Q+ +L+                                            
Subjt:  --------------FRGMATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAF-----------------------------------------

Query:  ------------------MKFSESSN------------------------RTTKLEEAVI----AINSTVNGHSAAIKNIETQLGQLVNVVST-------
                            FS S+N                        + ++LEE ++      ++ +    A+++N+ETQ+GQL N ++        
Subjt:  ------------------MKFSESSN------------------------RTTKLEEAVI----AINSTVNGHSAAIKNIETQLGQLVNVVST-------

Query:  --------------------------MNKGGIEEEPESED------YETPTGEAEEDTSSDEAEKPNLEPPIPSPTLLVPKEKKKKKKKKNN---QVH--
                                  +N+  +E E E  D       E    + ++D + ++     + PP P P  L  ++ +K+ +K  N   ++H  
Subjt:  --------------------------MNKGGIEEEPESED------YETPTGEAEEDTSSDEAEKPNLEPPIPSPTLLVPKEKKKKKKKKNN---QVH--

Query:  -----------------EGVVSKERKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCS--------------------------------------
                         + ++SK+RK  + +TV+L   CS  +Q K+P K+ DPGSF++PC+                                      
Subjt:  -----------------EGVVSKERKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCS--------------------------------------

Query:  ---------------FENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRAIIDIERRELTIRVKNEKEIFKAVEDSKMKCFTWATGKVQERAP
                        E+VL++V +F  P+D  ++DM E+  +P+ILGRPFLAT  AIID+   +++ +V  E   F     SK    T       +R  
Subjt:  ---------------FENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRAIIDIERRELTIRVKNEKEIFKAVEDSKMKCFTWATGKVQERAP

Query:  LLDSQNKSLLEARATRRAN
        L+D     L+    + +A+
Subjt:  LLDSQNKSLLEARATRRAN

A0A6J1DU19 uncharacterized protein LOC1110243616.4e-4031.15Show/hide
Query:  LVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPH---------------------------------MVTP----IGFRGMATNS
        +VQAFL  FFPPAKT +LRTEI +F++   EQLFE WER+KELLRKCP                                    TP    I  + MA NS
Subjt:  LVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPH---------------------------------MVTP----IGFRGMATNS

Query:  YQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAFMKFS-------------------------------------------------ESSNRTTKLE
        +QWPSERS  KK+ AG++E+D++S+L+AQ+ +L NA  K S                                                 E  +R +++E
Subjt:  YQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAFMKFS-------------------------------------------------ESSNRTTKLE

Query:  EAVIAINSTVNGHSAAIKNIETQLGQLVNVVSTMNKG---------------------GIE-EEPESEDYETPTGEAEEDTSSDEAEK---PNLEPPIP-
          V  +   + G++ +IKN+E Q+GQ+   ++TM KG                     G E +EPE +  E P    EE  + +E  K   P L+   P 
Subjt:  EAVIAINSTVNGHSAAIKNIETQLGQLVNVVSTMNKG---------------------GIE-EEPESEDYETPTGEAEEDTSSDEAEK---PNLEPPIP-

Query:  -----SPTLLVPKEKKKKKKKKNN-QVHEGVVSKERKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCS------------------------FEN
             SP   +P  +   ++  N  +  + +++ +RK +  +TV L   CS  +Q+K+P+K+ DPGSF++PC+                         E+
Subjt:  -----SPTLLVPKEKKKKKKKKNN-QVHEGVVSKERKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCS------------------------FEN

Query:  VLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRAIIDIERRELTIRVKNEKEIF
        VL++V R   P D  V+   E+  +P+ILGR FLATG A+ID++   LT+RV  E  +F
Subjt:  VLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRAIIDIERRELTIRVKNEKEIF

A0A6P6XAQ1 Reverse transcriptase6.0e-3024.83Show/hide
Query:  ISDKARDWLQSTTPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPH------MVTPIGFRG--------------
        + DKA+ WLQS  P + TTWD L +AFL KFFPP KT KLR +I +F QQ  E L+EAWER++EL R+CPH      +V    + G              
Subjt:  ISDKARDWLQSTTPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPH------MVTPIGFRG--------------

Query:  -----------------MATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLAN-------------------------------------------
                         MA N+YQW +ER   ++  AG+ EVD ++ L A+M ++                                             
Subjt:  -----------------MATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLAN-------------------------------------------

Query:  -------------------------------------------------------AFMKFSESSN-RTTKLEEAVIAINSTVNGHSAAI----KNIETQL
                                                               A  K + +SN +  KL  A       + G    +    +N+E QL
Subjt:  -------------------------------------------------------AFMKFSESSN-RTTKLEEAVIAINSTVNGHSAAI----KNIETQL

Query:  GQLVNVVSTMNKGGIEEEPESEDYETPTGEAEEDTSSDEAEKPNLEPPIP--------------SPTLLVPKEKKKKKKKKNNQVH--------------
        GQ+ N V+  N+G +  + E    E           +  + K  +EPP+               S      KE+K K+K + N++               
Subjt:  GQLVNVVSTMNKGGIEEEPESEDYETPTGEAEEDTSSDEAEKPNLEPPIP--------------SPTLLVPKEKKKKKKKKNNQVH--------------

Query:  ----EGVVSKERKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCS---------------------------------------------------
            + +++K+RK    +T+ L   CS  +Q K+P K+ DPGSF+VPC+                                                   
Subjt:  ----EGVVSKERKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCS---------------------------------------------------

Query:  --FENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRAIIDIERRELTIRVKNEKEIFKAVEDSKMKCFT
           ENVLI+V +F +P+D  V+DM E+ ++P+ILGRPFLAT   IID++R +   ++  E+  F   +  K   FT
Subjt:  --FENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRAIIDIERRELTIRVKNEKEIFKAVEDSKMKCFT

A5AZ88 Integrase catalytic domain-containing protein1.7e-2927.85Show/hide
Query:  ISDKARDWLQSTTPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPHMVTPIGFRG--------------------
        +++KA+ WL S  PG+ITTWD LV AFL K+FP AK+ K+R +I  F QQ  E L+EAWERFK+LLRKCPH   PI  +                     
Subjt:  ISDKARDWLQSTTPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPHMVTPIGFRG--------------------

Query:  -----------------MATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAFMKFSESSNRTTKLEEAVIAINSTVNGHSAAIKNIETQLG--
                         MA+N++   ++R+  K+   GV ++D  + L  Q+  L N F K +            V+  N      +    ++E Q+G  
Subjt:  -----------------MATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAFMKFSESSNRTTKLEEAVIAINSTVNGHSAAIKNIETQLG--

Query:  ------QLVNVVSTMNK----------GGIEEEPESEDYETPTGEAEEDTSSDEAEKPNLEPPIPSPTLLVPKEKKKKKKKKNNQVHEGVVSKERKEKKV
              + VN V+   +           G    P          +        +  K NLE      T  + K        K N  ++G       +K +
Subjt:  ------QLVNVVSTMNK----------GGIEEEPESEDYETPTGEAEEDTSSDEAEKPNLEPPIPSPTLLVPKEKKKKKKKKNNQVHEGVVSKERKEKKV

Query:  DT--VYLASTCSTRVQQKVPEKVADPGSFSVPCS-----------------------------------------------------FENVLIRVGRFFL
        D   V L   CS  +Q+++P K+ DPGSF++PC+                                                      E+VL++V +F  
Subjt:  DT--VYLASTCSTRVQQKVPEKVADPGSFSVPCS-----------------------------------------------------FENVLIRVGRFFL

Query:  PIDLYVMDMIENPSMPVILGRPFLATGRAIIDIERRELTIRVKNEKEIFKAVEDSK
        PID  V+DM E+  +P+ILGRPFLAT R +ID  + +L +RV++E+  F   E  K
Subjt:  PIDLYVMDMIENPSMPVILGRPFLATGRAIIDIERRELTIRVKNEKEIFKAVEDSK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGAAGGATTATATGGCTCGTAATGATGTCATAATCCTAAGTCAGCAAGCTTCATTGAGAGCCCTAGAGTTGCAAGTGGGTCAGCTAGCTAATGAGTTGAAGGCACG
ACCTCAAGGGAACATTCCTTTAGATATTGAACACCCTATAAGGGAAGGTAAGAAGCAGGTGCAGGCAATGACTTTAGGGAGTGATAGGCCACTAGAAGATAGAAAAGAGC
CTAGTAAACCCCTAGAAGTAGAAAAGAGTAGTGATAGTAATGTTGTCGAAAAAGAATTGGGGTCTGGTCAATATGATGGAGGCAGCAGCAAAGATGCTGGAGCAATTAGT
TCTGTTCCAGATGTAGAACCCCCACCTTATGTACCGCCCCCACCCTATGACCCACCCTTACCTTTTCCACAAAGGCAGAAGTCTAAGAACCAAGATGGTAGTCCATTTTT
GGCAACTGGTAGATCATTGATGGATGTCCAACAAGGGGAGCTTACAATGAAGGTGCATGACCAAAAGGTGAAGTTTAATATGTTTGATGCAACAAAATATCCTAATGATC
TTGAGGATTGCTCGTGCATTCAGCAACCCGAGATTAGGAAATCCTTCATTGATGAGCGATTATTTACTGTAGCTCATATTAAGGAAGTGAAAACACCTTGGTATGATGAC
TTTTCCAATTACCTTGATTTTGGAAATTTGCCTCCTGGTTTATCAAAACAACAAATGAAAGAATTTTTCCATGAGGAAATGGTTGATTGGAGCTATAATGCCATATTAGA
GTTAATCGGGTGCTTGGGGCGTGAAAAGATGCAAAGGAATGAAAAGAGTAAAAGTGAAAAAAGTCAAATCTCGGATAAAGCACGAGATTGGTTGCAGTCTACTACTCCTG
GGAGCATCACCACCTGGGATGCTTTGGTCCAGGCCTTTTTAAAGAAATTCTTCCCTCCTGCAAAGACGGTCAAGCTGAGGACCGAAATTGGGACATTCCAACAACAATAT
GATGAGCAGCTGTTCGAAGCTTGGGAGCGATTCAAAGAGCTACTGAGGAAGTGTCCTCACATGGTTACCCCGATTGGCTTCAGAGGAATGGCCACCAACAGCTATCAGTG
GCCATCTGAGCGGTCTACACCTAAAAAGATTGCTGCTGGAGTGTTTGAGGTTGATAAAGTAAGTGCACTCCAGGCCCAGATGACCTCTCTTGCCAATGCTTTTATGAAAT
TTTCAGAGTCTAGTAACAGGACAACCAAATTAGAGGAGGCAGTCATTGCCATCAACTCAACAGTGAATGGCCACAGTGCAGCTATAAAGAACATTGAGACTCAGCTGGGA
CAGTTGGTGAATGTTGTAAGCACCATGAATAAAGGAGGAATTGAAGAGGAACCTGAATCTGAGGACTATGAAACGCCTACAGGGGAAGCTGAGGAGGACACATCATCTGA
TGAGGCTGAAAAGCCTAACCTTGAGCCTCCTATTCCTTCTCCCACACTGTTGGTTCCCAAGGAAAAGAAAAAGAAAAAGAAGAAAAAGAATAATCAGGTTCATGAAGGAG
TGGTTAGCAAAGAACGAAAGGAAAAGAAGGTTGACACTGTTTATCTTGCTTCCACATGCAGCACCAGAGTACAACAGAAAGTACCTGAAAAAGTAGCAGATCCAGGGAGT
TTTTCTGTTCCTTGTAGTTTTGAAAATGTGTTAATCAGAGTAGGTAGATTTTTCCTCCCTATTGACTTGTATGTTATGGACATGATAGAAAATCCTTCAATGCCTGTCAT
ATTAGGACGACCATTCCTCGCTACTGGGCGAGCGATCATAGATATTGAGCGCAGGGAGCTCACTATTAGAGTCAAGAACGAAAAAGAAATCTTTAAAGCAGTTGAAGACT
CAAAGATGAAGTGCTTTACATGGGCTACAGGAAAGGTGCAAGAAAGAGCACCTCTGTTGGATTCACAGAACAAAAGCCTCCTTGAAGCACGTGCAACACGTCGAGCTAAT
GACGTTAAACAAGCGCTTATGGGAGGCAACCCAAGTCTTGTTTTCTTTCTCTCCTTTGCTTTCAAGCTTTCAAGATTCCAAGCTCTCAAGCAAGAAGTCATCATCATTCC
AGGCCAGATAGACTTTCTTGTCAATTTTCAAGCTTGTGATGAGATTTTAGGCAAAAAGAGCTACCCCTGTTGTGATCAGTTAGGCCAACCAGAGAAATCTCAGGAAATGG
TTGATTGGAGCTATAATGCCATATTAGAGTTAATCGGGTGCTCGGGGCGTGAAAAGATGCAAAGGAATGAAAAGAGTAAAAGTGGAAAAAAGTCAAATCTCGGTCAACAG
CAGGCTAGCGTCGAGACGCTAGCTCTTGAGCGTCTCGACGCTCACATTCCATATCAGATTAGGCGCGTAAAGCTCACAGCGTCGAGACGCTATGATAGGAAGCGTCCCGA
CGCTACCGTTTTTCCTTATTCAGAACGCGCGTATAAGAGGAGCGTCGCGACGCTGTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGATGAAGGATTATATGGCTCGTAATGATGTCATAATCCTAAGTCAGCAAGCTTCATTGAGAGCCCTAGAGTTGCAAGTGGGTCAGCTAGCTAATGAGTTGAAGGCACG
ACCTCAAGGGAACATTCCTTTAGATATTGAACACCCTATAAGGGAAGGTAAGAAGCAGGTGCAGGCAATGACTTTAGGGAGTGATAGGCCACTAGAAGATAGAAAAGAGC
CTAGTAAACCCCTAGAAGTAGAAAAGAGTAGTGATAGTAATGTTGTCGAAAAAGAATTGGGGTCTGGTCAATATGATGGAGGCAGCAGCAAAGATGCTGGAGCAATTAGT
TCTGTTCCAGATGTAGAACCCCCACCTTATGTACCGCCCCCACCCTATGACCCACCCTTACCTTTTCCACAAAGGCAGAAGTCTAAGAACCAAGATGGTAGTCCATTTTT
GGCAACTGGTAGATCATTGATGGATGTCCAACAAGGGGAGCTTACAATGAAGGTGCATGACCAAAAGGTGAAGTTTAATATGTTTGATGCAACAAAATATCCTAATGATC
TTGAGGATTGCTCGTGCATTCAGCAACCCGAGATTAGGAAATCCTTCATTGATGAGCGATTATTTACTGTAGCTCATATTAAGGAAGTGAAAACACCTTGGTATGATGAC
TTTTCCAATTACCTTGATTTTGGAAATTTGCCTCCTGGTTTATCAAAACAACAAATGAAAGAATTTTTCCATGAGGAAATGGTTGATTGGAGCTATAATGCCATATTAGA
GTTAATCGGGTGCTTGGGGCGTGAAAAGATGCAAAGGAATGAAAAGAGTAAAAGTGAAAAAAGTCAAATCTCGGATAAAGCACGAGATTGGTTGCAGTCTACTACTCCTG
GGAGCATCACCACCTGGGATGCTTTGGTCCAGGCCTTTTTAAAGAAATTCTTCCCTCCTGCAAAGACGGTCAAGCTGAGGACCGAAATTGGGACATTCCAACAACAATAT
GATGAGCAGCTGTTCGAAGCTTGGGAGCGATTCAAAGAGCTACTGAGGAAGTGTCCTCACATGGTTACCCCGATTGGCTTCAGAGGAATGGCCACCAACAGCTATCAGTG
GCCATCTGAGCGGTCTACACCTAAAAAGATTGCTGCTGGAGTGTTTGAGGTTGATAAAGTAAGTGCACTCCAGGCCCAGATGACCTCTCTTGCCAATGCTTTTATGAAAT
TTTCAGAGTCTAGTAACAGGACAACCAAATTAGAGGAGGCAGTCATTGCCATCAACTCAACAGTGAATGGCCACAGTGCAGCTATAAAGAACATTGAGACTCAGCTGGGA
CAGTTGGTGAATGTTGTAAGCACCATGAATAAAGGAGGAATTGAAGAGGAACCTGAATCTGAGGACTATGAAACGCCTACAGGGGAAGCTGAGGAGGACACATCATCTGA
TGAGGCTGAAAAGCCTAACCTTGAGCCTCCTATTCCTTCTCCCACACTGTTGGTTCCCAAGGAAAAGAAAAAGAAAAAGAAGAAAAAGAATAATCAGGTTCATGAAGGAG
TGGTTAGCAAAGAACGAAAGGAAAAGAAGGTTGACACTGTTTATCTTGCTTCCACATGCAGCACCAGAGTACAACAGAAAGTACCTGAAAAAGTAGCAGATCCAGGGAGT
TTTTCTGTTCCTTGTAGTTTTGAAAATGTGTTAATCAGAGTAGGTAGATTTTTCCTCCCTATTGACTTGTATGTTATGGACATGATAGAAAATCCTTCAATGCCTGTCAT
ATTAGGACGACCATTCCTCGCTACTGGGCGAGCGATCATAGATATTGAGCGCAGGGAGCTCACTATTAGAGTCAAGAACGAAAAAGAAATCTTTAAAGCAGTTGAAGACT
CAAAGATGAAGTGCTTTACATGGGCTACAGGAAAGGTGCAAGAAAGAGCACCTCTGTTGGATTCACAGAACAAAAGCCTCCTTGAAGCACGTGCAACACGTCGAGCTAAT
GACGTTAAACAAGCGCTTATGGGAGGCAACCCAAGTCTTGTTTTCTTTCTCTCCTTTGCTTTCAAGCTTTCAAGATTCCAAGCTCTCAAGCAAGAAGTCATCATCATTCC
AGGCCAGATAGACTTTCTTGTCAATTTTCAAGCTTGTGATGAGATTTTAGGCAAAAAGAGCTACCCCTGTTGTGATCAGTTAGGCCAACCAGAGAAATCTCAGGAAATGG
TTGATTGGAGCTATAATGCCATATTAGAGTTAATCGGGTGCTCGGGGCGTGAAAAGATGCAAAGGAATGAAAAGAGTAAAAGTGGAAAAAAGTCAAATCTCGGTCAACAG
CAGGCTAGCGTCGAGACGCTAGCTCTTGAGCGTCTCGACGCTCACATTCCATATCAGATTAGGCGCGTAAAGCTCACAGCGTCGAGACGCTATGATAGGAAGCGTCCCGA
CGCTACCGTTTTTCCTTATTCAGAACGCGCGTATAAGAGGAGCGTCGCGACGCTGTCTTGA
Protein sequenceShow/hide protein sequence
MMKDYMARNDVIILSQQASLRALELQVGQLANELKARPQGNIPLDIEHPIREGKKQVQAMTLGSDRPLEDRKEPSKPLEVEKSSDSNVVEKELGSGQYDGGSSKDAGAIS
SVPDVEPPPYVPPPPYDPPLPFPQRQKSKNQDGSPFLATGRSLMDVQQGELTMKVHDQKVKFNMFDATKYPNDLEDCSCIQQPEIRKSFIDERLFTVAHIKEVKTPWYDD
FSNYLDFGNLPPGLSKQQMKEFFHEEMVDWSYNAILELIGCLGREKMQRNEKSKSEKSQISDKARDWLQSTTPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQY
DEQLFEAWERFKELLRKCPHMVTPIGFRGMATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAFMKFSESSNRTTKLEEAVIAINSTVNGHSAAIKNIETQLG
QLVNVVSTMNKGGIEEEPESEDYETPTGEAEEDTSSDEAEKPNLEPPIPSPTLLVPKEKKKKKKKKNNQVHEGVVSKERKEKKVDTVYLASTCSTRVQQKVPEKVADPGS
FSVPCSFENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRAIIDIERRELTIRVKNEKEIFKAVEDSKMKCFTWATGKVQERAPLLDSQNKSLLEARATRRAN
DVKQALMGGNPSLVFFLSFAFKLSRFQALKQEVIIIPGQIDFLVNFQACDEILGKKSYPCCDQLGQPEKSQEMVDWSYNAILELIGCSGREKMQRNEKSKSGKKSNLGQQ
QASVETLALERLDAHIPYQIRRVKLTASRRYDRKRPDATVFPYSERAYKRSVATLS