; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018534 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018534
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr5:29058288..29059764
RNA-Seq ExpressionLag0018534
SyntenyLag0018534
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG55646.1 hypothetical protein EZV62_020902 [Acer yangbiense]3.6e-5533.8Show/hide
Query:  SSSSSSSATVSAHVSFVATVVSTSFGHPLSTVLTVKLDENNYLLWRGMVLAILCGQKVDGYILGTKAQPSERMEITTEIGKKLE---PNPLYEEWTTVDQ
        SSSS+ + TV    S  ++  S+ FG+ L+    +KLD  N++LW+ MV  I+ G ++DG++  T+  P E +   T  G        NP YE+W   DQ
Subjt:  SSSSSSSATVSAHVSFVATVVSTSFGHPLSTVLTVKLDENNYLLWRGMVLAILCGQKVDGYILGTKAQPSERMEITTEIGKKLE---PNPLYEEWTTVDQ

Query:  AFSGWLFGSMTSAIVVDIV---NLERFGKALEKAYGATSKARVNQLRRVLHNTKKGSMKMIEYLAIMKQASENLQLTGNPIFLSDLTSYVLVGLDSEYVP
           GWL+ SMT  + + ++         KALE  +GA SK++ N +R  +  T+KGS  M EYL  MK  +++L + G+P   + L + +L GLDSEY+P
Subjt:  AFSGWLFGSMTSAIVVDIV---NLERFGKALEKAYGATSKARVNQLRRVLHNTKKGSMKMIEYLAIMKQASENLQLTGNPIFLSDLTSYVLVGLDSEYVP

Query:  IVCSIDDKDIKTWQQLSSILITFEGTLARY-SVPTNVNELFDLARILCSIARIRVISRSSIILVE---------------------------RNNNSKPT
        IV  I+ ++  TWQ++   L++++  L    +V    N L   +  L +       + +     +                           RNNNS+PT
Subjt:  IVCSIDDKDIKTWQQLSSILITFEGTLARY-SVPTNVNELFDLARILCSIARIRVISRSSIILVE---------------------------RNNNSKPT

Query:  CQLYGKYGHSAPYCYLWFVESFNNPHVSANGNSQGGNQGNTAAYIATPEILNDPKWLVDNRATNHATGDGGNLAVKFNYSGKETLVVGNGTELSIAHTGS
        CQ+ GK+GHSA  CY  + +++     +AN N+      + + ++ATPE ++D  W  D+ ATNH T D GNL +K NY G E+L+VGNG +L I+H G 
Subjt:  CQLYGKYGHSAPYCYLWFVESFNNPHVSANGNSQGGNQGNTAAYIATPEILNDPKWLVDNRATNHATGDGGNLAVKFNYSGKETLVVGNGTELSIAHTGS

Query:  SSVPCLSGNKLISDMDSINNPLHVPE
         S+P L+ + +I     +   LHVPE
Subjt:  SSVPCLSGNKLISDMDSINNPLHVPE

TXG69253.1 hypothetical protein EZV62_004188 [Acer yangbiense]3.4e-5332.95Show/hide
Query:  SSSSSSSATVSAHVSFVATVVSTSFGHPLSTVLTVKLDENNYLLWRGMVLAILCGQKVDGYILGTKAQPSERMEITTEIGKKLE-----------PNPLY
        SSSS+++ TV    S  ++  S+ FG+ L+    +KLD  N++LW+ MV  I+ G ++DG++  T+  P E +   T  G                NP Y
Subjt:  SSSSSSSATVSAHVSFVATVVSTSFGHPLSTVLTVKLDENNYLLWRGMVLAILCGQKVDGYILGTKAQPSERMEITTEIGKKLE-----------PNPLY

Query:  EEWTTVDQAFSGWLFGSMTSAIVVDIV---NLERFGKALEKAYGATSKARVNQLRRVLHNTKKGSMKMIEYLAIMKQASENLQLTGNPIFLSDLTSYVLV
        E+W   DQ   GWL+ SMT  + + ++         KALE  +GA SK++ N +R  +  T+KGS  M EYL  MK  +++L + G+P   + L +  L 
Subjt:  EEWTTVDQAFSGWLFGSMTSAIVVDIV---NLERFGKALEKAYGATSKARVNQLRRVLHNTKKGSMKMIEYLAIMKQASENLQLTGNPIFLSDLTSYVLV

Query:  GLDSEYVPIVCSIDDKDIKTWQQLSSILITFEGTLARY-SVPTNVNELFDLARILCSIARIRVISRSSIILVE---------------------------
        GLDSEY+PIV  I+ ++  TWQ++   L++++  L    +V    N L   +  L +       + +     +                           
Subjt:  GLDSEYVPIVCSIDDKDIKTWQQLSSILITFEGTLARY-SVPTNVNELFDLARILCSIARIRVISRSSIILVE---------------------------

Query:  RNNNSKPTCQLYGKYGHSAPYCYLWFVESFNNPHVSANGNSQGGNQGNTAAYIATPEILNDPKWLVDNRATNHATGDGGNLAVKFNYSGKETLVVGNGTE
        RNNNS+PTCQ+ GK+GHSA  CY  + +++     +AN N+      + + ++ATPE ++D  W  D+ ATNH T D GNL +K +Y G E+L+VGNG +
Subjt:  RNNNSKPTCQLYGKYGHSAPYCYLWFVESFNNPHVSANGNSQGGNQGNTAAYIATPEILNDPKWLVDNRATNHATGDGGNLAVKFNYSGKETLVVGNGTE

Query:  LSIAHTGSSSVPCLSGNKLISDMDSINNPLHVPE
        L I+H G  S+P L+ + +I     +   LHVPE
Subjt:  LSIAHTGSSSVPCLSGNKLISDMDSINNPLHVPE

XP_022143579.1 ankyrin repeat-containing protein NPR4-like [Momordica charantia]2.4e-5457.64Show/hide
Query:  SSSATVSAHVSFVATVVSTSFGHPLSTVLTVKLDENNYLLWRGMVLAILCGQKVDGYILGTKAQPSERMEITTEIGKKLEP--NPLYEEWTTVDQAFSGW
        S S T ++  + V + ++ SFGHPLST LTVKLD+ NY LW+GMVLA+L GQKVDGY+L TK  PS+    T++ G  LEP  NP YEEW+ VDQAF GW
Subjt:  SSSATVSAHVSFVATVVSTSFGHPLSTVLTVKLDENNYLLWRGMVLAILCGQKVDGYILGTKAQPSERMEITTEIGKKLEP--NPLYEEWTTVDQAFSGW

Query:  LFGSMTSAIVVDIVNLERFGK---ALEKAYGATSKARVNQLRRVLHNTKKGSMKMIEYLAIMKQASENLQLTGNPIFLSDLTSYVLVGLDSEYVPIVCSI
        LFGSMT +I  D+VNL+   +   ALE  +G+TSKAR+NQLR  L NTKKG+MKM  YLA MKQ SE+L+L G P+ LS L+S +L G ++EY+PI+C+I
Subjt:  LFGSMTSAIVVDIVNLERFGK---ALEKAYGATSKARVNQLRRVLHNTKKGSMKMIEYLAIMKQASENLQLTGNPIFLSDLTSYVLVGLDSEYVPIVCSI

Query:  DDK
        +DK
Subjt:  DDK

XP_022157748.1 uncharacterized protein LOC111024384 isoform X1 [Momordica charantia]7.5e-7747.38Show/hide
Query:  STSFGHPLSTVLTVKLDENNYLLWRGMVLAILCGQKVDGYILGTKAQPSERM--EITTEIGKKLEPNPLYEEWTTVDQAFSGWLFGSMTSAIVVDIVNL-
        +TSFGHPL TVLTVKLD+ NY LWRGMVLA+L GQK DGY+LGT A+P + +    T      L+ NP Y EW  VDQA  GWLFGSMT +I  D+V+  
Subjt:  STSFGHPLSTVLTVKLDENNYLLWRGMVLAILCGQKVDGYILGTKAQPSERM--EITTEIGKKLEPNPLYEEWTTVDQAFSGWLFGSMTSAIVVDIVNL-

Query:  --ERFGKALEKAYGATSKARVNQLRRVLHNTKKGSMKMIEYLAIMKQASENLQLTGNPIFLSDLTSYVLVGLDSEYVPIVCSIDDKDIKTWQQLSSILIT
              KALE  YGATSKAR+NQLR VL NTKK S+KM EYL +MKQASE+L+L G P+  + L S VL GL++EY+PIVC I+ KD  +WQ+L + L+T
Subjt:  --ERFGKALEKAYGATSKARVNQLRRVLHNTKKGSMKMIEYLAIMKQASENLQLTGNPIFLSDLTSYVLVGLDSEYVPIVCSIDDKDIKTWQQLSSILIT

Query:  FEGTLARYSVPTNV---------------------NELFDLARILCSIARIRVISRSSIILVE----------RNNNSKPTCQLYGKYGHSAPYCYLWFV
        FE TL R ++ +                       N  F  ++      R    S  +   V           R NNSKP+CQL GKYGH A  CY  F 
Subjt:  FEGTLARYSVPTNV---------------------NELFDLARILCSIARIRVISRSSIILVE----------RNNNSKPTCQLYGKYGHSAPYCYLWFV

Query:  ESFNNPHVSANGNSQGGNQGNTAAYIATPEILNDPKWLVDNRATNHATGDGGNLAVKFNYSGK
        E+FN        N    N    +AY+A PEI+ +P WL D+ AT+H T D  NL VK +Y+GK
Subjt:  ESFNNPHVSANGNSQGGNQGNTAAYIATPEILNDPKWLVDNRATNHATGDGGNLAVKFNYSGK

XP_022157750.1 uncharacterized protein LOC111024384 isoform X2 [Momordica charantia]2.2e-7647.11Show/hide
Query:  STSFGHPLSTVLTVKLDENNYLLWRGMVLAILCGQKVDGYILGTKAQPSERM--EITTEIGKKLEPNPLYEEWTTVDQAFSGWLFGSMTSAIVVDIVNL-
        +TSFGHPL TVLTVKLD+ NY LWRGMVLA+L GQK DGY+LGT A+P + +    T      L+ NP Y EW  VDQA  GWLFGSMT +I  D+V+  
Subjt:  STSFGHPLSTVLTVKLDENNYLLWRGMVLAILCGQKVDGYILGTKAQPSERM--EITTEIGKKLEPNPLYEEWTTVDQAFSGWLFGSMTSAIVVDIVNL-

Query:  --ERFGKALEKAYGATSKARVNQLRRVLHNTKKGSMKMIEYLAIMKQASENLQLTGNPIFLSDLTSYVLVGLDSEYVPIVCSIDDKDIKTWQQLSSILIT
              KALE  YGATSKAR+NQLR VL NTKK S+KM EYL +MKQASE+L+L G P+  + L S VL GL++EY+PIVC I+ KD  +WQ+L + L+T
Subjt:  --ERFGKALEKAYGATSKARVNQLRRVLHNTKKGSMKMIEYLAIMKQASENLQLTGNPIFLSDLTSYVLVGLDSEYVPIVCSIDDKDIKTWQQLSSILIT

Query:  FEGTLARYSVPTNV---------------------NELFDLARILCSIARIRVISRSSIILVE----------RNNNSKPTCQLYGKYGHSAPYCYLWFV
        FE TL R ++ +                       N  F  ++      R    S  +   V           R NNSKP+CQL GKYGH A  CY  F 
Subjt:  FEGTLARYSVPTNV---------------------NELFDLARILCSIARIRVISRSSIILVE----------RNNNSKPTCQLYGKYGHSAPYCYLWFV

Query:  ESFNNPHVSANGNSQGGNQGNTAAYIATPEILNDPKWLVDNRATNHATGDGGNLAVKFNYSGK
        E+FN        N    N    +AY+A PEI+ +P WL D+ AT+H T D  NL VK +Y+G+
Subjt:  ESFNNPHVSANGNSQGGNQGNTAAYIATPEILNDPKWLVDNRATNHATGDGGNLAVKFNYSGK

TrEMBL top hitse value%identityAlignment
A0A5C7HHE9 Uncharacterized protein1.8e-5533.8Show/hide
Query:  SSSSSSSATVSAHVSFVATVVSTSFGHPLSTVLTVKLDENNYLLWRGMVLAILCGQKVDGYILGTKAQPSERMEITTEIGKKLE---PNPLYEEWTTVDQ
        SSSS+ + TV    S  ++  S+ FG+ L+    +KLD  N++LW+ MV  I+ G ++DG++  T+  P E +   T  G        NP YE+W   DQ
Subjt:  SSSSSSSATVSAHVSFVATVVSTSFGHPLSTVLTVKLDENNYLLWRGMVLAILCGQKVDGYILGTKAQPSERMEITTEIGKKLE---PNPLYEEWTTVDQ

Query:  AFSGWLFGSMTSAIVVDIV---NLERFGKALEKAYGATSKARVNQLRRVLHNTKKGSMKMIEYLAIMKQASENLQLTGNPIFLSDLTSYVLVGLDSEYVP
           GWL+ SMT  + + ++         KALE  +GA SK++ N +R  +  T+KGS  M EYL  MK  +++L + G+P   + L + +L GLDSEY+P
Subjt:  AFSGWLFGSMTSAIVVDIV---NLERFGKALEKAYGATSKARVNQLRRVLHNTKKGSMKMIEYLAIMKQASENLQLTGNPIFLSDLTSYVLVGLDSEYVP

Query:  IVCSIDDKDIKTWQQLSSILITFEGTLARY-SVPTNVNELFDLARILCSIARIRVISRSSIILVE---------------------------RNNNSKPT
        IV  I+ ++  TWQ++   L++++  L    +V    N L   +  L +       + +     +                           RNNNS+PT
Subjt:  IVCSIDDKDIKTWQQLSSILITFEGTLARY-SVPTNVNELFDLARILCSIARIRVISRSSIILVE---------------------------RNNNSKPT

Query:  CQLYGKYGHSAPYCYLWFVESFNNPHVSANGNSQGGNQGNTAAYIATPEILNDPKWLVDNRATNHATGDGGNLAVKFNYSGKETLVVGNGTELSIAHTGS
        CQ+ GK+GHSA  CY  + +++     +AN N+      + + ++ATPE ++D  W  D+ ATNH T D GNL +K NY G E+L+VGNG +L I+H G 
Subjt:  CQLYGKYGHSAPYCYLWFVESFNNPHVSANGNSQGGNQGNTAAYIATPEILNDPKWLVDNRATNHATGDGGNLAVKFNYSGKETLVVGNGTELSIAHTGS

Query:  SSVPCLSGNKLISDMDSINNPLHVPE
         S+P L+ + +I     +   LHVPE
Subjt:  SSVPCLSGNKLISDMDSINNPLHVPE

A0A6J1CPQ7 ankyrin repeat-containing protein NPR4-like1.1e-5457.64Show/hide
Query:  SSSATVSAHVSFVATVVSTSFGHPLSTVLTVKLDENNYLLWRGMVLAILCGQKVDGYILGTKAQPSERMEITTEIGKKLEP--NPLYEEWTTVDQAFSGW
        S S T ++  + V + ++ SFGHPLST LTVKLD+ NY LW+GMVLA+L GQKVDGY+L TK  PS+    T++ G  LEP  NP YEEW+ VDQAF GW
Subjt:  SSSATVSAHVSFVATVVSTSFGHPLSTVLTVKLDENNYLLWRGMVLAILCGQKVDGYILGTKAQPSERMEITTEIGKKLEP--NPLYEEWTTVDQAFSGW

Query:  LFGSMTSAIVVDIVNLERFGK---ALEKAYGATSKARVNQLRRVLHNTKKGSMKMIEYLAIMKQASENLQLTGNPIFLSDLTSYVLVGLDSEYVPIVCSI
        LFGSMT +I  D+VNL+   +   ALE  +G+TSKAR+NQLR  L NTKKG+MKM  YLA MKQ SE+L+L G P+ LS L+S +L G ++EY+PI+C+I
Subjt:  LFGSMTSAIVVDIVNLERFGK---ALEKAYGATSKARVNQLRRVLHNTKKGSMKMIEYLAIMKQASENLQLTGNPIFLSDLTSYVLVGLDSEYVPIVCSI

Query:  DDK
        +DK
Subjt:  DDK

A0A6J1DTZ7 uncharacterized protein LOC111024384 isoform X21.1e-7647.11Show/hide
Query:  STSFGHPLSTVLTVKLDENNYLLWRGMVLAILCGQKVDGYILGTKAQPSERM--EITTEIGKKLEPNPLYEEWTTVDQAFSGWLFGSMTSAIVVDIVNL-
        +TSFGHPL TVLTVKLD+ NY LWRGMVLA+L GQK DGY+LGT A+P + +    T      L+ NP Y EW  VDQA  GWLFGSMT +I  D+V+  
Subjt:  STSFGHPLSTVLTVKLDENNYLLWRGMVLAILCGQKVDGYILGTKAQPSERM--EITTEIGKKLEPNPLYEEWTTVDQAFSGWLFGSMTSAIVVDIVNL-

Query:  --ERFGKALEKAYGATSKARVNQLRRVLHNTKKGSMKMIEYLAIMKQASENLQLTGNPIFLSDLTSYVLVGLDSEYVPIVCSIDDKDIKTWQQLSSILIT
              KALE  YGATSKAR+NQLR VL NTKK S+KM EYL +MKQASE+L+L G P+  + L S VL GL++EY+PIVC I+ KD  +WQ+L + L+T
Subjt:  --ERFGKALEKAYGATSKARVNQLRRVLHNTKKGSMKMIEYLAIMKQASENLQLTGNPIFLSDLTSYVLVGLDSEYVPIVCSIDDKDIKTWQQLSSILIT

Query:  FEGTLARYSVPTNV---------------------NELFDLARILCSIARIRVISRSSIILVE----------RNNNSKPTCQLYGKYGHSAPYCYLWFV
        FE TL R ++ +                       N  F  ++      R    S  +   V           R NNSKP+CQL GKYGH A  CY  F 
Subjt:  FEGTLARYSVPTNV---------------------NELFDLARILCSIARIRVISRSSIILVE----------RNNNSKPTCQLYGKYGHSAPYCYLWFV

Query:  ESFNNPHVSANGNSQGGNQGNTAAYIATPEILNDPKWLVDNRATNHATGDGGNLAVKFNYSGK
        E+FN        N    N    +AY+A PEI+ +P WL D+ AT+H T D  NL VK +Y+G+
Subjt:  ESFNNPHVSANGNSQGGNQGNTAAYIATPEILNDPKWLVDNRATNHATGDGGNLAVKFNYSGK

A0A6J1DU77 uncharacterized protein LOC111024384 isoform X13.6e-7747.38Show/hide
Query:  STSFGHPLSTVLTVKLDENNYLLWRGMVLAILCGQKVDGYILGTKAQPSERM--EITTEIGKKLEPNPLYEEWTTVDQAFSGWLFGSMTSAIVVDIVNL-
        +TSFGHPL TVLTVKLD+ NY LWRGMVLA+L GQK DGY+LGT A+P + +    T      L+ NP Y EW  VDQA  GWLFGSMT +I  D+V+  
Subjt:  STSFGHPLSTVLTVKLDENNYLLWRGMVLAILCGQKVDGYILGTKAQPSERM--EITTEIGKKLEPNPLYEEWTTVDQAFSGWLFGSMTSAIVVDIVNL-

Query:  --ERFGKALEKAYGATSKARVNQLRRVLHNTKKGSMKMIEYLAIMKQASENLQLTGNPIFLSDLTSYVLVGLDSEYVPIVCSIDDKDIKTWQQLSSILIT
              KALE  YGATSKAR+NQLR VL NTKK S+KM EYL +MKQASE+L+L G P+  + L S VL GL++EY+PIVC I+ KD  +WQ+L + L+T
Subjt:  --ERFGKALEKAYGATSKARVNQLRRVLHNTKKGSMKMIEYLAIMKQASENLQLTGNPIFLSDLTSYVLVGLDSEYVPIVCSIDDKDIKTWQQLSSILIT

Query:  FEGTLARYSVPTNV---------------------NELFDLARILCSIARIRVISRSSIILVE----------RNNNSKPTCQLYGKYGHSAPYCYLWFV
        FE TL R ++ +                       N  F  ++      R    S  +   V           R NNSKP+CQL GKYGH A  CY  F 
Subjt:  FEGTLARYSVPTNV---------------------NELFDLARILCSIARIRVISRSSIILVE----------RNNNSKPTCQLYGKYGHSAPYCYLWFV

Query:  ESFNNPHVSANGNSQGGNQGNTAAYIATPEILNDPKWLVDNRATNHATGDGGNLAVKFNYSGK
        E+FN        N    N    +AY+A PEI+ +P WL D+ AT+H T D  NL VK +Y+GK
Subjt:  ESFNNPHVSANGNSQGGNQGNTAAYIATPEILNDPKWLVDNRATNHATGDGGNLAVKFNYSGK

A0A803QD97 Uncharacterized protein2.7e-5634.99Show/hide
Query:  NSSSSSSSATVSAHVSFVATVVSTSFGHPLSTVLTVKLDENNYLLWRGMVLAILCGQKVDGYILGTKAQPSERM---EITTEIGKKLEPNPLYEEWTTVD
        N +S   + + S++VS         FG  L+    +KLD NN+ LW+ MV AI  G ++DGY+ G +  P E +   +   + G   E NP +E W   D
Subjt:  NSSSSSSSATVSAHVSFVATVVSTSFGHPLSTVLTVKLDENNYLLWRGMVLAILCGQKVDGYILGTKAQPSERM---EITTEIGKKLEPNPLYEEWTTVD

Query:  QAFSGWLFGSMTSAIVVDIVNLE---RFGKALEKAYGATSKARVNQLRRVLHNTKKGSMKMIEYLAIMKQASENLQLTGNPIFLSDLTSYVLVGLDSEYV
        Q   GWL+GSMT  I  +I+          +LE  +GA SKA++++ R  +   +KGSM M++YL   KQ S+ L L G+P   S L S VL GLD EY+
Subjt:  QAFSGWLFGSMTSAIVVDIVNLE---RFGKALEKAYGATSKARVNQLRRVLHNTKKGSMKMIEYLAIMKQASENLQLTGNPIFLSDLTSYVLVGLDSEYV

Query:  PIVCSIDDKDIKTWQQLSSILITFEGTLARYSVPTNVNELFDLARILCSIARIRVISRSSIILVERNNNS------------------------KPTCQL
        PIV  I+ ++  TWQ L  +L++F+  L R S   +++E    +      A +   S S       NNN                         KPTCQ+
Subjt:  PIVCSIDDKDIKTWQQLSSILITFEGTLARYSVPTNVNELFDLARILCSIARIRVISRSSIILVERNNNS------------------------KPTCQL

Query:  YGKYGHSAPYCYLWFVESFNNPHVSANGNSQGGNQGNTAAYIATPEILNDPKWLVDNRATNHATGDGGNLAVKFNYSGKETLVVGNGTELSIAHTGSSSV
         G+YGHSA YCY  F E+F       N      +  N  A++ATPE+L D  W  ++ A+NH T +  NL  K  Y+GK++L VG+G++L I HTGS  +
Subjt:  YGKYGHSAPYCYLWFVESFNNPHVSANGNSQGGNQGNTAAYIATPEILNDPKWLVDNRATNHATGDGGNLAVKFNYSGKETLVVGNGTELSIAHTGSSSV

Query:  PCLSGNKLISDMDSINNPLHVPE
           + + LI     +   LHVP+
Subjt:  PCLSGNKLISDMDSINNPLHVPE

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.0e-2125.9Show/hide
Query:  KLDENNYLLWRGMVLAILCGQKVDGYILGTKAQPSERMEITTEIGKKLEPNPLYEEWTTVDQAFSGWLFGSMTSAI---VVDIVNLERFGKALEKAYGAT
        KL   NYL+W   V A+  G ++ G++ G+   P     I T+   ++  NP Y  W   D+     + G+++ ++   V       +  + L K Y   
Subjt:  KLDENNYLLWRGMVLAILCGQKVDGYILGTKAQPSERMEITTEIGKKLEPNPLYEEWTTVDQAFSGWLFGSMTSAI---VVDIVNLERFGKALEKAYGAT

Query:  SKARVNQLRRVLHNTKKGSMKMIEYLAIMKQASENLQLTGNPIFLSDLTSYVLVGLDSEYVPIVCSIDDKDI-KTWQQLSSILITFEGTLARYSVPT---
        S   V QLR  L    KG+  + +Y+  +    + L L G P+   +    VL  L  EY P++  I  KD   T  ++   L+  E  +   S  T   
Subjt:  SKARVNQLRRVLHNTKKGSMKMIEYLAIMKQASENLQLTGNPIFLSDLTSYVLVGLDSEYVPIVCSIDDKDI-KTWQQLSSILITFEGTLARYSVPT---

Query:  --------------------NVNELFDLARILCSIARIRVISRSSIILVERNNNSKP---TCQLYGKYGHSAPYCYLWFVESFNNPHVSANGNSQGGNQG
                            N N  +D      +    +   +SS      NN SKP    CQ+ G  GHSA  C           H  ++ NSQ     
Subjt:  --------------------NVNELFDLARILCSIARIRVISRSSIILVERNNNSKP---TCQLYGKYGHSAPYCYLWFVESFNNPHVSANGNSQGGNQG

Query:  NT----AAYIATPEILNDPKWLVDNRATNHATGDGGNLAVKFNYSGKETLVVGNGTELSIAHTGSSSVPCLSGNKLISDMDSINNPLHVP
         T     A +A     +   WL+D+ AT+H T D  NL++   Y+G + ++V +G+ + I+HTGS+S+   S         +++N L+VP
Subjt:  NT----AAYIATPEILNDPKWLVDNRATNHATGDGGNLAVKFNYSGKETLVVGNGTELSIAHTGSSSVPCLSGNKLISDMDSINNPLHVP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE27.2e-1423.53Show/hide
Query:  VSAHVSFVATVVSTSFGHPLSTVLTVKLDENNYLLWRGMVLAILCGQKVDGYILGTKAQPSERMEITTEIGKKLEP--NPLYEEWTTVDQAFSGWLFGSM
        ++ H   +  V +      +S V   KL   NYL+W   V A+  G ++ G++ G+   P         IG    P  NP Y  W   D+     + G++
Subjt:  VSAHVSFVATVVSTSFGHPLSTVLTVKLDENNYLLWRGMVLAILCGQKVDGYILGTKAQPSERMEITTEIGKKLEP--NPLYEEWTTVDQAFSGWLFGSM

Query:  TSAIVVDIVNLERFGKALEKAYGATSKARV-NQLRRVLHNTKKGSMKMIEYLAIMKQASENLQLTGNPIFLSDLTSYVLVGLDSEYVPIVCSIDDKDI-K
        + ++   +               AT+ A++   LR++  N   G +  + ++    Q    L L G P+   +    VL  L  +Y P++  I  KD   
Subjt:  TSAIVVDIVNLERFGKALEKAYGATSKARV-NQLRRVLHNTKKGSMKMIEYLAIMKQASENLQLTGNPIFLSDLTSYVLVGLDSEYVPIVCSIDDKDI-K

Query:  TWQQLSSILITFEGTLARYS----VPTNVNELFDLARILCSIARIRVISRSSIILVERNNNSKPT-----------------CQLYGKYGHSAPYC-YLW
        +  ++   LI  E  L   +    VP   N +             R  +R+      R+N+ +P+                 CQ+    GHSA  C  L 
Subjt:  TWQQLSSILITFEGTLARYS----VPTNVNELFDLARILCSIARIRVISRSSIILVERNNNSKPT-----------------CQLYGKYGHSAPYC-YLW

Query:  FVESFNNPHVSANGNSQGGNQGNTAAYIATPEILNDPKWLVDNRATNHATGDGGNLAVKFNYSGKETLVVGNGTELSIAHTGSSSVPCLSGNKLISDMDS
          +S  N   S +  +    + N    +A     N   WL+D+ AT+H T D  NL+    Y+G + +++ +G+ + I HTGS+S+P  S +        
Subjt:  FVESFNNPHVSANGNSQGGNQGNTAAYIATPEILNDPKWLVDNRATNHATGDGGNLAVKFNYSGKETLVVGNGTELSIAHTGSSSVPCLSGNKLISDMDS

Query:  INNPLHVP
        +N  L+VP
Subjt:  INNPLHVP

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).2.4e-0425Show/hide
Query:  HPLS-TVLTVKLDENNYLLWRGMVLAILCGQKVDGYILGTKAQPSERMEITTEIGKKLEP-NPLYEEWTTVDQAFSGWLFGSMTSAIVVDIVNLE---RF
        HP   ++  +  DE+NY+ W+    + L   K  G+I GT  +P              +P +PLY+ W   +     WL  SMT  ++  ++  E   + 
Subjt:  HPLS-TVLTVKLDENNYLLWRGMVLAILCGQKVDGYILGTKAQPSERMEITTEIGKKLEP-NPLYEEWTTVDQAFSGWLFGSMTSAIVVDIVNLE---RF

Query:  GKALEKAYGATSKARVNQLRRVLHNTKKGSMKMIEY
         + L + +      ++ QLRR L   ++G   + EY
Subjt:  GKALEKAYGATSKARVNQLRRVLHNTKKGSMKMIEY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGATGAAAACTCTTCTTCGTCTTCTTCTTCTGCAACAGTCTCAGCTCATGTTTCATTCGTTGCAACTGTCGTAAGCACTTCTTTCGGTCATCCGCTGAGCACTGT
TCTCACAGTGAAGCTTGATGAGAACAACTATTTGCTATGGAGAGGGATGGTTTTAGCCATTCTTTGTGGTCAGAAGGTGGATGGGTACATTCTCGGAACAAAGGCCCAAC
CATCAGAACGCATGGAAATCACCACTGAGATCGGTAAGAAACTCGAACCAAATCCTCTTTATGAGGAATGGACAACAGTCGACCAAGCTTTCTCTGGTTGGCTTTTTGGG
TCTATGACCTCTGCAATAGTTGTGGATATAGTGAACTTAGAGAGGTTTGGAAAAGCTCTAGAGAAGGCCTATGGAGCAACGAGCAAAGCTAGGGTGAATCAACTAAGAAG
AGTCCTTCATAATACTAAGAAGGGGTCGATGAAGATGATAGAATACCTAGCGATAATGAAGCAAGCTTCAGAAAACCTTCAACTGACCGGTAATCCCATCTTTCTCAGTG
ACCTTACTTCTTACGTTTTGGTTGGACTAGATTCAGAATATGTTCCCATTGTTTGTTCGATCGATGACAAGGACATCAAAACTTGGCAACAACTGTCATCAATTTTAATT
ACCTTTGAAGGCACCCTGGCGCGTTATTCTGTGCCTACTAATGTGAATGAGCTATTTGACCTAGCGCGCATCTTGTGTTCAATTGCCAGAATCAGAGTAATTTCTCGAAG
TAGTATAATCTTAGTAGAGCGGAATAACAACTCAAAACCAACCTGCCAACTCTATGGCAAGTATGGTCACTCTGCCCCCTATTGCTATCTTTGGTTTGTAGAATCCTTCA
ACAACCCTCATGTTTCTGCCAATGGTAATAGTCAAGGAGGAAATCAAGGCAACACTGCTGCTTATATTGCAACTCCAGAGATCTTAAATGATCCAAAATGGCTGGTTGAT
AATAGAGCTACAAATCATGCGACAGGTGATGGTGGTAATTTAGCTGTTAAATTTAATTACTCTGGTAAAGAAACCTTGGTTGTTGGAAACGGCACTGAGTTGAGTATAGC
TCATACAGGAAGTAGTTCCGTTCCTTGTTTGTCTGGAAATAAACTCATAAGTGATATGGATTCAATCAATAATCCATTGCATGTACCTGAAGGACCAATCACAAGGAGCA
AGGCAAAGAAGATACAAGAGGCTTTCACACTGCATGTTCAAAAGCTAGCAAATGCACAACGAGAGGCCGAGAATTTTGAACCCAAATTTTTGTATAATGTTAGTTCAGCA
AGTCAAGAAGAGAATGAAGTCAAGATGGCACAGGAAAAGTTGTGTAGTTTGGAAGATGGCACATAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTGATGAAAACTCTTCTTCGTCTTCTTCTTCTGCAACAGTCTCAGCTCATGTTTCATTCGTTGCAACTGTCGTAAGCACTTCTTTCGGTCATCCGCTGAGCACTGT
TCTCACAGTGAAGCTTGATGAGAACAACTATTTGCTATGGAGAGGGATGGTTTTAGCCATTCTTTGTGGTCAGAAGGTGGATGGGTACATTCTCGGAACAAAGGCCCAAC
CATCAGAACGCATGGAAATCACCACTGAGATCGGTAAGAAACTCGAACCAAATCCTCTTTATGAGGAATGGACAACAGTCGACCAAGCTTTCTCTGGTTGGCTTTTTGGG
TCTATGACCTCTGCAATAGTTGTGGATATAGTGAACTTAGAGAGGTTTGGAAAAGCTCTAGAGAAGGCCTATGGAGCAACGAGCAAAGCTAGGGTGAATCAACTAAGAAG
AGTCCTTCATAATACTAAGAAGGGGTCGATGAAGATGATAGAATACCTAGCGATAATGAAGCAAGCTTCAGAAAACCTTCAACTGACCGGTAATCCCATCTTTCTCAGTG
ACCTTACTTCTTACGTTTTGGTTGGACTAGATTCAGAATATGTTCCCATTGTTTGTTCGATCGATGACAAGGACATCAAAACTTGGCAACAACTGTCATCAATTTTAATT
ACCTTTGAAGGCACCCTGGCGCGTTATTCTGTGCCTACTAATGTGAATGAGCTATTTGACCTAGCGCGCATCTTGTGTTCAATTGCCAGAATCAGAGTAATTTCTCGAAG
TAGTATAATCTTAGTAGAGCGGAATAACAACTCAAAACCAACCTGCCAACTCTATGGCAAGTATGGTCACTCTGCCCCCTATTGCTATCTTTGGTTTGTAGAATCCTTCA
ACAACCCTCATGTTTCTGCCAATGGTAATAGTCAAGGAGGAAATCAAGGCAACACTGCTGCTTATATTGCAACTCCAGAGATCTTAAATGATCCAAAATGGCTGGTTGAT
AATAGAGCTACAAATCATGCGACAGGTGATGGTGGTAATTTAGCTGTTAAATTTAATTACTCTGGTAAAGAAACCTTGGTTGTTGGAAACGGCACTGAGTTGAGTATAGC
TCATACAGGAAGTAGTTCCGTTCCTTGTTTGTCTGGAAATAAACTCATAAGTGATATGGATTCAATCAATAATCCATTGCATGTACCTGAAGGACCAATCACAAGGAGCA
AGGCAAAGAAGATACAAGAGGCTTTCACACTGCATGTTCAAAAGCTAGCAAATGCACAACGAGAGGCCGAGAATTTTGAACCCAAATTTTTGTATAATGTTAGTTCAGCA
AGTCAAGAAGAGAATGAAGTCAAGATGGCACAGGAAAAGTTGTGTAGTTTGGAAGATGGCACATAG
Protein sequenceShow/hide protein sequence
MGDENSSSSSSSATVSAHVSFVATVVSTSFGHPLSTVLTVKLDENNYLLWRGMVLAILCGQKVDGYILGTKAQPSERMEITTEIGKKLEPNPLYEEWTTVDQAFSGWLFG
SMTSAIVVDIVNLERFGKALEKAYGATSKARVNQLRRVLHNTKKGSMKMIEYLAIMKQASENLQLTGNPIFLSDLTSYVLVGLDSEYVPIVCSIDDKDIKTWQQLSSILI
TFEGTLARYSVPTNVNELFDLARILCSIARIRVISRSSIILVERNNNSKPTCQLYGKYGHSAPYCYLWFVESFNNPHVSANGNSQGGNQGNTAAYIATPEILNDPKWLVD
NRATNHATGDGGNLAVKFNYSGKETLVVGNGTELSIAHTGSSSVPCLSGNKLISDMDSINNPLHVPEGPITRSKAKKIQEAFTLHVQKLANAQREAENFEPKFLYNVSSA
SQEENEVKMAQEKLCSLEDGT