; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0023016 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0023016
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr7:42944056..42947314
RNA-Seq ExpressionLag0023016
SyntenyLag0023016
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR029472 - Retrotransposon Copia-like, N-terminal
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0065480.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Cucumis melo var. makuwa]1.9e-5839.9Show/hide
Query:  AMLMALSGKNKEGFVDGTIKKPTNTKMVKMWKCNNDIIASWIMNSVSKEIAASIVYTGCVKEVWDELFDRFKEN-------------SLNSVNMRLVDVF
        AMLMA+SG+NK GF+ G I+KP++  ++  W CNNDI+ASWI+NSVSKEIAASI+Y G +KE+WDEL  RFK++             +L   N+ +   +
Subjt:  AMLMALSGKNKEGFVDGTIKKPTNTKMVKMWKCNNDIIASWIMNSVSKEIAASIVYTGCVKEVWDELFDRFKEN-------------SLNSVNMRLVDVF

Query:  --LKTVF--------------GFCDPNSENSENNFL---------CFRSIRTQILLMNPIPSISKVFALVIQEERQRNASNVISQSNPVALLATNASKKG
          LKT++              G   P  ++ E+ ++          + ++R QILLM P+PSI+ VF+L+IQEE+QR+A  +    +PVAL    AS   
Subjt:  --LKTVF--------------GFCDPNSENSENNFL---------CFRSIRTQILLMNPIPSISKVFALVIQEERQRNASNVISQSNPVALLATNASKKG

Query:  SSQTNQSRVTKPTVILLGTEPEIKIQIWQ--AIPNQWGPTL---------LLKLHNLL---------PPIFFSSLNASQCSQLMELLNSQLQAAETEPIT
         S     +  +PT    G +  I  + ++    P  + P             K +N+           P FFSSLN+ Q SQLM LLN+ LQAA T PIT
Subjt:  SSQTNQSRVTKPTVILLGTEPEIKIQIWQ--AIPNQWGPTL---------LLKLHNLL---------PPIFFSSLNASQCSQLMELLNSQLQAAETEPIT

Query:  TATFVVHTSGICSITSSPIDS-NVWIVDSSASQHISHCHHMFHNWRRVCGMSVILPTSQQVHVDYIGDIRLSQDLTLHDVLYVPRF
        TAT + HTSGI ++TS    S + WI+DS AS+HI H   +F NW     M V+LP   ++ VD IGDI+++  LTL DVL+V +F
Subjt:  TATFVVHTSGICSITSSPIDS-NVWIVDSSASQHISHCHHMFHNWRRVCGMSVILPTSQQVHVDYIGDIRLSQDLTLHDVLYVPRF

XP_010673168.1 PREDICTED: uncharacterized protein LOC104889608 [Beta vulgaris subsp. vulgaris]4.7e-5740Show/hide
Query:  MNPSKAPGMDGLQALFYQNYWHVVGNETTNLCLQILNGEERINRLNRTIIALVPKVHSPKKMEDFRPISLCNVSYKIIAKVLANRLK-------------
        M+PSKAPG DG+ A+FYQ +WH+VG++ T++   I++G    + LN T IAL+PKV SP  + +FRPISLCNV +K++ KVLANRLK             
Subjt:  MNPSKAPGMDGLQALFYQNYWHVVGNETTNLCLQILNGEERINRLNRTIIALVPKVHSPKKMEDFRPISLCNVSYKIIAKVLANRLK-------------

Query:  --------DNVVIGFECIHAINSRKKGQKGLVAMKLDMSKAYDRVEWSFMRKFMNKLGFQENWINMVMSCVETVQYSILINGFSYGSSVLNRGIRQGDPL
                DN +I  E  H++  R KG +G VAMKLDMSKAYDRVEWSF+R  ++K+GF ++W+  VM CV +V+YS ++NG   GS + +RG+RQGDP+
Subjt:  --------DNVVIGFECIHAINSRKKGQKGLVAMKLDMSKAYDRVEWSFMRKFMNKLGFQENWINMVMSCVETVQYSILINGFSYGSSVLNRGIRQGDPL

Query:  S-----------SAL------DK------DCRSIREILKVYEWASGQTINLEKSVFMASKKLRPGRFESLVRFLIAM----------LMALSGKNKE---
        S           SAL      DK      +C  I +IL  YE ASGQ IN+EKS    SK +   + + LV FL             +  L+G++K+   
Subjt:  S-----------SAL------DK------DCRSIREILKVYEWASGQTINLEKSVFMASKKLRPGRFESLVRFLIAM----------LMALSGKNKE---

Query:  -GFVDGTIKKPTNTKMVKMWKCNNDIIASWIMNSV
         G +D   KK    K   + +   +++   ++ ++
Subjt:  -GFVDGTIKKPTNTKMVKMWKCNNDIIASWIMNSV

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]4.4e-5540.74Show/hide
Query:  MNPSKAPGMDGLQALFYQNYWHVVGNETTNLCLQILNGEERINRLNRTIIALVPKVHSPKKMEDFRPISLCNVSYKIIAKVLANRLK-------------
        M+P+KAPG DG+ A+F+Q YW++VGN+   + L +LN    +  +N+T I LVPK+ +P KM DFRPISLCNV YK+I+KVLANRLK             
Subjt:  MNPSKAPGMDGLQALFYQNYWHVVGNETTNLCLQILNGEERINRLNRTIIALVPKVHSPKKMEDFRPISLCNVSYKIIAKVLANRLK-------------

Query:  --------DNVVIGFECIHAINSRKKGQKGLVAMKLDMSKAYDRVEWSFMRKFMNKLGFQENWINMVMSCVETVQYSILINGFSYGSSVLNRGIRQGDPL
                DNV++ FE +H +  +K+G++G  A+KLDMSKAYDRVEW F+++ M K+GF E WI +VM C+ +V YSIL+NG +YGS    RG+RQGDP+
Subjt:  --------DNVVIGFECIHAINSRKKGQKGLVAMKLDMSKAYDRVEWSFMRKFMNKLGFQENWINMVMSCVETVQYSILINGFSYGSSVLNRGIRQGDPL

Query:  S-------------------------------------------------SALDKDCRSIREILKVYEWASGQTINLEK-SVFMASKKLRPGRFESL
        S                                                  A  ++C+++ +IL++YE ASGQ IN++K SVF ++      R E L
Subjt:  S-------------------------------------------------SALDKDCRSIREILKVYEWASGQTINLEK-SVFMASKKLRPGRFESL

XP_023908235.1 uncharacterized protein LOC112019924 [Quercus suber]1.3e-5444.64Show/hide
Query:  MNPSKAPGMDGLQALFYQNYWHVVGNETTNLCLQILNGEERINRLNRTIIALVPKVHSPKKMEDFRPISLCNVSYKIIAKVLANRLK-------------
        M P+KAPG DG+ ALFYQ +WHVVG+   +  L  LN       +N T I L+PKV +P+KM DFRPISLCNV YKII+KVL NRLK             
Subjt:  MNPSKAPGMDGLQALFYQNYWHVVGNETTNLCLQILNGEERINRLNRTIIALVPKVHSPKKMEDFRPISLCNVSYKIIAKVLANRLK-------------

Query:  --------DNVVIGFECIHAINSRKKGQKGLVAMKLDMSKAYDRVEWSFMRKFMNKLGFQENWINMVMSCVETVQYSILINGFSYGSSVLNRGIRQGDPL
                DNV++ +E +H ++SRKKG+KG +A+KLD+SKAYDRVEW F++  M +LGF E WI  VMSCV T  +S+L+NG  +G+   +RGIRQGDPL
Subjt:  --------DNVVIGFECIHAINSRKKGQKGLVAMKLDMSKAYDRVEWSFMRKFMNKLGFQENWINMVMSCVETVQYSILINGFSYGSSVLNRGIRQGDPL

Query:  S------------SALDK-------------------------------------DCRSIREILKVYEWASGQTINLEKS
        S            S LD+                                     +  +I EIL+VY  ASGQ+INLEKS
Subjt:  S------------SALDK-------------------------------------DCRSIREILKVYEWASGQTINLEKS

XP_030939512.1 uncharacterized protein LOC115964316 [Quercus lobata]7.5e-5545.36Show/hide
Query:  MNPSKAPGMDGLQALFYQNYWHVVGNETTNLCLQILNGEERINRLNRTIIALVPKVHSPKKMEDFRPISLCNVSYKIIAKVLANRLK-------------
        M P+KAPG DG+ ALFYQ +WHVVG+   +  L  LN    +  +N T I L+PKV +P+KM DFRPISLCNV YKII+KVLANRLK             
Subjt:  MNPSKAPGMDGLQALFYQNYWHVVGNETTNLCLQILNGEERINRLNRTIIALVPKVHSPKKMEDFRPISLCNVSYKIIAKVLANRLK-------------

Query:  --------DNVVIGFECIHAINSRKKGQKGLVAMKLDMSKAYDRVEWSFMRKFMNKLGFQENWINMVMSCVETVQYSILINGFSYGSSVLNRGIRQGDPL
                DNV+I +E +H ++ RKKG+KG +A+KLD+SKAYDRVEW F++  + +LGF E WI  VMSCV T  +S+L+NG  +G+ + +RGIRQGDPL
Subjt:  --------DNVVIGFECIHAINSRKKGQKGLVAMKLDMSKAYDRVEWSFMRKFMNKLGFQENWINMVMSCVETVQYSILINGFSYGSSVLNRGIRQGDPL

Query:  S------------SALDKD-----------CRS--------------------------IREILKVYEWASGQTINLEKS
        S            S LD+            CRS                          I +IL+VY  ASGQ INLEKS
Subjt:  S------------SALDKD-----------CRS--------------------------IREILKVYEWASGQTINLEKS

TrEMBL top hitse value%identityAlignment
A0A2N9EK17 Reverse transcriptase domain-containing protein3.6e-5542.46Show/hide
Query:  MNPSKAPGMDGLQALFYQNYWHVVGNETTNLCLQILNGEERINRLNRTIIALVPKVHSPKKMEDFRPISLCNVSYKIIAKVLANRLK-------------
        M+P+KAPG DG+ ALF+Q YWH+VGN+     L  L     +  LN T IAL+PKV   + +  FRPISLCNV YKII+KVLANR+K             
Subjt:  MNPSKAPGMDGLQALFYQNYWHVVGNETTNLCLQILNGEERINRLNRTIIALVPKVHSPKKMEDFRPISLCNVSYKIIAKVLANRLK-------------

Query:  --------DNVVIGFECIHAINSRKKGQKGLVAMKLDMSKAYDRVEWSFMRKFMNKLGFQENWINMVMSCVETVQYSILINGFSYGSSVLNRGIRQGDPL
                DN++I FE +H +N++++G+   +A KLDMSKAYDRVEW++++  M K+GF+  W+N++M C+ TV YS+LING  +G    +RG+RQGDPL
Subjt:  --------DNVVIGFECIHAINSRKKGQKGLVAMKLDMSKAYDRVEWSFMRKFMNKLGFQENWINMVMSCVETVQYSILINGFSYGSSVLNRGIRQGDPL

Query:  S------------------------SALDKDCRSIREILKVYEWASGQTINLEKSVFMASKKLRPGRFESLVRFLIAMLMALSGK
        S                         AL  D  +++EIL  YE ASGQ +N EKS F  SK       E +   L   L +  GK
Subjt:  S------------------------SALDKDCRSIREILKVYEWASGQTINLEKSVFMASKKLRPGRFESLVRFLIAMLMALSGK

A0A2N9HDH5 Uncharacterized protein1.5e-5347.19Show/hide
Query:  MNPSKAPGMDGLQALFYQNYWHVVGNETTNLCLQILNGEERINRLNRTIIALVPKVHSPKKMEDFRPISLCNVSYKIIAKVLANRLK-------------
        M+PSKAPG DG+ + F+Q YWH+VG   TN  L ILN  + + ++N T ++L+PK  +P+K+ D+RPISLCNV YKII+KVLANRLK             
Subjt:  MNPSKAPGMDGLQALFYQNYWHVVGNETTNLCLQILNGEERINRLNRTIIALVPKVHSPKKMEDFRPISLCNVSYKIIAKVLANRLK-------------

Query:  --------DNVVIGFECIHAINSRKKGQKGLVAMKLDMSKAYDRVEWSFMRKFMNKLGFQENWINMVMSCVETVQYSILINGFSYGSSVLNRGIRQGDPL
                DNV + FE +H + ++++G++G +A+KLDMSKAYDRVEWSF+   M KLGF E WI M+M C+ TVQYSI+++G   G  + +RGIRQGDP+
Subjt:  --------DNVVIGFECIHAINSRKKGQKGLVAMKLDMSKAYDRVEWSFMRKFMNKLGFQENWINMVMSCVETVQYSILINGFSYGSSVLNRGIRQGDPL

Query:  SSALDKDCRSIREILKVYEWASGQTINLEKS
        S  L   C      L       GQ   L+ S
Subjt:  SSALDKDCRSIREILKVYEWASGQTINLEKS

A0A2N9J109 Uncharacterized protein1.8e-5445.06Show/hide
Query:  MNPSKAPGMDGLQALFYQNYWHVVGNETTNLCLQILNGEERINRLNRTIIALVPKVHSPKKMEDFRPISLCNVSYKIIAKVLANRLK-------------
        M P  APG DGL  +FYQN+WH++G +     L  LN  + +  +N T I L+PKV +P+K+ +FRPISLCNV YKII+KV+ANRLK             
Subjt:  MNPSKAPGMDGLQALFYQNYWHVVGNETTNLCLQILNGEERINRLNRTIIALVPKVHSPKKMEDFRPISLCNVSYKIIAKVLANRLK-------------

Query:  --------DNVVIGFECIHAINSRKKGQKGLVAMKLDMSKAYDRVEWSFMRKFMNKLGFQENWINMVMSCVETVQYSILINGFSYGSSVLNRGIRQGDPL
                DN+++ FE +H + S ++G++G +A+KLDMSKAYDRVEWSF+   M KLGF   +I+++M C+ TV YSILING  +G+   +RGIRQGDPL
Subjt:  --------DNVVIGFECIHAINSRKKGQKGLVAMKLDMSKAYDRVEWSFMRKFMNKLGFQENWINMVMSCVETVQYSILINGFSYGSSVLNRGIRQGDPL

Query:  S----------------SALDKDCRSIREILKVYEWASGQTINLEKSVFMASK
        S                 A   +C++I+ IL +YE ASGQ +N +K+    SK
Subjt:  S----------------SALDKDCRSIREILKVYEWASGQTINLEKSVFMASK

A0A2N9J6K4 Reverse transcriptase domain-containing protein1.6e-5544.53Show/hide
Query:  MNPSKAPGMDGLQALFYQNYWHVVGNETTNLCLQILNGEERINRLNRTIIALVPKVHSPKKMEDFRPISLCNVSYKIIAKVLANRLK-------------
        M+PSKAPG DG+ ALF+Q +WH+VG + T+  L  LN    +  LN T IAL+PKV SP+ M  FRPISLCNV YKII+KVL NR+K             
Subjt:  MNPSKAPGMDGLQALFYQNYWHVVGNETTNLCLQILNGEERINRLNRTIIALVPKVHSPKKMEDFRPISLCNVSYKIIAKVLANRLK-------------

Query:  --------DNVVIGFECIHAINSRKKGQKGLVAMKLDMSKAYDRVEWSFMRKFMNKLGFQENWINMVMSCVETVQYSILINGFSYGSSVLNRGIRQGDPL
                DN++I FE +H + +++ G+   +A+KLDMSKAYDRVEW +++K M KLGF   W+ ++M CV +V YSIL+NG   G    +RG+RQGDPL
Subjt:  --------DNVVIGFECIHAINSRKKGQKGLVAMKLDMSKAYDRVEWSFMRKFMNKLGFQENWINMVMSCVETVQYSILINGFSYGSSVLNRGIRQGDPL

Query:  S---------------SALDKDCRSIREILKVYEWASGQTINLEKSVFMASKKLRPGRFESLVRF
        S               +A D +C+++++IL +YE ASGQ IN  K+    S+   P    S++ F
Subjt:  S---------------SALDKDCRSIREILKVYEWASGQTINLEKSVFMASKKLRPGRFESLVRF

A0A5A7VE66 Cysteine-rich RLK (Receptor-like protein kinase) 89.3e-5939.9Show/hide
Query:  AMLMALSGKNKEGFVDGTIKKPTNTKMVKMWKCNNDIIASWIMNSVSKEIAASIVYTGCVKEVWDELFDRFKEN-------------SLNSVNMRLVDVF
        AMLMA+SG+NK GF+ G I+KP++  ++  W CNNDI+ASWI+NSVSKEIAASI+Y G +KE+WDEL  RFK++             +L   N+ +   +
Subjt:  AMLMALSGKNKEGFVDGTIKKPTNTKMVKMWKCNNDIIASWIMNSVSKEIAASIVYTGCVKEVWDELFDRFKEN-------------SLNSVNMRLVDVF

Query:  --LKTVF--------------GFCDPNSENSENNFL---------CFRSIRTQILLMNPIPSISKVFALVIQEERQRNASNVISQSNPVALLATNASKKG
          LKT++              G   P  ++ E+ ++          + ++R QILLM P+PSI+ VF+L+IQEE+QR+A  +    +PVAL    AS   
Subjt:  --LKTVF--------------GFCDPNSENSENNFL---------CFRSIRTQILLMNPIPSISKVFALVIQEERQRNASNVISQSNPVALLATNASKKG

Query:  SSQTNQSRVTKPTVILLGTEPEIKIQIWQ--AIPNQWGPTL---------LLKLHNLL---------PPIFFSSLNASQCSQLMELLNSQLQAAETEPIT
         S     +  +PT    G +  I  + ++    P  + P             K +N+           P FFSSLN+ Q SQLM LLN+ LQAA T PIT
Subjt:  SSQTNQSRVTKPTVILLGTEPEIKIQIWQ--AIPNQWGPTL---------LLKLHNLL---------PPIFFSSLNASQCSQLMELLNSQLQAAETEPIT

Query:  TATFVVHTSGICSITSSPIDS-NVWIVDSSASQHISHCHHMFHNWRRVCGMSVILPTSQQVHVDYIGDIRLSQDLTLHDVLYVPRF
        TAT + HTSGI ++TS    S + WI+DS AS+HI H   +F NW     M V+LP   ++ VD IGDI+++  LTL DVL+V +F
Subjt:  TATFVVHTSGICSITSSPIDS-NVWIVDSSASQHISHCHHMFHNWRRVCGMSVILPTSQQVHVDYIGDIRLSQDLTLHDVLYVPRF

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein2.5e-1631.71Show/hide
Query:  KAPGMDGLQALFYQNYWHVVGNETTNLCLQILNGEER----INRLNRTIIALVPKV-HSPKKMEDFRPISLCNVSYKIIAKVLANRLKDNV-------VI
        K+PG DG  A FYQ Y      E     L++    E+     N      I L+PK      K E+FRPISL N+  KI+ K+LANR++ ++        +
Subjt:  KAPGMDGLQALFYQNYWHVVGNETTNLCLQILNGEER----INRLNRTIIALVPKV-HSPKKMEDFRPISLCNVSYKIIAKVLANRLKDNV-------VI

Query:  GF--------------ECIHAINSRKKGQKGLVAMKLDMSKAYDRVEWSFMRKFMNKLGFQENWINMVMSCVETVQYSILINGFSYGSSVLNRGIRQGDP
        GF                I  IN  K   K  V + +D  KA+D+++  FM K +NKLG    ++ ++ +  +    +I++NG    +  L  G RQG P
Subjt:  GF--------------ECIHAINSRKKGQKGLVAMKLDMSKAYDRVEWSFMRKFMNKLGFQENWINMVMSCVETVQYSILINGFSYGSSVLNRGIRQGDP

Query:  LSSAL
        LS  L
Subjt:  LSSAL

P08548 LINE-1 reverse transcriptase homolog1.8e-1428.78Show/hide
Query:  KAPGMDGLQALFYQNYWHVVGNETTNLCLQILNGEERINRLNRTI----IALVPKV-HSPKKMEDFRPISLCNVSYKIIAKVLANRLKDNV-------VI
        K+PG DG  + FYQ +      E   + L +    E+   L  T     I L+PK    P + E++RPISL N+  KI+ K+L NR++ ++        +
Subjt:  KAPGMDGLQALFYQNYWHVVGNETTNLCLQILNGEERINRLNRTI----IALVPKV-HSPKKMEDFRPISLCNVSYKIIAKVLANRLKDNV-------VI

Query:  GF--------------ECIHAINSRKKGQKGLVAMKLDMSKAYDRVEWSFMRKFMNKLGFQENWINMVMSCVETVQYSILINGFSYGSSVLNRGIRQGDP
        GF                I  IN  K   K  + + +D  KA+D ++  FM + + K+G +  ++ ++ +       +I++NG    S  L  G RQG P
Subjt:  GF--------------ECIHAINSRKKGQKGLVAMKLDMSKAYDRVEWSFMRKFMNKLGFQENWINMVMSCVETVQYSILINGFSYGSSVLNRGIRQGDP

Query:  LSSAL
        LS  L
Subjt:  LSSAL

P11369 LINE-1 retrotransposable element ORF2 protein4.9e-1732.34Show/hide
Query:  KAPGMDGLQALFYQNYWHVVGNETTNLCLQILNGEERINRLNRTIIALVPKVH-SPKKMEDFRPISLCNVSYKIIAKVLANRLKDNV-------VIGF--
        K+PG DG  A FYQ +   +      L  +I       N      I L+PK    P K+E+FRPISL N+  KI+ K+LANR+++++        +GF  
Subjt:  KAPGMDGLQALFYQNYWHVVGNETTNLCLQILNGEERINRLNRTIIALVPKVH-SPKKMEDFRPISLCNVSYKIIAKVLANRLKDNV-------VIGF--

Query:  ------------ECIHAINSRKKGQKGLVAMKLDMSKAYDRVEWSFMRKFMNKLGFQENWINMVMSCVETVQYSILINGFSYGSSVLNRGIRQGDPLSSA
                      IH IN  K   K  + + LD  KA+D+++  FM K + + G Q  ++NM+ +       +I +NG    +  L  G RQG PLS  
Subjt:  ------------ECIHAINSRKKGQKGLVAMKLDMSKAYDRVEWSFMRKFMNKLGFQENWINMVMSCVETVQYSILINGFSYGSSVLNRGIRQGDPLSSA

Query:  L
        L
Subjt:  L

P14381 Transposon TX1 uncharacterized 149 kDa protein6.0e-1526.47Show/hide
Query:  MNPSKAPGMDGLQALFYQNYWHVVGNETTNLCLQILNGEERINRLNRTIIALVPKVHSPKKMEDFRPISLCNVSYKIIAKVLANRLK-------------
        M  +K+PG+DGL   F+Q +W  +G +   +  +     E      R +++L+PK    + ++++RP+SL +  YKI+AK ++ RLK             
Subjt:  MNPSKAPGMDGLQALFYQNYWHVVGNETTNLCLQILNGEERINRLNRTIIALVPKVHSPKKMEDFRPISLCNVSYKIIAKVLANRLK-------------

Query:  --------DNVVIGFECIHAINSRKKGQKGLVAMKLDMSKAYDRVEWSFMRKFMNKLGFQENWINMVMSCVETVQYSILINGFSYGSSVLNRGIRQGDPL
                DNV +  + +H   +R+ G   L  + LD  KA+DRV+  ++   +    F   ++  + +   + +  + IN          RG+RQG PL
Subjt:  --------DNVVIGFECIHAINSRKKGQKGLVAMKLDMSKAYDRVEWSFMRKFMNKLGFQENWINMVMSCVETVQYSILINGFSYGSSVLNRGIRQGDPL

Query:  SSAL
        S  L
Subjt:  SSAL

P16423 Retrovirus-related Pol polyprotein from type-2 retrotransposable element R2DM6.1e-0727.55Show/hide
Query:  SKAPGMDGLQALFYQNYWHVVGNETTNLCLQILNGEERINRLNRTIIALVPKVHSPKKMEDFRPISLCNVSYKIIAKVLANRLKDNV-----VIGF----
        S +PG DG+     +     +     NL L   N    I RL RT+   +PK  + K+ +DFRPIS+ +V  + +  +LA RL  ++       GF    
Subjt:  SKAPGMDGLQALFYQNYWHVVGNETTNLCLQILNGEERINRLNRTIIALVPKVHSPKKMEDFRPISLCNVSYKIIAKVLANRLKDNV-----VIGF----

Query:  ECIH-------AINSRKKGQKGLVAMKLDMSKAYDRVEWSFMRKFMNKLGFQENWINMVMSCVETVQYSILINGFSYGSSVLNRGIRQGDPLSSAL
         C          +    K  +      LD+SKA+D +  + +   +   G  + +++ V +  E    S+  +G+S    V  RG++QGDPLS  L
Subjt:  ECIH-------AINSRKKGQKGLVAMKLDMSKAYDRVEWSFMRKFMNKLGFQENWINMVMSCVETVQYSILINGFSYGSSVLNRGIRQGDPLSSAL

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.7e-0427.63Show/hide
Query:  ESLVRFLIAMLMALSGKNKEGFVDGTIKKPTN-TKMVKMWKCNNDIIASWIMNSVSKEIAASIVYTGCVKEVWDEL
        ++ V + I     L    K GF+DGT+ KP   + + + W+  N ++  W+MNS++ ++  S++Y     ++W++L
Subjt:  ESLVRFLIAMLMALSGKNKEGFVDGTIKKPTN-TKMVKMWKCNNDIIASWIMNSVSKEIAASIVYTGCVKEVWDEL

AT1G43760.1 DNAse I-like superfamily protein1.9e-0838.46Show/hide
Query:  MNPSKAPGMDGLQALFYQNYWHVVGNETTNLCLQILNGEERINRLNRTIIALVPKVHSPKKMEDFRPISLCNVSYKII
        M  +KAPG D   A F+   W VV + T     +       + R N T I L+PKV    ++  FRP+S C V YKII
Subjt:  MNPSKAPGMDGLQALFYQNYWHVVGNETTNLCLQILNGEERINRLNRTIIALVPKVHSPKKMEDFRPISLCNVSYKII

AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.6e-0742.86Show/hide
Query:  DNVVIGFECIHAINSRKKGQKGLVAMKLDMSKAYDRVEWSFMRKFMNKLGFQENWI
        DN+V   E +H++  RKKG KG + +KLD+ KAYDR+ W ++   +   GF E W+
Subjt:  DNVVIGFECIHAINSRKKGQKGLVAMKLDMSKAYDRVEWSFMRKFMNKLGFQENWI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCCTTCGAAAGCGCCCGGTATGGATGGTCTTCAAGCCCTTTTTTACCAAAATTACTGGCATGTGGTTGGCAATGAGACAACAAATCTATGTTTACAAATCCTCAA
TGGGGAAGAAAGAATAAACAGATTAAACAGAACGATCATAGCTCTTGTTCCCAAGGTTCACAGTCCAAAGAAGATGGAGGATTTCAGACCCATAAGCCTTTGCAACGTCA
GTTATAAAATTATTGCAAAGGTCCTAGCCAACAGACTTAAAGACAACGTGGTTATTGGCTTTGAATGCATTCACGCAATCAACTCCAGAAAAAAGGGTCAGAAGGGCCTC
GTGGCCATGAAACTCGACATGAGCAAAGCTTATGATCGAGTCGAATGGTCTTTTATGAGAAAATTCATGAACAAGCTTGGTTTCCAGGAGAATTGGATCAACATGGTGAT
GAGTTGTGTTGAAACAGTGCAATACTCTATTCTCATCAACGGTTTCTCTTATGGATCCTCGGTCCTGAATCGAGGCATAAGACAAGGCGACCCATTATCCTCGGCGCTGG
ATAAGGACTGCAGGAGTATCAGAGAAATTCTTAAGGTGTATGAGTGGGCGTCGGGCCAAACAATTAACCTGGAGAAATCAGTGTTCATGGCTAGCAAGAAGTTGAGGCCG
GGAAGATTCGAGAGTTTAGTGAGATTCCTTATAGCTATGTTAATGGCCCTCTCCGGTAAGAATAAGGAAGGCTTTGTCGATGGAACCATCAAGAAGCCGACCAACACAAA
AATGGTGAAAATGTGGAAGTGCAACAATGACATCATAGCTTCTTGGATCATGAATTCTGTTTCCAAGGAGATTGCTGCCAGCATTGTATATACTGGATGTGTAAAAGAAG
TATGGGATGAGCTTTTCGATCGTTTCAAGGAAAATTCACTGAATTCAGTGAATATGCGTTTGGTAGACGTGTTTCTGAAAACTGTTTTCGGATTCTGTGATCCAAATAGT
GAAAATTCTGAAAACAACTTTTTATGTTTTCGTTCGATTAGAACTCAAATCTTATTGATGAATCCTATCCCTTCCATTAGTAAGGTATTTGCATTAGTGATTCAAGAAGA
ACGTCAAAGAAATGCTAGCAATGTTATTTCACAGTCTAATCCAGTTGCGTTACTTGCTACCAATGCATCTAAGAAAGGATCATCACAAACTAATCAATCTCGTGTTACAA
AACCCACTGTTATCCTCTTGGGTACCGAACCAGAAATCAAAATTCAAATATGGCAAGCAATTCCAAACCAATGGGGACCAACACTGTTACTCAAGCTCCACAACCTGCTG
CCACCAATTTTTTTTTCTAGCCTGAATGCTAGTCAATGTTCTCAGTTGATGGAACTTCTAAATTCACAGTTGCAAGCAGCAGAAACTGAGCCTATCACAACAGCCACATT
TGTTGTTCATACATCAGGTATCTGCTCTATTACTTCTTCACCAATTGATTCAAACGTATGGATAGTTGATTCTAGTGCGTCTCAACATATATCTCATTGCCATCATATGT
TTCATAATTGGCGTCGTGTATGTGGTATGTCTGTTATTCTGCCAACTTCACAACAAGTACATGTTGATTATATTGGTGACATAAGGCTTTCTCAAGACTTGACTCTTCAT
GACGTGCTATATGTTCCACGGTTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATCCTTCGAAAGCGCCCGGTATGGATGGTCTTCAAGCCCTTTTTTACCAAAATTACTGGCATGTGGTTGGCAATGAGACAACAAATCTATGTTTACAAATCCTCAA
TGGGGAAGAAAGAATAAACAGATTAAACAGAACGATCATAGCTCTTGTTCCCAAGGTTCACAGTCCAAAGAAGATGGAGGATTTCAGACCCATAAGCCTTTGCAACGTCA
GTTATAAAATTATTGCAAAGGTCCTAGCCAACAGACTTAAAGACAACGTGGTTATTGGCTTTGAATGCATTCACGCAATCAACTCCAGAAAAAAGGGTCAGAAGGGCCTC
GTGGCCATGAAACTCGACATGAGCAAAGCTTATGATCGAGTCGAATGGTCTTTTATGAGAAAATTCATGAACAAGCTTGGTTTCCAGGAGAATTGGATCAACATGGTGAT
GAGTTGTGTTGAAACAGTGCAATACTCTATTCTCATCAACGGTTTCTCTTATGGATCCTCGGTCCTGAATCGAGGCATAAGACAAGGCGACCCATTATCCTCGGCGCTGG
ATAAGGACTGCAGGAGTATCAGAGAAATTCTTAAGGTGTATGAGTGGGCGTCGGGCCAAACAATTAACCTGGAGAAATCAGTGTTCATGGCTAGCAAGAAGTTGAGGCCG
GGAAGATTCGAGAGTTTAGTGAGATTCCTTATAGCTATGTTAATGGCCCTCTCCGGTAAGAATAAGGAAGGCTTTGTCGATGGAACCATCAAGAAGCCGACCAACACAAA
AATGGTGAAAATGTGGAAGTGCAACAATGACATCATAGCTTCTTGGATCATGAATTCTGTTTCCAAGGAGATTGCTGCCAGCATTGTATATACTGGATGTGTAAAAGAAG
TATGGGATGAGCTTTTCGATCGTTTCAAGGAAAATTCACTGAATTCAGTGAATATGCGTTTGGTAGACGTGTTTCTGAAAACTGTTTTCGGATTCTGTGATCCAAATAGT
GAAAATTCTGAAAACAACTTTTTATGTTTTCGTTCGATTAGAACTCAAATCTTATTGATGAATCCTATCCCTTCCATTAGTAAGGTATTTGCATTAGTGATTCAAGAAGA
ACGTCAAAGAAATGCTAGCAATGTTATTTCACAGTCTAATCCAGTTGCGTTACTTGCTACCAATGCATCTAAGAAAGGATCATCACAAACTAATCAATCTCGTGTTACAA
AACCCACTGTTATCCTCTTGGGTACCGAACCAGAAATCAAAATTCAAATATGGCAAGCAATTCCAAACCAATGGGGACCAACACTGTTACTCAAGCTCCACAACCTGCTG
CCACCAATTTTTTTTTCTAGCCTGAATGCTAGTCAATGTTCTCAGTTGATGGAACTTCTAAATTCACAGTTGCAAGCAGCAGAAACTGAGCCTATCACAACAGCCACATT
TGTTGTTCATACATCAGGTATCTGCTCTATTACTTCTTCACCAATTGATTCAAACGTATGGATAGTTGATTCTAGTGCGTCTCAACATATATCTCATTGCCATCATATGT
TTCATAATTGGCGTCGTGTATGTGGTATGTCTGTTATTCTGCCAACTTCACAACAAGTACATGTTGATTATATTGGTGACATAAGGCTTTCTCAAGACTTGACTCTTCAT
GACGTGCTATATGTTCCACGGTTCTGA
Protein sequenceShow/hide protein sequence
MNPSKAPGMDGLQALFYQNYWHVVGNETTNLCLQILNGEERINRLNRTIIALVPKVHSPKKMEDFRPISLCNVSYKIIAKVLANRLKDNVVIGFECIHAINSRKKGQKGL
VAMKLDMSKAYDRVEWSFMRKFMNKLGFQENWINMVMSCVETVQYSILINGFSYGSSVLNRGIRQGDPLSSALDKDCRSIREILKVYEWASGQTINLEKSVFMASKKLRP
GRFESLVRFLIAMLMALSGKNKEGFVDGTIKKPTNTKMVKMWKCNNDIIASWIMNSVSKEIAASIVYTGCVKEVWDELFDRFKENSLNSVNMRLVDVFLKTVFGFCDPNS
ENSENNFLCFRSIRTQILLMNPIPSISKVFALVIQEERQRNASNVISQSNPVALLATNASKKGSSQTNQSRVTKPTVILLGTEPEIKIQIWQAIPNQWGPTLLLKLHNLL
PPIFFSSLNASQCSQLMELLNSQLQAAETEPITTATFVVHTSGICSITSSPIDSNVWIVDSSASQHISHCHHMFHNWRRVCGMSVILPTSQQVHVDYIGDIRLSQDLTLH
DVLYVPRF