; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc03G04695 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc03G04695
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationClcChr03:4581328..4583694
RNA-Seq ExpressionClc03G04695
SyntenyClc03G04695
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011652531.1 protein ENL [Cucumis sativus]4.8e-4038.87Show/hide
Query:  MEGRGILSPQRSRFSPRKPKKNTPDGKPGRKVDKNGSKQKVTPIPKAVGLPSNLKPSEPSTSLNPNN--NYKNKSTSEPSSPAPLLTPTTPSPSPNTNLG
        M+ RGI+ P+RS+FSP KP +   D +  +   KN  K            PS+ KP   + + NPN+  N  N ++  PSS  P   P TP  +P     
Subjt:  MEGRGILSPQRSRFSPRKPKKNTPDGKPGRKVDKNGSKQKVTPIPKAVGLPSNLKPSEPSTSLNPNN--NYKNKSTSEPSSPAPLLTPTTPSPSPNTNLG

Query:  KANASKSGSLDPQKDVQKSTDHCDRNQYHRFSRVVSDTAALPPPDFSDRFLQRLSFDEFDSSFLIFWKSFVQKNLGKITYDLNLCPGRDSTINDKKLSII
            +KS    P  D      H + +   R     +  ++    +F  R     S   F  +         QK++ K T D +  PG   + +D    I 
Subjt:  KANASKSGSLDPQKDVQKSTDHCDRNQYHRFSRVVSDTAALPPPDFSDRFLQRLSFDEFDSSFLIFWKSFVQKNLGKITYDLNLCPGRDSTINDKKLSII

Query:  WIFLQKKKNGIYVGKDLADILQGNSIYDLISPNKKKESSSQSNDRSI-IFQIYKKIASHRQENSSVETYFTKLEELWVELVHYTADLVQFSNDGAPIGNP
           LQ+  +    GKDL DIL+GN+I DL+  N +KE SS  N  S+ I QIY+KIASHRQ N SVE YF KL++LW ++  Y+++ V+       I   
Subjt:  WIFLQKKKNGIYVGKDLADILQGNSIYDLISPNKKKESSSQSNDRSI-IFQIYKKIASHRQENSSVETYFTKLEELWVELVHYTADLVQFSNDGAPIGNP

Query:  MGLTEKEKVMQFLMGLDDSYATICSQILFMNPFPTVEKAYSEISREEKRRELVVALETMAAKVIQTNWLLRNNGRSNNNGDTNHGSDKEVD
          LTE++KV+QF +GL+D Y+ ICSQIL   PFPTVE+AYSEI REEKRREL VAL T+AA+VIQ+++    NG SNN  + N G D+E+D
Subjt:  MGLTEKEKVMQFLMGLDDSYATICSQILFMNPFPTVEKAYSEISREEKRRELVVALETMAAKVIQTNWLLRNNGRSNNNGDTNHGSDKEVD

XP_038895148.1 proline-rich receptor-like protein kinase PERK2 isoform X1 [Benincasa hispida]1.7e-6145.63Show/hide
Query:  MEGRGILSPQRSRFSPRKPKKNTPD-GKPGR-KVDKN--GSKQKVTPIPKAVGLPSNLKPSEPSTSLNPNNNY-KNKSTSEPSS--PAPLLTPTTPSPSP
        MEGRGI+SP+RSRFSP+  KK TP   KP + K  +N  GSK +VTPIP          P+ P  + NP+ N+ K +S SEPSS  P P  TPTT S   
Subjt:  MEGRGILSPQRSRFSPRKPKKNTPD-GKPGR-KVDKN--GSKQKVTPIPKAVGLPSNLKPSEPSTSLNPNNNY-KNKSTSEPSS--PAPLLTPTTPSPSP

Query:  NTNLGKANASKSGSLDPQKDVQKSTDHCDRNQYHRFS-----RVVSDTAALPPP----------DFSDRFLQRLSFDEFDSSFLIFWKSFVQKNLGKITY
          +    N  + G L         + H  ++ YH+ S      +  D    P P          D SDR LQRLSFD                       
Subjt:  NTNLGKANASKSGSLDPQKDVQKSTDHCDRNQYHRFS-----RVVSDTAALPPP----------DFSDRFLQRLSFDEFDSSFLIFWKSFVQKNLGKITY

Query:  DLNLCPGRDSTINDKKLSIIWIFLQKKKNGIYVGKDLADILQGNSIYDLISPNKKKESSSQSNDRSIIFQIYKKIASHRQENSSVETYFTKLEELWVELV
                                         GKD+ADILQG SIYD++  NKK+E +SQS D   + QIYK+IASHRQEN  VE YF KL  LW EL 
Subjt:  DLNLCPGRDSTINDKKLSIIWIFLQKKKNGIYVGKDLADILQGNSIYDLISPNKKKESSSQSNDRSIIFQIYKKIASHRQENSSVETYFTKLEELWVELV

Query:  HYTADLVQFSNDGAPIGNPMGLTEKEKVMQFLMGLDDSYATICSQILFMNPFPTVEKAYSEISREEKRRELVVALETMAAKVIQTNWLLRN-NGRSNNNG
         Y  DL Q S+ GA I N   LTE++KV+QFL+GL+DSYATIC QIL   PFPTVE+AYSEI  EEKRRELV ALET+AAKVIQ+NWLL+N N RSNN  
Subjt:  HYTADLVQFSNDGAPIGNPMGLTEKEKVMQFLMGLDDSYATICSQILFMNPFPTVEKAYSEISREEKRRELVVALETMAAKVIQTNWLLRN-NGRSNNNG

Query:  DTNHGSDKEVDN
          N G D+EVDN
Subjt:  DTNHGSDKEVDN

XP_038895149.1 proline-rich receptor-like protein kinase PERK2 isoform X2 [Benincasa hispida]5.4e-6045.15Show/hide
Query:  MEGRGILSPQRSRFSPRKPKKNTPD-GKPGR-KVDKN--GSKQKVTPIPKAVGLPSNLKPSEPSTSLNPNNNY-KNKSTSEPSS--PAPLLTPTTPSPSP
        MEGRGI+SP+RSRFSP+  KK TP   KP + K  +N  GSK +VTPIP          P+ P  + NP+ N+ K +S SEPSS  P P  TPTT S   
Subjt:  MEGRGILSPQRSRFSPRKPKKNTPD-GKPGR-KVDKN--GSKQKVTPIPKAVGLPSNLKPSEPSTSLNPNNNY-KNKSTSEPSS--PAPLLTPTTPSPSP

Query:  NTNLGKANASKSGSLDPQKDVQKSTDHCDRNQYHRFS-----RVVSDTAALPPP----------DFSDRFLQRLSFDEFDSSFLIFWKSFVQKNLGKITY
          +    N  + G L         + H  ++ YH+ S      +  D    P P          D SDR LQRLSFD                       
Subjt:  NTNLGKANASKSGSLDPQKDVQKSTDHCDRNQYHRFS-----RVVSDTAALPPP----------DFSDRFLQRLSFDEFDSSFLIFWKSFVQKNLGKITY

Query:  DLNLCPGRDSTINDKKLSIIWIFLQKKKNGIYVGKDLADILQGNSIYDLISPNKKKESSSQSNDRSIIFQIYKKIASHRQENSSVETYFTKLEELWVELV
                                           D+ADILQG SIYD++  NKK+E +SQS D   + QIYK+IASHRQEN  VE YF KL  LW EL 
Subjt:  DLNLCPGRDSTINDKKLSIIWIFLQKKKNGIYVGKDLADILQGNSIYDLISPNKKKESSSQSNDRSIIFQIYKKIASHRQENSSVETYFTKLEELWVELV

Query:  HYTADLVQFSNDGAPIGNPMGLTEKEKVMQFLMGLDDSYATICSQILFMNPFPTVEKAYSEISREEKRRELVVALETMAAKVIQTNWLLRN-NGRSNNNG
         Y  DL Q S+ GA I N   LTE++KV+QFL+GL+DSYATIC QIL   PFPTVE+AYSEI  EEKRRELV ALET+AAKVIQ+NWLL+N N RSNN  
Subjt:  HYTADLVQFSNDGAPIGNPMGLTEKEKVMQFLMGLDDSYATICSQILFMNPFPTVEKAYSEISREEKRRELVVALETMAAKVIQTNWLLRN-NGRSNNNG

Query:  DTNHGSDKEVDN
          N G D+EVDN
Subjt:  DTNHGSDKEVDN

XP_038895285.1 GATA zinc finger domain-containing protein 11-like isoform X1 [Benincasa hispida]1.0e-4238.46Show/hide
Query:  RGILSPQRSRFSPRKPKKNTPDGKP-----------------------GRKVDKNGSKQKVTPIPKAVGLPSNLKPSEPSTSL-NPNNNY---KNKST--
        RGILSP R++FS  +   N    KP                       G     N +  ++ P        S L  S P T+  NPN      KN +T  
Subjt:  RGILSPQRSRFSPRKPKKNTPDGKP-----------------------GRKVDKNGSKQKVTPIPKAVGLPSNLKPSEPSTSL-NPNNNY---KNKST--

Query:  ----SEPSSPAPLLTPTTPSPSPNTNLGKANASKSGSLDPQKDVQKSTDHCDRN-------QYHRFSRVVSDTAALPPPDFSDRFLQRLSFDEFDSSFLI
            S P +P P+   T P+P+   N+    AS SGS        K+     RN         HR+S      A L     SD+ LQRLS DEF      
Subjt:  ----SEPSSPAPLLTPTTPSPSPNTNLGKANASKSGSLDPQKDVQKSTDHCDRN-------QYHRFSRVVSDTAALPPPDFSDRFLQRLSFDEFDSSFLI

Query:  FWKSFVQKNLGKITYDLNLCPGRDSTINDKKLSIIWIFLQKKKNGIYVGKDLAD-ILQGNSIYDLISPNKKKESSSQSNDRSIIFQIYKKIASHRQENSS
                                                       VGKDLA  +L  NSIY+ +  + K+ SSSQSND S +FQIYK+IA HRQENSS
Subjt:  FWKSFVQKNLGKITYDLNLCPGRDSTINDKKLSIIWIFLQKKKNGIYVGKDLAD-ILQGNSIYDLISPNKKKESSSQSNDRSIIFQIYKKIASHRQENSS

Query:  VETYFTKLEELWVELVHYTADLVQFSNDGAPIGNPMGLTEKEKVMQFLMGLDDSYATICSQILFMNPFPTVEKAYSEISREEKRRELVVALETMAAKVIQ
        + +YFTKLE LW EL  +  DL+Q S  GA         E+EKVMQFL+GL+DSY+ IC+QIL   PFPT+EKAYS + REEK RELVV LE++A KVIQ
Subjt:  VETYFTKLEELWVELVHYTADLVQFSNDGAPIGNPMGLTEKEKVMQFLMGLDDSYATICSQILFMNPFPTVEKAYSEISREEKRRELVVALETMAAKVIQ

Query:  TNWL-LRN-NGRSNNNGDTNHGSDKEVDN
         NWL L+N N  S+NNGD N G  + VD+
Subjt:  TNWL-LRN-NGRSNNNGDTNHGSDKEVDN

XP_038895286.1 hybrid signal transduction histidine kinase L-like isoform X2 [Benincasa hispida]8.1e-4039.4Show/hide
Query:  RGILSPQRSRFSPRKPKKNTPDGKPGRKVDKNGSKQKVTPIPKAVGLPSNLKPSEPSTS-LNPNNNYKNKSTSEPSSPAPLLTPTTPSPSPNTNLGKANA
        RGILSP R++FS  +   N    KP         + K T I        N + S  +TS +NP     +KSTS+  S  P     T +P+    L K + 
Subjt:  RGILSPQRSRFSPRKPKKNTPDGKPGRKVDKNGSKQKVTPIPKAVGLPSNLKPSEPSTS-LNPNNNYKNKSTSEPSSPAPLLTPTTPSPSPNTNLGKANA

Query:  SKSGSLDPQKDVQKSTDHCDRNQYHRFSRVVSDTAALPPPDFSDRFLQRLSFDEFDSSFLIFWKSFVQKNLGKITYDLNLCP-----------GRDSTIN
        +K  S        K   H  R                P P+       R++      S L    S   KN  K  +  N  P           G  +T++
Subjt:  SKSGSLDPQKDVQKSTDHCDRNQYHRFSRVVSDTAALPPPDFSDRFLQRLSFDEFDSSFLIFWKSFVQKNLGKITYDLNLCP-----------GRDSTIN

Query:  DKKLSIIWIFLQKKKNGIYVGKDLAD-ILQGNSIYDLISPNKKKESSSQSNDRSIIFQIYKKIASHRQENSSVETYFTKLEELWVELVHYTADLVQFSND
        D  +S       + +     GKDLA  +L  NSIY+ +  + K+ SSSQSND S +FQIYK+IA HRQENSS+ +YFTKLE LW EL  +  DL+Q S  
Subjt:  DKKLSIIWIFLQKKKNGIYVGKDLAD-ILQGNSIYDLISPNKKKESSSQSNDRSIIFQIYKKIASHRQENSSVETYFTKLEELWVELVHYTADLVQFSND

Query:  GAPIGNPMGLTEKEKVMQFLMGLDDSYATICSQILFMNPFPTVEKAYSEISREEKRRELVVALETMAAKVIQTNWL-LRN-NGRSNNNGDTNHGSDKEVD
        GA         E+EKVMQFL+GL+DSY+ IC+QIL   PFPT+EKAYS + REEK RELVV LE++A KVIQ NWL L+N N  S+NNGD N G  + VD
Subjt:  GAPIGNPMGLTEKEKVMQFLMGLDDSYATICSQILFMNPFPTVEKAYSEISREEKRRELVVALETMAAKVIQTNWL-LRN-NGRSNNNGDTNHGSDKEVD

Query:  N
        +
Subjt:  N

TrEMBL top hitse value%identityAlignment
A0A0A0LRE6 Uncharacterized protein2.3e-4038.87Show/hide
Query:  MEGRGILSPQRSRFSPRKPKKNTPDGKPGRKVDKNGSKQKVTPIPKAVGLPSNLKPSEPSTSLNPNN--NYKNKSTSEPSSPAPLLTPTTPSPSPNTNLG
        M+ RGI+ P+RS+FSP KP +   D +  +   KN  K            PS+ KP   + + NPN+  N  N ++  PSS  P   P TP  +P     
Subjt:  MEGRGILSPQRSRFSPRKPKKNTPDGKPGRKVDKNGSKQKVTPIPKAVGLPSNLKPSEPSTSLNPNN--NYKNKSTSEPSSPAPLLTPTTPSPSPNTNLG

Query:  KANASKSGSLDPQKDVQKSTDHCDRNQYHRFSRVVSDTAALPPPDFSDRFLQRLSFDEFDSSFLIFWKSFVQKNLGKITYDLNLCPGRDSTINDKKLSII
            +KS    P  D      H + +   R     +  ++    +F  R     S   F  +         QK++ K T D +  PG   + +D    I 
Subjt:  KANASKSGSLDPQKDVQKSTDHCDRNQYHRFSRVVSDTAALPPPDFSDRFLQRLSFDEFDSSFLIFWKSFVQKNLGKITYDLNLCPGRDSTINDKKLSII

Query:  WIFLQKKKNGIYVGKDLADILQGNSIYDLISPNKKKESSSQSNDRSI-IFQIYKKIASHRQENSSVETYFTKLEELWVELVHYTADLVQFSNDGAPIGNP
           LQ+  +    GKDL DIL+GN+I DL+  N +KE SS  N  S+ I QIY+KIASHRQ N SVE YF KL++LW ++  Y+++ V+       I   
Subjt:  WIFLQKKKNGIYVGKDLADILQGNSIYDLISPNKKKESSSQSNDRSI-IFQIYKKIASHRQENSSVETYFTKLEELWVELVHYTADLVQFSNDGAPIGNP

Query:  MGLTEKEKVMQFLMGLDDSYATICSQILFMNPFPTVEKAYSEISREEKRRELVVALETMAAKVIQTNWLLRNNGRSNNNGDTNHGSDKEVD
          LTE++KV+QF +GL+D Y+ ICSQIL   PFPTVE+AYSEI REEKRREL VAL T+AA+VIQ+++    NG SNN  + N G D+E+D
Subjt:  MGLTEKEKVMQFLMGLDDSYATICSQILFMNPFPTVEKAYSEISREEKRRELVVALETMAAKVIQTNWLLRNNGRSNNNGDTNHGSDKEVD

A0A0A0LU31 Uncharacterized protein1.7e-3837.5Show/hide
Query:  RGILSPQRSRFSPRKPKKNTPDG--KPGRKVDKNGSKQKVTPIPKAVGLPSNLKPSEPSTSLNPNNNYKNKSTSEPSSPAPLLT-----------PTTPS
        RGI  P       R+   ++P+G  KP     KN + +     P     PS+ KP   + + N N N    S S PSSPAP  T           P TPS
Subjt:  RGILSPQRSRFSPRKPKKNTPDG--KPGRKVDKNGSKQKVTPIPKAVGLPSNLKPSEPSTSLNPNNNYKNKSTSEPSSPAPLLT-----------PTTPS

Query:  PS-PNTNLGKANASKSGSLDPQKDVQKSTDHCDRNQYHRFSRVVSDTAALPPPDFSDRFLQRLSFDEFDSSFLIFWKSFVQKNLGKITYDLNLCPGRDST
        P+ P T     + ++     P      S ++  R  Y+  S   S    + P   SD   + +    + +S                       P     
Subjt:  PS-PNTNLGKANASKSGSLDPQKDVQKSTDHCDRNQYHRFSRVVSDTAALPPPDFSDRFLQRLSFDEFDSSFLIFWKSFVQKNLGKITYDLNLCPGRDST

Query:  INDKKLSIIWIFLQKKKNGIYVGKDLADILQGNSIYDLISPNKKKESSSQSNDRSI-IFQIYKKIASHRQENSSVETYFTKLEELWVELVHYTADLVQFS
         ND       I  +  +   + GKDL DIL+GNSI DL+  N KKE SS  N  S+ I QIY+KIASHRQ N SVE YF KL++LW ++  Y++D  Q  
Subjt:  INDKKLSIIWIFLQKKKNGIYVGKDLADILQGNSIYDLISPNKKKESSSQSNDRSI-IFQIYKKIASHRQENSSVETYFTKLEELWVELVHYTADLVQFS

Query:  NDGAPIGNPMGLTEKEKVMQFLMGLDDSYATICSQILFMNPFPTVEKAYSEISREEKRRELVVALETMAAKVIQTNWLLRNNGRSNNNGDTNHGSDKEVD
        +    I     LTE++KVMQF +GL+D Y+ ICSQIL   PFPTVE+AYSEI REEKRREL VAL  MAA+VIQ+++     G SNN  + N G D+E+D
Subjt:  NDGAPIGNPMGLTEKEKVMQFLMGLDDSYATICSQILFMNPFPTVEKAYSEISREEKRRELVVALETMAAKVIQTNWLLRNNGRSNNNGDTNHGSDKEVD

A0A6J1C5Z8 uncharacterized protein LOC1110085883.8e-3536.84Show/hide
Query:  SPQRSRFSPRKPKKNTPDGKPGRKVDKNGSKQKVTPIPKAVGLPSNLKPSEPSTSLNPNNNYKNKSTSEPSSPAPLLTPTTPSPSPNTNLGKANASKSGS
        SP R R SP+      P  +P      N  K      PK  G     KPS P T      N        P  P P+   TTP P P   +  A+ S S +
Subjt:  SPQRSRFSPRKPKKNTPDGKPGRKVDKNGSKQKVTPIPKAVGLPSNLKPSEPSTSLNPNNNYKNKSTSEPSSPAPLLTPTTPSPSPNTNLGKANASKSGS

Query:  LDPQKDVQK-----STDHCDRNQYHRFSRVVSDTAALPPPDFSDRFLQRLSFDEFDSSFLIFWKSFVQKNLGKITYDLNLCPGRDSTINDKKLSIIWIFL
         +     +      S  H D   +  +S       A P     +  LQRLS D                                               
Subjt:  LDPQKDVQK-----STDHCDRNQYHRFSRVVSDTAALPPPDFSDRFLQRLSFDEFDSSFLIFWKSFVQKNLGKITYDLNLCPGRDSTINDKKLSIIWIFL

Query:  QKKKNGIYVGKDLAD-ILQGNSIYDLISPNKKKESSSQSNDRSIIFQIYKKIASHRQENSSVETYFTKLEELWVELVHYTADLVQFSNDGAPIGNPMGLT
                 GKDLA  IL  NSIY+ I  +  +ES   +  R  IFQIYK IASHRQENSSV +YFTKL+ LW EL  Y+ D+ Q  + GA +    G  
Subjt:  QKKKNGIYVGKDLAD-ILQGNSIYDLISPNKKKESSSQSNDRSIIFQIYKKIASHRQENSSVETYFTKLEELWVELVHYTADLVQFSNDGAPIGNPMGLT

Query:  EKEKVMQFLMGLDDSYATICSQILFMNPFPTVEKAYSEISREEKRRELVVALETMAAKVIQTNWLLRNNGRSNNNGDTNH
        E+EKVMQFLMGL++SY+TIC QIL + PFPT+EKAYS I REEKR ELV +LE +AAKV++  WLL+N+  SN   D  H
Subjt:  EKEKVMQFLMGLDDSYATICSQILFMNPFPTVEKAYSEISREEKRRELVVALETMAAKVIQTNWLLRNNGRSNNNGDTNH

A0A6J1C7L7 uncharacterized protein LOC1110089865.5e-2655.91Show/hide
Query:  ILQGNSIYDLISPNKKKESSSQSNDRSIIFQIYKKIASHRQENSSVETYFTKLEELWVELVHYTADLVQFSNDGAPIGNPMGLTEKEKVMQFLMGLDDSY
        + + NS+ + + P  K+E S QSN   I+ +IYK IASHRQ NSS+ +YFTKLE LW EL  Y +DL Q  +  A    P  L E+EKVMQFL+GL+DSY
Subjt:  ILQGNSIYDLISPNKKKESSSQSNDRSIIFQIYKKIASHRQENSSVETYFTKLEELWVELVHYTADLVQFSNDGAPIGNPMGLTEKEKVMQFLMGLDDSY

Query:  ATICSQILFMNPFPTVEKAYSEISREE
        +TICSQIL + PFPTVEKAYS I  +E
Subjt:  ATICSQILFMNPFPTVEKAYSEISREE

A0A6J1GTG4 serine/arginine repetitive matrix protein 1-like1.5e-3435.89Show/hide
Query:  RGILSPQRSRFSPRKPK---KNTPDGKPGRKVDKNGSKQKVTPIPKAVGLPSNLKPSEPST---SLNPNNNYKNKSTSEPSSPAPLLTPTTP-SPSPN--
        RGI+SP RSR SPR+ +    N     P R    +  ++  TPI     L ++ K  +P+       P +   N    +PS P   L P+ P +PSPN  
Subjt:  RGILSPQRSRFSPRKPK---KNTPDGKPGRKVDKNGSKQKVTPIPKAVGLPSNLKPSEPST---SLNPNNNYKNKSTSEPSSPAPLLTPTTP-SPSPN--

Query:  ---TNLGKANASKSGSLDPQKDVQKSTDHCDRNQYHRFSRVVSDTAALPPPDFSDRFLQRLSF----DEFDSSFLIFWKSFVQKNLGKITY-DLNLCPGR
           T       ++  S  P K +   +    +      SR  SD +   P D      + L      D+ D   +   +S+   + G  T  D ++    
Subjt:  ---TNLGKANASKSGSLDPQKDVQKSTDHCDRNQYHRFSRVVSDTAALPPPDFSDRFLQRLSF----DEFDSSFLIFWKSFVQKNLGKITY-DLNLCPGR

Query:  DSTINDKKLSIIWIFLQKKKNGIYVGKDLADILQGNSIYD-LISPNKKKESSSQSNDRSIIFQIYKKIASHRQENSSVETYFTKLEELWVELVHYTADLV
          +++DK L+ I                   +L  N +Y+ L S  K++E SSQ N+ S +FQIYK+IASH Q NSS+ +Y TKL+ LW EL  Y  D  
Subjt:  DSTINDKKLSIIWIFLQKKKNGIYVGKDLADILQGNSIYD-LISPNKKKESSSQSNDRSIIFQIYKKIASHRQENSSVETYFTKLEELWVELVHYTADLV

Query:  QFSNDGAPIGNPMGLTEKEKVMQFLMGLDDSYATICSQILFMNPFPTVEKAYSEISREEKRRELVVALETMAAKVIQTNWLLRNNGRSNNNGDTNHGSDK
        + S       +     E+EKVMQFL+GL+DSY+TIC+QIL M PFPTVEKA   I REEKRRELV++LE +AAKVIQ NWLL+       NG + +G ++
Subjt:  QFSNDGAPIGNPMGLTEKEKVMQFLMGLDDSYATICSQILFMNPFPTVEKAYSEISREEKRRELVVALETMAAKVIQTNWLLRNNGRSNNNGDTNHGSDK

Query:  EVDN
        EVD+
Subjt:  EVDN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).7.4e-0730.77Show/hide
Query:  IFQIYKKIASHRQENSSVETYFTKLEELWVELVHYTADLVQFSNDGAP---IGNPMGLTEKEKVMQFLMG--LDDSYATICSQILFMNPFPTVEKAYSEI
        I+Q+ +++A+ RQ   SVE YF KL ++W+EL  Y A + +    G             EKE+  +FLMG  L+  +  + ++I+F  P P++ +A++ +
Subjt:  IFQIYKKIASHRQENSSVETYFTKLEELWVELVHYTADLVQFSNDGAP---IGNPMGLTEKEKVMQFLMG--LDDSYATICSQILFMNPFPTVEKAYSEI

Query:  SREE
           E
Subjt:  SREE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGGAAGAGGAATTTTGAGTCCCCAGAGATCCCGATTTTCTCCAAGAAAGCCAAAGAAGAATACTCCAGACGGAAAACCAGGCAGGAAAGTTGATAAAAATGGATC
AAAACAGAAAGTAACTCCAATTCCAAAGGCTGTGGGTTTACCCTCAAACCTAAAACCATCAGAACCATCCACAAGTCTTAATCCTAACAACAACTACAAAAATAAATCCA
CCTCTGAACCATCTTCCCCAGCTCCACTTCTAACTCCCACTACGCCATCGCCTTCTCCAAACACCAATCTTGGTAAAGCTAATGCTTCCAAATCTGGGTCGTTGGATCCT
CAGAAAGATGTCCAAAAAAGTACTGACCACTGTGATCGGAATCAGTACCACCGTTTTTCGAGGGTAGTTTCCGACACTGCTGCCCTTCCTCCTCCTGATTTTAGTGATCG
CTTCTTACAACGCCTTTCTTTTGACGAATTTGATTCATCTTTTTTAATCTTTTGGAAATCTTTTGTTCAAAAGAATTTGGGAAAGATTACATACGATCTCAATCTTTGTC
CTGGCCGAGATTCCACCATTAATGACAAAAAACTATCGATAATTTGGATATTCTTGCAAAAAAAAAAAAATGGCATATATGTAGGTAAAGATCTTGCGGACATCCTCCAA
GGAAACTCGATATATGATTTAATCAGCCCAAATAAGAAGAAAGAATCTTCTTCTCAAAGCAATGATCGTTCCATAATATTTCAAATTTACAAGAAAATTGCGTCTCATCG
ACAAGAAAACTCATCCGTTGAAACTTACTTCACAAAGCTGGAGGAATTATGGGTTGAGCTTGTACACTACACTGCTGATTTGGTTCAATTTTCCAACGATGGTGCACCAA
TTGGAAATCCCATGGGGCTTACAGAGAAAGAAAAAGTTATGCAATTTCTTATGGGACTAGATGATTCTTATGCCACAATTTGCTCCCAAATCCTTTTTATGAACCCATTT
CCAACCGTGGAGAAAGCTTATTCTGAAATATCTCGAGAAGAAAAACGTAGGGAATTGGTTGTTGCATTAGAAACTATGGCTGCAAAAGTAATCCAAACCAACTGGCTTCT
TAGAAATAATGGTCGATCCAACAATAATGGTGATACAAATCATGGAAGTGATAAAGAAGTTGATAACAAGTAA
mRNA sequenceShow/hide mRNA sequence
AAGACCCTCTTCATTTATTTTATTTAGTTTTTCCCAAACTGGGGAGATGGATGGGAAGAGGTGAATTGTAATGGAGGGAAGAGGAATTTTGAGTCCCCAGAGATCCCGAT
TTTCTCCAAGAAAGCCAAAGAAGAATACTCCAGACGGAAAACCAGGCAGGAAAGTTGATAAAAATGGATCAAAACAGAAAGTAACTCCAATTCCAAAGGCTGTGGGTTTA
CCCTCAAACCTAAAACCATCAGAACCATCCACAAGTCTTAATCCTAACAACAACTACAAAAATAAATCCACCTCTGAACCATCTTCCCCAGCTCCACTTCTAACTCCCAC
TACGCCATCGCCTTCTCCAAACACCAATCTTGGTAAAGCTAATGCTTCCAAATCTGGGTCGTTGGATCCTCAGAAAGATGTCCAAAAAAGTACTGACCACTGTGATCGGA
ATCAGTACCACCGTTTTTCGAGGGTAGTTTCCGACACTGCTGCCCTTCCTCCTCCTGATTTTAGTGATCGCTTCTTACAACGCCTTTCTTTTGACGAATTTGATTCATCT
TTTTTAATCTTTTGGAAATCTTTTGTTCAAAAGAATTTGGGAAAGATTACATACGATCTCAATCTTTGTCCTGGCCGAGATTCCACCATTAATGACAAAAAACTATCGAT
AATTTGGATATTCTTGCAAAAAAAAAAAAATGGCATATATGTAGGTAAAGATCTTGCGGACATCCTCCAAGGAAACTCGATATATGATTTAATCAGCCCAAATAAGAAGA
AAGAATCTTCTTCTCAAAGCAATGATCGTTCCATAATATTTCAAATTTACAAGAAAATTGCGTCTCATCGACAAGAAAACTCATCCGTTGAAACTTACTTCACAAAGCTG
GAGGAATTATGGGTTGAGCTTGTACACTACACTGCTGATTTGGTTCAATTTTCCAACGATGGTGCACCAATTGGAAATCCCATGGGGCTTACAGAGAAAGAAAAAGTTAT
GCAATTTCTTATGGGACTAGATGATTCTTATGCCACAATTTGCTCCCAAATCCTTTTTATGAACCCATTTCCAACCGTGGAGAAAGCTTATTCTGAAATATCTCGAGAAG
AAAAACGTAGGGAATTGGTTGTTGCATTAGAAACTATGGCTGCAAAAGTAATCCAAACCAACTGGCTTCTTAGAAATAATGGTCGATCCAACAATAATGGTGATACAAAT
CATGGAAGTGATAAAGAAGTTGATAACAAGTAA
Protein sequenceShow/hide protein sequence
MEGRGILSPQRSRFSPRKPKKNTPDGKPGRKVDKNGSKQKVTPIPKAVGLPSNLKPSEPSTSLNPNNNYKNKSTSEPSSPAPLLTPTTPSPSPNTNLGKANASKSGSLDP
QKDVQKSTDHCDRNQYHRFSRVVSDTAALPPPDFSDRFLQRLSFDEFDSSFLIFWKSFVQKNLGKITYDLNLCPGRDSTINDKKLSIIWIFLQKKKNGIYVGKDLADILQ
GNSIYDLISPNKKKESSSQSNDRSIIFQIYKKIASHRQENSSVETYFTKLEELWVELVHYTADLVQFSNDGAPIGNPMGLTEKEKVMQFLMGLDDSYATICSQILFMNPF
PTVEKAYSEISREEKRRELVVALETMAAKVIQTNWLLRNNGRSNNNGDTNHGSDKEVDNK