; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008339 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008339
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr9:18223160..18226406
RNA-Seq ExpressionLag0008339
SyntenyLag0008339
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022926214.1 uncharacterized protein LOC111433394 [Cucurbita moschata]2.4e-7638.64Show/hide
Query:  QNPPLEQNEQQNNQAKNHILVANDRARAIRAYVFPMFGELNRGIARPQIEATNFEMKPTMFQMLQTVGQFHGLSSKDPHLHLKSFLGVNDRGVKKSGRVG
        +NP +  N  Q     N I +A+DR RAIRAY  P   ELN  I RP+++AT FE+KP MFQMLQT+GQFHGL S+DPHLHLKSFLG             
Subjt:  QNPPLEQNEQQNNQAKNHILVANDRARAIRAYVFPMFGELNRGIARPQIEATNFEMKPTMFQMLQTVGQFHGLSSKDPHLHLKSFLGVNDRGVKKSGRVG

Query:  CTDDPTRPDTSRLNERRQLTTLATANKKPYDGVDVDKEQASDDNGEEQKASDDKMFIVREEEEAKGMRCRRRSFYRKPRTEPNQIGSVQFGFDLMKNLNQ
                                                                                                            
Subjt:  CTDDPTRPDTSRLNERRQLTTLATANKKPYDGVDVDKEQASDDNGEEQKASDDKMFIVREEEEAKGMRCRRRSFYRKPRTEPNQIGSVQFGFDLMKNLNQ

Query:  PEPNRFGSVRFSDSVRVLTPLVNDSFVIQGVPRDALRLTLFPYSLRDGAKAW----------------------------NAKLRSEIVGFRQLEDETFC
                             V+DSF  Q V +D +RL+LFPYSLRDGAK+W                            NA+ R+EIV F+Q ED+T  
Subjt:  PEPNRFGSVRFSDSVRVLTPLVNDSFVIQGVPRDALRLTLFPYSLRDGAKAW----------------------------NAKLRSEIVGFRQLEDETFC

Query:  EAWERFKELLRKCPHHGLPHCIQMEIFYNGLNGATQGMIDASAGGALLTKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADIAMLA
        EAWERFKE+LRKCPHHGLPHCIQME FYNGLN AT+ ++DASA GA+L+KT+NEA+EILERI++N+CQW+DVR    +K + VLEVD +S+I A +A + 
Subjt:  EAWERFKELLRKCPHHGLPHCIQMEIFYNGLNGATQGMIDASAGGALLTKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADIAMLA

Query:  NALKNVTMFSHQQPPAVVF-AAMVKQVAEEACVYCGEEHNYEFCPNNPASVFFVLPQQNKQALPQQNSESS
        N L+N+ +       A V   A++ Q A E+CVYCGEEH ++ CP+NPAS+F+V   Q  Q  P+ N  S+
Subjt:  NALKNVTMFSHQQPPAVVF-AAMVKQVAEEACVYCGEEHNYEFCPNNPASVFFVLPQQNKQALPQQNSESS

XP_022929949.1 uncharacterized protein LOC111436411 [Cucurbita moschata]6.2e-8034.63Show/hide
Query:  MSDSPGVRFELDPKIERTFKRRREQRRQQNPMADVLCLPQGPGDPLDPQNRLLQQNPPLEQNEQQNNQAKNHILVANDRARAIRAYVFPMFGELNRGIAR
        M+   G+ F LDP+IERTF+RR +++++     ++  +  G       Q     +NP +  N  Q     N I +A+DR RAIRAY  P   ELN  I R
Subjt:  MSDSPGVRFELDPKIERTFKRRREQRRQQNPMADVLCLPQGPGDPLDPQNRLLQQNPPLEQNEQQNNQAKNHILVANDRARAIRAYVFPMFGELNRGIAR

Query:  PQIEATNFEMKPTMFQMLQTVGQFHGLSSKDPHLHLKSFLGVNDRGVKKSGRVGCTDDPTRPDTSRLNERRQLTTLATANKKPYDGVDVDKEQASDDNGE
        P+I+ T FE+KP MFQMLQT+GQFHGL  +DPHLHLKSFLGV+D                                                        
Subjt:  PQIEATNFEMKPTMFQMLQTVGQFHGLSSKDPHLHLKSFLGVNDRGVKKSGRVGCTDDPTRPDTSRLNERRQLTTLATANKKPYDGVDVDKEQASDDNGE

Query:  EQKASDDKMFIVREEEEAKGMRCRRRSFYRKPRTEPNQIGSVQFGFDLMKNLNQPEPNRFGSVRFSDSVRVLTPLVNDSFVIQGVPRDALRLTLFPYSLR
                                                                     S RF           +DSF  QGV +D +RL+LFPY LR
Subjt:  EQKASDDKMFIVREEEEAKGMRCRRRSFYRKPRTEPNQIGSVQFGFDLMKNLNQPEPNRFGSVRFSDSVRVLTPLVNDSFVIQGVPRDALRLTLFPYSLR

Query:  DGAKAW----------------------------NAKLRSEIVGFRQLEDETFCEAWERFKELLRKCPHHGLPHCIQMEIFYNGLNGATQGMIDASAGGA
        DGAK+W                            NA+ ++EIV F+Q EDET  EA ERFKE+LRKCPHHGLPHCIQME FYNGLN  T+ ++DASA GA
Subjt:  DGAKAW----------------------------NAKLRSEIVGFRQLEDETFCEAWERFKELLRKCPHHGLPHCIQMEIFYNGLNGATQGMIDASAGGA

Query:  LLTKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADIAMLANALKNVTMFSHQQPPAVVF-AAMVKQVAEEACVYCGEEHNYEFCPN
        +L+KT+NEA+EILERI++N+CQW+DVR    +K + VLEVD +S+I A +A + N L+N+ +       A V  AA + Q A E+CVYCGEEH ++ CP+
Subjt:  LLTKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADIAMLANALKNVTMFSHQQPPAVVF-AAMVKQVAEEACVYCGEEHNYEFCPN

Query:  NPASVFFVLPQ--------------------------------QNKQALPQQN-------------------------------SESSLEAMMKEYMART
        NPAS+F+V  Q                                 N+Q  P+ N                               SE+S+E+++KEYMA+ 
Subjt:  NPASVFFVLPQ--------------------------------QNKQALPQQN-------------------------------SESSLEAMMKEYMART

Query:  DAAIQNTEHPRREGKEQV
        DA IQ+ +   R  + Q+
Subjt:  DAAIQNTEHPRREGKEQV

XP_022947838.1 uncharacterized protein LOC111451598 [Cucurbita moschata]1.1e-7338.39Show/hide
Query:  QNNQAKNHILVANDRARAIRAYVFPMFGELNRGIARPQIEATNFEMKPTMFQMLQTVGQFHGLSSKDPHLHLKSFLGVNDRGVKKSGRVGCTDDPTRPDT
        Q     N I VA+DR RAIRAY  P   ELN  I RP+++AT FE+KP MFQMLQT+GQFHGLSSKDPHLHLKSFLG                       
Subjt:  QNNQAKNHILVANDRARAIRAYVFPMFGELNRGIARPQIEATNFEMKPTMFQMLQTVGQFHGLSSKDPHLHLKSFLGVNDRGVKKSGRVGCTDDPTRPDT

Query:  SRLNERRQLTTLATANKKPYDGVDVDKEQASDDNGEEQKASDDKMFIVREEEEAKGMRCRRRSFYRKPRTEPNQIGSVQFGFDLMKNLNQPEPNRFGSVR
                                                                                                            
Subjt:  SRLNERRQLTTLATANKKPYDGVDVDKEQASDDNGEEQKASDDKMFIVREEEEAKGMRCRRRSFYRKPRTEPNQIGSVQFGFDLMKNLNQPEPNRFGSVR

Query:  FSDSVRVLTPLVNDSFVIQGVPRDALRLTLFPYSLRDGAKAW----------------------------NAKLRSEIVGFRQLEDETFCEAWERFKELL
                   V+DSF  QGV +D +RL+ F YSLRDGAK+W                            +A+ R+EIV F++ E+ET  EAWERFKE L
Subjt:  FSDSVRVLTPLVNDSFVIQGVPRDALRLTLFPYSLRDGAKAW----------------------------NAKLRSEIVGFRQLEDETFCEAWERFKELL

Query:  RKCPHHGLPHCIQMEIFYNGLNGATQGMIDASAGGALLTKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADIAMLANALKNVTMFS
        RKCPHHGLPHCIQ+E FYNGLN AT+ ++DASA G +L+KT+NEA+EILERI++N+CQW DVR    KK + VLEVD +S+I A +A + N L+N+    
Subjt:  RKCPHHGLPHCIQMEIFYNGLNGATQGMIDASAGGALLTKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADIAMLANALKNVTMFS

Query:  HQQPPAVVFAAMVK-QVAEEACVYCGEEHNYEFCPNNPASVFFVLPQQNKQALPQQNSESS
             A    A V  Q A E+CVYCGE+H ++ CP+NPAS+F+V   Q  Q  P+ N  S+
Subjt:  HQQPPAVVFAAMVK-QVAEEACVYCGEEHNYEFCPNNPASVFFVLPQQNKQALPQQNSESS

XP_022960432.1 uncharacterized protein LOC111461168 [Cucurbita moschata]1.4e-7638.7Show/hide
Query:  RLLQQNPPLEQNEQQNNQAK---NHILVANDRARAIRAYVFPMFGELNRGIARPQIEATNFEMKPTMFQMLQTVGQFHGLSSKDPHLHLKSFLGVNDRGV
        ++ Q N   E      NQ +   N I +A+DR RAIRAY  P   ELN  I RP+++AT FE+KP MFQMLQT+GQFHGL S+DPHLHLKSFLG      
Subjt:  RLLQQNPPLEQNEQQNNQAK---NHILVANDRARAIRAYVFPMFGELNRGIARPQIEATNFEMKPTMFQMLQTVGQFHGLSSKDPHLHLKSFLGVNDRGV

Query:  KKSGRVGCTDDPTRPDTSRLNERRQLTTLATANKKPYDGVDVDKEQASDDNGEEQKASDDKMFIVREEEEAKGMRCRRRSFYRKPRTEPNQIGSVQFGFD
                                                                                                            
Subjt:  KKSGRVGCTDDPTRPDTSRLNERRQLTTLATANKKPYDGVDVDKEQASDDNGEEQKASDDKMFIVREEEEAKGMRCRRRSFYRKPRTEPNQIGSVQFGFD

Query:  LMKNLNQPEPNRFGSVRFSDSVRVLTPLVNDSFVIQGVPRDALRLTLFPYSLRDGAKAW----------------------------NAKLRSEIVGFRQ
                                    V+DSF  QGV +D +RL+LFPYSLRDGAK+W                            NA+ R+EIV F+Q
Subjt:  LMKNLNQPEPNRFGSVRFSDSVRVLTPLVNDSFVIQGVPRDALRLTLFPYSLRDGAKAW----------------------------NAKLRSEIVGFRQ

Query:  LEDETFCEAWERFKELLRKCPHHGLPHCIQMEIFYNGLNGATQGMIDASAGGALLTKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIR
         EDET  EAWERFKE+LRKCPHHGLPHCIQME FYNGLN AT+ ++DASA GA+L+KT+NEA+EILERI++N+CQW+DVR    KK + VLEVD +S+I 
Subjt:  LEDETFCEAWERFKELLRKCPHHGLPHCIQMEIFYNGLNGATQGMIDASAGGALLTKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIR

Query:  ADIAMLANALKNVTMFSHQQPPAVVF-AAMVKQVAEEACVYCGEEHNYEFCPNNPASVFFVLPQQNKQALPQQNSESS
        A +A + N L+N+         A    AA++ Q A E+CVYCGEEH ++ CP NPAS+ +V  Q ++    Q+N+ SS
Subjt:  ADIAMLANALKNVTMFSHQQPPAVVF-AAMVKQVAEEACVYCGEEHNYEFCPNNPASVFFVLPQQNKQALPQQNSESS

XP_030497803.1 uncharacterized protein LOC115713460 [Cannabis sativa]3.9e-7433.9Show/hide
Query:  NNQAKNHILVANDRARAIRAYVFPMFGELNRGIARPQIEATNFEMKPTMFQMLQTVGQFHGLSSKDPHLHLKSFLGVNDRGVKKSGRVGCTDDPTRPDTS
        +N+A N I +A+DR RAIR Y  PMF ELN GI RP+I+A +FE+KP MFQMLQTVGQF G  ++DPHLH++SFL                         
Subjt:  NNQAKNHILVANDRARAIRAYVFPMFGELNRGIARPQIEATNFEMKPTMFQMLQTVGQFHGLSSKDPHLHLKSFLGVNDRGVKKSGRVGCTDDPTRPDTS

Query:  RLNERRQLTTLATANKKPYDGVDVDKEQASDDNGEEQKASDDKMFIVREEEEAKGMRCRRRSFYRKPRTEPNQIGSVQFGFDLMKNLNQPEPNRFGSVRF
                                                                                                            
Subjt:  RLNERRQLTTLATANKKPYDGVDVDKEQASDDNGEEQKASDDKMFIVREEEEAKGMRCRRRSFYRKPRTEPNQIGSVQFGFDLMKNLNQPEPNRFGSVRF

Query:  SDSVRVLTPLVNDSFVIQGVPRDALRLTLFPYSLRDGAKAW----------------------------NAKLRSEIVGFRQLEDETFCEAWERFKELLR
                  V+DSF +QGV  +ALRL LFP+SLRD A+AW                            NAK RSEI+ F+Q EDET  +AWERFKELLR
Subjt:  SDSVRVLTPLVNDSFVIQGVPRDALRLTLFPYSLRDGAKAW----------------------------NAKLRSEIVGFRQLEDETFCEAWERFKELLR

Query:  KCPHHGLPHCIQMEIFYNGLNGATQGMIDASAGGALLTKTFNEAHEILERISTNSCQWSDVRG-TNKKVKSVLEVDGVSTIRADIAMLANALKNVTMFSH
        KCPHHG+PHCIQ+E FYNGLN A++ ++DASA GA+L+K++NEA EILERI++N+ QWS  R  T++KV  VLEVD ++ + A +A + N LKN+ M   
Subjt:  KCPHHGLPHCIQMEIFYNGLNGATQGMIDASAGGALLTKTFNEAHEILERISTNSCQWSDVRG-TNKKVKSVLEVDGVSTIRADIAMLANALKNVTMFSH

Query:  QQPPAVVFAAMVKQVAEEACVYCGEEHNYEFCPNNPASVFFVLPQ---------------------------QNKQAL---------------PQQNSES
         QP A +      Q A+ +CVYCG+ H +E CP+N ASV +V  Q                           Q KQ+                PQ +  S
Subjt:  QQPPAVVFAAMVKQVAEEACVYCGEEHNYEFCPNNPASVFFVLPQ---------------------------QNKQAL---------------PQQNSES

Query:  SLEAMMKEYMARTDAAIQ------------------------------NTEHPRREGKEQVKAVTLRCGKPLEERKEPIKTKD---IEKNCD-KNVVAEK
        SLE++M++YMA+ D  IQ                              +TE+PRR+GKE  KAVTLR GK +E      ++K+   I+K  + K   A  
Subjt:  SLEAMMKEYMARTDAAIQ------------------------------NTEHPRREGKEQVKAVTLRCGKPLEERKEPIKTKD---IEKNCD-KNVVAEK

Query:  ELESGQGARGSNNDAGAFGSFPDVEPPYVPPPPYVPPLPFPQR
         +E       S+  + A       E     PPP     PFPQR
Subjt:  ELESGQGARGSNNDAGAFGSFPDVEPPYVPPPPYVPPLPFPQR

TrEMBL top hitse value%identityAlignment
A0A6J1EEI2 uncharacterized protein LOC1114333941.2e-7638.64Show/hide
Query:  QNPPLEQNEQQNNQAKNHILVANDRARAIRAYVFPMFGELNRGIARPQIEATNFEMKPTMFQMLQTVGQFHGLSSKDPHLHLKSFLGVNDRGVKKSGRVG
        +NP +  N  Q     N I +A+DR RAIRAY  P   ELN  I RP+++AT FE+KP MFQMLQT+GQFHGL S+DPHLHLKSFLG             
Subjt:  QNPPLEQNEQQNNQAKNHILVANDRARAIRAYVFPMFGELNRGIARPQIEATNFEMKPTMFQMLQTVGQFHGLSSKDPHLHLKSFLGVNDRGVKKSGRVG

Query:  CTDDPTRPDTSRLNERRQLTTLATANKKPYDGVDVDKEQASDDNGEEQKASDDKMFIVREEEEAKGMRCRRRSFYRKPRTEPNQIGSVQFGFDLMKNLNQ
                                                                                                            
Subjt:  CTDDPTRPDTSRLNERRQLTTLATANKKPYDGVDVDKEQASDDNGEEQKASDDKMFIVREEEEAKGMRCRRRSFYRKPRTEPNQIGSVQFGFDLMKNLNQ

Query:  PEPNRFGSVRFSDSVRVLTPLVNDSFVIQGVPRDALRLTLFPYSLRDGAKAW----------------------------NAKLRSEIVGFRQLEDETFC
                             V+DSF  Q V +D +RL+LFPYSLRDGAK+W                            NA+ R+EIV F+Q ED+T  
Subjt:  PEPNRFGSVRFSDSVRVLTPLVNDSFVIQGVPRDALRLTLFPYSLRDGAKAW----------------------------NAKLRSEIVGFRQLEDETFC

Query:  EAWERFKELLRKCPHHGLPHCIQMEIFYNGLNGATQGMIDASAGGALLTKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADIAMLA
        EAWERFKE+LRKCPHHGLPHCIQME FYNGLN AT+ ++DASA GA+L+KT+NEA+EILERI++N+CQW+DVR    +K + VLEVD +S+I A +A + 
Subjt:  EAWERFKELLRKCPHHGLPHCIQMEIFYNGLNGATQGMIDASAGGALLTKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADIAMLA

Query:  NALKNVTMFSHQQPPAVVF-AAMVKQVAEEACVYCGEEHNYEFCPNNPASVFFVLPQQNKQALPQQNSESS
        N L+N+ +       A V   A++ Q A E+CVYCGEEH ++ CP+NPAS+F+V   Q  Q  P+ N  S+
Subjt:  NALKNVTMFSHQQPPAVVF-AAMVKQVAEEACVYCGEEHNYEFCPNNPASVFFVLPQQNKQALPQQNSESS

A0A6J1EQ90 uncharacterized protein LOC1114364113.0e-8034.63Show/hide
Query:  MSDSPGVRFELDPKIERTFKRRREQRRQQNPMADVLCLPQGPGDPLDPQNRLLQQNPPLEQNEQQNNQAKNHILVANDRARAIRAYVFPMFGELNRGIAR
        M+   G+ F LDP+IERTF+RR +++++     ++  +  G       Q     +NP +  N  Q     N I +A+DR RAIRAY  P   ELN  I R
Subjt:  MSDSPGVRFELDPKIERTFKRRREQRRQQNPMADVLCLPQGPGDPLDPQNRLLQQNPPLEQNEQQNNQAKNHILVANDRARAIRAYVFPMFGELNRGIAR

Query:  PQIEATNFEMKPTMFQMLQTVGQFHGLSSKDPHLHLKSFLGVNDRGVKKSGRVGCTDDPTRPDTSRLNERRQLTTLATANKKPYDGVDVDKEQASDDNGE
        P+I+ T FE+KP MFQMLQT+GQFHGL  +DPHLHLKSFLGV+D                                                        
Subjt:  PQIEATNFEMKPTMFQMLQTVGQFHGLSSKDPHLHLKSFLGVNDRGVKKSGRVGCTDDPTRPDTSRLNERRQLTTLATANKKPYDGVDVDKEQASDDNGE

Query:  EQKASDDKMFIVREEEEAKGMRCRRRSFYRKPRTEPNQIGSVQFGFDLMKNLNQPEPNRFGSVRFSDSVRVLTPLVNDSFVIQGVPRDALRLTLFPYSLR
                                                                     S RF           +DSF  QGV +D +RL+LFPY LR
Subjt:  EQKASDDKMFIVREEEEAKGMRCRRRSFYRKPRTEPNQIGSVQFGFDLMKNLNQPEPNRFGSVRFSDSVRVLTPLVNDSFVIQGVPRDALRLTLFPYSLR

Query:  DGAKAW----------------------------NAKLRSEIVGFRQLEDETFCEAWERFKELLRKCPHHGLPHCIQMEIFYNGLNGATQGMIDASAGGA
        DGAK+W                            NA+ ++EIV F+Q EDET  EA ERFKE+LRKCPHHGLPHCIQME FYNGLN  T+ ++DASA GA
Subjt:  DGAKAW----------------------------NAKLRSEIVGFRQLEDETFCEAWERFKELLRKCPHHGLPHCIQMEIFYNGLNGATQGMIDASAGGA

Query:  LLTKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADIAMLANALKNVTMFSHQQPPAVVF-AAMVKQVAEEACVYCGEEHNYEFCPN
        +L+KT+NEA+EILERI++N+CQW+DVR    +K + VLEVD +S+I A +A + N L+N+ +       A V  AA + Q A E+CVYCGEEH ++ CP+
Subjt:  LLTKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADIAMLANALKNVTMFSHQQPPAVVF-AAMVKQVAEEACVYCGEEHNYEFCPN

Query:  NPASVFFVLPQ--------------------------------QNKQALPQQN-------------------------------SESSLEAMMKEYMART
        NPAS+F+V  Q                                 N+Q  P+ N                               SE+S+E+++KEYMA+ 
Subjt:  NPASVFFVLPQ--------------------------------QNKQALPQQN-------------------------------SESSLEAMMKEYMART

Query:  DAAIQNTEHPRREGKEQV
        DA IQ+ +   R  + Q+
Subjt:  DAAIQNTEHPRREGKEQV

A0A6J1G7Q6 uncharacterized protein LOC1114515985.5e-7438.39Show/hide
Query:  QNNQAKNHILVANDRARAIRAYVFPMFGELNRGIARPQIEATNFEMKPTMFQMLQTVGQFHGLSSKDPHLHLKSFLGVNDRGVKKSGRVGCTDDPTRPDT
        Q     N I VA+DR RAIRAY  P   ELN  I RP+++AT FE+KP MFQMLQT+GQFHGLSSKDPHLHLKSFLG                       
Subjt:  QNNQAKNHILVANDRARAIRAYVFPMFGELNRGIARPQIEATNFEMKPTMFQMLQTVGQFHGLSSKDPHLHLKSFLGVNDRGVKKSGRVGCTDDPTRPDT

Query:  SRLNERRQLTTLATANKKPYDGVDVDKEQASDDNGEEQKASDDKMFIVREEEEAKGMRCRRRSFYRKPRTEPNQIGSVQFGFDLMKNLNQPEPNRFGSVR
                                                                                                            
Subjt:  SRLNERRQLTTLATANKKPYDGVDVDKEQASDDNGEEQKASDDKMFIVREEEEAKGMRCRRRSFYRKPRTEPNQIGSVQFGFDLMKNLNQPEPNRFGSVR

Query:  FSDSVRVLTPLVNDSFVIQGVPRDALRLTLFPYSLRDGAKAW----------------------------NAKLRSEIVGFRQLEDETFCEAWERFKELL
                   V+DSF  QGV +D +RL+ F YSLRDGAK+W                            +A+ R+EIV F++ E+ET  EAWERFKE L
Subjt:  FSDSVRVLTPLVNDSFVIQGVPRDALRLTLFPYSLRDGAKAW----------------------------NAKLRSEIVGFRQLEDETFCEAWERFKELL

Query:  RKCPHHGLPHCIQMEIFYNGLNGATQGMIDASAGGALLTKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADIAMLANALKNVTMFS
        RKCPHHGLPHCIQ+E FYNGLN AT+ ++DASA G +L+KT+NEA+EILERI++N+CQW DVR    KK + VLEVD +S+I A +A + N L+N+    
Subjt:  RKCPHHGLPHCIQMEIFYNGLNGATQGMIDASAGGALLTKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADIAMLANALKNVTMFS

Query:  HQQPPAVVFAAMVK-QVAEEACVYCGEEHNYEFCPNNPASVFFVLPQQNKQALPQQNSESS
             A    A V  Q A E+CVYCGE+H ++ CP+NPAS+F+V   Q  Q  P+ N  S+
Subjt:  HQQPPAVVFAAMVK-QVAEEACVYCGEEHNYEFCPNNPASVFFVLPQQNKQALPQQNSESS

A0A6J1H7E4 uncharacterized protein LOC1114611686.9e-7738.7Show/hide
Query:  RLLQQNPPLEQNEQQNNQAK---NHILVANDRARAIRAYVFPMFGELNRGIARPQIEATNFEMKPTMFQMLQTVGQFHGLSSKDPHLHLKSFLGVNDRGV
        ++ Q N   E      NQ +   N I +A+DR RAIRAY  P   ELN  I RP+++AT FE+KP MFQMLQT+GQFHGL S+DPHLHLKSFLG      
Subjt:  RLLQQNPPLEQNEQQNNQAK---NHILVANDRARAIRAYVFPMFGELNRGIARPQIEATNFEMKPTMFQMLQTVGQFHGLSSKDPHLHLKSFLGVNDRGV

Query:  KKSGRVGCTDDPTRPDTSRLNERRQLTTLATANKKPYDGVDVDKEQASDDNGEEQKASDDKMFIVREEEEAKGMRCRRRSFYRKPRTEPNQIGSVQFGFD
                                                                                                            
Subjt:  KKSGRVGCTDDPTRPDTSRLNERRQLTTLATANKKPYDGVDVDKEQASDDNGEEQKASDDKMFIVREEEEAKGMRCRRRSFYRKPRTEPNQIGSVQFGFD

Query:  LMKNLNQPEPNRFGSVRFSDSVRVLTPLVNDSFVIQGVPRDALRLTLFPYSLRDGAKAW----------------------------NAKLRSEIVGFRQ
                                    V+DSF  QGV +D +RL+LFPYSLRDGAK+W                            NA+ R+EIV F+Q
Subjt:  LMKNLNQPEPNRFGSVRFSDSVRVLTPLVNDSFVIQGVPRDALRLTLFPYSLRDGAKAW----------------------------NAKLRSEIVGFRQ

Query:  LEDETFCEAWERFKELLRKCPHHGLPHCIQMEIFYNGLNGATQGMIDASAGGALLTKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIR
         EDET  EAWERFKE+LRKCPHHGLPHCIQME FYNGLN AT+ ++DASA GA+L+KT+NEA+EILERI++N+CQW+DVR    KK + VLEVD +S+I 
Subjt:  LEDETFCEAWERFKELLRKCPHHGLPHCIQMEIFYNGLNGATQGMIDASAGGALLTKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIR

Query:  ADIAMLANALKNVTMFSHQQPPAVVF-AAMVKQVAEEACVYCGEEHNYEFCPNNPASVFFVLPQQNKQALPQQNSESS
        A +A + N L+N+         A    AA++ Q A E+CVYCGEEH ++ CP NPAS+ +V  Q ++    Q+N+ SS
Subjt:  ADIAMLANALKNVTMFSHQQPPAVVF-AAMVKQVAEEACVYCGEEHNYEFCPNNPASVFFVLPQQNKQALPQQNSESS

U5CUI2 Retrotrans_gag domain-containing protein3.4e-6837.1Show/hide
Query:  QAKNHILVANDRARAIRAYVFPMFGELNRGIARPQIEATNFEMKPTMFQMLQTVGQFHGLSSKDPHLHLKSFLGVNDRGVKKSGRVGCTDDPTRPDTSRL
        Q  N I++A+DRARAIR Y  PMF ELN GI RP+I+A  FE+KP MFQMLQTVGQF G+ ++DPHLHL+SFL                           
Subjt:  QAKNHILVANDRARAIRAYVFPMFGELNRGIARPQIEATNFEMKPTMFQMLQTVGQFHGLSSKDPHLHLKSFLGVNDRGVKKSGRVGCTDDPTRPDTSRL

Query:  NERRQLTTLATANKKPYDGVDVDKEQASDDNGEEQKASDDKMFIVREEEEAKGMRCRRRSFYRKPRTEPNQIGSVQFGFDLMKNLNQPEPNRFGSVRFSD
                                                                                                            
Subjt:  NERRQLTTLATANKKPYDGVDVDKEQASDDNGEEQKASDDKMFIVREEEEAKGMRCRRRSFYRKPRTEPNQIGSVQFGFDLMKNLNQPEPNRFGSVRFSD

Query:  SVRVLTPLVNDSFVIQGVPRDALRLTLFPYSLRDGAKAW----------------------------NAKLRSEIVGFRQLEDETFCEAWERFKELLRKC
                V+DSF IQGV  + LRL LFP+SLRD A++W                            NAK RSEI+ F+QLEDE+  +AWERFKELLRKC
Subjt:  SVRVLTPLVNDSFVIQGVPRDALRLTLFPYSLRDGAKAW----------------------------NAKLRSEIVGFRQLEDETFCEAWERFKELLRKC

Query:  PHHGLPHCIQMEIFYNGLNGATQGMIDASAGGALLTKTFNEAHEILERISTNSCQWSDVRG-TNKKVKSVLEVDGVSTIRADIAMLANALKNVTMFSHQ-
        PHHG+PHCIQME FYNGLN A++ ++DASA GA+L+K++NEA EILE I++N+ QWS+ R  T++KV  VLEVD ++ + A +A + N LKN+++ + + 
Subjt:  PHHGLPHCIQMEIFYNGLNGATQGMIDASAGGALLTKTFNEAHEILERISTNSCQWSDVRG-TNKKVKSVLEVDGVSTIRADIAMLANALKNVTMFSHQ-

Query:  -QPPAVVFAAMVKQVAEEACVYCGEEHNYEFCPNNPASVFFV
         QP A +      Q  + +CV+CGE H +E CP+NP SV ++
Subjt:  -QPPAVVFAAMVKQVAEEACVYCGEEHNYEFCPNNPASVFFV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCGATTCGCCTGGAGTAAGATTCGAGCTTGATCCAAAAATCGAGAGGACATTCAAGAGAAGGAGAGAGCAACGTAGACAACAAAATCCAATGGCTGACGTGCTGTG
TCTCCCACAGGGTCCAGGAGATCCACTTGATCCCCAGAATCGTTTGTTGCAGCAAAATCCACCGCTGGAGCAAAATGAACAGCAAAATAATCAGGCTAAGAATCATATCT
TGGTAGCCAATGATAGAGCCAGAGCCATTCGGGCGTATGTTTTTCCAATGTTTGGTGAGTTAAATCGAGGGATTGCACGTCCTCAAATTGAGGCAACAAATTTTGAAATG
AAACCGACAATGTTTCAGATGTTACAAACCGTGGGGCAGTTCCATGGTTTGTCATCTAAAGACCCACATTTGCATCTTAAGTCTTTTCTAGGAGTTAATGATAGGGGTGT
AAAAAAATCGGGTCGGGTCGGTTGTACGGACGACCCGACCCGACCCGACACAAGTCGGTTGAACGAAAGAAGGCAACTGACGACATTGGCGACGGCGAATAAGAAGCCAT
ACGACGGCGTCGACGTCGACAAAGAACAAGCAAGCGACGACAATGGCGAAGAACAAAAAGCAAGCGATGACAAGATGTTTATCGTGAGAGAAGAAGAAGAAGCGAAGGGG
ATGAGATGCAGGCGGCGCAGTTTTTATCGGAAACCGAGAACCGAACCGAACCAGATCGGTTCGGTTCAGTTCGGTTTTGATTTAATGAAAAATCTGAACCAACCCGAACC
GAACCGGTTCGGTTCGGTTCGGTTTTCCGATTCGGTTCGGGTTCTTACACCCCTAGTTAATGATTCTTTTGTAATTCAAGGAGTGCCTAGGGATGCCCTTAGATTAACTT
TGTTCCCGTATTCTCTTAGAGATGGAGCAAAGGCATGGAATGCTAAATTAAGGAGTGAAATAGTAGGGTTTAGGCAACTTGAAGATGAAACTTTTTGTGAGGCTTGGGAG
AGGTTTAAGGAGCTTTTGCGAAAGTGTCCCCACCATGGTTTACCACATTGTATCCAAATGGAAATATTTTACAATGGGTTAAATGGAGCAACCCAAGGTATGATTGATGC
TTCGGCTGGAGGGGCCCTTTTGACAAAAACTTTTAATGAAGCCCATGAAATTTTAGAAAGAATATCAACCAATAGTTGTCAGTGGTCAGATGTTAGAGGCACAAATAAAA
AAGTTAAGAGTGTGTTAGAGGTTGATGGTGTGTCCACAATTAGGGCTGATATTGCAATGTTAGCTAACGCTCTTAAAAATGTGACAATGTTTAGTCATCAGCAGCCGCCA
GCTGTGGTGTTTGCTGCAATGGTGAAACAAGTTGCAGAGGAAGCATGTGTCTATTGTGGTGAAGAGCACAACTACGAGTTTTGCCCCAACAATCCAGCTTCTGTGTTTTT
TGTATTGCCCCAGCAAAATAAGCAGGCTTTGCCCCAGCAAAATTCAGAGAGTTCTCTTGAGGCAATGATGAAAGAATATATGGCTCGTACAGATGCCGCAATTCAAAATA
CTGAACACCCTAGAAGGGAAGGTAAGGAGCAGGTAAAGGCAGTGACTCTTAGGTGTGGTAAGCCACTAGAGGAAAGAAAAGAACCTATTAAAACCAAGGATATAGAAAAG
AATTGTGATAAAAATGTTGTTGCTGAAAAAGAGTTGGAGTCTGGTCAAGGTGCTAGAGGCAGCAATAATGATGCTGGAGCATTTGGATCTTTTCCAGATGTGGAACCACC
TTATGTGCCGCCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAGGGTTGCGAACCACGCGATTTGCAGCAGCGAGAGAGTGAAGGAAAACACGAAAAATCAGGTAG
AAGAAATTCGTAGTTGCAGGTCACATTTTGGCTTGGAACTTGGTTTCAGAGGCTTGTTTTTTTTCCGTGTGAGTGTCGGTGTTGCGTTGGAGAAGGAAGTTTTCGTGGGT
CTTGATTCAAAGGCTTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGCGATTCGCCTGGAGTAAGATTCGAGCTTGATCCAAAAATCGAGAGGACATTCAAGAGAAGGAGAGAGCAACGTAGACAACAAAATCCAATGGCTGACGTGCTGTG
TCTCCCACAGGGTCCAGGAGATCCACTTGATCCCCAGAATCGTTTGTTGCAGCAAAATCCACCGCTGGAGCAAAATGAACAGCAAAATAATCAGGCTAAGAATCATATCT
TGGTAGCCAATGATAGAGCCAGAGCCATTCGGGCGTATGTTTTTCCAATGTTTGGTGAGTTAAATCGAGGGATTGCACGTCCTCAAATTGAGGCAACAAATTTTGAAATG
AAACCGACAATGTTTCAGATGTTACAAACCGTGGGGCAGTTCCATGGTTTGTCATCTAAAGACCCACATTTGCATCTTAAGTCTTTTCTAGGAGTTAATGATAGGGGTGT
AAAAAAATCGGGTCGGGTCGGTTGTACGGACGACCCGACCCGACCCGACACAAGTCGGTTGAACGAAAGAAGGCAACTGACGACATTGGCGACGGCGAATAAGAAGCCAT
ACGACGGCGTCGACGTCGACAAAGAACAAGCAAGCGACGACAATGGCGAAGAACAAAAAGCAAGCGATGACAAGATGTTTATCGTGAGAGAAGAAGAAGAAGCGAAGGGG
ATGAGATGCAGGCGGCGCAGTTTTTATCGGAAACCGAGAACCGAACCGAACCAGATCGGTTCGGTTCAGTTCGGTTTTGATTTAATGAAAAATCTGAACCAACCCGAACC
GAACCGGTTCGGTTCGGTTCGGTTTTCCGATTCGGTTCGGGTTCTTACACCCCTAGTTAATGATTCTTTTGTAATTCAAGGAGTGCCTAGGGATGCCCTTAGATTAACTT
TGTTCCCGTATTCTCTTAGAGATGGAGCAAAGGCATGGAATGCTAAATTAAGGAGTGAAATAGTAGGGTTTAGGCAACTTGAAGATGAAACTTTTTGTGAGGCTTGGGAG
AGGTTTAAGGAGCTTTTGCGAAAGTGTCCCCACCATGGTTTACCACATTGTATCCAAATGGAAATATTTTACAATGGGTTAAATGGAGCAACCCAAGGTATGATTGATGC
TTCGGCTGGAGGGGCCCTTTTGACAAAAACTTTTAATGAAGCCCATGAAATTTTAGAAAGAATATCAACCAATAGTTGTCAGTGGTCAGATGTTAGAGGCACAAATAAAA
AAGTTAAGAGTGTGTTAGAGGTTGATGGTGTGTCCACAATTAGGGCTGATATTGCAATGTTAGCTAACGCTCTTAAAAATGTGACAATGTTTAGTCATCAGCAGCCGCCA
GCTGTGGTGTTTGCTGCAATGGTGAAACAAGTTGCAGAGGAAGCATGTGTCTATTGTGGTGAAGAGCACAACTACGAGTTTTGCCCCAACAATCCAGCTTCTGTGTTTTT
TGTATTGCCCCAGCAAAATAAGCAGGCTTTGCCCCAGCAAAATTCAGAGAGTTCTCTTGAGGCAATGATGAAAGAATATATGGCTCGTACAGATGCCGCAATTCAAAATA
CTGAACACCCTAGAAGGGAAGGTAAGGAGCAGGTAAAGGCAGTGACTCTTAGGTGTGGTAAGCCACTAGAGGAAAGAAAAGAACCTATTAAAACCAAGGATATAGAAAAG
AATTGTGATAAAAATGTTGTTGCTGAAAAAGAGTTGGAGTCTGGTCAAGGTGCTAGAGGCAGCAATAATGATGCTGGAGCATTTGGATCTTTTCCAGATGTGGAACCACC
TTATGTGCCGCCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAGGGTTGCGAACCACGCGATTTGCAGCAGCGAGAGAGTGAAGGAAAACACGAAAAATCAGGTAG
AAGAAATTCGTAGTTGCAGGTCACATTTTGGCTTGGAACTTGGTTTCAGAGGCTTGTTTTTTTTCCGTGTGAGTGTCGGTGTTGCGTTGGAGAAGGAAGTTTTCGTGGGT
CTTGATTCAAAGGCTTTTTAG
Protein sequenceShow/hide protein sequence
MSDSPGVRFELDPKIERTFKRRREQRRQQNPMADVLCLPQGPGDPLDPQNRLLQQNPPLEQNEQQNNQAKNHILVANDRARAIRAYVFPMFGELNRGIARPQIEATNFEM
KPTMFQMLQTVGQFHGLSSKDPHLHLKSFLGVNDRGVKKSGRVGCTDDPTRPDTSRLNERRQLTTLATANKKPYDGVDVDKEQASDDNGEEQKASDDKMFIVREEEEAKG
MRCRRRSFYRKPRTEPNQIGSVQFGFDLMKNLNQPEPNRFGSVRFSDSVRVLTPLVNDSFVIQGVPRDALRLTLFPYSLRDGAKAWNAKLRSEIVGFRQLEDETFCEAWE
RFKELLRKCPHHGLPHCIQMEIFYNGLNGATQGMIDASAGGALLTKTFNEAHEILERISTNSCQWSDVRGTNKKVKSVLEVDGVSTIRADIAMLANALKNVTMFSHQQPP
AVVFAAMVKQVAEEACVYCGEEHNYEFCPNNPASVFFVLPQQNKQALPQQNSESSLEAMMKEYMARTDAAIQNTEHPRREGKEQVKAVTLRCGKPLEERKEPIKTKDIEK
NCDKNVVAEKELESGQGARGSNNDAGAFGSFPDVEPPYVPPPPYVPPLPFPQRVANHAICSSERVKENTKNQVEEIRSCRSHFGLELGFRGLFFFRVSVGVALEKEVFVG
LDSKAF