; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008600 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008600
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr9:26215668..26227510
RNA-Seq ExpressionLag0008600
SyntenyLag0008600
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022926214.1 uncharacterized protein LOC111433394 [Cucurbita moschata]6.6e-9247.22Show/hide
Query:  QNPSLEQNGQQNNQAKYPILIANDRDRVIRSYVFPIIDELNPVIMRPIIDASNFEIKPVMFHMLEMVGQFHGLPSEDPYLHLKSFLGVSDSFAFEGVSRD
        +NP++  N  Q       I +A+DR+R IR+Y  P ++ELNP I+RP + A+ FE+KPVMF ML+ +GQFHGLPSEDP+LHLKSFLGVSDSF F+ V +D
Subjt:  QNPSLEQNGQQNNQAKYPILIANDRDRVIRSYVFPIIDELNPVIMRPIIDASNFEIKPVMFHMLEMVGQFHGLPSEDPYLHLKSFLGVSDSFAFEGVSRD

Query:  ALRLTLFPYSLRDGAKAWLNSFAPASISTWNELVEKFLSKYFPPIRSAKLRSEIVGFRQNEEETFSEAWERFKELLRKCPHHCLPYCIQMEIFYNGLNIA
         +RL+LFPYSLRDGAK+WLN+ A  +I +WN LVEKFL KYFPP R+A+ R+EIV F+Q E++T SEAWERFKE+LRKCPHH LP+CIQME FYNGLNIA
Subjt:  ALRLTLFPYSLRDGAKAWLNSFAPASISTWNELVEKFLSKYFPPIRSAKLRSEIVGFRQNEEETFSEAWERFKELLRKCPHHCLPYCIQMEIFYNGLNIA

Query:  TQCMVDASTGGALLAKSFNEAYEILERISTYSFQWSDVRGS-NKKVKSVLEVD-----------------------------------------------
        T+ +VDAS  GA+L+K++NEAYEILERI++ + QW+DVR +  +K + VLEVD                                               
Subjt:  TQCMVDASTGGALLAKSFNEAYEILERISTYSFQWSDVRGS-NKKVKSVLEVD-----------------------------------------------

Query:  --------------------------GNQRNNLYSNFYNQGWRNHPNFAWAGQGSNSQVPQAQQKAVNQSGFAKSQELLQQNKQ------ALPQQNSEL-
                                  GN +NN +SN YN GWRNHPNF+W GQGS +Q  Q   KA    GF    +L   ++Q       +PQ    L 
Subjt:  --------------------------GNQRNNLYSNFYNQGWRNHPNFAWAGQGSNSQVPQAQQKAVNQSGFAKSQELLQQNKQ------ALPQQNSEL-

Query:  -QVGQLAKELKAR
          +  L KE  A+
Subjt:  -QVGQLAKELKAR

XP_022929949.1 uncharacterized protein LOC111436411 [Cucurbita moschata]8.1e-9039.47Show/hide
Query:  PEAELTAEWRSKKTICDSTQGDEVLPCVTLQNPSLEQNGQQNNQAKY---PILIANDRDRVIRSYVFPIIDELNPVIMRPIIDASNFEIKPVMFHMLEMV
        PE E T   R KK    + Q  + +      N   E      NQ +    PI +A+DR+R IR+Y  P ++ELNP I+RP I  + FE+KPVMF ML+ +
Subjt:  PEAELTAEWRSKKTICDSTQGDEVLPCVTLQNPSLEQNGQQNNQAKY---PILIANDRDRVIRSYVFPIIDELNPVIMRPIIDASNFEIKPVMFHMLEMV

Query:  GQFHGLPSEDPYLHLKSFLGV-------SDSFAFEGVSRDALRLTLFPYSLRDGAKAWLNSFAPASISTWNELVEKFLSKYFPPIRSAKLRSEIVGFRQN
        GQFHGLP EDP+LHLKSFLGV       SDSF F+GV +D +RL+LFPY LRDGAK+WLN+ AP +I +WN L E FL KYFPP R+A+ ++EIV F+Q 
Subjt:  GQFHGLPSEDPYLHLKSFLGV-------SDSFAFEGVSRDALRLTLFPYSLRDGAKAWLNSFAPASISTWNELVEKFLSKYFPPIRSAKLRSEIVGFRQN

Query:  EEETFSEAWERFKELLRKCPHHCLPYCIQMEIFYNGLNIATQCMVDASTGGALLAKSFNEAYEILERISTYSFQWSDVRGS-NKKVKSVLEVD-------
        E+ET SEA ERFKE+LRKCPHH LP+CIQME FYNGLNI T+ +VDAS  GA+L+K++NEAYEILERI++ + QW+DVR +  +K + VLEVD       
Subjt:  EEETFSEAWERFKELLRKCPHHCLPYCIQMEIFYNGLNIATQCMVDASTGGALLAKSFNEAYEILERISTYSFQWSDVRGS-NKKVKSVLEVD-------

Query:  ------------------------------------------------------------------GNQRNNLYSNFYNQGWRNHPNFAWAGQGSNSQVP
                                                                          GN +NN +SN YN GWRNHPNF+W GQ   +Q  
Subjt:  ------------------------------------------------------------------GNQRNNLYSNFYNQGWRNHPNFAWAGQGSNSQVP

Query:  QAQQKAVNQSGFAKSQELLQQNKQ--------ALPQQNSELQVGQLAKELKARLHGNIPLDIEHPIREGKKQVQTMTLGSDKPLEDRKEPS-KPQEVEKS
        Q   KA   SGF    +L   ++Q           Q  SE  +  L KE  A+       D     ++   +   + +G +K  E     S +  + ++ 
Subjt:  QAQQKAVNQSGFAKSQELLQQNKQ--------ALPQQNSELQVGQLAKELKARLHGNIPLDIEHPIREGKKQVQTMTLGSDKPLEDRKEPS-KPQEVEKS

Query:  SDSNVVEKELGSGQYDGGSSKDAGAISSVPDVEPQPYVPPPPYDPPFPFPQRQKSKNQEVKFNMF
        ++   V+KE  S  Y     +     ++  + E + Y P P      PFPQR K K +E  F  F
Subjt:  SDSNVVEKELGSGQYDGGSSKDAGAISSVPDVEPQPYVPPPPYDPPFPFPQRQKSKNQEVKFNMF

XP_022960432.1 uncharacterized protein LOC111461168 [Cucurbita moschata]1.3e-9550.54Show/hide
Query:  SLEQNGQQNNQAKYPILIAN-------------DRDRVIRSYVFPIIDELNPVIMRPIIDASNFEIKPVMFHMLEMVGQFHGLPSEDPYLHLKSFLGVSD
        +++Q  Q N + + P+++AN             DR+R IR+Y  P +DELNP I+RP + A+ FE+KPVMF ML+ +GQFHGLPSEDP+LHLKSFLGVSD
Subjt:  SLEQNGQQNNQAKYPILIAN-------------DRDRVIRSYVFPIIDELNPVIMRPIIDASNFEIKPVMFHMLEMVGQFHGLPSEDPYLHLKSFLGVSD

Query:  SFAFEGVSRDALRLTLFPYSLRDGAKAWLNSFAPASISTWNELVEKFLSKYFPPIRSAKLRSEIVGFRQNEEETFSEAWERFKELLRKCPHHCLPYCIQM
        SF F+GV +D +RL+LFPYSLRDGAK+WLN+ AP +I +WN L EKFL KYFPP R+A+ R+EIV F+Q E+ET SEAWERFKE+LRKCPHH LP+CIQM
Subjt:  SFAFEGVSRDALRLTLFPYSLRDGAKAWLNSFAPASISTWNELVEKFLSKYFPPIRSAKLRSEIVGFRQNEEETFSEAWERFKELLRKCPHHCLPYCIQM

Query:  EIFYNGLNIATQCMVDASTGGALLAKSFNEAYEILERISTYSFQWSDVRGS-NKKVKSVLEVD-------------------------------------
        E FYNGLNIAT+ +VDAS  GA+L+K++NEAYEILERI++ + QW+DVR +  KK + VLEVD                                     
Subjt:  EIFYNGLNIATQCMVDASTGGALLAKSFNEAYEILERISTYSFQWSDVRGS-NKKVKSVLEVD-------------------------------------

Query:  ------------------------------------GNQRNNLYSNFYNQGWRNHPNFAWAGQGSNSQ
                                            GNQ+NN  SN YN GWRNHPNF+W GQGS +Q
Subjt:  ------------------------------------GNQRNNLYSNFYNQGWRNHPNFAWAGQGSNSQ

XP_030497803.1 uncharacterized protein LOC115713460 [Cannabis sativa]1.8e-9742.12Show/hide
Query:  GQQNNQAKYPILIANDRDRVIRSYVFPIIDELNPVIMRPIIDASNFEIKPVMFHMLEMVGQFHGLPSEDPYLHLKSFLGVSDSFAFEGVSRDALRLTLFP
        G  +N+A  PI +A+DR R IR Y  P+ +ELNP I+RP I A +FE+KPVMF ML+ VGQF G P+EDP+LH++SFL VSDSF  +GVS +ALRL LFP
Subjt:  GQQNNQAKYPILIANDRDRVIRSYVFPIIDELNPVIMRPIIDASNFEIKPVMFHMLEMVGQFHGLPSEDPYLHLKSFLGVSDSFAFEGVSRDALRLTLFP

Query:  YSLRDGAKAWLNSFAPASISTWNELVEKFLSKYFPPIRSAKLRSEIVGFRQNEEETFSEAWERFKELLRKCPHHCLPYCIQMEIFYNGLNIATQCMVDAS
        +SLRD A+AWLN+  P S++ WN+L EKFL KYFPP R+AK RSEI+ F+Q+E+ET S+AWERFKELLRKCPHH +P+CIQ+E FYNGLN A++ ++DAS
Subjt:  YSLRDGAKAWLNSFAPASISTWNELVEKFLSKYFPPIRSAKLRSEIVGFRQNEEETFSEAWERFKELLRKCPHHCLPYCIQMEIFYNGLNIATQCMVDAS

Query:  TGGALLAKSFNEAYEILERISTYSFQWSDVRG-SNKKVKSVLEVDG------------------------------------------------------
          GA+L+KS+NEA+EILERI++ ++QWS  R  +++KV  VLEVD                                                       
Subjt:  TGGALLAKSFNEAYEILERISTYSFQWSDVRG-SNKKVKSVLEVDG------------------------------------------------------

Query:  ----------NQRNNLYSNFYNQGWRNHPNFAWAGQGSN------SQVPQAQQKAVNQSGFAKSQELLQQN----------KQALPQQNSELQVGQLAKE
                  N+ NN YSN YN  W++HPNF+W GQG        SQ P+ QQ    Q     S E L ++           QA   +N E+Q+GQLA +
Subjt:  ----------NQRNNLYSNFYNQGWRNHPNFAWAGQGSN------SQVPQAQQKAVNQSGFAKSQELLQQN----------KQALPQQNSELQVGQLAKE

Query:  LKARLHGNIPLDIEHPIREGKKQVQTMTLGSDKPLEDRKEPSKPQEVEKSSDSNVVEKELGSGQYDGGSSKDAGAISSVPDVEPQPYVPPPPYDPPFPFP
        LK R  G +P D E+P R+GK+  + +TL S K +E     ++ +E         ++K+  +   +      A +  S  +   Q         PP PFP
Subjt:  LKARLHGNIPLDIEHPIREGKKQVQTMTLGSDKPLEDRKEPSKPQEVEKSSDSNVVEKELGSGQYDGGSSKDAGAISSVPDVEPQPYVPPPPYDPPFPFP

Query:  QRQKSKNQEVKFNMF-DAIK
        QR K +  + +F  F D +K
Subjt:  QRQKSKNQEVKFNMF-DAIK

XP_030505184.1 uncharacterized protein LOC115720166 [Cannabis sativa]7.3e-9138.52Show/hide
Query:  PILIANDRDRVIRSYVFPIIDELNPVIMRPIIDASNFEIKPVMFHMLEMVGQFHGLPSEDPYLHLKSFLGVSDSFAFEGVSRDALRLTLFPYSLRDGAKA
        PI++ +DR R IR Y  P+ +ELNP I+RP I A  FE+KPVMF ML+ VGQF  +P+EDP+LHL+SFL +SDSF  +GVS +  RL LFP+SLRD A++
Subjt:  PILIANDRDRVIRSYVFPIIDELNPVIMRPIIDASNFEIKPVMFHMLEMVGQFHGLPSEDPYLHLKSFLGVSDSFAFEGVSRDALRLTLFPYSLRDGAKA

Query:  WLNSFAPASISTWNELVEKFLSKYFPPIRSAKLRSEIVGFRQNEEETFSEAWERFKELLRKCPHHCLPYCIQMEIFYNGLNIATQCMVDASTGGALLAKS
        WLN+ +P S++ WN+  EKFL KYFPP R+AK RSEI+ F Q E+E+ S+AWERFKELLRKCPHH +P+CIQME FYNGLN  +Q ++DAS  GA+L+KS
Subjt:  WLNSFAPASISTWNELVEKFLSKYFPPIRSAKLRSEIVGFRQNEEETFSEAWERFKELLRKCPHHCLPYCIQMEIFYNGLNIATQCMVDASTGGALLAKS

Query:  FNEAYEILERISTYSFQWSDVRG-SNKKVKSVLEVDG---------------------------------------------------------------
        +NEA+EILE I++ ++QWS+ R   ++KV  VLEVD                                                                
Subjt:  FNEAYEILERISTYSFQWSDVRG-SNKKVKSVLEVDG---------------------------------------------------------------

Query:  ---NQRNNLYSNFYNQGWRNHPNFAWAGQGSNSQVPQAQQ------------------------KAVNQSGFAKSQELLQQNKQALPQQNSELQVGQLAK
           N+ N  +SN YNQ W+NHPN +W  +       Q +Q                        +++ +   AK+  ++Q   QA   +N ELQ+G LA 
Subjt:  ---NQRNNLYSNFYNQGWRNHPNFAWAGQGSNSQVPQAQQ------------------------KAVNQSGFAKSQELLQQNKQALPQQNSELQVGQLAK

Query:  ELKARLHGNIPLDIEHPIREGKKQVQTMTLGSDKPLEDRKEPSKPQ------EVEKSSDSNVVEKELGSGQYDGGSSKDAGAISSVPDVEPQPYVPPPPY
        ELKAR  G++P D E+P R+GK+Q +++ L S K L++ +E  K        ++++       ++   + + D  + + + +  S P             
Subjt:  ELKARLHGNIPLDIEHPIREGKKQVQTMTLGSDKPLEDRKEPSKPQ------EVEKSSDSNVVEKELGSGQYDGGSSKDAGAISSVPDVEPQPYVPPPPY

Query:  DPPFPFPQRQKSKNQEVKFNMF-DAIK
         PP PFPQR + + Q+ +F  F D +K
Subjt:  DPPFPFPQRQKSKNQEVKFNMF-DAIK

TrEMBL top hitse value%identityAlignment
A0A6J1EEI2 uncharacterized protein LOC1114333943.2e-9247.22Show/hide
Query:  QNPSLEQNGQQNNQAKYPILIANDRDRVIRSYVFPIIDELNPVIMRPIIDASNFEIKPVMFHMLEMVGQFHGLPSEDPYLHLKSFLGVSDSFAFEGVSRD
        +NP++  N  Q       I +A+DR+R IR+Y  P ++ELNP I+RP + A+ FE+KPVMF ML+ +GQFHGLPSEDP+LHLKSFLGVSDSF F+ V +D
Subjt:  QNPSLEQNGQQNNQAKYPILIANDRDRVIRSYVFPIIDELNPVIMRPIIDASNFEIKPVMFHMLEMVGQFHGLPSEDPYLHLKSFLGVSDSFAFEGVSRD

Query:  ALRLTLFPYSLRDGAKAWLNSFAPASISTWNELVEKFLSKYFPPIRSAKLRSEIVGFRQNEEETFSEAWERFKELLRKCPHHCLPYCIQMEIFYNGLNIA
         +RL+LFPYSLRDGAK+WLN+ A  +I +WN LVEKFL KYFPP R+A+ R+EIV F+Q E++T SEAWERFKE+LRKCPHH LP+CIQME FYNGLNIA
Subjt:  ALRLTLFPYSLRDGAKAWLNSFAPASISTWNELVEKFLSKYFPPIRSAKLRSEIVGFRQNEEETFSEAWERFKELLRKCPHHCLPYCIQMEIFYNGLNIA

Query:  TQCMVDASTGGALLAKSFNEAYEILERISTYSFQWSDVRGS-NKKVKSVLEVD-----------------------------------------------
        T+ +VDAS  GA+L+K++NEAYEILERI++ + QW+DVR +  +K + VLEVD                                               
Subjt:  TQCMVDASTGGALLAKSFNEAYEILERISTYSFQWSDVRGS-NKKVKSVLEVD-----------------------------------------------

Query:  --------------------------GNQRNNLYSNFYNQGWRNHPNFAWAGQGSNSQVPQAQQKAVNQSGFAKSQELLQQNKQ------ALPQQNSEL-
                                  GN +NN +SN YN GWRNHPNF+W GQGS +Q  Q   KA    GF    +L   ++Q       +PQ    L 
Subjt:  --------------------------GNQRNNLYSNFYNQGWRNHPNFAWAGQGSNSQVPQAQQKAVNQSGFAKSQELLQQNKQ------ALPQQNSEL-

Query:  -QVGQLAKELKAR
          +  L KE  A+
Subjt:  -QVGQLAKELKAR

A0A6J1EQ90 uncharacterized protein LOC1114364113.9e-9039.47Show/hide
Query:  PEAELTAEWRSKKTICDSTQGDEVLPCVTLQNPSLEQNGQQNNQAKY---PILIANDRDRVIRSYVFPIIDELNPVIMRPIIDASNFEIKPVMFHMLEMV
        PE E T   R KK    + Q  + +      N   E      NQ +    PI +A+DR+R IR+Y  P ++ELNP I+RP I  + FE+KPVMF ML+ +
Subjt:  PEAELTAEWRSKKTICDSTQGDEVLPCVTLQNPSLEQNGQQNNQAKY---PILIANDRDRVIRSYVFPIIDELNPVIMRPIIDASNFEIKPVMFHMLEMV

Query:  GQFHGLPSEDPYLHLKSFLGV-------SDSFAFEGVSRDALRLTLFPYSLRDGAKAWLNSFAPASISTWNELVEKFLSKYFPPIRSAKLRSEIVGFRQN
        GQFHGLP EDP+LHLKSFLGV       SDSF F+GV +D +RL+LFPY LRDGAK+WLN+ AP +I +WN L E FL KYFPP R+A+ ++EIV F+Q 
Subjt:  GQFHGLPSEDPYLHLKSFLGV-------SDSFAFEGVSRDALRLTLFPYSLRDGAKAWLNSFAPASISTWNELVEKFLSKYFPPIRSAKLRSEIVGFRQN

Query:  EEETFSEAWERFKELLRKCPHHCLPYCIQMEIFYNGLNIATQCMVDASTGGALLAKSFNEAYEILERISTYSFQWSDVRGS-NKKVKSVLEVD-------
        E+ET SEA ERFKE+LRKCPHH LP+CIQME FYNGLNI T+ +VDAS  GA+L+K++NEAYEILERI++ + QW+DVR +  +K + VLEVD       
Subjt:  EEETFSEAWERFKELLRKCPHHCLPYCIQMEIFYNGLNIATQCMVDASTGGALLAKSFNEAYEILERISTYSFQWSDVRGS-NKKVKSVLEVD-------

Query:  ------------------------------------------------------------------GNQRNNLYSNFYNQGWRNHPNFAWAGQGSNSQVP
                                                                          GN +NN +SN YN GWRNHPNF+W GQ   +Q  
Subjt:  ------------------------------------------------------------------GNQRNNLYSNFYNQGWRNHPNFAWAGQGSNSQVP

Query:  QAQQKAVNQSGFAKSQELLQQNKQ--------ALPQQNSELQVGQLAKELKARLHGNIPLDIEHPIREGKKQVQTMTLGSDKPLEDRKEPS-KPQEVEKS
        Q   KA   SGF    +L   ++Q           Q  SE  +  L KE  A+       D     ++   +   + +G +K  E     S +  + ++ 
Subjt:  QAQQKAVNQSGFAKSQELLQQNKQ--------ALPQQNSELQVGQLAKELKARLHGNIPLDIEHPIREGKKQVQTMTLGSDKPLEDRKEPS-KPQEVEKS

Query:  SDSNVVEKELGSGQYDGGSSKDAGAISSVPDVEPQPYVPPPPYDPPFPFPQRQKSKNQEVKFNMF
        ++   V+KE  S  Y     +     ++  + E + Y P P      PFPQR K K +E  F  F
Subjt:  SDSNVVEKELGSGQYDGGSSKDAGAISSVPDVEPQPYVPPPPYDPPFPFPQRQKSKNQEVKFNMF

A0A6J1G7Q6 uncharacterized protein LOC1114515981.5e-8945Show/hide
Query:  ILIANDRDRVIRSYVFPIIDELNPVIMRPIIDASNFEIKPVMFHMLEMVGQFHGLPSEDPYLHLKSFLGVSDSFAFEGVSRDALRLTLFPYSLRDGAKAW
        I +A+DR+R IR+Y  P ++ELNP I+RP + A+ FE+KPVMF ML+ +GQFHGL S+DP+LHLKSFLGVSDSF F+GV +D +RL+ F YSLRDGAK+W
Subjt:  ILIANDRDRVIRSYVFPIIDELNPVIMRPIIDASNFEIKPVMFHMLEMVGQFHGLPSEDPYLHLKSFLGVSDSFAFEGVSRDALRLTLFPYSLRDGAKAW

Query:  LNSFAPASISTWNELVEKFLSKYFPPIRSAKLRSEIVGFRQNEEETFSEAWERFKELLRKCPHHCLPYCIQMEIFYNGLNIATQCMVDASTGGALLAKSF
        LN  A   I +WN L EKFL KYFPP RSA+ R+EIV F++ E ET SEAWERFKE LRKCPHH LP+CIQ+E FYNGLN AT+ +VDAS  G +L+K++
Subjt:  LNSFAPASISTWNELVEKFLSKYFPPIRSAKLRSEIVGFRQNEEETFSEAWERFKELLRKCPHHCLPYCIQMEIFYNGLNIATQCMVDASTGGALLAKSF

Query:  NEAYEILERISTYSFQWSDVRGS-NKKVKSVLEVD-----------------------------------------------------------------
        NEAYEILERI++ + QW DVR +  KK + VLEVD                                                                 
Subjt:  NEAYEILERISTYSFQWSDVRGS-NKKVKSVLEVD-----------------------------------------------------------------

Query:  --------GNQRNNLYSNFYNQGWRNHPNFAWAGQGSNSQVPQAQQKAVNQSGFAKSQELLQQNKQALPQ------------------------------
                GN + N  SN YN GWRNHPNF   GQGS +Q  Q   KA    GF    +L   ++QA  Q                              
Subjt:  --------GNQRNNLYSNFYNQGWRNHPNFAWAGQGSNSQVPQAQQKAVNQSGFAKSQELLQQNKQALPQ------------------------------

Query:  -------QNSELQVGQLAKELKARLHGNIPLDIEHPIREG
               +N E+QVGQLA EL+ R  G +P D E P REG
Subjt:  -------QNSELQVGQLAKELKARLHGNIPLDIEHPIREG

A0A6J1H7E4 uncharacterized protein LOC1114611686.2e-9650.54Show/hide
Query:  SLEQNGQQNNQAKYPILIAN-------------DRDRVIRSYVFPIIDELNPVIMRPIIDASNFEIKPVMFHMLEMVGQFHGLPSEDPYLHLKSFLGVSD
        +++Q  Q N + + P+++AN             DR+R IR+Y  P +DELNP I+RP + A+ FE+KPVMF ML+ +GQFHGLPSEDP+LHLKSFLGVSD
Subjt:  SLEQNGQQNNQAKYPILIAN-------------DRDRVIRSYVFPIIDELNPVIMRPIIDASNFEIKPVMFHMLEMVGQFHGLPSEDPYLHLKSFLGVSD

Query:  SFAFEGVSRDALRLTLFPYSLRDGAKAWLNSFAPASISTWNELVEKFLSKYFPPIRSAKLRSEIVGFRQNEEETFSEAWERFKELLRKCPHHCLPYCIQM
        SF F+GV +D +RL+LFPYSLRDGAK+WLN+ AP +I +WN L EKFL KYFPP R+A+ R+EIV F+Q E+ET SEAWERFKE+LRKCPHH LP+CIQM
Subjt:  SFAFEGVSRDALRLTLFPYSLRDGAKAWLNSFAPASISTWNELVEKFLSKYFPPIRSAKLRSEIVGFRQNEEETFSEAWERFKELLRKCPHHCLPYCIQM

Query:  EIFYNGLNIATQCMVDASTGGALLAKSFNEAYEILERISTYSFQWSDVRGS-NKKVKSVLEVD-------------------------------------
        E FYNGLNIAT+ +VDAS  GA+L+K++NEAYEILERI++ + QW+DVR +  KK + VLEVD                                     
Subjt:  EIFYNGLNIATQCMVDASTGGALLAKSFNEAYEILERISTYSFQWSDVRGS-NKKVKSVLEVD-------------------------------------

Query:  ------------------------------------GNQRNNLYSNFYNQGWRNHPNFAWAGQGSNSQ
                                            GNQ+NN  SN YN GWRNHPNF+W GQGS +Q
Subjt:  ------------------------------------GNQRNNLYSNFYNQGWRNHPNFAWAGQGSNSQ

U5CUI2 Retrotrans_gag domain-containing protein8.7e-8261.86Show/hide
Query:  PILIANDRDRVIRSYVFPIIDELNPVIMRPIIDASNFEIKPVMFHMLEMVGQFHGLPSEDPYLHLKSFLGVSDSFAFEGVSRDALRLTLFPYSLRDGAKA
        PI++A+DR R IR Y  P+ +ELNP I+RP I A  FE+KPVMF ML+ VGQF G+P+EDP+LHL+SFL VSDSF  +GVS + LRL LFP+SLRD A++
Subjt:  PILIANDRDRVIRSYVFPIIDELNPVIMRPIIDASNFEIKPVMFHMLEMVGQFHGLPSEDPYLHLKSFLGVSDSFAFEGVSRDALRLTLFPYSLRDGAKA

Query:  WLNSFAPASISTWNELVEKFLSKYFPPIRSAKLRSEIVGFRQNEEETFSEAWERFKELLRKCPHHCLPYCIQMEIFYNGLNIATQCMVDASTGGALLAKS
        WLN+  P S++ WN+L EKFL KYFPP R+AK RSEI+ F+Q E+E+ S+AWERFKELLRKCPHH +P+CIQME FYNGLN A++ ++DAS  GA+L+KS
Subjt:  WLNSFAPASISTWNELVEKFLSKYFPPIRSAKLRSEIVGFRQNEEETFSEAWERFKELLRKCPHHCLPYCIQMEIFYNGLNIATQCMVDASTGGALLAKS

Query:  FNEAYEILERISTYSFQWSDVRG-SNKKVKSVLEVD
        +NEA+EILE I++ ++QWS+ R  +++KV  VLEVD
Subjt:  FNEAYEILERISTYSFQWSDVRG-SNKKVKSVLEVD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACACCCATGGTTTGCCACTGTATGAAAGGTTTACTTAGGGAAGTGAGTACCAATCCAAGCTTAAAAACAGAAGAAATCTGGAAAAACTTAGAAATGCGGCCGCATTT
CTGGGAAGGCAAAATTGAAATGCGACTGCATATCTGGGCAGACAGAGGCAGTTCCGAGTCCGTCGCGGGCCGTTTTGAACAAGAGTTTCACGCACCTATGACGGAGCCTG
TCGTGGGGCTCAAATCACACCGGCGTCAAGACGCCAGGGCACAGTGTCTCGACGCTGCGAACATTAGTCTTCAGACTGAAATCGCGTCGGCATCTCGACGCCAAGACCGA
CGTCTCAATGCCACCCTCTTGACTCGAATTCCTGCTCCTTTTCTCGGCTTGGTCTCTCTTCTTCTTGGTTTGGGCTTCAATCTTTGGCTTTCGGTGTCACGATTCAGGCC
CGCTTCAGCACAATTGGGTCTCTTCACCTCCTCTCTGTCCAAGACATCGAAATGGCTCCAAAAACTCCATAATTGGCTCGAATTAACACCAATTAGCGTGATCGGCTCCC
AACACGGCCTAAGCTCAATTAACTTGGCGAGGTTGCCTAAGGCACCCTGTAGCAACTATAGAGTTATAGATTGCATGCTTGATTTATCTATGTTGGAGTCGAGTCACAGA
CATGTGCATAATGTTTCTTCTTCAAATCTGTGTTTCCAACTTAGTCCTCGTGGGATCGATACCACCCTTGGAATACTTCTAAGGAGGAGCCGTCAAAGACAAAGGCATTC
ACGATTTGGAGACCGAATCAATCGGCCGGAGGCCGAATTGACGGCGGAATGGAGATCTAAGAAGACCATTTGCGATTCCACTCAAGGAGACGAAGTTCTTCCATGTGTGA
CTTTGCAAAATCCGTCGCTGGAGCAAAATGGACAGCAAAATAATCAGGCTAAGTATCCTATCCTTATAGCAAATGATAGGGACAGAGTCATTAGATCGTATGTTTTCCCA
ATAATTGACGAGTTAAATCCAGTGATAATGCGTCCAATAATTGATGCATCAAATTTTGAAATAAAACCGGTCATGTTTCACATGTTAGAGATGGTCGGCCAATTCCATGG
TTTACCATCTGAGGACCCTTATCTTCATCTTAAGTCTTTTCTAGGAGTTAGTGATTCATTTGCTTTCGAGGGAGTGTCGAGAGATGCCCTTAGATTAACCCTATTTCCTT
ATTCTCTTAGAGATGGAGCAAAAGCGTGGTTGAATTCTTTTGCTCCAGCATCGATAAGTACGTGGAATGAGCTAGTAGAGAAATTTCTTAGTAAGTATTTTCCACCAATT
AGGAGTGCCAAGTTAAGGAGTGAAATAGTGGGATTTAGGCAAAATGAGGAAGAAACTTTTAGTGAGGCTTGGGAAAGGTTTAAGGAGCTTTTGCGAAAGTGTCCCCACCA
CTGCTTGCCATATTGTATTCAAATGGAGATATTTTACAATGGATTAAACATAGCAACCCAGTGTATGGTTGATGCTTCTACGGGAGGGGCTCTTTTGGCAAAATCTTTTA
ATGAAGCTTATGAGATTTTAGAGAGAATATCAACCTACAGTTTTCAATGGTCAGATGTTAGAGGCTCTAATAAAAAAGTTAAGAGTGTGTTAGAAGTTGATGGTAATCAA
AGGAATAACCTTTATTCTAACTTTTATAATCAAGGTTGGCGCAACCACCCCAACTTTGCGTGGGCAGGGCAAGGAAGCAATTCACAAGTCCCCCAAGCACAGCAAAAGGC
GGTGAACCAGTCCGGATTTGCTAAATCACAGGAATTGCTCCAGCAAAATAAGCAGGCTTTGCCCCAGCAAAACTCGGAGTTGCAAGTGGGTCAGCTAGCTAAGGAGCTGA
AGGCACGACTTCACGGGAATATTCCTTTAGATATTGAACACCCTATAAGGGAAGGTAAGAAGCAGGTGCAGACAATGACTTTAGGGAGTGATAAGCCACTAGAAGATAGA
AAAGAGCCTAGTAAACCCCAGGAAGTAGAAAAGAGTAGTGATAGTAATGTTGTCGAAAAAGAATTGGGGTCTGGTCAATATGATGGAGGCAGCAGCAAAGATGCTGGAGC
AATTAGTTCTGTTCCAGATGTAGAACCCCAACCTTATGTACCGCCCCCACCCTATGACCCACCCTTTCCTTTTCCACAAAGGCAGAAGTCTAAGAACCAAGAGGTGAAGT
TTAATATGTTTGATGCAATAAAATATCCTAATGATCTTGAGGATTGCTCGTGCATTCATGTGTTGGATGAGTTTGTTGAGGACCACTTTGAGAAGGAATTGATTGAGTAC
CATACCCAAAAATTTGGAGAAATCCAAATTGAGGATTTGGAAATAGGTGGATTGGAGCATGAGTATGACGATGTAGGTGAGATTTCTAGTTTTAAGAGGAATTTTGAATC
CTTGGAGCCAATCGATGAGAAATCCAAGCCTATTGAACCTTGTAATTCATTGACATTGCTCCAGCAACCTGAGATTAGGAAATCCTTCATTGATGAGCGGTTATTTACTG
TAGCTCATATTAAGGAAGTGAAAACACCTTGGTATGATGACTTTTCCAATTACCTTGATTTTGGAAATTTGCCTCCTGGTTTATCAAAAGAACAAATGAAAGAATTTTTC
CACGAGAGCCCTGGGGTGATTAATTCTTTATTTAATTTACAAGACTTCCCCCACGCTATTTTCAATGAAATATTAGTTGCTTCCTCAAACGAGCAACTAAATGTCCAGTG
GAGGTTGTCTAAGATGAGGGCAAGAACATTCCAGTCAACATACCTGAAGTGTGAGGCTAACACTTGGCTCAGCTTTGTCAAGCAGAGATTATTGGCTACAACGTACGCCT
CCACTATCCCCTGTGATCGAGTGCACTTAAGGTCAAGAGTCCCAGAGAGTTACAGTGAAATAAAGTTGTTTGACAAGGGGATTATTGGCACTGAAACCTTTTGCTTGAGC
ATTCCTTCTAGCCTGGTCATGACTGCAGCAAAAGTTCTGAGTGCTTGGTTTTGCAGGATGCTCAGGAGGAGTCGTCAAAGACAAAGGCATTCACGATTTGGAGACCGAAT
CAATCGGCCGGAGGCCGAATTGACGGCGGAATGGAGATCTAAGAAGACCATTTGCGATTCCACTCAAGGAGACGAAGTTCTTCCATGTTCTTGGTGGTTTGGAGCAGTCC
GTTTCATCCCCATTTGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGACACCCATGGTTTGCCACTGTATGAAAGGTTTACTTAGGGAAGTGAGTACCAATCCAAGCTTAAAAACAGAAGAAATCTGGAAAAACTTAGAAATGCGGCCGCATTT
CTGGGAAGGCAAAATTGAAATGCGACTGCATATCTGGGCAGACAGAGGCAGTTCCGAGTCCGTCGCGGGCCGTTTTGAACAAGAGTTTCACGCACCTATGACGGAGCCTG
TCGTGGGGCTCAAATCACACCGGCGTCAAGACGCCAGGGCACAGTGTCTCGACGCTGCGAACATTAGTCTTCAGACTGAAATCGCGTCGGCATCTCGACGCCAAGACCGA
CGTCTCAATGCCACCCTCTTGACTCGAATTCCTGCTCCTTTTCTCGGCTTGGTCTCTCTTCTTCTTGGTTTGGGCTTCAATCTTTGGCTTTCGGTGTCACGATTCAGGCC
CGCTTCAGCACAATTGGGTCTCTTCACCTCCTCTCTGTCCAAGACATCGAAATGGCTCCAAAAACTCCATAATTGGCTCGAATTAACACCAATTAGCGTGATCGGCTCCC
AACACGGCCTAAGCTCAATTAACTTGGCGAGGTTGCCTAAGGCACCCTGTAGCAACTATAGAGTTATAGATTGCATGCTTGATTTATCTATGTTGGAGTCGAGTCACAGA
CATGTGCATAATGTTTCTTCTTCAAATCTGTGTTTCCAACTTAGTCCTCGTGGGATCGATACCACCCTTGGAATACTTCTAAGGAGGAGCCGTCAAAGACAAAGGCATTC
ACGATTTGGAGACCGAATCAATCGGCCGGAGGCCGAATTGACGGCGGAATGGAGATCTAAGAAGACCATTTGCGATTCCACTCAAGGAGACGAAGTTCTTCCATGTGTGA
CTTTGCAAAATCCGTCGCTGGAGCAAAATGGACAGCAAAATAATCAGGCTAAGTATCCTATCCTTATAGCAAATGATAGGGACAGAGTCATTAGATCGTATGTTTTCCCA
ATAATTGACGAGTTAAATCCAGTGATAATGCGTCCAATAATTGATGCATCAAATTTTGAAATAAAACCGGTCATGTTTCACATGTTAGAGATGGTCGGCCAATTCCATGG
TTTACCATCTGAGGACCCTTATCTTCATCTTAAGTCTTTTCTAGGAGTTAGTGATTCATTTGCTTTCGAGGGAGTGTCGAGAGATGCCCTTAGATTAACCCTATTTCCTT
ATTCTCTTAGAGATGGAGCAAAAGCGTGGTTGAATTCTTTTGCTCCAGCATCGATAAGTACGTGGAATGAGCTAGTAGAGAAATTTCTTAGTAAGTATTTTCCACCAATT
AGGAGTGCCAAGTTAAGGAGTGAAATAGTGGGATTTAGGCAAAATGAGGAAGAAACTTTTAGTGAGGCTTGGGAAAGGTTTAAGGAGCTTTTGCGAAAGTGTCCCCACCA
CTGCTTGCCATATTGTATTCAAATGGAGATATTTTACAATGGATTAAACATAGCAACCCAGTGTATGGTTGATGCTTCTACGGGAGGGGCTCTTTTGGCAAAATCTTTTA
ATGAAGCTTATGAGATTTTAGAGAGAATATCAACCTACAGTTTTCAATGGTCAGATGTTAGAGGCTCTAATAAAAAAGTTAAGAGTGTGTTAGAAGTTGATGGTAATCAA
AGGAATAACCTTTATTCTAACTTTTATAATCAAGGTTGGCGCAACCACCCCAACTTTGCGTGGGCAGGGCAAGGAAGCAATTCACAAGTCCCCCAAGCACAGCAAAAGGC
GGTGAACCAGTCCGGATTTGCTAAATCACAGGAATTGCTCCAGCAAAATAAGCAGGCTTTGCCCCAGCAAAACTCGGAGTTGCAAGTGGGTCAGCTAGCTAAGGAGCTGA
AGGCACGACTTCACGGGAATATTCCTTTAGATATTGAACACCCTATAAGGGAAGGTAAGAAGCAGGTGCAGACAATGACTTTAGGGAGTGATAAGCCACTAGAAGATAGA
AAAGAGCCTAGTAAACCCCAGGAAGTAGAAAAGAGTAGTGATAGTAATGTTGTCGAAAAAGAATTGGGGTCTGGTCAATATGATGGAGGCAGCAGCAAAGATGCTGGAGC
AATTAGTTCTGTTCCAGATGTAGAACCCCAACCTTATGTACCGCCCCCACCCTATGACCCACCCTTTCCTTTTCCACAAAGGCAGAAGTCTAAGAACCAAGAGGTGAAGT
TTAATATGTTTGATGCAATAAAATATCCTAATGATCTTGAGGATTGCTCGTGCATTCATGTGTTGGATGAGTTTGTTGAGGACCACTTTGAGAAGGAATTGATTGAGTAC
CATACCCAAAAATTTGGAGAAATCCAAATTGAGGATTTGGAAATAGGTGGATTGGAGCATGAGTATGACGATGTAGGTGAGATTTCTAGTTTTAAGAGGAATTTTGAATC
CTTGGAGCCAATCGATGAGAAATCCAAGCCTATTGAACCTTGTAATTCATTGACATTGCTCCAGCAACCTGAGATTAGGAAATCCTTCATTGATGAGCGGTTATTTACTG
TAGCTCATATTAAGGAAGTGAAAACACCTTGGTATGATGACTTTTCCAATTACCTTGATTTTGGAAATTTGCCTCCTGGTTTATCAAAAGAACAAATGAAAGAATTTTTC
CACGAGAGCCCTGGGGTGATTAATTCTTTATTTAATTTACAAGACTTCCCCCACGCTATTTTCAATGAAATATTAGTTGCTTCCTCAAACGAGCAACTAAATGTCCAGTG
GAGGTTGTCTAAGATGAGGGCAAGAACATTCCAGTCAACATACCTGAAGTGTGAGGCTAACACTTGGCTCAGCTTTGTCAAGCAGAGATTATTGGCTACAACGTACGCCT
CCACTATCCCCTGTGATCGAGTGCACTTAAGGTCAAGAGTCCCAGAGAGTTACAGTGAAATAAAGTTGTTTGACAAGGGGATTATTGGCACTGAAACCTTTTGCTTGAGC
ATTCCTTCTAGCCTGGTCATGACTGCAGCAAAAGTTCTGAGTGCTTGGTTTTGCAGGATGCTCAGGAGGAGTCGTCAAAGACAAAGGCATTCACGATTTGGAGACCGAAT
CAATCGGCCGGAGGCCGAATTGACGGCGGAATGGAGATCTAAGAAGACCATTTGCGATTCCACTCAAGGAGACGAAGTTCTTCCATGTTCTTGGTGGTTTGGAGCAGTCC
GTTTCATCCCCATTTGCTGA
Protein sequenceShow/hide protein sequence
MTPMVCHCMKGLLREVSTNPSLKTEEIWKNLEMRPHFWEGKIEMRLHIWADRGSSESVAGRFEQEFHAPMTEPVVGLKSHRRQDARAQCLDAANISLQTEIASASRRQDR
RLNATLLTRIPAPFLGLVSLLLGLGFNLWLSVSRFRPASAQLGLFTSSLSKTSKWLQKLHNWLELTPISVIGSQHGLSSINLARLPKAPCSNYRVIDCMLDLSMLESSHR
HVHNVSSSNLCFQLSPRGIDTTLGILLRRSRQRQRHSRFGDRINRPEAELTAEWRSKKTICDSTQGDEVLPCVTLQNPSLEQNGQQNNQAKYPILIANDRDRVIRSYVFP
IIDELNPVIMRPIIDASNFEIKPVMFHMLEMVGQFHGLPSEDPYLHLKSFLGVSDSFAFEGVSRDALRLTLFPYSLRDGAKAWLNSFAPASISTWNELVEKFLSKYFPPI
RSAKLRSEIVGFRQNEEETFSEAWERFKELLRKCPHHCLPYCIQMEIFYNGLNIATQCMVDASTGGALLAKSFNEAYEILERISTYSFQWSDVRGSNKKVKSVLEVDGNQ
RNNLYSNFYNQGWRNHPNFAWAGQGSNSQVPQAQQKAVNQSGFAKSQELLQQNKQALPQQNSELQVGQLAKELKARLHGNIPLDIEHPIREGKKQVQTMTLGSDKPLEDR
KEPSKPQEVEKSSDSNVVEKELGSGQYDGGSSKDAGAISSVPDVEPQPYVPPPPYDPPFPFPQRQKSKNQEVKFNMFDAIKYPNDLEDCSCIHVLDEFVEDHFEKELIEY
HTQKFGEIQIEDLEIGGLEHEYDDVGEISSFKRNFESLEPIDEKSKPIEPCNSLTLLQQPEIRKSFIDERLFTVAHIKEVKTPWYDDFSNYLDFGNLPPGLSKEQMKEFF
HESPGVINSLFNLQDFPHAIFNEILVASSNEQLNVQWRLSKMRARTFQSTYLKCEANTWLSFVKQRLLATTYASTIPCDRVHLRSRVPESYSEIKLFDKGIIGTETFCLS
IPSSLVMTAAKVLSAWFCRMLRRSRQRQRHSRFGDRINRPEAELTAEWRSKKTICDSTQGDEVLPCSWWFGAVRFIPIC