; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036226 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036226
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr3:41978819..41982670
RNA-Seq ExpressionLag0036226
SyntenyLag0036226
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_017233063.1 PREDICTED: uncharacterized protein LOC108207110 [Daucus carota subsp. sativus]6.1e-9637.73Show/hide
Query:  QNRLLQQNQLLEQNEQQNNQAENPIL-----VANDRVRAIRAYVFPMFDELNLGIARPQIEAANFEIKPVMFQMLQTV----------------------
        Q ++ Q    ++ N    N  + PI+     + +D+ RAIR Y  P F+ELN GI RP I+A  FE+KPVMFQMLQT+                      
Subjt:  QNRLLQQNQLLEQNEQQNNQAENPIL-----VANDRVRAIRAYVFPMFDELNLGIARPQIEAANFEIKPVMFQMLQTV----------------------

Query:  ------GVSKDALRLTLFPYSLRDEAKAWLNSFAPGSIRTWNELAEKLLSKYFPPNRNAKLRSKIVGFRQLEDETFSEAWERFKELLQKCPHHGLPHCIQ
              GV +DALRL LFPYS+RD A+ WLNS   GS+ TWN+L EK LSKYFPPN NAKLR++I  F+Q +DE+  +AWERFKELL+KCPHHG+ HCIQ
Subjt:  ------GVSKDALRLTLFPYSLRDEAKAWLNSFAPGSIRTWNELAEKLLSKYFPPNRNAKLRSKIVGFRQLEDETFSEAWERFKELLQKCPHHGLPHCIQ

Query:  METFYNGLNGVTQGMVDVSTQGALLAKTFNEAHEILERISTNSCQWSDVRG-TNKKVKSVLEVDGVSTIRADLAMISNALKNVTVISHQQPPVVEPAALV
        METFYNGLN  T+ +VD S  GALL+K++N+A+EILE I+T + QWS  R  T KKV  + +VD +++++A LA + + LKN++ + + Q      ++ +
Subjt:  METFYNGLNGVTQGMVDVSTQGALLAKTFNEAHEILERISTNSCQWSDVRG-TNKKVKSVLEVDGVSTIRADLAMISNALKNVTVISHQQPPVVEPAALV

Query:  NQVAEEACVYCGENHNYEFCPNNPTSVFF-----------------------------------------------------------------------
        NQ    +CV+CGE H Y+ CP+NP SVF+                                                                       
Subjt:  NQVAEEACVYCGENHNYEFCPNNPTSVFF-----------------------------------------------------------------------

Query:  -------------------------VANELKARPQGKLPSDTEHPRREGKEQVKAVTFRSGKPLEERKEPSK-PQNVEISCDKNVVVEKELESGQGAGGS
                                 +ANEL+ RP G LPSDTE P+  G E  KA+T +SGK L      +K   +VE S ++ +  +KE E+ +     
Subjt:  -------------------------VANELKARPQGKLPSDTEHPRREGKEQVKAVTFRSGKPLEERKEPSK-PQNVEISCDKNVVVEKELESGQGAGGS

Query:  NNDAGASGSIPDVEPPY------------------VPPHLMYHLYLFHKGKGL----RIRMDILTKKKRLGEFETVSLTEECSVILKNGLPTKAKDPGSF
        +    +  S    +PP+                  V   L  ++ L    + +    +   DILTKK+RLGEFETV+LT+ECS  L++ LPTK KDPGSF
Subjt:  NNDAGASGSIPDVEPPY------------------VPPHLMYHLYLFHKGKGL----RIRMDILTKKKRLGEFETVSLTEECSVILKNGLPTKAKDPGSF

Query:  TIPMSIG
        TIP +IG
Subjt:  TIPMSIG

XP_022929949.1 uncharacterized protein LOC111436411 [Cucurbita moschata]2.1e-10437.41Show/hide
Query:  QNRLLQQNQLLEQNEQQ-------NNQAENPILVAN-------------DRVRAIRAYVFPMFDELNLGIARPQIEAANFEIKPVMFQMLQTV-------
        + RL ++ ++ EQN QQ       N + ENP ++AN             DR RAIRAY  P  +ELN  I RP+I+   FE+KPVMFQMLQT+       
Subjt:  QNRLLQQNQLLEQNEQQ-------NNQAENPILVAN-------------DRVRAIRAYVFPMFDELNLGIARPQIEAANFEIKPVMFQMLQTV-------

Query:  ----------------------------GVSKDALRLTLFPYSLRDEAKAWLNSFAPGSIRTWNELAEKLLSKYFPPNRNAKLRSKIVGFRQLEDETFSE
                                    GV KD +RL+LFPY LRD AK+WLN+ APG+I +WN LAE  L KYFPP RNA+ +++IV F+Q EDET SE
Subjt:  ----------------------------GVSKDALRLTLFPYSLRDEAKAWLNSFAPGSIRTWNELAEKLLSKYFPPNRNAKLRSKIVGFRQLEDETFSE

Query:  AWERFKELLQKCPHHGLPHCIQMETFYNGLNGVTQGMVDVSTQGALLAKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADLAMISN
        A ERFKE+L+KCPHHGLPHCIQMETFYNGLN VT+ +VD S  GA+L+KT+NEA+EILERI++N+CQW+DVR    +K + VLEVD +S+I A LA ++N
Subjt:  AWERFKELLQKCPHHGLPHCIQMETFYNGLNGVTQGMVDVSTQGALLAKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADLAMISN

Query:  ALKNVTVISHQQ-PPVVEPAALVNQVAEEACVYCGENHNYEFCPNNPTSVFFVANE-----LKARP----------------------------------
         L+N+ +         V  AA +NQ A E+CVYCGE H ++ CP+NP S+F+V N+     LK  P                                  
Subjt:  ALKNVTVISHQQ-PPVVEPAALVNQVAEEACVYCGENHNYEFCPNNPTSVFFVANE-----LKARP----------------------------------

Query:  -----QGKLPSDTEHPRREGK--------------EQVKAVTFRSGKPLEERKEPSKPQNVEISCDKN--------------------VVVEKELESGQG
             Q +L   ++    +GK                +K    ++   ++ ++   +   V+I  +KN                      V+KE      
Subjt:  -----QGKLPSDTEHPRREGK--------------EQVKAVTFRSGKPLEERKEPSKPQNVEISCDKN--------------------VVVEKELESGQG

Query:  AGGSNNDAGASGSIPDVEPPYVPP--------------------------HLMYHLY--LFHKGKGLRIRMDILTKKKRLGEFETVSLTEECSVILKNGL
                  + S       Y P                           H+   L   L      ++   D+L  +++  EF+ VSL EECS ILKN +
Subjt:  AGGSNNDAGASGSIPDVEPPYVPP--------------------------HLMYHLY--LFHKGKGLRIRMDILTKKKRLGEFETVSLTEECSVILKNGL

Query:  PTKAKDPGSFTIPMSI---------------------------GIGEARPTTVTLQLADKSITYLEGKIEDVLVKAMKY
        P K KDPGSFTIP+SI                           GIGEARPTTVTLQLAD+SITY EGKIED+L++  K+
Subjt:  PTKAKDPGSFTIPMSI---------------------------GIGEARPTTVTLQLADKSITYLEGKIEDVLVKAMKY

XP_030505184.1 uncharacterized protein LOC115720166 [Cannabis sativa]1.3e-11140.22Show/hide
Query:  QAENPILVANDRVRAIRAYVFPMFDELNLGIARPQIEAANFEIKPVMFQMLQTV----------------------------GVSKDALRLTLFPYSLRD
        Q  +PI++ +DR RAIR Y  PMF+ELN GI RP+I+A  FE+KPVMFQMLQTV                            GVS++  RL LFP+SLRD
Subjt:  QAENPILVANDRVRAIRAYVFPMFDELNLGIARPQIEAANFEIKPVMFQMLQTV----------------------------GVSKDALRLTLFPYSLRD

Query:  EAKAWLNSFAPGSIRTWNELAEKLLSKYFPPNRNAKLRSKIVGFRQLEDETFSEAWERFKELLQKCPHHGLPHCIQMETFYNGLNGVTQGMVDVSTQGAL
         A++WLN+ +P S+  WN+ AEK L KYFPP RNAK RS+I+ F QLEDE+ S+AWERFKELL+KCPHHG+PHCIQMETFYNGLN  +Q ++D S  GA+
Subjt:  EAKAWLNSFAPGSIRTWNELAEKLLSKYFPPNRNAKLRSKIVGFRQLEDETFSEAWERFKELLQKCPHHGLPHCIQMETFYNGLNGVTQGMVDVSTQGAL

Query:  LAKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADLAMISNALKNVTVISHQQPPVVEPAALVNQVAEEACVYCGENHNYEFCPNNP
        L+K++NEA EILE I++N+ QWS+ R   ++KV  VLEVD ++ +   +A ++N LKN+++ + +    ++PAA + Q  + +CV+C E H +E CP+NP
Subjt:  LAKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADLAMISNALKNVTVISHQQPPVVEPAALVNQVAEEACVYCGENHNYEFCPNNP

Query:  TSVFF-----------------------------------------------------------------------------------------------
         SV +                                                                                               
Subjt:  TSVFF-----------------------------------------------------------------------------------------------

Query:  ---VANELKARPQGKLPSDTEHPRREGKEQVKAVTFRSGKPL----EERKEPSKPQNVEISCDKNVVVEKELESGQ----GAGGSNNDAGASGSIPDVEP
           +ANELKARPQG LPSDTE+PRR+GKEQ K++  RSGK L    EE K   +P +++I    +    +E+   +      G  +N   ++      +P
Subjt:  ---VANELKARPQGKLPSDTEHPRREGKEQVKAVTFRSGKPL----EERKEPSKPQNVEISCDKNVVVEKELESGQ----GAGGSNNDAGASGSIPDVEP

Query:  PYVPP----------------------HLMYHLY--LFHKGKGLRIRMDILTKKKRLGEFETVSLTEECSVILKNGLPTKAKDPGSFTIPMSI-----GI
        P   P                      H+   L   L      ++   DILTKK+RLGEFE+  LTE    +LKN +P K KDPGSFTIP+SI     GI
Subjt:  PYVPP----------------------HLMYHLY--LFHKGKGLRIRMDILTKKKRLGEFETVSLTEECSVILKNGLPTKAKDPGSFTIPMSI-----GI

Query:  GEARPTTVTLQLADKSITYLEGKIEDVLVKAMKY
        GEARPTTVTLQLAD+S+ + +GKIEDVLV+  K+
Subjt:  GEARPTTVTLQLADKSITYLEGKIEDVLVKAMKY

XP_030508947.1 uncharacterized protein LOC115723603 [Cannabis sativa]1.4e-9541.73Show/hide
Query:  ENPILVANDRVRAIRAYVFPMFDELNLGIARPQIEAANFEIKPVMFQMLQTV----------------------------GVSKDALRLTLFPYSLRDEA
        +NPI +A+DR RA R Y   +F+ELN G  RP+I+A +FE+KPVMFQMLQ V                            GVS++ALRL LFP+SLRD A
Subjt:  ENPILVANDRVRAIRAYVFPMFDELNLGIARPQIEAANFEIKPVMFQMLQTV----------------------------GVSKDALRLTLFPYSLRDEA

Query:  KAWLNSFAPGSIRTWNELAEKLLSKYFPPNRNAKLRSKIVGFRQLEDETFSEAWERFKELLQKCPHHGLPHCIQMETFYNGLNGVTQGMVDVSTQGALLA
        +AWLN+  P  + +WN+LAEK L KYFPP RNA  RS+I+ F+QLEDET S+AWERFKELL+KCPHHG+PHCIQ+ETFYNGLN  ++ ++D S  GA+L+
Subjt:  KAWLNSFAPGSIRTWNELAEKLLSKYFPPNRNAKLRSKIVGFRQLEDETFSEAWERFKELLQKCPHHGLPHCIQMETFYNGLNGVTQGMVDVSTQGALLA

Query:  KTFNEAHEILERISTNSCQWSDVRG-TNKKVKSVLEVDGVSTIRADLAMISNALKNVTVISHQQPPVVEPAALVNQVAEEACVYCGENHNYEFCPNNPTS
        K++NEA EILE I+ N+ QWS  R  T++KV  VL+VD ++ + A +A ++N LKN+ +        V+PAA + Q A+ +CVYCG+ H +E  P+NP S
Subjt:  KTFNEAHEILERISTNSCQWSDVRG-TNKKVKSVLEVDGVSTIRADLAMISNALKNVTVISHQQPPVVEPAALVNQVAEEACVYCGENHNYEFCPNNPTS

Query:  VF-----------------------------------------------------FVANELKARPQGKLPSDTEHPRREGKEQVKAVTFRSGKPLEER--
                                                               ++A     RPQG LPSDT +PRR+GK+     T RSGK LE    
Subjt:  VF-----------------------------------------------------FVANELKARPQGKLPSDTEHPRREGKEQVKAVTFRSGKPLEER--

Query:  ----KEPSKPQNVEISCDKNVVVEKELESGQGAGGSNNDAGASGSIPDVEPPYVPPHL-------MYHLYL---------FHKGKGL-------RIRMDI
            KEPS  Q       K  +   E+ S      SN  +    S+    PP+ P  L        +  +L            G+ L       +   DI
Subjt:  ----KEPSKPQNVEISCDKNVVVEKELESGQGAGGSNNDAGASGSIPDVEPPYVPPHL-------MYHLYL---------FHKGKGL-------RIRMDI

Query:  LTKKKRLGEFETVSLTEECSVILKNGLPTKAKDPGSFTIPMSIG
        LT+K+RLGEFETV+LTE  S +LK+ +P K KDPGSFTIP+SIG
Subjt:  LTKKKRLGEFETVSLTEECSVILKNGLPTKAKDPGSFTIPMSIG

XP_030509259.1 uncharacterized protein LOC115723937 [Cannabis sativa]2.8e-9343.74Show/hide
Query:  PANPQQNRLLQQNQLLEQNEQQNNQAE-----------NPILVANDRVRAIRAYVFPMFDELNLGIARPQIEAANFEIKPVMFQMLQTV-GVSKDALRLT
        P +P+  R  +Q +  ++ +++ N A+           NPI +A+DR RAIR Y  PMF+ELN GI RP+I+A +FE+KPVMFQMLQT+ GVS++ALRL 
Subjt:  PANPQQNRLLQQNQLLEQNEQQNNQAE-----------NPILVANDRVRAIRAYVFPMFDELNLGIARPQIEAANFEIKPVMFQMLQTV-GVSKDALRLT

Query:  LFPYSLRDEAKAWLNSFAPGSIRTWNELAEKLLSKYFPPNRNAKLRSKIVGFRQLEDETFSEAWERFKELLQKCPHHGLPHCIQMETFYNGLNGVTQGMV
        LFP+SLRD A+AWLN+  P S+  WN+LAEK L KYFPP RNAK RS+I+ F+QLEDET S+AWERFKELL+KCPHHG+PHCIQ+ETFYNGLN  ++ ++
Subjt:  LFPYSLRDEAKAWLNSFAPGSIRTWNELAEKLLSKYFPPNRNAKLRSKIVGFRQLEDETFSEAWERFKELLQKCPHHGLPHCIQMETFYNGLNGVTQGMV

Query:  DVSTQGALLAKTFNEAHEILERISTNSCQWSDVRG-TNKKVKSVLEVDGVSTIRADLAMISNALKNVTVISHQQPPVVEPAALVNQVAEEACVYCGENHN
        D S  GA+L+K++NEA EILERI++N+ QWS  R  T++KV  VLEVD ++ + A +A ++N LKN+ +        V+PAA + Q AE +CVYCG+ H 
Subjt:  DVSTQGALLAKTFNEAHEILERISTNSCQWSDVRG-TNKKVKSVLEVDGVSTIRADLAMISNALKNVTVISHQQPPVVEPAALVNQVAEEACVYCGENHN

Query:  YEFCPNNPTSVFFV--------------------------------------------------------------------------------------
        +E CP+NP SV +V                                                                                      
Subjt:  YEFCPNNPTSVFFV--------------------------------------------------------------------------------------

Query:  -----------------ANELKARPQGKLPSDTEHPRREGKEQVKAVTFRSGKPLEER------KEPSKPQ
                         AN+LK RPQG LPSDTE+PRR+GKE  KAVT RSGK +E        KEPS  Q
Subjt:  -----------------ANELKARPQGKLPSDTEHPRREGKEQVKAVTFRSGKPLEER------KEPSKPQ

TrEMBL top hitse value%identityAlignment
A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129452.6e-8434.26Show/hide
Query:  EQQNNQAENPILVANDRVRAIRAYVFPMFDELNLGIARPQIEAANFEIKPVMFQMLQTV----------------------------GVSKDALRLTLFP
        E  NN   N I +  +  RA+R YV P+   L+  I RP I A NFEIKP   QM+Q+                             GV+ DA+RL LFP
Subjt:  EQQNNQAENPILVANDRVRAIRAYVFPMFDELNLGIARPQIEAANFEIKPVMFQMLQTV----------------------------GVSKDALRLTLFP

Query:  YSLRDEAKAWLNSFAPGSIRTWNELAEKLLSKYFPPNRNAKLRSKIVGFRQLEDETFSEAWERFKELLQKCPHHGLPHCIQMETFYNGLNGVTQGMVDVS
        +SLRD+AK+WLNS   GSI TW +LA+K L+K+FPP + AK+R+ I  F Q + E+  EAWERFKELL++CPHHG+P  +Q++TFYNGL G  + ++D +
Subjt:  YSLRDEAKAWLNSFAPGSIRTWNELAEKLLSKYFPPNRNAKLRSKIVGFRQLEDETFSEAWERFKELLQKCPHHGLPHCIQMETFYNGLNGVTQGMVDVS

Query:  TQGALLAKTFNEAHEILERISTNSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMISNALKNVTVISHQQPPVVEPAALVNQVAEEACVYCGENHNYEFC
          GAL++K   +A+ +LE +++N+ QW   R  ++K     E+D + T+   +A +S  L  + V + Q   VV             C  CG++H+Y+ C
Subjt:  TQGALLAKTFNEAHEILERISTNSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMISNALKNVTVISHQQPPVVEPAALVNQVAEEACVYCGENHNYEFC

Query:  PNNPTSVFFV------------------------------------------------------------------------------------------
        P N  SV FV                                                                                          
Subjt:  PNNPTSVFFV------------------------------------------------------------------------------------------

Query:  -ANELKARPQGKLPSDTE-HPRREGKEQVKAVTFRSGKPLEERKEPSKPQNVEISCDKNVVVEKELESGQGAGGSNNDAGASGSIPDVEPPYVPPHLMYH
         AN +  RPQG LPSDT+ +P+  GKEQ +A+T RSGK +E   + +    +E   DK  + E E+E  Q       + G S  I    PP  P  L   
Subjt:  -ANELKARPQGKLPSDTE-HPRREGKEQVKAVTFRSGKPLEERKEPSKPQNVEISCDKNVVVEKELESGQGAGGSNNDAGASGSIPDVEPPYVPPHLMYH

Query:  ---------LYLFHK--------------GKGLRIRMDILTKKKRLGEFETVSLTEECSVILKNGLPTKAKDPGSFTIPMSI------------------
                 L +F K                 ++   DIL+KK++LGEFETV LTEECS IL+N LP K KDPGSFTIP +I                  
Subjt:  ---------LYLFHK--------------GKGLRIRMDILTKKKRLGEFETVSLTEECSVILKNGLPTKAKDPGSFTIPMSI------------------

Query:  ---------GIGEARPTTVTLQLADKSITYLEGKIEDVLVKAMKY
                 G+GE +PT+VTLQLAD+S  Y  G IEDVLVK  K+
Subjt:  ---------GIGEARPTTVTLQLADKSITYLEGKIEDVLVKAMKY

A0A6J1EEI2 uncharacterized protein LOC1114333941.8e-9049.31Show/hide
Query:  QQNRLLQQN-QLLEQNEQQNNQAENPILVAN-------------DRVRAIRAYVFPMFDELNLGIARPQIEAANFEIKPVMFQMLQTVG-----------
        ++ ++ +QN Q +E   Q N + ENP ++AN             DR RAIRAY  P  +ELN  I RP+++A  FE+KPVMFQMLQT+G           
Subjt:  QQNRLLQQN-QLLEQNEQQNNQAENPILVAN-------------DRVRAIRAYVFPMFDELNLGIARPQIEAANFEIKPVMFQMLQTVG-----------

Query:  -----------------VSKDALRLTLFPYSLRDEAKAWLNSFAPGSIRTWNELAEKLLSKYFPPNRNAKLRSKIVGFRQLEDETFSEAWERFKELLQKC
                         V KD +RL+LFPYSLRD AK+WLN+ A G+I +WN L EK L KYFPP RNA+ R++IV F+Q ED+T SEAWERFKE+L+KC
Subjt:  -----------------VSKDALRLTLFPYSLRDEAKAWLNSFAPGSIRTWNELAEKLLSKYFPPNRNAKLRSKIVGFRQLEDETFSEAWERFKELLQKC

Query:  PHHGLPHCIQMETFYNGLNGVTQGMVDVSTQGALLAKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADLAMISNALKNVTVISHQQ
        PHHGLPHCIQMETFYNGLN  T+ +VD S  GA+L+KT+NEA+EILERI++N+CQW+DVR    +K + VLEVD +S+I A LA ++N L+N+ +     
Subjt:  PHHGLPHCIQMETFYNGLNGVTQGMVDVSTQGALLAKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADLAMISNALKNVTVISHQQ

Query:  -PPVVEPAALVNQVAEEACVYCGENHNYEFCPNNPTSVFFVANEL-KARPQGKLPSDTEHP
            V   A++NQ A E+CVYCGE H ++ CP+NP S+F+V N+  +  P+    S+T +P
Subjt:  -PPVVEPAALVNQVAEEACVYCGENHNYEFCPNNPTSVFFVANEL-KARPQGKLPSDTEHP

A0A6J1EQ90 uncharacterized protein LOC1114364111.0e-10437.41Show/hide
Query:  QNRLLQQNQLLEQNEQQ-------NNQAENPILVAN-------------DRVRAIRAYVFPMFDELNLGIARPQIEAANFEIKPVMFQMLQTV-------
        + RL ++ ++ EQN QQ       N + ENP ++AN             DR RAIRAY  P  +ELN  I RP+I+   FE+KPVMFQMLQT+       
Subjt:  QNRLLQQNQLLEQNEQQ-------NNQAENPILVAN-------------DRVRAIRAYVFPMFDELNLGIARPQIEAANFEIKPVMFQMLQTV-------

Query:  ----------------------------GVSKDALRLTLFPYSLRDEAKAWLNSFAPGSIRTWNELAEKLLSKYFPPNRNAKLRSKIVGFRQLEDETFSE
                                    GV KD +RL+LFPY LRD AK+WLN+ APG+I +WN LAE  L KYFPP RNA+ +++IV F+Q EDET SE
Subjt:  ----------------------------GVSKDALRLTLFPYSLRDEAKAWLNSFAPGSIRTWNELAEKLLSKYFPPNRNAKLRSKIVGFRQLEDETFSE

Query:  AWERFKELLQKCPHHGLPHCIQMETFYNGLNGVTQGMVDVSTQGALLAKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADLAMISN
        A ERFKE+L+KCPHHGLPHCIQMETFYNGLN VT+ +VD S  GA+L+KT+NEA+EILERI++N+CQW+DVR    +K + VLEVD +S+I A LA ++N
Subjt:  AWERFKELLQKCPHHGLPHCIQMETFYNGLNGVTQGMVDVSTQGALLAKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADLAMISN

Query:  ALKNVTVISHQQ-PPVVEPAALVNQVAEEACVYCGENHNYEFCPNNPTSVFFVANE-----LKARP----------------------------------
         L+N+ +         V  AA +NQ A E+CVYCGE H ++ CP+NP S+F+V N+     LK  P                                  
Subjt:  ALKNVTVISHQQ-PPVVEPAALVNQVAEEACVYCGENHNYEFCPNNPTSVFFVANE-----LKARP----------------------------------

Query:  -----QGKLPSDTEHPRREGK--------------EQVKAVTFRSGKPLEERKEPSKPQNVEISCDKN--------------------VVVEKELESGQG
             Q +L   ++    +GK                +K    ++   ++ ++   +   V+I  +KN                      V+KE      
Subjt:  -----QGKLPSDTEHPRREGK--------------EQVKAVTFRSGKPLEERKEPSKPQNVEISCDKN--------------------VVVEKELESGQG

Query:  AGGSNNDAGASGSIPDVEPPYVPP--------------------------HLMYHLY--LFHKGKGLRIRMDILTKKKRLGEFETVSLTEECSVILKNGL
                  + S       Y P                           H+   L   L      ++   D+L  +++  EF+ VSL EECS ILKN +
Subjt:  AGGSNNDAGASGSIPDVEPPYVPP--------------------------HLMYHLY--LFHKGKGLRIRMDILTKKKRLGEFETVSLTEECSVILKNGL

Query:  PTKAKDPGSFTIPMSI---------------------------GIGEARPTTVTLQLADKSITYLEGKIEDVLVKAMKY
        P K KDPGSFTIP+SI                           GIGEARPTTVTLQLAD+SITY EGKIED+L++  K+
Subjt:  PTKAKDPGSFTIPMSI---------------------------GIGEARPTTVTLQLADKSITYLEGKIEDVLVKAMKY

A0A6J1H7E4 uncharacterized protein LOC1114611683.7e-9151Show/hide
Query:  QLLEQNEQQNNQAENPILVAN-------------DRVRAIRAYVFPMFDELNLGIARPQIEAANFEIKPVMFQMLQTV----------------------
        Q ++Q  Q N + ENP+++AN             DR RAIRAY  P  DELN  I RP+++A  FE+KPVMFQMLQT+                      
Subjt:  QLLEQNEQQNNQAENPILVAN-------------DRVRAIRAYVFPMFDELNLGIARPQIEAANFEIKPVMFQMLQTV----------------------

Query:  ------GVSKDALRLTLFPYSLRDEAKAWLNSFAPGSIRTWNELAEKLLSKYFPPNRNAKLRSKIVGFRQLEDETFSEAWERFKELLQKCPHHGLPHCIQ
              GV KD +RL+LFPYSLRD AK+WLN+ AP +I +WN LAEK L KYFPP RNA+ R++IV F+Q EDET SEAWERFKE+L+KCPHHGLPHCIQ
Subjt:  ------GVSKDALRLTLFPYSLRDEAKAWLNSFAPGSIRTWNELAEKLLSKYFPPNRNAKLRSKIVGFRQLEDETFSEAWERFKELLQKCPHHGLPHCIQ

Query:  METFYNGLNGVTQGMVDVSTQGALLAKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADLAMISNALKNVTVISHQQPPVVEP---A
        METFYNGLN  T+ +VD S  GA+L+KT+NEA+EILERI++N+CQW+DVR    KK + VLEVD +S+I A LA ++N L+N+     Q   +  P   A
Subjt:  METFYNGLNGVTQGMVDVSTQGALLAKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADLAMISNALKNVTVISHQQPPVVEP---A

Query:  ALVNQVAEEACVYCGENHNYEFCPNNPTSVFFVANELKARPQGKLPSDTEH
        A++ Q A E+CVYCGE H ++ CP NP S+ +V N+     Q   PS   +
Subjt:  ALVNQVAEEACVYCGENHNYEFCPNNPTSVFFVANELKARPQGKLPSDTEH

U5CUI2 Retrotrans_gag domain-containing protein5.6e-8752.43Show/hide
Query:  QAENPILVANDRVRAIRAYVFPMFDELNLGIARPQIEAANFEIKPVMFQMLQTV----------------------------GVSKDALRLTLFPYSLRD
        Q  NPI++A+DR RAIR Y  PMF+ELN GI RP+I+A  FE+KPVMFQMLQTV                            GVS++ LRL LFP+SLRD
Subjt:  QAENPILVANDRVRAIRAYVFPMFDELNLGIARPQIEAANFEIKPVMFQMLQTV----------------------------GVSKDALRLTLFPYSLRD

Query:  EAKAWLNSFAPGSIRTWNELAEKLLSKYFPPNRNAKLRSKIVGFRQLEDETFSEAWERFKELLQKCPHHGLPHCIQMETFYNGLNGVTQGMVDVSTQGAL
         A++WLN+  P S+  WN+LAEK L KYFPP RNAK RS+I+ F+QLEDE+ S+AWERFKELL+KCPHHG+PHCIQMETFYNGLN  ++ ++D S  GA+
Subjt:  EAKAWLNSFAPGSIRTWNELAEKLLSKYFPPNRNAKLRSKIVGFRQLEDETFSEAWERFKELLQKCPHHGLPHCIQMETFYNGLNGVTQGMVDVSTQGAL

Query:  LAKTFNEAHEILERISTNSCQWSDVRG-TNKKVKSVLEVDGVSTIRADLAMISNALKNVTVISHQQPPVVEPAALVNQVAEEACVYCGENHNYEFCPNNP
        L+K++NEA EILE I++N+ QWS+ R  T++KV  VLEVD ++ + A +A ++N LKN+++ + +    ++PAA + Q  + +CV+CGE H +E CP+NP
Subjt:  LAKTFNEAHEILERISTNSCQWSDVRG-TNKKVKSVLEVDGVSTIRADLAMISNALKNVTVISHQQPPVVEPAALVNQVAEEACVYCGENHNYEFCPNNP

Query:  TSVFFVANE
         SV ++ N+
Subjt:  TSVFFVANE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAATGTACGCCCTAACAAATGCGGAAAGTCTCGCAGCGTCGAGACGCTGTGGAGGTGTGAGTTTGGTGCATGAGCGATCCGCTTGGGGTACGGGTCCTGAAGATCC
AGCGAACCCCCAGCAGAATCGTTTGCTGCAGCAGAACCAGTTGCTTGAGCAAAATGAGCAGCAAAATAATCAGGCTGAGAATCCTATCTTGGTAGCAAATGATAGGGTCA
GAGCAATTCGAGCGTATGTTTTTCCAATGTTTGATGAGTTAAATCTAGGGATTGCACGTCCTCAAATTGAGGCAGCAAATTTTGAAATAAAACCGGTAATGTTTCAGATG
TTGCAAACCGTGGGAGTGTCTAAAGATGCCCTCAGATTAACTTTGTTCCCGTATTCTCTTAGAGATGAAGCAAAGGCATGGTTAAATTCTTTTGCTCCAGGATCGATTAG
GACATGGAATGAATTAGCAGAAAAACTTCTTAGTAAATATTTCCCACCAAATAGAAATGCTAAATTGAGGAGTAAAATAGTAGGGTTTAGGCAACTTGAAGATGAGACTT
TTAGTGAGGCTTGGGAGAGGTTTAAGGAGCTTTTGCAAAAGTGTCCCCACCATGGTTTACCTCATTGTATCCAAATGGAAACATTTTACAATGGGTTAAATGGAGTAACC
CAGGGTATGGTTGATGTTTCTACTCAAGGGGCCCTTTTGGCAAAAACTTTCAATGAAGCCCATGAAATTTTAGAAAGAATATCAACTAATAGTTGTCAGTGGTCTGATGT
TAGAGGCACAAATAAAAAGGTTAAGAGTGTGTTAGAGGTTGATGGTGTGTCCACCATTAGGGCTGATCTTGCAATGATTTCTAACGCTCTTAAGAATGTGACAGTGATTA
GTCATCAGCAGCCACCAGTTGTAGAGCCTGCTGCACTGGTGAACCAAGTCGCAGAGGAAGCATGTGTCTATTGTGGTGAAAACCACAACTACGAGTTTTGCCCCAACAAC
CCAACTTCTGTGTTTTTTGTAGCTAATGAGCTGAAGGCAAGGCCTCAAGGGAAGCTTCCTTCAGATACTGAACACCCTAGAAGGGAAGGTAAGGAGCAGGTAAAGGCAGT
GACTTTTAGGAGTGGTAAGCCATTAGAAGAAAGAAAAGAGCCTAGTAAACCCCAGAATGTAGAAATTAGTTGTGATAAAAATGTTGTTGTTGAGAAAGAGTTGGAGTCTG
GTCAAGGTGCTGGAGGCAGCAATAATGATGCTGGAGCATCTGGTTCTATTCCAGATGTGGAACCACCTTATGTGCCGCCCCACCTTATGTACCACCTCTACCTTTTCCAC
AAAGGCAAAGGCCTAAGAATTAGGATGGATATTTTAACTAAAAAGAAGAGGTTAGGTGAGTTTGAAACTGTATCTCTTACTGAGGAATGTAGTGTTATTCTTAAGAATGG
GCTACCAACCAAGGCTAAGGATCCAGGATCATTCACTATACCTATGTCAATAGGTATTGGTGAAGCTAGGCCTACTACAGTCACACTCCAATTAGCTGATAAGTCTATCA
CATATCTAGAGGGAAAAATTGAGGATGTCTTAGTAAAGGCCATGAAGTACCCAGACGAAATGGAAGATTGCTCTTTCATTAGGATTCTGGAGAACACAATTGTTGAGACA
ACAATTCAGAATTCGGCTAACAAGCATTTGGAAGATCATAGAGAGATTAGTGTAGAAGATTTAGAAGACCAGAGCGTCTCGACTCTAGGTGATCAGCGAAGAACGCTCCA
AAATCGTGGTAGCGTCTCGACGCTGTCTGAGCAGCGTCTCGACGCTGCTACGACTTCTGCCCCAAGACAACCTGACGTCACAGCGTCGAGACGCTGTGACGTAGCGTCTC
GATGCTGTCGGCTTGTTGAGCAAGAAACCCCTCCGTCCAGCACCTATAAAAAGCCCCCTCTCCCAAGGTTCAAATCATCCCATTTTTTGGGGAGCTCTCCCCTAGCTCAA
TAG
mRNA sequenceShow/hide mRNA sequence
ATGTCAATGTACGCCCTAACAAATGCGGAAAGTCTCGCAGCGTCGAGACGCTGTGGAGGTGTGAGTTTGGTGCATGAGCGATCCGCTTGGGGTACGGGTCCTGAAGATCC
AGCGAACCCCCAGCAGAATCGTTTGCTGCAGCAGAACCAGTTGCTTGAGCAAAATGAGCAGCAAAATAATCAGGCTGAGAATCCTATCTTGGTAGCAAATGATAGGGTCA
GAGCAATTCGAGCGTATGTTTTTCCAATGTTTGATGAGTTAAATCTAGGGATTGCACGTCCTCAAATTGAGGCAGCAAATTTTGAAATAAAACCGGTAATGTTTCAGATG
TTGCAAACCGTGGGAGTGTCTAAAGATGCCCTCAGATTAACTTTGTTCCCGTATTCTCTTAGAGATGAAGCAAAGGCATGGTTAAATTCTTTTGCTCCAGGATCGATTAG
GACATGGAATGAATTAGCAGAAAAACTTCTTAGTAAATATTTCCCACCAAATAGAAATGCTAAATTGAGGAGTAAAATAGTAGGGTTTAGGCAACTTGAAGATGAGACTT
TTAGTGAGGCTTGGGAGAGGTTTAAGGAGCTTTTGCAAAAGTGTCCCCACCATGGTTTACCTCATTGTATCCAAATGGAAACATTTTACAATGGGTTAAATGGAGTAACC
CAGGGTATGGTTGATGTTTCTACTCAAGGGGCCCTTTTGGCAAAAACTTTCAATGAAGCCCATGAAATTTTAGAAAGAATATCAACTAATAGTTGTCAGTGGTCTGATGT
TAGAGGCACAAATAAAAAGGTTAAGAGTGTGTTAGAGGTTGATGGTGTGTCCACCATTAGGGCTGATCTTGCAATGATTTCTAACGCTCTTAAGAATGTGACAGTGATTA
GTCATCAGCAGCCACCAGTTGTAGAGCCTGCTGCACTGGTGAACCAAGTCGCAGAGGAAGCATGTGTCTATTGTGGTGAAAACCACAACTACGAGTTTTGCCCCAACAAC
CCAACTTCTGTGTTTTTTGTAGCTAATGAGCTGAAGGCAAGGCCTCAAGGGAAGCTTCCTTCAGATACTGAACACCCTAGAAGGGAAGGTAAGGAGCAGGTAAAGGCAGT
GACTTTTAGGAGTGGTAAGCCATTAGAAGAAAGAAAAGAGCCTAGTAAACCCCAGAATGTAGAAATTAGTTGTGATAAAAATGTTGTTGTTGAGAAAGAGTTGGAGTCTG
GTCAAGGTGCTGGAGGCAGCAATAATGATGCTGGAGCATCTGGTTCTATTCCAGATGTGGAACCACCTTATGTGCCGCCCCACCTTATGTACCACCTCTACCTTTTCCAC
AAAGGCAAAGGCCTAAGAATTAGGATGGATATTTTAACTAAAAAGAAGAGGTTAGGTGAGTTTGAAACTGTATCTCTTACTGAGGAATGTAGTGTTATTCTTAAGAATGG
GCTACCAACCAAGGCTAAGGATCCAGGATCATTCACTATACCTATGTCAATAGGTATTGGTGAAGCTAGGCCTACTACAGTCACACTCCAATTAGCTGATAAGTCTATCA
CATATCTAGAGGGAAAAATTGAGGATGTCTTAGTAAAGGCCATGAAGTACCCAGACGAAATGGAAGATTGCTCTTTCATTAGGATTCTGGAGAACACAATTGTTGAGACA
ACAATTCAGAATTCGGCTAACAAGCATTTGGAAGATCATAGAGAGATTAGTGTAGAAGATTTAGAAGACCAGAGCGTCTCGACTCTAGGTGATCAGCGAAGAACGCTCCA
AAATCGTGGTAGCGTCTCGACGCTGTCTGAGCAGCGTCTCGACGCTGCTACGACTTCTGCCCCAAGACAACCTGACGTCACAGCGTCGAGACGCTGTGACGTAGCGTCTC
GATGCTGTCGGCTTGTTGAGCAAGAAACCCCTCCGTCCAGCACCTATAAAAAGCCCCCTCTCCCAAGGTTCAAATCATCCCATTTTTTGGGGAGCTCTCCCCTAGCTCAA
TAG
Protein sequenceShow/hide protein sequence
MSMYALTNAESLAASRRCGGVSLVHERSAWGTGPEDPANPQQNRLLQQNQLLEQNEQQNNQAENPILVANDRVRAIRAYVFPMFDELNLGIARPQIEAANFEIKPVMFQM
LQTVGVSKDALRLTLFPYSLRDEAKAWLNSFAPGSIRTWNELAEKLLSKYFPPNRNAKLRSKIVGFRQLEDETFSEAWERFKELLQKCPHHGLPHCIQMETFYNGLNGVT
QGMVDVSTQGALLAKTFNEAHEILERISTNSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMISNALKNVTVISHQQPPVVEPAALVNQVAEEACVYCGENHNYEFCPNN
PTSVFFVANELKARPQGKLPSDTEHPRREGKEQVKAVTFRSGKPLEERKEPSKPQNVEISCDKNVVVEKELESGQGAGGSNNDAGASGSIPDVEPPYVPPHLMYHLYLFH
KGKGLRIRMDILTKKKRLGEFETVSLTEECSVILKNGLPTKAKDPGSFTIPMSIGIGEARPTTVTLQLADKSITYLEGKIEDVLVKAMKYPDEMEDCSFIRILENTIVET
TIQNSANKHLEDHREISVEDLEDQSVSTLGDQRRTLQNRGSVSTLSEQRLDAATTSAPRQPDVTASRRCDVASRCCRLVEQETPPSSTYKKPPLPRFKSSHFLGSSPLAQ