; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038205 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038205
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRibonuclease H
Genome locationchr2:13777718..13780550
RNA-Seq ExpressionLag0038205
SyntenyLag0038205
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFS38726.1 hypothetical protein Acr_00g0059080 [Actinidia rufa]8.1e-8240Show/hide
Query:  LESLIGQADPPFVDEIMQAEVPHKFKLPT-FPQYDGKKDPVQHLDTYRSWMGFHGASEATKCRAFSLTLTGTAQQWYGKLPPKSVDSFKELSRLFATQFL
        +++L+ Q DPPF + +++  +  KFKLPT    Y+GK DP+ HLD+Y+S M   G S+   C+AFS TL G A+ W+ KL P +VDSF ELSRLF   F+
Subjt:  LESLIGQADPPFVDEIMQAEVPHKFKLPT-FPQYDGKKDPVQHLDTYRSWMGFHGASEATKCRAFSLTLTGTAQQWYGKLPPKSVDSFKELSRLFATQFL

Query:  GARDRKKPQFNLLTIKQRPGESLNGYITRFSNEVVQVEGYDDAVALTAVIAGLQDERLLNSVGEDQPRTYTEFVARAQKYINAEELMKSKRAEREAQRVT
          R+R+K   +L TI Q+  ESL  ++ RF+  V+ VE   D V + A++ GL+   L +S+ ++ P T +   ++A KYI AEEL ++KR  R      
Subjt:  GARDRKKPQFNLLTIKQRPGESLNGYITRFSNEVVQVEGYDDAVALTAVIAGLQDERLLNSVGEDQPRTYTEFVARAQKYINAEELMKSKRAEREAQRVT

Query:  TIDKGRRKEERGKRSREEDEGRSRLKYSVGRGQQDQKEGRGRPEYKGKYERDGYLKEFVDGEESKKPRPTRGDGKEEGNDPPL-EIKTILGG-PVGGDSN
          +  +RKE   +R    +E R +      R  QD K    R         +  L            RP+  + K   N P    I+TI GG   GG S 
Subjt:  TIDKGRRKEERGKRSREEDEGRSRLKYSVGRGQQDQKEGRGRPEYKGKYERDGYLKEFVDGEESKKPRPTRGDGKEEGNDPPL-EIKTILGG-PVGGDSN

Query:  RKRKAAAREARYDLEERRVYSVR--LSKGLPSVEFTELEASGIHQPHNDALVVSLTVANTRVHRILVDGGSSADILSTKVFEAMKLGKERLRTSAAPLVG
          RK  AR A   +EE  VY++    +   P + F+  +  G+H PH+DALVVS  +AN  V RIL+D GSSADIL    FE MK+G ++L     PL+G
Subjt:  RKRKAAAREARYDLEERRVYSVR--LSKGLPSVEFTELEASGIHQPHNDALVVSLTVANTRVHRILVDGGSSADILSTKVFEAMKLGKERLRTSAAPLVG

Query:  FGGEKVLPEGSIELPVTYGEGQNVVTKMVNFLVVNCVSAYNAILGRPALHELKAVASTYHQMMKFPTDSGIGVVRGEQQASRECYYTALR
        FGG    P G I LP+T G      T   +F+VV+C S YN+ILGRP L  +KA+ STYH  MKFPT +GIG V+G+Q+ +R+C+ +A++
Subjt:  FGGEKVLPEGSIELPVTYGEGQNVVTKMVNFLVVNCVSAYNAILGRPALHELKAVASTYHQMMKFPTDSGIGVVRGEQQASRECYYTALR

XP_022158830.1 uncharacterized protein LOC111025293 [Momordica charantia]4.5e-8036.51Show/hide
Query:  PNPGTTYKRSSARDQGRKRGASGEEEDETDSATSKLRQPRGGEKPVLKEPGPSRGAERKGALDVPDEVSTVGSHRRIEAEAEAEAEAI--AKAKMQEKVE
        P  GT+ + +  R +G     +    +      + L      E P++++P   +G       D  +  ++VGS  RI              K K Q K  
Subjt:  PNPGTTYKRSSARDQGRKRGASGEEEDETDSATSKLRQPRGGEKPVLKEPGPSRGAERKGALDVPDEVSTVGSHRRIEAEAEAEAEAI--AKAKMQEKVE

Query:  LEAKIRVELEGKLRAEAEAAAKAKAEAEQSKVRAKEGTQGSTRPRDADRDY--------LESLIGQADPPFVDEIMQAEVPHKFKLPTFPQYDGKKDPVQ
                      + A    K+  +   S+  + +  + + RP  +++ +        LE L+ QAD PF +EIM+ +VP KFKLPT  Q+D   DPV 
Subjt:  LEAKIRVELEGKLRAEAEAAAKAKAEAEQSKVRAKEGTQGSTRPRDADRDY--------LESLIGQADPPFVDEIMQAEVPHKFKLPTFPQYDGKKDPVQ

Query:  HLDTYRSWMGFHGASEATKCRAFSLTLTGTAQQWYGKLPPKSVDSFKELSRLFATQFLGARDRKKPQFNLLTIKQRPGESLNGYITRFSNEVVQVEGYDD
        HLD YR WM  +G SEA +CR FS TL G+A+ W+ +L   S+ SFK L+R F TQF+G R R +P   LLTIKQR  ESL  Y+ RF+ E +QVEG  D
Subjt:  HLDTYRSWMGFHGASEATKCRAFSLTLTGTAQQWYGKLPPKSVDSFKELSRLFATQFLGARDRKKPQFNLLTIKQRPGESLNGYITRFSNEVVQVEGYDD

Query:  AVALTAVIAGLQDERLLNSVGEDQPRTYTEFVARAQKYINAEELMKS------KRAEREAQRVTTIDKGRRKEERGKRSREEDEGRSRLKYS--------
        AV+L A ++G++DE L  S G+  P T++E ++RAQ+Y++A E   S      KR + + +R     +G R E+R  RS ++D  R   KY+        
Subjt:  AVALTAVIAGLQDERLLNSVGEDQPRTYTEFVARAQKYINAEELMKS------KRAEREAQRVTTIDKGRRKEERGKRSREEDEGRSRLKYS--------

Query:  VGRGQQDQK--------EGRGRPEYKGKY------------------------ERDGYLKEFVDGEESKKPRPTRGDGKEEGNDPPLEIKTILGGPVGGD
        V    +DQ+        +       KG+Y                         R GYLKE+V+     +P+ T+    E    P  EI+TI+GGP+  +
Subjt:  VGRGQQDQK--------EGRGRPEYKGKY------------------------ERDGYLKEFVDGEESKKPRPTRGDGKEEGNDPPLEIKTILGGPVGGD

Query:  SNRKRKAAAREARYDLEERRVYSVRLSKGLPSVEFTELEASGIHQPHNDALVVSLTVANTRVHRILVDGGSSADILSTKVFEAMKLGKERLRTSAAPLVG
        S RKRKA  REAR   E+  VY    +    ++EF+E EA+ +  PHNDALV+ L +AN +VHR+LVDGGSSADI+S   ++AM L ++ L+ S APLVG
Subjt:  SNRKRKAAAREARYDLEERRVYSVRLSKGLPSVEFTELEASGIHQPHNDALVVSLTVANTRVHRILVDGGSSADILSTKVFEAMKLGKERLRTSAAPLVG

Query:  FGGEKVLPEGSIELPVTYG
        FG E+V+PEG IELPVT+G
Subjt:  FGGEKVLPEGSIELPVTYG

XP_024041095.1 uncharacterized protein LOC112098853 [Citrus clementina]4.6e-9338.55Show/hide
Query:  LESLIGQADPPFVDEIMQAEVPHKFKLPTFPQYDGKKDPVQHLDTYRSWMGFHGASEATKCRAFSLTLTGTAQQWYGKLPPKSVDSFKELSRLFATQFLG
        L+ +  + +PPF  +IM A+ P +F LP    YDG++DP +HL+ YR+ M   GAS+A  CRAF LTL G A++W+ +L P S+ SF +LSR F + F  
Subjt:  LESLIGQADPPFVDEIMQAEVPHKFKLPTFPQYDGKKDPVQHLDTYRSWMGFHGASEATKCRAFSLTLTGTAQQWYGKLPPKSVDSFKELSRLFATQFLG

Query:  ARDRKKPQFNLLTIKQRPGESLNGYITRFSNEVVQVEGYDDAVALTAVIAGLQDERLLNSVGEDQPRTYTEFVARAQKYINAEELMKS------------
        AR R KP   LLT+KQ+ GE+L  YI R++NE+ QV+GYDD +AL+ ++ GL+  +L  SV +  P +Y+E +ARA+KY NAEE  K+            
Subjt:  ARDRKKPQFNLLTIKQRPGESLNGYITRFSNEVVQVEGYDDAVALTAVIAGLQDERLLNSVGEDQPRTYTEFVARAQKYINAEELMKS------------

Query:  -KRAEREAQRVTTIDK---------GRRKEERGKRSREEDEGRSRLKYSVGRGQ----------------QDQKEGRGRP------------------EY
         K+ ++  +RV   D+         G R E R  R R   +  +  + +  R Q                      R  P                  E 
Subjt:  -KRAEREAQRVTTIDK---------GRRKEERGKRSREEDEGRSRLKYSVGRGQ----------------QDQKEGRGRP------------------EY

Query:  KGKYE---RDGYLKEF-------VDGEESKKPRPTRGDGKEEGNDPPLEIKTILGGPVGGDSNRKRKAAAREARYDLEERRVYSVRLS-----KGLPSVE
        K + E   R G L+E+       V  E+ +  +  +G  KE+ ++   ++  I GGP  GDS + RK  AR+AR++     V     S     +G+P + 
Subjt:  KGKYE---RDGYLKEF-------VDGEESKKPRPTRGDGKEEGNDPPLEIKTILGGPVGGDSNRKRKAAAREARYDLEERRVYSVRLS-----KGLPSVE

Query:  FTELEASGIHQPHNDALVVSLTVANTRVHRILVDGGSSADILSTKVFEAMKLGKERLRTSAAPLVGFGGEKVLPEGSIELPVTYGEGQNVVTKMVNFLVV
        F+E +A G+H PH DALVV+L VAN R+HRIL+D GSSADIL    F  M L + +L+    PL GF G  V+PEG IEL V++G+    VT MVNF+VV
Subjt:  FTELEASGIHQPHNDALVVSLTVANTRVHRILVDGGSSADILSTKVFEAMKLGKERLRTSAAPLVGFGGEKVLPEGSIELPVTYGEGQNVVTKMVNFLVV

Query:  NCVSAYNAILGRPALHELKAVASTYHQMMKFPTDSGIGVVRGEQQASRECYYTALRGASQQAR----QHGSAQDEGRGRPV
        +  S+YNA+LGRP L+ LKA  S YH  +KFPT++G+GVVRGEQ+ +RECY  A R   +  +       + +D+ R  PV
Subjt:  NCVSAYNAILGRPALHELKAVASTYHQMMKFPTDSGIGVVRGEQQASRECYYTALRGASQQAR----QHGSAQDEGRGRPV

XP_024047974.1 uncharacterized protein LOC112101548 [Citrus clementina]1.3e-8237.44Show/hide
Query:  ESLIGQADPPFVDEIMQAEVPHKFKLPTFPQYDGKKDPVQHLDTYRSWMGFHGASEATKCRAFSLTLTGTAQQWYGKLPPKSVDSFKELSRLFATQFLGA
        + ++ +++PPF  EIM+A  P  F+LP+   YDG+K P++H++ YRS M   G S A  CRAF LTL+  A++W+  L P S+ SF EL R F   F  A
Subjt:  ESLIGQADPPFVDEIMQAEVPHKFKLPTFPQYDGKKDPVQHLDTYRSWMGFHGASEATKCRAFSLTLTGTAQQWYGKLPPKSVDSFKELSRLFATQFLGA

Query:  RDRKKPQFNLLTIKQRPGESLNGYITRFSNEVVQVEGYDDAVALTAVIAGLQDERLLNSVGEDQPRTYTEFVARAQKYINAEELMKSKRAEREAQ-----
        R R KP   LLT+KQ  GESL  YI R++ E  QV+GYDD VAL+ ++ GLQ  RL  SV ++ P TY+E ++RA+KY NAEE  +SK+   + +     
Subjt:  RDRKKPQFNLLTIKQRPGESLNGYITRFSNEVVQVEGYDDAVALTAVIAGLQDERLLNSVGEDQPRTYTEFVARAQKYINAEELMKSKRAEREAQ-----

Query:  ---------RVTTIDKGRRKEE-RGKRSREEDEGRSR---------------------LKYSV----------GRGQQDQKE--------GRGRPE---Y
                 R    D+  RK++ R        E RSR                     +K S            R +++QK+        G    E    
Subjt:  ---------RVTTIDKGRRKEE-RGKRSREEDEGRSR---------------------LKYSV----------GRGQQDQKE--------GRGRPE---Y

Query:  KGKYE---RDGYLKEFVDGEESKK-------PRPTRGDGKEEGNDPPLE-IKTILGGPVGGDSNRKRKAAAREARYDLEERRVYSVRLSKGLPS------
        K + E   R   L+ +V G+ S++        R  +G  KE  +D  +  +  I GGP  G+S + RK  AR+AR+   E     + L+  L +      
Subjt:  KGKYE---RDGYLKEFVDGEESKK-------PRPTRGDGKEEGNDPPLE-IKTILGGPVGGDSNRKRKAAAREARYDLEERRVYSVRLSKGLPS------

Query:  -VEFTELEASGIHQPHNDALVVSLTVANTRVHRILVDGGSSADILSTKVFEAMKLGKERLRTSAAPLVGFGGEKVLPEGSIELPVTYGEGQNVVTKMVNF
         + F+E +  GIH PH DA+VV+L VAN R+HRIL+D GSSADIL    F  M L + +L      L GF G  V+PEG IEL V++G+    VT MV F
Subjt:  -VEFTELEASGIHQPHNDALVVSLTVANTRVHRILVDGGSSADILSTKVFEAMKLGKERLRTSAAPLVGFGGEKVLPEGSIELPVTYGEGQNVVTKMVNF

Query:  LVVNCVSAYNAILGRPALHELKAVASTYHQMMKFPTDSGIGVVRGEQQASRECYYTALRGASQQ----ARQHGSAQDEGRGRPVQ
        +VV+  SAYN++LGRP L+ +KA  S YH  +KFPT++ IGVVRG Q+ +RECY  + +  ++     +    + +D+ R  PV+
Subjt:  LVVNCVSAYNAILGRPALHELKAVASTYHQMMKFPTDSGIGVVRGEQQASRECYYTALRGASQQ----ARQHGSAQDEGRGRPVQ

XP_030924688.1 uncharacterized protein LOC115951668 [Quercus lobata]2.6e-8036.67Show/hide
Query:  LESLIGQADPPFVDEIMQAEVPHKFKLPTFPQYDGKKDPVQHLDTYRSWMGFHGASEATKCRAFSLTLTGTAQQWYGKLPPKSVDSFKELSRLFATQFLG
        L+ L+ + D PF   +  + +P KF++P    YDG KDP+ HL+T+++ M   G  +   CRAF  TL G A+ W+ +L P S+++FKELS  F + F+G
Subjt:  LESLIGQADPPFVDEIMQAEVPHKFKLPTFPQYDGKKDPVQHLDTYRSWMGFHGASEATKCRAFSLTLTGTAQQWYGKLPPKSVDSFKELSRLFATQFLG

Query:  ARDRKKPQFNLLTIKQRPGESLNGYITRFSNEVVQVEGYDDAVALTAVIAGLQDERLLNSVGEDQPRTYTEFVARAQKYINAEELMKSKRAEREAQRVTT
            K+    L++IKQR  E+L  YIT F+ E + ++  DD + + A  +GL+  + L S+ ++ P+T  + + RA KY+NAE+ + +            
Subjt:  ARDRKKPQFNLLTIKQRPGESLNGYITRFSNEVVQVEGYDDAVALTAVIAGLQDERLLNSVGEDQPRTYTEFVARAQKYINAEELMKSKRAEREAQRVTT

Query:  IDKGRRKEERGKRSREEDEGRSRLKYSVGRGQQDQKEGRGRPEYKGKYE--------RDGYLKEFVDGEESKKPRPTRGDGKEEGNDPPL-EIKTILGGP
             R+E+  KR R+ED  + R +  V R    + E R RP  +              G L+ FV  E++ KP       + E   PP+ +I+ I+GG 
Subjt:  IDKGRRKEERGKRSREEDEGRSRLKYSVGRGQQDQKEGRGRPEYKGKYE--------RDGYLKEFVDGEESKKPRPTRGDGKEEGNDPPL-EIKTILGGP

Query:  VGGDSNRKRKAAAREARYDLEERRVYSVRLSKGLPSVEFTELEASGIHQPHNDALVVSLTVANTRVHRILVDGGSSADILSTKVFEAMKLGKERLRTSAA
        V   S++K +       + ++           G P + F+E +A  +H PH+DALVVSL + +  +HR+LVD GSS DIL    F+ M++ +E+L  + A
Subjt:  VGGDSNRKRKAAAREARYDLEERRVYSVRLSKGLPSVEFTELEASGIHQPHNDALVVSLTVANTRVHRILVDGGSSADILSTKVFEAMKLGKERLRTSAA

Query:  PLVGFGGEKVLPEGSIELPVTYGEGQNVVTKMVNFLVVNCVSAYNAILGRPALHELKAVASTYHQMMKFPTDSGIGVVRGEQQASRECYYTALRGASQQ
        PLVGFGG K+ P G+I L VT  +    +TK V FLV++C  AYNAILGRP L   K V STYH M+KF T+ GIG +RG Q  +R+CY   +    QQ
Subjt:  PLVGFGGEKVLPEGSIELPVTYGEGQNVVTKMVNFLVVNCVSAYNAILGRPALHELKAVASTYHQMMKFPTDSGIGVVRGEQQASRECYYTALRGASQQ

TrEMBL top hitse value%identityAlignment
A0A2N9FDC4 Ribonuclease H3.1e-7935.84Show/hide
Query:  SHRRIEAEAEAEAEAIAKAKMQEKVELEAKIRVELEGKLRAEAEAAAKAKAEAEQSKVRAKEGTQGSTRPRDADRDYLESLIGQADPPFVDEIMQAEVPH
        S  R +AE+  ++ + +  K   K  ++ +   E +G+        + AK E E   ++A+ G         A R+ L++L+ + D PF ++++   +P 
Subjt:  SHRRIEAEAEAEAEAIAKAKMQEKVELEAKIRVELEGKLRAEAEAAAKAKAEAEQSKVRAKEGTQGSTRPRDADRDYLESLIGQADPPFVDEIMQAEVPH

Query:  KFKLPTFPQYDGKKDPVQHLDTYRSWMGFHGASEATKCRAFSLTLTGTAQQWYGKLPPKSVDSFKELSRLFATQFLGARDRKKPQFNLLTIKQRPGESLN
        KF++P+   +DG KDP+ HL+++++ M   G  +   CRAF  TL G A+ W+ KL P SV SF +LSR F   F+G +   +P  +LL ++QR GE+L 
Subjt:  KFKLPTFPQYDGKKDPVQHLDTYRSWMGFHGASEATKCRAFSLTLTGTAQQWYGKLPPKSVDSFKELSRLFATQFLGARDRKKPQFNLLTIKQRPGESLN

Query:  GYITRFSNEVVQVEGYDDAVALTAVIAGLQDERLLNSVGEDQPRTYTEFVARAQKYINAEELMKSKRAEREAQRVTTIDKG-RRKEERGKRSREEDEGRS
         Y+TRF+ E + V+G DD V LTA I+GLQ    L SV +D P + +E +  AQ+Y+N E+ ++++            D G +R+ +   RS E  E + 
Subjt:  GYITRFSNEVVQVEGYDDAVALTAVIAGLQDERLLNSVGEDQPRTYTEFVARAQKYINAEELMKSKRAEREAQRVTTIDKG-RRKEERGKRSREEDEGRS

Query:  RLKYSVGRGQQDQKEGRGRPE----------------YKGKYE---RDGYLKEFV--DGEESKKP--RPTRGDGKEEGNDPPLEIKTILGG-PVGGDSNR
        +++ + G+ ++D+   RG  E                 K + E   + G L+ FV  D  E + P  RP R   ++    P  EI  I GG   GG S  
Subjt:  RLKYSVGRGQQDQKEGRGRPE----------------YKGKYE---RDGYLKEFV--DGEESKKP--RPTRGDGKEEGNDPPLEIKTILGG-PVGGDSNR

Query:  KRKAAAREARYDLEERRVYSVRLSKGLPSVEFTELEASGIHQPHNDALVVSLTVANTRVHRILVDGGSSADILSTKVFEAMKLGKERLRTSAAPLVGFGG
         RKA AR+    L  ++       + LP + FTE +A  +  PH+DALVV+L +A     R+L+D GSSADI+    F+ M++GK++LR    PLVGF G
Subjt:  KRKAAAREARYDLEERRVYSVRLSKGLPSVEFTELEASGIHQPHNDALVVSLTVANTRVHRILVDGGSSADILSTKVFEAMKLGKERLRTSAAPLVGFGG

Query:  EKVLPEGSIELPVTYGEGQNVVTKMVNFLVVNCVSAYNAILGRPALHELKAVASTYHQMMKFPTDSGIGVVRGEQQASRECYYTAL
          V P G I L +  G      TK V FLVV+C SAYN I+GRP L+ L+AV STYH +++FPT+ GIG ++G+Q  +RECY T++
Subjt:  EKVLPEGSIELPVTYGEGQNVVTKMVNFLVVNCVSAYNAILGRPALHELKAVASTYHQMMKFPTDSGIGVVRGEQQASRECYYTAL

A0A2N9G7K4 Uncharacterized protein5.7e-8136.85Show/hide
Query:  AKAEAEQSKVRAKEGTQGSTRPRDADRDYLESLIGQADPPFVDEIMQAEVPHKFKLPTFPQYDGKKDPVQHLDTYRSWMGFHGASEATKCRAFSLTLTGT
        A+ E E  ++R + G   ++    A R+ L++L+ +AD PF+  I    +P +FK+P    +DG KDP  +L+ +++ M      E   CRAF L L G+
Subjt:  AKAEAEQSKVRAKEGTQGSTRPRDADRDYLESLIGQADPPFVDEIMQAEVPHKFKLPTFPQYDGKKDPVQHLDTYRSWMGFHGASEATKCRAFSLTLTGT

Query:  AQQWYGKLPPKSVDSFKELSRLFATQFLGARDRKKPQFNLLTIKQRPGESLNGYITRFSNEVVQVEGYDDAVALTAVIAGLQDERLLNSVGEDQPRTYTE
        A+ W+ KL  +S+ SF +LSR F   F+G++ R +P  +LL++KQ  GESL  ++ RF+ E ++++   + V +TAV+AGL+    L  + +D P T +E
Subjt:  AQQWYGKLPPKSVDSFKELSRLFATQFLGARDRKKPQFNLLTIKQRPGESLNGYITRFSNEVVQVEGYDDAVALTAVIAGLQDERLLNSVGEDQPRTYTE

Query:  FVARAQKYINAEELMKSKRAEREAQRVTTIDKGRRKEERGKRSREEDEGRSRLKYSVGRGQQDQ----KEGRGRPEYKGKYERDGYLKEFVDGEESKKPR
         +  A K++NAE+ +++         +      RRK+   ++ R +   R +  Y   R  +D     +E     E      R G L+++V+   + +P 
Subjt:  FVARAQKYINAEELMKSKRAEREAQRVTTIDKGRRKEERGKRSREEDEGRSRLKYSVGRGQQDQ----KEGRGRPEYKGKYERDGYLKEFVDGEESKKPR

Query:  --PTRGDGKEEGNDPPL-EIKTILGGPV-GGDSNRKRKAAAREARYDLEERRVYSVRLSKGLPSVEFTELEASGIHQPHNDALVVSLTVANTRVHRILVD
          P + +  E     P+ EI+TI+GGP  GG S   RKA AR+                     + F+E +A G HQPH+DALV+ + +A   + R++VD
Subjt:  --PTRGDGKEEGNDPPL-EIKTILGGPV-GGDSNRKRKAAAREARYDLEERRVYSVRLSKGLPSVEFTELEASGIHQPHNDALVVSLTVANTRVHRILVD

Query:  GGSSADILSTKVFEAMKLGKERLRTSAAPLVGFGGEKVLPEGSIELPVTYGEGQNVVTKMVNFLVVNCVSAYNAILGRPALHELKAVASTYHQMMKFPTD
         GSSADIL    ++ M+L + +LR   APLVGF G+KV P G + LP+  G     V+K V+FLV+NC SAYNAI+GRP L+ L+AV STYH ++KFPT+
Subjt:  GGSSADILSTKVFEAMKLGKERLRTSAAPLVGFGGEKVLPEGSIELPVTYGEGQNVVTKMVNFLVVNCVSAYNAILGRPALHELKAVASTYHQMMKFPTD

Query:  SGIGVVRGEQQASRECYYTAL
         G+G VRG+Q A+RECY T+L
Subjt:  SGIGVVRGEQQASRECYYTAL

A0A6J1DWY0 uncharacterized protein LOC1110252931.7e-8036.67Show/hide
Query:  PNPGTTYKRSSARDQGRKRGASGEEEDETDSATSKLRQPRGGEKPVLKEPGPSRGAERKGALDVPDEVSTVGSHRRIEAEAEAEAEAI--AKAKMQEKVE
        P  GT+ + +  R +G     +    +      + L      E P++++P   +G       D  +  ++VGS  RI              K K Q K  
Subjt:  PNPGTTYKRSSARDQGRKRGASGEEEDETDSATSKLRQPRGGEKPVLKEPGPSRGAERKGALDVPDEVSTVGSHRRIEAEAEAEAEAI--AKAKMQEKVE

Query:  LEAKIRVELEGKLRAEAEAAAKAKAEAEQSKVRAKEGTQGSTRPRDADRDY--------LESLIGQADPPFVDEIMQAEVPHKFKLPTFPQYDGKKDPVQ
                      + A    K+  +   S+  + +  + + RP  +++ +        LE L+ QAD PF +EIM+ +VP KFKLPT  Q+D   DPV 
Subjt:  LEAKIRVELEGKLRAEAEAAAKAKAEAEQSKVRAKEGTQGSTRPRDADRDY--------LESLIGQADPPFVDEIMQAEVPHKFKLPTFPQYDGKKDPVQ

Query:  HLDTYRSWMGFHGASEATKCRAFSLTLTGTAQQWYGKLPPKSVDSFKELSRLFATQFLGARDRKKPQFNLLTIKQRPGESLNGYITRFSNEVVQVEGYDD
        HLD YR WM  +G SEA +CR FS TL G+A+ W+ +L   S+ SFK L+R F TQF+G R R +P   LLTIKQR  ESL  Y+ RF+ E +QVEG  D
Subjt:  HLDTYRSWMGFHGASEATKCRAFSLTLTGTAQQWYGKLPPKSVDSFKELSRLFATQFLGARDRKKPQFNLLTIKQRPGESLNGYITRFSNEVVQVEGYDD

Query:  AVALTAVIAGLQDERLLNSVGEDQPRTYTEFVARAQKYINAEELMKS------KRAEREAQRVTTIDKGRRKEERGKRSREEDEGRSRLKYS--------
        AV+L A ++G++DE L  S G+  P T++E ++RAQ+Y++A E   S      KR + + +R     +G R E+R  RS ++D  R   KY+        
Subjt:  AVALTAVIAGLQDERLLNSVGEDQPRTYTEFVARAQKYINAEELMKS------KRAEREAQRVTTIDKGRRKEERGKRSREEDEGRSRLKYS--------

Query:  VGRGQQDQK--------EGRGRPEYKGKY------------------------ERDGYLKEFVDGEESKKPRPTRGDGKEEGNDPPLEIKTILGGPVGGD
        V    +DQ+        +       KG+Y                         R GYLKE+V+     +P+ T+    E    P  EI+TI+GGP+  +
Subjt:  VGRGQQDQK--------EGRGRPEYKGKY------------------------ERDGYLKEFVDGEESKKPRPTRGDGKEEGNDPPLEIKTILGGPVGGD

Query:  SNRKRKAAAREARYDLEERRVYSVRLSKGLPSVEFTELEASGIHQPHNDALVVSLTVANTRVHRILVDGGSSADILSTKVFEAMKLGKERLRTSAAPLVG
        S RKRKA  REAR   E+  VY    +    ++EF+E EA+ +  PHNDALV+ L +AN +VHR+LVDGGSSADILS   ++AM L ++ L+ S APLVG
Subjt:  SNRKRKAAAREARYDLEERRVYSVRLSKGLPSVEFTELEASGIHQPHNDALVVSLTVANTRVHRILVDGGSSADILSTKVFEAMKLGKERLRTSAAPLVG

Query:  FGGEKVLPEGSIELPVTYG
        FG E+V+PEG IELPVT+G
Subjt:  FGGEKVLPEGSIELPVTYG

A0A7J0DNL3 Retrotrans_gag domain-containing protein3.9e-8240Show/hide
Query:  LESLIGQADPPFVDEIMQAEVPHKFKLPT-FPQYDGKKDPVQHLDTYRSWMGFHGASEATKCRAFSLTLTGTAQQWYGKLPPKSVDSFKELSRLFATQFL
        +++L+ Q DPPF + +++  +  KFKLPT    Y+GK DP+ HLD+Y+S M   G S+   C+AFS TL G A+ W+ KL P +VDSF ELSRLF   F+
Subjt:  LESLIGQADPPFVDEIMQAEVPHKFKLPT-FPQYDGKKDPVQHLDTYRSWMGFHGASEATKCRAFSLTLTGTAQQWYGKLPPKSVDSFKELSRLFATQFL

Query:  GARDRKKPQFNLLTIKQRPGESLNGYITRFSNEVVQVEGYDDAVALTAVIAGLQDERLLNSVGEDQPRTYTEFVARAQKYINAEELMKSKRAEREAQRVT
          R+R+K   +L TI Q+  ESL  ++ RF+  V+ VE   D V + A++ GL+   L +S+ ++ P T +   ++A KYI AEEL ++KR  R      
Subjt:  GARDRKKPQFNLLTIKQRPGESLNGYITRFSNEVVQVEGYDDAVALTAVIAGLQDERLLNSVGEDQPRTYTEFVARAQKYINAEELMKSKRAEREAQRVT

Query:  TIDKGRRKEERGKRSREEDEGRSRLKYSVGRGQQDQKEGRGRPEYKGKYERDGYLKEFVDGEESKKPRPTRGDGKEEGNDPPL-EIKTILGG-PVGGDSN
          +  +RKE   +R    +E R +      R  QD K    R         +  L            RP+  + K   N P    I+TI GG   GG S 
Subjt:  TIDKGRRKEERGKRSREEDEGRSRLKYSVGRGQQDQKEGRGRPEYKGKYERDGYLKEFVDGEESKKPRPTRGDGKEEGNDPPL-EIKTILGG-PVGGDSN

Query:  RKRKAAAREARYDLEERRVYSVR--LSKGLPSVEFTELEASGIHQPHNDALVVSLTVANTRVHRILVDGGSSADILSTKVFEAMKLGKERLRTSAAPLVG
          RK  AR A   +EE  VY++    +   P + F+  +  G+H PH+DALVVS  +AN  V RIL+D GSSADIL    FE MK+G ++L     PL+G
Subjt:  RKRKAAAREARYDLEERRVYSVR--LSKGLPSVEFTELEASGIHQPHNDALVVSLTVANTRVHRILVDGGSSADILSTKVFEAMKLGKERLRTSAAPLVG

Query:  FGGEKVLPEGSIELPVTYGEGQNVVTKMVNFLVVNCVSAYNAILGRPALHELKAVASTYHQMMKFPTDSGIGVVRGEQQASRECYYTALR
        FGG    P G I LP+T G      T   +F+VV+C S YN+ILGRP L  +KA+ STYH  MKFPT +GIG V+G+Q+ +R+C+ +A++
Subjt:  FGGEKVLPEGSIELPVTYGEGQNVVTKMVNFLVVNCVSAYNAILGRPALHELKAVASTYHQMMKFPTDSGIGVVRGEQQASRECYYTALR

A0A7N2N9G0 Reverse transcriptase1.4e-7936.43Show/hide
Query:  LESLIGQADPPFVDEIMQAEVPHKFKLPTFPQYDGKKDPVQHLDTYRSWMGFHGASEATKCRAFSLTLTGTAQQWYGKLPPKSVDSFKELSRLFATQFLG
        L+ L+ + D PF   +    +P KF++P    YDG KDP+ HL+T+++ M   G ++A  CRAF  TL G A+ W+ +L P S+ +FKELS  F   F+G
Subjt:  LESLIGQADPPFVDEIMQAEVPHKFKLPTFPQYDGKKDPVQHLDTYRSWMGFHGASEATKCRAFSLTLTGTAQQWYGKLPPKSVDSFKELSRLFATQFLG

Query:  ARDRKKPQFNLLTIKQRPGESLNGYITRFSNEVVQVEGYDDAVALTAVIAGLQDERLLNSVGEDQPRTYTEFVARAQKYINAEELM---KSKRAEREAQR
            KK    L+++KQR  E+L  YI+RF+ E + V+  DD + + A   GL+  + L S+ ++ P+T +E + RA KY+NAE+ +   + K  +RE Q 
Subjt:  ARDRKKPQFNLLTIKQRPGESLNGYITRFSNEVVQVEGYDDAVALTAVIAGLQDERLLNSVGEDQPRTYTEFVARAQKYINAEELM---KSKRAEREAQR

Query:  VTTIDKGRRKEERGKRSREE-------------------DEGRSRLKYS-----VGRGQQDQKEGRGRPEY-----------------KGKYE---RDGY
            D+GR+K   G R  E                    D+   ++K        G+ + D  + R R +Y                 K + E   R G 
Subjt:  VTTIDKGRRKEERGKRSREE-------------------DEGRSRLKYS-----VGRGQQDQKEGRGRPEY-----------------KGKYE---RDGY

Query:  LKEFVDGEESKKPRPTRGDGKE-EGNDPPL-EIKTILGGPVGGDSNRKRKAAAREARYDLEERRVYSVRLSKGLPSVEFTELEASGIHQPHNDALVVSLT
        L+ FV  E +  P   +   ++ E   PP+ +I+ I+GG     S++K +        +++   V      +  P + F+E +A   H PH+DALVVSL 
Subjt:  LKEFVDGEESKKPRPTRGDGKE-EGNDPPL-EIKTILGGPVGGDSNRKRKAAAREARYDLEERRVYSVRLSKGLPSVEFTELEASGIHQPHNDALVVSLT

Query:  VANTRVHRILVDGGSSADILSTKVFEAMKLGKERLRTSAAPLVGFGGEKVLPEGSIELPVTYGEGQNVVTKMVNFLVVNCVSAYNAILGRPALHELKAVA
        V +  +HR+LVD GSSADIL    F+ M++ +ERL  + APLVGFGG +V P G++ L V  G+    + + V FLVV+C SAYNAILGRP L+  KAV 
Subjt:  VANTRVHRILVDGGSSADILSTKVFEAMKLGKERLRTSAAPLVGFGGEKVLPEGSIELPVTYGEGQNVVTKMVNFLVVNCVSAYNAILGRPALHELKAVA

Query:  STYHQMMKFPTDSGIGVVRGEQQASRECYYTALRGASQ
        STYH M+KFPTD G+G +RG Q A+RECY   +    Q
Subjt:  STYHQMMKFPTDSGIGVVRGEQQASRECYYTALRGASQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGAGGCCGAGCGTCTGGTCGGCCTCGGCCCAATGCCGAAACCGACCAGGCCAATTATTCCCACTGGCCGTTTGGGCTGCCCGGGTCAATCCCCTTGGGCCCAGTAG
CAGGAGGCTACGTGCGGTGGACAATGCGGAAAGGGAAAAAACCCTAGCGAGAGAGGTTTTCTGGACGGGGAGACCGGCCGGAGGTTTTCCGATCACTGACTTAGGCATCG
GAGCCTGTGTGGCGAGCACCACACCGATTTTGGCATCAACAGTTGGCGCCGTCTGTGGGGAAGTGTGTCGGATTGAGAGCATTGTAGGCTGGTGCAACAGGGTAATGGAA
CAGGAGAATCCAGCGGGCGAGAGAGGTCAGCAGTCCAAGGTGAGTGCCCCGGGCACTGAGATGGAGGACCTTAGAGGAAGGGTGAACGAGATGGGCCAGAGTTTGGCTGA
GATTCTGGGTATATTGAAGCAACCGAATCCTGGTACGACGTACAAGAGAAGTTCCGCGCGGGACCAAGGGAGGAAGAGAGGGGCCTCCGGCGAAGAGGAAGATGAAACGG
ATAGTGCCACCAGCAAGCTGCGTCAGCCAAGGGGTGGTGAAAAACCTGTCTTGAAGGAGCCAGGGCCGAGCAGAGGGGCAGAGCGCAAGGGTGCACTCGATGTCCCAGAT
GAGGTTAGCACAGTAGGTTCGCACAGGAGGATCGAAGCCGAGGCCGAGGCCGAGGCCGAGGCCATAGCCAAAGCCAAGATGCAGGAAAAGGTCGAGCTCGAGGCAAAGAT
CAGGGTTGAGCTAGAAGGTAAGCTGAGGGCCGAGGCCGAAGCTGCGGCCAAGGCAAAGGCCGAGGCCGAGCAGTCAAAGGTTAGGGCCAAGGAGGGCACCCAGGGAAGCA
CACGCCCTAGGGATGCGGACAGGGATTATTTGGAAAGTTTGATAGGGCAGGCTGACCCACCCTTTGTTGATGAAATTATGCAGGCCGAGGTTCCACATAAGTTCAAATTA
CCGACTTTCCCGCAGTACGACGGGAAGAAAGATCCGGTGCAACATTTGGACACCTACCGATCTTGGATGGGATTTCATGGGGCATCCGAGGCGACTAAGTGCCGGGCGTT
TTCGTTAACCCTAACCGGAACGGCACAGCAGTGGTACGGTAAACTGCCACCTAAGTCTGTTGATTCATTCAAAGAATTGTCCCGCCTTTTTGCCACTCAGTTTTTGGGAG
CAAGGGATCGGAAGAAGCCGCAGTTCAATTTGTTGACCATAAAGCAAAGGCCAGGGGAGAGCTTGAATGGGTATATCACAAGGTTCAGTAACGAGGTTGTGCAGGTAGAG
GGGTATGACGATGCAGTGGCGCTAACCGCAGTTATCGCCGGGTTACAGGATGAAAGATTGTTGAATTCCGTGGGGGAAGACCAGCCACGAACGTACACGGAATTCGTGGC
CAGAGCACAAAAATACATAAACGCAGAGGAGCTGATGAAGTCCAAGCGGGCGGAAAGGGAAGCGCAGAGGGTAACCACCATCGATAAAGGCAGACGAAAGGAGGAGAGGG
GCAAACGGTCGCGTGAAGAGGACGAGGGCCGGAGCCGTCTCAAGTATTCTGTGGGTCGGGGCCAGCAAGACCAGAAGGAAGGCCGAGGCCGACCAGAATACAAAGGAAAA
TATGAGAGGGACGGATATCTGAAAGAATTTGTGGACGGTGAGGAGAGCAAAAAGCCGAGGCCGACCAGAGGTGATGGCAAGGAGGAGGGCAATGACCCCCCACTGGAAAT
TAAAACAATTTTGGGAGGACCCGTAGGAGGAGACTCGAACCGGAAAAGGAAAGCAGCAGCTAGGGAGGCAAGGTATGATCTTGAGGAAAGAAGGGTGTATTCGGTACGGC
TCTCAAAGGGGTTGCCGTCAGTAGAGTTCACCGAGCTGGAAGCTTCCGGCATACACCAGCCTCACAATGATGCACTAGTGGTGAGCCTGACAGTTGCCAACACGCGGGTT
CATCGGATTTTGGTTGATGGAGGAAGTTCAGCAGACATCCTGTCCACGAAGGTGTTTGAGGCCATGAAATTGGGGAAGGAACGGTTGAGGACGAGCGCTGCACCGCTGGT
TGGTTTTGGTGGAGAAAAGGTGCTACCAGAGGGAAGCATCGAGTTGCCGGTGACCTATGGGGAAGGACAGAATGTGGTGACGAAAATGGTTAATTTCTTGGTAGTAAATT
GTGTATCCGCCTATAACGCCATACTGGGAAGGCCAGCTCTGCATGAATTGAAAGCGGTGGCATCGACATACCACCAAATGATGAAGTTTCCCACTGATAGCGGAATAGGA
GTAGTGAGGGGAGAGCAACAAGCTTCACGAGAATGCTATTACACCGCGCTCAGAGGAGCAAGCCAGCAGGCAAGGCAGCACGGCTCGGCCCAGGACGAGGGCCGAGGCCG
ACCAGTCCAAGGCCGAGCAGAGGGATGCCCTTTGTCAATGGAATTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCGAGGCCGAGCGTCTGGTCGGCCTCGGCCCAATGCCGAAACCGACCAGGCCAATTATTCCCACTGGCCGTTTGGGCTGCCCGGGTCAATCCCCTTGGGCCCAGTAG
CAGGAGGCTACGTGCGGTGGACAATGCGGAAAGGGAAAAAACCCTAGCGAGAGAGGTTTTCTGGACGGGGAGACCGGCCGGAGGTTTTCCGATCACTGACTTAGGCATCG
GAGCCTGTGTGGCGAGCACCACACCGATTTTGGCATCAACAGTTGGCGCCGTCTGTGGGGAAGTGTGTCGGATTGAGAGCATTGTAGGCTGGTGCAACAGGGTAATGGAA
CAGGAGAATCCAGCGGGCGAGAGAGGTCAGCAGTCCAAGGTGAGTGCCCCGGGCACTGAGATGGAGGACCTTAGAGGAAGGGTGAACGAGATGGGCCAGAGTTTGGCTGA
GATTCTGGGTATATTGAAGCAACCGAATCCTGGTACGACGTACAAGAGAAGTTCCGCGCGGGACCAAGGGAGGAAGAGAGGGGCCTCCGGCGAAGAGGAAGATGAAACGG
ATAGTGCCACCAGCAAGCTGCGTCAGCCAAGGGGTGGTGAAAAACCTGTCTTGAAGGAGCCAGGGCCGAGCAGAGGGGCAGAGCGCAAGGGTGCACTCGATGTCCCAGAT
GAGGTTAGCACAGTAGGTTCGCACAGGAGGATCGAAGCCGAGGCCGAGGCCGAGGCCGAGGCCATAGCCAAAGCCAAGATGCAGGAAAAGGTCGAGCTCGAGGCAAAGAT
CAGGGTTGAGCTAGAAGGTAAGCTGAGGGCCGAGGCCGAAGCTGCGGCCAAGGCAAAGGCCGAGGCCGAGCAGTCAAAGGTTAGGGCCAAGGAGGGCACCCAGGGAAGCA
CACGCCCTAGGGATGCGGACAGGGATTATTTGGAAAGTTTGATAGGGCAGGCTGACCCACCCTTTGTTGATGAAATTATGCAGGCCGAGGTTCCACATAAGTTCAAATTA
CCGACTTTCCCGCAGTACGACGGGAAGAAAGATCCGGTGCAACATTTGGACACCTACCGATCTTGGATGGGATTTCATGGGGCATCCGAGGCGACTAAGTGCCGGGCGTT
TTCGTTAACCCTAACCGGAACGGCACAGCAGTGGTACGGTAAACTGCCACCTAAGTCTGTTGATTCATTCAAAGAATTGTCCCGCCTTTTTGCCACTCAGTTTTTGGGAG
CAAGGGATCGGAAGAAGCCGCAGTTCAATTTGTTGACCATAAAGCAAAGGCCAGGGGAGAGCTTGAATGGGTATATCACAAGGTTCAGTAACGAGGTTGTGCAGGTAGAG
GGGTATGACGATGCAGTGGCGCTAACCGCAGTTATCGCCGGGTTACAGGATGAAAGATTGTTGAATTCCGTGGGGGAAGACCAGCCACGAACGTACACGGAATTCGTGGC
CAGAGCACAAAAATACATAAACGCAGAGGAGCTGATGAAGTCCAAGCGGGCGGAAAGGGAAGCGCAGAGGGTAACCACCATCGATAAAGGCAGACGAAAGGAGGAGAGGG
GCAAACGGTCGCGTGAAGAGGACGAGGGCCGGAGCCGTCTCAAGTATTCTGTGGGTCGGGGCCAGCAAGACCAGAAGGAAGGCCGAGGCCGACCAGAATACAAAGGAAAA
TATGAGAGGGACGGATATCTGAAAGAATTTGTGGACGGTGAGGAGAGCAAAAAGCCGAGGCCGACCAGAGGTGATGGCAAGGAGGAGGGCAATGACCCCCCACTGGAAAT
TAAAACAATTTTGGGAGGACCCGTAGGAGGAGACTCGAACCGGAAAAGGAAAGCAGCAGCTAGGGAGGCAAGGTATGATCTTGAGGAAAGAAGGGTGTATTCGGTACGGC
TCTCAAAGGGGTTGCCGTCAGTAGAGTTCACCGAGCTGGAAGCTTCCGGCATACACCAGCCTCACAATGATGCACTAGTGGTGAGCCTGACAGTTGCCAACACGCGGGTT
CATCGGATTTTGGTTGATGGAGGAAGTTCAGCAGACATCCTGTCCACGAAGGTGTTTGAGGCCATGAAATTGGGGAAGGAACGGTTGAGGACGAGCGCTGCACCGCTGGT
TGGTTTTGGTGGAGAAAAGGTGCTACCAGAGGGAAGCATCGAGTTGCCGGTGACCTATGGGGAAGGACAGAATGTGGTGACGAAAATGGTTAATTTCTTGGTAGTAAATT
GTGTATCCGCCTATAACGCCATACTGGGAAGGCCAGCTCTGCATGAATTGAAAGCGGTGGCATCGACATACCACCAAATGATGAAGTTTCCCACTGATAGCGGAATAGGA
GTAGTGAGGGGAGAGCAACAAGCTTCACGAGAATGCTATTACACCGCGCTCAGAGGAGCAAGCCAGCAGGCAAGGCAGCACGGCTCGGCCCAGGACGAGGGCCGAGGCCG
ACCAGTCCAAGGCCGAGCAGAGGGATGCCCTTTGTCAATGGAATTCTGA
Protein sequenceShow/hide protein sequence
MPRPSVWSASAQCRNRPGQLFPLAVWAARVNPLGPSSRRLRAVDNAEREKTLAREVFWTGRPAGGFPITDLGIGACVASTTPILASTVGAVCGEVCRIESIVGWCNRVME
QENPAGERGQQSKVSAPGTEMEDLRGRVNEMGQSLAEILGILKQPNPGTTYKRSSARDQGRKRGASGEEEDETDSATSKLRQPRGGEKPVLKEPGPSRGAERKGALDVPD
EVSTVGSHRRIEAEAEAEAEAIAKAKMQEKVELEAKIRVELEGKLRAEAEAAAKAKAEAEQSKVRAKEGTQGSTRPRDADRDYLESLIGQADPPFVDEIMQAEVPHKFKL
PTFPQYDGKKDPVQHLDTYRSWMGFHGASEATKCRAFSLTLTGTAQQWYGKLPPKSVDSFKELSRLFATQFLGARDRKKPQFNLLTIKQRPGESLNGYITRFSNEVVQVE
GYDDAVALTAVIAGLQDERLLNSVGEDQPRTYTEFVARAQKYINAEELMKSKRAEREAQRVTTIDKGRRKEERGKRSREEDEGRSRLKYSVGRGQQDQKEGRGRPEYKGK
YERDGYLKEFVDGEESKKPRPTRGDGKEEGNDPPLEIKTILGGPVGGDSNRKRKAAAREARYDLEERRVYSVRLSKGLPSVEFTELEASGIHQPHNDALVVSLTVANTRV
HRILVDGGSSADILSTKVFEAMKLGKERLRTSAAPLVGFGGEKVLPEGSIELPVTYGEGQNVVTKMVNFLVVNCVSAYNAILGRPALHELKAVASTYHQMMKFPTDSGIG
VVRGEQQASRECYYTALRGASQQARQHGSAQDEGRGRPVQGRAEGCPLSMEF