; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi04G011930 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi04G011930
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr04:17989640..17991781
RNA-Seq ExpressionLsi04G011930
SyntenyLsi04G011930
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035961.1 uncharacterized protein E6C27_scaffold56G001660 [Cucumis melo var. makuwa]7.3e-8045.41Show/hide
Query:  NVEITSAAKEMIEELGRMFGREIAALSKEVANLRKFVEGEVHELYKELDNTRAECSKN--CNASSSASTSATSA------TPCP----------------
        NVEI++AAKEMI+ LG   G+E+  +  EV NLRKFVE E+H L K +D  RAE       N ++S STS T         P P                
Subjt:  NVEITSAAKEMIEELGRMFGREIAALSKEVANLRKFVEGEVHELYKELDNTRAECSKN--CNASSSASTSATSA------TPCP----------------

Query:  -------------------------------LW---KHLERERDICSIRTWEQFKAELQKHFVPHNTEMEARGKLRRLRQIGSIPEYNKEFTTLMLEIYD
                                       LW   KH E E+D   ++TWEQFKAEL+KHFVPHN +MEARGKLRRLRQIGSIP+Y KEFTTLMLEI D
Subjt:  -------------------------------LW---KHLERERDICSIRTWEQFKAELQKHFVPHNTEMEARGKLRRLRQIGSIPEYNKEFTTLMLEIYD

Query:  LSDKDTLLYFKDDLKDW----LDRRNVRTLDDAIAAVEVLIDYSIKGKQATKSEREVSDGKNGRNGKATSK-GKSSTSQP---PKPCFLCNGPHWTMDCP
        LSDKD L +F+DDLKDW    LDRRNVRTLDDAIAA   LID   K K+    E E  + K   N K   K GK+  +     P  CFLC GPHWT DCP
Subjt:  LSDKDTLLYFKDDLKDW----LDRRNVRTLDDAIAAVEVLIDYSIKGKQATKSEREVSDGKNGRNGKATSK-GKSSTSQP---PKPCFLCNGPHWTMDCP

Query:  SKKKLNALVAKSRNEEQGKGEPNAKMGSLQLLSAMTKASSSKGSGEEGQLLVEAKINGRVTDALLDMGASLNFIDPKEAKRLGPK-IKKEGGNMKAVNSR
         K KL+A+VA  R EE+   +  A++G ++ LSAM K+ SSK   E G L V+A  N +   A+LD GA+ N +DP+EAKRLG +  ++    +K ++++
Subjt:  SKKKLNALVAKSRNEEQGKGEPNAKMGSLQLLSAMTKASSSKGSGEEGQLLVEAKINGRVTDALLDMGASLNFIDPKEAKRLGPK-IKKEGGNMKAVNSR

Query:  TRKAKGIAENVPIKIGKWGGSLDFTVVPMDDYRTIL
        + K  G A++V  ++G W G L+F +  +DD + IL
Subjt:  TRKAKGIAENVPIKIGKWGGSLDFTVVPMDDYRTIL

KAA0040659.1 uncharacterized protein E6C27_scaffold370G00130 [Cucumis melo var. makuwa]2.6e-7741.45Show/hide
Query:  NVEITSAAKEMIEELGRMFGREIAALSKEVANLRKFVEGEVHELYKE-------LDNTRAECSK---NCNASSSASTSATSAT-----PCP---------
        N EITS AKEMIE++GR F +E+  L+  V  L+ FVEGE+H+L+ +       LD    EC       NA S+++   TS T     P P         
Subjt:  NVEITSAAKEMIEELGRMFGREIAALSKEVANLRKFVEGEVHELYKE-------LDNTRAECSK---NCNASSSASTSATSAT-----PCP---------

Query:  ---------------------------------------LWKHLERERDICSIRTWEQFKAELQKHFVPHNTEMEARGKLRRLRQIGSIPEYNKEFTTLM
                                                W+    ++   +I +WEQFKAEL+KHFVPHN E+E+RGKLRRLR  GSI +Y KEFTTLM
Subjt:  ---------------------------------------LWKHLERERDICSIRTWEQFKAELQKHFVPHNTEMEARGKLRRLRQIGSIPEYNKEFTTLM

Query:  LEIYDLSDKDTLLYFKDDLKDW----LDRRNVRTLDDAIAAVEVLIDYSIKG--------KQATKSEREVSDG-KNG--------RNGKATSKGKSSTSQ
        LEI DL +K+ L  FKD LKDW    LDRRNV+TLDDAIA  E L+DYS +         K   KS++  + G K+G        +NGK     +  +S 
Subjt:  LEIYDLSDKDTLLYFKDDLKDW----LDRRNVRTLDDAIAAVEVLIDYSIKG--------KQATKSEREVSDG-KNG--------RNGKATSKGKSSTSQ

Query:  PPKPCFLCNGPHWTMDCPSKKKLNALVAKSRNEEQGKGEPNAKMGSLQLLSAMTKASSSKGSGEEGQLLVEAKINGRVTDALLDMGASLNFIDPKEAKRL
        PPKPCF+C GPHWT DCP++K LNALVAK +  +Q +  P  ++GS+Q +  M K ++ +    +G L    +I G+    + D GAS NFID +EAKRL
Subjt:  PPKPCFLCNGPHWTMDCPSKKKLNALVAKSRNEEQGKGEPNAKMGSLQLLSAMTKASSSKGSGEEGQLLVEAKINGRVTDALLDMGASLNFIDPKEAKRL

Query:  GPKIKKEGGNMKAVNSRTRKAKGIAENVPIKIGKWGGSLDFTVVPMDDYRTILEYG
        G K K+E G +K VN++ +   G+A+ V +KIG W   LDF+V+PMDD+  +L  G
Subjt:  GPKIKKEGGNMKAVNSRTRKAKGIAENVPIKIGKWGGSLDFTVVPMDDYRTILEYG

KAA0042140.1 uncharacterized protein E6C27_scaffold67G006290 [Cucumis melo var. makuwa]8.9e-7841.23Show/hide
Query:  NVEITSAAKEMIEELGRMFGREIAALSKEVANLRKFVEGEVHELYKE-------LDNTRAECSK---NCNASSSASTSATSAT-----PCP---------
        N EITS AKEMIE++GR F +E+  L+  V  L+ FVEGE+H+L+ +       LD    EC       NA S+++   TS T     P P         
Subjt:  NVEITSAAKEMIEELGRMFGREIAALSKEVANLRKFVEGEVHELYKE-------LDNTRAECSK---NCNASSSASTSATSAT-----PCP---------

Query:  ---------------------------------------LWKHLERERDICSIRTWEQFKAELQKHFVPHNTEMEARGKLRRLRQIGSIPEYNKEFTTLM
                                                W+H   ++   +I +WEQFK EL+KHFVPHN E+E+RGKLR LR IGSI +Y KEFTTLM
Subjt:  ---------------------------------------LWKHLERERDICSIRTWEQFKAELQKHFVPHNTEMEARGKLRRLRQIGSIPEYNKEFTTLM

Query:  LEIYDLSDKDTLLYFKDDLKDW----LDRRNVRTLDDAIAAVEVLIDYSIKGK-------------QATKSEREVSDGK----NGRNGKATSKGKSSTSQ
        LEI DL +K+ L  FKD LKDW    LDRRNV+TLDDAIAA E L+DYS + K               TK+      GK      +NGK     +  +S 
Subjt:  LEIYDLSDKDTLLYFKDDLKDW----LDRRNVRTLDDAIAAVEVLIDYSIKGK-------------QATKSEREVSDGK----NGRNGKATSKGKSSTSQ

Query:  PPKPCFLCNGPHWTMDCPSKKKLNALVAKSRNEEQGKGEPNAKMGSLQLLSAMTKASSSKGSGEEGQLLVEAKINGRVTDALLDMGASLNFIDPKEAKRL
        PPKPCF+C GPHWT DCP++K LNALVAK +  +Q +  P  ++GS+Q +  M K ++ +    +G L    +I G+    + D GAS NF+D +EAKRL
Subjt:  PPKPCFLCNGPHWTMDCPSKKKLNALVAKSRNEEQGKGEPNAKMGSLQLLSAMTKASSSKGSGEEGQLLVEAKINGRVTDALLDMGASLNFIDPKEAKRL

Query:  GPKIKKEGGNMKAVNSRTRKAKGIAENVPIKIGKWGGSLDFTVVPMDDYRTILEYG
        G K K+E G +K VN++ +   G+A+ V +KIG W   LDF+V+PMDD+  +L  G
Subjt:  GPKIKKEGGNMKAVNSRTRKAKGIAENVPIKIGKWGGSLDFTVVPMDDYRTILEYG

KAA0065760.1 polyprotein [Cucumis melo var. makuwa]6.8e-7841.89Show/hide
Query:  NVEITSAAKEMIEELGRMFGREIAALSKEVANLRKFVEGEVHELYKE-------LDNTRAECSK---NCNASSSASTSATSAT-----PCP---------
        N EITS AKEMIE++GR F  E+  L+  V  L+ FVEGE+H L+ +       LD    EC       NA S+++   TS T     P P         
Subjt:  NVEITSAAKEMIEELGRMFGREIAALSKEVANLRKFVEGEVHELYKE-------LDNTRAECSK---NCNASSSASTSATSAT-----PCP---------

Query:  ---------------------------------------LWKHLERERDICSIRTWEQFKAELQKHFVPHNTEMEARGKLRRLRQIGSIPEYNKEFTTLM
                                                W+    ++   +I +WEQFKAEL+KHFVPHN E+E+RGKLRRLR  GSI EY KEFTTLM
Subjt:  ---------------------------------------LWKHLERERDICSIRTWEQFKAELQKHFVPHNTEMEARGKLRRLRQIGSIPEYNKEFTTLM

Query:  LEIYDLSDKDTLLYFKDDLKDW----LDRRNVRTLDDAIAAVEVLIDYSI--KGKQ------ATKSEREVSDGKNG---------RNGKATSKGKSSTSQ
        LEI DL +K+ L  FKD LKDW    LDRRNV+TLDDAIAA E L+DYS   KGK+        KS++  + G+           +NGK     +  +S 
Subjt:  LEIYDLSDKDTLLYFKDDLKDW----LDRRNVRTLDDAIAAVEVLIDYSI--KGKQ------ATKSEREVSDGKNG---------RNGKATSKGKSSTSQ

Query:  PPKPCFLCNGPHWTMDCPSKKKLNALVAKSRNEEQGKGEPNAKMGSLQLLSAMTKASSSKGSGEEGQLLVEAKINGRVTDALLDMGASLNFIDPKEAKRL
        PPKPCF+C GPHWT DCP++K LNALVAK +  +Q +  P  ++GS+Q +  M K ++ +    +G L    +I G+   A+ D GAS NF+D +EAKRL
Subjt:  PPKPCFLCNGPHWTMDCPSKKKLNALVAKSRNEEQGKGEPNAKMGSLQLLSAMTKASSSKGSGEEGQLLVEAKINGRVTDALLDMGASLNFIDPKEAKRL

Query:  GPKIKKEGGNMKAVNSRTRKAKGIAENVPIKIGKWGGSLDFTVVPMDDYRTILEYG
        G K K+E G +K VN++ +   G+A+ V +KIG W   LDF+V+PMDD+  +L  G
Subjt:  GPKIKKEGGNMKAVNSRTRKAKGIAENVPIKIGKWGGSLDFTVVPMDDYRTILEYG

TYK18079.1 uncharacterized protein E5676_scaffold306G004150 [Cucumis melo var. makuwa]8.9e-7841.23Show/hide
Query:  NVEITSAAKEMIEELGRMFGREIAALSKEVANLRKFVEGEVHELYKE-------LDNTRAECSK---NCNASSSASTSATSAT-----PCP---------
        N EITS AKEMIE++GR F +E+  L+  V  L+ FVEGE+H+L+ +       LD    EC       NA S+++   TS T     P P         
Subjt:  NVEITSAAKEMIEELGRMFGREIAALSKEVANLRKFVEGEVHELYKE-------LDNTRAECSK---NCNASSSASTSATSAT-----PCP---------

Query:  ---------------------------------------LWKHLERERDICSIRTWEQFKAELQKHFVPHNTEMEARGKLRRLRQIGSIPEYNKEFTTLM
                                                W+H   ++   +I +WEQFK EL+KHFVPHN E+E+RGKLR LR IGSI +Y KEFTTLM
Subjt:  ---------------------------------------LWKHLERERDICSIRTWEQFKAELQKHFVPHNTEMEARGKLRRLRQIGSIPEYNKEFTTLM

Query:  LEIYDLSDKDTLLYFKDDLKDW----LDRRNVRTLDDAIAAVEVLIDYSIKGK-------------QATKSEREVSDGK----NGRNGKATSKGKSSTSQ
        LEI DL +K+ L  FKD LKDW    LDRRNV+TLDDAIAA E L+DYS + K               TK+      GK      +NGK     +  +S 
Subjt:  LEIYDLSDKDTLLYFKDDLKDW----LDRRNVRTLDDAIAAVEVLIDYSIKGK-------------QATKSEREVSDGK----NGRNGKATSKGKSSTSQ

Query:  PPKPCFLCNGPHWTMDCPSKKKLNALVAKSRNEEQGKGEPNAKMGSLQLLSAMTKASSSKGSGEEGQLLVEAKINGRVTDALLDMGASLNFIDPKEAKRL
        PPKPCF+C GPHWT DCP++K LNALVAK +  +Q +  P  ++GS+Q +  M K ++ +    +G L    +I G+    + D GAS NF+D +EAKRL
Subjt:  PPKPCFLCNGPHWTMDCPSKKKLNALVAKSRNEEQGKGEPNAKMGSLQLLSAMTKASSSKGSGEEGQLLVEAKINGRVTDALLDMGASLNFIDPKEAKRL

Query:  GPKIKKEGGNMKAVNSRTRKAKGIAENVPIKIGKWGGSLDFTVVPMDDYRTILEYG
        G K K+E G +K VN++ +   G+A+ V +KIG W   LDF+V+PMDD+  +L  G
Subjt:  GPKIKKEGGNMKAVNSRTRKAKGIAENVPIKIGKWGGSLDFTVVPMDDYRTILEYG

TrEMBL top hitse value%identityAlignment
A0A5A7T2W8 Retrotrans_gag domain-containing protein3.5e-8045.41Show/hide
Query:  NVEITSAAKEMIEELGRMFGREIAALSKEVANLRKFVEGEVHELYKELDNTRAECSKN--CNASSSASTSATSA------TPCP----------------
        NVEI++AAKEMI+ LG   G+E+  +  EV NLRKFVE E+H L K +D  RAE       N ++S STS T         P P                
Subjt:  NVEITSAAKEMIEELGRMFGREIAALSKEVANLRKFVEGEVHELYKELDNTRAECSKN--CNASSSASTSATSA------TPCP----------------

Query:  -------------------------------LW---KHLERERDICSIRTWEQFKAELQKHFVPHNTEMEARGKLRRLRQIGSIPEYNKEFTTLMLEIYD
                                       LW   KH E E+D   ++TWEQFKAEL+KHFVPHN +MEARGKLRRLRQIGSIP+Y KEFTTLMLEI D
Subjt:  -------------------------------LW---KHLERERDICSIRTWEQFKAELQKHFVPHNTEMEARGKLRRLRQIGSIPEYNKEFTTLMLEIYD

Query:  LSDKDTLLYFKDDLKDW----LDRRNVRTLDDAIAAVEVLIDYSIKGKQATKSEREVSDGKNGRNGKATSK-GKSSTSQP---PKPCFLCNGPHWTMDCP
        LSDKD L +F+DDLKDW    LDRRNVRTLDDAIAA   LID   K K+    E E  + K   N K   K GK+  +     P  CFLC GPHWT DCP
Subjt:  LSDKDTLLYFKDDLKDW----LDRRNVRTLDDAIAAVEVLIDYSIKGKQATKSEREVSDGKNGRNGKATSK-GKSSTSQP---PKPCFLCNGPHWTMDCP

Query:  SKKKLNALVAKSRNEEQGKGEPNAKMGSLQLLSAMTKASSSKGSGEEGQLLVEAKINGRVTDALLDMGASLNFIDPKEAKRLGPK-IKKEGGNMKAVNSR
         K KL+A+VA  R EE+   +  A++G ++ LSAM K+ SSK   E G L V+A  N +   A+LD GA+ N +DP+EAKRLG +  ++    +K ++++
Subjt:  SKKKLNALVAKSRNEEQGKGEPNAKMGSLQLLSAMTKASSSKGSGEEGQLLVEAKINGRVTDALLDMGASLNFIDPKEAKRLGPK-IKKEGGNMKAVNSR

Query:  TRKAKGIAENVPIKIGKWGGSLDFTVVPMDDYRTIL
        + K  G A++V  ++G W G L+F +  +DD + IL
Subjt:  TRKAKGIAENVPIKIGKWGGSLDFTVVPMDDYRTIL

A0A5A7TFP3 Retrotrans_gag domain-containing protein4.3e-7841.23Show/hide
Query:  NVEITSAAKEMIEELGRMFGREIAALSKEVANLRKFVEGEVHELYKE-------LDNTRAECSK---NCNASSSASTSATSAT-----PCP---------
        N EITS AKEMIE++GR F +E+  L+  V  L+ FVEGE+H+L+ +       LD    EC       NA S+++   TS T     P P         
Subjt:  NVEITSAAKEMIEELGRMFGREIAALSKEVANLRKFVEGEVHELYKE-------LDNTRAECSK---NCNASSSASTSATSAT-----PCP---------

Query:  ---------------------------------------LWKHLERERDICSIRTWEQFKAELQKHFVPHNTEMEARGKLRRLRQIGSIPEYNKEFTTLM
                                                W+H   ++   +I +WEQFK EL+KHFVPHN E+E+RGKLR LR IGSI +Y KEFTTLM
Subjt:  ---------------------------------------LWKHLERERDICSIRTWEQFKAELQKHFVPHNTEMEARGKLRRLRQIGSIPEYNKEFTTLM

Query:  LEIYDLSDKDTLLYFKDDLKDW----LDRRNVRTLDDAIAAVEVLIDYSIKGK-------------QATKSEREVSDGK----NGRNGKATSKGKSSTSQ
        LEI DL +K+ L  FKD LKDW    LDRRNV+TLDDAIAA E L+DYS + K               TK+      GK      +NGK     +  +S 
Subjt:  LEIYDLSDKDTLLYFKDDLKDW----LDRRNVRTLDDAIAAVEVLIDYSIKGK-------------QATKSEREVSDGK----NGRNGKATSKGKSSTSQ

Query:  PPKPCFLCNGPHWTMDCPSKKKLNALVAKSRNEEQGKGEPNAKMGSLQLLSAMTKASSSKGSGEEGQLLVEAKINGRVTDALLDMGASLNFIDPKEAKRL
        PPKPCF+C GPHWT DCP++K LNALVAK +  +Q +  P  ++GS+Q +  M K ++ +    +G L    +I G+    + D GAS NF+D +EAKRL
Subjt:  PPKPCFLCNGPHWTMDCPSKKKLNALVAKSRNEEQGKGEPNAKMGSLQLLSAMTKASSSKGSGEEGQLLVEAKINGRVTDALLDMGASLNFIDPKEAKRL

Query:  GPKIKKEGGNMKAVNSRTRKAKGIAENVPIKIGKWGGSLDFTVVPMDDYRTILEYG
        G K K+E G +K VN++ +   G+A+ V +KIG W   LDF+V+PMDD+  +L  G
Subjt:  GPKIKKEGGNMKAVNSRTRKAKGIAENVPIKIGKWGGSLDFTVVPMDDYRTILEYG

A0A5A7THC0 Reverse transcriptase domain-containing protein1.3e-7741.45Show/hide
Query:  NVEITSAAKEMIEELGRMFGREIAALSKEVANLRKFVEGEVHELYKE-------LDNTRAECSK---NCNASSSASTSATSAT-----PCP---------
        N EITS AKEMIE++GR F +E+  L+  V  L+ FVEGE+H+L+ +       LD    EC       NA S+++   TS T     P P         
Subjt:  NVEITSAAKEMIEELGRMFGREIAALSKEVANLRKFVEGEVHELYKE-------LDNTRAECSK---NCNASSSASTSATSAT-----PCP---------

Query:  ---------------------------------------LWKHLERERDICSIRTWEQFKAELQKHFVPHNTEMEARGKLRRLRQIGSIPEYNKEFTTLM
                                                W+    ++   +I +WEQFKAEL+KHFVPHN E+E+RGKLRRLR  GSI +Y KEFTTLM
Subjt:  ---------------------------------------LWKHLERERDICSIRTWEQFKAELQKHFVPHNTEMEARGKLRRLRQIGSIPEYNKEFTTLM

Query:  LEIYDLSDKDTLLYFKDDLKDW----LDRRNVRTLDDAIAAVEVLIDYSIKG--------KQATKSEREVSDG-KNG--------RNGKATSKGKSSTSQ
        LEI DL +K+ L  FKD LKDW    LDRRNV+TLDDAIA  E L+DYS +         K   KS++  + G K+G        +NGK     +  +S 
Subjt:  LEIYDLSDKDTLLYFKDDLKDW----LDRRNVRTLDDAIAAVEVLIDYSIKG--------KQATKSEREVSDG-KNG--------RNGKATSKGKSSTSQ

Query:  PPKPCFLCNGPHWTMDCPSKKKLNALVAKSRNEEQGKGEPNAKMGSLQLLSAMTKASSSKGSGEEGQLLVEAKINGRVTDALLDMGASLNFIDPKEAKRL
        PPKPCF+C GPHWT DCP++K LNALVAK +  +Q +  P  ++GS+Q +  M K ++ +    +G L    +I G+    + D GAS NFID +EAKRL
Subjt:  PPKPCFLCNGPHWTMDCPSKKKLNALVAKSRNEEQGKGEPNAKMGSLQLLSAMTKASSSKGSGEEGQLLVEAKINGRVTDALLDMGASLNFIDPKEAKRL

Query:  GPKIKKEGGNMKAVNSRTRKAKGIAENVPIKIGKWGGSLDFTVVPMDDYRTILEYG
        G K K+E G +K VN++ +   G+A+ V +KIG W   LDF+V+PMDD+  +L  G
Subjt:  GPKIKKEGGNMKAVNSRTRKAKGIAENVPIKIGKWGGSLDFTVVPMDDYRTILEYG

A0A5A7VEX8 Polyprotein3.3e-7841.89Show/hide
Query:  NVEITSAAKEMIEELGRMFGREIAALSKEVANLRKFVEGEVHELYKE-------LDNTRAECSK---NCNASSSASTSATSAT-----PCP---------
        N EITS AKEMIE++GR F  E+  L+  V  L+ FVEGE+H L+ +       LD    EC       NA S+++   TS T     P P         
Subjt:  NVEITSAAKEMIEELGRMFGREIAALSKEVANLRKFVEGEVHELYKE-------LDNTRAECSK---NCNASSSASTSATSAT-----PCP---------

Query:  ---------------------------------------LWKHLERERDICSIRTWEQFKAELQKHFVPHNTEMEARGKLRRLRQIGSIPEYNKEFTTLM
                                                W+    ++   +I +WEQFKAEL+KHFVPHN E+E+RGKLRRLR  GSI EY KEFTTLM
Subjt:  ---------------------------------------LWKHLERERDICSIRTWEQFKAELQKHFVPHNTEMEARGKLRRLRQIGSIPEYNKEFTTLM

Query:  LEIYDLSDKDTLLYFKDDLKDW----LDRRNVRTLDDAIAAVEVLIDYSI--KGKQ------ATKSEREVSDGKNG---------RNGKATSKGKSSTSQ
        LEI DL +K+ L  FKD LKDW    LDRRNV+TLDDAIAA E L+DYS   KGK+        KS++  + G+           +NGK     +  +S 
Subjt:  LEIYDLSDKDTLLYFKDDLKDW----LDRRNVRTLDDAIAAVEVLIDYSI--KGKQ------ATKSEREVSDGKNG---------RNGKATSKGKSSTSQ

Query:  PPKPCFLCNGPHWTMDCPSKKKLNALVAKSRNEEQGKGEPNAKMGSLQLLSAMTKASSSKGSGEEGQLLVEAKINGRVTDALLDMGASLNFIDPKEAKRL
        PPKPCF+C GPHWT DCP++K LNALVAK +  +Q +  P  ++GS+Q +  M K ++ +    +G L    +I G+   A+ D GAS NF+D +EAKRL
Subjt:  PPKPCFLCNGPHWTMDCPSKKKLNALVAKSRNEEQGKGEPNAKMGSLQLLSAMTKASSSKGSGEEGQLLVEAKINGRVTDALLDMGASLNFIDPKEAKRL

Query:  GPKIKKEGGNMKAVNSRTRKAKGIAENVPIKIGKWGGSLDFTVVPMDDYRTILEYG
        G K K+E G +K VN++ +   G+A+ V +KIG W   LDF+V+PMDD+  +L  G
Subjt:  GPKIKKEGGNMKAVNSRTRKAKGIAENVPIKIGKWGGSLDFTVVPMDDYRTILEYG

A0A5D3D3V4 Retrotrans_gag domain-containing protein4.3e-7841.23Show/hide
Query:  NVEITSAAKEMIEELGRMFGREIAALSKEVANLRKFVEGEVHELYKE-------LDNTRAECSK---NCNASSSASTSATSAT-----PCP---------
        N EITS AKEMIE++GR F +E+  L+  V  L+ FVEGE+H+L+ +       LD    EC       NA S+++   TS T     P P         
Subjt:  NVEITSAAKEMIEELGRMFGREIAALSKEVANLRKFVEGEVHELYKE-------LDNTRAECSK---NCNASSSASTSATSAT-----PCP---------

Query:  ---------------------------------------LWKHLERERDICSIRTWEQFKAELQKHFVPHNTEMEARGKLRRLRQIGSIPEYNKEFTTLM
                                                W+H   ++   +I +WEQFK EL+KHFVPHN E+E+RGKLR LR IGSI +Y KEFTTLM
Subjt:  ---------------------------------------LWKHLERERDICSIRTWEQFKAELQKHFVPHNTEMEARGKLRRLRQIGSIPEYNKEFTTLM

Query:  LEIYDLSDKDTLLYFKDDLKDW----LDRRNVRTLDDAIAAVEVLIDYSIKGK-------------QATKSEREVSDGK----NGRNGKATSKGKSSTSQ
        LEI DL +K+ L  FKD LKDW    LDRRNV+TLDDAIAA E L+DYS + K               TK+      GK      +NGK     +  +S 
Subjt:  LEIYDLSDKDTLLYFKDDLKDW----LDRRNVRTLDDAIAAVEVLIDYSIKGK-------------QATKSEREVSDGK----NGRNGKATSKGKSSTSQ

Query:  PPKPCFLCNGPHWTMDCPSKKKLNALVAKSRNEEQGKGEPNAKMGSLQLLSAMTKASSSKGSGEEGQLLVEAKINGRVTDALLDMGASLNFIDPKEAKRL
        PPKPCF+C GPHWT DCP++K LNALVAK +  +Q +  P  ++GS+Q +  M K ++ +    +G L    +I G+    + D GAS NF+D +EAKRL
Subjt:  PPKPCFLCNGPHWTMDCPSKKKLNALVAKSRNEEQGKGEPNAKMGSLQLLSAMTKASSSKGSGEEGQLLVEAKINGRVTDALLDMGASLNFIDPKEAKRL

Query:  GPKIKKEGGNMKAVNSRTRKAKGIAENVPIKIGKWGGSLDFTVVPMDDYRTILEYG
        G K K+E G +K VN++ +   G+A+ V +KIG W   LDF+V+PMDD+  +L  G
Subjt:  GPKIKKEGGNMKAVNSRTRKAKGIAENVPIKIGKWGGSLDFTVVPMDDYRTILEYG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
AATGTCGAGATCACGTCTGCTGCCAAGGAAATGATCGAGGAACTTGGACGCATGTTTGGAAGGGAGATTGCTGCCCTATCCAAAGAAGTTGCCAACCTACGAAAGTTTGT
GGAGGGGGAGGTCCACGAACTTTACAAAGAACTTGACAACACACGTGCCGAATGTTCGAAAAATTGTAATGCGAGTAGTTCGGCATCCACTAGTGCAACATCCGCTACAC
CATGCCCCCTATGGAAGCATTTGGAGCGAGAGCGGGACATTTGCAGCATTAGAACGTGGGAGCAATTCAAAGCAGAACTCCAAAAGCATTTTGTTCCCCACAATACTGAG
ATGGAGGCACGAGGTAAACTCCGACGCCTAAGGCAAATTGGTAGCATTCCTGAATACAACAAGGAGTTCACAACCCTCATGCTTGAGATATATGATCTATCCGACAAAGA
CACACTCCTTTATTTCAAGGATGATCTCAAGGATTGGCTCGACCGGCGCAATGTGCGGACGCTCGATGATGCTATTGCCGCCGTCGAAGTGCTCATAGATTATTCAATCA
AGGGAAAGCAGGCCACCAAAAGTGAAAGAGAAGTGTCGGATGGCAAGAATGGGAGAAACGGGAAGGCTACTAGCAAAGGCAAGTCCTCCACATCACAACCTCCCAAACCG
TGTTTTCTTTGTAATGGGCCACATTGGACTATGGACTGTCCAAGTAAGAAGAAGCTAAACGCCCTAGTTGCTAAATCCCGCAATGAAGAACAAGGAAAAGGTGAACCCAA
TGCTAAGATGGGTTCCTTGCAATTACTTAGCGCCATGACTAAAGCTTCCTCTTCCAAGGGGAGTGGAGAGGAAGGACAACTCCTCGTTGAAGCTAAAATCAATGGAAGAG
TCACAGATGCGTTGCTCGACATGGGAGCATCGCTCAACTTCATAGATCCAAAAGAGGCTAAACGTCTTGGCCCCAAGATCAAGAAGGAAGGCGGTAACATGAAGGCGGTG
AACTCAAGAACCCGAAAGGCCAAGGGAATTGCCGAGAATGTTCCCATTAAGATTGGCAAGTGGGGAGGTTCGCTGGATTTCACCGTTGTTCCGATGGATGACTATAGAAC
CATCCTAGAATATGGGAAAATTTCAAATCGCCAAGATGTCGACAATGACAATCAAGAAGGGATGCATGAAGCAACGAAGGGGCTTAAAGAAGCCACACGCAACGAGGGCA
TTGCAAAATCGAGTGGGGGAGGGTGCACGTGTCACGAAAATGGTCCAACAAATTGCGCTAAAATGCCTAAAGACTCGTCTAGCAAGGCCAAGGATGAGTCAAGGAAGAAG
GGCCAACAAGCCCCCAAGTATGTCCCACGCACATACCAAGACGCCCCAGCAAGCTGCCATGGCCGTAGGGCAGCGCCCATGCCTCAGGCGCACGGGCATGACACAGTCGA
GCACCCAGGCGCACAGCCAAGTGTGCGCCCCAGTCAGCCCAGCGTTGCCATGCGTGCGCCCCACGCCGTGAGCAAGTTGCATGCACACAGACCCATCGCGCCCTGCGAGC
GGCCAGCCCCAGACGCGCACCATGTGCATGCGCCCGTGCTCGGTTTCCTAGAGCCAGAGCTTAACGCGCGCGCCCGCTTGCACGCCTGCCTTGCGTGCTCGGCCACTCAG
CGCCCATGCTTAGTGCGCACGCCCCATGGTCCAAACTTCTCTGAACAATCCCAAAAGCTTTTAGAAAGCTCGGGACACCTCCAGCTCCCTCGCCCAAGGCCTAGATGTGC
CTGGAAGGATATGGGCAGCTTTGGACAGTGCCAGAACACCCTGGAATGA
mRNA sequenceShow/hide mRNA sequence
AATGTCGAGATCACGTCTGCTGCCAAGGAAATGATCGAGGAACTTGGACGCATGTTTGGAAGGGAGATTGCTGCCCTATCCAAAGAAGTTGCCAACCTACGAAAGTTTGT
GGAGGGGGAGGTCCACGAACTTTACAAAGAACTTGACAACACACGTGCCGAATGTTCGAAAAATTGTAATGCGAGTAGTTCGGCATCCACTAGTGCAACATCCGCTACAC
CATGCCCCCTATGGAAGCATTTGGAGCGAGAGCGGGACATTTGCAGCATTAGAACGTGGGAGCAATTCAAAGCAGAACTCCAAAAGCATTTTGTTCCCCACAATACTGAG
ATGGAGGCACGAGGTAAACTCCGACGCCTAAGGCAAATTGGTAGCATTCCTGAATACAACAAGGAGTTCACAACCCTCATGCTTGAGATATATGATCTATCCGACAAAGA
CACACTCCTTTATTTCAAGGATGATCTCAAGGATTGGCTCGACCGGCGCAATGTGCGGACGCTCGATGATGCTATTGCCGCCGTCGAAGTGCTCATAGATTATTCAATCA
AGGGAAAGCAGGCCACCAAAAGTGAAAGAGAAGTGTCGGATGGCAAGAATGGGAGAAACGGGAAGGCTACTAGCAAAGGCAAGTCCTCCACATCACAACCTCCCAAACCG
TGTTTTCTTTGTAATGGGCCACATTGGACTATGGACTGTCCAAGTAAGAAGAAGCTAAACGCCCTAGTTGCTAAATCCCGCAATGAAGAACAAGGAAAAGGTGAACCCAA
TGCTAAGATGGGTTCCTTGCAATTACTTAGCGCCATGACTAAAGCTTCCTCTTCCAAGGGGAGTGGAGAGGAAGGACAACTCCTCGTTGAAGCTAAAATCAATGGAAGAG
TCACAGATGCGTTGCTCGACATGGGAGCATCGCTCAACTTCATAGATCCAAAAGAGGCTAAACGTCTTGGCCCCAAGATCAAGAAGGAAGGCGGTAACATGAAGGCGGTG
AACTCAAGAACCCGAAAGGCCAAGGGAATTGCCGAGAATGTTCCCATTAAGATTGGCAAGTGGGGAGGTTCGCTGGATTTCACCGTTGTTCCGATGGATGACTATAGAAC
CATCCTAGAATATGGGAAAATTTCAAATCGCCAAGATGTCGACAATGACAATCAAGAAGGGATGCATGAAGCAACGAAGGGGCTTAAAGAAGCCACACGCAACGAGGGCA
TTGCAAAATCGAGTGGGGGAGGGTGCACGTGTCACGAAAATGGTCCAACAAATTGCGCTAAAATGCCTAAAGACTCGTCTAGCAAGGCCAAGGATGAGTCAAGGAAGAAG
GGCCAACAAGCCCCCAAGTATGTCCCACGCACATACCAAGACGCCCCAGCAAGCTGCCATGGCCGTAGGGCAGCGCCCATGCCTCAGGCGCACGGGCATGACACAGTCGA
GCACCCAGGCGCACAGCCAAGTGTGCGCCCCAGTCAGCCCAGCGTTGCCATGCGTGCGCCCCACGCCGTGAGCAAGTTGCATGCACACAGACCCATCGCGCCCTGCGAGC
GGCCAGCCCCAGACGCGCACCATGTGCATGCGCCCGTGCTCGGTTTCCTAGAGCCAGAGCTTAACGCGCGCGCCCGCTTGCACGCCTGCCTTGCGTGCTCGGCCACTCAG
CGCCCATGCTTAGTGCGCACGCCCCATGGTCCAAACTTCTCTGAACAATCCCAAAAGCTTTTAGAAAGCTCGGGACACCTCCAGCTCCCTCGCCCAAGGCCTAGATGTGC
CTGGAAGGATATGGGCAGCTTTGGACAGTGCCAGAACACCCTGGAATGA
Protein sequenceShow/hide protein sequence
NVEITSAAKEMIEELGRMFGREIAALSKEVANLRKFVEGEVHELYKELDNTRAECSKNCNASSSASTSATSATPCPLWKHLERERDICSIRTWEQFKAELQKHFVPHNTE
MEARGKLRRLRQIGSIPEYNKEFTTLMLEIYDLSDKDTLLYFKDDLKDWLDRRNVRTLDDAIAAVEVLIDYSIKGKQATKSEREVSDGKNGRNGKATSKGKSSTSQPPKP
CFLCNGPHWTMDCPSKKKLNALVAKSRNEEQGKGEPNAKMGSLQLLSAMTKASSSKGSGEEGQLLVEAKINGRVTDALLDMGASLNFIDPKEAKRLGPKIKKEGGNMKAV
NSRTRKAKGIAENVPIKIGKWGGSLDFTVVPMDDYRTILEYGKISNRQDVDNDNQEGMHEATKGLKEATRNEGIAKSSGGGCTCHENGPTNCAKMPKDSSSKAKDESRKK
GQQAPKYVPRTYQDAPASCHGRRAAPMPQAHGHDTVEHPGAQPSVRPSQPSVAMRAPHAVSKLHAHRPIAPCERPAPDAHHVHAPVLGFLEPELNARARLHACLACSATQ
RPCLVRTPHGPNFSEQSQKLLESSGHLQLPRPRPRCAWKDMGSFGQCQNTLE