; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018953 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018953
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCopia-like polyprotein
Genome locationchr5:36948241..36949620
RNA-Seq ExpressionLag0018953
SyntenyLag0018953
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG55646.1 hypothetical protein EZV62_020902 [Acer yangbiense]2.3e-8140.91Show/hide
Query:  SSTTLPQATVTFPISSTQTPT-----------TSSFGHPLGTVLIVKLDDKNYSLWRGMVLVVLRGQKIDGFVLGTKVQPNKLLHS---VGNEDSSTTAT
        S+T+  Q+T+    SST+TPT           +S FG+ L     +KLD +N+ LW+ MV  +++G ++DG +  T+  P + L S    G  DS + + 
Subjt:  SSTTLPQATVTFPISSTQTPT-----------TSSFGHPLGTVLIVKLDDKNYSLWRGMVLVVLRGQKIDGFVLGTKVQPNKLLHS---VGNEDSSTTAT

Query:  NQVINPAYEEWRATDQALLGWMFGSITLFVACEVVDLTSSRDVWVALEDLFGATSKARIIQLRKVLQTTKKGTIRMTEYLTFMKQTSESLKLAGESVSLN
            NP YE+W   DQ L+GW++ S+T  VA  V+  T++  +W ALE+LFGA SK++   +R  +QTT+KG+  M EYLT MK  ++SL +AG+    N
Subjt:  NQVINPAYEEWRATDQALLGWMFGSITLFVACEVVDLTSSRDVWVALEDLFGATSKARIIQLRKVLQTTKKGTIRMTEYLTFMKQTSESLKLAGESVSLN

Query:  YLMSCVLSGLEAEYIPIVCHIEGKDSMTWQELYATLLTFKNTLVRLNVVS-QVDTTEAQSANYA---------HAKSSAEGNFNQGGQ---NRNNF-GGR
         L + +L+GL++EY+PIV  IE ++  TWQE+Y TLL++ + L  +N VS + +   + SA+ A           K+S + N NQGG    NR  F GG 
Subjt:  YLMSCVLSGLEAEYIPIVCHIEGKDSMTWQELYATLLTFKNTLVRLNVVS-QVDTTEAQSANYA---------HAKSSAEGNFNQGGQ---NRNNF-GGR

Query:  GNMRGRGRDKSYSRNNNSKSTCQLCGKYGHTAAICYNRFNKEFAGFYSNNHSSNSNSQDSGTRNAAFIATPEMVVDPSRLADSGATSHITADLGNLNLKS
        G  RGRG      RNNNS+ TCQ+CGK+GH+A++CY R++  + G      +SN+NS       + F+ATPE V D +  ADSGAT+H+T D GNL+LKS
Subjt:  GNMRGRGRDKSYSRNNNSKSTCQLCGKYGHTAAICYNRFNKEFAGFYSNNHSSNSNSQDSGTRNAAFIATPEMVVDPSRLADSGATSHITADLGNLNLKS

Query:  EYTGKEKLTVGNGSRLNISHIGQNVINIQHADKSPLVMKNI--CAKYQKNLLSVAKFTAQNNCFIEFHSTCCLVKDKETKRVLL
         Y G E L VGNG +L+ISH+G  + ++    K  +++K +    + +KNLLSV++    N+ FIEFH+ CC VKDK T+  +L
Subjt:  EYTGKEKLTVGNGSRLNISHIGQNVINIQHADKSPLVMKNI--CAKYQKNLLSVAKFTAQNNCFIEFHSTCCLVKDKETKRVLL

TXG67243.1 hypothetical protein EZV62_008518 [Acer yangbiense]1.6e-7940.87Show/hide
Query:  SSTTLPQATVTFPISSTQTPT-----------TSSFGHPLGTVLIVKLDDKNYSLWRGMVLVVLRGQKIDGFVLGTKVQPNKLLHS--VGNEDSSTTA--
        S+T+  Q+T+    SST TPT           +S FG+ L     +KLD +N+ LW+ MV  +++G ++DG +  T+  P + L S       S TT   
Subjt:  SSTTLPQATVTFPISSTQTPT-----------TSSFGHPLGTVLIVKLDDKNYSLWRGMVLVVLRGQKIDGFVLGTKVQPNKLLHS--VGNEDSSTTA--

Query:  --TNQVINPAYEEWRATDQALLGWMFGSITLFVACEVVDLTSSRDVWVALEDLFGATSKARIIQLRKVLQTTKKGTIRMTEYLTFMKQTSESLKLAGESV
          +    NP YE+W   DQ L+GW++ S+T  VA  V+  T++  +W ALE+LFGA SK++   +R  +QTT+KG+  M EYLT MK  ++SL +AG+  
Subjt:  --TNQVINPAYEEWRATDQALLGWMFGSITLFVACEVVDLTSSRDVWVALEDLFGATSKARIIQLRKVLQTTKKGTIRMTEYLTFMKQTSESLKLAGESV

Query:  SLNYLMSCVLSGLEAEYIPIVCHIEGKDSMTWQELYATLLTFKNTLVRLNVVS-QVDTTEAQSANYA---------HAKSSAEGNFNQGGQ---NRNNF-
          N L +  L+GL++EY+PIV  IE ++  TWQE+Y TLL++ + L  +N VS + +   + SA+ A           K+S + N NQGG    NR  F 
Subjt:  SLNYLMSCVLSGLEAEYIPIVCHIEGKDSMTWQELYATLLTFKNTLVRLNVVS-QVDTTEAQSANYA---------HAKSSAEGNFNQGGQ---NRNNF-

Query:  GGRGNMRGRGRDKSYSRNNNSKSTCQLCGKYGHTAAICYNRFNKEFAGFYSNNHSSNSNSQDSGTRNAAFIATPEMVVDPSRLADSGATSHITADLGNLN
        GG G  RGRG      RNNNS+ TCQ+CGK+GH+A++CY R++  + G      +SN+NS       + F+ATPE V D +  ADSGAT H+T D GNL+
Subjt:  GGRGNMRGRGRDKSYSRNNNSKSTCQLCGKYGHTAAICYNRFNKEFAGFYSNNHSSNSNSQDSGTRNAAFIATPEMVVDPSRLADSGATSHITADLGNLN

Query:  LKSEYTGKEKLTVGNGSRLNISHIGQNVINIQHADKSPLVMKNI--CAKYQKNLLSVAKFTAQNNCFIEFHSTCCLVKDKET
        LKS+Y G E L VGNG +L+ISH+G  + ++    K  +++K +    + +KNLLSV++    N+ FIEFH+ CC VKDK T
Subjt:  LKSEYTGKEKLTVGNGSRLNISHIGQNVINIQHADKSPLVMKNI--CAKYQKNLLSVAKFTAQNNCFIEFHSTCCLVKDKET

TXG69253.1 hypothetical protein EZV62_004188 [Acer yangbiense]1.2e-7940.87Show/hide
Query:  SSTTLPQATVTFPISSTQTPT-----------TSSFGHPLGTVLIVKLDDKNYSLWRGMVLVVLRGQKIDGFVLGTKVQPNKLLHS--VGNEDSSTTA--
        S+T+  Q+T+    SST TPT           +S FG+ L     +KLD +N+ LW+ MV  +++G ++DG +  T+  P + L S       S TT   
Subjt:  SSTTLPQATVTFPISSTQTPT-----------TSSFGHPLGTVLIVKLDDKNYSLWRGMVLVVLRGQKIDGFVLGTKVQPNKLLHS--VGNEDSSTTA--

Query:  --TNQVINPAYEEWRATDQALLGWMFGSITLFVACEVVDLTSSRDVWVALEDLFGATSKARIIQLRKVLQTTKKGTIRMTEYLTFMKQTSESLKLAGESV
          +    NP YE+W   DQ L+GW++ S+T  VA  V+  T++  +W ALE+LFGA SK++   +R  +QTT+KG+  M EYLT MK  ++SL +AG+  
Subjt:  --TNQVINPAYEEWRATDQALLGWMFGSITLFVACEVVDLTSSRDVWVALEDLFGATSKARIIQLRKVLQTTKKGTIRMTEYLTFMKQTSESLKLAGESV

Query:  SLNYLMSCVLSGLEAEYIPIVCHIEGKDSMTWQELYATLLTFKNTLVRLNVVS-QVDTTEAQSANYA---------HAKSSAEGNFNQGGQ---NRNNF-
          N L +  L+GL++EY+PIV  IE ++  TWQE+Y TLL++ + L  +N VS + +   + SA+ A           K+S + N NQGG    NR  F 
Subjt:  SLNYLMSCVLSGLEAEYIPIVCHIEGKDSMTWQELYATLLTFKNTLVRLNVVS-QVDTTEAQSANYA---------HAKSSAEGNFNQGGQ---NRNNF-

Query:  GGRGNMRGRGRDKSYSRNNNSKSTCQLCGKYGHTAAICYNRFNKEFAGFYSNNHSSNSNSQDSGTRNAAFIATPEMVVDPSRLADSGATSHITADLGNLN
        GG G  RGRG      RNNNS+ TCQ+CGK+GH+A++CY R++  + G      +SN+NS       + F+ATPE V D +  ADSGAT+H+T D GNL+
Subjt:  GGRGNMRGRGRDKSYSRNNNSKSTCQLCGKYGHTAAICYNRFNKEFAGFYSNNHSSNSNSQDSGTRNAAFIATPEMVVDPSRLADSGATSHITADLGNLN

Query:  LKSEYTGKEKLTVGNGSRLNISHIGQNVINIQHADKSPLVMKNI--CAKYQKNLLSVAKFTAQNNCFIEFHSTCCLVKDKET
        LKS+Y G E L VGNG +L+ISH+G  + ++    K  +++K +    + +KNLLSV++    N+ FIEFH+ CC VKDK T
Subjt:  LKSEYTGKEKLTVGNGSRLNISHIGQNVINIQHADKSPLVMKNI--CAKYQKNLLSVAKFTAQNNCFIEFHSTCCLVKDKET

XP_022157748.1 uncharacterized protein LOC111024384 isoform X1 [Momordica charantia]4.4e-10957.72Show/hide
Query:  DDSSSTTLPQATVTFPISSTQTPT---TSSFGHPLGTVLIVKLDDKNYSLWRGMVLVVLRGQKIDGFVLGTKVQPNKLLHSVGNEDSSTTATNQVINPAY
        +++ ++++P   VT     T  P+    +SFGHPLGTVL VKLDDKNYSLWRGMVL VLRGQK DG+VLGT  +P + L S    ++  T+ +  +NP Y
Subjt:  DDSSSTTLPQATVTFPISSTQTPT---TSSFGHPLGTVLIVKLDDKNYSLWRGMVLVVLRGQKIDGFVLGTKVQPNKLLHSVGNEDSSTTATNQVINPAY

Query:  EEWRATDQALLGWMFGSITLFVACEVVDLTSSRDVWVALEDLFGATSKARIIQLRKVLQTTKKGTIRMTEYLTFMKQTSESLKLAGESVSLNYLMSCVLS
         EW+A DQALLGW+FGS+T  +AC+VVD  SSR+VW ALEDL+GATSKARI QLR VLQ TKK +++M+EYL  MKQ SESLKLAGE V+ NYLMSCVLS
Subjt:  EEWRATDQALLGWMFGSITLFVACEVVDLTSSRDVWVALEDLFGATSKARIIQLRKVLQTTKKGTIRMTEYLTFMKQTSESLKLAGESVSLNYLMSCVLS

Query:  GLEAEYIPIVCHIEGKDSMTWQELYATLLTFKNTLVRLNVVS--QVDTTEAQSANYAHAKSSAEGN--FNQG----GQNRNNFG---GRGNMRGRGRDK-
        GLEAEY+PIVC IEGKDS +WQEL+ATL+TF+NTL+RLN+VS    +     S NY H+K ++ GN  F+Q     GQ R ++     + N+RGRGR + 
Subjt:  GLEAEYIPIVCHIEGKDSMTWQELYATLLTFKNTLVRLNVVS--QVDTTEAQSANYAHAKSSAEGN--FNQG----GQNRNNFG---GRGNMRGRGRDK-

Query:  SYSRNNNSKSTCQLCGKYGHTAAICYNRFNKEFAGFYSNNHSSNSNSQDSGTRNAAFIATPEMVVDPSRLADSGATSHITADLGNLNLKSEYTGK
        S  R NNSK +CQLCGKYGH AA+CY RF++ F     NN SS++N+     RN+A++A PE+V +PS LADSGAT H+T+DL NLN+KS+Y GK
Subjt:  SYSRNNNSKSTCQLCGKYGHTAAICYNRFNKEFAGFYSNNHSSNSNSQDSGTRNAAFIATPEMVVDPSRLADSGATSHITADLGNLNLKSEYTGK

XP_022157750.1 uncharacterized protein LOC111024384 isoform X2 [Momordica charantia]1.3e-10857.47Show/hide
Query:  DDSSSTTLPQATVTFPISSTQTPT---TSSFGHPLGTVLIVKLDDKNYSLWRGMVLVVLRGQKIDGFVLGTKVQPNKLLHSVGNEDSSTTATNQVINPAY
        +++ ++++P   VT     T  P+    +SFGHPLGTVL VKLDDKNYSLWRGMVL VLRGQK DG+VLGT  +P + L S    ++  T+ +  +NP Y
Subjt:  DDSSSTTLPQATVTFPISSTQTPT---TSSFGHPLGTVLIVKLDDKNYSLWRGMVLVVLRGQKIDGFVLGTKVQPNKLLHSVGNEDSSTTATNQVINPAY

Query:  EEWRATDQALLGWMFGSITLFVACEVVDLTSSRDVWVALEDLFGATSKARIIQLRKVLQTTKKGTIRMTEYLTFMKQTSESLKLAGESVSLNYLMSCVLS
         EW+A DQALLGW+FGS+T  +AC+VVD  SSR+VW ALEDL+GATSKARI QLR VLQ TKK +++M+EYL  MKQ SESLKLAGE V+ NYLMSCVLS
Subjt:  EEWRATDQALLGWMFGSITLFVACEVVDLTSSRDVWVALEDLFGATSKARIIQLRKVLQTTKKGTIRMTEYLTFMKQTSESLKLAGESVSLNYLMSCVLS

Query:  GLEAEYIPIVCHIEGKDSMTWQELYATLLTFKNTLVRLNVVS--QVDTTEAQSANYAHAKSSAEGN--FNQG----GQNRNNFG---GRGNMRGRGRDK-
        GLEAEY+PIVC IEGKDS +WQEL+ATL+TF+NTL+RLN+VS    +     S NY H+K ++ GN  F+Q     GQ R ++     + N+RGRGR + 
Subjt:  GLEAEYIPIVCHIEGKDSMTWQELYATLLTFKNTLVRLNVVS--QVDTTEAQSANYAHAKSSAEGN--FNQG----GQNRNNFG---GRGNMRGRGRDK-

Query:  SYSRNNNSKSTCQLCGKYGHTAAICYNRFNKEFAGFYSNNHSSNSNSQDSGTRNAAFIATPEMVVDPSRLADSGATSHITADLGNLNLKSEYTGK
        S  R NNSK +CQLCGKYGH AA+CY RF++ F     NN SS++N+     RN+A++A PE+V +PS LADSGAT H+T+DL NLN+KS+Y G+
Subjt:  SYSRNNNSKSTCQLCGKYGHTAAICYNRFNKEFAGFYSNNHSSNSNSQDSGTRNAAFIATPEMVVDPSRLADSGATSHITADLGNLNLKSEYTGK

TrEMBL top hitse value%identityAlignment
A0A5C7HHE9 Uncharacterized protein1.1e-8140.91Show/hide
Query:  SSTTLPQATVTFPISSTQTPT-----------TSSFGHPLGTVLIVKLDDKNYSLWRGMVLVVLRGQKIDGFVLGTKVQPNKLLHS---VGNEDSSTTAT
        S+T+  Q+T+    SST+TPT           +S FG+ L     +KLD +N+ LW+ MV  +++G ++DG +  T+  P + L S    G  DS + + 
Subjt:  SSTTLPQATVTFPISSTQTPT-----------TSSFGHPLGTVLIVKLDDKNYSLWRGMVLVVLRGQKIDGFVLGTKVQPNKLLHS---VGNEDSSTTAT

Query:  NQVINPAYEEWRATDQALLGWMFGSITLFVACEVVDLTSSRDVWVALEDLFGATSKARIIQLRKVLQTTKKGTIRMTEYLTFMKQTSESLKLAGESVSLN
            NP YE+W   DQ L+GW++ S+T  VA  V+  T++  +W ALE+LFGA SK++   +R  +QTT+KG+  M EYLT MK  ++SL +AG+    N
Subjt:  NQVINPAYEEWRATDQALLGWMFGSITLFVACEVVDLTSSRDVWVALEDLFGATSKARIIQLRKVLQTTKKGTIRMTEYLTFMKQTSESLKLAGESVSLN

Query:  YLMSCVLSGLEAEYIPIVCHIEGKDSMTWQELYATLLTFKNTLVRLNVVS-QVDTTEAQSANYA---------HAKSSAEGNFNQGGQ---NRNNF-GGR
         L + +L+GL++EY+PIV  IE ++  TWQE+Y TLL++ + L  +N VS + +   + SA+ A           K+S + N NQGG    NR  F GG 
Subjt:  YLMSCVLSGLEAEYIPIVCHIEGKDSMTWQELYATLLTFKNTLVRLNVVS-QVDTTEAQSANYA---------HAKSSAEGNFNQGGQ---NRNNF-GGR

Query:  GNMRGRGRDKSYSRNNNSKSTCQLCGKYGHTAAICYNRFNKEFAGFYSNNHSSNSNSQDSGTRNAAFIATPEMVVDPSRLADSGATSHITADLGNLNLKS
        G  RGRG      RNNNS+ TCQ+CGK+GH+A++CY R++  + G      +SN+NS       + F+ATPE V D +  ADSGAT+H+T D GNL+LKS
Subjt:  GNMRGRGRDKSYSRNNNSKSTCQLCGKYGHTAAICYNRFNKEFAGFYSNNHSSNSNSQDSGTRNAAFIATPEMVVDPSRLADSGATSHITADLGNLNLKS

Query:  EYTGKEKLTVGNGSRLNISHIGQNVINIQHADKSPLVMKNI--CAKYQKNLLSVAKFTAQNNCFIEFHSTCCLVKDKETKRVLL
         Y G E L VGNG +L+ISH+G  + ++    K  +++K +    + +KNLLSV++    N+ FIEFH+ CC VKDK T+  +L
Subjt:  EYTGKEKLTVGNGSRLNISHIGQNVINIQHADKSPLVMKNI--CAKYQKNLLSVAKFTAQNNCFIEFHSTCCLVKDKETKRVLL

A0A5C7IJ06 Uncharacterized protein6.0e-8040.87Show/hide
Query:  SSTTLPQATVTFPISSTQTPT-----------TSSFGHPLGTVLIVKLDDKNYSLWRGMVLVVLRGQKIDGFVLGTKVQPNKLLHS--VGNEDSSTTA--
        S+T+  Q+T+    SST TPT           +S FG+ L     +KLD +N+ LW+ MV  +++G ++DG +  T+  P + L S       S TT   
Subjt:  SSTTLPQATVTFPISSTQTPT-----------TSSFGHPLGTVLIVKLDDKNYSLWRGMVLVVLRGQKIDGFVLGTKVQPNKLLHS--VGNEDSSTTA--

Query:  --TNQVINPAYEEWRATDQALLGWMFGSITLFVACEVVDLTSSRDVWVALEDLFGATSKARIIQLRKVLQTTKKGTIRMTEYLTFMKQTSESLKLAGESV
          +    NP YE+W   DQ L+GW++ S+T  VA  V+  T++  +W ALE+LFGA SK++   +R  +QTT+KG+  M EYLT MK  ++SL +AG+  
Subjt:  --TNQVINPAYEEWRATDQALLGWMFGSITLFVACEVVDLTSSRDVWVALEDLFGATSKARIIQLRKVLQTTKKGTIRMTEYLTFMKQTSESLKLAGESV

Query:  SLNYLMSCVLSGLEAEYIPIVCHIEGKDSMTWQELYATLLTFKNTLVRLNVVS-QVDTTEAQSANYA---------HAKSSAEGNFNQGGQ---NRNNF-
          N L +  L+GL++EY+PIV  IE ++  TWQE+Y TLL++ + L  +N VS + +   + SA+ A           K+S + N NQGG    NR  F 
Subjt:  SLNYLMSCVLSGLEAEYIPIVCHIEGKDSMTWQELYATLLTFKNTLVRLNVVS-QVDTTEAQSANYA---------HAKSSAEGNFNQGGQ---NRNNF-

Query:  GGRGNMRGRGRDKSYSRNNNSKSTCQLCGKYGHTAAICYNRFNKEFAGFYSNNHSSNSNSQDSGTRNAAFIATPEMVVDPSRLADSGATSHITADLGNLN
        GG G  RGRG      RNNNS+ TCQ+CGK+GH+A++CY R++  + G      +SN+NS       + F+ATPE V D +  ADSGAT+H+T D GNL+
Subjt:  GGRGNMRGRGRDKSYSRNNNSKSTCQLCGKYGHTAAICYNRFNKEFAGFYSNNHSSNSNSQDSGTRNAAFIATPEMVVDPSRLADSGATSHITADLGNLN

Query:  LKSEYTGKEKLTVGNGSRLNISHIGQNVINIQHADKSPLVMKNI--CAKYQKNLLSVAKFTAQNNCFIEFHSTCCLVKDKET
        LKS+Y G E L VGNG +L+ISH+G  + ++    K  +++K +    + +KNLLSV++    N+ FIEFH+ CC VKDK T
Subjt:  LKSEYTGKEKLTVGNGSRLNISHIGQNVINIQHADKSPLVMKNI--CAKYQKNLLSVAKFTAQNNCFIEFHSTCCLVKDKET

A0A6J1DTZ7 uncharacterized protein LOC111024384 isoform X26.1e-10957.47Show/hide
Query:  DDSSSTTLPQATVTFPISSTQTPT---TSSFGHPLGTVLIVKLDDKNYSLWRGMVLVVLRGQKIDGFVLGTKVQPNKLLHSVGNEDSSTTATNQVINPAY
        +++ ++++P   VT     T  P+    +SFGHPLGTVL VKLDDKNYSLWRGMVL VLRGQK DG+VLGT  +P + L S    ++  T+ +  +NP Y
Subjt:  DDSSSTTLPQATVTFPISSTQTPT---TSSFGHPLGTVLIVKLDDKNYSLWRGMVLVVLRGQKIDGFVLGTKVQPNKLLHSVGNEDSSTTATNQVINPAY

Query:  EEWRATDQALLGWMFGSITLFVACEVVDLTSSRDVWVALEDLFGATSKARIIQLRKVLQTTKKGTIRMTEYLTFMKQTSESLKLAGESVSLNYLMSCVLS
         EW+A DQALLGW+FGS+T  +AC+VVD  SSR+VW ALEDL+GATSKARI QLR VLQ TKK +++M+EYL  MKQ SESLKLAGE V+ NYLMSCVLS
Subjt:  EEWRATDQALLGWMFGSITLFVACEVVDLTSSRDVWVALEDLFGATSKARIIQLRKVLQTTKKGTIRMTEYLTFMKQTSESLKLAGESVSLNYLMSCVLS

Query:  GLEAEYIPIVCHIEGKDSMTWQELYATLLTFKNTLVRLNVVS--QVDTTEAQSANYAHAKSSAEGN--FNQG----GQNRNNFG---GRGNMRGRGRDK-
        GLEAEY+PIVC IEGKDS +WQEL+ATL+TF+NTL+RLN+VS    +     S NY H+K ++ GN  F+Q     GQ R ++     + N+RGRGR + 
Subjt:  GLEAEYIPIVCHIEGKDSMTWQELYATLLTFKNTLVRLNVVS--QVDTTEAQSANYAHAKSSAEGN--FNQG----GQNRNNFG---GRGNMRGRGRDK-

Query:  SYSRNNNSKSTCQLCGKYGHTAAICYNRFNKEFAGFYSNNHSSNSNSQDSGTRNAAFIATPEMVVDPSRLADSGATSHITADLGNLNLKSEYTGK
        S  R NNSK +CQLCGKYGH AA+CY RF++ F     NN SS++N+     RN+A++A PE+V +PS LADSGAT H+T+DL NLN+KS+Y G+
Subjt:  SYSRNNNSKSTCQLCGKYGHTAAICYNRFNKEFAGFYSNNHSSNSNSQDSGTRNAAFIATPEMVVDPSRLADSGATSHITADLGNLNLKSEYTGK

A0A6J1DU77 uncharacterized protein LOC111024384 isoform X12.1e-10957.72Show/hide
Query:  DDSSSTTLPQATVTFPISSTQTPT---TSSFGHPLGTVLIVKLDDKNYSLWRGMVLVVLRGQKIDGFVLGTKVQPNKLLHSVGNEDSSTTATNQVINPAY
        +++ ++++P   VT     T  P+    +SFGHPLGTVL VKLDDKNYSLWRGMVL VLRGQK DG+VLGT  +P + L S    ++  T+ +  +NP Y
Subjt:  DDSSSTTLPQATVTFPISSTQTPT---TSSFGHPLGTVLIVKLDDKNYSLWRGMVLVVLRGQKIDGFVLGTKVQPNKLLHSVGNEDSSTTATNQVINPAY

Query:  EEWRATDQALLGWMFGSITLFVACEVVDLTSSRDVWVALEDLFGATSKARIIQLRKVLQTTKKGTIRMTEYLTFMKQTSESLKLAGESVSLNYLMSCVLS
         EW+A DQALLGW+FGS+T  +AC+VVD  SSR+VW ALEDL+GATSKARI QLR VLQ TKK +++M+EYL  MKQ SESLKLAGE V+ NYLMSCVLS
Subjt:  EEWRATDQALLGWMFGSITLFVACEVVDLTSSRDVWVALEDLFGATSKARIIQLRKVLQTTKKGTIRMTEYLTFMKQTSESLKLAGESVSLNYLMSCVLS

Query:  GLEAEYIPIVCHIEGKDSMTWQELYATLLTFKNTLVRLNVVS--QVDTTEAQSANYAHAKSSAEGN--FNQG----GQNRNNFG---GRGNMRGRGRDK-
        GLEAEY+PIVC IEGKDS +WQEL+ATL+TF+NTL+RLN+VS    +     S NY H+K ++ GN  F+Q     GQ R ++     + N+RGRGR + 
Subjt:  GLEAEYIPIVCHIEGKDSMTWQELYATLLTFKNTLVRLNVVS--QVDTTEAQSANYAHAKSSAEGN--FNQG----GQNRNNFG---GRGNMRGRGRDK-

Query:  SYSRNNNSKSTCQLCGKYGHTAAICYNRFNKEFAGFYSNNHSSNSNSQDSGTRNAAFIATPEMVVDPSRLADSGATSHITADLGNLNLKSEYTGK
        S  R NNSK +CQLCGKYGH AA+CY RF++ F     NN SS++N+     RN+A++A PE+V +PS LADSGAT H+T+DL NLN+KS+Y GK
Subjt:  SYSRNNNSKSTCQLCGKYGHTAAICYNRFNKEFAGFYSNNHSSNSNSQDSGTRNAAFIATPEMVVDPSRLADSGATSHITADLGNLNLKSEYTGK

A0A803QD97 Uncharacterized protein5.4e-8140.77Show/hide
Query:  SSTQTPTTSSFGHPLGTVLIVKLDDKNYSLWRGMVLVVLRGQKIDGFVLGTKVQPNKLLHSVGNEDSSTTATNQVINPAYEEWRATDQALLGWMFGSITL
        SS+   +   FG  L     +KLD  N+SLW+ MV  + RG ++DG++ G +  P++ L +   E     A    INP +E W   DQ L+GW++GS+T 
Subjt:  SSTQTPTTSSFGHPLGTVLIVKLDDKNYSLWRGMVLVVLRGQKIDGFVLGTKVQPNKLLHSVGNEDSSTTATNQVINPAYEEWRATDQALLGWMFGSITL

Query:  FVACEVVDLTSSRDVWVALEDLFGATSKARIIQLRKVLQTTKKGTIRMTEYLTFMKQTSESLKLAGESVSLNYLMSCVLSGLEAEYIPIVCHIEGKDSMT
         +A E++  +SS ++W +LE LFGA SKA++ + R  +QT +KG++ M +YL   KQ S+ L LAG+    + L+S VLSGL+ EY+PIV  IE ++  T
Subjt:  FVACEVVDLTSSRDVWVALEDLFGATSKARIIQLRKVLQTTKKGTIRMTEYLTFMKQTSESLKLAGESVSLNYLMSCVLSGLEAEYIPIVCHIEGKDSMT

Query:  WQELYATLLTFKNTLVRLNVVSQVDTTEAQSANYAH-AKSSAEGNFNQGGQNRNNFGG-RGNMRGRGRDKSYSRNNNSKSTCQLCGKYGHTAAICYNRFN
        WQ L   LL+F + L RL+ +S+       ++  A+ A  S  GN+N G  N    GG   N RGR   +  S     K TCQ+CG+YGH+AA CYNRF+
Subjt:  WQELYATLLTFKNTLVRLNVVSQVDTTEAQSANYAH-AKSSAEGNFNQGGQNRNNFGG-RGNMRGRGRDKSYSRNNNSKSTCQLCGKYGHTAAICYNRFN

Query:  KEFAGFYSNNHSSNSNSQDSGTRNAAFIATPEMVVDPSRLADSGATSHITADLGNLNLKSEYTGKEKLTVGNGSRLNISHIGQNVINIQHADKSPLVMKN
        + F G     ++ ++N   SG    AF+ATPEM+ D +  A+SGA++H+T++  NLN K++Y GK+ LTVG+GS+L I H G   ++  ++  SPL++K 
Subjt:  KEFAGFYSNNHSSNSNSQDSGTRNAAFIATPEMVVDPSRLADSGATSHITADLGNLNLKSEYTGKEKLTVGNGSRLNISHIGQNVINIQHADKSPLVMKN

Query:  I--CAKYQKNLLSVAKFTAQNNCFIEFHSTCCLVKDKETKRVLL
        +    K  KNL+S++K TA NN  +EF S  C VKD +TK+ +L
Subjt:  I--CAKYQKNLLSVAKFTAQNNCFIEFHSTCCLVKDKETKRVLL

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.1e-3027.08Show/hide
Query:  IVKLDDKNYSLWRGMVLVVLRGQKIDGFVLGTKVQPNKLLHSVGNEDSSTTATNQVINPAYEEWRATDQALLGWMFGSITLFVACEVVDLTSSRDVWVAL
        + KL   NY +W   V  +  G ++ GF+ G+   P     ++G + +        +NP Y  W+  D+ +   + G+I++ V   V   T++  +W  L
Subjt:  IVKLDDKNYSLWRGMVLVVLRGQKIDGFVLGTKVQPNKLLHSVGNEDSSTTATNQVINPAYEEWRATDQALLGWMFGSITLFVACEVVDLTSSRDVWVAL

Query:  EDLFGATSKARIIQLRKVLQTTKKGTIRMTEYLTFMKQTSESLKLAGESVSLNYLMSCVLSGLEAEYIPIVCHIEGKDS-MTWQELYATLLTFKNTLVRL
          ++   S   + QLR  L+   KGT  + +Y+  +    + L L G+ +  +  +  VL  L  EY P++  I  KD+  T  E++  LL  ++ ++ +
Subjt:  EDLFGATSKARIIQLRKVLQTTKKGTIRMTEYLTFMKQTSESLKLAGESVSLNYLMSCVLSGLEAEYIPIVCHIEGKDS-MTWQELYATLLTFKNTLVRL

Query:  NVVSQVDTTEAQSANYAHAKSSAEGNFNQGGQNRNNFGGRGNMRG-----RGRDKSYSRNNNSK---STCQLCGKYGHTAAICYNRFNKEFAGFYSNNHS
        +  + +  T    AN    +++   N N  G   N +  R N        +     +  NN SK     CQ+CG  GH+A  C      +   F S+ +S
Subjt:  NVVSQVDTTEAQSANYAHAKSSAEGNFNQGGQNRNNFGGRGNMRG-----RGRDKSYSRNNNSK---STCQLCGKYGHTAAICYNRFNKEFAGFYSNNHS

Query:  SNSNSQDSGTRNAAFIATPEMVVDPSRLADSGATSHITADLGNLNLKSEYTGKEKLTVGNGSRLNISHIGQNVINIQHADKSPLVMKNI--CAKYQKNLL
            S  +  +  A +A        + L DSGAT HIT+D  NL+L   YTG + + V +GS + ISH G   ++ +     PL + NI       KNL+
Subjt:  SNSNSQDSGTRNAAFIATPEMVVDPSRLADSGATSHITADLGNLNLKSEYTGKEKLTVGNGSRLNISHIGQNVINIQHADKSPLVMKNI--CAKYQKNLL

Query:  SVAKFTAQNNCFIEFHSTCCLVKDKETKRVLL
        SV +    N   +EF      VKD  T   LL
Subjt:  SVAKFTAQNNCFIEFHSTCCLVKDKETKRVLL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE28.5e-2326.56Show/hide
Query:  IVKLDDKNYSLWRGMVLVVLRGQKIDGFVLGTKVQPNKLLHSVGNEDSSTTATNQV--INPAYEEWRATDQALLGWMFGSITLFVACEVVDLTSSRDVWV
        + KL   NY +W   V  +  G ++ GF+ G+   P            +T  T+ V  +NP Y  WR  D+ +   + G+I++ V   V   T++  +W 
Subjt:  IVKLDDKNYSLWRGMVLVVLRGQKIDGFVLGTKVQPNKLLHSVGNEDSSTTATNQV--INPAYEEWRATDQALLGWMFGSITLFVACEVVDLTSSRDVWV

Query:  ALEDLFGATSKARIIQLRKVLQTTKKGTIRMTEYLTFMKQTSESLKLAGESVSLNYLMSCVLSGLEAEYIPIVCHIEGKDS-MTWQELYATLLTFKNTLV
         L  ++   S   + QLR + +                    + L L G+ +  +  +  VL  L  +Y P++  I  KD+  +  E++  L+  ++ L+
Subjt:  ALEDLFGATSKARIIQLRKVLQTTKKGTIRMTEYLTFMKQTSESLKLAGESVSLNYLMSCVLSGLEAEYIPIVCHIEGKDS-MTWQELYATLLTFKNTLV

Query:  RLNVVSQVDTTEAQSAN-YAHAKSSAEGNFNQGGQNR--NNFGGRGNMRGRGRDKSYSRNNNSK---STCQLCGKYGHTAAICYNRFNKEFAGFYSNNHS
         LN    V  T    AN   H  ++   N N  G NR  NN   R N        S S N   K     CQ+C   GH+A  C      +   F S  + 
Subjt:  RLNVVSQVDTTEAQSAN-YAHAKSSAEGNFNQGGQNR--NNFGGRGNMRGRGRDKSYSRNNNSK---STCQLCGKYGHTAAICYNRFNKEFAGFYSNNHS

Query:  SNSNSQDSGTRNAAFIATPEMVVDPSRLADSGATSHITADLGNLNLKSEYTGKEKLTVGNGSRLNISHIGQNVINIQHAD---KSPLVMKNICAKYQKNL
          S S  +  +  A +A        + L DSGAT HIT+D  NL+    YTG + + + +GS + I+H G   +            L + NI     KNL
Subjt:  SNSNSQDSGTRNAAFIATPEMVVDPSRLADSGATSHITADLGNLNLKSEYTGKEKLTVGNGSRLNISHIGQNVINIQHAD---KSPLVMKNICAKYQKNL

Query:  LSVAKFTAQNNCFIEFHSTCCLVKDKETKRVLL
        +SV +    N   +EF      VKD  T   LL
Subjt:  LSVAKFTAQNNCFIEFHSTCCLVKDKETKRVLL

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.9e-0623.55Show/hide
Query:  LIVKLDDKNYSLWRGMVLVVLRGQKIDGFVLGTKVQPNKLLHSVGNEDSSTTATNQVINPAYEEWRATDQALLGWMFGSIT-LFVACEVVDLTSSRDVWV
        +++ +++ NY  WR + L       + G + GT +  N                N V       W+  D  +   ++G++T        V  ++SRD+W+
Subjt:  LIVKLDDKNYSLWRGMVLVVLRGQKIDGFVLGTKVQPNKLLHSVGNEDSSTTATNQVINPAYEEWRATDQALLGWMFGSIT-LFVACEVVDLTSSRDVWV

Query:  ALEDLFGATSKARIIQLRKVLQTTKKGTIRMTEYLTFMKQTSESLKLAGESVSLNYLMSCVLSGLEAEYIPIVCHIEGKDSMTWQELYATLLTFKNTLVR
         +++ F     AR ++L   L+T   G +R+ +Y   MK+ ++SL+     V+   L+  VL+GL  ++  I+  I+ +      +  AT+L  +   ++
Subjt:  ALEDLFGATSKARIIQLRKVLQTTKKGTIRMTEYLTFMKQTSESLKLAGESVSLNYLMSCVLSGLEAEYIPIVCHIEGKDSMTWQELYATLLTFKNTLVR

Query:  LNVVSQVDTTEAQSANYAHAKSSAE--GNFNQGGQNRNNFGGRGN----MRGRGRDKSY
          +       +  S++   A S A    NF + G N+  + GRG      RGRG   SY
Subjt:  LNVVSQVDTTEAQSANYAHAKSSAE--GNFNQGGQNRNNFGGRGN----MRGRGRDKSY

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.3e-0726.24Show/hide
Query:  VGNEDSSTTATNQVINPAYEEWRATDQALLGWMFGSITLFVACEVVDL-TSSRDVWVALEDLFGATSKARIIQLRKVLQTTKKGTIRMTEYLTFMKQTSE
        +G+ D S+T T        + W+  D  +  W++G+IT  +   ++ +  ++RD+W++LE+LF    +AR +Q    L+TT    + + EY   +K  S+
Subjt:  VGNEDSSTTATNQVINPAYEEWRATDQALLGWMFGSITLFVACEVVDL-TSSRDVWVALEDLFGATSKARIIQLRKVLQTTKKGTIRMTEYLTFMKQTSE

Query:  SLKLAGESVSLNYLMSCVLSGLEAEYIPIVCHIEGKDSM-TWQELYATLLTFKNTLVRLNVVSQVDTTEAQSANYAHAKSSAEGNFNQGGQNRNNFGGRG
         L      +S   L+  +L+GL  +Y  I+  I+ K    ++ E  + LL  ++ L   +  S   T     +N        +  + Q   N N+  GRG
Subjt:  SLKLAGESVSLNYLMSCVLSGLEAEYIPIVCHIEGKDSM-TWQELYATLLTFKNTLVRLNVVSQVDTTEAQSANYAHAKSSAEGNFNQGGQNRNNFGGRG

Query:  -----NMRGRGRDKSYSRNNN
             N  G   D  Y+ NNN
Subjt:  -----NMRGRGRDKSYSRNNN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGACGATTCGAGCTCCACCACCCTTCCACAAGCCACTGTCACTTTCCCCATCAGTTCTACCCAAACTCCTACAACATCCTCGTTTGGTCACCCCCTTGGAACTGT
CCTTATTGTCAAGCTCGATGACAAGAACTACTCACTGTGGAGAGGGATGGTCCTTGTTGTCCTTCGTGGTCAAAAGATAGATGGCTTTGTTCTCGGTACAAAGGTTCAAC
CGAATAAGCTTCTCCACTCTGTTGGAAATGAAGACTCCTCAACTACTGCTACAAATCAGGTTATCAATCCTGCCTATGAAGAATGGAGGGCCACTGACCAAGCACTGCTT
GGATGGATGTTTGGCTCAATAACACTGTTCGTTGCCTGTGAAGTTGTGGATTTGACATCCTCACGAGATGTCTGGGTCGCCCTTGAAGATCTCTTTGGTGCAACCAGTAA
AGCAAGAATCATTCAATTGAGGAAAGTTCTTCAAACAACCAAGAAAGGAACAATACGGATGACTGAATATCTCACTTTTATGAAGCAAACATCCGAGAGTTTAAAACTCG
CAGGTGAATCAGTAAGCTTGAACTACTTGATGTCTTGTGTTTTATCTGGTTTAGAAGCAGAGTACATTCCCATTGTATGCCACATTGAGGGCAAAGATTCCATGACATGG
CAAGAATTATATGCCACCCTCCTGACCTTTAAAAATACTCTAGTAAGGCTCAATGTTGTGAGCCAAGTTGACACCACAGAAGCACAAAGTGCAAATTATGCTCATGCCAA
ATCCAGTGCTGAAGGAAATTTCAACCAAGGTGGTCAGAATCGTAACAACTTTGGTGGCAGAGGAAATATGAGAGGTAGGGGCAGAGATAAGTCATACTCTAGGAACAACA
ATTCAAAGTCAACATGTCAATTATGTGGGAAATATGGTCACACTGCTGCAATTTGTTACAATCGTTTTAACAAAGAATTTGCTGGCTTTTATTCTAACAATCATTCCTCT
AACTCTAATAGTCAAGACAGTGGAACTAGAAATGCAGCATTTATAGCCACTCCTGAGATGGTGGTTGACCCAAGTCGGCTAGCAGATAGTGGGGCTACTAGCCACATCAC
AGCAGATCTTGGAAATCTCAATCTGAAGTCTGAGTACACTGGTAAGGAAAAACTTACTGTTGGTAATGGTAGTCGTTTGAATATCTCTCACATTGGACAAAATGTCATAA
ATATACAGCATGCTGATAAGTCTCCTCTCGTGATGAAAAATATATGTGCCAAATATCAAAAGAACTTGTTAAGTGTAGCCAAGTTTACTGCTCAAAATAACTGTTTTATT
GAATTTCACTCTACATGTTGTTTGGTGAAGGACAAGGAAACAAAGAGGGTTCTACTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGACGATTCGAGCTCCACCACCCTTCCACAAGCCACTGTCACTTTCCCCATCAGTTCTACCCAAACTCCTACAACATCCTCGTTTGGTCACCCCCTTGGAACTGT
CCTTATTGTCAAGCTCGATGACAAGAACTACTCACTGTGGAGAGGGATGGTCCTTGTTGTCCTTCGTGGTCAAAAGATAGATGGCTTTGTTCTCGGTACAAAGGTTCAAC
CGAATAAGCTTCTCCACTCTGTTGGAAATGAAGACTCCTCAACTACTGCTACAAATCAGGTTATCAATCCTGCCTATGAAGAATGGAGGGCCACTGACCAAGCACTGCTT
GGATGGATGTTTGGCTCAATAACACTGTTCGTTGCCTGTGAAGTTGTGGATTTGACATCCTCACGAGATGTCTGGGTCGCCCTTGAAGATCTCTTTGGTGCAACCAGTAA
AGCAAGAATCATTCAATTGAGGAAAGTTCTTCAAACAACCAAGAAAGGAACAATACGGATGACTGAATATCTCACTTTTATGAAGCAAACATCCGAGAGTTTAAAACTCG
CAGGTGAATCAGTAAGCTTGAACTACTTGATGTCTTGTGTTTTATCTGGTTTAGAAGCAGAGTACATTCCCATTGTATGCCACATTGAGGGCAAAGATTCCATGACATGG
CAAGAATTATATGCCACCCTCCTGACCTTTAAAAATACTCTAGTAAGGCTCAATGTTGTGAGCCAAGTTGACACCACAGAAGCACAAAGTGCAAATTATGCTCATGCCAA
ATCCAGTGCTGAAGGAAATTTCAACCAAGGTGGTCAGAATCGTAACAACTTTGGTGGCAGAGGAAATATGAGAGGTAGGGGCAGAGATAAGTCATACTCTAGGAACAACA
ATTCAAAGTCAACATGTCAATTATGTGGGAAATATGGTCACACTGCTGCAATTTGTTACAATCGTTTTAACAAAGAATTTGCTGGCTTTTATTCTAACAATCATTCCTCT
AACTCTAATAGTCAAGACAGTGGAACTAGAAATGCAGCATTTATAGCCACTCCTGAGATGGTGGTTGACCCAAGTCGGCTAGCAGATAGTGGGGCTACTAGCCACATCAC
AGCAGATCTTGGAAATCTCAATCTGAAGTCTGAGTACACTGGTAAGGAAAAACTTACTGTTGGTAATGGTAGTCGTTTGAATATCTCTCACATTGGACAAAATGTCATAA
ATATACAGCATGCTGATAAGTCTCCTCTCGTGATGAAAAATATATGTGCCAAATATCAAAAGAACTTGTTAAGTGTAGCCAAGTTTACTGCTCAAAATAACTGTTTTATT
GAATTTCACTCTACATGTTGTTTGGTGAAGGACAAGGAAACAAAGAGGGTTCTACTGTAA
Protein sequenceShow/hide protein sequence
MADDSSSTTLPQATVTFPISSTQTPTTSSFGHPLGTVLIVKLDDKNYSLWRGMVLVVLRGQKIDGFVLGTKVQPNKLLHSVGNEDSSTTATNQVINPAYEEWRATDQALL
GWMFGSITLFVACEVVDLTSSRDVWVALEDLFGATSKARIIQLRKVLQTTKKGTIRMTEYLTFMKQTSESLKLAGESVSLNYLMSCVLSGLEAEYIPIVCHIEGKDSMTW
QELYATLLTFKNTLVRLNVVSQVDTTEAQSANYAHAKSSAEGNFNQGGQNRNNFGGRGNMRGRGRDKSYSRNNNSKSTCQLCGKYGHTAAICYNRFNKEFAGFYSNNHSS
NSNSQDSGTRNAAFIATPEMVVDPSRLADSGATSHITADLGNLNLKSEYTGKEKLTVGNGSRLNISHIGQNVINIQHADKSPLVMKNICAKYQKNLLSVAKFTAQNNCFI
EFHSTCCLVKDKETKRVLL