; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0014399 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0014399
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionDUF4283 domain-containing protein
Genome locationchr12:416609..420343
RNA-Seq ExpressionLag0014399
SyntenyLag0014399
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039949.1 hypothetical protein E6C27_scaffold122G002290 [Cucumis melo var. makuwa]5.1e-5130.94Show/hide
Query:  RKCVAERKLFSIDYSKDRNKRLVKIVEKNHDLRFEVVLESKYVLWVADSIDDLVITPSTQKFFRKTSCENGLIWLQKVSNKRGVFVEIAKVASSGNRSNL
        R C  E+K F +   K   +  + I E      F + +    + W+  +   L+ TP T +FF +    +  +W+QK+ N+RG   EI +V   G +  +
Subjt:  RKCVAERKLFSIDYSKDRNKRLVKIVEKNHDLRFEVVLESKYVLWVADSIDDLVITPSTQKFFRKTSCENGLIWLQKVSNKRGVFVEIAKVASSGNRSNL

Query:  IIPRGEDSKGWRAFLTMI-------NDFFSSKEEQHEDK---KVVYN---------PQKSFADAVKG------------------------------RHK
        ++P G D  GW  F  M+          F ++   ++DK   KV  +         P+K++ +AV                                + +
Subjt:  IIPRGEDSKGWRAFLTMI-------NDFFSSKEEQHEDK---KVVYN---------PQKSFADAVKG------------------------------RHK

Query:  VDGQARQISLPSQETSAMSTTCFHDEWGKIYDTLCN-AFQKDLIIN--PFHPDKALLKCPDAEFARIVAHNKGWSVLGNFTLKFEYWDYELHGRINVVPS
        +  + ++  +  +++  +S  CFHD+W KI D L +   +KD      PFH DKALL   D E A ++  N GW+ +G F +KFE W    H    V+PS
Subjt:  VDGQARQISLPSQETSAMSTTCFHDEWGKIYDTLCN-AFQKDLIIN--PFHPDKALLKCPDAEFARIVAHNKGWSVLGNFTLKFEYWDYELHGRINVVPS

Query:  YGGWVRFRNIPLQNWCVDTFKAIGEAYGGYIECDDKCLSLVGCMEVVIKVKSNYCGFIPAEIDIIQEDGSVAIAQVVTFEDPQLLESRRVYIHGGFTSEA
        YGGW RFR IPL  W ++TF  IGEA GG+I+   + ++ +   E +IKVK NY GF+PA I I  E+G   I Q +T  + + L  R   +HG FT  A
Subjt:  YGGWVRFRNIPLQNWCVDTFKAIGEAYGGYIECDDKCLSLVGCMEVVIKVKSNYCGFIPAEIDIIQEDGSVAIAQVVTFEDPQLLESRRVYIHGGFTSEA

Query:  ARVF
        A  F
Subjt:  ARVF

KAA0040039.1 hypothetical protein E6C27_scaffold366G00060 [Cucumis melo var. makuwa]3.9e-5128.69Show/hide
Query:  RKCVAERKLF--SIDYSKDRNKRLVKIVEKNHDLRFEVVLESKYVLWVADSIDDLVITPSTQKFFRKTSCENGLIWLQKVSNKRGVFVEIAKVASSGNRS
        R C  E+K F  S+D      K L+  V         V LES  + W+  S   L+ TP T +FF +   E   +W+QK  N++G   EI +V   G + 
Subjt:  RKCVAERKLF--SIDYSKDRNKRLVKIVEKNHDLRFEVVLESKYVLWVADSIDDLVITPSTQKFFRKTSCENGLIWLQKVSNKRGVFVEIAKVASSGNRS

Query:  NLIIPRGEDSKGWRAFLTMIN--------------------DFFSSKEEQHEDKKVVYNPQKSFADAVKGRHKVDGQARQ--------------ISLPSQ
         +++P G +  GW  F+++++                    D FSS +   + K+      +S+A+AV      D ++                 S   +
Subjt:  NLIIPRGEDSKGWRAFLTMIN--------------------DFFSSKEEQHEDKKVVYNPQKSFADAVKGRHKVDGQARQ--------------ISLPSQ

Query:  ETSAMSTTCFHDEWGKIYDTLCNAFQKDLIINPFHPDKALLKCPDAEFARIVAHNKGWSVLGNFTLKFEYWDYELHGRINVVPSYGGWVRFRNIPLQNWC
         T+ ++   FHD+W +I + L       +   PFH DKAL+   + E A+++  NKGW+ +G F +KFE W  + H    V+PSYGGW++ R +PL  W 
Subjt:  ETSAMSTTCFHDEWGKIYDTLCNAFQKDLIINPFHPDKALLKCPDAEFARIVAHNKGWSVLGNFTLKFEYWDYELHGRINVVPSYGGWVRFRNIPLQNWC

Query:  VDTFKAIGEAYGGYIECDDKCLSLVGCMEVVIKVKSNYCGFIPAEIDIIQEDGSVAIAQVVTFEDPQLLESRRVYIHGGFTSEAARVFFGEDDNREDLCL
        +++F  IG+A GG++E   +   L    E  IK+K NY GFIPA I +  ++    I QV+   + +    R   IHG FT EAA+ F   + N E    
Subjt:  VDTFKAIGEAYGGYIECDDKCLSLVGCMEVVIKVKSNYCGFIPAEIDIIQEDGSVAIAQVVTFEDPQLLESRRVYIHGGFTSEAARVFFGEDDNREDLCL

Query:  VDKNRIEDQKAKSQIQGAVIKTSHVS-----------ISNNGEGSAAEREKGKSSVCRKRVIEDKVG
         D   +  +KA S + G      ++            ++ +G+  ++E+ K K++     VI  K G
Subjt:  VDKNRIEDQKAKSQIQGAVIKTSHVS-----------ISNNGEGSAAEREKGKSSVCRKRVIEDKVG

KAA0041398.1 hypothetical protein E6C27_scaffold206G00440 [Cucumis melo var. makuwa]9.4e-5331.91Show/hide
Query:  RKCVAERKLFSIDYSKDRNKRLVKIVEKNHDLRFEVVLESKYVLWVADSIDDLVITPSTQKFFRKTSCENGLIWLQKVSNKRGVFVEIAKVASSGNRSNL
        R C  E+K F +   K   +  + I E      F + +    + W+  +   L+ TP T +FF +    +  +W+QK+ N+RG   EI +V   G +  +
Subjt:  RKCVAERKLFSIDYSKDRNKRLVKIVEKNHDLRFEVVLESKYVLWVADSIDDLVITPSTQKFFRKTSCENGLIWLQKVSNKRGVFVEIAKVASSGNRSNL

Query:  IIPRGEDSKGWRAFLTMIN-DFFSSKEE--------QHEDKKVVY----------NPQKSFADAVKG------------------------RHKVDGQAR
        ++P G D  GW  F  M+     S K+E        Q + K+ +           +P+K++A+ V                          + ++  + R
Subjt:  IIPRGEDSKGWRAFLTMIN-DFFSSKEE--------QHEDKKVVY----------NPQKSFADAVKG------------------------RHKVDGQAR

Query:  QISLPSQETSAMSTTCFHDEWGKIYDTLCNAFQKD---LIINPFHPDKALLKCPDAEFARIVAHNKGWSVLGNFTLKFEYWDYELHGRINVVPSYGGWVR
        +  +  ++T  +S  CFHD+W KI D L     K        PFH DKALL   D + A+++  N GW+ +G F +KFE W   +H    V+PSYGGW R
Subjt:  QISLPSQETSAMSTTCFHDEWGKIYDTLCNAFQKD---LIINPFHPDKALLKCPDAEFARIVAHNKGWSVLGNFTLKFEYWDYELHGRINVVPSYGGWVR

Query:  FRNIPLQNWCVDTFKAIGEAYGGYIECDDKCLSLVGCMEVVIKVKSNYCGFIPAEIDIIQEDGSVAIAQVVTFEDPQLLESRRVYIHGGFTSEAARVF
        FR IPL  W ++TF  IGEA GG+I+   + ++ +   E +IKVK NY GF+PA I I  E+G V I Q+VT  + + L  R   IHG F   AA  F
Subjt:  FRNIPLQNWCVDTFKAIGEAYGGYIECDDKCLSLVGCMEVVIKVKSNYCGFIPAEIDIIQEDGSVAIAQVVTFEDPQLLESRRVYIHGGFTSEAARVF

KAA0044449.1 hypothetical protein E6C27_scaffold46G001820 [Cucumis melo var. makuwa]1.7e-5434.49Show/hide
Query:  RKCVAERKLFSIDYSKDRNKRLVKIVEKNHDLRFEVVLESKYVLWVADSIDDLVITPSTQKFFRKTSCENGLIWLQKVSNKRGVFVEIAKVASSGNRSNL
        R C  E+K F +   K   +  + I E      F + +    + W+  +   L+ TP T +FF +    +  +W+Q + N+RG   EI +V   G +  +
Subjt:  RKCVAERKLFSIDYSKDRNKRLVKIVEKNHDLRFEVVLESKYVLWVADSIDDLVITPSTQKFFRKTSCENGLIWLQKVSNKRGVFVEIAKVASSGNRSNL

Query:  IIPRGEDSKGWRAFLTMI------------------NDFFSSKEEQHEDKKV-VYNPQKSFADAVKGRHKVDGQARQISLPSQETSAMSTTCFHDEWGKI
        ++P G D  GW  F  M+                   D    K +Q  D      +P+K++A+AV             S  ++ TS++   CFHD+W KI
Subjt:  IIPRGEDSKGWRAFLTMI------------------NDFFSSKEEQHEDKKV-VYNPQKSFADAVKGRHKVDGQARQISLPSQETSAMSTTCFHDEWGKI

Query:  YDTLCN-AFQKDLIIN--PFHPDKALLKCPDAEFARIVAHNKGWSVLGNFTLKFEYWDYELHGRINVVPSYGGWVRFRNIPLQNWCVDTFKAIGEAYGGY
         D L +   +KD      PFH DKALL   D E A+++  N GW+ +G F +KFE W    H    V+PSYGGW RFR IPL  W ++TF  IGEAYGG+
Subjt:  YDTLCN-AFQKDLIIN--PFHPDKALLKCPDAEFARIVAHNKGWSVLGNFTLKFEYWDYELHGRINVVPSYGGWVRFRNIPLQNWCVDTFKAIGEAYGGY

Query:  IECDDKCLSLVGCMEVVIKVKSNYCGFIPAEIDIIQEDGSVAIAQVVTFEDPQLLESRRVYIHGGFTSEAARVF
        I+   + ++ +   E +IKVK NY GF+PA I I  E+G   I Q VT    + L  R   IHG FT  AA  F
Subjt:  IECDDKCLSLVGCMEVVIKVKSNYCGFIPAEIDIIQEDGSVAIAQVVTFEDPQLLESRRVYIHGGFTSEAARVF

TYK29576.1 hypothetical protein E5676_scaffold655G001820 [Cucumis melo var. makuwa]1.7e-5434.49Show/hide
Query:  RKCVAERKLFSIDYSKDRNKRLVKIVEKNHDLRFEVVLESKYVLWVADSIDDLVITPSTQKFFRKTSCENGLIWLQKVSNKRGVFVEIAKVASSGNRSNL
        R C  E+K F +   K   +  + I E      F + +    + W+  +   L+ TP T +FF +    +  +W+Q + N+RG   EI +V   G +  +
Subjt:  RKCVAERKLFSIDYSKDRNKRLVKIVEKNHDLRFEVVLESKYVLWVADSIDDLVITPSTQKFFRKTSCENGLIWLQKVSNKRGVFVEIAKVASSGNRSNL

Query:  IIPRGEDSKGWRAFLTMI------------------NDFFSSKEEQHEDKKV-VYNPQKSFADAVKGRHKVDGQARQISLPSQETSAMSTTCFHDEWGKI
        ++P G D  GW  F  M+                   D    K +Q  D      +P+K++A+AV             S  ++ TS++   CFHD+W KI
Subjt:  IIPRGEDSKGWRAFLTMI------------------NDFFSSKEEQHEDKKV-VYNPQKSFADAVKGRHKVDGQARQISLPSQETSAMSTTCFHDEWGKI

Query:  YDTLCN-AFQKDLIIN--PFHPDKALLKCPDAEFARIVAHNKGWSVLGNFTLKFEYWDYELHGRINVVPSYGGWVRFRNIPLQNWCVDTFKAIGEAYGGY
         D L +   +KD      PFH DKALL   D E A+++  N GW+ +G F +KFE W    H    V+PSYGGW RFR IPL  W ++TF  IGEAYGG+
Subjt:  YDTLCN-AFQKDLIIN--PFHPDKALLKCPDAEFARIVAHNKGWSVLGNFTLKFEYWDYELHGRINVVPSYGGWVRFRNIPLQNWCVDTFKAIGEAYGGY

Query:  IECDDKCLSLVGCMEVVIKVKSNYCGFIPAEIDIIQEDGSVAIAQVVTFEDPQLLESRRVYIHGGFTSEAARVF
        I+   + ++ +   E +IKVK NY GF+PA I I  E+G   I Q VT    + L  R   IHG FT  AA  F
Subjt:  IECDDKCLSLVGCMEVVIKVKSNYCGFIPAEIDIIQEDGSVAIAQVVTFEDPQLLESRRVYIHGGFTSEAARVF

TrEMBL top hitse value%identityAlignment
A0A5A7TEP0 DUF4283 domain-containing protein4.5e-5331.91Show/hide
Query:  RKCVAERKLFSIDYSKDRNKRLVKIVEKNHDLRFEVVLESKYVLWVADSIDDLVITPSTQKFFRKTSCENGLIWLQKVSNKRGVFVEIAKVASSGNRSNL
        R C  E+K F +   K   +  + I E      F + +    + W+  +   L+ TP T +FF +    +  +W+QK+ N+RG   EI +V   G +  +
Subjt:  RKCVAERKLFSIDYSKDRNKRLVKIVEKNHDLRFEVVLESKYVLWVADSIDDLVITPSTQKFFRKTSCENGLIWLQKVSNKRGVFVEIAKVASSGNRSNL

Query:  IIPRGEDSKGWRAFLTMIN-DFFSSKEE--------QHEDKKVVY----------NPQKSFADAVKG------------------------RHKVDGQAR
        ++P G D  GW  F  M+     S K+E        Q + K+ +           +P+K++A+ V                          + ++  + R
Subjt:  IIPRGEDSKGWRAFLTMIN-DFFSSKEE--------QHEDKKVVY----------NPQKSFADAVKG------------------------RHKVDGQAR

Query:  QISLPSQETSAMSTTCFHDEWGKIYDTLCNAFQKD---LIINPFHPDKALLKCPDAEFARIVAHNKGWSVLGNFTLKFEYWDYELHGRINVVPSYGGWVR
        +  +  ++T  +S  CFHD+W KI D L     K        PFH DKALL   D + A+++  N GW+ +G F +KFE W   +H    V+PSYGGW R
Subjt:  QISLPSQETSAMSTTCFHDEWGKIYDTLCNAFQKD---LIINPFHPDKALLKCPDAEFARIVAHNKGWSVLGNFTLKFEYWDYELHGRINVVPSYGGWVR

Query:  FRNIPLQNWCVDTFKAIGEAYGGYIECDDKCLSLVGCMEVVIKVKSNYCGFIPAEIDIIQEDGSVAIAQVVTFEDPQLLESRRVYIHGGFTSEAARVF
        FR IPL  W ++TF  IGEA GG+I+   + ++ +   E +IKVK NY GF+PA I I  E+G V I Q+VT  + + L  R   IHG F   AA  F
Subjt:  FRNIPLQNWCVDTFKAIGEAYGGYIECDDKCLSLVGCMEVVIKVKSNYCGFIPAEIDIIQEDGSVAIAQVVTFEDPQLLESRRVYIHGGFTSEAARVF

A0A5A7TFK7 DUF4283 domain-containing protein1.9e-5128.69Show/hide
Query:  RKCVAERKLF--SIDYSKDRNKRLVKIVEKNHDLRFEVVLESKYVLWVADSIDDLVITPSTQKFFRKTSCENGLIWLQKVSNKRGVFVEIAKVASSGNRS
        R C  E+K F  S+D      K L+  V         V LES  + W+  S   L+ TP T +FF +   E   +W+QK  N++G   EI +V   G + 
Subjt:  RKCVAERKLF--SIDYSKDRNKRLVKIVEKNHDLRFEVVLESKYVLWVADSIDDLVITPSTQKFFRKTSCENGLIWLQKVSNKRGVFVEIAKVASSGNRS

Query:  NLIIPRGEDSKGWRAFLTMIN--------------------DFFSSKEEQHEDKKVVYNPQKSFADAVKGRHKVDGQARQ--------------ISLPSQ
         +++P G +  GW  F+++++                    D FSS +   + K+      +S+A+AV      D ++                 S   +
Subjt:  NLIIPRGEDSKGWRAFLTMIN--------------------DFFSSKEEQHEDKKVVYNPQKSFADAVKGRHKVDGQARQ--------------ISLPSQ

Query:  ETSAMSTTCFHDEWGKIYDTLCNAFQKDLIINPFHPDKALLKCPDAEFARIVAHNKGWSVLGNFTLKFEYWDYELHGRINVVPSYGGWVRFRNIPLQNWC
         T+ ++   FHD+W +I + L       +   PFH DKAL+   + E A+++  NKGW+ +G F +KFE W  + H    V+PSYGGW++ R +PL  W 
Subjt:  ETSAMSTTCFHDEWGKIYDTLCNAFQKDLIINPFHPDKALLKCPDAEFARIVAHNKGWSVLGNFTLKFEYWDYELHGRINVVPSYGGWVRFRNIPLQNWC

Query:  VDTFKAIGEAYGGYIECDDKCLSLVGCMEVVIKVKSNYCGFIPAEIDIIQEDGSVAIAQVVTFEDPQLLESRRVYIHGGFTSEAARVFFGEDDNREDLCL
        +++F  IG+A GG++E   +   L    E  IK+K NY GFIPA I +  ++    I QV+   + +    R   IHG FT EAA+ F   + N E    
Subjt:  VDTFKAIGEAYGGYIECDDKCLSLVGCMEVVIKVKSNYCGFIPAEIDIIQEDGSVAIAQVVTFEDPQLLESRRVYIHGGFTSEAARVFFGEDDNREDLCL

Query:  VDKNRIEDQKAKSQIQGAVIKTSHVS-----------ISNNGEGSAAEREKGKSSVCRKRVIEDKVG
         D   +  +KA S + G      ++            ++ +G+  ++E+ K K++     VI  K G
Subjt:  VDKNRIEDQKAKSQIQGAVIKTSHVS-----------ISNNGEGSAAEREKGKSSVCRKRVIEDKVG

A0A5A7TTA1 DUF4283 domain-containing protein8.3e-5534.49Show/hide
Query:  RKCVAERKLFSIDYSKDRNKRLVKIVEKNHDLRFEVVLESKYVLWVADSIDDLVITPSTQKFFRKTSCENGLIWLQKVSNKRGVFVEIAKVASSGNRSNL
        R C  E+K F +   K   +  + I E      F + +    + W+  +   L+ TP T +FF +    +  +W+Q + N+RG   EI +V   G +  +
Subjt:  RKCVAERKLFSIDYSKDRNKRLVKIVEKNHDLRFEVVLESKYVLWVADSIDDLVITPSTQKFFRKTSCENGLIWLQKVSNKRGVFVEIAKVASSGNRSNL

Query:  IIPRGEDSKGWRAFLTMI------------------NDFFSSKEEQHEDKKV-VYNPQKSFADAVKGRHKVDGQARQISLPSQETSAMSTTCFHDEWGKI
        ++P G D  GW  F  M+                   D    K +Q  D      +P+K++A+AV             S  ++ TS++   CFHD+W KI
Subjt:  IIPRGEDSKGWRAFLTMI------------------NDFFSSKEEQHEDKKV-VYNPQKSFADAVKGRHKVDGQARQISLPSQETSAMSTTCFHDEWGKI

Query:  YDTLCN-AFQKDLIIN--PFHPDKALLKCPDAEFARIVAHNKGWSVLGNFTLKFEYWDYELHGRINVVPSYGGWVRFRNIPLQNWCVDTFKAIGEAYGGY
         D L +   +KD      PFH DKALL   D E A+++  N GW+ +G F +KFE W    H    V+PSYGGW RFR IPL  W ++TF  IGEAYGG+
Subjt:  YDTLCN-AFQKDLIIN--PFHPDKALLKCPDAEFARIVAHNKGWSVLGNFTLKFEYWDYELHGRINVVPSYGGWVRFRNIPLQNWCVDTFKAIGEAYGGY

Query:  IECDDKCLSLVGCMEVVIKVKSNYCGFIPAEIDIIQEDGSVAIAQVVTFEDPQLLESRRVYIHGGFTSEAARVF
        I+   + ++ +   E +IKVK NY GF+PA I I  E+G   I Q VT    + L  R   IHG FT  AA  F
Subjt:  IECDDKCLSLVGCMEVVIKVKSNYCGFIPAEIDIIQEDGSVAIAQVVTFEDPQLLESRRVYIHGGFTSEAARVF

A0A5D3DLT1 DUF4283 domain-containing protein2.5e-5130.94Show/hide
Query:  RKCVAERKLFSIDYSKDRNKRLVKIVEKNHDLRFEVVLESKYVLWVADSIDDLVITPSTQKFFRKTSCENGLIWLQKVSNKRGVFVEIAKVASSGNRSNL
        R C  E+K F +   K   +  + I E      F + +    + W+  +   L+ TP T +FF +    +  +W+QK+ N+RG   EI +V   G +  +
Subjt:  RKCVAERKLFSIDYSKDRNKRLVKIVEKNHDLRFEVVLESKYVLWVADSIDDLVITPSTQKFFRKTSCENGLIWLQKVSNKRGVFVEIAKVASSGNRSNL

Query:  IIPRGEDSKGWRAFLTMI-------NDFFSSKEEQHEDK---KVVYN---------PQKSFADAVKG------------------------------RHK
        ++P G D  GW  F  M+          F ++   ++DK   KV  +         P+K++ +AV                                + +
Subjt:  IIPRGEDSKGWRAFLTMI-------NDFFSSKEEQHEDK---KVVYN---------PQKSFADAVKG------------------------------RHK

Query:  VDGQARQISLPSQETSAMSTTCFHDEWGKIYDTLCN-AFQKDLIIN--PFHPDKALLKCPDAEFARIVAHNKGWSVLGNFTLKFEYWDYELHGRINVVPS
        +  + ++  +  +++  +S  CFHD+W KI D L +   +KD      PFH DKALL   D E A ++  N GW+ +G F +KFE W    H    V+PS
Subjt:  VDGQARQISLPSQETSAMSTTCFHDEWGKIYDTLCN-AFQKDLIIN--PFHPDKALLKCPDAEFARIVAHNKGWSVLGNFTLKFEYWDYELHGRINVVPS

Query:  YGGWVRFRNIPLQNWCVDTFKAIGEAYGGYIECDDKCLSLVGCMEVVIKVKSNYCGFIPAEIDIIQEDGSVAIAQVVTFEDPQLLESRRVYIHGGFTSEA
        YGGW RFR IPL  W ++TF  IGEA GG+I+   + ++ +   E +IKVK NY GF+PA I I  E+G   I Q +T  + + L  R   +HG FT  A
Subjt:  YGGWVRFRNIPLQNWCVDTFKAIGEAYGGYIECDDKCLSLVGCMEVVIKVKSNYCGFIPAEIDIIQEDGSVAIAQVVTFEDPQLLESRRVYIHGGFTSEA

Query:  ARVF
        A  F
Subjt:  ARVF

A0A5D3E0Y8 DUF4283 domain-containing protein8.3e-5534.49Show/hide
Query:  RKCVAERKLFSIDYSKDRNKRLVKIVEKNHDLRFEVVLESKYVLWVADSIDDLVITPSTQKFFRKTSCENGLIWLQKVSNKRGVFVEIAKVASSGNRSNL
        R C  E+K F +   K   +  + I E      F + +    + W+  +   L+ TP T +FF +    +  +W+Q + N+RG   EI +V   G +  +
Subjt:  RKCVAERKLFSIDYSKDRNKRLVKIVEKNHDLRFEVVLESKYVLWVADSIDDLVITPSTQKFFRKTSCENGLIWLQKVSNKRGVFVEIAKVASSGNRSNL

Query:  IIPRGEDSKGWRAFLTMI------------------NDFFSSKEEQHEDKKV-VYNPQKSFADAVKGRHKVDGQARQISLPSQETSAMSTTCFHDEWGKI
        ++P G D  GW  F  M+                   D    K +Q  D      +P+K++A+AV             S  ++ TS++   CFHD+W KI
Subjt:  IIPRGEDSKGWRAFLTMI------------------NDFFSSKEEQHEDKKV-VYNPQKSFADAVKGRHKVDGQARQISLPSQETSAMSTTCFHDEWGKI

Query:  YDTLCN-AFQKDLIIN--PFHPDKALLKCPDAEFARIVAHNKGWSVLGNFTLKFEYWDYELHGRINVVPSYGGWVRFRNIPLQNWCVDTFKAIGEAYGGY
         D L +   +KD      PFH DKALL   D E A+++  N GW+ +G F +KFE W    H    V+PSYGGW RFR IPL  W ++TF  IGEAYGG+
Subjt:  YDTLCN-AFQKDLIIN--PFHPDKALLKCPDAEFARIVAHNKGWSVLGNFTLKFEYWDYELHGRINVVPSYGGWVRFRNIPLQNWCVDTFKAIGEAYGGY

Query:  IECDDKCLSLVGCMEVVIKVKSNYCGFIPAEIDIIQEDGSVAIAQVVTFEDPQLLESRRVYIHGGFTSEAARVF
        I+   + ++ +   E +IKVK NY GF+PA I I  E+G   I Q VT    + L  R   IHG FT  AA  F
Subjt:  IECDDKCLSLVGCMEVVIKVKSNYCGFIPAEIDIIQEDGSVAIAQVVTFEDPQLLESRRVYIHGGFTSEAARVF

SwissProt top hitse value%identityAlignment
A4GSN8 Nuclear-pore anchor1.2e-0589.66Show/hide
Query:  NKLVELYKESSEEWSKKATELEGVIKALE
        NKLV+LYKESSEEWS+KA ELEGVIKALE
Subjt:  NKLVELYKESSEEWSKKATELEGVIKALE

O00370 LINE-1 retrotransposable element ORF2 protein1.2e-0528.43Show/hide
Query:  MINGRPRGKIFAIRGLRQGDPISPFLFTLIGDSFSRLVRYDIEKRYIRGFCIGSQSIEVSHLQYADDTIMFCPDYGESIDNWWSFISIFLQALGFSLNLS
        ++NG+         G RQG P+SP LF ++ +  +R +R   +++ I+G  +G + +++S   +ADD I++  +   S  N    IS F +  G+ +N+ 
Subjt:  MINGRPRGKIFAIRGLRQGDPISPFLFTLIGDSFSRLVRYDIEKRYIRGFCIGSQSIEVSHLQYADDTIMFCPDYGESIDNWWSFISIFLQALGFSLNLS

Query:  KT
        K+
Subjt:  KT

P08548 LINE-1 reverse transcriptase homolog2.6e-0532.69Show/hide
Query:  KIFAIR-GLRQGDPISPFLFTLIGDSFSRLVRYDIEKRYIRGFCIGSQSIEVSHLQYADDTIMFCPDYGESIDNWWSFISIFLQALGFSLNLSKTAIAGI
        K F +R G RQG P+SP LF ++ +  +  +R   E++ I+G  IGS+ I++S   +ADD I++  +  +S       I  +    G+ +N  K ++A I
Subjt:  KIFAIR-GLRQGDPISPFLFTLIGDSFSRLVRYDIEKRYIRGFCIGSQSIEVSHLQYADDTIMFCPDYGESIDNWWSFISIFLQALGFSLNLSKTAIAGI

Query:  NTEN
         T N
Subjt:  NTEN

P11369 LINE-1 retrotransposable element ORF2 protein4.7e-0730.91Show/hide
Query:  INGRPRGKIFAIRGLRQGDPISPFLFTLIGDSFSRLVRYDIEKRYIRGFCIGSQSIEVSHLQYADDTIMFCPDYGESIDNWWSFISIFLQALGFSLNLSK
        +NG     I    G RQG P+SP+LF ++ +  +R +R   +++ I+G  IG + +++S L  ADD I++  D   S     + I+ F + +G+ +N +K
Subjt:  INGRPRGKIFAIRGLRQGDPISPFLFTLIGDSFSRLVRYDIEKRYIRGFCIGSQSIEVSHLQYADDTIMFCPDYGESIDNWWSFISIFLQALGFSLNLSK

Query:  TAIAGINTEN
         ++A + T+N
Subjt:  TAIAGINTEN

P92555 Uncharacterized mitochondrial protein AtMg012503.9e-0944.12Show/hide
Query:  MINGRPRGKIFAIRGLRQGDPISPFLFTLIGDSFSRLVRYDIEKRYIRGFCIGSQSIEVSHLQYADDT
        +ING P+G +   RGLRQGDP+SP+LF L  +  S L R   E+  + G  + + S  ++HL +ADDT
Subjt:  MINGRPRGKIFAIRGLRQGDPISPFLFTLIGDSFSRLVRYDIEKRYIRGFCIGSQSIEVSHLQYADDT

Arabidopsis top hitse value%identityAlignment
AT1G79280.1 nuclear pore anchor8.3e-0789.66Show/hide
Query:  NKLVELYKESSEEWSKKATELEGVIKALE
        NKLV+LYKESSEEWS+KA ELEGVIKALE
Subjt:  NKLVELYKESSEEWSKKATELEGVIKALE

AT1G79280.2 nuclear pore anchor8.3e-0789.66Show/hide
Query:  NKLVELYKESSEEWSKKATELEGVIKALE
        NKLV+LYKESSEEWS+KA ELEGVIKALE
Subjt:  NKLVELYKESSEEWSKKATELEGVIKALE

AT1G79280.3 nuclear pore anchor8.3e-0789.66Show/hide
Query:  NKLVELYKESSEEWSKKATELEGVIKALE
        NKLV+LYKESSEEWS+KA ELEGVIKALE
Subjt:  NKLVELYKESSEEWSKKATELEGVIKALE

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)2.8e-1044.12Show/hide
Query:  MINGRPRGKIFAIRGLRQGDPISPFLFTLIGDSFSRLVRYDIEKRYIRGFCIGSQSIEVSHLQYADDT
        +ING P+G +   RGLRQGDP+SP+LF L  +  S L R   E+  + G  + + S  ++HL +ADDT
Subjt:  MINGRPRGKIFAIRGLRQGDPISPFLFTLIGDSFSRLVRYDIEKRYIRGFCIGSQSIEVSHLQYADDT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATAAATGGTAGACCGAGGGGCAAGATCTTTGCAATTAGGGGCTTAAGGCAAGGCGATCCTATCTCCCCTTTCCTTTTTACCTTGATTGGGGATTCCTTTAGCCGTTT
GGTGCGGTATGACATTGAGAAGAGATATATTAGGGGCTTTTGTATTGGCAGTCAAAGTATAGAGGTTTCCCACTTACAATATGCTGATGACACTATAATGTTTTGCCCAG
ATTATGGAGAATCGATCGATAATTGGTGGAGCTTCATTTCTATTTTTCTACAAGCTTTGGGTTTCTCACTAAATCTTTCAAAAACAGCTATAGCAGGGATTAATACGGAG
AATGTTCACTTGAATGAGGGGAGACAAGTTGGGATCACCCCAACTTATTATTTATCCATTCTTAAAGCCCCTGCTTGTGTTACAAAAGAAATAGATAAATTGGCAAGGAA
CTTCTTTTGGAATGGGGGAGGCTTTAACCCTACTAGACATATGGTCAATAAGCTTGTTGAACTTTACAAAGAAAGTTCTGAGGAGTGGTCTAAGAAGGCAACAGAGCTTG
AGGGTGTCATAAAAGCCTTGGAGCTAAGTCGAAAAATGACGTCAAGATCAGCTCGCAAGTGTGTAGCGGAAAGAAAGTTGTTCTCCATAGATTATAGTAAAGACAGAAAC
AAACGTCTGGTGAAGATAGTAGAGAAAAACCACGATTTGAGGTTTGAAGTCGTCTTGGAAAGCAAGTATGTTCTATGGGTTGCGGACTCCATTGATGACCTGGTAATCAC
CCCCAGTACCCAAAAATTCTTTCGGAAGACTTCTTGTGAGAATGGACTCATCTGGCTTCAGAAAGTGTCGAACAAAAGAGGAGTGTTCGTGGAGATAGCGAAAGTTGCCT
CGTCCGGAAATAGAAGCAACTTGATAATCCCCCGTGGGGAAGATTCAAAAGGATGGAGAGCTTTTCTTACGATGATAAATGATTTCTTCAGTAGCAAAGAAGAGCAGCAC
GAAGACAAAAAGGTTGTTTACAATCCCCAAAAGTCCTTTGCAGATGCTGTCAAGGGGAGACACAAGGTGGATGGCCAAGCACGTCAAATATCGTTACCTTCCCAAGAGAC
CAGCGCAATGAGCACAACATGCTTCCACGATGAATGGGGTAAGATTTATGATACATTATGCAATGCTTTTCAGAAGGATTTGATTATTAACCCTTTTCATCCAGACAAGG
CCCTCTTAAAGTGCCCAGACGCAGAGTTTGCTCGGATAGTGGCTCATAATAAAGGATGGTCAGTGTTAGGGAATTTCACTCTGAAGTTTGAATATTGGGATTATGAGTTA
CATGGAAGGATCAACGTGGTCCCATCTTATGGGGGATGGGTGAGATTTCGAAACATTCCTCTGCAGAACTGGTGTGTGGACACATTTAAAGCGATTGGGGAAGCCTATGG
AGGGTATATTGAATGCGATGATAAATGCCTCTCGTTAGTGGGTTGTATGGAAGTGGTTATTAAAGTTAAAAGTAATTATTGTGGCTTCATTCCAGCAGAAATTGATATTA
TTCAGGAAGATGGTTCAGTTGCGATTGCTCAAGTAGTCACGTTTGAAGATCCTCAATTGCTGGAAAGTAGAAGAGTTTACATCCATGGCGGTTTTACCAGTGAAGCTGCT
AGGGTTTTCTTTGGGGAAGACGATAACAGAGAAGACCTTTGTCTAGTGGATAAAAACCGAATAGAGGATCAGAAGGCCAAAAGTCAGATACAAGGGGCTGTGATAAAGAC
TAGTCATGTCAGTATTAGTAACAATGGGGAAGGTTCAGCAGCTGAGAGAGAAAAAGGCAAAAGTTCTGTATGCAGAAAACGTGTGATAGAGGACAAAGTGGGGCCCAGTG
ATCAGATTACCAGAACCAATGAAAATTGGGACAATAATAAAAAAGCAGATGAAATGGGAGTGAATACCACGCGCGTGATAGGCTACCAATGGAAAGAAAAAGTCAAAGAT
AAAAAAGGCGAATCTAGCTGGGAGGACGAAATCCAGGGATTTGAAGGCCAAAAGGAGGCGTGGAACCAGGCTGAAGCTCTCGAAGATATAGCAGCTATGTTTGAAGATGA
AGAGAAAGATGAACAGAATCCTCCTCCAAAGGACAATCTGTCCATAGTTACAAGTATGCCTCAAGAAAGTATGATTGATGCTCCTCCTATTAGATCAGAAGACATAGGTA
AGGGGGAGATTAATGTTTTGGGAGCCCGATGA
mRNA sequenceShow/hide mRNA sequence
ATGATAAATGGTAGACCGAGGGGCAAGATCTTTGCAATTAGGGGCTTAAGGCAAGGCGATCCTATCTCCCCTTTCCTTTTTACCTTGATTGGGGATTCCTTTAGCCGTTT
GGTGCGGTATGACATTGAGAAGAGATATATTAGGGGCTTTTGTATTGGCAGTCAAAGTATAGAGGTTTCCCACTTACAATATGCTGATGACACTATAATGTTTTGCCCAG
ATTATGGAGAATCGATCGATAATTGGTGGAGCTTCATTTCTATTTTTCTACAAGCTTTGGGTTTCTCACTAAATCTTTCAAAAACAGCTATAGCAGGGATTAATACGGAG
AATGTTCACTTGAATGAGGGGAGACAAGTTGGGATCACCCCAACTTATTATTTATCCATTCTTAAAGCCCCTGCTTGTGTTACAAAAGAAATAGATAAATTGGCAAGGAA
CTTCTTTTGGAATGGGGGAGGCTTTAACCCTACTAGACATATGGTCAATAAGCTTGTTGAACTTTACAAAGAAAGTTCTGAGGAGTGGTCTAAGAAGGCAACAGAGCTTG
AGGGTGTCATAAAAGCCTTGGAGCTAAGTCGAAAAATGACGTCAAGATCAGCTCGCAAGTGTGTAGCGGAAAGAAAGTTGTTCTCCATAGATTATAGTAAAGACAGAAAC
AAACGTCTGGTGAAGATAGTAGAGAAAAACCACGATTTGAGGTTTGAAGTCGTCTTGGAAAGCAAGTATGTTCTATGGGTTGCGGACTCCATTGATGACCTGGTAATCAC
CCCCAGTACCCAAAAATTCTTTCGGAAGACTTCTTGTGAGAATGGACTCATCTGGCTTCAGAAAGTGTCGAACAAAAGAGGAGTGTTCGTGGAGATAGCGAAAGTTGCCT
CGTCCGGAAATAGAAGCAACTTGATAATCCCCCGTGGGGAAGATTCAAAAGGATGGAGAGCTTTTCTTACGATGATAAATGATTTCTTCAGTAGCAAAGAAGAGCAGCAC
GAAGACAAAAAGGTTGTTTACAATCCCCAAAAGTCCTTTGCAGATGCTGTCAAGGGGAGACACAAGGTGGATGGCCAAGCACGTCAAATATCGTTACCTTCCCAAGAGAC
CAGCGCAATGAGCACAACATGCTTCCACGATGAATGGGGTAAGATTTATGATACATTATGCAATGCTTTTCAGAAGGATTTGATTATTAACCCTTTTCATCCAGACAAGG
CCCTCTTAAAGTGCCCAGACGCAGAGTTTGCTCGGATAGTGGCTCATAATAAAGGATGGTCAGTGTTAGGGAATTTCACTCTGAAGTTTGAATATTGGGATTATGAGTTA
CATGGAAGGATCAACGTGGTCCCATCTTATGGGGGATGGGTGAGATTTCGAAACATTCCTCTGCAGAACTGGTGTGTGGACACATTTAAAGCGATTGGGGAAGCCTATGG
AGGGTATATTGAATGCGATGATAAATGCCTCTCGTTAGTGGGTTGTATGGAAGTGGTTATTAAAGTTAAAAGTAATTATTGTGGCTTCATTCCAGCAGAAATTGATATTA
TTCAGGAAGATGGTTCAGTTGCGATTGCTCAAGTAGTCACGTTTGAAGATCCTCAATTGCTGGAAAGTAGAAGAGTTTACATCCATGGCGGTTTTACCAGTGAAGCTGCT
AGGGTTTTCTTTGGGGAAGACGATAACAGAGAAGACCTTTGTCTAGTGGATAAAAACCGAATAGAGGATCAGAAGGCCAAAAGTCAGATACAAGGGGCTGTGATAAAGAC
TAGTCATGTCAGTATTAGTAACAATGGGGAAGGTTCAGCAGCTGAGAGAGAAAAAGGCAAAAGTTCTGTATGCAGAAAACGTGTGATAGAGGACAAAGTGGGGCCCAGTG
ATCAGATTACCAGAACCAATGAAAATTGGGACAATAATAAAAAAGCAGATGAAATGGGAGTGAATACCACGCGCGTGATAGGCTACCAATGGAAAGAAAAAGTCAAAGAT
AAAAAAGGCGAATCTAGCTGGGAGGACGAAATCCAGGGATTTGAAGGCCAAAAGGAGGCGTGGAACCAGGCTGAAGCTCTCGAAGATATAGCAGCTATGTTTGAAGATGA
AGAGAAAGATGAACAGAATCCTCCTCCAAAGGACAATCTGTCCATAGTTACAAGTATGCCTCAAGAAAGTATGATTGATGCTCCTCCTATTAGATCAGAAGACATAGGTA
AGGGGGAGATTAATGTTTTGGGAGCCCGATGA
Protein sequenceShow/hide protein sequence
MINGRPRGKIFAIRGLRQGDPISPFLFTLIGDSFSRLVRYDIEKRYIRGFCIGSQSIEVSHLQYADDTIMFCPDYGESIDNWWSFISIFLQALGFSLNLSKTAIAGINTE
NVHLNEGRQVGITPTYYLSILKAPACVTKEIDKLARNFFWNGGGFNPTRHMVNKLVELYKESSEEWSKKATELEGVIKALELSRKMTSRSARKCVAERKLFSIDYSKDRN
KRLVKIVEKNHDLRFEVVLESKYVLWVADSIDDLVITPSTQKFFRKTSCENGLIWLQKVSNKRGVFVEIAKVASSGNRSNLIIPRGEDSKGWRAFLTMINDFFSSKEEQH
EDKKVVYNPQKSFADAVKGRHKVDGQARQISLPSQETSAMSTTCFHDEWGKIYDTLCNAFQKDLIINPFHPDKALLKCPDAEFARIVAHNKGWSVLGNFTLKFEYWDYEL
HGRINVVPSYGGWVRFRNIPLQNWCVDTFKAIGEAYGGYIECDDKCLSLVGCMEVVIKVKSNYCGFIPAEIDIIQEDGSVAIAQVVTFEDPQLLESRRVYIHGGFTSEAA
RVFFGEDDNREDLCLVDKNRIEDQKAKSQIQGAVIKTSHVSISNNGEGSAAEREKGKSSVCRKRVIEDKVGPSDQITRTNENWDNNKKADEMGVNTTRVIGYQWKEKVKD
KKGESSWEDEIQGFEGQKEAWNQAEALEDIAAMFEDEEKDEQNPPPKDNLSIVTSMPQESMIDAPPIRSEDIGKGEINVLGAR