; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039599 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039599
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr2:47026568..47032856
RNA-Seq ExpressionLag0039599
SyntenyLag0039599
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR029472 - Retrotransposon Copia-like, N-terminal
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049700.1 T4.5 [Cucumis melo var. makuwa]4.4e-7249.85Show/hide
Query:  MTEQSPLKDAHSPIFLLTNICNLISIRLDSSNYVLWKFQFSSKLKAHKLFGFVGRSNKAPTEFLQSTFESESLSSSTRTLNPLYDDWCAKDQALMTLINT
        +   S  KD+ SPIFLL+NICNLIS+RLDS+N+VLWKFQ ++ LKAHKL+GF+  +N  P      T  S S S+     NP Y+DW AKDQALMT+IN 
Subjt:  MTEQSPLKDAHSPIFLLTNICNLISIRLDSSNYVLWKFQFSSKLKAHKLFGFVGRSNKAPTEFLQSTFESESLSSSTRTLNPLYDDWCAKDQALMTLINT

Query:  TLSTKALTYIVSCKSANE-----------------------------KADESIDAYVRRIKKIEDKLVNVSSVVNDEDLLIYALNGLPLEHNIFRTSMRT
        TLS +AL Y+V   S+ +                             K DESIDAY++RIK+I+DKL NVS+ +N+EDLLIYALNGLP E+N FRTSMRT
Subjt:  TLSTKALTYIVSCKSANE-----------------------------KADESIDAYVRRIKKIEDKLVNVSSVVNDEDLLIYALNGLPLEHNIFRTSMRT

Query:  CSQPVTFDELHVLMKSEEFALAKQSKREDLSIQPTAMVANQS-------THRNQFNLNGRGHG---------FVGQSSGKSSVSAGAKMN-----CQICN
         SQPVTF+ELHVL+++EE ALAKQSK +D   QPT ++++         T  N F + G GHG         F  Q+ G  S      ++     CQIC+
Subjt:  CSQPVTFDELHVLMKSEEFALAKQSKREDLSIQPTAMVANQS-------THRNQFNLNGRGHG---------FVGQSSGKSSVSAGAKMN-----CQICN

Query:  RPGHMTLDCYNRMDYHFEGRHPPPQLAAM
        R GH  LDC+NRM+Y+F+GRHPP QLAAM
Subjt:  RPGHMTLDCYNRMDYHFEGRHPPPQLAAM

XP_008448007.1 PREDICTED: uncharacterized protein LOC103490319 isoform X2 [Cucumis melo]4.4e-7249.85Show/hide
Query:  MTEQSPLKDAHSPIFLLTNICNLISIRLDSSNYVLWKFQFSSKLKAHKLFGFVGRSNKAPTEFLQSTFESESLSSSTRTLNPLYDDWCAKDQALMTLINT
        +   S  KD+ SPIFLL+NICNLIS+RLDS+N+VLWKFQ ++ LKAHKL+GF+  +N  P      T  S S S+     NP Y+DW AKDQALMT+IN 
Subjt:  MTEQSPLKDAHSPIFLLTNICNLISIRLDSSNYVLWKFQFSSKLKAHKLFGFVGRSNKAPTEFLQSTFESESLSSSTRTLNPLYDDWCAKDQALMTLINT

Query:  TLSTKALTYIVSCKSANE-----------------------------KADESIDAYVRRIKKIEDKLVNVSSVVNDEDLLIYALNGLPLEHNIFRTSMRT
        TLS +AL Y+V   S+ +                             K DESIDAY++RIK+I+DKL NVS+ +N+EDLLIYALNGLP E+N FRTSMRT
Subjt:  TLSTKALTYIVSCKSANE-----------------------------KADESIDAYVRRIKKIEDKLVNVSSVVNDEDLLIYALNGLPLEHNIFRTSMRT

Query:  CSQPVTFDELHVLMKSEEFALAKQSKREDLSIQPTAMVANQS-------THRNQFNLNGRGHG---------FVGQSSGKSSVSAGAKMN-----CQICN
         SQPVTF+ELHVL+++EE ALAKQSK +D   QPT ++++         T  N F + G GHG         F  Q+ G  S      ++     CQIC+
Subjt:  CSQPVTFDELHVLMKSEEFALAKQSKREDLSIQPTAMVANQS-------THRNQFNLNGRGHG---------FVGQSSGKSSVSAGAKMN-----CQICN

Query:  RPGHMTLDCYNRMDYHFEGRHPPPQLAAM
        R GH  LDC+NRM+Y+F+GRHPP QLAAM
Subjt:  RPGHMTLDCYNRMDYHFEGRHPPPQLAAM

XP_008448008.1 PREDICTED: uncharacterized protein LOC103490319 isoform X3 [Cucumis melo]4.4e-7249.85Show/hide
Query:  MTEQSPLKDAHSPIFLLTNICNLISIRLDSSNYVLWKFQFSSKLKAHKLFGFVGRSNKAPTEFLQSTFESESLSSSTRTLNPLYDDWCAKDQALMTLINT
        +   S  KD+ SPIFLL+NICNLIS+RLDS+N+VLWKFQ ++ LKAHKL+GF+  +N  P      T  S S S+     NP Y+DW AKDQALMT+IN 
Subjt:  MTEQSPLKDAHSPIFLLTNICNLISIRLDSSNYVLWKFQFSSKLKAHKLFGFVGRSNKAPTEFLQSTFESESLSSSTRTLNPLYDDWCAKDQALMTLINT

Query:  TLSTKALTYIVSCKSANE-----------------------------KADESIDAYVRRIKKIEDKLVNVSSVVNDEDLLIYALNGLPLEHNIFRTSMRT
        TLS +AL Y+V   S+ +                             K DESIDAY++RIK+I+DKL NVS+ +N+EDLLIYALNGLP E+N FRTSMRT
Subjt:  TLSTKALTYIVSCKSANE-----------------------------KADESIDAYVRRIKKIEDKLVNVSSVVNDEDLLIYALNGLPLEHNIFRTSMRT

Query:  CSQPVTFDELHVLMKSEEFALAKQSKREDLSIQPTAMVANQS-------THRNQFNLNGRGHG---------FVGQSSGKSSVSAGAKMN-----CQICN
         SQPVTF+ELHVL+++EE ALAKQSK +D   QPT ++++         T  N F + G GHG         F  Q+ G  S      ++     CQIC+
Subjt:  CSQPVTFDELHVLMKSEEFALAKQSKREDLSIQPTAMVANQS-------THRNQFNLNGRGHG---------FVGQSSGKSSVSAGAKMN-----CQICN

Query:  RPGHMTLDCYNRMDYHFEGRHPPPQLAAM
        R GH  LDC+NRM+Y+F+GRHPP QLAAM
Subjt:  RPGHMTLDCYNRMDYHFEGRHPPPQLAAM

XP_016900446.1 PREDICTED: uncharacterized protein LOC103490319 isoform X1 [Cucumis melo]4.4e-7249.85Show/hide
Query:  MTEQSPLKDAHSPIFLLTNICNLISIRLDSSNYVLWKFQFSSKLKAHKLFGFVGRSNKAPTEFLQSTFESESLSSSTRTLNPLYDDWCAKDQALMTLINT
        +   S  KD+ SPIFLL+NICNLIS+RLDS+N+VLWKFQ ++ LKAHKL+GF+  +N  P      T  S S S+     NP Y+DW AKDQALMT+IN 
Subjt:  MTEQSPLKDAHSPIFLLTNICNLISIRLDSSNYVLWKFQFSSKLKAHKLFGFVGRSNKAPTEFLQSTFESESLSSSTRTLNPLYDDWCAKDQALMTLINT

Query:  TLSTKALTYIVSCKSANE-----------------------------KADESIDAYVRRIKKIEDKLVNVSSVVNDEDLLIYALNGLPLEHNIFRTSMRT
        TLS +AL Y+V   S+ +                             K DESIDAY++RIK+I+DKL NVS+ +N+EDLLIYALNGLP E+N FRTSMRT
Subjt:  TLSTKALTYIVSCKSANE-----------------------------KADESIDAYVRRIKKIEDKLVNVSSVVNDEDLLIYALNGLPLEHNIFRTSMRT

Query:  CSQPVTFDELHVLMKSEEFALAKQSKREDLSIQPTAMVANQS-------THRNQFNLNGRGHG---------FVGQSSGKSSVSAGAKMN-----CQICN
         SQPVTF+ELHVL+++EE ALAKQSK +D   QPT ++++         T  N F + G GHG         F  Q+ G  S      ++     CQIC+
Subjt:  CSQPVTFDELHVLMKSEEFALAKQSKREDLSIQPTAMVANQS-------THRNQFNLNGRGHG---------FVGQSSGKSSVSAGAKMN-----CQICN

Query:  RPGHMTLDCYNRMDYHFEGRHPPPQLAAM
        R GH  LDC+NRM+Y+F+GRHPP QLAAM
Subjt:  RPGHMTLDCYNRMDYHFEGRHPPPQLAAM

XP_022150845.1 uncharacterized protein LOC111018892 [Momordica charantia]1.6e-7451.06Show/hide
Query:  KDAHSPIFLLTNICNLISIRLDSSNYVLWKFQFSSKLKAHKLFGFVGRSNKAPTEFLQSTFESESLSSSTRTL---NPLYDDWCAKDQALMTLINTTLST
        KD HSPIFLL+NICNL+SIRLDS++++LWKFQ ++ LKAHKLFGF+  S  AP++FL S+ E+ES  ++T +L   NP ++DW AKDQALMTLIN TLS 
Subjt:  KDAHSPIFLLTNICNLISIRLDSSNYVLWKFQFSSKLKAHKLFGFVGRSNKAPTEFLQSTFESESLSSSTRTL---NPLYDDWCAKDQALMTLINTTLST

Query:  KALTYIV-----------------------------SCKSANEKADESIDAYVRRIKKIEDKLVNVSSVVNDEDLLIYALNGLPLEHNIFRTSMRTCSQP
        +AL Y+V                               +S  +K +ESIDAYV+RIK+I+DK  NVS  +NDE LLIYALNGL  E+N   TSMRT +Q 
Subjt:  KALTYIV-----------------------------SCKSANEKADESIDAYVRRIKKIEDKLVNVSSVVNDEDLLIYALNGLPLEHNIFRTSMRTCSQP

Query:  VTFDELHVLMKSEEFALAKQSKREDLSIQPTAMVA--------------NQSTHRNQFNLNGRGHG-----FVGQSSGKS------SVSAGAKMNCQICN
        V+F+ELHV MKSEE A+ KQ KREDL  QP A+ A              NQS  R +   NGRG       F  Q  G+S      S  A  +  CQIC 
Subjt:  VTFDELHVLMKSEEFALAKQSKREDLSIQPTAMVA--------------NQSTHRNQFNLNGRGHG-----FVGQSSGKS------SVSAGAKMNCQICN

Query:  RPGHMTLDCYNRMDYHFEGRHPPPQLAAM
        + GH  LDCYNRM++HF+GRHPPPQLAAM
Subjt:  RPGHMTLDCYNRMDYHFEGRHPPPQLAAM

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X22.1e-7249.85Show/hide
Query:  MTEQSPLKDAHSPIFLLTNICNLISIRLDSSNYVLWKFQFSSKLKAHKLFGFVGRSNKAPTEFLQSTFESESLSSSTRTLNPLYDDWCAKDQALMTLINT
        +   S  KD+ SPIFLL+NICNLIS+RLDS+N+VLWKFQ ++ LKAHKL+GF+  +N  P      T  S S S+     NP Y+DW AKDQALMT+IN 
Subjt:  MTEQSPLKDAHSPIFLLTNICNLISIRLDSSNYVLWKFQFSSKLKAHKLFGFVGRSNKAPTEFLQSTFESESLSSSTRTLNPLYDDWCAKDQALMTLINT

Query:  TLSTKALTYIVSCKSANE-----------------------------KADESIDAYVRRIKKIEDKLVNVSSVVNDEDLLIYALNGLPLEHNIFRTSMRT
        TLS +AL Y+V   S+ +                             K DESIDAY++RIK+I+DKL NVS+ +N+EDLLIYALNGLP E+N FRTSMRT
Subjt:  TLSTKALTYIVSCKSANE-----------------------------KADESIDAYVRRIKKIEDKLVNVSSVVNDEDLLIYALNGLPLEHNIFRTSMRT

Query:  CSQPVTFDELHVLMKSEEFALAKQSKREDLSIQPTAMVANQS-------THRNQFNLNGRGHG---------FVGQSSGKSSVSAGAKMN-----CQICN
         SQPVTF+ELHVL+++EE ALAKQSK +D   QPT ++++         T  N F + G GHG         F  Q+ G  S      ++     CQIC+
Subjt:  CSQPVTFDELHVLMKSEEFALAKQSKREDLSIQPTAMVANQS-------THRNQFNLNGRGHG---------FVGQSSGKSSVSAGAKMN-----CQICN

Query:  RPGHMTLDCYNRMDYHFEGRHPPPQLAAM
        R GH  LDC+NRM+Y+F+GRHPP QLAAM
Subjt:  RPGHMTLDCYNRMDYHFEGRHPPPQLAAM

A0A1S3BIR3 uncharacterized protein LOC103490319 isoform X32.1e-7249.85Show/hide
Query:  MTEQSPLKDAHSPIFLLTNICNLISIRLDSSNYVLWKFQFSSKLKAHKLFGFVGRSNKAPTEFLQSTFESESLSSSTRTLNPLYDDWCAKDQALMTLINT
        +   S  KD+ SPIFLL+NICNLIS+RLDS+N+VLWKFQ ++ LKAHKL+GF+  +N  P      T  S S S+     NP Y+DW AKDQALMT+IN 
Subjt:  MTEQSPLKDAHSPIFLLTNICNLISIRLDSSNYVLWKFQFSSKLKAHKLFGFVGRSNKAPTEFLQSTFESESLSSSTRTLNPLYDDWCAKDQALMTLINT

Query:  TLSTKALTYIVSCKSANE-----------------------------KADESIDAYVRRIKKIEDKLVNVSSVVNDEDLLIYALNGLPLEHNIFRTSMRT
        TLS +AL Y+V   S+ +                             K DESIDAY++RIK+I+DKL NVS+ +N+EDLLIYALNGLP E+N FRTSMRT
Subjt:  TLSTKALTYIVSCKSANE-----------------------------KADESIDAYVRRIKKIEDKLVNVSSVVNDEDLLIYALNGLPLEHNIFRTSMRT

Query:  CSQPVTFDELHVLMKSEEFALAKQSKREDLSIQPTAMVANQS-------THRNQFNLNGRGHG---------FVGQSSGKSSVSAGAKMN-----CQICN
         SQPVTF+ELHVL+++EE ALAKQSK +D   QPT ++++         T  N F + G GHG         F  Q+ G  S      ++     CQIC+
Subjt:  CSQPVTFDELHVLMKSEEFALAKQSKREDLSIQPTAMVANQS-------THRNQFNLNGRGHG---------FVGQSSGKSSVSAGAKMN-----CQICN

Query:  RPGHMTLDCYNRMDYHFEGRHPPPQLAAM
        R GH  LDC+NRM+Y+F+GRHPP QLAAM
Subjt:  RPGHMTLDCYNRMDYHFEGRHPPPQLAAM

A0A1S4DWT9 uncharacterized protein LOC103490319 isoform X12.1e-7249.85Show/hide
Query:  MTEQSPLKDAHSPIFLLTNICNLISIRLDSSNYVLWKFQFSSKLKAHKLFGFVGRSNKAPTEFLQSTFESESLSSSTRTLNPLYDDWCAKDQALMTLINT
        +   S  KD+ SPIFLL+NICNLIS+RLDS+N+VLWKFQ ++ LKAHKL+GF+  +N  P      T  S S S+     NP Y+DW AKDQALMT+IN 
Subjt:  MTEQSPLKDAHSPIFLLTNICNLISIRLDSSNYVLWKFQFSSKLKAHKLFGFVGRSNKAPTEFLQSTFESESLSSSTRTLNPLYDDWCAKDQALMTLINT

Query:  TLSTKALTYIVSCKSANE-----------------------------KADESIDAYVRRIKKIEDKLVNVSSVVNDEDLLIYALNGLPLEHNIFRTSMRT
        TLS +AL Y+V   S+ +                             K DESIDAY++RIK+I+DKL NVS+ +N+EDLLIYALNGLP E+N FRTSMRT
Subjt:  TLSTKALTYIVSCKSANE-----------------------------KADESIDAYVRRIKKIEDKLVNVSSVVNDEDLLIYALNGLPLEHNIFRTSMRT

Query:  CSQPVTFDELHVLMKSEEFALAKQSKREDLSIQPTAMVANQS-------THRNQFNLNGRGHG---------FVGQSSGKSSVSAGAKMN-----CQICN
         SQPVTF+ELHVL+++EE ALAKQSK +D   QPT ++++         T  N F + G GHG         F  Q+ G  S      ++     CQIC+
Subjt:  CSQPVTFDELHVLMKSEEFALAKQSKREDLSIQPTAMVANQS-------THRNQFNLNGRGHG---------FVGQSSGKSSVSAGAKMN-----CQICN

Query:  RPGHMTLDCYNRMDYHFEGRHPPPQLAAM
        R GH  LDC+NRM+Y+F+GRHPP QLAAM
Subjt:  RPGHMTLDCYNRMDYHFEGRHPPPQLAAM

A0A5D3CLI6 T4.52.1e-7249.85Show/hide
Query:  MTEQSPLKDAHSPIFLLTNICNLISIRLDSSNYVLWKFQFSSKLKAHKLFGFVGRSNKAPTEFLQSTFESESLSSSTRTLNPLYDDWCAKDQALMTLINT
        +   S  KD+ SPIFLL+NICNLIS+RLDS+N+VLWKFQ ++ LKAHKL+GF+  +N  P      T  S S S+     NP Y+DW AKDQALMT+IN 
Subjt:  MTEQSPLKDAHSPIFLLTNICNLISIRLDSSNYVLWKFQFSSKLKAHKLFGFVGRSNKAPTEFLQSTFESESLSSSTRTLNPLYDDWCAKDQALMTLINT

Query:  TLSTKALTYIVSCKSANE-----------------------------KADESIDAYVRRIKKIEDKLVNVSSVVNDEDLLIYALNGLPLEHNIFRTSMRT
        TLS +AL Y+V   S+ +                             K DESIDAY++RIK+I+DKL NVS+ +N+EDLLIYALNGLP E+N FRTSMRT
Subjt:  TLSTKALTYIVSCKSANE-----------------------------KADESIDAYVRRIKKIEDKLVNVSSVVNDEDLLIYALNGLPLEHNIFRTSMRT

Query:  CSQPVTFDELHVLMKSEEFALAKQSKREDLSIQPTAMVANQS-------THRNQFNLNGRGHG---------FVGQSSGKSSVSAGAKMN-----CQICN
         SQPVTF+ELHVL+++EE ALAKQSK +D   QPT ++++         T  N F + G GHG         F  Q+ G  S      ++     CQIC+
Subjt:  CSQPVTFDELHVLMKSEEFALAKQSKREDLSIQPTAMVANQS-------THRNQFNLNGRGHG---------FVGQSSGKSSVSAGAKMN-----CQICN

Query:  RPGHMTLDCYNRMDYHFEGRHPPPQLAAM
        R GH  LDC+NRM+Y+F+GRHPP QLAAM
Subjt:  RPGHMTLDCYNRMDYHFEGRHPPPQLAAM

A0A6J1D9L6 uncharacterized protein LOC1110188927.8e-7551.06Show/hide
Query:  KDAHSPIFLLTNICNLISIRLDSSNYVLWKFQFSSKLKAHKLFGFVGRSNKAPTEFLQSTFESESLSSSTRTL---NPLYDDWCAKDQALMTLINTTLST
        KD HSPIFLL+NICNL+SIRLDS++++LWKFQ ++ LKAHKLFGF+  S  AP++FL S+ E+ES  ++T +L   NP ++DW AKDQALMTLIN TLS 
Subjt:  KDAHSPIFLLTNICNLISIRLDSSNYVLWKFQFSSKLKAHKLFGFVGRSNKAPTEFLQSTFESESLSSSTRTL---NPLYDDWCAKDQALMTLINTTLST

Query:  KALTYIV-----------------------------SCKSANEKADESIDAYVRRIKKIEDKLVNVSSVVNDEDLLIYALNGLPLEHNIFRTSMRTCSQP
        +AL Y+V                               +S  +K +ESIDAYV+RIK+I+DK  NVS  +NDE LLIYALNGL  E+N   TSMRT +Q 
Subjt:  KALTYIV-----------------------------SCKSANEKADESIDAYVRRIKKIEDKLVNVSSVVNDEDLLIYALNGLPLEHNIFRTSMRTCSQP

Query:  VTFDELHVLMKSEEFALAKQSKREDLSIQPTAMVA--------------NQSTHRNQFNLNGRGHG-----FVGQSSGKS------SVSAGAKMNCQICN
        V+F+ELHV MKSEE A+ KQ KREDL  QP A+ A              NQS  R +   NGRG       F  Q  G+S      S  A  +  CQIC 
Subjt:  VTFDELHVLMKSEEFALAKQSKREDLSIQPTAMVA--------------NQSTHRNQFNLNGRGHG-----FVGQSSGKS------SVSAGAKMNCQICN

Query:  RPGHMTLDCYNRMDYHFEGRHPPPQLAAM
        + GH  LDCYNRM++HF+GRHPPPQLAAM
Subjt:  RPGHMTLDCYNRMDYHFEGRHPPPQLAAM

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.4e-2830.46Show/hide
Query:  QVKDYLTCKKVHKAL---KEKPKGMTDEDWEALDEEAVATIRMCFSMDMTSLVAHETTA---------VKLMEALTNKY---------------------
        +++D L  + +HK L    +KP  M  EDW  LDE A + IR+  S D+ + +  E TA         + + + LTNK                      
Subjt:  QVKDYLTCKKVHKAL---KEKPKGMTDEDWEALDEEAVATIRMCFSMDMTSLVAHETTA---------VKLMEALTNKY---------------------

Query:  --------------EKPSANDK---LLTSLPDSWETMKTAESNSTGNNTLKFSEVCDIAIAKEIRRQGSNKESTVGSTLVMT-KGKDKVDEENESSNSRK
                       K    DK   LL SLP S++ + T   +  G  T++  +V    +  E  R+   K    G  L+   +G+      N    S  
Subjt:  --------------EKPSANDK---LLTSLPDSWETMKTAESNSTGNNTLKFSEVCDIAIAKEIRRQGSNKESTVGSTLVMT-KGKDKVDEENESSNSRK

Query:  KWKSRNEVE-----CFYCHKKGQFKSQC---RKFKED---QKRKPKANVVVK----VVLVCVESD--IKYSNHSSDWILNSAASVHIASNRSLFTSFTGG
        + KS+N  +     C+ C++ G FK  C   RK K +   QK       +V+    VVL   E +  +  S   S+W++++AAS H    R LF  +  G
Subjt:  KWKSRNEVE-----CFYCHKKGQFKSQC---RKFKED---QKRKPKANVVVK----VVLVCVESD--IKYSNHSSDWILNSAASVHIASNRSLFTSFTGG

Query:  HHGLVRMGNDRTSKTRGIGDVSLKTECGGILVLRDVRIVPNIKMNLISIGKLADDGYMCEFGSRQCKLKFGSQVVAVGHRKSTLYRCQLNVAKG
          G V+MGN   SK  GIGD+ +KT  G  LVL+DVR VP+++MNLIS   L  DGY   F +++ +L  GS V+A G  + TLYR    + +G
Subjt:  HHGLVRMGNDRTSKTRGIGDVSLKTECGGILVLRDVRIVPNIKMNLISIGKLADDGYMCEFGSRQCKLKFGSQVVAVGHRKSTLYRCQLNVAKG

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-0237.74Show/hide
Query:  STTAIQLAHNTVFHGRTKHVEVDYHFVHECVVHQDVLLKYTSTQSQFVDVFTK
        S +AI L+ N+++H RTKH++V YH++ E V  + + +   ST     D+ TK
Subjt:  STTAIQLAHNTVFHGRTKHVEVDYHFVHECVVHQDVLLKYTSTQSQFVDVFTK

P92519 Uncharacterized mitochondrial protein AtMg008102.0e-0650.88Show/hide
Query:  MHSPSLCHLSAAKRVLRYLQGTTFKGLLFKKSASGLVLNAFSDSDWVGSVLDKRSTT
        MH P+L      KRVLRY++GT F GL   K+ S L + AF DSDW G    +RSTT
Subjt:  MHSPSLCHLSAAKRVLRYLQGTTFKGLLFKKSASGLVLNAFSDSDWVGSVLDKRSTT

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.1e-0928.81Show/hide
Query:  FMHSPSLCHLSAAKRVLRYLQGTTFKGLLFKKSASGLVLNAFSDSDWVGSVLDKRSTT------------------------------------------
        FMH P+  HL A KR+LRYL GT   G+  KK  + L L+A+SD+DW G   D  ST                                           
Subjt:  FMHSPSLCHLSAAKRVLRYLQGTTFKGLLFKKSASGLVLNAFSDSDWVGSVLDKRSTT------------------------------------------

Query:  ------------------------AIQLAHNTVFHGRTKHVEVDYHFVHECVVHQDVLLKYTSTQSQFVDVFTKPLT
                                A  L  N VFH R KH+ +DYHF+   V    + + + ST  Q  D  TKPL+
Subjt:  ------------------------AIQLAHNTVFHGRTKHVEVDYHFVHECVVHQDVLLKYTSTQSQFVDVFTKPLT

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.1e-0929.38Show/hide
Query:  FMHSPSLCHLSAAKRVLRYLQGTTFKGLLFKKSASGLVLNAFSDSDWVGSVLDKRSTT------------------------------------------
        +MH P+  H +A KRVLRYL GT   G+  KK  + L L+A+SD+DW G   D  ST                                           
Subjt:  FMHSPSLCHLSAAKRVLRYLQGTTFKGLLFKKSASGLVLNAFSDSDWVGSVLDKRSTT------------------------------------------

Query:  --------AIQLAH----------------NTVFHGRTKHVEVDYHFVHECVVHQDVLLKYTSTQSQFVDVFTKPLT
                 IQL+H                N VFH R KH+ +DYHF+   V    + + + ST  Q  D  TKPL+
Subjt:  --------AIQLAH----------------NTVFHGRTKHVEVDYHFVHECVVHQDVLLKYTSTQSQFVDVFTKPLT

Arabidopsis top hitse value%identityAlignment
AT3G21000.1 Gag-Pol-related retrotransposon family protein7.1e-0423.08Show/hide
Query:  KSRNEVECFYCHKKGQFKSQCRKFKEDQKRKPKANVVVKVVLVCVESDIKYSNHSSDWILNSAASVHIASNRSLFTSFTGGHHGLVRMGNDRTSKTRGIG
        KS++E  C  C+K    +  C+      K + +  +VV   L  V +    +     WI++  A +++      FT+        V   +       G G
Subjt:  KSRNEVECFYCHKKGQFKSQCRKFKEDQKRKPKANVVVKVVLVCVESDIKYSNHSSDWILNSAASVHIASNRSLFTSFTGGHHGLVRMGNDRTSKTRGIG

Query:  DVSLKTECGGILVLRDVRIVPNIKMNLISIGKLADDGYMCEFG
        DV ++ + G    +R+V  VP +  N++S GK+    Y    G
Subjt:  DVSLKTECGGILVLRDVRIVPNIKMNLISIGKLADDGYMCEFG

AT3G29785.1 unknown protein1.2e-0634.18Show/hide
Query:  MQVKDYLTCKKVHKALKEKPKGMTDEDWEALDEEAVATIRMCFSMDMTSLVAHETTAVKLMEALTNKYEKPSANDKLLT
        M+++DYL  KK+H+ L +K + M+ +DW  L  + +  IR+  S ++   VA E +   LM+ L++ Y+KPS N+ +++
Subjt:  MQVKDYLTCKKVHKALKEKPKGMTDEDWEALDEEAVATIRMCFSMDMTSLVAHETTAVKLMEALTNKYEKPSANDKLLT

ATMG00810.1 DNA/RNA polymerases superfamily protein1.4e-0750.88Show/hide
Query:  MHSPSLCHLSAAKRVLRYLQGTTFKGLLFKKSASGLVLNAFSDSDWVGSVLDKRSTT
        MH P+L      KRVLRY++GT F GL   K+ S L + AF DSDW G    +RSTT
Subjt:  MHSPSLCHLSAAKRVLRYLQGTTFKGLLFKKSASGLVLNAFSDSDWVGSVLDKRSTT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGTCAAGGATTATTTAACTTGCAAGAAAGTGCATAAGGCATTGAAGGAGAAACCGAAAGGGATGACAGACGAAGATTGGGAAGCTCTAGATGAAGAGGCAGTTGC
AACCATAAGGATGTGTTTTTCAATGGATATGACAAGTCTAGTAGCCCATGAGACAACTGCAGTCAAGTTGATGGAAGCGCTTACAAACAAGTATGAAAAACCCTCTGCAA
ATGATAAGTTGTTAACATCTTTACCTGATAGTTGGGAAACGATGAAGACAGCAGAGTCTAATTCAACTGGAAATAACACTTTAAAATTTTCAGAAGTTTGTGATATAGCC
ATAGCTAAGGAAATTCGTAGGCAGGGTAGTAATAAAGAGTCTACAGTAGGGTCAACTTTGGTTATGACTAAAGGGAAAGATAAGGTTGATGAAGAAAATGAATCGAGTAA
CAGTAGGAAAAAGTGGAAAAGTAGGAATGAGGTAGAATGTTTTTACTGCCATAAGAAAGGTCAATTCAAGAGTCAGTGTAGGAAATTTAAAGAGGATCAGAAAAGAAAAC
CAAAGGCAAATGTAGTGGTGAAGGTTGTCTTAGTTTGTGTTGAGAGTGACATAAAATATAGTAACCACTCATCAGATTGGATATTAAACAGTGCAGCTTCTGTTCACATA
GCTTCAAATAGGAGTTTGTTCACATCATTCACAGGAGGGCATCATGGCCTAGTGAGGATGGGGAATGATAGAACCTCCAAGACTAGAGGGATTGGAGATGTTAGTCTGAA
GACAGAATGTGGAGGTATATTGGTACTACGAGATGTCAGGATCGTGCCTAATATCAAGATGAATCTTATTTCTATTGGTAAGTTGGCAGATGATGGTTACATGTGTGAGT
TTGGTAGTCGCCAGTGTAAACTCAAGTTCGGATCCCAGGTAGTGGCAGTTGGTCACAGGAAATCTACACTGTACAGATGTCAGTTGAATGTTGCCAAAGGTTCAAAGAGA
CAGTGGATGCCGGTTAAAGCTACAGATGGTAATTGTAGAGGTACAGCTGAGCCAGCATCAAGGATACCCAATTTCGATCAGTCCGATCAAGATCCTTCAGTTCAGATACA
ATTGGGAAGTCCAGGAGAGAAAGTTGATGGCTATCATGAATCCCCAGTTGTCAGACGCTCGAATGAATTGAAGAAGTCGCTTAGGCGAGTTGAGGCATCAAAGTGGAAGG
CCAGAGCAGTTGCTAAGGTCAAAGGTCTGGTCTCTAGCTTGGTAATAGGTTTGAATAGAGGATTCAAGCCATTCTCAGAGTGCATATTCTTCAAGAACAATTGTTCGGGT
TGGAAGAAGATGACAGAGCAATCTCCTTTGAAAGATGCTCACTCTCCTATTTTTCTTCTCACCAATATTTGCAATCTCATCTCTATTCGTCTTGATTCATCAAATTATGT
CCTCTGGAAATTTCAATTTTCTTCTAAGTTGAAGGCTCACAAATTGTTTGGTTTTGTTGGTAGATCTAACAAAGCACCAACCGAATTTCTTCAATCTACGTTTGAATCGG
AATCCCTGTCTTCTTCAACTCGTACTCTTAATCCTCTGTATGATGACTGGTGTGCCAAGGATCAAGCGTTGATGACTCTGATAAACACTACTCTCTCTACCAAAGCCCTA
ACTTATATTGTTAGTTGCAAATCTGCCAATGAGAAAGCGGATGAATCAATTGATGCTTATGTTAGACGAATTAAGAAGATTGAGGATAAACTTGTTAATGTTTCTTCAGT
TGTAAACGATGAAGACTTACTGATCTATGCTTTGAATGGTCTTCCTCTTGAGCATAATATCTTTCGAACGTCAATGAGGACTTGCTCACAACCAGTGACATTTGATGAAC
TCCATGTCTTGATGAAATCTGAAGAGTTTGCTTTAGCCAAGCAATCTAAACGAGAAGATTTGTCGATTCAGCCTACTGCTATGGTTGCGAATCAGTCAACTCATCGGAAT
CAGTTTAATTTGAATGGTCGTGGACATGGTTTTGTTGGTCAATCCTCTGGTAAGTCTTCTGTTTCAGCTGGTGCTAAAATGAATTGTCAAATTTGCAATCGTCCTGGTCA
CATGACTCTTGACTGCTACAATAGGATGGATTATCACTTTGAAGGTCGTCATCCTCCTCCACAACTGGCTGCCATGGCCAAACTTTGTTCGAAAGGACCTAGTGTTAATG
GTCTTTATCCCATTACACCTTATTCCTCTAGCATTTACTCTTATCTGGGAGGCTCACCCACTGCTCATGTTGGAGTCAAGTCTCCTTCTACTCTTTGGCACAACTGCTTT
TTTGTCTCTTGCCATGTTCTGTTTGATGAAACTGTCTTTCATTTCTCTTCTACTTCGTCTACTTCACCTAATTCTTCTTCATATACGTCTACTTCATCTTTACCCTCTTA
TTCATTGTCTATCCCTTTGCCTTTACTTTTATCTTCTCCATCATCTACTTCTTTACCGTCATCTCCCTCTGTTCCACAACCTTGTACTGATAATTTGGTGTTTGAATTAC
CTCATGCTAACACTGTTACACCATCTAGTAATGGTGCTAATGTTGAGCCAGCTTCTAATCTTGGTAGTCATGTTGAGCACATTACTAGTAGTGGTGTTTCAACACCTTGT
TCTTCAACTCAACCTTCCTCCAATGTCAAAAATACTCATGCTATGAAGACACGTTCAAAGTCTGAAGAAGATATTGATTATGATGAGACGTTTAGTTCTGTTGTTAAGAA
ACCCACTGTTCGTATTATTCTTTCATCACTTGCTGCACATTTTGGCTGGAAGTTGTGGCAGCTTGATTTTATGCATTCACCTTCGCTTTGTCATTTGTCTGCGGCAAAAC
GGGTCTTAAGATATCTTCAGGGCACGACATTTAAGGGTCTGTTGTTCAAGAAATCAGCTTCAGGCCTTGTTTTAAATGCTTTTTCGGACTCTGATTGGGTTGGCAGTGTG
TTGGATAAAAGGTCTACTACAGCTATTCAATTGGCTCATAACACTGTTTTTCATGGTCGTACAAAGCATGTTGAAGTCGACTATCACTTTGTTCATGAGTGTGTTGTTCA
TCAGGATGTTCTGTTGAAATATACATCTACTCAGTCTCAATTTGTTGATGTTTTTACAAAACCTTTGACCACAGATCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCAAGTCAAGGATTATTTAACTTGCAAGAAAGTGCATAAGGCATTGAAGGAGAAACCGAAAGGGATGACAGACGAAGATTGGGAAGCTCTAGATGAAGAGGCAGTTGC
AACCATAAGGATGTGTTTTTCAATGGATATGACAAGTCTAGTAGCCCATGAGACAACTGCAGTCAAGTTGATGGAAGCGCTTACAAACAAGTATGAAAAACCCTCTGCAA
ATGATAAGTTGTTAACATCTTTACCTGATAGTTGGGAAACGATGAAGACAGCAGAGTCTAATTCAACTGGAAATAACACTTTAAAATTTTCAGAAGTTTGTGATATAGCC
ATAGCTAAGGAAATTCGTAGGCAGGGTAGTAATAAAGAGTCTACAGTAGGGTCAACTTTGGTTATGACTAAAGGGAAAGATAAGGTTGATGAAGAAAATGAATCGAGTAA
CAGTAGGAAAAAGTGGAAAAGTAGGAATGAGGTAGAATGTTTTTACTGCCATAAGAAAGGTCAATTCAAGAGTCAGTGTAGGAAATTTAAAGAGGATCAGAAAAGAAAAC
CAAAGGCAAATGTAGTGGTGAAGGTTGTCTTAGTTTGTGTTGAGAGTGACATAAAATATAGTAACCACTCATCAGATTGGATATTAAACAGTGCAGCTTCTGTTCACATA
GCTTCAAATAGGAGTTTGTTCACATCATTCACAGGAGGGCATCATGGCCTAGTGAGGATGGGGAATGATAGAACCTCCAAGACTAGAGGGATTGGAGATGTTAGTCTGAA
GACAGAATGTGGAGGTATATTGGTACTACGAGATGTCAGGATCGTGCCTAATATCAAGATGAATCTTATTTCTATTGGTAAGTTGGCAGATGATGGTTACATGTGTGAGT
TTGGTAGTCGCCAGTGTAAACTCAAGTTCGGATCCCAGGTAGTGGCAGTTGGTCACAGGAAATCTACACTGTACAGATGTCAGTTGAATGTTGCCAAAGGTTCAAAGAGA
CAGTGGATGCCGGTTAAAGCTACAGATGGTAATTGTAGAGGTACAGCTGAGCCAGCATCAAGGATACCCAATTTCGATCAGTCCGATCAAGATCCTTCAGTTCAGATACA
ATTGGGAAGTCCAGGAGAGAAAGTTGATGGCTATCATGAATCCCCAGTTGTCAGACGCTCGAATGAATTGAAGAAGTCGCTTAGGCGAGTTGAGGCATCAAAGTGGAAGG
CCAGAGCAGTTGCTAAGGTCAAAGGTCTGGTCTCTAGCTTGGTAATAGGTTTGAATAGAGGATTCAAGCCATTCTCAGAGTGCATATTCTTCAAGAACAATTGTTCGGGT
TGGAAGAAGATGACAGAGCAATCTCCTTTGAAAGATGCTCACTCTCCTATTTTTCTTCTCACCAATATTTGCAATCTCATCTCTATTCGTCTTGATTCATCAAATTATGT
CCTCTGGAAATTTCAATTTTCTTCTAAGTTGAAGGCTCACAAATTGTTTGGTTTTGTTGGTAGATCTAACAAAGCACCAACCGAATTTCTTCAATCTACGTTTGAATCGG
AATCCCTGTCTTCTTCAACTCGTACTCTTAATCCTCTGTATGATGACTGGTGTGCCAAGGATCAAGCGTTGATGACTCTGATAAACACTACTCTCTCTACCAAAGCCCTA
ACTTATATTGTTAGTTGCAAATCTGCCAATGAGAAAGCGGATGAATCAATTGATGCTTATGTTAGACGAATTAAGAAGATTGAGGATAAACTTGTTAATGTTTCTTCAGT
TGTAAACGATGAAGACTTACTGATCTATGCTTTGAATGGTCTTCCTCTTGAGCATAATATCTTTCGAACGTCAATGAGGACTTGCTCACAACCAGTGACATTTGATGAAC
TCCATGTCTTGATGAAATCTGAAGAGTTTGCTTTAGCCAAGCAATCTAAACGAGAAGATTTGTCGATTCAGCCTACTGCTATGGTTGCGAATCAGTCAACTCATCGGAAT
CAGTTTAATTTGAATGGTCGTGGACATGGTTTTGTTGGTCAATCCTCTGGTAAGTCTTCTGTTTCAGCTGGTGCTAAAATGAATTGTCAAATTTGCAATCGTCCTGGTCA
CATGACTCTTGACTGCTACAATAGGATGGATTATCACTTTGAAGGTCGTCATCCTCCTCCACAACTGGCTGCCATGGCCAAACTTTGTTCGAAAGGACCTAGTGTTAATG
GTCTTTATCCCATTACACCTTATTCCTCTAGCATTTACTCTTATCTGGGAGGCTCACCCACTGCTCATGTTGGAGTCAAGTCTCCTTCTACTCTTTGGCACAACTGCTTT
TTTGTCTCTTGCCATGTTCTGTTTGATGAAACTGTCTTTCATTTCTCTTCTACTTCGTCTACTTCACCTAATTCTTCTTCATATACGTCTACTTCATCTTTACCCTCTTA
TTCATTGTCTATCCCTTTGCCTTTACTTTTATCTTCTCCATCATCTACTTCTTTACCGTCATCTCCCTCTGTTCCACAACCTTGTACTGATAATTTGGTGTTTGAATTAC
CTCATGCTAACACTGTTACACCATCTAGTAATGGTGCTAATGTTGAGCCAGCTTCTAATCTTGGTAGTCATGTTGAGCACATTACTAGTAGTGGTGTTTCAACACCTTGT
TCTTCAACTCAACCTTCCTCCAATGTCAAAAATACTCATGCTATGAAGACACGTTCAAAGTCTGAAGAAGATATTGATTATGATGAGACGTTTAGTTCTGTTGTTAAGAA
ACCCACTGTTCGTATTATTCTTTCATCACTTGCTGCACATTTTGGCTGGAAGTTGTGGCAGCTTGATTTTATGCATTCACCTTCGCTTTGTCATTTGTCTGCGGCAAAAC
GGGTCTTAAGATATCTTCAGGGCACGACATTTAAGGGTCTGTTGTTCAAGAAATCAGCTTCAGGCCTTGTTTTAAATGCTTTTTCGGACTCTGATTGGGTTGGCAGTGTG
TTGGATAAAAGGTCTACTACAGCTATTCAATTGGCTCATAACACTGTTTTTCATGGTCGTACAAAGCATGTTGAAGTCGACTATCACTTTGTTCATGAGTGTGTTGTTCA
TCAGGATGTTCTGTTGAAATATACATCTACTCAGTCTCAATTTGTTGATGTTTTTACAAAACCTTTGACCACAGATCAATGA
Protein sequenceShow/hide protein sequence
MQVKDYLTCKKVHKALKEKPKGMTDEDWEALDEEAVATIRMCFSMDMTSLVAHETTAVKLMEALTNKYEKPSANDKLLTSLPDSWETMKTAESNSTGNNTLKFSEVCDIA
IAKEIRRQGSNKESTVGSTLVMTKGKDKVDEENESSNSRKKWKSRNEVECFYCHKKGQFKSQCRKFKEDQKRKPKANVVVKVVLVCVESDIKYSNHSSDWILNSAASVHI
ASNRSLFTSFTGGHHGLVRMGNDRTSKTRGIGDVSLKTECGGILVLRDVRIVPNIKMNLISIGKLADDGYMCEFGSRQCKLKFGSQVVAVGHRKSTLYRCQLNVAKGSKR
QWMPVKATDGNCRGTAEPASRIPNFDQSDQDPSVQIQLGSPGEKVDGYHESPVVRRSNELKKSLRRVEASKWKARAVAKVKGLVSSLVIGLNRGFKPFSECIFFKNNCSG
WKKMTEQSPLKDAHSPIFLLTNICNLISIRLDSSNYVLWKFQFSSKLKAHKLFGFVGRSNKAPTEFLQSTFESESLSSSTRTLNPLYDDWCAKDQALMTLINTTLSTKAL
TYIVSCKSANEKADESIDAYVRRIKKIEDKLVNVSSVVNDEDLLIYALNGLPLEHNIFRTSMRTCSQPVTFDELHVLMKSEEFALAKQSKREDLSIQPTAMVANQSTHRN
QFNLNGRGHGFVGQSSGKSSVSAGAKMNCQICNRPGHMTLDCYNRMDYHFEGRHPPPQLAAMAKLCSKGPSVNGLYPITPYSSSIYSYLGGSPTAHVGVKSPSTLWHNCF
FVSCHVLFDETVFHFSSTSSTSPNSSSYTSTSSLPSYSLSIPLPLLLSSPSSTSLPSSPSVPQPCTDNLVFELPHANTVTPSSNGANVEPASNLGSHVEHITSSGVSTPC
SSTQPSSNVKNTHAMKTRSKSEEDIDYDETFSSVVKKPTVRIILSSLAAHFGWKLWQLDFMHSPSLCHLSAAKRVLRYLQGTTFKGLLFKKSASGLVLNAFSDSDWVGSV
LDKRSTTAIQLAHNTVFHGRTKHVEVDYHFVHECVVHQDVLLKYTSTQSQFVDVFTKPLTTDQ