; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026437 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026437
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr10:36729452..36731316
RNA-Seq ExpressionLag0026437
SyntenyLag0026437
Gene Ontology termsNA
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049700.1 T4.5 [Cucumis melo var. makuwa]9.4e-6643.93Show/hide
Query:  MASTTT----HSLKDLHSSIFLLTNICNLISIRLDSSNYVLWKFQFSSMLRVHKLFGFVDGSNKALSETLSSTDSSSSEESSTDLNPAYEDWLTKDQMLM
        M+S+TT     + KD  S IFLL+NICNLIS+RLDS+N+VLWKFQ +++L+ HKL+GF+DG+N     T +S+ +S+    S   NP+YEDW+ KDQ LM
Subjt:  MASTTT----HSLKDLHSSIFLLTNICNLISIRLDSSNYVLWKFQFSSMLRVHKLFGFVDGSNKALSETLSSTDSSSSEESSTDLNPAYEDWLTKDQMLM

Query:  TLINATLSAEALTYIVGCSSSKDEWEALERHYSSSTQSNI-----------------------RIKEIKDKLANVSSVVNDEDLLIYALNG---------
        T+INATLS EAL Y+VG +SSK  W+ L + YSS ++SN+                       RIKEIKDKLANVS+ +N+EDLLIYALNG         
Subjt:  TLINATLSAEALTYIVGCSSSKDEWEALERHYSSSTQSNI-----------------------RIKEIKDKLANVSSVVNDEDLLIYALNG---------

Query:  ----------------------ESVIEKQAKREDLVVRPTVMYASQQNQFRSNQSSVSSSSFRDYGQGRNSSRRRSNNSPFSASRQGCDHFNGNAVPSSS
                              ES + KQ+K +D   +PTV+ +S Q+   S   +  ++  R  G G++    R +   F A  +G    +G++    S
Subjt:  ----------------------ESVIEKQAKREDLVVRPTVMYASQQNQFRSNQSSVSSSSFRDYGQGRNSSRRRSNNSPFSASRQGCDHFNGNAVPSSS

Query:  SADSRISCQICNRPGHMALDCYNRMNYNFQGRYPSPQLAAMVATQNNQYLSTHQNNQYNSSPWLADSGCDAHVTSDLSNLAISSEYN
          D+  +CQIC+R GH ALDC+NRMNYNFQGR+P  QLAAMVA+QNN +LS       NSS  L DSGC+  +TSD++ ++++ EYN
Subjt:  SADSRISCQICNRPGHMALDCYNRMNYNFQGRYPSPQLAAMVATQNNQYLSTHQNNQYNSSPWLADSGCDAHVTSDLSNLAISSEYN

XP_008448007.1 PREDICTED: uncharacterized protein LOC103490319 isoform X2 [Cucumis melo]9.4e-6643.93Show/hide
Query:  MASTTT----HSLKDLHSSIFLLTNICNLISIRLDSSNYVLWKFQFSSMLRVHKLFGFVDGSNKALSETLSSTDSSSSEESSTDLNPAYEDWLTKDQMLM
        M+S+TT     + KD  S IFLL+NICNLIS+RLDS+N+VLWKFQ +++L+ HKL+GF+DG+N     T +S+ +S+    S   NP+YEDW+ KDQ LM
Subjt:  MASTTT----HSLKDLHSSIFLLTNICNLISIRLDSSNYVLWKFQFSSMLRVHKLFGFVDGSNKALSETLSSTDSSSSEESSTDLNPAYEDWLTKDQMLM

Query:  TLINATLSAEALTYIVGCSSSKDEWEALERHYSSSTQSNI-----------------------RIKEIKDKLANVSSVVNDEDLLIYALNG---------
        T+INATLS EAL Y+VG +SSK  W+ L + YSS ++SN+                       RIKEIKDKLANVS+ +N+EDLLIYALNG         
Subjt:  TLINATLSAEALTYIVGCSSSKDEWEALERHYSSSTQSNI-----------------------RIKEIKDKLANVSSVVNDEDLLIYALNG---------

Query:  ----------------------ESVIEKQAKREDLVVRPTVMYASQQNQFRSNQSSVSSSSFRDYGQGRNSSRRRSNNSPFSASRQGCDHFNGNAVPSSS
                              ES + KQ+K +D   +PTV+ +S Q+   S   +  ++  R  G G++    R +   F A  +G    +G++    S
Subjt:  ----------------------ESVIEKQAKREDLVVRPTVMYASQQNQFRSNQSSVSSSSFRDYGQGRNSSRRRSNNSPFSASRQGCDHFNGNAVPSSS

Query:  SADSRISCQICNRPGHMALDCYNRMNYNFQGRYPSPQLAAMVATQNNQYLSTHQNNQYNSSPWLADSGCDAHVTSDLSNLAISSEYN
          D+  +CQIC+R GH ALDC+NRMNYNFQGR+P  QLAAMVA+QNN +LS       NSS  L DSGC+  +TSD++ ++++ EYN
Subjt:  SADSRISCQICNRPGHMALDCYNRMNYNFQGRYPSPQLAAMVATQNNQYLSTHQNNQYNSSPWLADSGCDAHVTSDLSNLAISSEYN

XP_008448008.1 PREDICTED: uncharacterized protein LOC103490319 isoform X3 [Cucumis melo]9.4e-6643.93Show/hide
Query:  MASTTT----HSLKDLHSSIFLLTNICNLISIRLDSSNYVLWKFQFSSMLRVHKLFGFVDGSNKALSETLSSTDSSSSEESSTDLNPAYEDWLTKDQMLM
        M+S+TT     + KD  S IFLL+NICNLIS+RLDS+N+VLWKFQ +++L+ HKL+GF+DG+N     T +S+ +S+    S   NP+YEDW+ KDQ LM
Subjt:  MASTTT----HSLKDLHSSIFLLTNICNLISIRLDSSNYVLWKFQFSSMLRVHKLFGFVDGSNKALSETLSSTDSSSSEESSTDLNPAYEDWLTKDQMLM

Query:  TLINATLSAEALTYIVGCSSSKDEWEALERHYSSSTQSNI-----------------------RIKEIKDKLANVSSVVNDEDLLIYALNG---------
        T+INATLS EAL Y+VG +SSK  W+ L + YSS ++SN+                       RIKEIKDKLANVS+ +N+EDLLIYALNG         
Subjt:  TLINATLSAEALTYIVGCSSSKDEWEALERHYSSSTQSNI-----------------------RIKEIKDKLANVSSVVNDEDLLIYALNG---------

Query:  ----------------------ESVIEKQAKREDLVVRPTVMYASQQNQFRSNQSSVSSSSFRDYGQGRNSSRRRSNNSPFSASRQGCDHFNGNAVPSSS
                              ES + KQ+K +D   +PTV+ +S Q+   S   +  ++  R  G G++    R +   F A  +G    +G++    S
Subjt:  ----------------------ESVIEKQAKREDLVVRPTVMYASQQNQFRSNQSSVSSSSFRDYGQGRNSSRRRSNNSPFSASRQGCDHFNGNAVPSSS

Query:  SADSRISCQICNRPGHMALDCYNRMNYNFQGRYPSPQLAAMVATQNNQYLSTHQNNQYNSSPWLADSGCDAHVTSDLSNLAISSEYN
          D+  +CQIC+R GH ALDC+NRMNYNFQGR+P  QLAAMVA+QNN +LS       NSS  L DSGC+  +TSD++ ++++ EYN
Subjt:  SADSRISCQICNRPGHMALDCYNRMNYNFQGRYPSPQLAAMVATQNNQYLSTHQNNQYNSSPWLADSGCDAHVTSDLSNLAISSEYN

XP_016900446.1 PREDICTED: uncharacterized protein LOC103490319 isoform X1 [Cucumis melo]9.4e-6643.93Show/hide
Query:  MASTTT----HSLKDLHSSIFLLTNICNLISIRLDSSNYVLWKFQFSSMLRVHKLFGFVDGSNKALSETLSSTDSSSSEESSTDLNPAYEDWLTKDQMLM
        M+S+TT     + KD  S IFLL+NICNLIS+RLDS+N+VLWKFQ +++L+ HKL+GF+DG+N     T +S+ +S+    S   NP+YEDW+ KDQ LM
Subjt:  MASTTT----HSLKDLHSSIFLLTNICNLISIRLDSSNYVLWKFQFSSMLRVHKLFGFVDGSNKALSETLSSTDSSSSEESSTDLNPAYEDWLTKDQMLM

Query:  TLINATLSAEALTYIVGCSSSKDEWEALERHYSSSTQSNI-----------------------RIKEIKDKLANVSSVVNDEDLLIYALNG---------
        T+INATLS EAL Y+VG +SSK  W+ L + YSS ++SN+                       RIKEIKDKLANVS+ +N+EDLLIYALNG         
Subjt:  TLINATLSAEALTYIVGCSSSKDEWEALERHYSSSTQSNI-----------------------RIKEIKDKLANVSSVVNDEDLLIYALNG---------

Query:  ----------------------ESVIEKQAKREDLVVRPTVMYASQQNQFRSNQSSVSSSSFRDYGQGRNSSRRRSNNSPFSASRQGCDHFNGNAVPSSS
                              ES + KQ+K +D   +PTV+ +S Q+   S   +  ++  R  G G++    R +   F A  +G    +G++    S
Subjt:  ----------------------ESVIEKQAKREDLVVRPTVMYASQQNQFRSNQSSVSSSSFRDYGQGRNSSRRRSNNSPFSASRQGCDHFNGNAVPSSS

Query:  SADSRISCQICNRPGHMALDCYNRMNYNFQGRYPSPQLAAMVATQNNQYLSTHQNNQYNSSPWLADSGCDAHVTSDLSNLAISSEYN
          D+  +CQIC+R GH ALDC+NRMNYNFQGR+P  QLAAMVA+QNN +LS       NSS  L DSGC+  +TSD++ ++++ EYN
Subjt:  SADSRISCQICNRPGHMALDCYNRMNYNFQGRYPSPQLAAMVATQNNQYLSTHQNNQYNSSPWLADSGCDAHVTSDLSNLAISSEYN

XP_022150845.1 uncharacterized protein LOC111018892 [Momordica charantia]1.9e-7947.96Show/hide
Query:  MASTTTHSLKDLHSSIFLLTNICNLISIRLDSSNYVLWKFQFSSMLRVHKLFGFVDGSNKALSETLSSTDSSSSEESSTD----LNPAYEDWLTKDQMLM
        M S++T++ KDLHS IFLL+NICNL+SIRLDS++++LWKFQ +++L+ HKLFGF+DGS  A S+ L+S+  + S+ ++T     +NP +EDW+ KDQ LM
Subjt:  MASTTTHSLKDLHSSIFLLTNICNLISIRLDSSNYVLWKFQFSSMLRVHKLFGFVDGSNKALSETLSSTDSSSSEESSTD----LNPAYEDWLTKDQMLM

Query:  TLINATLSAEALTYIVGCSSSKDEWEALERHYSSSTQSNI-----------------------RIKEIKDKLANVSSVVNDEDLLIYALNG---------
        TLINATLSAEAL Y+V   +SK  WE LE+HYSS++++N+                       RIKEIKDK ANVS  +NDE LLIYALNG         
Subjt:  TLINATLSAEALTYIVGCSSSKDEWEALERHYSSSTQSNI-----------------------RIKEIKDKLANVSSVVNDEDLLIYALNG---------

Query:  ----------------------ESVIEKQAKREDLVVRPTVMYASQQNQFRSNQSSVSSSSFRDYGQGRNSSRRRSNNSPFSASRQGCDHFNGNAVPSSS
                              ES IEKQ KREDLV +P  ++AS   Q ++  S+   +   D G+G+N+ R ++N +P + + QG    +GN   +S 
Subjt:  ----------------------ESVIEKQAKREDLVVRPTVMYASQQNQFRSNQSSVSSSSFRDYGQGRNSSRRRSNNSPFSASRQGCDHFNGNAVPSSS

Query:  SADSRISCQICNRPGHMALDCYNRMNYNFQGRYPSPQLAAMVATQNNQYLSTHQNNQYNSSP--WLADSGCDAHVTSDLSNL---AISSEYN
         AD+R  CQIC + GH ALDCYNRMN++FQGR+P PQLAAMVA QNN YL+       NSSP  WLADS C+ H+T+DLSNL   +I+S+YN
Subjt:  SADSRISCQICNRPGHMALDCYNRMNYNFQGRYPSPQLAAMVATQNNQYLSTHQNNQYNSSP--WLADSGCDAHVTSDLSNL---AISSEYN

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X24.5e-6643.93Show/hide
Query:  MASTTT----HSLKDLHSSIFLLTNICNLISIRLDSSNYVLWKFQFSSMLRVHKLFGFVDGSNKALSETLSSTDSSSSEESSTDLNPAYEDWLTKDQMLM
        M+S+TT     + KD  S IFLL+NICNLIS+RLDS+N+VLWKFQ +++L+ HKL+GF+DG+N     T +S+ +S+    S   NP+YEDW+ KDQ LM
Subjt:  MASTTT----HSLKDLHSSIFLLTNICNLISIRLDSSNYVLWKFQFSSMLRVHKLFGFVDGSNKALSETLSSTDSSSSEESSTDLNPAYEDWLTKDQMLM

Query:  TLINATLSAEALTYIVGCSSSKDEWEALERHYSSSTQSNI-----------------------RIKEIKDKLANVSSVVNDEDLLIYALNG---------
        T+INATLS EAL Y+VG +SSK  W+ L + YSS ++SN+                       RIKEIKDKLANVS+ +N+EDLLIYALNG         
Subjt:  TLINATLSAEALTYIVGCSSSKDEWEALERHYSSSTQSNI-----------------------RIKEIKDKLANVSSVVNDEDLLIYALNG---------

Query:  ----------------------ESVIEKQAKREDLVVRPTVMYASQQNQFRSNQSSVSSSSFRDYGQGRNSSRRRSNNSPFSASRQGCDHFNGNAVPSSS
                              ES + KQ+K +D   +PTV+ +S Q+   S   +  ++  R  G G++    R +   F A  +G    +G++    S
Subjt:  ----------------------ESVIEKQAKREDLVVRPTVMYASQQNQFRSNQSSVSSSSFRDYGQGRNSSRRRSNNSPFSASRQGCDHFNGNAVPSSS

Query:  SADSRISCQICNRPGHMALDCYNRMNYNFQGRYPSPQLAAMVATQNNQYLSTHQNNQYNSSPWLADSGCDAHVTSDLSNLAISSEYN
          D+  +CQIC+R GH ALDC+NRMNYNFQGR+P  QLAAMVA+QNN +LS       NSS  L DSGC+  +TSD++ ++++ EYN
Subjt:  SADSRISCQICNRPGHMALDCYNRMNYNFQGRYPSPQLAAMVATQNNQYLSTHQNNQYNSSPWLADSGCDAHVTSDLSNLAISSEYN

A0A1S3BIR3 uncharacterized protein LOC103490319 isoform X34.5e-6643.93Show/hide
Query:  MASTTT----HSLKDLHSSIFLLTNICNLISIRLDSSNYVLWKFQFSSMLRVHKLFGFVDGSNKALSETLSSTDSSSSEESSTDLNPAYEDWLTKDQMLM
        M+S+TT     + KD  S IFLL+NICNLIS+RLDS+N+VLWKFQ +++L+ HKL+GF+DG+N     T +S+ +S+    S   NP+YEDW+ KDQ LM
Subjt:  MASTTT----HSLKDLHSSIFLLTNICNLISIRLDSSNYVLWKFQFSSMLRVHKLFGFVDGSNKALSETLSSTDSSSSEESSTDLNPAYEDWLTKDQMLM

Query:  TLINATLSAEALTYIVGCSSSKDEWEALERHYSSSTQSNI-----------------------RIKEIKDKLANVSSVVNDEDLLIYALNG---------
        T+INATLS EAL Y+VG +SSK  W+ L + YSS ++SN+                       RIKEIKDKLANVS+ +N+EDLLIYALNG         
Subjt:  TLINATLSAEALTYIVGCSSSKDEWEALERHYSSSTQSNI-----------------------RIKEIKDKLANVSSVVNDEDLLIYALNG---------

Query:  ----------------------ESVIEKQAKREDLVVRPTVMYASQQNQFRSNQSSVSSSSFRDYGQGRNSSRRRSNNSPFSASRQGCDHFNGNAVPSSS
                              ES + KQ+K +D   +PTV+ +S Q+   S   +  ++  R  G G++    R +   F A  +G    +G++    S
Subjt:  ----------------------ESVIEKQAKREDLVVRPTVMYASQQNQFRSNQSSVSSSSFRDYGQGRNSSRRRSNNSPFSASRQGCDHFNGNAVPSSS

Query:  SADSRISCQICNRPGHMALDCYNRMNYNFQGRYPSPQLAAMVATQNNQYLSTHQNNQYNSSPWLADSGCDAHVTSDLSNLAISSEYN
          D+  +CQIC+R GH ALDC+NRMNYNFQGR+P  QLAAMVA+QNN +LS       NSS  L DSGC+  +TSD++ ++++ EYN
Subjt:  SADSRISCQICNRPGHMALDCYNRMNYNFQGRYPSPQLAAMVATQNNQYLSTHQNNQYNSSPWLADSGCDAHVTSDLSNLAISSEYN

A0A1S4DWT9 uncharacterized protein LOC103490319 isoform X14.5e-6643.93Show/hide
Query:  MASTTT----HSLKDLHSSIFLLTNICNLISIRLDSSNYVLWKFQFSSMLRVHKLFGFVDGSNKALSETLSSTDSSSSEESSTDLNPAYEDWLTKDQMLM
        M+S+TT     + KD  S IFLL+NICNLIS+RLDS+N+VLWKFQ +++L+ HKL+GF+DG+N     T +S+ +S+    S   NP+YEDW+ KDQ LM
Subjt:  MASTTT----HSLKDLHSSIFLLTNICNLISIRLDSSNYVLWKFQFSSMLRVHKLFGFVDGSNKALSETLSSTDSSSSEESSTDLNPAYEDWLTKDQMLM

Query:  TLINATLSAEALTYIVGCSSSKDEWEALERHYSSSTQSNI-----------------------RIKEIKDKLANVSSVVNDEDLLIYALNG---------
        T+INATLS EAL Y+VG +SSK  W+ L + YSS ++SN+                       RIKEIKDKLANVS+ +N+EDLLIYALNG         
Subjt:  TLINATLSAEALTYIVGCSSSKDEWEALERHYSSSTQSNI-----------------------RIKEIKDKLANVSSVVNDEDLLIYALNG---------

Query:  ----------------------ESVIEKQAKREDLVVRPTVMYASQQNQFRSNQSSVSSSSFRDYGQGRNSSRRRSNNSPFSASRQGCDHFNGNAVPSSS
                              ES + KQ+K +D   +PTV+ +S Q+   S   +  ++  R  G G++    R +   F A  +G    +G++    S
Subjt:  ----------------------ESVIEKQAKREDLVVRPTVMYASQQNQFRSNQSSVSSSSFRDYGQGRNSSRRRSNNSPFSASRQGCDHFNGNAVPSSS

Query:  SADSRISCQICNRPGHMALDCYNRMNYNFQGRYPSPQLAAMVATQNNQYLSTHQNNQYNSSPWLADSGCDAHVTSDLSNLAISSEYN
          D+  +CQIC+R GH ALDC+NRMNYNFQGR+P  QLAAMVA+QNN +LS       NSS  L DSGC+  +TSD++ ++++ EYN
Subjt:  SADSRISCQICNRPGHMALDCYNRMNYNFQGRYPSPQLAAMVATQNNQYLSTHQNNQYNSSPWLADSGCDAHVTSDLSNLAISSEYN

A0A5D3CLI6 T4.54.5e-6643.93Show/hide
Query:  MASTTT----HSLKDLHSSIFLLTNICNLISIRLDSSNYVLWKFQFSSMLRVHKLFGFVDGSNKALSETLSSTDSSSSEESSTDLNPAYEDWLTKDQMLM
        M+S+TT     + KD  S IFLL+NICNLIS+RLDS+N+VLWKFQ +++L+ HKL+GF+DG+N     T +S+ +S+    S   NP+YEDW+ KDQ LM
Subjt:  MASTTT----HSLKDLHSSIFLLTNICNLISIRLDSSNYVLWKFQFSSMLRVHKLFGFVDGSNKALSETLSSTDSSSSEESSTDLNPAYEDWLTKDQMLM

Query:  TLINATLSAEALTYIVGCSSSKDEWEALERHYSSSTQSNI-----------------------RIKEIKDKLANVSSVVNDEDLLIYALNG---------
        T+INATLS EAL Y+VG +SSK  W+ L + YSS ++SN+                       RIKEIKDKLANVS+ +N+EDLLIYALNG         
Subjt:  TLINATLSAEALTYIVGCSSSKDEWEALERHYSSSTQSNI-----------------------RIKEIKDKLANVSSVVNDEDLLIYALNG---------

Query:  ----------------------ESVIEKQAKREDLVVRPTVMYASQQNQFRSNQSSVSSSSFRDYGQGRNSSRRRSNNSPFSASRQGCDHFNGNAVPSSS
                              ES + KQ+K +D   +PTV+ +S Q+   S   +  ++  R  G G++    R +   F A  +G    +G++    S
Subjt:  ----------------------ESVIEKQAKREDLVVRPTVMYASQQNQFRSNQSSVSSSSFRDYGQGRNSSRRRSNNSPFSASRQGCDHFNGNAVPSSS

Query:  SADSRISCQICNRPGHMALDCYNRMNYNFQGRYPSPQLAAMVATQNNQYLSTHQNNQYNSSPWLADSGCDAHVTSDLSNLAISSEYN
          D+  +CQIC+R GH ALDC+NRMNYNFQGR+P  QLAAMVA+QNN +LS       NSS  L DSGC+  +TSD++ ++++ EYN
Subjt:  SADSRISCQICNRPGHMALDCYNRMNYNFQGRYPSPQLAAMVATQNNQYLSTHQNNQYNSSPWLADSGCDAHVTSDLSNLAISSEYN

A0A6J1D9L6 uncharacterized protein LOC1110188929.4e-8047.96Show/hide
Query:  MASTTTHSLKDLHSSIFLLTNICNLISIRLDSSNYVLWKFQFSSMLRVHKLFGFVDGSNKALSETLSSTDSSSSEESSTD----LNPAYEDWLTKDQMLM
        M S++T++ KDLHS IFLL+NICNL+SIRLDS++++LWKFQ +++L+ HKLFGF+DGS  A S+ L+S+  + S+ ++T     +NP +EDW+ KDQ LM
Subjt:  MASTTTHSLKDLHSSIFLLTNICNLISIRLDSSNYVLWKFQFSSMLRVHKLFGFVDGSNKALSETLSSTDSSSSEESSTD----LNPAYEDWLTKDQMLM

Query:  TLINATLSAEALTYIVGCSSSKDEWEALERHYSSSTQSNI-----------------------RIKEIKDKLANVSSVVNDEDLLIYALNG---------
        TLINATLSAEAL Y+V   +SK  WE LE+HYSS++++N+                       RIKEIKDK ANVS  +NDE LLIYALNG         
Subjt:  TLINATLSAEALTYIVGCSSSKDEWEALERHYSSSTQSNI-----------------------RIKEIKDKLANVSSVVNDEDLLIYALNG---------

Query:  ----------------------ESVIEKQAKREDLVVRPTVMYASQQNQFRSNQSSVSSSSFRDYGQGRNSSRRRSNNSPFSASRQGCDHFNGNAVPSSS
                              ES IEKQ KREDLV +P  ++AS   Q ++  S+   +   D G+G+N+ R ++N +P + + QG    +GN   +S 
Subjt:  ----------------------ESVIEKQAKREDLVVRPTVMYASQQNQFRSNQSSVSSSSFRDYGQGRNSSRRRSNNSPFSASRQGCDHFNGNAVPSSS

Query:  SADSRISCQICNRPGHMALDCYNRMNYNFQGRYPSPQLAAMVATQNNQYLSTHQNNQYNSSP--WLADSGCDAHVTSDLSNL---AISSEYN
         AD+R  CQIC + GH ALDCYNRMN++FQGR+P PQLAAMVA QNN YL+       NSSP  WLADS C+ H+T+DLSNL   +I+S+YN
Subjt:  SADSRISCQICNRPGHMALDCYNRMNYNFQGRYPSPQLAAMVATQNNQYLSTHQNNQYNSSP--WLADSGCDAHVTSDLSNL---AISSEYN

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.3e-0520.6Show/hide
Query:  LHSSIFLLTNICNLISIRLDSSNYVLWKFQFSSMLRVHKLFGFVDGSNKALSETLSSTDSSSSEESSTDLNPAYEDWLTKDQMLMTLINATLSAEALTYI
        L+++  L  N+ N+   +L S+NY++W  Q  ++   ++L GF+DGS      T+ +       +++  +NP Y  W  +D+++ + +   +S      +
Subjt:  LHSSIFLLTNICNLISIRLDSSNYVLWKFQFSSMLRVHKLFGFVDGSNKALSETLSSTDSSSSEESSTDLNPAYEDWLTKDQMLMTLINATLSAEALTYI

Query:  VGCSSSKDEWEALERHYSSSTQSNIR------------IKEIKDKLANVSSVVNDEDLLIYALNGESVIEKQAKREDLVVRPTVMYASQQN---------
           +++   WE L + Y++ +  ++              K I D +  + +  +   LL   ++ +  +E+  +      +P +   + ++         
Subjt:  VGCSSSKDEWEALERHYSSSTQSNIR------------IKEIKDKLANVSSVVNDEDLLIYALNGESVIEKQAKREDLVVRPTVMYASQQN---------

Query:  ----QFRSNQSSVSSSSF------------------RDYGQGRNSSRRRSNNSPFSASRQGCDHFNGNAVPSSSSADSRISCQICNRPGHMALDCYNRMN
               S   +VSS++                    + G   N    R+NN+     +Q   +F+ N   ++ S      CQIC   GH A  C    +
Subjt:  ----QFRSNQSSVSSSSF------------------RDYGQGRNSSRRRSNNSPFSASRQGCDHFNGNAVPSSSSADSRISCQICNRPGHMALDCYNRMN

Query:  Y----NFQGRYPSPQLAAMVATQNNQYLSTHQNNQYNSSPWLADSGCDAHVTSDLSNLAISSEY
        +    N Q + PSP       T      +    + Y+S+ WL DSG   H+TSD +NL++   Y
Subjt:  Y----NFQGRYPSPQLAAMVATQNNQYLSTHQNNQYNSSPWLADSGCDAHVTSDLSNLAISSEY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.7e-0721.59Show/hide
Query:  IFLLTNICNLIS---IRLDSSNYVLWKFQFSSMLRVHKLFGFVDGSNKALSETLSSTDSSSSEESSTDLNPAYEDWLTKDQMLMTLINATLSAEALTYIV
        + + TNI N+      +L S+NY++W  Q  ++   ++L GF+DGS      T+ +       ++   +NP Y  W  +D+++ + I   +S      + 
Subjt:  IFLLTNICNLIS---IRLDSSNYVLWKFQFSSMLRVHKLFGFVDGSNKALSETLSSTDSSSSEESSTDLNPAYEDWLTKDQMLMTLINATLSAEALTYIV

Query:  GCSSSKDEWEALERHYSSSTQSNI---RIKEIKDKLANVSSVVNDEDLLIYALNG-----ESVIEKQAKR----------EDLVVRPTVMYASQQNQ---
          +++   WE L + Y++ +  ++   R     D+LA +   ++ ++ +   L       + VI++ A +          E L+ R + + A    +   
Subjt:  GCSSSKDEWEALERHYSSSTQSNI---RIKEIKDKLANVSSVVNDEDLLIYALNG-----ESVIEKQAKR----------EDLVVRPTVMYASQQNQ---

Query:  ----FRSNQSSVSSSSFRDYGQGRNSSRRRSNNSPFSASRQGCDHFNGNAVPSSSSADSRISCQICNRPGHMALDCYNRMNYNFQGRYPSPQLAAMVATQ
              +++++ ++ +  + G  RN +   + ++ +  S  G    N    P          CQIC+  GH A  C              PQL    +T 
Subjt:  ----FRSNQSSVSSSSFRDYGQGRNSSRRRSNNSPFSASRQGCDHFNGNAVPSSSSADSRISCQICNRPGHMALDCYNRMNYNFQGRYPSPQLAAMVATQ

Query:  NNQYLST-----------HQNNQYNSSPWLADSGCDAHVTSDLSNLAISSEY
        N Q  ++             N+ YN++ WL DSG   H+TSD +NL+    Y
Subjt:  NNQYLST-----------HQNNQYNSSPWLADSGCDAHVTSDLSNLAISSEY

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).2.6e-0528.81Show/hide
Query:  DSSNYVLWKFQFSSMLRVHKLFGFVDGSNKALSETLSSTDSSSSEESSTDLNPAYEDWLTKDQMLMTLINATLSAEALTYIVGCSSSKDEWEALERHYSS
        D  NYV WK +F S LRV K FGF+DG       TL   D  S         P Y+ W   + M+M  +  +++ + L  ++   ++   WE L R +  
Subjt:  DSSNYVLWKFQFSSMLRVHKLFGFVDGSNKALSETLSSTDSSSSEESSTDLNPAYEDWLTKDQMLMTLINATLSAEALTYIVGCSSSKDEWEALERHYSS

Query:  STQSNIRIKEIKDKLANV
            +++I +++ +LA +
Subjt:  STQSNIRIKEIKDKLANV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCACCACTACTCATTCTCTCAAGGATTTGCATTCTTCGATTTTTCTCTTGACGAACATCTGTAATTTGATCTCAATTCGTTTGGATTCTTCAAATTATGTCTT
ATGGAAGTTTCAATTTTCTTCAATGTTGCGAGTGCACAAGTTGTTTGGATTCGTTGATGGCTCCAACAAGGCTTTGTCGGAGACTCTTTCTTCTACGGATTCATCTTCAT
CTGAAGAATCTTCTACTGATCTCAATCCCGCGTATGAGGATTGGCTCACAAAAGATCAGATGTTGATGACTTTAATCAATGCTACGTTATCTGCGGAAGCCCTAACATAT
ATTGTTGGTTGTTCTTCTTCTAAGGATGAATGGGAAGCCTTAGAGCGGCATTATTCCTCTTCTACTCAGTCCAATATTCGCATCAAAGAGATCAAGGACAAGCTTGCGAA
TGTTTCGTCTGTTGTTAATGACGAGGATCTTCTCATCTATGCGCTTAATGGGGAATCTGTTATTGAAAAACAAGCCAAGCGTGAAGATTTAGTTGTTCGGCCTACTGTGA
TGTATGCGTCTCAACAAAATCAATTTCGTTCGAATCAATCCTCTGTTTCTTCATCTTCTTTTCGAGATTATGGTCAAGGACGAAATTCTAGCCGCAGACGTTCTAATAAT
TCTCCATTCTCTGCTTCTAGACAAGGTTGCGATCATTTTAATGGCAATGCGGTTCCTTCTTCTTCTTCGGCTGATTCTCGAATCAGTTGCCAGATCTGTAATAGACCTGG
ACATATGGCTCTCGACTGTTATAATCGCATGAATTATAACTTCCAAGGTCGATATCCTTCTCCTCAATTGGCAGCCATGGTAGCAACACAGAACAATCAGTATCTTTCTA
CTCATCAAAATAATCAGTATAATTCCTCTCCTTGGCTTGCAGATTCTGGTTGCGATGCTCATGTGACGTCAGATCTTTCGAATTTGGCCATCTCTAGTGAATATAATGAG
AGGAGAATGTTGTTGTTGGACAAACTTACGGGCAACATTTTATTCCAAGGTGCTAGTGTAAATGGACTTTATCCAATTACACCGTACGTGCATTTCCAAAGCCTCTCATA
TCTCTTCGGAAGGTTTTACTGCTGCTCATGTTGGAGTCAAATCCTCTACTACTCTTTGGCACAACCGGTTCACACAAGGACTCCACTCACAAGCACTTACTCAACTCAGC
ATGCAAACCAAATTCTCTCAGATGAAATCAGTCAAAGAAATCACACACTCGAACCTCTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCACCACTACTCATTCTCTCAAGGATTTGCATTCTTCGATTTTTCTCTTGACGAACATCTGTAATTTGATCTCAATTCGTTTGGATTCTTCAAATTATGTCTT
ATGGAAGTTTCAATTTTCTTCAATGTTGCGAGTGCACAAGTTGTTTGGATTCGTTGATGGCTCCAACAAGGCTTTGTCGGAGACTCTTTCTTCTACGGATTCATCTTCAT
CTGAAGAATCTTCTACTGATCTCAATCCCGCGTATGAGGATTGGCTCACAAAAGATCAGATGTTGATGACTTTAATCAATGCTACGTTATCTGCGGAAGCCCTAACATAT
ATTGTTGGTTGTTCTTCTTCTAAGGATGAATGGGAAGCCTTAGAGCGGCATTATTCCTCTTCTACTCAGTCCAATATTCGCATCAAAGAGATCAAGGACAAGCTTGCGAA
TGTTTCGTCTGTTGTTAATGACGAGGATCTTCTCATCTATGCGCTTAATGGGGAATCTGTTATTGAAAAACAAGCCAAGCGTGAAGATTTAGTTGTTCGGCCTACTGTGA
TGTATGCGTCTCAACAAAATCAATTTCGTTCGAATCAATCCTCTGTTTCTTCATCTTCTTTTCGAGATTATGGTCAAGGACGAAATTCTAGCCGCAGACGTTCTAATAAT
TCTCCATTCTCTGCTTCTAGACAAGGTTGCGATCATTTTAATGGCAATGCGGTTCCTTCTTCTTCTTCGGCTGATTCTCGAATCAGTTGCCAGATCTGTAATAGACCTGG
ACATATGGCTCTCGACTGTTATAATCGCATGAATTATAACTTCCAAGGTCGATATCCTTCTCCTCAATTGGCAGCCATGGTAGCAACACAGAACAATCAGTATCTTTCTA
CTCATCAAAATAATCAGTATAATTCCTCTCCTTGGCTTGCAGATTCTGGTTGCGATGCTCATGTGACGTCAGATCTTTCGAATTTGGCCATCTCTAGTGAATATAATGAG
AGGAGAATGTTGTTGTTGGACAAACTTACGGGCAACATTTTATTCCAAGGTGCTAGTGTAAATGGACTTTATCCAATTACACCGTACGTGCATTTCCAAAGCCTCTCATA
TCTCTTCGGAAGGTTTTACTGCTGCTCATGTTGGAGTCAAATCCTCTACTACTCTTTGGCACAACCGGTTCACACAAGGACTCCACTCACAAGCACTTACTCAACTCAGC
ATGCAAACCAAATTCTCTCAGATGAAATCAGTCAAAGAAATCACACACTCGAACCTCTCTGA
Protein sequenceShow/hide protein sequence
MASTTTHSLKDLHSSIFLLTNICNLISIRLDSSNYVLWKFQFSSMLRVHKLFGFVDGSNKALSETLSSTDSSSSEESSTDLNPAYEDWLTKDQMLMTLINATLSAEALTY
IVGCSSSKDEWEALERHYSSSTQSNIRIKEIKDKLANVSSVVNDEDLLIYALNGESVIEKQAKREDLVVRPTVMYASQQNQFRSNQSSVSSSSFRDYGQGRNSSRRRSNN
SPFSASRQGCDHFNGNAVPSSSSADSRISCQICNRPGHMALDCYNRMNYNFQGRYPSPQLAAMVATQNNQYLSTHQNNQYNSSPWLADSGCDAHVTSDLSNLAISSEYNE
RRMLLLDKLTGNILFQGASVNGLYPITPYVHFQSLSYLFGRFYCCSCWSQILYYSLAQPVHTRTPLTSTYSTQHANQILSDEISQRNHTLEPL