; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0016098 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0016098
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr12:33104478..33105545
RNA-Seq ExpressionLag0016098
SyntenyLag0016098
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046195.1 putative Ty1-copia-like retrotransposon [Cucumis melo var. makuwa]7.9e-5845.28Show/hide
Query:  DNFLLWKFQILTALEGYNLESHLEN--DPPAQFLDVPNTTST----GDSSSTVKTSNPAYMQWKRQDKVVSSWLVGSMSEDILHHMIHCTTTKEIWSCLK
        D+F+L    ILTALE Y LES+ ++  +P  ++++ P   S+      S+  +   N  Y  WKRQD+++SSWL+GSMSEDIL+ M+H T+ K+IW  L+
Subjt:  DNFLLWKFQILTALEGYNLESHLEN--DPPAQFLDVPNTTST----GDSSSTVKTSNPAYMQWKRQDKVVSSWLVGSMSEDILHHMIHCTTTKEIWSCLK

Query:  QIFTTRNLAQMMKIKMKLQTIQKGSMTLKEYFSKIQQYVDALAAVGKPVEVEDHILFILSGLGSDFESMVSVISAKIGPQSVQEVMSLLLTQENRNESKL
         I+++R LA+ M+ K KL  ++KG+M+LKEYF KIQQ VDALA++ KP+  +DHIL+IL+GLG++++S++S+ISA+    SVQ+ MSLLLTQE++ ESK+
Subjt:  QIFTTRNLAQMMKIKMKLQTIQKGSMTLKEYFSKIQQYVDALAAVGKPVEVEDHILFILSGLGSDFESMVSVISAKIGPQSVQEVMSLLLTQENRNESKL

Query:  VHTEGSVPSVNLMVKT-------SEPDVSKTNSPQFSQSFGSGGRDGRGGRGGSNSNRGGRGSPWNNRNKVQCQVCSKFGHTATRCFFRYAPPSSHSPPG
          +E S+P+VN+   T        E +V+            +  +     R G  SNRGGRG    NR+K QCQ+CSKFGH A RC+FRY P    +PP 
Subjt:  VHTEGSVPSVNLMVKT-------SEPDVSKTNSPQFSQSFGSGGRDGRGGRGGSNSNRGGRGSPWNNRNKVQCQVCSKFGHTATRCFFRYAPPSSHSPPG

Query:  SYTPNFS
         Y+ N S
Subjt:  SYTPNFS

KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.5e-8552.65Show/hide
Query:  MSTQQSSLSAASEDSSSSSSQIFSPGNKISIVKLTDDNFLLWKFQILTALEGYNLESHL--ENDPPAQFLDVPNTTSTGDSSSTVKTSNPAYMQWKRQDK
        MS+  S L   + ++SS  +QIF  GNKIS+VKL DD FLLWKFQILTALE Y+LE+ L  E++PP+++L     ++   S+S   T NPAY  WKRQD+
Subjt:  MSTQQSSLSAASEDSSSSSSQIFSPGNKISIVKLTDDNFLLWKFQILTALEGYNLESHL--ENDPPAQFLDVPNTTSTGDSSSTVKTSNPAYMQWKRQDK

Query:  VVSSWLVGSMSEDILHHMIHCTTTKEIWSCLKQIFTTRNLAQMMKIKMKLQTIQKGSMTLKEYFSKIQQYVDALAAVGKPVEVEDHILFILSGLGSDFES
        ++SSWL+GSMSE+IL+ M+HC + KEIW  L+ IF++R LAQ M+ K KL  I+KGSM LKEYF KI Q VDALA++ KPV  +DHIL+IL+GLGSD++S
Subjt:  VVSSWLVGSMSEDILHHMIHCTTTKEIWSCLKQIFTTRNLAQMMKIKMKLQTIQKGSMTLKEYFSKIQQYVDALAAVGKPVEVEDHILFILSGLGSDFES

Query:  MVSVISAKIGPQSVQEVMSLLLTQENRNESKLVHTEGSVPSVNLMVKTSEPDVSKTNSPQFSQSFGSGGRDGRGGRGGSNSNRGGRGSPWNNRNKVQCQV
        M+SVISA+    SVQEVMSLLLTQE++NESKL+ +E ++PSVN++ +T+E           +    +   + RGGRG   SNRG RG    NRNK QCQ+
Subjt:  MVSVISAKIGPQSVQEVMSLLLTQENRNESKLVHTEGSVPSVNLMVKTSEPDVSKTNSPQFSQSFGSGGRDGRGGRGGSNSNRGGRGSPWNNRNKVQCQV

Query:  CSKFGHTATRCFFRYAPPSSHS--PPGSYTPNFSTFNRSPSYPQMTVMVATPDINHDTN
        C+K G++A RCFFRY P S+ S   P S+  +++  N   ++PQM+ MVA  D+N D+N
Subjt:  CSKFGHTATRCFFRYAPPSSHS--PPGSYTPNFSTFNRSPSYPQMTVMVATPDINHDTN

KAA0053143.1 keratin, type II cytoskeletal 1-like [Cucumis melo var. makuwa]2.8e-6355.78Show/hide
Query:  MSTQQSSLSAASEDSSSSSSQIFSPGNKISIVKLTDDNFLLWKFQILTALEGYNLESHLEN--DPPAQFLDVPNTTSTGDSS-STVKTSNPAYMQWKRQD
        MS+  S L   + + S     IF  GNKIS+VKL+DDNFLLWKFQILTALE Y+LE+  E+  +PP+++L     TSTG SS S  +T NP Y  WKR +
Subjt:  MSTQQSSLSAASEDSSSSSSQIFSPGNKISIVKLTDDNFLLWKFQILTALEGYNLESHLEN--DPPAQFLDVPNTTSTGDSS-STVKTSNPAYMQWKRQD

Query:  KVVSSWLVGSMSEDILHHMIHCTTTKEIWSCLKQIFTTRNLAQMMKIKMKLQTIQKGSMTLKEYFSKIQQYVDALAAVGKPVEVEDHILFILSGLGSDFE
        +++S WL+GSMSE+IL+ M+HC + KEIW  L+ IF++R LAQ M+ K KL  I+KGSM+LKEYF KIQQ VDALA++ KPV  +DHIL+IL GLG D++
Subjt:  KVVSSWLVGSMSEDILHHMIHCTTTKEIWSCLKQIFTTRNLAQMMKIKMKLQTIQKGSMTLKEYFSKIQQYVDALAAVGKPVEVEDHILFILSGLGSDFE

Query:  SMVSVISAKIGPQSVQEVMSLLLTQENRNESKLVHTEGSVPSVNLMVKTSE
        SM+S+ISA+    S+QEVMSLLLTQE++NESKL+ +E ++P V ++ +T+E
Subjt:  SMVSVISAKIGPQSVQEVMSLLLTQENRNESKLVHTEGSVPSVNLMVKTSE

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.5e-8552.65Show/hide
Query:  MSTQQSSLSAASEDSSSSSSQIFSPGNKISIVKLTDDNFLLWKFQILTALEGYNLESHL--ENDPPAQFLDVPNTTSTGDSSSTVKTSNPAYMQWKRQDK
        MS+  S L   + ++SS  +QIF  GNKIS+VKL DD FLLWKFQILTALE Y+LE+ L  E++PP+++L     ++   S+S   T NPAY  WKRQD+
Subjt:  MSTQQSSLSAASEDSSSSSSQIFSPGNKISIVKLTDDNFLLWKFQILTALEGYNLESHL--ENDPPAQFLDVPNTTSTGDSSSTVKTSNPAYMQWKRQDK

Query:  VVSSWLVGSMSEDILHHMIHCTTTKEIWSCLKQIFTTRNLAQMMKIKMKLQTIQKGSMTLKEYFSKIQQYVDALAAVGKPVEVEDHILFILSGLGSDFES
        ++SSWL+GSMSE+IL+ M+HC + KEIW  L+ IF++R LAQ M+ K KL  I+KGSM LKEYF KI Q VDALA++ KPV  +DHIL+IL+GLGSD++S
Subjt:  VVSSWLVGSMSEDILHHMIHCTTTKEIWSCLKQIFTTRNLAQMMKIKMKLQTIQKGSMTLKEYFSKIQQYVDALAAVGKPVEVEDHILFILSGLGSDFES

Query:  MVSVISAKIGPQSVQEVMSLLLTQENRNESKLVHTEGSVPSVNLMVKTSEPDVSKTNSPQFSQSFGSGGRDGRGGRGGSNSNRGGRGSPWNNRNKVQCQV
        M+SVISA+    SVQEVMSLLLTQE++NESKL+ +E ++PSVN++ +T+E           +    +   + RGGRG   SNRG RG    NRNK QCQ+
Subjt:  MVSVISAKIGPQSVQEVMSLLLTQENRNESKLVHTEGSVPSVNLMVKTSEPDVSKTNSPQFSQSFGSGGRDGRGGRGGSNSNRGGRGSPWNNRNKVQCQV

Query:  CSKFGHTATRCFFRYAPPSSHS--PPGSYTPNFSTFNRSPSYPQMTVMVATPDINHDTN
        C+K G++A RCFFRY P S+ S   P S+  +++  N   ++PQM+ MVA  D+N D+N
Subjt:  CSKFGHTATRCFFRYAPPSSHS--PPGSYTPNFSTFNRSPSYPQMTVMVATPDINHDTN

XP_022154487.1 uncharacterized protein LOC111021757 [Momordica charantia]4.5e-6942.9Show/hide
Query:  SSQIFSPGNKISIVKLTDDNFLLWKFQILTALEGYNLESHLEN--DPPAQFLDVPNTTSTGDSSSTVKTSNPAYMQWKRQDKVVSSWLVGSMSEDILHHM
        +S+  +PG+K+SIV+L DDN LLWKFQI TAL+G  LES++++  D PAQF+     T+  +SSS+    NPAY +W +QDK++S+WL+GSM+EDIL  M
Subjt:  SSQIFSPGNKISIVKLTDDNFLLWKFQILTALEGYNLESHLEN--DPPAQFLDVPNTTSTGDSSSTVKTSNPAYMQWKRQDKVVSSWLVGSMSEDILHHM

Query:  IHCTTTKEIWSCLKQIFTTRNLAQMMKIKMKLQTIQKGSMTLKEYFSKIQQYVDALAAVGKPVEVEDHILFILSGLGSDFESMVSVISAKIGPQSVQEVM
        + C + +EIW+ L+ +F +R LA++M++K+KL+  +KG+++LK+YF KI+  VD+LA  GK +  EDHI+ IL+GLG +F++++SVI+A+  PQ++QEV 
Subjt:  IHCTTTKEIWSCLKQIFTTRNLAQMMKIKMKLQTIQKGSMTLKEYFSKIQQYVDALAAVGKPVEVEDHILFILSGLGSDFESMVSVISAKIGPQSVQEVM

Query:  SLLLTQENRNESKLVHTEGSVPSVNLMVKTSEPDVSKTNSPQFSQSFGSGGRDGRGGRGGSNSNRGGRGSPWNNRNKVQCQVCSKFGHTATRCFFRY---
        SLLL QE RNE  L++++GS+PSVNL +     D SK N+   S+ F +  +     RG   +NR      W   NK QCQ+C +FGHTA RC+ R+   
Subjt:  SLLLTQENRNESKLVHTEGSVPSVNLMVKTSEPDVSKTNSPQFSQSFGSGGRDGRGGRGGSNSNRGGRGSPWNNRNKVQCQVCSKFGHTATRCFFRY---

Query:  -----APPSSHSP-------------------PGSYTPNFSTFNRSPSYPQMTVMVATPDINHDTN
               P++ SP                   P +   NFS  + SPS  QM  ++   D N D+N
Subjt:  -----APPSSHSP-------------------PGSYTPNFSTFNRSPSYPQMTVMVATPDINHDTN

TrEMBL top hitse value%identityAlignment
A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-947.4e-8652.65Show/hide
Query:  MSTQQSSLSAASEDSSSSSSQIFSPGNKISIVKLTDDNFLLWKFQILTALEGYNLESHL--ENDPPAQFLDVPNTTSTGDSSSTVKTSNPAYMQWKRQDK
        MS+  S L   + ++SS  +QIF  GNKIS+VKL DD FLLWKFQILTALE Y+LE+ L  E++PP+++L     ++   S+S   T NPAY  WKRQD+
Subjt:  MSTQQSSLSAASEDSSSSSSQIFSPGNKISIVKLTDDNFLLWKFQILTALEGYNLESHL--ENDPPAQFLDVPNTTSTGDSSSTVKTSNPAYMQWKRQDK

Query:  VVSSWLVGSMSEDILHHMIHCTTTKEIWSCLKQIFTTRNLAQMMKIKMKLQTIQKGSMTLKEYFSKIQQYVDALAAVGKPVEVEDHILFILSGLGSDFES
        ++SSWL+GSMSE+IL+ M+HC + KEIW  L+ IF++R LAQ M+ K KL  I+KGSM LKEYF KI Q VDALA++ KPV  +DHIL+IL+GLGSD++S
Subjt:  VVSSWLVGSMSEDILHHMIHCTTTKEIWSCLKQIFTTRNLAQMMKIKMKLQTIQKGSMTLKEYFSKIQQYVDALAAVGKPVEVEDHILFILSGLGSDFES

Query:  MVSVISAKIGPQSVQEVMSLLLTQENRNESKLVHTEGSVPSVNLMVKTSEPDVSKTNSPQFSQSFGSGGRDGRGGRGGSNSNRGGRGSPWNNRNKVQCQV
        M+SVISA+    SVQEVMSLLLTQE++NESKL+ +E ++PSVN++ +T+E           +    +   + RGGRG   SNRG RG    NRNK QCQ+
Subjt:  MVSVISAKIGPQSVQEVMSLLLTQENRNESKLVHTEGSVPSVNLMVKTSEPDVSKTNSPQFSQSFGSGGRDGRGGRGGSNSNRGGRGSPWNNRNKVQCQV

Query:  CSKFGHTATRCFFRYAPPSSHS--PPGSYTPNFSTFNRSPSYPQMTVMVATPDINHDTN
        C+K G++A RCFFRY P S+ S   P S+  +++  N   ++PQM+ MVA  D+N D+N
Subjt:  CSKFGHTATRCFFRYAPPSSHS--PPGSYTPNFSTFNRSPSYPQMTVMVATPDINHDTN

A0A5A7UB21 Keratin, type II cytoskeletal 1-like1.4e-6355.78Show/hide
Query:  MSTQQSSLSAASEDSSSSSSQIFSPGNKISIVKLTDDNFLLWKFQILTALEGYNLESHLEN--DPPAQFLDVPNTTSTGDSS-STVKTSNPAYMQWKRQD
        MS+  S L   + + S     IF  GNKIS+VKL+DDNFLLWKFQILTALE Y+LE+  E+  +PP+++L     TSTG SS S  +T NP Y  WKR +
Subjt:  MSTQQSSLSAASEDSSSSSSQIFSPGNKISIVKLTDDNFLLWKFQILTALEGYNLESHLEN--DPPAQFLDVPNTTSTGDSS-STVKTSNPAYMQWKRQD

Query:  KVVSSWLVGSMSEDILHHMIHCTTTKEIWSCLKQIFTTRNLAQMMKIKMKLQTIQKGSMTLKEYFSKIQQYVDALAAVGKPVEVEDHILFILSGLGSDFE
        +++S WL+GSMSE+IL+ M+HC + KEIW  L+ IF++R LAQ M+ K KL  I+KGSM+LKEYF KIQQ VDALA++ KPV  +DHIL+IL GLG D++
Subjt:  KVVSSWLVGSMSEDILHHMIHCTTTKEIWSCLKQIFTTRNLAQMMKIKMKLQTIQKGSMTLKEYFSKIQQYVDALAAVGKPVEVEDHILFILSGLGSDFE

Query:  SMVSVISAKIGPQSVQEVMSLLLTQENRNESKLVHTEGSVPSVNLMVKTSE
        SM+S+ISA+    S+QEVMSLLLTQE++NESKL+ +E ++P V ++ +T+E
Subjt:  SMVSVISAKIGPQSVQEVMSLLLTQENRNESKLVHTEGSVPSVNLMVKTSE

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-947.4e-8652.65Show/hide
Query:  MSTQQSSLSAASEDSSSSSSQIFSPGNKISIVKLTDDNFLLWKFQILTALEGYNLESHL--ENDPPAQFLDVPNTTSTGDSSSTVKTSNPAYMQWKRQDK
        MS+  S L   + ++SS  +QIF  GNKIS+VKL DD FLLWKFQILTALE Y+LE+ L  E++PP+++L     ++   S+S   T NPAY  WKRQD+
Subjt:  MSTQQSSLSAASEDSSSSSSQIFSPGNKISIVKLTDDNFLLWKFQILTALEGYNLESHL--ENDPPAQFLDVPNTTSTGDSSSTVKTSNPAYMQWKRQDK

Query:  VVSSWLVGSMSEDILHHMIHCTTTKEIWSCLKQIFTTRNLAQMMKIKMKLQTIQKGSMTLKEYFSKIQQYVDALAAVGKPVEVEDHILFILSGLGSDFES
        ++SSWL+GSMSE+IL+ M+HC + KEIW  L+ IF++R LAQ M+ K KL  I+KGSM LKEYF KI Q VDALA++ KPV  +DHIL+IL+GLGSD++S
Subjt:  VVSSWLVGSMSEDILHHMIHCTTTKEIWSCLKQIFTTRNLAQMMKIKMKLQTIQKGSMTLKEYFSKIQQYVDALAAVGKPVEVEDHILFILSGLGSDFES

Query:  MVSVISAKIGPQSVQEVMSLLLTQENRNESKLVHTEGSVPSVNLMVKTSEPDVSKTNSPQFSQSFGSGGRDGRGGRGGSNSNRGGRGSPWNNRNKVQCQV
        M+SVISA+    SVQEVMSLLLTQE++NESKL+ +E ++PSVN++ +T+E           +    +   + RGGRG   SNRG RG    NRNK QCQ+
Subjt:  MVSVISAKIGPQSVQEVMSLLLTQENRNESKLVHTEGSVPSVNLMVKTSEPDVSKTNSPQFSQSFGSGGRDGRGGRGGSNSNRGGRGSPWNNRNKVQCQV

Query:  CSKFGHTATRCFFRYAPPSSHS--PPGSYTPNFSTFNRSPSYPQMTVMVATPDINHDTN
        C+K G++A RCFFRY P S+ S   P S+  +++  N   ++PQM+ MVA  D+N D+N
Subjt:  CSKFGHTATRCFFRYAPPSSHS--PPGSYTPNFSTFNRSPSYPQMTVMVATPDINHDTN

A0A5D3CRZ7 Putative Ty1-copia-like retrotransposon3.8e-5845.28Show/hide
Query:  DNFLLWKFQILTALEGYNLESHLEN--DPPAQFLDVPNTTST----GDSSSTVKTSNPAYMQWKRQDKVVSSWLVGSMSEDILHHMIHCTTTKEIWSCLK
        D+F+L    ILTALE Y LES+ ++  +P  ++++ P   S+      S+  +   N  Y  WKRQD+++SSWL+GSMSEDIL+ M+H T+ K+IW  L+
Subjt:  DNFLLWKFQILTALEGYNLESHLEN--DPPAQFLDVPNTTST----GDSSSTVKTSNPAYMQWKRQDKVVSSWLVGSMSEDILHHMIHCTTTKEIWSCLK

Query:  QIFTTRNLAQMMKIKMKLQTIQKGSMTLKEYFSKIQQYVDALAAVGKPVEVEDHILFILSGLGSDFESMVSVISAKIGPQSVQEVMSLLLTQENRNESKL
         I+++R LA+ M+ K KL  ++KG+M+LKEYF KIQQ VDALA++ KP+  +DHIL+IL+GLG++++S++S+ISA+    SVQ+ MSLLLTQE++ ESK+
Subjt:  QIFTTRNLAQMMKIKMKLQTIQKGSMTLKEYFSKIQQYVDALAAVGKPVEVEDHILFILSGLGSDFESMVSVISAKIGPQSVQEVMSLLLTQENRNESKL

Query:  VHTEGSVPSVNLMVKT-------SEPDVSKTNSPQFSQSFGSGGRDGRGGRGGSNSNRGGRGSPWNNRNKVQCQVCSKFGHTATRCFFRYAPPSSHSPPG
          +E S+P+VN+   T        E +V+            +  +     R G  SNRGGRG    NR+K QCQ+CSKFGH A RC+FRY P    +PP 
Subjt:  VHTEGSVPSVNLMVKT-------SEPDVSKTNSPQFSQSFGSGGRDGRGGRGGSNSNRGGRGSPWNNRNKVQCQVCSKFGHTATRCFFRYAPPSSHSPPG

Query:  SYTPNFS
         Y+ N S
Subjt:  SYTPNFS

A0A6J1DLT9 uncharacterized protein LOC1110217572.2e-6942.9Show/hide
Query:  SSQIFSPGNKISIVKLTDDNFLLWKFQILTALEGYNLESHLEN--DPPAQFLDVPNTTSTGDSSSTVKTSNPAYMQWKRQDKVVSSWLVGSMSEDILHHM
        +S+  +PG+K+SIV+L DDN LLWKFQI TAL+G  LES++++  D PAQF+     T+  +SSS+    NPAY +W +QDK++S+WL+GSM+EDIL  M
Subjt:  SSQIFSPGNKISIVKLTDDNFLLWKFQILTALEGYNLESHLEN--DPPAQFLDVPNTTSTGDSSSTVKTSNPAYMQWKRQDKVVSSWLVGSMSEDILHHM

Query:  IHCTTTKEIWSCLKQIFTTRNLAQMMKIKMKLQTIQKGSMTLKEYFSKIQQYVDALAAVGKPVEVEDHILFILSGLGSDFESMVSVISAKIGPQSVQEVM
        + C + +EIW+ L+ +F +R LA++M++K+KL+  +KG+++LK+YF KI+  VD+LA  GK +  EDHI+ IL+GLG +F++++SVI+A+  PQ++QEV 
Subjt:  IHCTTTKEIWSCLKQIFTTRNLAQMMKIKMKLQTIQKGSMTLKEYFSKIQQYVDALAAVGKPVEVEDHILFILSGLGSDFESMVSVISAKIGPQSVQEVM

Query:  SLLLTQENRNESKLVHTEGSVPSVNLMVKTSEPDVSKTNSPQFSQSFGSGGRDGRGGRGGSNSNRGGRGSPWNNRNKVQCQVCSKFGHTATRCFFRY---
        SLLL QE RNE  L++++GS+PSVNL +     D SK N+   S+ F +  +     RG   +NR      W   NK QCQ+C +FGHTA RC+ R+   
Subjt:  SLLLTQENRNESKLVHTEGSVPSVNLMVKTSEPDVSKTNSPQFSQSFGSGGRDGRGGRGGSNSNRGGRGSPWNNRNKVQCQVCSKFGHTATRCFFRY---

Query:  -----APPSSHSP-------------------PGSYTPNFSTFNRSPSYPQMTVMVATPDINHDTN
               P++ SP                   P +   NFS  + SPS  QM  ++   D N D+N
Subjt:  -----APPSSHSP-------------------PGSYTPNFSTFNRSPSYPQMTVMVATPDINHDTN

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.6e-2126.49Show/hide
Query:  LSAASEDSSSSSSQIFSPGNKISIVKLTDDNFLLWKFQILTALEGYNLESHLENDPPAQFLDVPNTTSTGDSSSTVKTSNPAYMQWKRQDKVVSSWLVGS
        ++A +E+   +++ I +  N  ++ KLT  N+L+W  Q+    +GY L   L+         +P  T   D++  V   NP Y +WKRQDK++ S ++G+
Subjt:  LSAASEDSSSSSSQIFSPGNKISIVKLTDDNFLLWKFQILTALEGYNLESHLENDPPAQFLDVPNTTSTGDSSSTVKTSNPAYMQWKRQDKVVSSWLVGS

Query:  MSEDILHHMIHCTTTKEIWSCLKQIFTTRNLAQMMKIKMKLQTIQKGSMTLKEYFSKIQQYVDALAAVGKPVEVEDHILFILSGLGSDFESMVSVISAKI
        +S  +   +   TT  +IW  L++I+   +   + +++ +L+   KG+ T+ +Y   +    D LA +GKP++ ++ +  +L  L  +++ ++  I+AK 
Subjt:  MSEDILHHMIHCTTTKEIWSCLKQIFTTRNLAQMMKIKMKLQTIQKGSMTLKEYFSKIQQYVDALAAVGKPVEVEDHILFILSGLGSDFESMVSVISAKI

Query:  GPQSVQEVMSLLLTQENRNESKLVHTEGSVPSVNLMVKTSEPDVSKTNSPQFSQSFGSGGRDGRGGRGGSNSNRGGRGSPW----------NNRNKV---
         P ++ E+   LL     +ESK++    +V S  ++  T+   VS  N+   + +  +G R+ R      N N      PW          NN++K    
Subjt:  GPQSVQEVMSLLLTQENRNESKLVHTEGSVPSVNLMVKTSEPDVSKTNSPQFSQSFGSGGRDGRGGRGGSNSNRGGRGSPW----------NNRNKV---

Query:  QCQVCSKFGHTATRCF---FRYAPPSSHSPPGSYTP
        +CQ+C   GH+A RC       +  +S  PP  +TP
Subjt:  QCQVCSKFGHTATRCF---FRYAPPSSHSPPGSYTP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.2e-1625.69Show/hide
Query:  NKISIVKLTDDNFLLWKFQILTALEGYNLESHLENDPPAQFLDVPNTTSTGDSSSTVKTSNPAYMQWKRQDKVVSSWLVGSMSEDILHHMIHCTTTKEIW
        N  ++ KLT  N+L+W  Q+    +GY L   L+   P      P T  T      V   NP Y +W+RQDK++ S ++G++S  +   +   TT  +IW
Subjt:  NKISIVKLTDDNFLLWKFQILTALEGYNLESHLENDPPAQFLDVPNTTSTGDSSSTVKTSNPAYMQWKRQDKVVSSWLVGSMSEDILHHMIHCTTTKEIW

Query:  SCLKQIFTTRNLAQMMKIKMKLQTIQKGSMTLKEYFSKIQQYVDALAAVGKPVEVEDHILFILSGLGSDFESMVSVISAKIGPQSVQEVMSLLLTQENRN
          L++I+   +                G +T   + ++     D LA +GKP++ ++ +  +L  L  D++ ++  I+AK  P S+ E+   L+ +    
Subjt:  SCLKQIFTTRNLAQMMKIKMKLQTIQKGSMTLKEYFSKIQQYVDALAAVGKPVEVEDHILFILSGLGSDFESMVSVISAKIGPQSVQEVMSLLLTQENRN

Query:  ESKLVHTEGSVPSVNLMVKTSEPDVSKTNSPQFSQSFGSGGRDGRGGRGGSNS-NRGGRGSPWNNRNKV----QCQVCSKFGHTATRCFFRYAPPSSHSP
        ESKL+    ++ S  ++  T+     +  +   +Q+     R+       SNS      GS  +NR       +CQ+CS  GH+A RC      P  H  
Subjt:  ESKLVHTEGSVPSVNLMVKTSEPDVSKTNSPQFSQSFGSGGRDGRGGRGGSNS-NRGGRGSPWNNRNKV----QCQVCSKFGHTATRCFFRYAPPSSHSP

Query:  PGSYTPNFSTFNRSPSYPQMTVMVATP
          +     ST   +P  P+  + V +P
Subjt:  PGSYTPNFSTFNRSPSYPQMTVMVATP

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.5e-0625Show/hide
Query:  SIVKLT--DDNFLLWKFQILTALEGYNLESHLENDPPAQFLDVPNTTSTGDSSSTVKTSNPAYMQWKRQDKVVSSWLVGSMSEDILHHMIHCTTTKEIWS
        SI KL+  +DN++ WK +           S L       F+D   T    D  S      P Y  W++ + +V  WL+ SM++ +L  +++  T  ++W 
Subjt:  SIVKLT--DDNFLLWKFQILTALEGYNLESHLENDPPAQFLDVPNTTSTGDSSSTVKTSNPAYMQWKRQDKVVSSWLVGSMSEDILHHMIHCTTTKEIWS

Query:  CLKQIFTTRNLAQMMKIKMKLQTIQKGSMTLKEYFSKIQQ
         L+++F      ++ +++ +L T+++G  +++EYF K+ +
Subjt:  CLKQIFTTRNLAQMMKIKMKLQTIQKGSMTLKEYFSKIQQ

AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.6e-0825.34Show/hide
Query:  GDSSSTVKTSNPAYMQWKRQDKVVSSWLVGSMS-EDILHHMIHCTTTKEIWSCLKQIFTTRNLAQMMKIKMKLQTIQKGSMTLKEYFSKIQQYVDALAAV
        G    T+  +N   + W+++D +V   L G+++ +      +  +T+++IW  +K  F     A+ +++  +L+T   G M + +Y+ K+++  D+L  V
Subjt:  GDSSSTVKTSNPAYMQWKRQDKVVSSWLVGSMS-EDILHHMIHCTTTKEIWSCLKQIFTTRNLAQMMKIKMKLQTIQKGSMTLKEYFSKIQQYVDALAAV

Query:  GKPVEVEDHILFILSGLGSDFESMVSVISAKIGPQSVQEVMSLLLTQENRNESKL----VHTEGSVPSVNLMVKTSEPDVSKTNSPQFSQSFGSGGRDGR
          PV   + ++++L+GL   F+++++VI  +    S  +  ++L  +E+R +  +     H + S  S  L    + P    TN   F +S   G + G 
Subjt:  GKPVEVEDHILFILSGLGSDFESMVSVISAKIGPQSVQEVMSLLLTQENRNESKL----VHTEGSVPSVNLMVKTSEPDVSKTNSPQFSQSFGSGGRDGR

Query:  GGRGGSNS---NRGGRGSPWN
         GRG  N+    RGGR S +N
Subjt:  GGRGGSNS---NRGGRGSPWN

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)3.2e-0925.91Show/hide
Query:  GDSSSTVKTSNPAYMQWKRQDKVVSSWLVGSMSEDILHHMIHC-TTTKEIWSCLKQIFTTRNLAQMMKIKMKLQTIQKGSMTLKEYFSKIQQYVDALAAV
        G S+ T  T      +WK +D +V  W+ G++++ +L  +I    T +++W  L+ +F     A+ ++ + +L+T     +++ EY  K++   D L  V
Subjt:  GDSSSTVKTSNPAYMQWKRQDKVVSSWLVGSMSEDILHHMIHC-TTTKEIWSCLKQIFTTRNLAQMMKIKMKLQTIQKGSMTLKEYFSKIQQYVDALAAV

Query:  GKPVEVEDHILFILSGLGSDFESMVSVISAKIGPQSVQEVMSLLLTQENR--NESKLVHTEGSVPSVNLMVKTSEPDVSKTNSPQFSQSFGSG-GRDGRG
          P+     ++ +L+GL   ++ +++VI  K    S  E  S+LL +E+R  N+SK   +  + PS++ ++ T      +      + +   G GR  + 
Subjt:  GKPVEVEDHILFILSGLGSDFESMVSVISAKIGPQSVQEVMSLLLTQENR--NESKLVHTEGSVPSVNLMVKTSEPDVSKTNSPQFSQSFGSG-GRDGRG

Query:  GRGGSNSNRGGRGSPWNNRN
         RGG +S+  GR   +NN N
Subjt:  GRGGSNSNRGGRGSPWNNRN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGACGCAACAAAGCTCTCTGAGTGCCGCTTCGGAGGATTCCTCCAGTTCTTCGTCTCAGATATTTAGCCCGGGTAACAAAATTTCTATTGTGAAATTGACTGACGA
TAATTTTTTATTGTGGAAATTTCAGATTCTCACAGCCCTTGAAGGGTATAATCTTGAATCTCATCTTGAAAATGACCCACCTGCTCAGTTTCTTGATGTTCCTAATACTA
CTTCAACTGGTGATTCGTCTTCTACGGTTAAAACCTCGAACCCGGCCTACATGCAGTGGAAACGCCAAGACAAGGTCGTATCCTCCTGGCTAGTTGGTTCCATGTCTGAA
GACATTCTTCATCATATGATCCACTGTACCACTACTAAAGAGATTTGGTCCTGCTTGAAACAAATTTTTACTACTCGTAACTTGGCGCAAATGATGAAGATTAAGATGAA
ACTCCAAACTATTCAGAAAGGAAGCATGACCCTGAAGGAATATTTCTCAAAAATCCAGCAGTATGTTGACGCTCTAGCTGCAGTTGGTAAGCCAGTAGAGGTAGAAGATC
ATATTCTTTTCATACTTTCTGGTTTGGGCTCTGACTTTGAATCTATGGTCTCGGTAATCTCGGCCAAGATTGGGCCTCAATCTGTCCAAGAGGTGATGTCTCTTCTCCTA
ACTCAGGAGAATCGGAATGAGAGCAAACTTGTTCACACGGAAGGATCTGTCCCTTCGGTTAATCTTATGGTTAAAACATCTGAACCCGATGTTTCGAAAACTAACTCTCC
TCAGTTTTCTCAGTCTTTTGGCAGTGGTGGTAGAGATGGACGCGGTGGCCGTGGTGGTTCAAACTCTAATCGCGGAGGTCGTGGTAGTCCCTGGAATAATCGCAACAAGG
TGCAATGTCAAGTTTGCAGTAAATTTGGCCATACAGCCACCCGATGTTTTTTTCGATATGCCCCTCCATCCTCGCACAGTCCGCCAGGTTCGTATACTCCAAATTTCAGT
ACATTTAATCGATCTCCTTCTTATCCTCAGATGACCGTGATGGTTGCTACTCCTGATATTAATCACGATACCAACTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGACGCAACAAAGCTCTCTGAGTGCCGCTTCGGAGGATTCCTCCAGTTCTTCGTCTCAGATATTTAGCCCGGGTAACAAAATTTCTATTGTGAAATTGACTGACGA
TAATTTTTTATTGTGGAAATTTCAGATTCTCACAGCCCTTGAAGGGTATAATCTTGAATCTCATCTTGAAAATGACCCACCTGCTCAGTTTCTTGATGTTCCTAATACTA
CTTCAACTGGTGATTCGTCTTCTACGGTTAAAACCTCGAACCCGGCCTACATGCAGTGGAAACGCCAAGACAAGGTCGTATCCTCCTGGCTAGTTGGTTCCATGTCTGAA
GACATTCTTCATCATATGATCCACTGTACCACTACTAAAGAGATTTGGTCCTGCTTGAAACAAATTTTTACTACTCGTAACTTGGCGCAAATGATGAAGATTAAGATGAA
ACTCCAAACTATTCAGAAAGGAAGCATGACCCTGAAGGAATATTTCTCAAAAATCCAGCAGTATGTTGACGCTCTAGCTGCAGTTGGTAAGCCAGTAGAGGTAGAAGATC
ATATTCTTTTCATACTTTCTGGTTTGGGCTCTGACTTTGAATCTATGGTCTCGGTAATCTCGGCCAAGATTGGGCCTCAATCTGTCCAAGAGGTGATGTCTCTTCTCCTA
ACTCAGGAGAATCGGAATGAGAGCAAACTTGTTCACACGGAAGGATCTGTCCCTTCGGTTAATCTTATGGTTAAAACATCTGAACCCGATGTTTCGAAAACTAACTCTCC
TCAGTTTTCTCAGTCTTTTGGCAGTGGTGGTAGAGATGGACGCGGTGGCCGTGGTGGTTCAAACTCTAATCGCGGAGGTCGTGGTAGTCCCTGGAATAATCGCAACAAGG
TGCAATGTCAAGTTTGCAGTAAATTTGGCCATACAGCCACCCGATGTTTTTTTCGATATGCCCCTCCATCCTCGCACAGTCCGCCAGGTTCGTATACTCCAAATTTCAGT
ACATTTAATCGATCTCCTTCTTATCCTCAGATGACCGTGATGGTTGCTACTCCTGATATTAATCACGATACCAACTAG
Protein sequenceShow/hide protein sequence
MSTQQSSLSAASEDSSSSSSQIFSPGNKISIVKLTDDNFLLWKFQILTALEGYNLESHLENDPPAQFLDVPNTTSTGDSSSTVKTSNPAYMQWKRQDKVVSSWLVGSMSE
DILHHMIHCTTTKEIWSCLKQIFTTRNLAQMMKIKMKLQTIQKGSMTLKEYFSKIQQYVDALAAVGKPVEVEDHILFILSGLGSDFESMVSVISAKIGPQSVQEVMSLLL
TQENRNESKLVHTEGSVPSVNLMVKTSEPDVSKTNSPQFSQSFGSGGRDGRGGRGGSNSNRGGRGSPWNNRNKVQCQVCSKFGHTATRCFFRYAPPSSHSPPGSYTPNFS
TFNRSPSYPQMTVMVATPDINHDTN