; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0014920 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0014920
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr12:5900832..5903413
RNA-Seq ExpressionLag0014920
SyntenyLag0014920
Gene Ontology termsGO:0034641 - cellular nitrogen compound metabolic process (biological process)
GO:0071704 - organic substance metabolic process (biological process)
GO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046195.1 putative Ty1-copia-like retrotransposon [Cucumis melo var. makuwa]2.0e-5948.77Show/hide
Query:  ILMALEGYDLEHHLVDDSPP--QFLTSTAQSSSV----EGASVTKTLNPAYTIWKRQDKVISSWLVGSMSEDILHQMIHCTSTKEIWTCLQQIFTSRNLA
        IL ALE Y LE +    + P  +++      SSV      A     LN  Y +WKRQD++ISSWL+GSMSEDIL+QM+H TS K+IW  LQ I++SR LA
Subjt:  ILMALEGYDLEHHLVDDSPP--QFLTSTAQSSSV----EGASVTKTLNPAYTIWKRQDKVISSWLVGSMSEDILHQMIHCTSTKEIWTCLQQIFTSRNLA

Query:  QVMKVKTKLQTLQKGGMSLKDYFSKIQHYVDALAAVGKPVETEDHILYILSGLGSDFESMVSVISAKMGPQSVHEVMALLLTQENRNESKIATPDGSLPS
        + M+ K KL  ++KG MSLK+YF KIQ  VDALA++ KP+ T+DHILYIL+GLG++++S++S+ISA+    SV + M+LLLTQE++ ESKI T + SLP+
Subjt:  QVMKVKTKLQTLQKGGMSLKDYFSKIQHYVDALAAVGKPVETEDHILYILSGLGSDFESMVSVISAKMGPQSVHEVMALLLTQENRNESKIATPDGSLPS

Query:  ANIVVNSKSIES--KSTKTNNAQSSQNF------SPGNRGSRGGGRSGRGGRSGSWNNRNKVQCQLCTKFGHTASHCFFRYAPSN
         N+  +++ I S  K ++  +   S N       S  +  SR GGRS RGGR     NR+K QCQ+C+KFGH A  C+FRY P N
Subjt:  ANIVVNSKSIES--KSTKTNNAQSSQNF------SPGNRGSRGGGRSGRGGRSGSWNNRNKVQCQLCTKFGHTASHCFFRYAPSN

KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.2e-8052.46Show/hide
Query:  MASLSSSNTAVDDSSASLSSQIFGPGNKISVVKLTDENFLLWKFQILMALEGYDLEHHLVDDS--PPQFLTSTAQSSSVEGASVTKTLNPAYTIWKRQDK
        M+S SS     +  ++S  +QIFG GNKIS+VKL D+ FLLWKFQIL ALE YDLE+ L  +S  P ++L ST  SS    AS T T NPAY +WKRQD+
Subjt:  MASLSSSNTAVDDSSASLSSQIFGPGNKISVVKLTDENFLLWKFQILMALEGYDLEHHLVDDS--PPQFLTSTAQSSSVEGASVTKTLNPAYTIWKRQDK

Query:  VISSWLVGSMSEDILHQMIHCTSTKEIWTCLQQIFTSRNLAQVMKVKTKLQTLQKGGMSLKDYFSKIQHYVDALAAVGKPVETEDHILYILSGLGSDFES
        +ISSWL+GSMSE+IL+QM+HC S KEIW  LQ IF+SR LAQ M+ K KL  ++KG M LK+YF KI   VDALA++ KPV ++DHILYIL+GLGSD++S
Subjt:  VISSWLVGSMSEDILHQMIHCTSTKEIWTCLQQIFTSRNLAQVMKVKTKLQTLQKGGMSLKDYFSKIQHYVDALAAVGKPVETEDHILYILSGLGSDFES

Query:  MVSVISAKMGPQSVHEVMALLLTQENRNESKIATPDGSLPSANIVVNSKSIESKS-TKTNNAQSSQNFSPGNRGSRGGGRSGRGGRSGSWNNRNKVQCQL
        M+SVISA+    SV EVM+LLLTQE++NESK+ + + +LPS NIV  +    ++S  +TN      N S   RG RG GRS RG R     NRNK QCQ+
Subjt:  MVSVISAKMGPQSVHEVMALLLTQENRNESKIATPDGSLPSANIVVNSKSIESKS-TKTNNAQSSQNFSPGNRGSRGGGRSGRGGRSGSWNNRNKVQCQL

Query:  CTKFGHTASHCFFRYAPSNSGNTGEESRPGCIWGHRENGPGMDIL
        C K G++A  CFFRY P ++ +    +     + +  N P M  +
Subjt:  CTKFGHTASHCFFRYAPSNSGNTGEESRPGCIWGHRENGPGMDIL

KAA0053143.1 keratin, type II cytoskeletal 1-like [Cucumis melo var. makuwa]8.9e-6353.18Show/hide
Query:  SLSSSNTAVDDSSASLSSQIFGPGNKISVVKLTDENFLLWKFQILMALEGYDLEHHLVD--DSPPQFLTSTAQSSSVEGASVTKTLNPAYTIWKRQDKVI
        S +SS   V+++  S    IFG GNKIS+VKL+D+NFLLWKFQIL ALE YDLE+      + P ++LTST  SS+    S T+T NP Y +WKR +++I
Subjt:  SLSSSNTAVDDSSASLSSQIFGPGNKISVVKLTDENFLLWKFQILMALEGYDLEHHLVD--DSPPQFLTSTAQSSSVEGASVTKTLNPAYTIWKRQDKVI

Query:  SSWLVGSMSEDILHQMIHCTSTKEIWTCLQQIFTSRNLAQVMKVKTKLQTLQKGGMSLKDYFSKIQHYVDALAAVGKPVETEDHILYILSGLGSDFESMV
        S WL+GSMSE+IL+QM+HC S KEIW  LQ IF+SR LAQ M+ K KL  ++KG MSLK+YF KIQ  VDALA++ KPV ++DHILYIL GLG D++SM+
Subjt:  SSWLVGSMSEDILHQMIHCTSTKEIWTCLQQIFTSRNLAQVMKVKTKLQTLQKGGMSLKDYFSKIQHYVDALAAVGKPVETEDHILYILSGLGSDFESMV

Query:  SVISAKMGPQSVHEVMALLLTQENRNESKIATPDGSLPSANIVVNS--KSIESKSTKTNNAQSSQNF
        S+ISA+    S+ EVM+LLLTQE++NESK+ + + +LP   IV  +  K  ES    + N   + +F
Subjt:  SVISAKMGPQSVHEVMALLLTQENRNESKIATPDGSLPSANIVVNS--KSIESKSTKTNNAQSSQNF

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.2e-8052.46Show/hide
Query:  MASLSSSNTAVDDSSASLSSQIFGPGNKISVVKLTDENFLLWKFQILMALEGYDLEHHLVDDS--PPQFLTSTAQSSSVEGASVTKTLNPAYTIWKRQDK
        M+S SS     +  ++S  +QIFG GNKIS+VKL D+ FLLWKFQIL ALE YDLE+ L  +S  P ++L ST  SS    AS T T NPAY +WKRQD+
Subjt:  MASLSSSNTAVDDSSASLSSQIFGPGNKISVVKLTDENFLLWKFQILMALEGYDLEHHLVDDS--PPQFLTSTAQSSSVEGASVTKTLNPAYTIWKRQDK

Query:  VISSWLVGSMSEDILHQMIHCTSTKEIWTCLQQIFTSRNLAQVMKVKTKLQTLQKGGMSLKDYFSKIQHYVDALAAVGKPVETEDHILYILSGLGSDFES
        +ISSWL+GSMSE+IL+QM+HC S KEIW  LQ IF+SR LAQ M+ K KL  ++KG M LK+YF KI   VDALA++ KPV ++DHILYIL+GLGSD++S
Subjt:  VISSWLVGSMSEDILHQMIHCTSTKEIWTCLQQIFTSRNLAQVMKVKTKLQTLQKGGMSLKDYFSKIQHYVDALAAVGKPVETEDHILYILSGLGSDFES

Query:  MVSVISAKMGPQSVHEVMALLLTQENRNESKIATPDGSLPSANIVVNSKSIESKS-TKTNNAQSSQNFSPGNRGSRGGGRSGRGGRSGSWNNRNKVQCQL
        M+SVISA+    SV EVM+LLLTQE++NESK+ + + +LPS NIV  +    ++S  +TN      N S   RG RG GRS RG R     NRNK QCQ+
Subjt:  MVSVISAKMGPQSVHEVMALLLTQENRNESKIATPDGSLPSANIVVNSKSIESKS-TKTNNAQSSQNFSPGNRGSRGGGRSGRGGRSGSWNNRNKVQCQL

Query:  CTKFGHTASHCFFRYAPSNSGNTGEESRPGCIWGHRENGPGMDIL
        C K G++A  CFFRY P ++ +    +     + +  N P M  +
Subjt:  CTKFGHTASHCFFRYAPSNSGNTGEESRPGCIWGHRENGPGMDIL

XP_022154487.1 uncharacterized protein LOC111021757 [Momordica charantia]4.1e-6847.78Show/hide
Query:  MASLSSSNTAVDDSSASLSSQIFGPGNKISVVKLTDENFLLWKFQILMALEGYDLEHHL--VDDSPPQFLTSTAQSSSVEGASVTKTLNPAYTIWKRQDK
        MASLSS   + D +    +S+   PG+K+S+V+L D+N LLWKFQI  AL+G  LE ++   +D+P QF+ +T   SS    S +   NPAY  W +QDK
Subjt:  MASLSSSNTAVDDSSASLSSQIFGPGNKISVVKLTDENFLLWKFQILMALEGYDLEHHL--VDDSPPQFLTSTAQSSSVEGASVTKTLNPAYTIWKRQDK

Query:  VISSWLVGSMSEDILHQMIHCTSTKEIWTCLQQIFTSRNLAQVMKVKTKLQTLQKGGMSLKDYFSKIQHYVDALAAVGKPVETEDHILYILSGLGSDFES
        +IS+WL+GSM+EDIL QM+ C S +EIWT L+ +F SR LA+VM++K KL+  +KG +SLKDYF KI++ VD+LA  GK + TEDHI++IL+GLG +F++
Subjt:  VISSWLVGSMSEDILHQMIHCTSTKEIWTCLQQIFTSRNLAQVMKVKTKLQTLQKGGMSLKDYFSKIQHYVDALAAVGKPVETEDHILYILSGLGSDFES

Query:  MVSVISAKMGPQSVHEVMALLLTQENRNESKIATPDGSLPSANIVVNSKSIESKSTKTNNAQSSQNFSP--GNRGSRGGGRSGRGGRSGSWNNRNKVQCQ
        ++SVI+A+  PQ++ EV +LLL QE RNE  +   DGSLPS N+ +N       S+K NN   S+ F+P   N   RG G + R     +W   NK QCQ
Subjt:  MVSVISAKMGPQSVHEVMALLLTQENRNESKIATPDGSLPSANIVVNSKSIESKSTKTNNAQSSQNFSP--GNRGSRGGGRSGRGGRSGSWNNRNKVQCQ

Query:  LCTKFGHTASHCFFRY
        +C +FGHTA  C+ R+
Subjt:  LCTKFGHTASHCFFRY

TrEMBL top hitse value%identityAlignment
A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-946.0e-8152.46Show/hide
Query:  MASLSSSNTAVDDSSASLSSQIFGPGNKISVVKLTDENFLLWKFQILMALEGYDLEHHLVDDS--PPQFLTSTAQSSSVEGASVTKTLNPAYTIWKRQDK
        M+S SS     +  ++S  +QIFG GNKIS+VKL D+ FLLWKFQIL ALE YDLE+ L  +S  P ++L ST  SS    AS T T NPAY +WKRQD+
Subjt:  MASLSSSNTAVDDSSASLSSQIFGPGNKISVVKLTDENFLLWKFQILMALEGYDLEHHLVDDS--PPQFLTSTAQSSSVEGASVTKTLNPAYTIWKRQDK

Query:  VISSWLVGSMSEDILHQMIHCTSTKEIWTCLQQIFTSRNLAQVMKVKTKLQTLQKGGMSLKDYFSKIQHYVDALAAVGKPVETEDHILYILSGLGSDFES
        +ISSWL+GSMSE+IL+QM+HC S KEIW  LQ IF+SR LAQ M+ K KL  ++KG M LK+YF KI   VDALA++ KPV ++DHILYIL+GLGSD++S
Subjt:  VISSWLVGSMSEDILHQMIHCTSTKEIWTCLQQIFTSRNLAQVMKVKTKLQTLQKGGMSLKDYFSKIQHYVDALAAVGKPVETEDHILYILSGLGSDFES

Query:  MVSVISAKMGPQSVHEVMALLLTQENRNESKIATPDGSLPSANIVVNSKSIESKS-TKTNNAQSSQNFSPGNRGSRGGGRSGRGGRSGSWNNRNKVQCQL
        M+SVISA+    SV EVM+LLLTQE++NESK+ + + +LPS NIV  +    ++S  +TN      N S   RG RG GRS RG R     NRNK QCQ+
Subjt:  MVSVISAKMGPQSVHEVMALLLTQENRNESKIATPDGSLPSANIVVNSKSIESKS-TKTNNAQSSQNFSPGNRGSRGGGRSGRGGRSGSWNNRNKVQCQL

Query:  CTKFGHTASHCFFRYAPSNSGNTGEESRPGCIWGHRENGPGMDIL
        C K G++A  CFFRY P ++ +    +     + +  N P M  +
Subjt:  CTKFGHTASHCFFRYAPSNSGNTGEESRPGCIWGHRENGPGMDIL

A0A5A7UB21 Keratin, type II cytoskeletal 1-like4.3e-6353.18Show/hide
Query:  SLSSSNTAVDDSSASLSSQIFGPGNKISVVKLTDENFLLWKFQILMALEGYDLEHHLVD--DSPPQFLTSTAQSSSVEGASVTKTLNPAYTIWKRQDKVI
        S +SS   V+++  S    IFG GNKIS+VKL+D+NFLLWKFQIL ALE YDLE+      + P ++LTST  SS+    S T+T NP Y +WKR +++I
Subjt:  SLSSSNTAVDDSSASLSSQIFGPGNKISVVKLTDENFLLWKFQILMALEGYDLEHHLVD--DSPPQFLTSTAQSSSVEGASVTKTLNPAYTIWKRQDKVI

Query:  SSWLVGSMSEDILHQMIHCTSTKEIWTCLQQIFTSRNLAQVMKVKTKLQTLQKGGMSLKDYFSKIQHYVDALAAVGKPVETEDHILYILSGLGSDFESMV
        S WL+GSMSE+IL+QM+HC S KEIW  LQ IF+SR LAQ M+ K KL  ++KG MSLK+YF KIQ  VDALA++ KPV ++DHILYIL GLG D++SM+
Subjt:  SSWLVGSMSEDILHQMIHCTSTKEIWTCLQQIFTSRNLAQVMKVKTKLQTLQKGGMSLKDYFSKIQHYVDALAAVGKPVETEDHILYILSGLGSDFESMV

Query:  SVISAKMGPQSVHEVMALLLTQENRNESKIATPDGSLPSANIVVNS--KSIESKSTKTNNAQSSQNF
        S+ISA+    S+ EVM+LLLTQE++NESK+ + + +LP   IV  +  K  ES    + N   + +F
Subjt:  SVISAKMGPQSVHEVMALLLTQENRNESKIATPDGSLPSANIVVNS--KSIESKSTKTNNAQSSQNF

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-946.0e-8152.46Show/hide
Query:  MASLSSSNTAVDDSSASLSSQIFGPGNKISVVKLTDENFLLWKFQILMALEGYDLEHHLVDDS--PPQFLTSTAQSSSVEGASVTKTLNPAYTIWKRQDK
        M+S SS     +  ++S  +QIFG GNKIS+VKL D+ FLLWKFQIL ALE YDLE+ L  +S  P ++L ST  SS    AS T T NPAY +WKRQD+
Subjt:  MASLSSSNTAVDDSSASLSSQIFGPGNKISVVKLTDENFLLWKFQILMALEGYDLEHHLVDDS--PPQFLTSTAQSSSVEGASVTKTLNPAYTIWKRQDK

Query:  VISSWLVGSMSEDILHQMIHCTSTKEIWTCLQQIFTSRNLAQVMKVKTKLQTLQKGGMSLKDYFSKIQHYVDALAAVGKPVETEDHILYILSGLGSDFES
        +ISSWL+GSMSE+IL+QM+HC S KEIW  LQ IF+SR LAQ M+ K KL  ++KG M LK+YF KI   VDALA++ KPV ++DHILYIL+GLGSD++S
Subjt:  VISSWLVGSMSEDILHQMIHCTSTKEIWTCLQQIFTSRNLAQVMKVKTKLQTLQKGGMSLKDYFSKIQHYVDALAAVGKPVETEDHILYILSGLGSDFES

Query:  MVSVISAKMGPQSVHEVMALLLTQENRNESKIATPDGSLPSANIVVNSKSIESKS-TKTNNAQSSQNFSPGNRGSRGGGRSGRGGRSGSWNNRNKVQCQL
        M+SVISA+    SV EVM+LLLTQE++NESK+ + + +LPS NIV  +    ++S  +TN      N S   RG RG GRS RG R     NRNK QCQ+
Subjt:  MVSVISAKMGPQSVHEVMALLLTQENRNESKIATPDGSLPSANIVVNSKSIESKS-TKTNNAQSSQNFSPGNRGSRGGGRSGRGGRSGSWNNRNKVQCQL

Query:  CTKFGHTASHCFFRYAPSNSGNTGEESRPGCIWGHRENGPGMDIL
        C K G++A  CFFRY P ++ +    +     + +  N P M  +
Subjt:  CTKFGHTASHCFFRYAPSNSGNTGEESRPGCIWGHRENGPGMDIL

A0A5D3CRZ7 Putative Ty1-copia-like retrotransposon9.9e-6048.77Show/hide
Query:  ILMALEGYDLEHHLVDDSPP--QFLTSTAQSSSV----EGASVTKTLNPAYTIWKRQDKVISSWLVGSMSEDILHQMIHCTSTKEIWTCLQQIFTSRNLA
        IL ALE Y LE +    + P  +++      SSV      A     LN  Y +WKRQD++ISSWL+GSMSEDIL+QM+H TS K+IW  LQ I++SR LA
Subjt:  ILMALEGYDLEHHLVDDSPP--QFLTSTAQSSSV----EGASVTKTLNPAYTIWKRQDKVISSWLVGSMSEDILHQMIHCTSTKEIWTCLQQIFTSRNLA

Query:  QVMKVKTKLQTLQKGGMSLKDYFSKIQHYVDALAAVGKPVETEDHILYILSGLGSDFESMVSVISAKMGPQSVHEVMALLLTQENRNESKIATPDGSLPS
        + M+ K KL  ++KG MSLK+YF KIQ  VDALA++ KP+ T+DHILYIL+GLG++++S++S+ISA+    SV + M+LLLTQE++ ESKI T + SLP+
Subjt:  QVMKVKTKLQTLQKGGMSLKDYFSKIQHYVDALAAVGKPVETEDHILYILSGLGSDFESMVSVISAKMGPQSVHEVMALLLTQENRNESKIATPDGSLPS

Query:  ANIVVNSKSIES--KSTKTNNAQSSQNF------SPGNRGSRGGGRSGRGGRSGSWNNRNKVQCQLCTKFGHTASHCFFRYAPSN
         N+  +++ I S  K ++  +   S N       S  +  SR GGRS RGGR     NR+K QCQ+C+KFGH A  C+FRY P N
Subjt:  ANIVVNSKSIES--KSTKTNNAQSSQNF------SPGNRGSRGGGRSGRGGRSGSWNNRNKVQCQLCTKFGHTASHCFFRYAPSN

A0A6J1DLT9 uncharacterized protein LOC1110217572.0e-6847.78Show/hide
Query:  MASLSSSNTAVDDSSASLSSQIFGPGNKISVVKLTDENFLLWKFQILMALEGYDLEHHL--VDDSPPQFLTSTAQSSSVEGASVTKTLNPAYTIWKRQDK
        MASLSS   + D +    +S+   PG+K+S+V+L D+N LLWKFQI  AL+G  LE ++   +D+P QF+ +T   SS    S +   NPAY  W +QDK
Subjt:  MASLSSSNTAVDDSSASLSSQIFGPGNKISVVKLTDENFLLWKFQILMALEGYDLEHHL--VDDSPPQFLTSTAQSSSVEGASVTKTLNPAYTIWKRQDK

Query:  VISSWLVGSMSEDILHQMIHCTSTKEIWTCLQQIFTSRNLAQVMKVKTKLQTLQKGGMSLKDYFSKIQHYVDALAAVGKPVETEDHILYILSGLGSDFES
        +IS+WL+GSM+EDIL QM+ C S +EIWT L+ +F SR LA+VM++K KL+  +KG +SLKDYF KI++ VD+LA  GK + TEDHI++IL+GLG +F++
Subjt:  VISSWLVGSMSEDILHQMIHCTSTKEIWTCLQQIFTSRNLAQVMKVKTKLQTLQKGGMSLKDYFSKIQHYVDALAAVGKPVETEDHILYILSGLGSDFES

Query:  MVSVISAKMGPQSVHEVMALLLTQENRNESKIATPDGSLPSANIVVNSKSIESKSTKTNNAQSSQNFSP--GNRGSRGGGRSGRGGRSGSWNNRNKVQCQ
        ++SVI+A+  PQ++ EV +LLL QE RNE  +   DGSLPS N+ +N       S+K NN   S+ F+P   N   RG G + R     +W   NK QCQ
Subjt:  MVSVISAKMGPQSVHEVMALLLTQENRNESKIATPDGSLPSANIVVNSKSIESKSTKTNNAQSSQNFSP--GNRGSRGGGRSGRGGRSGSWNNRNKVQCQ

Query:  LCTKFGHTASHCFFRY
        +C +FGHTA  C+ R+
Subjt:  LCTKFGHTASHCFFRY

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.4e-2427.39Show/hide
Query:  NKISVVKLTDENFLLWKFQILMALEGYDLEHHLVDDSPPQFLTSTAQSSSVEGASVTKTLNPAYTIWKRQDKVISSWLVGSMSEDILHQMIHCTSTKEIW
        N  +V KLT  N+L+W  Q+    +GY+L   L          ST    +  G      +NP YT WKRQDK+I S ++G++S  +   +   T+  +IW
Subjt:  NKISVVKLTDENFLLWKFQILMALEGYDLEHHLVDDSPPQFLTSTAQSSSVEGASVTKTLNPAYTIWKRQDKVISSWLVGSMSEDILHQMIHCTSTKEIW

Query:  TCLQQIFTSRNLAQVMKVKTKLQTLQKGGMSLKDYFSKIQHYVDALAAVGKPVETEDHILYILSGLGSDFESMVSVISAKMGPQSVHEVMALLLTQENRN
          L++I+ + +   V +++T+L+   KG  ++ DY   +    D LA +GKP++ ++ +  +L  L  +++ ++  I+AK  P ++ E+   LL     +
Subjt:  TCLQQIFTSRNLAQVMKVKTKLQTLQKGGMSLKDYFSKIQHYVDALAAVGKPVETEDHILYILSGLGSDFESMVSVISAKMGPQSVHEVMALLLTQENRN

Query:  ESKIATPDGSLPSANIVVNSKSIESKSTKTNNAQSSQNFSPGNRGSRGGGRSGRGGRSGSWNNRNKV-----QCQLCTKFGHTASHCFFRYAPSNSGNTG
        ESKI     S     I  N+ S  + +T  NN   ++N    NR +    +  +   +    N N+      +CQ+C   GH+A  C       +S N+ 
Subjt:  ESKIATPDGSLPSANIVVNSKSIESKSTKTNNAQSSQNFSPGNRGSRGGGRSGRGGRSGSWNNRNKV-----QCQLCTKFGHTASHCFFRYAPSNSGNTG

Query:  EESRPGCIWGHREN
        +   P   W  R N
Subjt:  EESRPGCIWGHREN

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE28.2e-1926.35Show/hide
Query:  NKISVVKLTDENFLLWKFQILMALEGYDLEHHLVDDSPPQFLTSTAQSSSVEGASVTKTLNPAYTIWKRQDKVISSWLVGSMSEDILHQMIHCTSTKEIW
        N  +V KLT  N+L+W  Q+    +GY+L   L          ST    +  G      +NP YT W+RQDK+I S ++G++S  +   +   T+  +IW
Subjt:  NKISVVKLTDENFLLWKFQILMALEGYDLEHHLVDDSPPQFLTSTAQSSSVEGASVTKTLNPAYTIWKRQDKVISSWLVGSMSEDILHQMIHCTSTKEIW

Query:  TCLQQIFTSRNLAQVMKVKTKLQTLQKGGMSLKDYFSKIQHYVDALAAVGKPVETEDHILYILSGLGSDFESMVSVISAKMGPQSVHEVMALLLTQENRN
          L++I+ + +   V    T+L+ + +                D LA +GKP++ ++ +  +L  L  D++ ++  I+AK  P S+ E+   L+ +    
Subjt:  TCLQQIFTSRNLAQVMKVKTKLQTLQKGGMSLKDYFSKIQHYVDALAAVGKPVETEDHILYILSGLGSDFESMVSVISAKMGPQSVHEVMALLLTQENRN

Query:  ESKIATPDGSLPSANIV-VNSKSIESKSTKTNNAQS----SQNFSPGNRGSRGGGRSGRGGRSGSWNNRNKV-QCQLCTKFGHTASHCFFRYAPSNSGNT
        ESK+     +L SA +V + +  +  ++T TN  Q+    ++N++  N  S     S  G RS +   +  + +CQ+C+  GH+A  C   +   ++ N 
Subjt:  ESKIATPDGSLPSANIV-VNSKSIESKSTKTNNAQS----SQNFSPGNRGSRGGGRSGRGGRSGSWNNRNKV-QCQLCTKFGHTASHCFFRYAPSNSGNT

Query:  GEESRPGCIWGHREN
         + + P   W  R N
Subjt:  GEESRPGCIWGHREN

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.9e-0720.58Show/hide
Query:  SSSNTAVDDSSASLSSQIFGPGNKISVVKLT--DENFLLWKFQILMALEGYDLEHHLVDDSPPQFLTSTAQSSSVEGA-SVTKTLNPAYTIWKRQDKVIS
        S S T+  DS   L   I  P +  S+ KL+  ++N++ WK +                     FL  T +   ++G        +P Y  W++ + ++ 
Subjt:  SSSNTAVDDSSASLSSQIFGPGNKISVVKLT--DENFLLWKFQILMALEGYDLEHHLVDDSPPQFLTSTAQSSSVEGA-SVTKTLNPAYTIWKRQDKVIS

Query:  SWLVGSMSEDILHQMIHCTSTKEIWTCLQQIFTSRNLAQVMKVKTKLQTLQKGGMSLKDYFSKIQHY-------------------VDALAAVGKPVETE
         WL+ SM++ +L  +++  +  ++W  L+++F      ++ +++ +L TL++GG S+++YF K+                       +      +  E E
Subjt:  SWLVGSMSEDILHQMIHCTSTKEIWTCLQQIFTSRNLAQVMKVKTKLQTLQKGGMSLKDYFSKIQHY-------------------VDALAAVGKPVETE

Query:  DHILYILS-GLGSDFESMVSVISAKMGPQSVHEVMALLLTQEN
            +++   L   FE++ + I  +  P S+HE  A++   E+
Subjt:  DHILYILS-GLGSDFESMVSVISAKMGPQSVHEVMALLLTQEN

AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)6.0e-0925.49Show/hide
Query:  WKRQDKVISSWLVGSMS-EDILHQMIHCTSTKEIWTCLQQIFTSRNLAQVMKVKTKLQTLQKGGMSLKDYFSKIQHYVDALAAVGKPVETEDHILYILSG
        W+++D ++   L G+++ +      +  +++++IW  ++  F +   A+ +++ ++L+T   G M + DY+ K++   D+L  V  PV   + ++Y+L+G
Subjt:  WKRQDKVISSWLVGSMS-EDILHQMIHCTSTKEIWTCLQQIFTSRNLAQVMKVKTKLQTLQKGGMSLKDYFSKIQHYVDALAAVGKPVETEDHILYILSG

Query:  LGSDFESMVSVISAKMGPQSVHEVMALLLTQENRNESKIATPDGSLPSANIVVNSKSIE----SKSTKTNNAQSSQNFSPGNRGSRGGGRS---GRGGRS
        L   F+++++VI  +    S  +   +L  +E+R +  I       P+   V +S S      S++    N Q S     G RG RG G +   GRGGR 
Subjt:  LGSDFESMVSVISAKMGPQSVHEVMALLLTQENRNESKIATPDGSLPSANIVVNSKSIE----SKSTKTNNAQSSQNFSPGNRGSRGGGRS---GRGGRS

Query:  GSWN
          +N
Subjt:  GSWN

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)2.9e-1125.37Show/hide
Query:  WKRQDKVISSWLVGSMSEDILHQMIH--CTSTKEIWTCLQQIFTSRNLAQVMKVKTKLQTLQKGGMSLKDYFSKIQHYVDALAAVGKPVETEDHILYILS
        WK +D ++  W+ G++++ +L  +I   CT+ +++W  L+ +F     A+ ++ + +L+T     +S+ +Y  K++   D L  V  P+     ++++L+
Subjt:  WKRQDKVISSWLVGSMSEDILHQMIH--CTSTKEIWTCLQQIFTSRNLAQVMKVKTKLQTLQKGGMSLKDYFSKIQHYVDALAAVGKPVETEDHILYILS

Query:  GLGSDFESMVSVISAKMGPQSVHEVMALLLTQENR--NESKIATPDGSLPSANIVVNS--KSIESKSTKTNNAQSSQNFSPGNRGSRGGGRSGRGGRSGS
        GL   ++ +++VI  K    S  E  ++LL +E+R  N+SK +    + PS + V+ +  +  E    + +N  S+       + +RGGG S      G 
Subjt:  GLGSDFESMVSVISAKMGPQSVHEVMALLLTQENR--NESKIATPDGSLPSANIVVNS--KSIESKSTKTNNAQSSQNFSPGNRGSRGGGRSGRGGRSGS

Query:  WNNRN
        +NN N
Subjt:  WNNRN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCTCTGAGTTCTTCAAATACGGCGGTGGATGATTCCTCTGCTTCTTTATCCTCTCAGATCTTCGGTCCGGGTAACAAAATTTCAGTTGTTAAATTAACTGATGA
AAATTTTCTCTTATGGAAGTTTCAGATCCTTATGGCTCTCGAGGGCTATGATTTGGAACATCACCTTGTCGATGATTCTCCTCCTCAATTTCTTACATCTACTGCTCAGT
CTTCCTCCGTGGAGGGGGCGTCTGTAACGAAAACACTGAACCCAGCCTACACTATCTGGAAACGTCAAGACAAAGTCATCTCGTCATGGCTGGTGGGTTCAATGTCGGAG
GACATTCTTCACCAAATGATACATTGCACCTCAACGAAGGAAATTTGGACCTGTCTACAACAAATTTTTACCTCCCGTAACCTAGCTCAGGTAATGAAGGTTAAAACGAA
ACTCCAAACGCTGCAAAAGGGAGGTATGTCTCTTAAGGATTACTTTTCAAAAATACAGCACTATGTTGATGCATTGGCCGCTGTCGGTAAGCCTGTCGAAACTGAGGATC
ACATATTATACATTCTGTCCGGTCTTGGATCTGATTTTGAGTCGATGGTCTCTGTGATATCAGCTAAAATGGGTCCCCAATCTGTTCACGAAGTTATGGCTCTTTTATTA
ACTCAAGAAAATCGAAATGAGAGTAAAATAGCTACTCCGGATGGCTCTCTTCCCTCTGCTAACATTGTAGTTAATTCTAAATCGATTGAGTCTAAGTCCACCAAAACTAA
CAATGCTCAGTCTTCTCAGAATTTTTCTCCTGGAAACAGAGGAAGCAGAGGTGGGGGTCGTTCTGGTCGAGGGGGCCGATCTGGTTCTTGGAACAATCGCAACAAGGTTC
AGTGTCAACTGTGCACAAAATTTGGGCACACTGCTTCCCATTGCTTCTTCCGCTATGCTCCTTCCAACTCAGGAAACACAGGCGAAGAATCTCGACCGGGATGCATATGG
GGGCACCGGGAAAATGGGCCAGGAATGGATATTTTGAATCCCAAAACCGCGACCTCGGCCGGAAGCCACGAGAGATTTACCATCATGGCATTGGTACCAAAGATTGCACC
CGCTGACTACTTTCCTGCTGCTTTGACATGTTGTTCACATCTCGGAAAAACCGTTAAAAATATTAAGGATAAATTAACTGATACCCAGTTAGAAATGTTTAGGCAAACAT
GTTTTGGACATTTCTTAGATACGTCCTTGATGTTTAATGGACAACTTATTCATTATTTTCTTTTGAGGGAAGTGAATGAGCCTAGGATTGATGTTATTAGCTTTGAGATT
CTGGGAGAGAAAGTTTCATTTGGTCGGAGGGAATTTAACCTTATTACTGGAATTAGGCATAGGACCCAACATGTTAGGGGTAATGTATCTAGTACTAGACTGAGAAGACT
GTACCTTAACGATAGCATCAGCATGAAAGGGTTTGAACTAGATAGATTATTCCTACCATTAATTTTGAGAGCGATGAGGATGCTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCTCTGAGTTCTTCAAATACGGCGGTGGATGATTCCTCTGCTTCTTTATCCTCTCAGATCTTCGGTCCGGGTAACAAAATTTCAGTTGTTAAATTAACTGATGA
AAATTTTCTCTTATGGAAGTTTCAGATCCTTATGGCTCTCGAGGGCTATGATTTGGAACATCACCTTGTCGATGATTCTCCTCCTCAATTTCTTACATCTACTGCTCAGT
CTTCCTCCGTGGAGGGGGCGTCTGTAACGAAAACACTGAACCCAGCCTACACTATCTGGAAACGTCAAGACAAAGTCATCTCGTCATGGCTGGTGGGTTCAATGTCGGAG
GACATTCTTCACCAAATGATACATTGCACCTCAACGAAGGAAATTTGGACCTGTCTACAACAAATTTTTACCTCCCGTAACCTAGCTCAGGTAATGAAGGTTAAAACGAA
ACTCCAAACGCTGCAAAAGGGAGGTATGTCTCTTAAGGATTACTTTTCAAAAATACAGCACTATGTTGATGCATTGGCCGCTGTCGGTAAGCCTGTCGAAACTGAGGATC
ACATATTATACATTCTGTCCGGTCTTGGATCTGATTTTGAGTCGATGGTCTCTGTGATATCAGCTAAAATGGGTCCCCAATCTGTTCACGAAGTTATGGCTCTTTTATTA
ACTCAAGAAAATCGAAATGAGAGTAAAATAGCTACTCCGGATGGCTCTCTTCCCTCTGCTAACATTGTAGTTAATTCTAAATCGATTGAGTCTAAGTCCACCAAAACTAA
CAATGCTCAGTCTTCTCAGAATTTTTCTCCTGGAAACAGAGGAAGCAGAGGTGGGGGTCGTTCTGGTCGAGGGGGCCGATCTGGTTCTTGGAACAATCGCAACAAGGTTC
AGTGTCAACTGTGCACAAAATTTGGGCACACTGCTTCCCATTGCTTCTTCCGCTATGCTCCTTCCAACTCAGGAAACACAGGCGAAGAATCTCGACCGGGATGCATATGG
GGGCACCGGGAAAATGGGCCAGGAATGGATATTTTGAATCCCAAAACCGCGACCTCGGCCGGAAGCCACGAGAGATTTACCATCATGGCATTGGTACCAAAGATTGCACC
CGCTGACTACTTTCCTGCTGCTTTGACATGTTGTTCACATCTCGGAAAAACCGTTAAAAATATTAAGGATAAATTAACTGATACCCAGTTAGAAATGTTTAGGCAAACAT
GTTTTGGACATTTCTTAGATACGTCCTTGATGTTTAATGGACAACTTATTCATTATTTTCTTTTGAGGGAAGTGAATGAGCCTAGGATTGATGTTATTAGCTTTGAGATT
CTGGGAGAGAAAGTTTCATTTGGTCGGAGGGAATTTAACCTTATTACTGGAATTAGGCATAGGACCCAACATGTTAGGGGTAATGTATCTAGTACTAGACTGAGAAGACT
GTACCTTAACGATAGCATCAGCATGAAAGGGTTTGAACTAGATAGATTATTCCTACCATTAATTTTGAGAGCGATGAGGATGCTGTGA
Protein sequenceShow/hide protein sequence
MASLSSSNTAVDDSSASLSSQIFGPGNKISVVKLTDENFLLWKFQILMALEGYDLEHHLVDDSPPQFLTSTAQSSSVEGASVTKTLNPAYTIWKRQDKVISSWLVGSMSE
DILHQMIHCTSTKEIWTCLQQIFTSRNLAQVMKVKTKLQTLQKGGMSLKDYFSKIQHYVDALAAVGKPVETEDHILYILSGLGSDFESMVSVISAKMGPQSVHEVMALLL
TQENRNESKIATPDGSLPSANIVVNSKSIESKSTKTNNAQSSQNFSPGNRGSRGGGRSGRGGRSGSWNNRNKVQCQLCTKFGHTASHCFFRYAPSNSGNTGEESRPGCIW
GHRENGPGMDILNPKTATSAGSHERFTIMALVPKIAPADYFPAALTCCSHLGKTVKNIKDKLTDTQLEMFRQTCFGHFLDTSLMFNGQLIHYFLLREVNEPRIDVISFEI
LGEKVSFGRREFNLITGIRHRTQHVRGNVSSTRLRRLYLNDSISMKGFELDRLFLPLILRAMRML