; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026845 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026845
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr10:42508837..42510026
RNA-Seq ExpressionLag0026845
SyntenyLag0026845
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]4.2e-8650.38Show/hide
Query:  MSTPESSLSGALGDSSAGSSQAFCPGNKISIVKLTDDNFLLWKFQILIALEGYDLESHL--LEDPPPKTLGSPAGSSSSSDATPPPTPNPTYLQWKRQDK
        MS+  S L     ++S+  +Q F  GNKIS+VKL DD FLLWKFQIL ALE YDLE+ L    +PP K L S   SS+S+      TPNP Y  WKRQD+
Subjt:  MSTPESSLSGALGDSSAGSSQAFCPGNKISIVKLTDDNFLLWKFQILIALEGYDLESHL--LEDPPPKTLGSPAGSSSSSDATPPPTPNPTYLQWKRQDK

Query:  VISSWLVGSMTEDLLHQMIHCTTTKEIWTALQQIFTTRNLAQMMKIKTKLQTIQKG-------------------------GRGSHIVFILSGLGSDYES
        +ISSWL+GSM+E++L+QM+HC + KEIW  LQ IF++R LAQ M+ K KL  I+KG                             HI++IL+GLGSDY+S
Subjt:  VISSWLVGSMTEDLLHQMIHCTTTKEIWTALQQIFTTRNLAQMMKIKTKLQTIQKG-------------------------GRGSHIVFILSGLGSDYES

Query:  MVSVITAKIGPQTVQEVMSLLLTQENRIESKLVPIESSVPSVNLMVQSKPPESDS-SKTNQSQFSSNIGSGNRGGRGGRGGRDYSNRGGGRSWNNRNKVQ
        M+SVI+A+    +VQEVMSLLLTQE++ ESKL+  E+++PSVN++ Q+    ++S  +TNQ+ + +N     RGGRG   GR  SNRG      NRNK Q
Subjt:  MVSVITAKIGPQTVQEVMSLLLTQENRIESKLVPIESSVPSVNLMVQSKPPESDS-SKTNQSQFSSNIGSGNRGGRGGRGGRDYSNRGGGRSWNNRNKVQ

Query:  CQLCTKFGHTAAKCFFRYAPQSSS--MSPGAYSSGFNNFNRSTSFLQMSAMVASPDLNQDTNWYPDSGATNHLTHNFNNLSVGSEYGGSNQVHVGNGAG
        CQ+C K G++A +CFFRY P+S+S   SP ++++ + N N      QMSAMVA+ DLN D+NWYPDSGATNHLTH+ +NLS+GSEYGG NQ++  NG+G
Subjt:  CQLCTKFGHTAAKCFFRYAPQSSS--MSPGAYSSGFNNFNRSTSFLQMSAMVASPDLNQDTNWYPDSGATNHLTHNFNNLSVGSEYGGSNQVHVGNGAG

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]4.2e-8650.38Show/hide
Query:  MSTPESSLSGALGDSSAGSSQAFCPGNKISIVKLTDDNFLLWKFQILIALEGYDLESHL--LEDPPPKTLGSPAGSSSSSDATPPPTPNPTYLQWKRQDK
        MS+  S L     ++S+  +Q F  GNKIS+VKL DD FLLWKFQIL ALE YDLE+ L    +PP K L S   SS+S+      TPNP Y  WKRQD+
Subjt:  MSTPESSLSGALGDSSAGSSQAFCPGNKISIVKLTDDNFLLWKFQILIALEGYDLESHL--LEDPPPKTLGSPAGSSSSSDATPPPTPNPTYLQWKRQDK

Query:  VISSWLVGSMTEDLLHQMIHCTTTKEIWTALQQIFTTRNLAQMMKIKTKLQTIQKG-------------------------GRGSHIVFILSGLGSDYES
        +ISSWL+GSM+E++L+QM+HC + KEIW  LQ IF++R LAQ M+ K KL  I+KG                             HI++IL+GLGSDY+S
Subjt:  VISSWLVGSMTEDLLHQMIHCTTTKEIWTALQQIFTTRNLAQMMKIKTKLQTIQKG-------------------------GRGSHIVFILSGLGSDYES

Query:  MVSVITAKIGPQTVQEVMSLLLTQENRIESKLVPIESSVPSVNLMVQSKPPESDS-SKTNQSQFSSNIGSGNRGGRGGRGGRDYSNRGGGRSWNNRNKVQ
        M+SVI+A+    +VQEVMSLLLTQE++ ESKL+  E+++PSVN++ Q+    ++S  +TNQ+ + +N     RGGRG   GR  SNRG      NRNK Q
Subjt:  MVSVITAKIGPQTVQEVMSLLLTQENRIESKLVPIESSVPSVNLMVQSKPPESDS-SKTNQSQFSSNIGSGNRGGRGGRGGRDYSNRGGGRSWNNRNKVQ

Query:  CQLCTKFGHTAAKCFFRYAPQSSS--MSPGAYSSGFNNFNRSTSFLQMSAMVASPDLNQDTNWYPDSGATNHLTHNFNNLSVGSEYGGSNQVHVGNGAG
        CQ+C K G++A +CFFRY P+S+S   SP ++++ + N N      QMSAMVA+ DLN D+NWYPDSGATNHLTH+ +NLS+GSEYGG NQ++  NG+G
Subjt:  CQLCTKFGHTAAKCFFRYAPQSSS--MSPGAYSSGFNNFNRSTSFLQMSAMVASPDLNQDTNWYPDSGATNHLTHNFNNLSVGSEYGGSNQVHVGNGAG

XP_022136882.1 dr1-associated corepressor homolog isoform X1 [Momordica charantia]1.6e-4839.63Show/hide
Query:  RQDKVISSWLVGSMTEDLLHQMIHCTTTKEIWTALQQIFTTRNLAQMMKIKTKLQTIQKGGR-------------------------GSHIVFILSGLGS
        +QDK+I+SWL  SM E++L +MIHC T +E+W  L+ ++T+RNLA++M++K+KL+ I+KG                             HI+ IL+GL S
Subjt:  RQDKVISSWLVGSMTEDLLHQMIHCTTTKEIWTALQQIFTTRNLAQMMKIKTKLQTIQKGGR-------------------------GSHIVFILSGLGS

Query:  DYESMVSVITAKIGPQTVQEVMSLLLTQENRIESKLVPIESSVPSVNLMVQSKPPESDSSKTNQSQFSSNIGSGNRGGRGGRGGRDYSNRGGGRSWNNRN
        ++ES VSVI+A+   QT+QEV SLLL+ E R E   +  + ++PSVNL  Q+K   S  S   Q  +  N  S N G    R           R+WN+ N
Subjt:  DYESMVSVITAKIGPQTVQEVMSLLLTQENRIESKLVPIESSVPSVNLMVQSKPPESDSSKTNQSQFSSNIGSGNRGGRGGRGGRDYSNRGGGRSWNNRN

Query:  KVQCQLCTKFGHTAAKCFFRY-----APQSSSMSPGAYSSG------------------FNNFNRSTSFLQMSAMVASPDLNQDTNWYPDSGATNHLTHN
        + QCQ+  KFGHTA +C+ R+      P   S     +S G                  F N  +  S   M+A +A  D N+DTNWYPDSGATNH+T N
Subjt:  KVQCQLCTKFGHTAAKCFFRY-----APQSSSMSPGAYSSG------------------FNNFNRSTSFLQMSAMVASPDLNQDTNWYPDSGATNHLTHN

Query:  FNNLSVGSEYGGSNQVHVGNGAG
        FNNL+  +EY G NQV +GNG G
Subjt:  FNNLSVGSEYGGSNQVHVGNGAG

XP_022136883.1 dr1-associated corepressor homolog isoform X2 [Momordica charantia]5.9e-4839.56Show/hide
Query:  RQDKVISSWLVGSMTEDLLHQMIHCTTTKEIWTALQQIFTTRNLAQMMKIKTKLQTIQKGGR-------------------------GSHIVFILSGLGS
        +QDK+I+SWL  SM E++L +MIHC T +E+W  L+ ++T+RNLA++M++K+KL+ I+KG                             HI+ IL+GL S
Subjt:  RQDKVISSWLVGSMTEDLLHQMIHCTTTKEIWTALQQIFTTRNLAQMMKIKTKLQTIQKGGR-------------------------GSHIVFILSGLGS

Query:  DYESMVSVITAKIGPQTVQEVMSLLLTQENRIESKLVPIESSVPSVNLMVQSKPPESDSSKTNQSQFSSNIGSGNRGGRGGRGGRDYSNRGGGRSWNNRN
        ++ES VSVI+A+   QT+QEV SLLL+ E R E   +  + ++PSVNL  Q+K   S  S   Q  +  N  S N G    R           R+WN+ N
Subjt:  DYESMVSVITAKIGPQTVQEVMSLLLTQENRIESKLVPIESSVPSVNLMVQSKPPESDSSKTNQSQFSSNIGSGNRGGRGGRGGRDYSNRGGGRSWNNRN

Query:  KVQCQLCTKFGHTAAKCFFRY-----APQSSSMSPGAYSSG------------------FNNFNRSTSFLQMSAMVASPDLNQDTNWYPDSGATNHLTHN
        + QCQ+  KFGHTA +C+ R+      P   S     +S G                  F N  +  S   M+A +A  D N+DTNWYPDSGATNH+T N
Subjt:  KVQCQLCTKFGHTAAKCFFRY-----APQSSSMSPGAYSSG------------------FNNFNRSTSFLQMSAMVASPDLNQDTNWYPDSGATNHLTHN

Query:  FNNLSVGSEYGGSNQVHVGNG
        FNNL+  +EY G NQV +GNG
Subjt:  FNNLSVGSEYGGSNQVHVGNG

XP_022154487.1 uncharacterized protein LOC111021757 [Momordica charantia]1.2e-6440.89Show/hide
Query:  SSQAFCPGNKISIVKLTDDNFLLWKFQILIALEGYDLESHL--LEDPPPKTLGSPAGSSSSSDATPPPTPNPTYLQWKRQDKVISSWLVGSMTEDLLHQM
        +S+   PG+K+SIV+L DDN LLWKFQI  AL+G  LES++   ED P + + +    SSSS        NP Y +W +QDK+IS+WL+GSM ED+L QM
Subjt:  SSQAFCPGNKISIVKLTDDNFLLWKFQILIALEGYDLESHL--LEDPPPKTLGSPAGSSSSSDATPPPTPNPTYLQWKRQDKVISSWLVGSMTEDLLHQM

Query:  IHCTTTKEIWTALQQIFTTRNLAQMMKIKTKLQTIQKGGRG-------------------------SHIVFILSGLGSDYESMVSVITAKIGPQTVQEVM
        + C + +EIWT L+ +F +R LA++M++K KL+  +KG                             HI+ IL+GLG ++++++SVITA+  PQT+QEV 
Subjt:  IHCTTTKEIWTALQQIFTTRNLAQMMKIKTKLQTIQKGGRG-------------------------SHIVFILSGLGSDYESMVSVITAKIGPQTVQEVM

Query:  SLLLTQENRIESKLVPIESSVPSVNLMVQSKPPESDSSKTNQSQFSS--NIGSGNRGGRGGRGGRDYSNRGGGRSWNNRNKVQCQLCTKFGHTAAKCFFR
        SLLL QE R E  L+  + S+PSVNL +      +DSSK N    S   N    N   RG       SNR   R+W   NK QCQ+C +FGHTA +C+ R
Subjt:  SLLLTQENRIESKLVPIESSVPSVNLMVQSKPPESDSSKTNQSQFSS--NIGSGNRGGRGGRGGRDYSNRGGGRSWNNRNKVQCQLCTKFGHTAAKCFFR

Query:  YA-------PQSSSMSPGAYSSGF-----------------NNFN-RSTSFLQMSAMVASPDLNQDTNWYPDSGATNHLTHNFNNLSVGSEYGGSNQVHV
        +           ++ SP  +SSGF                  NF+  S S  QM A++ + D N+D+NWY DSG TNH+T+ F N S+GSEY G  ++ V
Subjt:  YA-------PQSSSMSPGAYSSGF-----------------NNFN-RSTSFLQMSAMVASPDLNQDTNWYPDSGATNHLTHNFNNLSVGSEYGGSNQVHV

Query:  GNGAGH
        GNG G+
Subjt:  GNGAGH

TrEMBL top hitse value%identityAlignment
A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-942.0e-8650.38Show/hide
Query:  MSTPESSLSGALGDSSAGSSQAFCPGNKISIVKLTDDNFLLWKFQILIALEGYDLESHL--LEDPPPKTLGSPAGSSSSSDATPPPTPNPTYLQWKRQDK
        MS+  S L     ++S+  +Q F  GNKIS+VKL DD FLLWKFQIL ALE YDLE+ L    +PP K L S   SS+S+      TPNP Y  WKRQD+
Subjt:  MSTPESSLSGALGDSSAGSSQAFCPGNKISIVKLTDDNFLLWKFQILIALEGYDLESHL--LEDPPPKTLGSPAGSSSSSDATPPPTPNPTYLQWKRQDK

Query:  VISSWLVGSMTEDLLHQMIHCTTTKEIWTALQQIFTTRNLAQMMKIKTKLQTIQKG-------------------------GRGSHIVFILSGLGSDYES
        +ISSWL+GSM+E++L+QM+HC + KEIW  LQ IF++R LAQ M+ K KL  I+KG                             HI++IL+GLGSDY+S
Subjt:  VISSWLVGSMTEDLLHQMIHCTTTKEIWTALQQIFTTRNLAQMMKIKTKLQTIQKG-------------------------GRGSHIVFILSGLGSDYES

Query:  MVSVITAKIGPQTVQEVMSLLLTQENRIESKLVPIESSVPSVNLMVQSKPPESDS-SKTNQSQFSSNIGSGNRGGRGGRGGRDYSNRGGGRSWNNRNKVQ
        M+SVI+A+    +VQEVMSLLLTQE++ ESKL+  E+++PSVN++ Q+    ++S  +TNQ+ + +N     RGGRG   GR  SNRG      NRNK Q
Subjt:  MVSVITAKIGPQTVQEVMSLLLTQENRIESKLVPIESSVPSVNLMVQSKPPESDS-SKTNQSQFSSNIGSGNRGGRGGRGGRDYSNRGGGRSWNNRNKVQ

Query:  CQLCTKFGHTAAKCFFRYAPQSSS--MSPGAYSSGFNNFNRSTSFLQMSAMVASPDLNQDTNWYPDSGATNHLTHNFNNLSVGSEYGGSNQVHVGNGAG
        CQ+C K G++A +CFFRY P+S+S   SP ++++ + N N      QMSAMVA+ DLN D+NWYPDSGATNHLTH+ +NLS+GSEYGG NQ++  NG+G
Subjt:  CQLCTKFGHTAAKCFFRYAPQSSS--MSPGAYSSGFNNFNRSTSFLQMSAMVASPDLNQDTNWYPDSGATNHLTHNFNNLSVGSEYGGSNQVHVGNGAG

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-942.0e-8650.38Show/hide
Query:  MSTPESSLSGALGDSSAGSSQAFCPGNKISIVKLTDDNFLLWKFQILIALEGYDLESHL--LEDPPPKTLGSPAGSSSSSDATPPPTPNPTYLQWKRQDK
        MS+  S L     ++S+  +Q F  GNKIS+VKL DD FLLWKFQIL ALE YDLE+ L    +PP K L S   SS+S+      TPNP Y  WKRQD+
Subjt:  MSTPESSLSGALGDSSAGSSQAFCPGNKISIVKLTDDNFLLWKFQILIALEGYDLESHL--LEDPPPKTLGSPAGSSSSSDATPPPTPNPTYLQWKRQDK

Query:  VISSWLVGSMTEDLLHQMIHCTTTKEIWTALQQIFTTRNLAQMMKIKTKLQTIQKG-------------------------GRGSHIVFILSGLGSDYES
        +ISSWL+GSM+E++L+QM+HC + KEIW  LQ IF++R LAQ M+ K KL  I+KG                             HI++IL+GLGSDY+S
Subjt:  VISSWLVGSMTEDLLHQMIHCTTTKEIWTALQQIFTTRNLAQMMKIKTKLQTIQKG-------------------------GRGSHIVFILSGLGSDYES

Query:  MVSVITAKIGPQTVQEVMSLLLTQENRIESKLVPIESSVPSVNLMVQSKPPESDS-SKTNQSQFSSNIGSGNRGGRGGRGGRDYSNRGGGRSWNNRNKVQ
        M+SVI+A+    +VQEVMSLLLTQE++ ESKL+  E+++PSVN++ Q+    ++S  +TNQ+ + +N     RGGRG   GR  SNRG      NRNK Q
Subjt:  MVSVITAKIGPQTVQEVMSLLLTQENRIESKLVPIESSVPSVNLMVQSKPPESDS-SKTNQSQFSSNIGSGNRGGRGGRGGRDYSNRGGGRSWNNRNKVQ

Query:  CQLCTKFGHTAAKCFFRYAPQSSS--MSPGAYSSGFNNFNRSTSFLQMSAMVASPDLNQDTNWYPDSGATNHLTHNFNNLSVGSEYGGSNQVHVGNGAG
        CQ+C K G++A +CFFRY P+S+S   SP ++++ + N N      QMSAMVA+ DLN D+NWYPDSGATNHLTH+ +NLS+GSEYGG NQ++  NG+G
Subjt:  CQLCTKFGHTAAKCFFRYAPQSSS--MSPGAYSSGFNNFNRSTSFLQMSAMVASPDLNQDTNWYPDSGATNHLTHNFNNLSVGSEYGGSNQVHVGNGAG

A0A6J1C6N9 dr1-associated corepressor homolog isoform X17.5e-4939.63Show/hide
Query:  RQDKVISSWLVGSMTEDLLHQMIHCTTTKEIWTALQQIFTTRNLAQMMKIKTKLQTIQKGGR-------------------------GSHIVFILSGLGS
        +QDK+I+SWL  SM E++L +MIHC T +E+W  L+ ++T+RNLA++M++K+KL+ I+KG                             HI+ IL+GL S
Subjt:  RQDKVISSWLVGSMTEDLLHQMIHCTTTKEIWTALQQIFTTRNLAQMMKIKTKLQTIQKGGR-------------------------GSHIVFILSGLGS

Query:  DYESMVSVITAKIGPQTVQEVMSLLLTQENRIESKLVPIESSVPSVNLMVQSKPPESDSSKTNQSQFSSNIGSGNRGGRGGRGGRDYSNRGGGRSWNNRN
        ++ES VSVI+A+   QT+QEV SLLL+ E R E   +  + ++PSVNL  Q+K   S  S   Q  +  N  S N G    R           R+WN+ N
Subjt:  DYESMVSVITAKIGPQTVQEVMSLLLTQENRIESKLVPIESSVPSVNLMVQSKPPESDSSKTNQSQFSSNIGSGNRGGRGGRGGRDYSNRGGGRSWNNRN

Query:  KVQCQLCTKFGHTAAKCFFRY-----APQSSSMSPGAYSSG------------------FNNFNRSTSFLQMSAMVASPDLNQDTNWYPDSGATNHLTHN
        + QCQ+  KFGHTA +C+ R+      P   S     +S G                  F N  +  S   M+A +A  D N+DTNWYPDSGATNH+T N
Subjt:  KVQCQLCTKFGHTAAKCFFRY-----APQSSSMSPGAYSSG------------------FNNFNRSTSFLQMSAMVASPDLNQDTNWYPDSGATNHLTHN

Query:  FNNLSVGSEYGGSNQVHVGNGAG
        FNNL+  +EY G NQV +GNG G
Subjt:  FNNLSVGSEYGGSNQVHVGNGAG

A0A6J1C8R2 dr1-associated corepressor homolog isoform X22.9e-4839.56Show/hide
Query:  RQDKVISSWLVGSMTEDLLHQMIHCTTTKEIWTALQQIFTTRNLAQMMKIKTKLQTIQKGGR-------------------------GSHIVFILSGLGS
        +QDK+I+SWL  SM E++L +MIHC T +E+W  L+ ++T+RNLA++M++K+KL+ I+KG                             HI+ IL+GL S
Subjt:  RQDKVISSWLVGSMTEDLLHQMIHCTTTKEIWTALQQIFTTRNLAQMMKIKTKLQTIQKGGR-------------------------GSHIVFILSGLGS

Query:  DYESMVSVITAKIGPQTVQEVMSLLLTQENRIESKLVPIESSVPSVNLMVQSKPPESDSSKTNQSQFSSNIGSGNRGGRGGRGGRDYSNRGGGRSWNNRN
        ++ES VSVI+A+   QT+QEV SLLL+ E R E   +  + ++PSVNL  Q+K   S  S   Q  +  N  S N G    R           R+WN+ N
Subjt:  DYESMVSVITAKIGPQTVQEVMSLLLTQENRIESKLVPIESSVPSVNLMVQSKPPESDSSKTNQSQFSSNIGSGNRGGRGGRGGRDYSNRGGGRSWNNRN

Query:  KVQCQLCTKFGHTAAKCFFRY-----APQSSSMSPGAYSSG------------------FNNFNRSTSFLQMSAMVASPDLNQDTNWYPDSGATNHLTHN
        + QCQ+  KFGHTA +C+ R+      P   S     +S G                  F N  +  S   M+A +A  D N+DTNWYPDSGATNH+T N
Subjt:  KVQCQLCTKFGHTAAKCFFRY-----APQSSSMSPGAYSSG------------------FNNFNRSTSFLQMSAMVASPDLNQDTNWYPDSGATNHLTHN

Query:  FNNLSVGSEYGGSNQVHVGNG
        FNNL+  +EY G NQV +GNG
Subjt:  FNNLSVGSEYGGSNQVHVGNG

A0A6J1DLT9 uncharacterized protein LOC1110217575.7e-6540.89Show/hide
Query:  SSQAFCPGNKISIVKLTDDNFLLWKFQILIALEGYDLESHL--LEDPPPKTLGSPAGSSSSSDATPPPTPNPTYLQWKRQDKVISSWLVGSMTEDLLHQM
        +S+   PG+K+SIV+L DDN LLWKFQI  AL+G  LES++   ED P + + +    SSSS        NP Y +W +QDK+IS+WL+GSM ED+L QM
Subjt:  SSQAFCPGNKISIVKLTDDNFLLWKFQILIALEGYDLESHL--LEDPPPKTLGSPAGSSSSSDATPPPTPNPTYLQWKRQDKVISSWLVGSMTEDLLHQM

Query:  IHCTTTKEIWTALQQIFTTRNLAQMMKIKTKLQTIQKGGRG-------------------------SHIVFILSGLGSDYESMVSVITAKIGPQTVQEVM
        + C + +EIWT L+ +F +R LA++M++K KL+  +KG                             HI+ IL+GLG ++++++SVITA+  PQT+QEV 
Subjt:  IHCTTTKEIWTALQQIFTTRNLAQMMKIKTKLQTIQKGGRG-------------------------SHIVFILSGLGSDYESMVSVITAKIGPQTVQEVM

Query:  SLLLTQENRIESKLVPIESSVPSVNLMVQSKPPESDSSKTNQSQFSS--NIGSGNRGGRGGRGGRDYSNRGGGRSWNNRNKVQCQLCTKFGHTAAKCFFR
        SLLL QE R E  L+  + S+PSVNL +      +DSSK N    S   N    N   RG       SNR   R+W   NK QCQ+C +FGHTA +C+ R
Subjt:  SLLLTQENRIESKLVPIESSVPSVNLMVQSKPPESDSSKTNQSQFSS--NIGSGNRGGRGGRGGRDYSNRGGGRSWNNRNKVQCQLCTKFGHTAAKCFFR

Query:  YA-------PQSSSMSPGAYSSGF-----------------NNFN-RSTSFLQMSAMVASPDLNQDTNWYPDSGATNHLTHNFNNLSVGSEYGGSNQVHV
        +           ++ SP  +SSGF                  NF+  S S  QM A++ + D N+D+NWY DSG TNH+T+ F N S+GSEY G  ++ V
Subjt:  YA-------PQSSSMSPGAYSSGF-----------------NNFN-RSTSFLQMSAMVASPDLNQDTNWYPDSGATNHLTHNFNNLSVGSEYGGSNQVHV

Query:  GNGAGH
        GNG G+
Subjt:  GNGAGH

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.2e-2026.03Show/hide
Query:  NKISIVKLTDDNFLLWKFQILIALEGYDLESHL--LEDPPPKTLGSPAGSSSSSDATPPPTPNPTYLQWKRQDKVISSWLVGSMTEDLLHQMIHCTTTKE
        N  ++ KLT  N+L+W  Q+    +GY+L   L      PP T+G+ A           P  NP Y +WKRQDK+I S ++G+++  +   +   TT  +
Subjt:  NKISIVKLTDDNFLLWKFQILIALEGYDLESHL--LEDPPPKTLGSPAGSSSSSDATPPPTPNPTYLQWKRQDKVISSWLVGSMTEDLLHQMIHCTTTKE

Query:  IWTALQQIFTTRNLAQMMKIKTKLQTIQKGGR-------------------------GSHIVFILSGLGSDYESMVSVITAKIGPQTVQEVMSLLLTQEN
        IW  L++I+   +   + +++T+L+   KG +                            +  +L  L  +Y+ ++  I AK  P T+ E+   LL  E+
Subjt:  IWTALQQIFTTRNLAQMMKIKTKLQTIQKGGR-------------------------GSHIVFILSGLGSDYESMVSVITAKIGPQTVQEVMSLLLTQEN

Query:  RI----ESKLVPIESSVPSVNLMVQSKPPESDSSKTNQSQFSSNIGSGNRGGRGGRGGRDYSNRGGGRSW----------NNRNKV---QCQLCTKFGHT
        +I     + ++PI ++  S            +++ TN      N  +GNR  R      + +N    + W          NN++K    +CQ+C   GH+
Subjt:  RI----ESKLVPIESSVPSVNLMVQSKPPESDSSKTNQSQFSSNIGSGNRGGRGGRGGRDYSNRGGGRSW----------NNRNKV---QCQLCTKFGHT

Query:  AAKCFFRYAPQSSSMSPGAYSSGFNNFNRSTSFL--QMSAMVASPDLNQDTNWYPDSGATNHLTHNFNNLSVGSEYGGSNQVHVGNGA
        A +C        S +    + S  N+    + F   Q  A +A        NW  DSGAT+H+T +FNNLS+   Y G + V V +G+
Subjt:  AAKCFFRYAPQSSSMSPGAYSSGFNNFNRSTSFL--QMSAMVASPDLNQDTNWYPDSGATNHLTHNFNNLSVGSEYGGSNQVHVGNGA

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.1e-2026.87Show/hide
Query:  NKISIVKLTDDNFLLWKFQILIALEGYDLESHL--LEDPPPKTLGSPAGSSSSSDATPPPTPNPTYLQWKRQDKVISSWLVGSMTEDLLHQMIHCTTTKE
        N  ++ KLT  N+L+W  Q+    +GY+L   L      PP T+G+ A           P  NP Y +W+RQDK+I S ++G+++  +   +   TT  +
Subjt:  NKISIVKLTDDNFLLWKFQILIALEGYDLESHL--LEDPPPKTLGSPAGSSSSSDATPPPTPNPTYLQWKRQDKVISSWLVGSMTEDLLHQMIHCTTTKE

Query:  IWTALQQIFTTRNLAQMMKIK--TKLQTIQKGGR----GSHIVFILSGLGSDYESMVSVITAKIGPQTVQEVMSLLLTQENRI----ESKLVPIESSVPS
        IW  L++I+   +   + +++  T+   +   G+       +  +L  L  DY+ ++  I AK  P ++ E+   L+ +E+++     +++VPI ++V +
Subjt:  IWTALQQIFTTRNLAQMMKIK--TKLQTIQKGGR----GSHIVFILSGLGSDYESMVSVITAKIGPQTVQEVMSLLLTQENRI----ESKLVPIESSVPS

Query:  VNLMVQSKPPESDSSKTNQSQFSSNIGSG-NRGGRGGRGGRDYSNRGGGRSWNNRNKV---QCQLCTKFGHTAAKCFFRYAPQSSSMSPGAYSSGFNNFN
                        TN ++  +N G   N      R      +  G RS N + K    +CQ+C+  GH+A +C   +  QS++           N  
Subjt:  VNLMVQSKPPESDSSKTNQSQFSSNIGSG-NRGGRGGRGGRDYSNRGGGRSWNNRNKV---QCQLCTKFGHTAAKCFFRYAPQSSSMSPGAYSSGFNNFN

Query:  RSTSFL---QMSAMVASPDLNQDTNWYPDSGATNHLTHNFNNLSVGSEYGGSNQVHVGNGA
        +STS     Q  A +A        NW  DSGAT+H+T +FNNLS    Y G + V + +G+
Subjt:  RSTSFL---QMSAMVASPDLNQDTNWYPDSGATNHLTHNFNNLSVGSEYGGSNQVHVGNGA

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).3.9e-0528Show/hide
Query:  PPPTP-NPTYLQWKRQDKVISSWLVGSMTEDLLHQMIHCTTTKEIWTALQQIFTTRNLAQMMKIKTKLQTIQKGG
        P P P +P Y  W++ + ++  WL+ SMT+ LL  +++  T  ++W  L+++F      ++ +++ +L T+++GG
Subjt:  PPPTP-NPTYLQWKRQDKVISSWLVGSMTEDLLHQMIHCTTTKEIWTALQQIFTTRNLAQMMKIKTKLQTIQKGG

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)2.0e-0624.6Show/hide
Query:  YDLESHLLEDPPPKTLGSPAGSSSSSDATPPPTPNPTYLQWKRQDKVISSWLVGSMTEDLLHQMIHC-TTTKEIWTALQQIFTTRNLAQMMKIKTKLQTI
        YD+   L E     TL    G     D +  PTP  T  +WK +D ++  W+ G++T+ LL  +I    T +++W +L+ +F     A+ ++ + +L+T 
Subjt:  YDLESHLLEDPPPKTLGSPAGSSSSSDATPPPTPNPTYLQWKRQDKVISSWLVGSMTEDLLHQMIHC-TTTKEIWTALQQIFTTRNLAQMMKIKTKLQTI

Query:  QKGGRGSH-------------------------IVFILSGLGSDYESMVSVITAKIGPQTVQEVMSLLLTQENRI--ESKLVPIESSVPSVNLMVQSKPP
               H                         ++ +L+GL   Y+ +++VI  K    +  E  S+LL +E+R+  +SK     ++ PS++ ++ + P 
Subjt:  QKGGRGSH-------------------------IVFILSGLGSDYESMVSVITAKIGPQTVQEVMSLLLTQENRI--ESKLVPIESSVPSVNLMVQSKPP

Query:  ESDSSKTNQSQFSSNIGSGNRGGRGGRGGRDYSNRGGGRS---WNNRN
        + +         +SN+         GRG     NRGGG S   +NN N
Subjt:  ESDSSKTNQSQFSSNIGSGNRGGRGGRGGRDYSNRGGGRS---WNNRN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGACTCCTGAAAGCTCTCTAAGCGGTGCCCTAGGCGATTCTTCCGCTGGTTCTTCGCAAGCGTTTTGTCCGGGTAACAAAATCTCTATCGTCAAACTAACTGATGA
CAACTTTCTTCTCTGGAAATTTCAGATACTCATTGCCCTTGAGGGATATGACCTTGAATCGCATCTTTTGGAGGATCCGCCCCCGAAAACTCTCGGTTCTCCAGCTGGTT
CCTCCTCAAGCAGTGACGCGACGCCACCACCCACGCCCAATCCGACCTATCTTCAATGGAAGCGCCAAGATAAGGTGATTTCGTCTTGGCTTGTTGGGTCCATGACTGAG
GACCTTCTTCATCAAATGATTCATTGCACCACGACCAAAGAAATATGGACTGCTCTTCAACAGATCTTTACCACACGTAACCTAGCTCAGATGATGAAGATCAAAACCAA
GCTCCAAACGATTCAAAAAGGAGGTAGAGGATCACATATTGTGTTTATTCTCTCTGGTCTTGGGTCTGATTATGAGTCGATGGTGTCTGTTATCACGGCCAAGATTGGTC
CACAAACTGTTCAGGAAGTCATGTCTCTTCTTCTCACTCAGGAAAATCGTATAGAGAGCAAGCTTGTTCCCATTGAGAGTTCGGTCCCATCTGTGAACCTCATGGTTCAA
TCAAAACCCCCTGAATCAGATTCATCTAAAACTAATCAATCTCAATTTTCTTCAAATATTGGTAGTGGTAACAGGGGTGGACGTGGTGGTCGTGGTGGTCGTGATTACTC
TAATCGTGGTGGAGGACGTTCCTGGAACAATCGAAATAAAGTTCAATGCCAACTTTGCACCAAGTTTGGCCATACTGCTGCCAAATGCTTCTTTCGATATGCCCCTCAGT
CATCTTCAATGTCACCAGGTGCGTATTCTTCAGGTTTTAATAATTTTAACCGGTCTACATCTTTTCTTCAGATGTCAGCAATGGTAGCCTCACCTGACTTGAATCAAGAT
ACCAACTGGTATCCAGACTCGGGCGCCACCAATCATCTCACACATAATTTTAACAACCTCTCTGTTGGGTCAGAATATGGTGGCTCAAACCAAGTTCATGTTGGAAATGG
AGCAGGTCACTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCGACTCCTGAAAGCTCTCTAAGCGGTGCCCTAGGCGATTCTTCCGCTGGTTCTTCGCAAGCGTTTTGTCCGGGTAACAAAATCTCTATCGTCAAACTAACTGATGA
CAACTTTCTTCTCTGGAAATTTCAGATACTCATTGCCCTTGAGGGATATGACCTTGAATCGCATCTTTTGGAGGATCCGCCCCCGAAAACTCTCGGTTCTCCAGCTGGTT
CCTCCTCAAGCAGTGACGCGACGCCACCACCCACGCCCAATCCGACCTATCTTCAATGGAAGCGCCAAGATAAGGTGATTTCGTCTTGGCTTGTTGGGTCCATGACTGAG
GACCTTCTTCATCAAATGATTCATTGCACCACGACCAAAGAAATATGGACTGCTCTTCAACAGATCTTTACCACACGTAACCTAGCTCAGATGATGAAGATCAAAACCAA
GCTCCAAACGATTCAAAAAGGAGGTAGAGGATCACATATTGTGTTTATTCTCTCTGGTCTTGGGTCTGATTATGAGTCGATGGTGTCTGTTATCACGGCCAAGATTGGTC
CACAAACTGTTCAGGAAGTCATGTCTCTTCTTCTCACTCAGGAAAATCGTATAGAGAGCAAGCTTGTTCCCATTGAGAGTTCGGTCCCATCTGTGAACCTCATGGTTCAA
TCAAAACCCCCTGAATCAGATTCATCTAAAACTAATCAATCTCAATTTTCTTCAAATATTGGTAGTGGTAACAGGGGTGGACGTGGTGGTCGTGGTGGTCGTGATTACTC
TAATCGTGGTGGAGGACGTTCCTGGAACAATCGAAATAAAGTTCAATGCCAACTTTGCACCAAGTTTGGCCATACTGCTGCCAAATGCTTCTTTCGATATGCCCCTCAGT
CATCTTCAATGTCACCAGGTGCGTATTCTTCAGGTTTTAATAATTTTAACCGGTCTACATCTTTTCTTCAGATGTCAGCAATGGTAGCCTCACCTGACTTGAATCAAGAT
ACCAACTGGTATCCAGACTCGGGCGCCACCAATCATCTCACACATAATTTTAACAACCTCTCTGTTGGGTCAGAATATGGTGGCTCAAACCAAGTTCATGTTGGAAATGG
AGCAGGTCACTAA
Protein sequenceShow/hide protein sequence
MSTPESSLSGALGDSSAGSSQAFCPGNKISIVKLTDDNFLLWKFQILIALEGYDLESHLLEDPPPKTLGSPAGSSSSSDATPPPTPNPTYLQWKRQDKVISSWLVGSMTE
DLLHQMIHCTTTKEIWTALQQIFTTRNLAQMMKIKTKLQTIQKGGRGSHIVFILSGLGSDYESMVSVITAKIGPQTVQEVMSLLLTQENRIESKLVPIESSVPSVNLMVQ
SKPPESDSSKTNQSQFSSNIGSGNRGGRGGRGGRDYSNRGGGRSWNNRNKVQCQLCTKFGHTAAKCFFRYAPQSSSMSPGAYSSGFNNFNRSTSFLQMSAMVASPDLNQD
TNWYPDSGATNHLTHNFNNLSVGSEYGGSNQVHVGNGAGH