; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0019279 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0019279
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr5:40580996..40583162
RNA-Seq ExpressionLag0019279
SyntenyLag0019279
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.0e-9439.52Show/hide
Query:  SSSQSSQVVNPGNKISTVKLSDENFLLWKFQILTALEGHDLDQHINDDCEPPPEKI--QVSGNGAMVSKPNPAFKLWKKQDKLVSSWIVGSMSESILEQI
        +SS  +Q+   GNKIS VKL+D+ FLLWKFQILTALE +DL+  +  + EPP + +    S + +    PNPA+K+WK+QD+L+SSW++GSMSE IL Q+
Subjt:  SSSQSSQVVNPGNKISTVKLSDENFLLWKFQILTALEGHDLDQHINDDCEPPPEKI--QVSGNGAMVSKPNPAFKLWKKQDKLVSSWIVGSMSESILEQI

Query:  LHCKSAKEIWSCLLQIFNSRHLAQIMKIKSKLQNIQKGGSSMSEYISKIKKCVDALSAIGKEVDVQDHIMYILSGLGAEYETMVSVITAKTGTQSVHDVV
        LHCKSAKEIW  L  IF+SR+LAQ M+ K+KL NI+KG   + EY  KI +CVDAL++I K V   DHI+YIL+GLG++Y++M+SVI+A+T + SV +V+
Subjt:  LHCKSAKEIWSCLLQIFNSRHLAQIMKIKSKLQNIQKGGSSMSEYISKIKKCVDALSAIGKEVDVQDHIMYILSGLGAEYETMVSVITAKTGTQSVHDVV

Query:  ALLLTHESRIESKSAVNPDNVLPSANLVVQNPIHNTVQ--NTSQNVQQQNFGNNRGRGRSNFGQNRGGRTWNNRNRPQCQICNKFGHTAIKCYSRVPMPG
        +LLLT ES+ ESK  +  +  LPS N+V Q           T+QN    N   N+  GR N   NRG R   NRN+PQCQIC K G++A +C+ R     
Subjt:  ALLLTHESRIESKSAVNPDNVLPSANLVVQNPIHNTVQ--NTSQNVQQQNFGNNRGRGRSNFGQNRGGRTWNNRNRPQCQICNKFGHTAIKCYSRVPMPG

Query:  AYATQFSPSGSAFSSGQTLGQQQVGGPFPQMQAMMATPNYNQDCNWYPDSGATNHLTNSLSNMSVSSDYPGNNQVLIGNGAGLPISNLGYASFTS-----
         Y  + + SG + +S  T          PQM AM+A  + N D NWYPDSGATNHLT+SLSN+S+ S+Y G NQ+   NG+GLPI++ G  SF S     
Subjt:  AYATQFSPSGSAFSSGQTLGQQQVGGPFPQMQAMMATPNYNQDCNWYPDSGATNHLTNSLSNMSVSSDYPGNNQVLIGNGAGLPISNLGYASFTS-----

Query:  -----------------------------------------------------------------PS---------------------------------
                                                                         PS                                 
Subjt:  -----------------------------------------------------------------PS---------------------------------

Query:  -----------------------------------KHHAMPFSRSTTTYYAPLQLIVTDLWGPSYKLSTHGFRYYISFVDAF
                                           KHHA+PFS S T Y  PLQLI  DLWGP+  +S +GFRYYISFVDA+
Subjt:  -----------------------------------KHHAMPFSRSTTTYYAPLQLIVTDLWGPSYKLSTHGFRYYISFVDAF

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.0e-9439.52Show/hide
Query:  SSSQSSQVVNPGNKISTVKLSDENFLLWKFQILTALEGHDLDQHINDDCEPPPEKI--QVSGNGAMVSKPNPAFKLWKKQDKLVSSWIVGSMSESILEQI
        +SS  +Q+   GNKIS VKL+D+ FLLWKFQILTALE +DL+  +  + EPP + +    S + +    PNPA+K+WK+QD+L+SSW++GSMSE IL Q+
Subjt:  SSSQSSQVVNPGNKISTVKLSDENFLLWKFQILTALEGHDLDQHINDDCEPPPEKI--QVSGNGAMVSKPNPAFKLWKKQDKLVSSWIVGSMSESILEQI

Query:  LHCKSAKEIWSCLLQIFNSRHLAQIMKIKSKLQNIQKGGSSMSEYISKIKKCVDALSAIGKEVDVQDHIMYILSGLGAEYETMVSVITAKTGTQSVHDVV
        LHCKSAKEIW  L  IF+SR+LAQ M+ K+KL NI+KG   + EY  KI +CVDAL++I K V   DHI+YIL+GLG++Y++M+SVI+A+T + SV +V+
Subjt:  LHCKSAKEIWSCLLQIFNSRHLAQIMKIKSKLQNIQKGGSSMSEYISKIKKCVDALSAIGKEVDVQDHIMYILSGLGAEYETMVSVITAKTGTQSVHDVV

Query:  ALLLTHESRIESKSAVNPDNVLPSANLVVQNPIHNTVQ--NTSQNVQQQNFGNNRGRGRSNFGQNRGGRTWNNRNRPQCQICNKFGHTAIKCYSRVPMPG
        +LLLT ES+ ESK  +  +  LPS N+V Q           T+QN    N   N+  GR N   NRG R   NRN+PQCQIC K G++A +C+ R     
Subjt:  ALLLTHESRIESKSAVNPDNVLPSANLVVQNPIHNTVQ--NTSQNVQQQNFGNNRGRGRSNFGQNRGGRTWNNRNRPQCQICNKFGHTAIKCYSRVPMPG

Query:  AYATQFSPSGSAFSSGQTLGQQQVGGPFPQMQAMMATPNYNQDCNWYPDSGATNHLTNSLSNMSVSSDYPGNNQVLIGNGAGLPISNLGYASFTS-----
         Y  + + SG + +S  T          PQM AM+A  + N D NWYPDSGATNHLT+SLSN+S+ S+Y G NQ+   NG+GLPI++ G  SF S     
Subjt:  AYATQFSPSGSAFSSGQTLGQQQVGGPFPQMQAMMATPNYNQDCNWYPDSGATNHLTNSLSNMSVSSDYPGNNQVLIGNGAGLPISNLGYASFTS-----

Query:  -----------------------------------------------------------------PS---------------------------------
                                                                         PS                                 
Subjt:  -----------------------------------------------------------------PS---------------------------------

Query:  -----------------------------------KHHAMPFSRSTTTYYAPLQLIVTDLWGPSYKLSTHGFRYYISFVDAF
                                           KHHA+PFS S T Y  PLQLI  DLWGP+  +S +GFRYYISFVDA+
Subjt:  -----------------------------------KHHAMPFSRSTTTYYAPLQLIVTDLWGPSYKLSTHGFRYYISFVDAF

XP_022136882.1 dr1-associated corepressor homolog isoform X1 [Momordica charantia]1.3e-6145.6Show/hide
Query:  KQDKLVSSWIVGSMSESILEQILHCKSAKEIWSCLLQIFNSRHLAQIMKIKSKLQNIQKGGSSMSEYISKIKKCVDALSAIGKEVDVQDHIMYILSGLGA
        KQDKL++SW+  SM E IL +++HC +A+E+W  L  ++ SR+LA++M++KSKL+NI+KG   + +Y  K+K  VD+L+A GK+V V+DHIM+IL+GL +
Subjt:  KQDKLVSSWIVGSMSESILEQILHCKSAKEIWSCLLQIFNSRHLAQIMKIKSKLQNIQKGGSSMSEYISKIKKCVDALSAIGKEVDVQDHIMYILSGLGA

Query:  EYETMVSVITAKTGTQSVHDVVALLLTHESRIESKSAVNPDNVLPSANLVVQNPIHNTVQNTSQNVQQQNFGNNRGRGRSNFGQNRGGRTWNNRNRPQCQ
        E+E+ VSVI+A+T TQ++ +V +LLL+HE R E ++++N D  LPS NL  Q    N+ Q  S + Q+    NNR +   N G     R WN+ NRPQCQ
Subjt:  EYETMVSVITAKTGTQSVHDVVALLLTHESRIESKSAVNPDNVLPSANLVVQNPIHNTVQNTSQNVQQQNFGNNRGRGRSNFGQNRGGRTWNNRNRPQCQ

Query:  ICNKFGHTAIKCYSRVPMP-----GAYATQFSPSG---SAFSSGQT-LGQQQ---VGGPFP----QMQAMMATPNYNQDCNWYPDSGATNHLTNSLSNMS
        I  KFGHTA++CY R         G    Q   SG   S+ +S  T  G QQ     G  P     M A +A  ++N+D NWYPDSGATNH+T++ +N++
Subjt:  ICNKFGHTAIKCYSRVPMP-----GAYATQFSPSG---SAFSSGQT-LGQQQ---VGGPFP----QMQAMMATPNYNQDCNWYPDSGATNHLTNSLSNMS

Query:  VSSDYPGNNQVLIGNGAG
         S++Y G+NQV IGNG G
Subjt:  VSSDYPGNNQVLIGNGAG

XP_022136883.1 dr1-associated corepressor homolog isoform X2 [Momordica charantia]4.9e-6145.57Show/hide
Query:  KQDKLVSSWIVGSMSESILEQILHCKSAKEIWSCLLQIFNSRHLAQIMKIKSKLQNIQKGGSSMSEYISKIKKCVDALSAIGKEVDVQDHIMYILSGLGA
        KQDKL++SW+  SM E IL +++HC +A+E+W  L  ++ SR+LA++M++KSKL+NI+KG   + +Y  K+K  VD+L+A GK+V V+DHIM+IL+GL +
Subjt:  KQDKLVSSWIVGSMSESILEQILHCKSAKEIWSCLLQIFNSRHLAQIMKIKSKLQNIQKGGSSMSEYISKIKKCVDALSAIGKEVDVQDHIMYILSGLGA

Query:  EYETMVSVITAKTGTQSVHDVVALLLTHESRIESKSAVNPDNVLPSANLVVQNPIHNTVQNTSQNVQQQNFGNNRGRGRSNFGQNRGGRTWNNRNRPQCQ
        E+E+ VSVI+A+T TQ++ +V +LLL+HE R E ++++N D  LPS NL  Q    N+ Q  S + Q+    NNR +   N G     R WN+ NRPQCQ
Subjt:  EYETMVSVITAKTGTQSVHDVVALLLTHESRIESKSAVNPDNVLPSANLVVQNPIHNTVQNTSQNVQQQNFGNNRGRGRSNFGQNRGGRTWNNRNRPQCQ

Query:  ICNKFGHTAIKCYSRVPMP-----GAYATQFSPSG---SAFSSGQT-LGQQQ---VGGPFP----QMQAMMATPNYNQDCNWYPDSGATNHLTNSLSNMS
        I  KFGHTA++CY R         G    Q   SG   S+ +S  T  G QQ     G  P     M A +A  ++N+D NWYPDSGATNH+T++ +N++
Subjt:  ICNKFGHTAIKCYSRVPMP-----GAYATQFSPSG---SAFSSGQT-LGQQQ---VGGPFP----QMQAMMATPNYNQDCNWYPDSGATNHLTNSLSNMS

Query:  VSSDYPGNNQVLIGNG
         S++Y G+NQV IGNG
Subjt:  VSSDYPGNNQVLIGNG

XP_022154487.1 uncharacterized protein LOC111021757 [Momordica charantia]4.0e-7943.03Show/hide
Query:  SENNNSQVLSSSQSSQVVNPGNKISTVKLSDENFLLWKFQILTALEGHDLDQHINDDCEPPPEKIQVSGNGAMVS--KPNPAFKLWKKQDKLVSSWIVGS
        S   NS      Q+S+ +NPG+K+S V+L+D+N LLWKFQI TAL+G+ L+ +I+ + + P + +Q + + +  S  + NPA+  W KQDKL+S+W++GS
Subjt:  SENNNSQVLSSSQSSQVVNPGNKISTVKLSDENFLLWKFQILTALEGHDLDQHINDDCEPPPEKIQVSGNGAMVS--KPNPAFKLWKKQDKLVSSWIVGS

Query:  MSESILEQILHCKSAKEIWSCLLQIFNSRHLAQIMKIKSKLQNIQKGGSSMSEYISKIKKCVDALSAIGKEVDVQDHIMYILSGLGAEYETMVSVITAKT
        M+E IL Q+L CKSA+EIW+ L  +F SR LA++M++K KL+N +KG  S+ +Y  KIK  VD+L+  GK++  +DHIM+IL+GLG E++ ++SVITA+ 
Subjt:  MSESILEQILHCKSAKEIWSCLLQIFNSRHLAQIMKIKSKLQNIQKGGSSMSEYISKIKKCVDALSAIGKEVDVQDHIMYILSGLGAEYETMVSVITAKT

Query:  GTQSVHDVVALLLTHESRIESKSAVNPDNVLPSANLVVQNPI--HNTVQNTSQNVQQQNFGNNRGRGRSNFGQNRGGRTWNNRNRPQCQICNKFGHTAIK
          Q++ +V +LLL  E R E ++ +N D  LPS NL + +    +N  Q+   N  Q N+ + RGRG +N   NR  R W   N+PQCQIC +FGHTA++
Subjt:  GTQSVHDVVALLLTHESRIESKSAVNPDNVLPSANLVVQNPI--HNTVQNTSQNVQQQNFGNNRGRGRSNFGQNRGGRTWNNRNRPQCQICNKFGHTAIK

Query:  CYSRVP-------------MPGAYATQF---SPSGSAFSSGQTL-GQQQVGGPFP-QMQAMMATPNYNQDCNWYPDSGATNHLTNSLSNMSVSSDYPGNN
        CY R                P  +++ F   +PS +AFSS  T  G   +    P QMQA+M   ++N+D NWY DSG TNH+TN   N S+ S+Y G+ 
Subjt:  CYSRVP-------------MPGAYATQF---SPSGSAFSSGQTL-GQQQVGGPFP-QMQAMMATPNYNQDCNWYPDSGATNHLTNSLSNMSVSSDYPGNN

Query:  QVLIGNGAG
        ++ +GNG G
Subjt:  QVLIGNGAG

TrEMBL top hitse value%identityAlignment
A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-949.5e-9539.52Show/hide
Query:  SSSQSSQVVNPGNKISTVKLSDENFLLWKFQILTALEGHDLDQHINDDCEPPPEKI--QVSGNGAMVSKPNPAFKLWKKQDKLVSSWIVGSMSESILEQI
        +SS  +Q+   GNKIS VKL+D+ FLLWKFQILTALE +DL+  +  + EPP + +    S + +    PNPA+K+WK+QD+L+SSW++GSMSE IL Q+
Subjt:  SSSQSSQVVNPGNKISTVKLSDENFLLWKFQILTALEGHDLDQHINDDCEPPPEKI--QVSGNGAMVSKPNPAFKLWKKQDKLVSSWIVGSMSESILEQI

Query:  LHCKSAKEIWSCLLQIFNSRHLAQIMKIKSKLQNIQKGGSSMSEYISKIKKCVDALSAIGKEVDVQDHIMYILSGLGAEYETMVSVITAKTGTQSVHDVV
        LHCKSAKEIW  L  IF+SR+LAQ M+ K+KL NI+KG   + EY  KI +CVDAL++I K V   DHI+YIL+GLG++Y++M+SVI+A+T + SV +V+
Subjt:  LHCKSAKEIWSCLLQIFNSRHLAQIMKIKSKLQNIQKGGSSMSEYISKIKKCVDALSAIGKEVDVQDHIMYILSGLGAEYETMVSVITAKTGTQSVHDVV

Query:  ALLLTHESRIESKSAVNPDNVLPSANLVVQNPIHNTVQ--NTSQNVQQQNFGNNRGRGRSNFGQNRGGRTWNNRNRPQCQICNKFGHTAIKCYSRVPMPG
        +LLLT ES+ ESK  +  +  LPS N+V Q           T+QN    N   N+  GR N   NRG R   NRN+PQCQIC K G++A +C+ R     
Subjt:  ALLLTHESRIESKSAVNPDNVLPSANLVVQNPIHNTVQ--NTSQNVQQQNFGNNRGRGRSNFGQNRGGRTWNNRNRPQCQICNKFGHTAIKCYSRVPMPG

Query:  AYATQFSPSGSAFSSGQTLGQQQVGGPFPQMQAMMATPNYNQDCNWYPDSGATNHLTNSLSNMSVSSDYPGNNQVLIGNGAGLPISNLGYASFTS-----
         Y  + + SG + +S  T          PQM AM+A  + N D NWYPDSGATNHLT+SLSN+S+ S+Y G NQ+   NG+GLPI++ G  SF S     
Subjt:  AYATQFSPSGSAFSSGQTLGQQQVGGPFPQMQAMMATPNYNQDCNWYPDSGATNHLTNSLSNMSVSSDYPGNNQVLIGNGAGLPISNLGYASFTS-----

Query:  -----------------------------------------------------------------PS---------------------------------
                                                                         PS                                 
Subjt:  -----------------------------------------------------------------PS---------------------------------

Query:  -----------------------------------KHHAMPFSRSTTTYYAPLQLIVTDLWGPSYKLSTHGFRYYISFVDAF
                                           KHHA+PFS S T Y  PLQLI  DLWGP+  +S +GFRYYISFVDA+
Subjt:  -----------------------------------KHHAMPFSRSTTTYYAPLQLIVTDLWGPSYKLSTHGFRYYISFVDAF

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-949.5e-9539.52Show/hide
Query:  SSSQSSQVVNPGNKISTVKLSDENFLLWKFQILTALEGHDLDQHINDDCEPPPEKI--QVSGNGAMVSKPNPAFKLWKKQDKLVSSWIVGSMSESILEQI
        +SS  +Q+   GNKIS VKL+D+ FLLWKFQILTALE +DL+  +  + EPP + +    S + +    PNPA+K+WK+QD+L+SSW++GSMSE IL Q+
Subjt:  SSSQSSQVVNPGNKISTVKLSDENFLLWKFQILTALEGHDLDQHINDDCEPPPEKI--QVSGNGAMVSKPNPAFKLWKKQDKLVSSWIVGSMSESILEQI

Query:  LHCKSAKEIWSCLLQIFNSRHLAQIMKIKSKLQNIQKGGSSMSEYISKIKKCVDALSAIGKEVDVQDHIMYILSGLGAEYETMVSVITAKTGTQSVHDVV
        LHCKSAKEIW  L  IF+SR+LAQ M+ K+KL NI+KG   + EY  KI +CVDAL++I K V   DHI+YIL+GLG++Y++M+SVI+A+T + SV +V+
Subjt:  LHCKSAKEIWSCLLQIFNSRHLAQIMKIKSKLQNIQKGGSSMSEYISKIKKCVDALSAIGKEVDVQDHIMYILSGLGAEYETMVSVITAKTGTQSVHDVV

Query:  ALLLTHESRIESKSAVNPDNVLPSANLVVQNPIHNTVQ--NTSQNVQQQNFGNNRGRGRSNFGQNRGGRTWNNRNRPQCQICNKFGHTAIKCYSRVPMPG
        +LLLT ES+ ESK  +  +  LPS N+V Q           T+QN    N   N+  GR N   NRG R   NRN+PQCQIC K G++A +C+ R     
Subjt:  ALLLTHESRIESKSAVNPDNVLPSANLVVQNPIHNTVQ--NTSQNVQQQNFGNNRGRGRSNFGQNRGGRTWNNRNRPQCQICNKFGHTAIKCYSRVPMPG

Query:  AYATQFSPSGSAFSSGQTLGQQQVGGPFPQMQAMMATPNYNQDCNWYPDSGATNHLTNSLSNMSVSSDYPGNNQVLIGNGAGLPISNLGYASFTS-----
         Y  + + SG + +S  T          PQM AM+A  + N D NWYPDSGATNHLT+SLSN+S+ S+Y G NQ+   NG+GLPI++ G  SF S     
Subjt:  AYATQFSPSGSAFSSGQTLGQQQVGGPFPQMQAMMATPNYNQDCNWYPDSGATNHLTNSLSNMSVSSDYPGNNQVLIGNGAGLPISNLGYASFTS-----

Query:  -----------------------------------------------------------------PS---------------------------------
                                                                         PS                                 
Subjt:  -----------------------------------------------------------------PS---------------------------------

Query:  -----------------------------------KHHAMPFSRSTTTYYAPLQLIVTDLWGPSYKLSTHGFRYYISFVDAF
                                           KHHA+PFS S T Y  PLQLI  DLWGP+  +S +GFRYYISFVDA+
Subjt:  -----------------------------------KHHAMPFSRSTTTYYAPLQLIVTDLWGPSYKLSTHGFRYYISFVDAF

A0A6J1C6N9 dr1-associated corepressor homolog isoform X16.2e-6245.6Show/hide
Query:  KQDKLVSSWIVGSMSESILEQILHCKSAKEIWSCLLQIFNSRHLAQIMKIKSKLQNIQKGGSSMSEYISKIKKCVDALSAIGKEVDVQDHIMYILSGLGA
        KQDKL++SW+  SM E IL +++HC +A+E+W  L  ++ SR+LA++M++KSKL+NI+KG   + +Y  K+K  VD+L+A GK+V V+DHIM+IL+GL +
Subjt:  KQDKLVSSWIVGSMSESILEQILHCKSAKEIWSCLLQIFNSRHLAQIMKIKSKLQNIQKGGSSMSEYISKIKKCVDALSAIGKEVDVQDHIMYILSGLGA

Query:  EYETMVSVITAKTGTQSVHDVVALLLTHESRIESKSAVNPDNVLPSANLVVQNPIHNTVQNTSQNVQQQNFGNNRGRGRSNFGQNRGGRTWNNRNRPQCQ
        E+E+ VSVI+A+T TQ++ +V +LLL+HE R E ++++N D  LPS NL  Q    N+ Q  S + Q+    NNR +   N G     R WN+ NRPQCQ
Subjt:  EYETMVSVITAKTGTQSVHDVVALLLTHESRIESKSAVNPDNVLPSANLVVQNPIHNTVQNTSQNVQQQNFGNNRGRGRSNFGQNRGGRTWNNRNRPQCQ

Query:  ICNKFGHTAIKCYSRVPMP-----GAYATQFSPSG---SAFSSGQT-LGQQQ---VGGPFP----QMQAMMATPNYNQDCNWYPDSGATNHLTNSLSNMS
        I  KFGHTA++CY R         G    Q   SG   S+ +S  T  G QQ     G  P     M A +A  ++N+D NWYPDSGATNH+T++ +N++
Subjt:  ICNKFGHTAIKCYSRVPMP-----GAYATQFSPSG---SAFSSGQT-LGQQQ---VGGPFP----QMQAMMATPNYNQDCNWYPDSGATNHLTNSLSNMS

Query:  VSSDYPGNNQVLIGNGAG
         S++Y G+NQV IGNG G
Subjt:  VSSDYPGNNQVLIGNGAG

A0A6J1C8R2 dr1-associated corepressor homolog isoform X22.4e-6145.57Show/hide
Query:  KQDKLVSSWIVGSMSESILEQILHCKSAKEIWSCLLQIFNSRHLAQIMKIKSKLQNIQKGGSSMSEYISKIKKCVDALSAIGKEVDVQDHIMYILSGLGA
        KQDKL++SW+  SM E IL +++HC +A+E+W  L  ++ SR+LA++M++KSKL+NI+KG   + +Y  K+K  VD+L+A GK+V V+DHIM+IL+GL +
Subjt:  KQDKLVSSWIVGSMSESILEQILHCKSAKEIWSCLLQIFNSRHLAQIMKIKSKLQNIQKGGSSMSEYISKIKKCVDALSAIGKEVDVQDHIMYILSGLGA

Query:  EYETMVSVITAKTGTQSVHDVVALLLTHESRIESKSAVNPDNVLPSANLVVQNPIHNTVQNTSQNVQQQNFGNNRGRGRSNFGQNRGGRTWNNRNRPQCQ
        E+E+ VSVI+A+T TQ++ +V +LLL+HE R E ++++N D  LPS NL  Q    N+ Q  S + Q+    NNR +   N G     R WN+ NRPQCQ
Subjt:  EYETMVSVITAKTGTQSVHDVVALLLTHESRIESKSAVNPDNVLPSANLVVQNPIHNTVQNTSQNVQQQNFGNNRGRGRSNFGQNRGGRTWNNRNRPQCQ

Query:  ICNKFGHTAIKCYSRVPMP-----GAYATQFSPSG---SAFSSGQT-LGQQQ---VGGPFP----QMQAMMATPNYNQDCNWYPDSGATNHLTNSLSNMS
        I  KFGHTA++CY R         G    Q   SG   S+ +S  T  G QQ     G  P     M A +A  ++N+D NWYPDSGATNH+T++ +N++
Subjt:  ICNKFGHTAIKCYSRVPMP-----GAYATQFSPSG---SAFSSGQT-LGQQQ---VGGPFP----QMQAMMATPNYNQDCNWYPDSGATNHLTNSLSNMS

Query:  VSSDYPGNNQVLIGNG
         S++Y G+NQV IGNG
Subjt:  VSSDYPGNNQVLIGNG

A0A6J1DLT9 uncharacterized protein LOC1110217571.9e-7943.03Show/hide
Query:  SENNNSQVLSSSQSSQVVNPGNKISTVKLSDENFLLWKFQILTALEGHDLDQHINDDCEPPPEKIQVSGNGAMVS--KPNPAFKLWKKQDKLVSSWIVGS
        S   NS      Q+S+ +NPG+K+S V+L+D+N LLWKFQI TAL+G+ L+ +I+ + + P + +Q + + +  S  + NPA+  W KQDKL+S+W++GS
Subjt:  SENNNSQVLSSSQSSQVVNPGNKISTVKLSDENFLLWKFQILTALEGHDLDQHINDDCEPPPEKIQVSGNGAMVS--KPNPAFKLWKKQDKLVSSWIVGS

Query:  MSESILEQILHCKSAKEIWSCLLQIFNSRHLAQIMKIKSKLQNIQKGGSSMSEYISKIKKCVDALSAIGKEVDVQDHIMYILSGLGAEYETMVSVITAKT
        M+E IL Q+L CKSA+EIW+ L  +F SR LA++M++K KL+N +KG  S+ +Y  KIK  VD+L+  GK++  +DHIM+IL+GLG E++ ++SVITA+ 
Subjt:  MSESILEQILHCKSAKEIWSCLLQIFNSRHLAQIMKIKSKLQNIQKGGSSMSEYISKIKKCVDALSAIGKEVDVQDHIMYILSGLGAEYETMVSVITAKT

Query:  GTQSVHDVVALLLTHESRIESKSAVNPDNVLPSANLVVQNPI--HNTVQNTSQNVQQQNFGNNRGRGRSNFGQNRGGRTWNNRNRPQCQICNKFGHTAIK
          Q++ +V +LLL  E R E ++ +N D  LPS NL + +    +N  Q+   N  Q N+ + RGRG +N   NR  R W   N+PQCQIC +FGHTA++
Subjt:  GTQSVHDVVALLLTHESRIESKSAVNPDNVLPSANLVVQNPI--HNTVQNTSQNVQQQNFGNNRGRGRSNFGQNRGGRTWNNRNRPQCQICNKFGHTAIK

Query:  CYSRVP-------------MPGAYATQF---SPSGSAFSSGQTL-GQQQVGGPFP-QMQAMMATPNYNQDCNWYPDSGATNHLTNSLSNMSVSSDYPGNN
        CY R                P  +++ F   +PS +AFSS  T  G   +    P QMQA+M   ++N+D NWY DSG TNH+TN   N S+ S+Y G+ 
Subjt:  CYSRVP-------------MPGAYATQF---SPSGSAFSSGQTL-GQQQVGGPFP-QMQAMMATPNYNQDCNWYPDSGATNHLTNSLSNMSVSSDYPGNN

Query:  QVLIGNGAG
        ++ +GNG G
Subjt:  QVLIGNGAG

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.8e-3326.99Show/hide
Query:  NKISTVKLSDENFLLWKFQILTALEGHDLDQHINDDCEPPPEKIQVSGNGAMVSKPNPAFKLWKKQDKLVSSWIVGSMSESILEQILHCKSAKEIWSCLL
        N  +  KL+  N+L+W  Q+    +G++L   ++     PP  I   G  A   + NP +  WK+QDKL+ S ++G++S S+   +    +A +IW  L 
Subjt:  NKISTVKLSDENFLLWKFQILTALEGHDLDQHINDDCEPPPEKIQVSGNGAMVSKPNPAFKLWKKQDKLVSSWIVGSMSESILEQILHCKSAKEIWSCLL

Query:  QIFNSRHLAQIMKIKSKLQNIQKGGSSMSEYISKIKKCVDALSAIGKEVDVQDHIMYILSGLGAEYETMVSVITAKTGTQSVHDVVALLLTHESRIESKS
        +I+ +     + +++++L+   KG  ++ +Y+  +    D L+ +GK +D  + +  +L  L  EY+ ++  I AK    ++ ++   LL HES+I    
Subjt:  QIFNSRHLAQIMKIKSKLQNIQKGGSSMSEYISKIKKCVDALSAIGKEVDVQDHIMYILSGLGAEYETMVSVITAKTGTQSVHDVVALLLTHESRIESKS

Query:  AVNPDNVLP-SANLVVQNPIHNTVQNTSQNVQQQNFGNNRGRGRSNFGQNRGGRTW---NNRNRP---QCQICNKFGHTAIKCYSRVPMPGAYATQFSPS
        AV+   V+P +AN V     + T  N + N  + N  +NR    ++    +    +   NN+++P   +CQIC   GH+A +C        +  +Q  P 
Subjt:  AVNPDNVLP-SANLVVQNPIHNTVQNTSQNVQQQNFGNNRGRGRSNFGQNRGGRTW---NNRNRP---QCQICNKFGHTAIKCYSRVPMPGAYATQFSPS

Query:  GSAFSSGQTLGQQQVGGPFPQMQAMMATPNYNQDCNWYPDSGATNHLTNSLSNMSVSSDYPGNNQVLIGNGAGLPISNLGYASFTSPSK
         S F+  Q      +G P+                NW  DSGAT+H+T+  +N+S+   Y G + V++ +G+ +PIS+ G  S ++ S+
Subjt:  GSAFSSGQTLGQQQVGGPFPQMQAMMATPNYNQDCNWYPDSGATNHLTNSLSNMSVSSDYPGNNQVLIGNGAGLPISNLGYASFTSPSK

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE29.3e-3127.91Show/hide
Query:  NKISTVKLSDENFLLWKFQILTALEGHDLDQHINDDCEPPPEKIQVSGNGAMVSKPNPAFKLWKKQDKLVSSWIVGSMSESILEQILHCKSAKEIWSCLL
        N  +  KL+  N+L+W  Q+    +G++L   ++     PP  I   G  A V + NP +  W++QDKL+ S I+G++S S+   +    +A +IW  L 
Subjt:  NKISTVKLSDENFLLWKFQILTALEGHDLDQHINDDCEPPPEKIQVSGNGAMVSKPNPAFKLWKKQDKLVSSWIVGSMSESILEQILHCKSAKEIWSCLL

Query:  QIFNSRHLAQIMKIKSKLQNIQKGGSSMSEYISKIKKCVDALSAIGKEVDVQDHIMYILSGLGAEYETMVSVITAKTGTQSVHDVVALLLTHESRIESKS
        +I+ +     + +++               +I++     D L+ +GK +D  + +  +L  L  +Y+ ++  I AK    S+ ++   L+  ES++    
Subjt:  QIFNSRHLAQIMKIKSKLQNIQKGGSSMSEYISKIKKCVDALSAIGKEVDVQDHIMYILSGLGAEYETMVSVITAKTGTQSVHDVVALLLTHESRIESKS

Query:  AVNPDNVLPSANLVVQNPIHNTVQNTSQNVQQQNFGNNRGRGRSNFGQNRGGRTWNNRNRP---QCQICNKFGHTAIKCYSRVPMPGAYATQFSPSGSAF
        A+N   V+P    VV +   NT +N +     +N+ NN  R  S    + G R+ N + +P   +CQIC+  GH+A +C               P    F
Subjt:  AVNPDNVLPSANLVVQNPIHNTVQNTSQNVQQQNFGNNRGRGRSNFGQNRGGRTWNNRNRP---QCQICNKFGHTAIKCYSRVPMPGAYATQFSPSGSAF

Query:  SSGQTLGQQQVGGPFP--QMQAMMATPNYNQDCNWYPDSGATNHLTNSLSNMSVSSDYPGNNQVLIGNGAGLPISNLGYASFTSPSK
         S  T  QQQ   PF   Q +A +A  +     NW  DSGAT+H+T+  +N+S    Y G + V+I +G+ +PI++ G AS  + S+
Subjt:  SSGQTLGQQQVGGPFP--QMQAMMATPNYNQDCNWYPDSGATNHLTNSLSNMSVSSDYPGNNQVLIGNGAGLPISNLGYASFTSPSK

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.1e-0722.22Show/hide
Query:  ISTVKLSDENFLLWKFQILTALEGHDLDQHINDDCEPPPEKIQVSGNGAMVSKPNPAFKLWKKQDKLVSSWIVGSMSESILEQILHCKSAKEIWSCLLQI
        I  +   ++N++ WK +  + L        I D   P P+              +P ++ W++ + +V  W++ SM++ +LE +++ ++A ++W  L ++
Subjt:  ISTVKLSDENFLLWKFQILTALEGHDLDQHINDDCEPPPEKIQVSGNGAMVSKPNPAFKLWKKQDKLVSSWIVGSMSESILEQILHCKSAKEIWSCLLQI

Query:  FNSRHLAQIMKIKSKLQNIQKGGSSMSEYISKIKK
        F      +I +++ +L  +++GG S+ EY  K+ K
Subjt:  FNSRHLAQIMKIKSKLQNIQKGGSSMSEYISKIKK

AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.6e-0924.44Show/hide
Query:  LSDENFLLWKFQILTALEGHDLDQHINDDCEPPPEKIQVSGNGAMVSKPNPAFKLWKKQDKLVSSWIVGSMSESILE-QILHCKSAKEIWSCLLQIFNSR
        + + N+  W+   LT     D+  HI+    P                 N     W+K+D +V   + G+++    +   +   ++++IW  +   F + 
Subjt:  LSDENFLLWKFQILTALEGHDLDQHINDDCEPPPEKIQVSGNGAMVSKPNPAFKLWKKQDKLVSSWIVGSMSESILE-QILHCKSAKEIWSCLLQIFNSR

Query:  HLAQIMKIKSKLQNIQKGGSSMSEYISKIKKCVDALSAIGKEVDVQDHIMYILSGLGAEYETMVSVITAKTGTQSVHDVVALLLTHESRIESKSAVNPDN
          A+ +++ S+L+    G   +++Y  K+KK  D+L  +   V  ++ +MY+L+GL  +++ +++VI  +    S  D   +L   E R++     NP +
Subjt:  HLAQIMKIKSKLQNIQKGGSSMSEYISKIKKCVDALSAIGKEVDVQDHIMYILSGLGAEYETMVSVITAKTGTQSVHDVVALLLTHESRIESKSAVNPDN

Query:  VLPSANLVVQNPIHNTVQNTSQNVQQQNF----GNN---RGRGR-SNFGQNRGGR-------TWNNRNRP
        V  S++        +TV   S+     NF    GN    RGRGR +N  + RGGR       T+N+ NRP
Subjt:  VLPSANLVVQNPIHNTVQNTSQNVQQQNF----GNN---RGRGR-SNFGQNRGGR-------TWNNRNRP

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)2.4e-1329.44Show/hide
Query:  SKPNP-AFKLWKKQDKLVSSWIVGSMSESILEQILHCK-SAKEIWSCLLQIFNSRHLAQIMKIKSKLQNIQKGGSSMSEYISKIKKCVDALSAIGKEVDV
        S P P   K WK++D LV  WI G++++S+L+ I+    +A+++W  L  +F     A+ ++ +++L+       S+ EY  K+K   D L+ +   +  
Subjt:  SKPNP-AFKLWKKQDKLVSSWIVGSMSESILEQILHCK-SAKEIWSCLLQIFNSRHLAQIMKIKSKLQNIQKGGSSMSEYISKIKKCVDALSAIGKEVDV

Query:  QDHIMYILSGLGAEYETMVSVITAKTGTQSVHDVVALLLTHESRI--ESKSAVNPDNVLPSANLVVQNPIHNTVQNTSQNVQQQNFGNNRGRGRSNFGQN
        +  +M++L+GL  +Y+ +++VI  K+   S  +  ++LL  ESR+  +SKS+++  N    +N++   P     Q      +  N  +N GRGRS   +N
Subjt:  QDHIMYILSGLGAEYETMVSVITAKTGTQSVHDVVALLLTHESRI--ESKSAVNPDNVLPSANLVVQNPIHNTVQNTSQNVQQQNFGNNRGRGRSNFGQN

Query:  RGGRT----WNNRN
        RGG +    +NN N
Subjt:  RGGRT----WNNRN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTTTTCAGCTGCATTCGTCATTAGGGTTAGAGAGAGCCTCGTCTAACGACTCATTTTTGTTTGTTCGCATGGAGACCTCTGAGAATAACTCTGAGAATAACAACTC
ACAAGTTTTGAGCAGCTCGCAAAGTAGCCAAGTGGTCAATCCCGGGAACAAGATCTCGACTGTTAAGCTGTCCGATGAGAATTTTCTCCTTTGGAAGTTCCAAATTCTTA
CCGCACTTGAGGGGCATGATCTCGATCAGCATATCAATGACGATTGTGAACCACCGCCTGAGAAAATTCAGGTAAGTGGAAATGGTGCAATGGTCAGTAAACCTAACCCT
GCCTTTAAACTCTGGAAAAAGCAAGACAAACTCGTATCCTCATGGATTGTTGGGTCTATGTCTGAGTCTATTTTAGAACAGATACTTCACTGTAAATCGGCTAAGGAAAT
CTGGTCTTGCTTGCTTCAAATTTTTAATTCTAGACACTTGGCTCAGATTATGAAAATTAAGTCGAAACTCCAAAATATTCAAAAAGGAGGGTCCTCTATGAGTGAATACA
TTTCTAAAATTAAGAAATGTGTAGATGCCTTATCTGCAATAGGAAAAGAAGTTGATGTTCAAGACCATATTATGTATATTCTCTCTGGTTTAGGGGCTGAGTATGAGACT
ATGGTGTCTGTTATTACTGCTAAAACTGGTACACAGTCTGTTCATGATGTTGTAGCTCTATTATTAACTCATGAGAGTCGGATTGAAAGTAAAAGTGCTGTTAACCCTGA
TAATGTCCTACCCTCGGCTAATTTGGTTGTTCAAAATCCTATACATAACACTGTGCAAAACACCTCTCAGAATGTGCAACAGCAAAATTTTGGTAATAATAGGGGTAGAG
GTCGTTCAAATTTTGGTCAAAATAGAGGTGGAAGAACCTGGAATAATCGAAATCGACCTCAGTGTCAGATATGTAATAAGTTTGGTCATACTGCTATTAAATGTTACTCT
CGTGTTCCAATGCCTGGTGCTTATGCTACTCAGTTCAGTCCCTCTGGTTCTGCTTTTTCCTCTGGTCAAACTCTTGGCCAGCAACAAGTTGGTGGACCATTTCCACAAAT
GCAGGCTATGATGGCTACTCCCAATTATAATCAAGATTGTAACTGGTATCCTGACTCAGGAGCCACCAATCATTTGACCAACAGCCTGAGTAACATGTCTGTGAGTTCTG
ATTATCCTGGAAATAATCAGGTTCTGATTGGCAATGGTGCAGGTTTGCCTATCTCTAATCTTGGTTATGCCTCTTTTACTTCTCCAAGCAAACACCATGCTATGCCCTTT
TCTAGATCTACTACTACTTATTATGCACCTTTACAACTCATTGTAACCGATTTATGGGGTCCTTCTTACAAACTGTCCACTCATGGCTTTAGATATTACATTAGCTTTGT
GGATGCTTTTCTCGATATACATGGATTTATTTCCTTCAAACTAAGTCTGAAGCATTTCAAGCTTTCATTAAATTCAAAACGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCTTTTCAGCTGCATTCGTCATTAGGGTTAGAGAGAGCCTCGTCTAACGACTCATTTTTGTTTGTTCGCATGGAGACCTCTGAGAATAACTCTGAGAATAACAACTC
ACAAGTTTTGAGCAGCTCGCAAAGTAGCCAAGTGGTCAATCCCGGGAACAAGATCTCGACTGTTAAGCTGTCCGATGAGAATTTTCTCCTTTGGAAGTTCCAAATTCTTA
CCGCACTTGAGGGGCATGATCTCGATCAGCATATCAATGACGATTGTGAACCACCGCCTGAGAAAATTCAGGTAAGTGGAAATGGTGCAATGGTCAGTAAACCTAACCCT
GCCTTTAAACTCTGGAAAAAGCAAGACAAACTCGTATCCTCATGGATTGTTGGGTCTATGTCTGAGTCTATTTTAGAACAGATACTTCACTGTAAATCGGCTAAGGAAAT
CTGGTCTTGCTTGCTTCAAATTTTTAATTCTAGACACTTGGCTCAGATTATGAAAATTAAGTCGAAACTCCAAAATATTCAAAAAGGAGGGTCCTCTATGAGTGAATACA
TTTCTAAAATTAAGAAATGTGTAGATGCCTTATCTGCAATAGGAAAAGAAGTTGATGTTCAAGACCATATTATGTATATTCTCTCTGGTTTAGGGGCTGAGTATGAGACT
ATGGTGTCTGTTATTACTGCTAAAACTGGTACACAGTCTGTTCATGATGTTGTAGCTCTATTATTAACTCATGAGAGTCGGATTGAAAGTAAAAGTGCTGTTAACCCTGA
TAATGTCCTACCCTCGGCTAATTTGGTTGTTCAAAATCCTATACATAACACTGTGCAAAACACCTCTCAGAATGTGCAACAGCAAAATTTTGGTAATAATAGGGGTAGAG
GTCGTTCAAATTTTGGTCAAAATAGAGGTGGAAGAACCTGGAATAATCGAAATCGACCTCAGTGTCAGATATGTAATAAGTTTGGTCATACTGCTATTAAATGTTACTCT
CGTGTTCCAATGCCTGGTGCTTATGCTACTCAGTTCAGTCCCTCTGGTTCTGCTTTTTCCTCTGGTCAAACTCTTGGCCAGCAACAAGTTGGTGGACCATTTCCACAAAT
GCAGGCTATGATGGCTACTCCCAATTATAATCAAGATTGTAACTGGTATCCTGACTCAGGAGCCACCAATCATTTGACCAACAGCCTGAGTAACATGTCTGTGAGTTCTG
ATTATCCTGGAAATAATCAGGTTCTGATTGGCAATGGTGCAGGTTTGCCTATCTCTAATCTTGGTTATGCCTCTTTTACTTCTCCAAGCAAACACCATGCTATGCCCTTT
TCTAGATCTACTACTACTTATTATGCACCTTTACAACTCATTGTAACCGATTTATGGGGTCCTTCTTACAAACTGTCCACTCATGGCTTTAGATATTACATTAGCTTTGT
GGATGCTTTTCTCGATATACATGGATTTATTTCCTTCAAACTAAGTCTGAAGCATTTCAAGCTTTCATTAAATTCAAAACGCTAG
Protein sequenceShow/hide protein sequence
MPFQLHSSLGLERASSNDSFLFVRMETSENNSENNNSQVLSSSQSSQVVNPGNKISTVKLSDENFLLWKFQILTALEGHDLDQHINDDCEPPPEKIQVSGNGAMVSKPNP
AFKLWKKQDKLVSSWIVGSMSESILEQILHCKSAKEIWSCLLQIFNSRHLAQIMKIKSKLQNIQKGGSSMSEYISKIKKCVDALSAIGKEVDVQDHIMYILSGLGAEYET
MVSVITAKTGTQSVHDVVALLLTHESRIESKSAVNPDNVLPSANLVVQNPIHNTVQNTSQNVQQQNFGNNRGRGRSNFGQNRGGRTWNNRNRPQCQICNKFGHTAIKCYS
RVPMPGAYATQFSPSGSAFSSGQTLGQQQVGGPFPQMQAMMATPNYNQDCNWYPDSGATNHLTNSLSNMSVSSDYPGNNQVLIGNGAGLPISNLGYASFTSPSKHHAMPF
SRSTTTYYAPLQLIVTDLWGPSYKLSTHGFRYYISFVDAFLDIHGFISFKLSLKHFKLSLNSKR