; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh04G000160 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh04G000160
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationCmo_Chr04:83740..85284
RNA-Seq ExpressionCmoCh04G000160
SyntenyCmoCh04G000160
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016628 - oxidoreductase activity, acting on the CH-CH group of donors, NAD or NADP as acceptor (molecular function)
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW69506.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]9.0e-13261Show/hide
Query:  MAESAKSS-FKISDVDLTHPYYIHHSDQPGYSLVPIKLNGANYQSWSKSVMHALIAKKKIGFIDGTIEEPSQDANSTEFELWNQCNSMIISWLTHSVEAD
        MA S K+S  +   +D +HP YIHHSDQPG+ LVPIKLNG NYQSWSK+V+HAL AKKKIGF+DGT+EEPSQ+     FE WNQCNSMI+SWLTH+VE+D
Subjt:  MAESAKSS-FKISDVDLTHPYYIHHSDQPGYSLVPIKLNGANYQSWSKSVMHALIAKKKIGFIDGTIEEPSQDANSTEFELWNQCNSMIISWLTHSVEAD

Query:  IAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYXXXXTMALSTYFTKLKALWDELEAYRTPFTCNQRQIHIDQREEDKLMQLL
        IA+GIIHAKTA +VWVDL DQFSQKNAPA+FQIQ SIATMSQG           TM ++ YFTK+KALWDELE YR+P TCNQRQ H++QREED+LMQ L
Subjt:  IAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYXXXXTMALSTYFTKLKALWDELEAYRTPFTCNQRQIHIDQREEDKLMQLL

Query:  MGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQVTSEPTENFSIASAVQKKTIYSKFAKDKKCEHCNKSGHTINECRILKFHCNFCDRRGHTEDR
        MGLN+SYK VRSNILMMSPLPNVRQAYSL+VQEEMQRQV+SEPTENFSIA+AV  K       + K C+HCN+SGHTI+ECR LKFHC FCD+RGHTEDR
Subjt:  MGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQVTSEPTENFSIASAVQKKTIYSKFAKDKKCEHCNKSGHTINECRILKFHCNFCDRRGHTEDR

Query:  CRQKNNSGRTR---QDNQHNNRGYRSSANMADVSQLNTEEQSPNSIPNFSSEQLREIAQALSAINHHPSGNSDNHVNVA---------------------
        CR KN S       +  +   RG + SAN A  SQ  ++  S +++  F++EQ++++AQA+ A+NH  SGN D++ N A                     
Subjt:  CRQKNNSGRTR---QDNQHNNRGYRSSANMADVSQLNTEEQSPNSIPNFSSEQLREIAQALSAINHHPSGNSDNHVNVA---------------------

Query:  -------GLGYGEDDWLG
               GLG+GEDDWLG
Subjt:  -------GLGYGEDDWLG

XP_023511453.1 uncharacterized protein LOC111776274 [Cucurbita pepo subsp. pepo]2.2e-15495Show/hide
Query:  MIISWLTHSVEADIAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYXXXXTMALSTYFTKLKALWDELEAYRTPFTCNQRQIH
        MIISWLTHSVEADIAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQG           TMALSTYFTKLKALWDELEAYRTPFTCNQRQIH
Subjt:  MIISWLTHSVEADIAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYXXXXTMALSTYFTKLKALWDELEAYRTPFTCNQRQIH

Query:  IDQREEDKLMQLLMGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQVTSEPTENFSIASAVQKKTIYSKFAKDKKCEHCNKSGHTINECRILKFH
        IDQREEDKLMQLLMGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQVTSEPTE  SIASAVQKKTIYSKFAKDK CEHCNKSGHTINECRILKFH
Subjt:  IDQREEDKLMQLLMGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQVTSEPTENFSIASAVQKKTIYSKFAKDKKCEHCNKSGHTINECRILKFH

Query:  CNFCDRRGHTEDRCRQKNNSGRTRQDNQHNNRGYRSSANMADVSQLNTEEQSPNSIPNFSSEQLREIAQALSAINHHPSGNSDNHVNVAGLGYGEDDWLG
        CNFCDRRGHTEDRCRQKNNSGRTRQDNQHNNRGYRSSANMADVSQLNTEEQSPNSIPNFSSEQLR+IAQALSAINHHPSGNSDNHVNVAGLGYGEDDWLG
Subjt:  CNFCDRRGHTEDRCRQKNNSGRTRQDNQHNNRGYRSSANMADVSQLNTEEQSPNSIPNFSSEQLREIAQALSAINHHPSGNSDNHVNVAGLGYGEDDWLG

XP_023520712.1 uncharacterized protein LOC111784113 [Cucurbita pepo subsp. pepo]1.4e-15696Show/hide
Query:  MIISWLTHSVEADIAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYXXXXTMALSTYFTKLKALWDELEAYRTPFTCNQRQIH
        MIISWLTHSVEADIAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQG           TMALSTYFTKLKALWDELEAYRTPFTCNQRQIH
Subjt:  MIISWLTHSVEADIAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYXXXXTMALSTYFTKLKALWDELEAYRTPFTCNQRQIH

Query:  IDQREEDKLMQLLMGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQVTSEPTENFSIASAVQKKTIYSKFAKDKKCEHCNKSGHTINECRILKFH
        IDQREEDKLMQLLMGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQVTSEPTENFSIASAVQKKTIYSKFAKDKKCEHCNKSGHTINECRILKFH
Subjt:  IDQREEDKLMQLLMGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQVTSEPTENFSIASAVQKKTIYSKFAKDKKCEHCNKSGHTINECRILKFH

Query:  CNFCDRRGHTEDRCRQKNNSGRTRQDNQHNNRGYRSSANMADVSQLNTEEQSPNSIPNFSSEQLREIAQALSAINHHPSGNSDNHVNVAGLGYGEDDWLG
        CNFCDRRGHTEDRCRQKNNSGRTRQDNQHNNRGYRSSANMADVSQLNTEEQSPNSIPNFSSEQLR+IAQALSAINHHPSGNSDNHVNVAGLGYGEDDWLG
Subjt:  CNFCDRRGHTEDRCRQKNNSGRTRQDNQHNNRGYRSSANMADVSQLNTEEQSPNSIPNFSSEQLREIAQALSAINHHPSGNSDNHVNVAGLGYGEDDWLG

XP_023524327.1 uncharacterized protein LOC111788256 [Cucurbita pepo subsp. pepo]3.4e-15595.33Show/hide
Query:  MIISWLTHSVEADIAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYXXXXTMALSTYFTKLKALWDELEAYRTPFTCNQRQIH
        MIISWLTHSVEADIAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQG           TMALSTYFTKLKALWDELEAYRTPFTCNQRQIH
Subjt:  MIISWLTHSVEADIAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYXXXXTMALSTYFTKLKALWDELEAYRTPFTCNQRQIH

Query:  IDQREEDKLMQLLMGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQVTSEPTENFSIASAVQKKTIYSKFAKDKKCEHCNKSGHTINECRILKFH
        IDQREEDKLMQLLMGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQVTSEPTE  SIASAVQKKTIYSKFAKDKKCEHCNKSGHTINECRILKFH
Subjt:  IDQREEDKLMQLLMGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQVTSEPTENFSIASAVQKKTIYSKFAKDKKCEHCNKSGHTINECRILKFH

Query:  CNFCDRRGHTEDRCRQKNNSGRTRQDNQHNNRGYRSSANMADVSQLNTEEQSPNSIPNFSSEQLREIAQALSAINHHPSGNSDNHVNVAGLGYGEDDWLG
        CNFCDRRGHTEDRCRQKNNSGRTRQDNQHNNRGYRSSANMADVSQLNTEEQSPNSIPNFSSEQLR+IAQALSAINHHPSGNSDNHVNVAGLGYGEDDWLG
Subjt:  CNFCDRRGHTEDRCRQKNNSGRTRQDNQHNNRGYRSSANMADVSQLNTEEQSPNSIPNFSSEQLREIAQALSAINHHPSGNSDNHVNVAGLGYGEDDWLG

XP_023536128.1 uncharacterized protein LOC111797374 [Cucurbita pepo subsp. pepo]1.4e-15394.33Show/hide
Query:  MIISWLTHSVEADIAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYXXXXTMALSTYFTKLKALWDELEAYRTPFTCNQRQIH
        MIISWLTHSVEADIAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQG           TMALSTYFTKLKALWDELEAYRTPFTCNQRQIH
Subjt:  MIISWLTHSVEADIAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYXXXXTMALSTYFTKLKALWDELEAYRTPFTCNQRQIH

Query:  IDQREEDKLMQLLMGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQVTSEPTENFSIASAVQKKTIYSKFAKDKKCEHCNKSGHTINECRILKFH
        IDQREEDKLMQLLMGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQVTSEPTE  SIASAVQKKTIYSKFAKDK CEHCNKSGHTINECRILKFH
Subjt:  IDQREEDKLMQLLMGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQVTSEPTENFSIASAVQKKTIYSKFAKDKKCEHCNKSGHTINECRILKFH

Query:  CNFCDRRGHTEDRCRQKNNSGRTRQDNQHNNRGYRSSANMADVSQLNTEEQSPNSIPNFSSEQLREIAQALSAINHHPSGNSDNHVNVAGLGYGEDDWLG
        C FCDRRGHTEDRCRQKNNSGRTRQDNQHNNRGYRSSANMADVSQLNTEEQSPNSIPNFSSEQLR+IAQALSAINHHPSGNSDNH+NVAGLGYGEDDWLG
Subjt:  CNFCDRRGHTEDRCRQKNNSGRTRQDNQHNNRGYRSSANMADVSQLNTEEQSPNSIPNFSSEQLREIAQALSAINHHPSGNSDNHVNVAGLGYGEDDWLG

TrEMBL top hitse value%identityAlignment
A0A438GBE7 Retrovirus-related Pol polyprotein from transposon RE14.3e-13261Show/hide
Query:  MAESAKSS-FKISDVDLTHPYYIHHSDQPGYSLVPIKLNGANYQSWSKSVMHALIAKKKIGFIDGTIEEPSQDANSTEFELWNQCNSMIISWLTHSVEAD
        MA S K+S  +   +D +HP YIHHSDQPG+ LVPIKLNG NYQSWSK+V+HAL AKKKIGF+DGT+EEPSQ+     FE WNQCNSMI+SWLTH+VE+D
Subjt:  MAESAKSS-FKISDVDLTHPYYIHHSDQPGYSLVPIKLNGANYQSWSKSVMHALIAKKKIGFIDGTIEEPSQDANSTEFELWNQCNSMIISWLTHSVEAD

Query:  IAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYXXXXTMALSTYFTKLKALWDELEAYRTPFTCNQRQIHIDQREEDKLMQLL
        IA+GIIHAKTA +VWVDL DQFSQKNAPA+FQIQ SIATMSQG           TM ++ YFTK+KALWDELE YR+P TCNQRQ H++QREED+LMQ L
Subjt:  IAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYXXXXTMALSTYFTKLKALWDELEAYRTPFTCNQRQIHIDQREEDKLMQLL

Query:  MGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQVTSEPTENFSIASAVQKKTIYSKFAKDKKCEHCNKSGHTINECRILKFHCNFCDRRGHTEDR
        MGLN+SYK VRSNILMMSPLPNVRQAYSL+VQEEMQRQV+SEPTENFSIA+AV  K       + K C+HCN+SGHTI+ECR LKFHC FCD+RGHTEDR
Subjt:  MGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQVTSEPTENFSIASAVQKKTIYSKFAKDKKCEHCNKSGHTINECRILKFHCNFCDRRGHTEDR

Query:  CRQKNNSGRTR---QDNQHNNRGYRSSANMADVSQLNTEEQSPNSIPNFSSEQLREIAQALSAINHHPSGNSDNHVNVA---------------------
        CR KN S       +  +   RG + SAN A  SQ  ++  S +++  F++EQ++++AQA+ A+NH  SGN D++ N A                     
Subjt:  CRQKNNSGRTR---QDNQHNNRGYRSSANMADVSQLNTEEQSPNSIPNFSSEQLREIAQALSAINHHPSGNSDNHVNVA---------------------

Query:  -------GLGYGEDDWLG
               GLG+GEDDWLG
Subjt:  -------GLGYGEDDWLG

A0A438GFQ0 Retrovirus-related Pol polyprotein from transposon RE14.1e-13064.57Show/hide
Query:  MAESAKSS-FKISDVDLTHPYYIHHSDQPGYSLVPIKLNGANYQSWSKSVMHALIAKKKIGFIDGTIEEPSQDANSTEFELWNQCNSMIISWLTHSVEAD
        MA S K+S  +   +D +HP YIHHSDQPG+ LVPIKLNG NYQSWSK+V+HAL AKKKIGF++GT+EEPSQ+     FE WNQCNSMI+SWLTH+VE+D
Subjt:  MAESAKSS-FKISDVDLTHPYYIHHSDQPGYSLVPIKLNGANYQSWSKSVMHALIAKKKIGFIDGTIEEPSQDANSTEFELWNQCNSMIISWLTHSVEAD

Query:  IAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYXXXXTMALSTYFTKLKALWDELEAYRTPFTCNQRQIHIDQREEDKLMQLL
        IA+GIIHAKTA +VWVDL DQFSQKNAPA+FQIQ SIATMSQG           TM ++ YFTK+KALWDELE YR+P TCNQRQ H++QREED+LMQ L
Subjt:  IAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYXXXXTMALSTYFTKLKALWDELEAYRTPFTCNQRQIHIDQREEDKLMQLL

Query:  MGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQVTSEPTENFSIASAVQKKTIYSKFAKDKKCEHCNKSGHTINECRILKFHCNFCDRRGHTEDR
        MGLN+SYK VRSNILMMSPLPNVRQAYSL++QEEMQRQV+SEPTENFSIA+AV  K       + K C+HCN+SGHTI+ECR LKFHC FCD+RGHTEDR
Subjt:  MGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQVTSEPTENFSIASAVQKKTIYSKFAKDKKCEHCNKSGHTINECRILKFHCNFCDRRGHTEDR

Query:  CRQKNNSGRTR---QDNQHNNRGYRSSANMADVSQLNTEEQSPNSIPNFSSEQLREIAQALSAINHHPSGNSDNHVNVAGL
        CR KN S       +  +   RG + SAN A  SQ  ++  S +++  F++EQ++++AQA+ A+NH  SGN D + NVAGL
Subjt:  CRQKNNSGRTR---QDNQHNNRGYRSSANMADVSQLNTEEQSPNSIPNFSSEQLREIAQALSAINHHPSGNSDNHVNVAGL

A0A438GTA3 Retrotran_gag_3 domain-containing protein4.5e-12960.29Show/hide
Query:  MAESAKSS-FKISDVDLTHPYYIHHSDQPGYSLVPIKLNGANYQSWSKSVMHALIAKKKIGFIDGTIEEPSQDANSTEFELWNQCNSMIISWLTHSVEAD
        MA S K+S  +   +D +HP YIHHSDQP + LVPIKLNG NYQSWSK+V+HAL AKKKIGF+DGT+EEPSQ+     FE WNQCNSMI+SWLTH+VE+D
Subjt:  MAESAKSS-FKISDVDLTHPYYIHHSDQPGYSLVPIKLNGANYQSWSKSVMHALIAKKKIGFIDGTIEEPSQDANSTEFELWNQCNSMIISWLTHSVEAD

Query:  IAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYXXXXTMALSTYFTKLKALWDELEAYRTPFTCNQRQIHIDQREEDKLMQLL
        IA+GIIH+KT  +VWVDL DQFSQKNAP +FQIQ SIATMSQG           TM ++ YFTK+KALWDELE YR+P TCNQ Q H++QREED+LMQ L
Subjt:  IAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYXXXXTMALSTYFTKLKALWDELEAYRTPFTCNQRQIHIDQREEDKLMQLL

Query:  MGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQVTSEPTENFSIASAVQKKTIYSKFAKDKKCEHCNKSGHTINECRILKFHCNFCDRRGHTEDR
        MGLN+SYK +RSNILMMSPLPNVRQAYSL+VQEEMQRQV+SEPTENFSIA+AV  K       + K C+HCN+SGHTI+ECR LKFHC FCD+RGHTEDR
Subjt:  MGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQVTSEPTENFSIASAVQKKTIYSKFAKDKKCEHCNKSGHTINECRILKFHCNFCDRRGHTEDR

Query:  CRQKNNS-GRTRQ--DNQHNNRGYRSSANMADVSQLNTEEQSPNSIPNFSSEQLREIAQALSAINHHPSGNSDNHVNVA---------------------
        CR KN S  +TRQ    +   RG + SAN A  SQ  ++  S +++  F++EQ++++AQA+ A+NH  SGN D + N A                     
Subjt:  CRQKNNS-GRTRQ--DNQHNNRGYRSSANMADVSQLNTEEQSPNSIPNFSSEQLREIAQALSAINHHPSGNSDNHVNVA---------------------

Query:  -------GLGYGEDDWLG
               GLG+GEDDWLG
Subjt:  -------GLGYGEDDWLG

A0A438K345 Retrovirus-related Pol polyprotein from transposon RE11.5e-12961.14Show/hide
Query:  VDLTHPYYIHHSDQPGYSLVPIKLNGANYQSWSKSVMHALIAKKKIGFIDGTIEEPSQDANSTEFELWNQCNSMIISWLTHSVEADIAKGIIHAKTAHQV
        +D +HP YIHHSDQPG+ LVPIKLNG NYQSWSK+V+HAL AKKKIGF+DGT+EEPSQ+     FE WNQCNSMI+SWLTH VE+DIA+GIIHAKTA +V
Subjt:  VDLTHPYYIHHSDQPGYSLVPIKLNGANYQSWSKSVMHALIAKKKIGFIDGTIEEPSQDANSTEFELWNQCNSMIISWLTHSVEADIAKGIIHAKTAHQV

Query:  WVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYXXXXTMALSTYFTKLKALWDELEAYRTPFTCNQRQIHIDQREEDKLMQLLMGLNQSYKTVRSNI
        WVDL DQFSQKNAPA+FQIQ SIATMSQG           TM ++ YFTK+KALWDELE YR+P TCNQRQ H++QREED+LMQ LMGL++SYK VRSNI
Subjt:  WVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYXXXXTMALSTYFTKLKALWDELEAYRTPFTCNQRQIHIDQREEDKLMQLLMGLNQSYKTVRSNI

Query:  LMMSPLPNVRQAYSLLVQEEMQRQVTSEPTENFSIASAVQKKTIYSKFAKDKKCEHCNKSGHTINECRILKFHCNFCDRRGHTEDRCRQKNNSGRTR---
        LMMSPLPNVRQAYSL+VQEEMQRQV+SEPTENFSIA+AV +K       + K C+HCN+SGH ++ECR LKFHC FCD+RGHTEDRCR KN S       
Subjt:  LMMSPLPNVRQAYSLLVQEEMQRQVTSEPTENFSIASAVQKKTIYSKFAKDKKCEHCNKSGHTINECRILKFHCNFCDRRGHTEDRCRQKNNSGRTR---

Query:  QDNQHNNRGYRSSANMADVSQLNTEEQSPNSIPNFSSEQLREIAQALSAINHHPSGNSDNHVNVA----------------------------GLGYGED
        +  +   RG +  AN A  SQ  ++  S ++I  F++EQ++++AQA+ A+NH  SGN D + N A                            GLG+GED
Subjt:  QDNQHNNRGYRSSANMADVSQLNTEEQSPNSIPNFSSEQLREIAQALSAINHHPSGNSDNHVNVA----------------------------GLGYGED

Query:  DWLG
        DWLG
Subjt:  DWLG

A5BNR5 Integrase catalytic domain-containing protein2.6e-12964.47Show/hide
Query:  MAESAKSS-FKISDVDLTHPYYIHHSDQPGYSLVPIKLNGANYQSWSKSVMHALIAKKKIGFIDGTIEEPSQDANSTEFELWNQCNSMIISWLTHSVEAD
        MA S K+S  +   +D +HP YIHHSDQPG+ LVPIKLNG NYQSWSK+V+HAL  KKKIGF+DGT+EEPSQ+     FE WNQCNSMI+SWLTH+VE+D
Subjt:  MAESAKSS-FKISDVDLTHPYYIHHSDQPGYSLVPIKLNGANYQSWSKSVMHALIAKKKIGFIDGTIEEPSQDANSTEFELWNQCNSMIISWLTHSVEAD

Query:  IAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYXXXXTMALSTYFTKLKALWDELEAYRTPFTCNQRQIHIDQREEDKLMQLL
        IA+GIIHAKTA +VWVDL DQFSQKNAPA+FQIQ SIATMSQG           TM ++ YFTK+KALWDELE YR+P TCNQRQ H++QREED+LMQ L
Subjt:  IAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYXXXXTMALSTYFTKLKALWDELEAYRTPFTCNQRQIHIDQREEDKLMQLL

Query:  MGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQVTSEPTENFSIASAVQKKTIYSKFAKDKKCEHCNKSGHTINECRILKFHCNFCDRRGHTEDR
        MGLN+SYK VRSNILMMSPLPNVRQAYSL+VQEEMQRQV+SEPTENFSIA+AV  K       + K C+HCN+SGHTI+ECR LKFHC FCD+RGHTEDR
Subjt:  MGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQVTSEPTENFSIASAVQKKTIYSKFAKDKKCEHCNKSGHTINECRILKFHCNFCDRRGHTEDR

Query:  CRQKNNSGRTR---QDNQHNNRGYRSSANMADVSQLNTEEQSPNSIPNFSSEQLREIAQALSAINHHPSGNSDNHVNVAG
        CR KN S       +  +   RG + SAN A  SQ  ++  S +++  F++EQ++++AQA+ A+NH  SGN D + N AG
Subjt:  CRQKNNSGRTR---QDNQHNNRGYRSSANMADVSQLNTEEQSPNSIPNFSSEQLREIAQALSAINHHPSGNSDNHVNVAG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.7e-2430.49Show/hide
Query:  MAESAKSSFKISDVDLTHPYY----IHHSDQPGYSLVPIKLNGANYQSWSKSVMHALIAKKKIGFIDGTIEEPSQDANSTEFELWNQCNSMIISWLTHSV
        MAE+ KS    SD D   PYY    IHH     +S+  +  +  NY +W       L   KK GFIDGT+ +P  D  S  ++ W QCN+M++ WL +S+
Subjt:  MAESAKSSFKISDVDLTHPYY----IHHSDQPGYSLVPIKLNGANYQSWSKSVMHALIAKKKIGFIDGTIEEPSQDANSTEFELWNQCNSMIISWLTHSV

Query:  EADIAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYXXXXTMALSTYFTKLKALWDELEAYRT-------PFTCNQRQIHIDQ
           + + +++A+TAH++W DL   F       I+Q++  +AT+ QG             ++  YF KL  +W EL  Y            C   +   + 
Subjt:  EADIAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYXXXXTMALSTYFTKLKALWDELEAYRT-------PFTCNQRQIHIDQ

Query:  REEDKLMQLLMG--LNQSYKTVRSNILMMSPLPNVRQAYSLLVQEE
        RE+++  + LMG  LNQ ++ V + I+   P P++ +A++++   E
Subjt:  REEDKLMQLLMG--LNQSYKTVRSNILMMSPLPNVRQAYSLLVQEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGAATCAGCCAAATCCAGCTTCAAAATTTCGGATGTTGATTTAACACATCCGTACTATATTCATCACTCTGATCAGCCAGGATATTCACTTGTTCCAATCAAATT
AAATGGAGCAAATTACCAATCCTGGAGTAAATCAGTTATGCATGCTCTTATTGCCAAGAAGAAAATTGGCTTCATTGATGGCACAATTGAGGAACCGTCCCAAGATGCAA
ATTCAACCGAATTCGAACTCTGGAATCAGTGCAACAGTATGATAATATCTTGGTTAACTCATTCCGTTGAAGCAGATATCGCTAAAGGCATTATTCACGCCAAGACAGCT
CATCAAGTGTGGGTTGATCTTCACGATCAATTCTCACAAAAGAATGCTCCAGCAATTTTTCAAATACAAAACTCGATAGCAACGATGTCACAAGGAACCATGGCGCTGTC
AACATATTTNNNNNNNNNNACCATGGCGCTGTCAACATATTTCACCAAGCTCAAAGCACTTTGGGATGAACTGGAAGCGTACCGCACACCATTTACCTGTAATCAACGTC
AAATACATATTGATCAACGCGAAGAAGACAAGTTGATGCAATTGCTCATGGGGCTTAATCAGTCTTATAAAACGGTGAGATCTAACATATTGATGATGTCTCCATTACCT
AATGTGAGGCAAGCCTATTCATTACTTGTACAAGAAGAGATGCAGCGTCAGGTAACTTCCGAACCTACTGAGAATTTCTCGATTGCATCAGCAGTGCAAAAGAAAACAAT
ATATTCAAAATTCGCCAAGGACAAAAAGTGTGAACACTGCAATAAAAGTGGTCATACAATCAATGAGTGTCGAATTCTTAAGTTTCACTGTAACTTTTGTGATAGAAGGG
GCCATACAGAAGATCGGTGTCGACAGAAAAATAATTCTGGAAGGACAAGACAAGACAATCAACACAATAACCGTGGATATCGATCATCTGCAAATATGGCCGATGTTTCA
CAGTTGAATACAGAAGAACAGTCACCTAATTCCATTCCAAATTTTTCTTCTGAGCAATTACGAGAGATAGCACAAGCCTTATCTGCAATCAATCATCACCCTTCTGGTAA
TTCTGACAATCACGTCAATGTTGCAGGACTTGGCTACGGGGAAGATGATTGGCTCGGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGAATCAGCCAAATCCAGCTTCAAAATTTCGGATGTTGATTTAACACATCCGTACTATATTCATCACTCTGATCAGCCAGGATATTCACTTGTTCCAATCAAATT
AAATGGAGCAAATTACCAATCCTGGAGTAAATCAGTTATGCATGCTCTTATTGCCAAGAAGAAAATTGGCTTCATTGATGGCACAATTGAGGAACCGTCCCAAGATGCAA
ATTCAACCGAATTCGAACTCTGGAATCAGTGCAACAGTATGATAATATCTTGGTTAACTCATTCCGTTGAAGCAGATATCGCTAAAGGCATTATTCACGCCAAGACAGCT
CATCAAGTGTGGGTTGATCTTCACGATCAATTCTCACAAAAGAATGCTCCAGCAATTTTTCAAATACAAAACTCGATAGCAACGATGTCACAAGGAACCATGGCGCTGTC
AACATATTTNNNNNNNNNNACCATGGCGCTGTCAACATATTTCACCAAGCTCAAAGCACTTTGGGATGAACTGGAAGCGTACCGCACACCATTTACCTGTAATCAACGTC
AAATACATATTGATCAACGCGAAGAAGACAAGTTGATGCAATTGCTCATGGGGCTTAATCAGTCTTATAAAACGGTGAGATCTAACATATTGATGATGTCTCCATTACCT
AATGTGAGGCAAGCCTATTCATTACTTGTACAAGAAGAGATGCAGCGTCAGGTAACTTCCGAACCTACTGAGAATTTCTCGATTGCATCAGCAGTGCAAAAGAAAACAAT
ATATTCAAAATTCGCCAAGGACAAAAAGTGTGAACACTGCAATAAAAGTGGTCATACAATCAATGAGTGTCGAATTCTTAAGTTTCACTGTAACTTTTGTGATAGAAGGG
GCCATACAGAAGATCGGTGTCGACAGAAAAATAATTCTGGAAGGACAAGACAAGACAATCAACACAATAACCGTGGATATCGATCATCTGCAAATATGGCCGATGTTTCA
CAGTTGAATACAGAAGAACAGTCACCTAATTCCATTCCAAATTTTTCTTCTGAGCAATTACGAGAGATAGCACAAGCCTTATCTGCAATCAATCATCACCCTTCTGGTAA
TTCTGACAATCACGTCAATGTTGCAGGACTTGGCTACGGGGAAGATGATTGGCTCGGGTAAACAATTTGGAGGTCTCTATCATATTTCTTCATCTCCAATCAAATCTTCA
GCTCATCAAGTATCT
Protein sequenceShow/hide protein sequence
MAESAKSSFKISDVDLTHPYYIHHSDQPGYSLVPIKLNGANYQSWSKSVMHALIAKKKIGFIDGTIEEPSQDANSTEFELWNQCNSMIISWLTHSVEADIAKGIIHAKTA
HQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYXXXXTMALSTYFTKLKALWDELEAYRTPFTCNQRQIHIDQREEDKLMQLLMGLNQSYKTVRSNILMMSPLP
NVRQAYSLLVQEEMQRQVTSEPTENFSIASAVQKKTIYSKFAKDKKCEHCNKSGHTINECRILKFHCNFCDRRGHTEDRCRQKNNSGRTRQDNQHNNRGYRSSANMADVS
QLNTEEQSPNSIPNFSSEQLREIAQALSAINHHPSGNSDNHVNVAGLGYGEDDWLG