; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh18G010010 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh18G010010
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationCmo_Chr18:10906944..10908455
RNA-Seq ExpressionCmoCh18G010010
SyntenyCmoCh18G010010
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016628 - oxidoreductase activity, acting on the CH-CH group of donors, NAD or NADP as acceptor (molecular function)
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW69506.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.9e-13462.65Show/hide
Query:  MAESAKSS-FKISDVDLTHPYYIHHSDQPGYSLVPIKLNGANYQSWSKSVMHALIAKKKIGFIDGTIEEPSQDANSTEFELWNQCNSMIISWLTHSVEAD
        MA S K+S  +   +D +HP YIHHSDQPG+ LVPIKLNG NYQSWSK+V+HAL AKKKIGF+DGT+EEPSQ+     FE WNQCNSMI+SWLTH+VE+D
Subjt:  MAESAKSS-FKISDVDLTHPYYIHHSDQPGYSLVPIKLNGANYQSWSKSVMHALIAKKKIGFIDGTIEEPSQDANSTEFELWNQCNSMIISWLTHSVEAD

Query:  IAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCNQRQIHIDQREEDKLMQLLMGLNQSYKTVR
        IA+GIIHAKTA +VWVDL DQFSQKNAPA+FQIQ SIATMSQGTM ++ YFTK+KALWDELE YR+P TCNQRQ H++QREED+LMQ LMGLN+SYK VR
Subjt:  IAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCNQRQIHIDQREEDKLMQLLMGLNQSYKTVR

Query:  SNILMMSPLPNVRQAYSLLVQEEMQRQVTSEPTENFSIASAVQKKTIYSKFAKDKKCEHCNKSGHTINECRILKFHCNFCDRRGHTEDRCRQKNNSGRTR
        SNILMMSPLPNVRQAYSL+VQEEMQRQV+SEPTENFSIA+AV  K       + K C+HCN+SGHTI+ECR LKFHC FCD+RGHTEDRCR KN S    
Subjt:  SNILMMSPLPNVRQAYSLLVQEEMQRQVTSEPTENFSIASAVQKKTIYSKFAKDKKCEHCNKSGHTINECRILKFHCNFCDRRGHTEDRCRQKNNSGRTR

Query:  ---QDNQHNNRGYRSSANMADVSQLNTEEQSPNSIPNFSSEQLREIAQALSAINHHPSGNSDNHVNVA----------------------------GLGY
           +  +   RG + SAN A  SQ  ++  S +++  F++EQ++++AQA+ A+NH  SGN D++ N A                            GLG+
Subjt:  ---QDNQHNNRGYRSSANMADVSQLNTEEQSPNSIPNFSSEQLREIAQALSAINHHPSGNSDNHVNVA----------------------------GLGY

Query:  GEDDWLG
        GEDDWLG
Subjt:  GEDDWLG

XP_023511453.1 uncharacterized protein LOC111776274 [Cucurbita pepo subsp. pepo]4.6e-15798.62Show/hide
Query:  MIISWLTHSVEADIAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCNQRQIHIDQREEDKLMQ
        MIISWLTHSVEADIAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCNQRQIHIDQREEDKLMQ
Subjt:  MIISWLTHSVEADIAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCNQRQIHIDQREEDKLMQ

Query:  LLMGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQVTSEPTENFSIASAVQKKTIYSKFAKDKKCEHCNKSGHTINECRILKFHCNFCDRRGHTE
        LLMGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQVTSEPTE  SIASAVQKKTIYSKFAKDK CEHCNKSGHTINECRILKFHCNFCDRRGHTE
Subjt:  LLMGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQVTSEPTENFSIASAVQKKTIYSKFAKDKKCEHCNKSGHTINECRILKFHCNFCDRRGHTE

Query:  DRCRQKNNSGRTRQDNQHNNRGYRSSANMADVSQLNTEEQSPNSIPNFSSEQLREIAQALSAINHHPSGNSDNHVNVAGLGYGEDDWLG
        DRCRQKNNSGRTRQDNQHNNRGYRSSANMADVSQLNTEEQSPNSIPNFSSEQLR+IAQALSAINHHPSGNSDNHVNVAGLGYGEDDWLG
Subjt:  DRCRQKNNSGRTRQDNQHNNRGYRSSANMADVSQLNTEEQSPNSIPNFSSEQLREIAQALSAINHHPSGNSDNHVNVAGLGYGEDDWLG

XP_023520712.1 uncharacterized protein LOC111784113 [Cucurbita pepo subsp. pepo]2.9e-15999.65Show/hide
Query:  MIISWLTHSVEADIAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCNQRQIHIDQREEDKLMQ
        MIISWLTHSVEADIAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCNQRQIHIDQREEDKLMQ
Subjt:  MIISWLTHSVEADIAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCNQRQIHIDQREEDKLMQ

Query:  LLMGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQVTSEPTENFSIASAVQKKTIYSKFAKDKKCEHCNKSGHTINECRILKFHCNFCDRRGHTE
        LLMGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQVTSEPTENFSIASAVQKKTIYSKFAKDKKCEHCNKSGHTINECRILKFHCNFCDRRGHTE
Subjt:  LLMGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQVTSEPTENFSIASAVQKKTIYSKFAKDKKCEHCNKSGHTINECRILKFHCNFCDRRGHTE

Query:  DRCRQKNNSGRTRQDNQHNNRGYRSSANMADVSQLNTEEQSPNSIPNFSSEQLREIAQALSAINHHPSGNSDNHVNVAGLGYGEDDWLG
        DRCRQKNNSGRTRQDNQHNNRGYRSSANMADVSQLNTEEQSPNSIPNFSSEQLR+IAQALSAINHHPSGNSDNHVNVAGLGYGEDDWLG
Subjt:  DRCRQKNNSGRTRQDNQHNNRGYRSSANMADVSQLNTEEQSPNSIPNFSSEQLREIAQALSAINHHPSGNSDNHVNVAGLGYGEDDWLG

XP_023524327.1 uncharacterized protein LOC111788256 [Cucurbita pepo subsp. pepo]7.1e-15898.96Show/hide
Query:  MIISWLTHSVEADIAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCNQRQIHIDQREEDKLMQ
        MIISWLTHSVEADIAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCNQRQIHIDQREEDKLMQ
Subjt:  MIISWLTHSVEADIAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCNQRQIHIDQREEDKLMQ

Query:  LLMGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQVTSEPTENFSIASAVQKKTIYSKFAKDKKCEHCNKSGHTINECRILKFHCNFCDRRGHTE
        LLMGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQVTSEPTE  SIASAVQKKTIYSKFAKDKKCEHCNKSGHTINECRILKFHCNFCDRRGHTE
Subjt:  LLMGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQVTSEPTENFSIASAVQKKTIYSKFAKDKKCEHCNKSGHTINECRILKFHCNFCDRRGHTE

Query:  DRCRQKNNSGRTRQDNQHNNRGYRSSANMADVSQLNTEEQSPNSIPNFSSEQLREIAQALSAINHHPSGNSDNHVNVAGLGYGEDDWLG
        DRCRQKNNSGRTRQDNQHNNRGYRSSANMADVSQLNTEEQSPNSIPNFSSEQLR+IAQALSAINHHPSGNSDNHVNVAGLGYGEDDWLG
Subjt:  DRCRQKNNSGRTRQDNQHNNRGYRSSANMADVSQLNTEEQSPNSIPNFSSEQLREIAQALSAINHHPSGNSDNHVNVAGLGYGEDDWLG

XP_023536128.1 uncharacterized protein LOC111797374 [Cucurbita pepo subsp. pepo]3.0e-15697.92Show/hide
Query:  MIISWLTHSVEADIAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCNQRQIHIDQREEDKLMQ
        MIISWLTHSVEADIAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCNQRQIHIDQREEDKLMQ
Subjt:  MIISWLTHSVEADIAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCNQRQIHIDQREEDKLMQ

Query:  LLMGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQVTSEPTENFSIASAVQKKTIYSKFAKDKKCEHCNKSGHTINECRILKFHCNFCDRRGHTE
        LLMGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQVTSEPTE  SIASAVQKKTIYSKFAKDK CEHCNKSGHTINECRILKFHC FCDRRGHTE
Subjt:  LLMGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQVTSEPTENFSIASAVQKKTIYSKFAKDKKCEHCNKSGHTINECRILKFHCNFCDRRGHTE

Query:  DRCRQKNNSGRTRQDNQHNNRGYRSSANMADVSQLNTEEQSPNSIPNFSSEQLREIAQALSAINHHPSGNSDNHVNVAGLGYGEDDWLG
        DRCRQKNNSGRTRQDNQHNNRGYRSSANMADVSQLNTEEQSPNSIPNFSSEQLR+IAQALSAINHHPSGNSDNH+NVAGLGYGEDDWLG
Subjt:  DRCRQKNNSGRTRQDNQHNNRGYRSSANMADVSQLNTEEQSPNSIPNFSSEQLREIAQALSAINHHPSGNSDNHVNVAGLGYGEDDWLG

TrEMBL top hitse value%identityAlignment
A0A438GBE7 Retrovirus-related Pol polyprotein from transposon RE19.1e-13562.65Show/hide
Query:  MAESAKSS-FKISDVDLTHPYYIHHSDQPGYSLVPIKLNGANYQSWSKSVMHALIAKKKIGFIDGTIEEPSQDANSTEFELWNQCNSMIISWLTHSVEAD
        MA S K+S  +   +D +HP YIHHSDQPG+ LVPIKLNG NYQSWSK+V+HAL AKKKIGF+DGT+EEPSQ+     FE WNQCNSMI+SWLTH+VE+D
Subjt:  MAESAKSS-FKISDVDLTHPYYIHHSDQPGYSLVPIKLNGANYQSWSKSVMHALIAKKKIGFIDGTIEEPSQDANSTEFELWNQCNSMIISWLTHSVEAD

Query:  IAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCNQRQIHIDQREEDKLMQLLMGLNQSYKTVR
        IA+GIIHAKTA +VWVDL DQFSQKNAPA+FQIQ SIATMSQGTM ++ YFTK+KALWDELE YR+P TCNQRQ H++QREED+LMQ LMGLN+SYK VR
Subjt:  IAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCNQRQIHIDQREEDKLMQLLMGLNQSYKTVR

Query:  SNILMMSPLPNVRQAYSLLVQEEMQRQVTSEPTENFSIASAVQKKTIYSKFAKDKKCEHCNKSGHTINECRILKFHCNFCDRRGHTEDRCRQKNNSGRTR
        SNILMMSPLPNVRQAYSL+VQEEMQRQV+SEPTENFSIA+AV  K       + K C+HCN+SGHTI+ECR LKFHC FCD+RGHTEDRCR KN S    
Subjt:  SNILMMSPLPNVRQAYSLLVQEEMQRQVTSEPTENFSIASAVQKKTIYSKFAKDKKCEHCNKSGHTINECRILKFHCNFCDRRGHTEDRCRQKNNSGRTR

Query:  ---QDNQHNNRGYRSSANMADVSQLNTEEQSPNSIPNFSSEQLREIAQALSAINHHPSGNSDNHVNVA----------------------------GLGY
           +  +   RG + SAN A  SQ  ++  S +++  F++EQ++++AQA+ A+NH  SGN D++ N A                            GLG+
Subjt:  ---QDNQHNNRGYRSSANMADVSQLNTEEQSPNSIPNFSSEQLREIAQALSAINHHPSGNSDNHVNVA----------------------------GLGY

Query:  GEDDWLG
        GEDDWLG
Subjt:  GEDDWLG

A0A438GFQ0 Retrovirus-related Pol polyprotein from transposon RE18.5e-13366.49Show/hide
Query:  MAESAKSS-FKISDVDLTHPYYIHHSDQPGYSLVPIKLNGANYQSWSKSVMHALIAKKKIGFIDGTIEEPSQDANSTEFELWNQCNSMIISWLTHSVEAD
        MA S K+S  +   +D +HP YIHHSDQPG+ LVPIKLNG NYQSWSK+V+HAL AKKKIGF++GT+EEPSQ+     FE WNQCNSMI+SWLTH+VE+D
Subjt:  MAESAKSS-FKISDVDLTHPYYIHHSDQPGYSLVPIKLNGANYQSWSKSVMHALIAKKKIGFIDGTIEEPSQDANSTEFELWNQCNSMIISWLTHSVEAD

Query:  IAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCNQRQIHIDQREEDKLMQLLMGLNQSYKTVR
        IA+GIIHAKTA +VWVDL DQFSQKNAPA+FQIQ SIATMSQGTM ++ YFTK+KALWDELE YR+P TCNQRQ H++QREED+LMQ LMGLN+SYK VR
Subjt:  IAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCNQRQIHIDQREEDKLMQLLMGLNQSYKTVR

Query:  SNILMMSPLPNVRQAYSLLVQEEMQRQVTSEPTENFSIASAVQKKTIYSKFAKDKKCEHCNKSGHTINECRILKFHCNFCDRRGHTEDRCRQKNNSGRTR
        SNILMMSPLPNVRQAYSL++QEEMQRQV+SEPTENFSIA+AV  K       + K C+HCN+SGHTI+ECR LKFHC FCD+RGHTEDRCR KN S    
Subjt:  SNILMMSPLPNVRQAYSLLVQEEMQRQVTSEPTENFSIASAVQKKTIYSKFAKDKKCEHCNKSGHTINECRILKFHCNFCDRRGHTEDRCRQKNNSGRTR

Query:  ---QDNQHNNRGYRSSANMADVSQLNTEEQSPNSIPNFSSEQLREIAQALSAINHHPSGNSDNHVNVAGL
           +  +   RG + SAN A  SQ  ++  S +++  F++EQ++++AQA+ A+NH  SGN D + NVAGL
Subjt:  ---QDNQHNNRGYRSSANMADVSQLNTEEQSPNSIPNFSSEQLREIAQALSAINHHPSGNSDNHVNVAGL

A0A438GTA3 Retrotran_gag_3 domain-containing protein9.4e-13261.92Show/hide
Query:  MAESAKSS-FKISDVDLTHPYYIHHSDQPGYSLVPIKLNGANYQSWSKSVMHALIAKKKIGFIDGTIEEPSQDANSTEFELWNQCNSMIISWLTHSVEAD
        MA S K+S  +   +D +HP YIHHSDQP + LVPIKLNG NYQSWSK+V+HAL AKKKIGF+DGT+EEPSQ+     FE WNQCNSMI+SWLTH+VE+D
Subjt:  MAESAKSS-FKISDVDLTHPYYIHHSDQPGYSLVPIKLNGANYQSWSKSVMHALIAKKKIGFIDGTIEEPSQDANSTEFELWNQCNSMIISWLTHSVEAD

Query:  IAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCNQRQIHIDQREEDKLMQLLMGLNQSYKTVR
        IA+GIIH+KT  +VWVDL DQFSQKNAP +FQIQ SIATMSQGTM ++ YFTK+KALWDELE YR+P TCNQ Q H++QREED+LMQ LMGLN+SYK +R
Subjt:  IAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCNQRQIHIDQREEDKLMQLLMGLNQSYKTVR

Query:  SNILMMSPLPNVRQAYSLLVQEEMQRQVTSEPTENFSIASAVQKKTIYSKFAKDKKCEHCNKSGHTINECRILKFHCNFCDRRGHTEDRCRQKNNS-GRT
        SNILMMSPLPNVRQAYSL+VQEEMQRQV+SEPTENFSIA+AV  K       + K C+HCN+SGHTI+ECR LKFHC FCD+RGHTEDRCR KN S  +T
Subjt:  SNILMMSPLPNVRQAYSLLVQEEMQRQVTSEPTENFSIASAVQKKTIYSKFAKDKKCEHCNKSGHTINECRILKFHCNFCDRRGHTEDRCRQKNNS-GRT

Query:  RQ--DNQHNNRGYRSSANMADVSQLNTEEQSPNSIPNFSSEQLREIAQALSAINHHPSGNSDNHVNVA----------------------------GLGY
        RQ    +   RG + SAN A  SQ  ++  S +++  F++EQ++++AQA+ A+NH  SGN D + N A                            GLG+
Subjt:  RQ--DNQHNNRGYRSSANMADVSQLNTEEQSPNSIPNFSSEQLREIAQALSAINHHPSGNSDNHVNVA----------------------------GLGY

Query:  GEDDWLG
        GEDDWLG
Subjt:  GEDDWLG

A0A438K345 Retrovirus-related Pol polyprotein from transposon RE13.2e-13262.85Show/hide
Query:  VDLTHPYYIHHSDQPGYSLVPIKLNGANYQSWSKSVMHALIAKKKIGFIDGTIEEPSQDANSTEFELWNQCNSMIISWLTHSVEADIAKGIIHAKTAHQV
        +D +HP YIHHSDQPG+ LVPIKLNG NYQSWSK+V+HAL AKKKIGF+DGT+EEPSQ+     FE WNQCNSMI+SWLTH VE+DIA+GIIHAKTA +V
Subjt:  VDLTHPYYIHHSDQPGYSLVPIKLNGANYQSWSKSVMHALIAKKKIGFIDGTIEEPSQDANSTEFELWNQCNSMIISWLTHSVEADIAKGIIHAKTAHQV

Query:  WVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCNQRQIHIDQREEDKLMQLLMGLNQSYKTVRSNILMMSPLPNVRQ
        WVDL DQFSQKNAPA+FQIQ SIATMSQGTM ++ YFTK+KALWDELE YR+P TCNQRQ H++QREED+LMQ LMGL++SYK VRSNILMMSPLPNVRQ
Subjt:  WVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCNQRQIHIDQREEDKLMQLLMGLNQSYKTVRSNILMMSPLPNVRQ

Query:  AYSLLVQEEMQRQVTSEPTENFSIASAVQKKTIYSKFAKDKKCEHCNKSGHTINECRILKFHCNFCDRRGHTEDRCRQKNNSGRTR---QDNQHNNRGYR
        AYSL+VQEEMQRQV+SEPTENFSIA+AV +K       + K C+HCN+SGH ++ECR LKFHC FCD+RGHTEDRCR KN S       +  +   RG +
Subjt:  AYSLLVQEEMQRQVTSEPTENFSIASAVQKKTIYSKFAKDKKCEHCNKSGHTINECRILKFHCNFCDRRGHTEDRCRQKNNSGRTR---QDNQHNNRGYR

Query:  SSANMADVSQLNTEEQSPNSIPNFSSEQLREIAQALSAINHHPSGNSDNHVNVA----------------------------GLGYGEDDWLG
          AN A  SQ  ++  S ++I  F++EQ++++AQA+ A+NH  SGN D + N A                            GLG+GEDDWLG
Subjt:  SSANMADVSQLNTEEQSPNSIPNFSSEQLREIAQALSAINHHPSGNSDNHVNVA----------------------------GLGYGEDDWLG

A5BNR5 Integrase catalytic domain-containing protein5.5e-13266.4Show/hide
Query:  MAESAKSS-FKISDVDLTHPYYIHHSDQPGYSLVPIKLNGANYQSWSKSVMHALIAKKKIGFIDGTIEEPSQDANSTEFELWNQCNSMIISWLTHSVEAD
        MA S K+S  +   +D +HP YIHHSDQPG+ LVPIKLNG NYQSWSK+V+HAL  KKKIGF+DGT+EEPSQ+     FE WNQCNSMI+SWLTH+VE+D
Subjt:  MAESAKSS-FKISDVDLTHPYYIHHSDQPGYSLVPIKLNGANYQSWSKSVMHALIAKKKIGFIDGTIEEPSQDANSTEFELWNQCNSMIISWLTHSVEAD

Query:  IAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCNQRQIHIDQREEDKLMQLLMGLNQSYKTVR
        IA+GIIHAKTA +VWVDL DQFSQKNAPA+FQIQ SIATMSQGTM ++ YFTK+KALWDELE YR+P TCNQRQ H++QREED+LMQ LMGLN+SYK VR
Subjt:  IAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCNQRQIHIDQREEDKLMQLLMGLNQSYKTVR

Query:  SNILMMSPLPNVRQAYSLLVQEEMQRQVTSEPTENFSIASAVQKKTIYSKFAKDKKCEHCNKSGHTINECRILKFHCNFCDRRGHTEDRCRQKNNSGRTR
        SNILMMSPLPNVRQAYSL+VQEEMQRQV+SEPTENFSIA+AV  K       + K C+HCN+SGHTI+ECR LKFHC FCD+RGHTEDRCR KN S    
Subjt:  SNILMMSPLPNVRQAYSLLVQEEMQRQVTSEPTENFSIASAVQKKTIYSKFAKDKKCEHCNKSGHTINECRILKFHCNFCDRRGHTEDRCRQKNNSGRTR

Query:  ---QDNQHNNRGYRSSANMADVSQLNTEEQSPNSIPNFSSEQLREIAQALSAINHHPSGNSDNHVNVAG
           +  +   RG + SAN A  SQ  ++  S +++  F++EQ++++AQA+ A+NH  SGN D + N AG
Subjt:  ---QDNQHNNRGYRSSANMADVSQLNTEEQSPNSIPNFSSEQLREIAQALSAINHHPSGNSDNHVNVAG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).3.6e-2731.91Show/hide
Query:  MAESAKSSFKISDVDLTHPYY----IHHSDQPGYSLVPIKLNGANYQSWSKSVMHALIAKKKIGFIDGTIEEPSQDANSTEFELWNQCNSMIISWLTHSV
        MAE+ KS    SD D   PYY    IHH     +S+  +  +  NY +W       L   KK GFIDGT+ +P  D  S  ++ W QCN+M++ WL +S+
Subjt:  MAESAKSSFKISDVDLTHPYY----IHHSDQPGYSLVPIKLNGANYQSWSKSVMHALIAKKKIGFIDGTIEEPSQDANSTEFELWNQCNSMIISWLTHSV

Query:  EADIAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRT-------PFTCNQRQIHIDQREEDKLMQLLM
           + + +++A+TAH++W DL   F       I+Q++  +AT+ QG  ++  YF KL  +W EL  Y            C   +   + RE+++  + LM
Subjt:  EADIAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRT-------PFTCNQRQIHIDQREEDKLMQLLM

Query:  G--LNQSYKTVRSNILMMSPLPNVRQAYSLLVQEE
        G  LNQ ++ V + I+   P P++ +A++++   E
Subjt:  G--LNQSYKTVRSNILMMSPLPNVRQAYSLLVQEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGAATCAGCCAAATCCAGCTTCAAAATTTCGGATGTTGATTTAACACATCCGTACTATATTCATCACTCTGATCAGCCAGGATATTCACTTGTTCCAATCAAATT
AAATGGAGCAAATTACCAATCCTGGAGTAAATCAGTTATGCATGCTCTTATTGCCAAGAAGAAAATTGGCTTCATTGATGGCACAATTGAGGAACCGTCCCAAGATGCAA
ATTCAACCGAATTCGAACTCTGGAATCAGTGCAACAGTATGATAATATCTTGGTTAACTCATTCCGTTGAAGCAGATATCGCTAAAGGCATTATTCACGCCAAGACAGCT
CATCAAGTGTGGGTTGATCTTCACGATCAATTCTCACAAAAGAATGCTCCAGCAATTTTTCAAATACAAAACTCGATAGCAACGATGTCACAAGGAACCATGGCGCTGTC
AACATATTTCACCAAGCTCAAAGCACTTTGGGATGAACTGGAAGCATACCGCACACCATTTACCTGTAATCAACGTCAAATACATATTGACCAACGCGAAGAAGACAAGT
TGATGCAATTGCTCATGGGGCTTAATCAGTCTTATAAAACGGTGAGATCTAACATATTGATGATGTCTCCATTACCTAATGTGAGGCAAGCCTATTCATTACTTGTACAA
GAAGAGATGCAGCGTCAGGTAACTTCCGAACCTACTGAGAATTTCTCGATTGCATCAGCAGTGCAAAAGAAAACAATATATTCAAAATTCGCCAAGGACAAAAAGTGTGA
ACACTGCAATAAAAGTGGTCATACAATCAATGAGTGTCGAATTCTTAAGTTTCACTGTAACTTTTGTGATAGAAGGGGCCATACAGAAGATCGGTGTCGACAGAAAAATA
ATTCTGGAAGGACAAGACAAGACAATCAACACAATAACCGTGGATATCGATCATCTGCAAATATGGCCGATGTTTCACAGTTGAATACAGAAGAACAGTCACCTAATTCC
ATTCCAAATTTTTCTTCTGAGCAATTACGAGAGATAGCACAAGCCTTATCTGCAATCAATCATCACCCTTCTGGTAATTCTGACAATCACGTCAATGTTGCAGGACTTGG
CTACGGGGAAGATGATTGGCTCGGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGAATCAGCCAAATCCAGCTTCAAAATTTCGGATGTTGATTTAACACATCCGTACTATATTCATCACTCTGATCAGCCAGGATATTCACTTGTTCCAATCAAATT
AAATGGAGCAAATTACCAATCCTGGAGTAAATCAGTTATGCATGCTCTTATTGCCAAGAAGAAAATTGGCTTCATTGATGGCACAATTGAGGAACCGTCCCAAGATGCAA
ATTCAACCGAATTCGAACTCTGGAATCAGTGCAACAGTATGATAATATCTTGGTTAACTCATTCCGTTGAAGCAGATATCGCTAAAGGCATTATTCACGCCAAGACAGCT
CATCAAGTGTGGGTTGATCTTCACGATCAATTCTCACAAAAGAATGCTCCAGCAATTTTTCAAATACAAAACTCGATAGCAACGATGTCACAAGGAACCATGGCGCTGTC
AACATATTTCACCAAGCTCAAAGCACTTTGGGATGAACTGGAAGCATACCGCACACCATTTACCTGTAATCAACGTCAAATACATATTGACCAACGCGAAGAAGACAAGT
TGATGCAATTGCTCATGGGGCTTAATCAGTCTTATAAAACGGTGAGATCTAACATATTGATGATGTCTCCATTACCTAATGTGAGGCAAGCCTATTCATTACTTGTACAA
GAAGAGATGCAGCGTCAGGTAACTTCCGAACCTACTGAGAATTTCTCGATTGCATCAGCAGTGCAAAAGAAAACAATATATTCAAAATTCGCCAAGGACAAAAAGTGTGA
ACACTGCAATAAAAGTGGTCATACAATCAATGAGTGTCGAATTCTTAAGTTTCACTGTAACTTTTGTGATAGAAGGGGCCATACAGAAGATCGGTGTCGACAGAAAAATA
ATTCTGGAAGGACAAGACAAGACAATCAACACAATAACCGTGGATATCGATCATCTGCAAATATGGCCGATGTTTCACAGTTGAATACAGAAGAACAGTCACCTAATTCC
ATTCCAAATTTTTCTTCTGAGCAATTACGAGAGATAGCACAAGCCTTATCTGCAATCAATCATCACCCTTCTGGTAATTCTGACAATCACGTCAATGTTGCAGGACTTGG
CTACGGGGAAGATGATTGGCTCGGGTAAACAATTTGGAGGTCTCTATCATATTTCTTCATCTCCAATCAAATCTTCAGCTCATCAAGTATCT
Protein sequenceShow/hide protein sequence
MAESAKSSFKISDVDLTHPYYIHHSDQPGYSLVPIKLNGANYQSWSKSVMHALIAKKKIGFIDGTIEEPSQDANSTEFELWNQCNSMIISWLTHSVEADIAKGIIHAKTA
HQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCNQRQIHIDQREEDKLMQLLMGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQ
EEMQRQVTSEPTENFSIASAVQKKTIYSKFAKDKKCEHCNKSGHTINECRILKFHCNFCDRRGHTEDRCRQKNNSGRTRQDNQHNNRGYRSSANMADVSQLNTEEQSPNS
IPNFSSEQLREIAQALSAINHHPSGNSDNHVNVAGLGYGEDDWLG