; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh10G011030 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh10G011030
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionReverse transcriptase
Genome locationCmo_Chr10:6373599..6377084
RNA-Seq ExpressionCmoCh10G011030
SyntenyCmoCh10G011030
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041588 - Integrase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043669.1 pol protein [Cucumis melo var. makuwa]2.1e-18276.3Show/hide
Query:  MHPGGTKMYQDLKQHFWWKSMKRDVAGFVSKCLVCQQVKAPRQKTAGLLQPLSIPEWKWENIAMDFIVGLPKTPKGYIVIWVVVDRLTKSAHFLPGKVTY
        MHPG TKMYQDLK+ +WW++MKR+VA FVSKCLVCQQVKAPRQK AGLLQPLSIPEWKWEN++MDFI GLP+T +G+ VIWVVVDRLTKSAHF+PGK TY
Subjt:  MHPGGTKMYQDLKQHFWWKSMKRDVAGFVSKCLVCQQVKAPRQKTAGLLQPLSIPEWKWENIAMDFIVGLPKTPKGYIVIWVVVDRLTKSAHFLPGKVTY

Query:  TVDNWAQLYVKEIVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRLDFSTAFHPQTDGQTEHLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQA
        T   WAQLY+ EIVRLHGVPVSIVSDRD RFTS FW+GLQ A+GTRLDFSTAFHPQTDGQTE LNQ+LEDMLRAC L+F +SWDS LHLMEF+YNNS+QA
Subjt:  TVDNWAQLYVKEIVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRLDFSTAFHPQTDGQTEHLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQA

Query:  TIGMAPFEALYGKRCRSPLCWDEVGEKELVGPELVRLTNEAVQKIRARMRTAQSRQKSYADVRRKSLEFEVGDPVFLKVAPMKGVLRYGHKGKLSPKFIG
        TIGMAPFEALYG+ CRSP+CWDEVGE+ L+GPELV+ TNEAVQKIR+RM TAQSRQKSYADVRRK LEFEVGD VFLKVAPMKGVLR+  +GKLSP+F+G
Subjt:  TIGMAPFEALYGKRCRSPLCWDEVGEKELVGPELVRLTNEAVQKIRARMRTAQSRQKSYADVRRKSLEFEVGDPVFLKVAPMKGVLRYGHKGKLSPKFIG

Query:  PFEILERVGPVAYKLALPPALSGVHDVFHVSMLRKYITDPIHVIDYEPLKLNEDLSYEEKSVRILAREVKTLRNRSIAFIKELLGRKTPKKERRTTNKEE
        PFEILER+GPVAY+LALPP+LS VHDVFHVSMLRKY+ DP HV+DYEPL+++E+LSY E+ V +LAREVKTLRN+ I  +K L   +  + E  T  +E+
Subjt:  PFEILERVGPVAYKLALPPALSGVHDVFHVSMLRKYITDPIHVIDYEPLKLNEDLSYEEKSVRILAREVKTLRNRSIAFIKELLGRKTPKKERRTTNKEE

Query:  KGRKR
          R R
Subjt:  KGRKR

KAA0051368.1 pol protein [Cucumis melo var. makuwa]8.8e-18175.56Show/hide
Query:  MHPGGTKMYQDLKQHFWWKSMKRDVAGFVSKCLVCQQVKAPRQKTAGLLQPLSIPEWKWENIAMDFIVGLPKTPKGYIVIWVVVDRLTKSAHFLPGKVTY
        MHPG TKMYQDLK+ +WW++MKR+VA FVSKCLVCQQVKAPRQK AGLLQPLSIPEWKWEN++MDFI GLP+T +G+ VIWVVVDRLTKSAHF+PGK TY
Subjt:  MHPGGTKMYQDLKQHFWWKSMKRDVAGFVSKCLVCQQVKAPRQKTAGLLQPLSIPEWKWENIAMDFIVGLPKTPKGYIVIWVVVDRLTKSAHFLPGKVTY

Query:  TVDNWAQLYVKEIVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRLDFSTAFHPQTDGQTEHLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQA
        T   WAQLY+ EIVRLHGVPVSIVSDRD RFTS FW+GLQ A+GTRLDFSTAFHPQTDGQTE LNQ+LEDMLRAC L+F  SWDS LHLMEF+YNNS+QA
Subjt:  TVDNWAQLYVKEIVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRLDFSTAFHPQTDGQTEHLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQA

Query:  TIGMAPFEALYGKRCRSPLCWDEVGEKELVGPELVRLTNEAVQKIRARMRTAQSRQKSYADVRRKSLEFEVGDPVFLKVAPMKGVLRYGHKGKLSPKFIG
        TIGMAPFEALYG+ CRSP+CW EVGE+ L+GPELV+ TNEA+QKIR+RM TAQSRQKSYADVRRK LEFE+GD VFLKVAPMKGVLR+  +GKLSP+F+G
Subjt:  TIGMAPFEALYGKRCRSPLCWDEVGEKELVGPELVRLTNEAVQKIRARMRTAQSRQKSYADVRRKSLEFEVGDPVFLKVAPMKGVLRYGHKGKLSPKFIG

Query:  PFEILERVGPVAYKLALPPALSGVHDVFHVSMLRKYITDPIHVIDYEPLKLNEDLSYEEKSVRILAREVKTLRNRSIAFIKELLGRKTPKKERRTTNKEE
        PFEILER+GPVAY+LALPP+LS VHDVFHVSMLRKY+ DP HV+DYEPL+++E+LSY E+ V +LAREVKTLRN+ I  +K L   +  + E  T  +E+
Subjt:  PFEILERVGPVAYKLALPPALSGVHDVFHVSMLRKYITDPIHVIDYEPLKLNEDLSYEEKSVRILAREVKTLRNRSIAFIKELLGRKTPKKERRTTNKEE

Query:  KGRKR
          R R
Subjt:  KGRKR

KAA0060848.1 pol protein [Cucumis melo var. makuwa]5.1e-18175.56Show/hide
Query:  MHPGGTKMYQDLKQHFWWKSMKRDVAGFVSKCLVCQQVKAPRQKTAGLLQPLSIPEWKWENIAMDFIVGLPKTPKGYIVIWVVVDRLTKSAHFLPGKVTY
        MHPG TKMYQDLK+ +WW++MKR+VA FVSKCLVCQQVKAPRQK AGLLQPLSIPEWKWEN++MDFI+GLP+T +G+ VIWVVVDRLTKSAHF+PGK TY
Subjt:  MHPGGTKMYQDLKQHFWWKSMKRDVAGFVSKCLVCQQVKAPRQKTAGLLQPLSIPEWKWENIAMDFIVGLPKTPKGYIVIWVVVDRLTKSAHFLPGKVTY

Query:  TVDNWAQLYVKEIVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRLDFSTAFHPQTDGQTEHLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQA
        T   WAQLY+ EIVRLHGVPVSIVSDRD RFTS FW+GLQ A+GTRLDFSTAFHPQTDGQTE LNQ+LEDMLRAC L+F  SWDS LHLMEF+YNNS+QA
Subjt:  TVDNWAQLYVKEIVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRLDFSTAFHPQTDGQTEHLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQA

Query:  TIGMAPFEALYGKRCRSPLCWDEVGEKELVGPELVRLTNEAVQKIRARMRTAQSRQKSYADVRRKSLEFEVGDPVFLKVAPMKGVLRYGHKGKLSPKFIG
        TIGMAPFEALYGK CRSP+CWDEVGE+ L+GPELV+ TNEA+QKIR+RM TAQSRQKSYADVRRK LEFEVGD VFLKVAPM+GVLR+  +GKLSP+F+G
Subjt:  TIGMAPFEALYGKRCRSPLCWDEVGEKELVGPELVRLTNEAVQKIRARMRTAQSRQKSYADVRRKSLEFEVGDPVFLKVAPMKGVLRYGHKGKLSPKFIG

Query:  PFEILERVGPVAYKLALPPALSGVHDVFHVSMLRKYITDPIHVIDYEPLKLNEDLSYEEKSVRILAREVKTLRNRSIAFIKELLGRKTPKKERRTTNKEE
        PFEILER+GPVAY+LALPP+L+  HDVFHVSMLRKY+ DP HV+DYEPL+++E++SY E+ V ILAREVKTLRN+ I  +K L   +  + E  T  +E+
Subjt:  PFEILERVGPVAYKLALPPALSGVHDVFHVSMLRKYITDPIHVIDYEPLKLNEDLSYEEKSVRILAREVKTLRNRSIAFIKELLGRKTPKKERRTTNKEE

Query:  KGRKR
          R R
Subjt:  KGRKR

KAA0064005.1 pol protein [Cucumis melo var. makuwa]1.1e-18075.43Show/hide
Query:  MHPGGTKMYQDLKQHFWWKSMKRDVAGFVSKCLVCQQVKAPRQKTAGLLQPLSIPEWKWENIAMDFIVGLPKTPKGYIVIWVVVDRLTKSAHFLPGKVTY
        MHPG TKMYQDLK+ +WW++MKR+VA FVSKCLVCQQVKAPRQK AGLLQPLSIPEWKWEN++MDFI GLP+T +G+ VIWVVVDRLTKSAHF+PGK TY
Subjt:  MHPGGTKMYQDLKQHFWWKSMKRDVAGFVSKCLVCQQVKAPRQKTAGLLQPLSIPEWKWENIAMDFIVGLPKTPKGYIVIWVVVDRLTKSAHFLPGKVTY

Query:  TVDNWAQLYVKEIVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRLDFSTAFHPQTDGQTEHLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQA
        T   WAQLY+ EIVRLHGVPVSIVSDRD RFTS FW+GLQ A+GTRLDFSTAFHPQTDGQTE LNQ+LEDMLRAC L+F  SWDS LHLMEF+YNNS+QA
Subjt:  TVDNWAQLYVKEIVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRLDFSTAFHPQTDGQTEHLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQA

Query:  TIGMAPFEALYGKRCRSPLCWDEVGEKELVGPELVRLTNEAVQKIRARMRTAQSRQKSYADVRRKSLEFEVGDPVFLKVAPMKGVLRYGHKGKLSPKFIG
        TIGM PFEALYG+ CRSP+CW EVGE+ L+GPELV+ TNEA+QKIR+RM TAQSRQKSYADVRRK LEFE+GD VFLKVAPMKGVLR+  +GKLSP+F+G
Subjt:  TIGMAPFEALYGKRCRSPLCWDEVGEKELVGPELVRLTNEAVQKIRARMRTAQSRQKSYADVRRKSLEFEVGDPVFLKVAPMKGVLRYGHKGKLSPKFIG

Query:  PFEILERVGPVAYKLALPPALSGVHDVFHVSMLRKYITDPIHVIDYEPLKLNEDLSYEEKSVRILAREVKTLRNRSIAFIKELLGRKTPKKERRTTNKEE
        PFEILER+GPVAY+LALPP+LS VHDVFHVSMLRKY+ DP HV+DYEPL+++E+LSY E+ V +LAREVKTLRN+ I  +K L   +  ++E  T  +E+
Subjt:  PFEILERVGPVAYKLALPPALSGVHDVFHVSMLRKYITDPIHVIDYEPLKLNEDLSYEEKSVRILAREVKTLRNRSIAFIKELLGRKTPKKERRTTNKEE

Query:  KGR
          R
Subjt:  KGR

XP_022933231.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111440131 [Cucurbita moschata]6.3e-18782.6Show/hide
Query:  MHPGGTKMYQDLKQHFWWKSMKRDVAGFVSKCLVCQQVKAPRQKTAGLLQPLSIPEWKWENIAMDFIVGLPKTPKGYIVIWVVVDRLTKSAHFLPGKVTY
        MHPGGTKMYQDLKQHFWWKSMKRDVAGFVSKCLVCQ+VKAPRQKTAGLLQPLSIPEWKWENI MDFIVGLPKTPKGY VIWVVVDRLTKSAHFLPGKVTY
Subjt:  MHPGGTKMYQDLKQHFWWKSMKRDVAGFVSKCLVCQQVKAPRQKTAGLLQPLSIPEWKWENIAMDFIVGLPKTPKGYIVIWVVVDRLTKSAHFLPGKVTY

Query:  TVDNWAQLYVKEIVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRLDFSTAFHPQTDGQTEHLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQA
        TVDNWAQLYVKEIVRLHGV VSIVSDRDPRFT AFWRGLQKALGTRLDFSTAFHPQ DGQTE LNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQA
Subjt:  TVDNWAQLYVKEIVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRLDFSTAFHPQTDGQTEHLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQA

Query:  TIGMAPFEALYGKRCRSPLCWDEVGEKELVGPELVRLTNEAVQKIRARMRTAQSRQKSYADVRRKSLEFEVGDPVFLKVAPMKGVLRYGHKGKLSPKFIG
        TIGMAPFEALYGKRCRSPLCWDEVGEKELVGPELVRLTNEAVQ IRARMRT QSRQKSYADVRRKSLEFEVGDP+FLKVAPMKGVLR+GHKGKLSPKFIG
Subjt:  TIGMAPFEALYGKRCRSPLCWDEVGEKELVGPELVRLTNEAVQKIRARMRTAQSRQKSYADVRRKSLEFEVGDPVFLKVAPMKGVLRYGHKGKLSPKFIG

Query:  PFEILERVGPVAYKLALPPALSGVHDVFHVSMLRKYITDPIHVIDYEPLKLNEDLSYEEKSVRILAREVKTLRNRSIAFIKELLGRKTPKKERRTTNKEE
        PFEILERVGPVAYKLALPPALSGVHDV HV                                     EVKTLRNRSIAF+K +L R    +E     ++E
Subjt:  PFEILERVGPVAYKLALPPALSGVHDVFHVSMLRKYITDPIHVIDYEPLKLNEDLSYEEKSVRILAREVKTLRNRSIAFIKELLGRKTPKKERRTTNKEE

Query:  KGRKRTEI
           K  E+
Subjt:  KGRKRTEI

TrEMBL top hitse value%identityAlignment
A0A5A7TR61 Pol protein1.0e-18276.3Show/hide
Query:  MHPGGTKMYQDLKQHFWWKSMKRDVAGFVSKCLVCQQVKAPRQKTAGLLQPLSIPEWKWENIAMDFIVGLPKTPKGYIVIWVVVDRLTKSAHFLPGKVTY
        MHPG TKMYQDLK+ +WW++MKR+VA FVSKCLVCQQVKAPRQK AGLLQPLSIPEWKWEN++MDFI GLP+T +G+ VIWVVVDRLTKSAHF+PGK TY
Subjt:  MHPGGTKMYQDLKQHFWWKSMKRDVAGFVSKCLVCQQVKAPRQKTAGLLQPLSIPEWKWENIAMDFIVGLPKTPKGYIVIWVVVDRLTKSAHFLPGKVTY

Query:  TVDNWAQLYVKEIVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRLDFSTAFHPQTDGQTEHLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQA
        T   WAQLY+ EIVRLHGVPVSIVSDRD RFTS FW+GLQ A+GTRLDFSTAFHPQTDGQTE LNQ+LEDMLRAC L+F +SWDS LHLMEF+YNNS+QA
Subjt:  TVDNWAQLYVKEIVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRLDFSTAFHPQTDGQTEHLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQA

Query:  TIGMAPFEALYGKRCRSPLCWDEVGEKELVGPELVRLTNEAVQKIRARMRTAQSRQKSYADVRRKSLEFEVGDPVFLKVAPMKGVLRYGHKGKLSPKFIG
        TIGMAPFEALYG+ CRSP+CWDEVGE+ L+GPELV+ TNEAVQKIR+RM TAQSRQKSYADVRRK LEFEVGD VFLKVAPMKGVLR+  +GKLSP+F+G
Subjt:  TIGMAPFEALYGKRCRSPLCWDEVGEKELVGPELVRLTNEAVQKIRARMRTAQSRQKSYADVRRKSLEFEVGDPVFLKVAPMKGVLRYGHKGKLSPKFIG

Query:  PFEILERVGPVAYKLALPPALSGVHDVFHVSMLRKYITDPIHVIDYEPLKLNEDLSYEEKSVRILAREVKTLRNRSIAFIKELLGRKTPKKERRTTNKEE
        PFEILER+GPVAY+LALPP+LS VHDVFHVSMLRKY+ DP HV+DYEPL+++E+LSY E+ V +LAREVKTLRN+ I  +K L   +  + E  T  +E+
Subjt:  PFEILERVGPVAYKLALPPALSGVHDVFHVSMLRKYITDPIHVIDYEPLKLNEDLSYEEKSVRILAREVKTLRNRSIAFIKELLGRKTPKKERRTTNKEE

Query:  KGRKR
          R R
Subjt:  KGRKR

A0A5A7U7V9 Reverse transcriptase4.2e-18175.56Show/hide
Query:  MHPGGTKMYQDLKQHFWWKSMKRDVAGFVSKCLVCQQVKAPRQKTAGLLQPLSIPEWKWENIAMDFIVGLPKTPKGYIVIWVVVDRLTKSAHFLPGKVTY
        MHPG TKMYQDLK+ +WW++MKR+VA FVSKCLVCQQVKAPRQK AGLLQPLSIPEWKWEN++MDFI GLP+T +G+ VIWVVVDRLTKSAHF+PGK TY
Subjt:  MHPGGTKMYQDLKQHFWWKSMKRDVAGFVSKCLVCQQVKAPRQKTAGLLQPLSIPEWKWENIAMDFIVGLPKTPKGYIVIWVVVDRLTKSAHFLPGKVTY

Query:  TVDNWAQLYVKEIVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRLDFSTAFHPQTDGQTEHLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQA
        T   WAQLY+ EIVRLHGVPVSIVSDRD RFTS FW+GLQ A+GTRLDFSTAFHPQTDGQTE LNQ+LEDMLRAC L+F  SWDS LHLMEF+YNNS+QA
Subjt:  TVDNWAQLYVKEIVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRLDFSTAFHPQTDGQTEHLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQA

Query:  TIGMAPFEALYGKRCRSPLCWDEVGEKELVGPELVRLTNEAVQKIRARMRTAQSRQKSYADVRRKSLEFEVGDPVFLKVAPMKGVLRYGHKGKLSPKFIG
        TIGMAPFEALYG+ CRSP+CW EVGE+ L+GPELV+ TNEA+QKIR+RM TAQSRQKSYADVRRK LEFE+GD VFLKVAPMKGVLR+  +GKLSP+F+G
Subjt:  TIGMAPFEALYGKRCRSPLCWDEVGEKELVGPELVRLTNEAVQKIRARMRTAQSRQKSYADVRRKSLEFEVGDPVFLKVAPMKGVLRYGHKGKLSPKFIG

Query:  PFEILERVGPVAYKLALPPALSGVHDVFHVSMLRKYITDPIHVIDYEPLKLNEDLSYEEKSVRILAREVKTLRNRSIAFIKELLGRKTPKKERRTTNKEE
        PFEILER+GPVAY+LALPP+LS VHDVFHVSMLRKY+ DP HV+DYEPL+++E+LSY E+ V +LAREVKTLRN+ I  +K L   +  + E  T  +E+
Subjt:  PFEILERVGPVAYKLALPPALSGVHDVFHVSMLRKYITDPIHVIDYEPLKLNEDLSYEEKSVRILAREVKTLRNRSIAFIKELLGRKTPKKERRTTNKEE

Query:  KGRKR
          R R
Subjt:  KGRKR

A0A5A7UZZ4 Pol protein2.5e-18175.56Show/hide
Query:  MHPGGTKMYQDLKQHFWWKSMKRDVAGFVSKCLVCQQVKAPRQKTAGLLQPLSIPEWKWENIAMDFIVGLPKTPKGYIVIWVVVDRLTKSAHFLPGKVTY
        MHPG TKMYQDLK+ +WW++MKR+VA FVSKCLVCQQVKAPRQK AGLLQPLSIPEWKWEN++MDFI+GLP+T +G+ VIWVVVDRLTKSAHF+PGK TY
Subjt:  MHPGGTKMYQDLKQHFWWKSMKRDVAGFVSKCLVCQQVKAPRQKTAGLLQPLSIPEWKWENIAMDFIVGLPKTPKGYIVIWVVVDRLTKSAHFLPGKVTY

Query:  TVDNWAQLYVKEIVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRLDFSTAFHPQTDGQTEHLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQA
        T   WAQLY+ EIVRLHGVPVSIVSDRD RFTS FW+GLQ A+GTRLDFSTAFHPQTDGQTE LNQ+LEDMLRAC L+F  SWDS LHLMEF+YNNS+QA
Subjt:  TVDNWAQLYVKEIVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRLDFSTAFHPQTDGQTEHLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQA

Query:  TIGMAPFEALYGKRCRSPLCWDEVGEKELVGPELVRLTNEAVQKIRARMRTAQSRQKSYADVRRKSLEFEVGDPVFLKVAPMKGVLRYGHKGKLSPKFIG
        TIGMAPFEALYGK CRSP+CWDEVGE+ L+GPELV+ TNEA+QKIR+RM TAQSRQKSYADVRRK LEFEVGD VFLKVAPM+GVLR+  +GKLSP+F+G
Subjt:  TIGMAPFEALYGKRCRSPLCWDEVGEKELVGPELVRLTNEAVQKIRARMRTAQSRQKSYADVRRKSLEFEVGDPVFLKVAPMKGVLRYGHKGKLSPKFIG

Query:  PFEILERVGPVAYKLALPPALSGVHDVFHVSMLRKYITDPIHVIDYEPLKLNEDLSYEEKSVRILAREVKTLRNRSIAFIKELLGRKTPKKERRTTNKEE
        PFEILER+GPVAY+LALPP+L+  HDVFHVSMLRKY+ DP HV+DYEPL+++E++SY E+ V ILAREVKTLRN+ I  +K L   +  + E  T  +E+
Subjt:  PFEILERVGPVAYKLALPPALSGVHDVFHVSMLRKYITDPIHVIDYEPLKLNEDLSYEEKSVRILAREVKTLRNRSIAFIKELLGRKTPKKERRTTNKEE

Query:  KGRKR
          R R
Subjt:  KGRKR

A0A5A7VDP3 Pol protein5.5e-18175.43Show/hide
Query:  MHPGGTKMYQDLKQHFWWKSMKRDVAGFVSKCLVCQQVKAPRQKTAGLLQPLSIPEWKWENIAMDFIVGLPKTPKGYIVIWVVVDRLTKSAHFLPGKVTY
        MHPG TKMYQDLK+ +WW++MKR+VA FVSKCLVCQQVKAPRQK AGLLQPLSIPEWKWEN++MDFI GLP+T +G+ VIWVVVDRLTKSAHF+PGK TY
Subjt:  MHPGGTKMYQDLKQHFWWKSMKRDVAGFVSKCLVCQQVKAPRQKTAGLLQPLSIPEWKWENIAMDFIVGLPKTPKGYIVIWVVVDRLTKSAHFLPGKVTY

Query:  TVDNWAQLYVKEIVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRLDFSTAFHPQTDGQTEHLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQA
        T   WAQLY+ EIVRLHGVPVSIVSDRD RFTS FW+GLQ A+GTRLDFSTAFHPQTDGQTE LNQ+LEDMLRAC L+F  SWDS LHLMEF+YNNS+QA
Subjt:  TVDNWAQLYVKEIVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRLDFSTAFHPQTDGQTEHLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQA

Query:  TIGMAPFEALYGKRCRSPLCWDEVGEKELVGPELVRLTNEAVQKIRARMRTAQSRQKSYADVRRKSLEFEVGDPVFLKVAPMKGVLRYGHKGKLSPKFIG
        TIGM PFEALYG+ CRSP+CW EVGE+ L+GPELV+ TNEA+QKIR+RM TAQSRQKSYADVRRK LEFE+GD VFLKVAPMKGVLR+  +GKLSP+F+G
Subjt:  TIGMAPFEALYGKRCRSPLCWDEVGEKELVGPELVRLTNEAVQKIRARMRTAQSRQKSYADVRRKSLEFEVGDPVFLKVAPMKGVLRYGHKGKLSPKFIG

Query:  PFEILERVGPVAYKLALPPALSGVHDVFHVSMLRKYITDPIHVIDYEPLKLNEDLSYEEKSVRILAREVKTLRNRSIAFIKELLGRKTPKKERRTTNKEE
        PFEILER+GPVAY+LALPP+LS VHDVFHVSMLRKY+ DP HV+DYEPL+++E+LSY E+ V +LAREVKTLRN+ I  +K L   +  ++E  T  +E+
Subjt:  PFEILERVGPVAYKLALPPALSGVHDVFHVSMLRKYITDPIHVIDYEPLKLNEDLSYEEKSVRILAREVKTLRNRSIAFIKELLGRKTPKKERRTTNKEE

Query:  KGR
          R
Subjt:  KGR

A0A6J1EYH9 Reverse transcriptase3.0e-18782.6Show/hide
Query:  MHPGGTKMYQDLKQHFWWKSMKRDVAGFVSKCLVCQQVKAPRQKTAGLLQPLSIPEWKWENIAMDFIVGLPKTPKGYIVIWVVVDRLTKSAHFLPGKVTY
        MHPGGTKMYQDLKQHFWWKSMKRDVAGFVSKCLVCQ+VKAPRQKTAGLLQPLSIPEWKWENI MDFIVGLPKTPKGY VIWVVVDRLTKSAHFLPGKVTY
Subjt:  MHPGGTKMYQDLKQHFWWKSMKRDVAGFVSKCLVCQQVKAPRQKTAGLLQPLSIPEWKWENIAMDFIVGLPKTPKGYIVIWVVVDRLTKSAHFLPGKVTY

Query:  TVDNWAQLYVKEIVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRLDFSTAFHPQTDGQTEHLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQA
        TVDNWAQLYVKEIVRLHGV VSIVSDRDPRFT AFWRGLQKALGTRLDFSTAFHPQ DGQTE LNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQA
Subjt:  TVDNWAQLYVKEIVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRLDFSTAFHPQTDGQTEHLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQA

Query:  TIGMAPFEALYGKRCRSPLCWDEVGEKELVGPELVRLTNEAVQKIRARMRTAQSRQKSYADVRRKSLEFEVGDPVFLKVAPMKGVLRYGHKGKLSPKFIG
        TIGMAPFEALYGKRCRSPLCWDEVGEKELVGPELVRLTNEAVQ IRARMRT QSRQKSYADVRRKSLEFEVGDP+FLKVAPMKGVLR+GHKGKLSPKFIG
Subjt:  TIGMAPFEALYGKRCRSPLCWDEVGEKELVGPELVRLTNEAVQKIRARMRTAQSRQKSYADVRRKSLEFEVGDPVFLKVAPMKGVLRYGHKGKLSPKFIG

Query:  PFEILERVGPVAYKLALPPALSGVHDVFHVSMLRKYITDPIHVIDYEPLKLNEDLSYEEKSVRILAREVKTLRNRSIAFIKELLGRKTPKKERRTTNKEE
        PFEILERVGPVAYKLALPPALSGVHDV HV                                     EVKTLRNRSIAF+K +L R    +E     ++E
Subjt:  PFEILERVGPVAYKLALPPALSGVHDVFHVSMLRKYITDPIHVIDYEPLKLNEDLSYEEKSVRILAREVKTLRNRSIAFIKELLGRKTPKKERRTTNKEE

Query:  KGRKRTEI
           K  E+
Subjt:  KGRKRTEI

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein3.5e-4732.65Show/hide
Query:  MHPGGTKMYQDLKQHFWWKSMKRDVAGFVSKCLVCQQVKAPRQKTAGLLQPLSIPEWKWENIAMDFIVGLPKTPKGYIVIWVVVDRLTKSAHFLPGKVTY
        +HPG   +   + + F WK +++ +  +V  C  CQ  K+   K  G LQP+   E  WE+++MDFI  LP++  GY  ++VVVDR +K A  +P   + 
Subjt:  MHPGGTKMYQDLKQHFWWKSMKRDVAGFVSKCLVCQQVKAPRQKTAGLLQPLSIPEWKWENIAMDFIVGLPKTPKGYIVIWVVVDRLTKSAHFLPGKVTY

Query:  TVDNWAQLYVKEIVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRLDFSTAFHPQTDGQTEHLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQA
        T +  A+++ + ++   G P  I++D D  FTS  W+         + FS  + PQTDGQTE  NQ +E +LR        +W   + L++ SYNN+  +
Subjt:  TVDNWAQLYVKEIVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRLDFSTAFHPQTDGQTEHLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQA

Query:  TIGMAPFEALYG-KRCRSPLCWDEVGEKELVGPELVRLTNEAVQKIRARMRTAQSRQKSYADVRRKSL-EFEVGDPVFLKVAPMKGVLRYGHK-GKLSPK
           M PFE ++      SPL      +K     E  + T +  Q ++  + T   + K Y D++ + + EF+ GD V +K    +    + HK  KL+P 
Subjt:  TIGMAPFEALYG-KRCRSPLCWDEVGEKELVGPELVRLTNEAVQKIRARMRTAQSRQKSYADVRRKSL-EFEVGDPVFLKVAPMKGVLRYGHK-GKLSPK

Query:  FIGPFEILERVGPVAYKLALPPALSGV-HDVFHVSMLRKY
        F GPF +L++ GP  Y+L LP ++  +    FHVS L KY
Subjt:  FIGPFEILERVGPVAYKLALPPALSGV-HDVFHVSMLRKY

P0CT35 Transposon Tf2-2 polyprotein3.5e-4732.65Show/hide
Query:  MHPGGTKMYQDLKQHFWWKSMKRDVAGFVSKCLVCQQVKAPRQKTAGLLQPLSIPEWKWENIAMDFIVGLPKTPKGYIVIWVVVDRLTKSAHFLPGKVTY
        +HPG   +   + + F WK +++ +  +V  C  CQ  K+   K  G LQP+   E  WE+++MDFI  LP++  GY  ++VVVDR +K A  +P   + 
Subjt:  MHPGGTKMYQDLKQHFWWKSMKRDVAGFVSKCLVCQQVKAPRQKTAGLLQPLSIPEWKWENIAMDFIVGLPKTPKGYIVIWVVVDRLTKSAHFLPGKVTY

Query:  TVDNWAQLYVKEIVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRLDFSTAFHPQTDGQTEHLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQA
        T +  A+++ + ++   G P  I++D D  FTS  W+         + FS  + PQTDGQTE  NQ +E +LR        +W   + L++ SYNN+  +
Subjt:  TVDNWAQLYVKEIVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRLDFSTAFHPQTDGQTEHLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQA

Query:  TIGMAPFEALYG-KRCRSPLCWDEVGEKELVGPELVRLTNEAVQKIRARMRTAQSRQKSYADVRRKSL-EFEVGDPVFLKVAPMKGVLRYGHK-GKLSPK
           M PFE ++      SPL      +K     E  + T +  Q ++  + T   + K Y D++ + + EF+ GD V +K    +    + HK  KL+P 
Subjt:  TIGMAPFEALYG-KRCRSPLCWDEVGEKELVGPELVRLTNEAVQKIRARMRTAQSRQKSYADVRRKSL-EFEVGDPVFLKVAPMKGVLRYGHK-GKLSPK

Query:  FIGPFEILERVGPVAYKLALPPALSGV-HDVFHVSMLRKY
        F GPF +L++ GP  Y+L LP ++  +    FHVS L KY
Subjt:  FIGPFEILERVGPVAYKLALPPALSGV-HDVFHVSMLRKY

P0CT36 Transposon Tf2-3 polyprotein3.5e-4732.65Show/hide
Query:  MHPGGTKMYQDLKQHFWWKSMKRDVAGFVSKCLVCQQVKAPRQKTAGLLQPLSIPEWKWENIAMDFIVGLPKTPKGYIVIWVVVDRLTKSAHFLPGKVTY
        +HPG   +   + + F WK +++ +  +V  C  CQ  K+   K  G LQP+   E  WE+++MDFI  LP++  GY  ++VVVDR +K A  +P   + 
Subjt:  MHPGGTKMYQDLKQHFWWKSMKRDVAGFVSKCLVCQQVKAPRQKTAGLLQPLSIPEWKWENIAMDFIVGLPKTPKGYIVIWVVVDRLTKSAHFLPGKVTY

Query:  TVDNWAQLYVKEIVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRLDFSTAFHPQTDGQTEHLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQA
        T +  A+++ + ++   G P  I++D D  FTS  W+         + FS  + PQTDGQTE  NQ +E +LR        +W   + L++ SYNN+  +
Subjt:  TVDNWAQLYVKEIVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRLDFSTAFHPQTDGQTEHLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQA

Query:  TIGMAPFEALYG-KRCRSPLCWDEVGEKELVGPELVRLTNEAVQKIRARMRTAQSRQKSYADVRRKSL-EFEVGDPVFLKVAPMKGVLRYGHK-GKLSPK
           M PFE ++      SPL      +K     E  + T +  Q ++  + T   + K Y D++ + + EF+ GD V +K    +    + HK  KL+P 
Subjt:  TIGMAPFEALYG-KRCRSPLCWDEVGEKELVGPELVRLTNEAVQKIRARMRTAQSRQKSYADVRRKSL-EFEVGDPVFLKVAPMKGVLRYGHK-GKLSPK

Query:  FIGPFEILERVGPVAYKLALPPALSGV-HDVFHVSMLRKY
        F GPF +L++ GP  Y+L LP ++  +    FHVS L KY
Subjt:  FIGPFEILERVGPVAYKLALPPALSGV-HDVFHVSMLRKY

P0CT41 Transposon Tf2-12 polyprotein3.5e-4732.65Show/hide
Query:  MHPGGTKMYQDLKQHFWWKSMKRDVAGFVSKCLVCQQVKAPRQKTAGLLQPLSIPEWKWENIAMDFIVGLPKTPKGYIVIWVVVDRLTKSAHFLPGKVTY
        +HPG   +   + + F WK +++ +  +V  C  CQ  K+   K  G LQP+   E  WE+++MDFI  LP++  GY  ++VVVDR +K A  +P   + 
Subjt:  MHPGGTKMYQDLKQHFWWKSMKRDVAGFVSKCLVCQQVKAPRQKTAGLLQPLSIPEWKWENIAMDFIVGLPKTPKGYIVIWVVVDRLTKSAHFLPGKVTY

Query:  TVDNWAQLYVKEIVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRLDFSTAFHPQTDGQTEHLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQA
        T +  A+++ + ++   G P  I++D D  FTS  W+         + FS  + PQTDGQTE  NQ +E +LR        +W   + L++ SYNN+  +
Subjt:  TVDNWAQLYVKEIVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRLDFSTAFHPQTDGQTEHLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQA

Query:  TIGMAPFEALYG-KRCRSPLCWDEVGEKELVGPELVRLTNEAVQKIRARMRTAQSRQKSYADVRRKSL-EFEVGDPVFLKVAPMKGVLRYGHK-GKLSPK
           M PFE ++      SPL      +K     E  + T +  Q ++  + T   + K Y D++ + + EF+ GD V +K    +    + HK  KL+P 
Subjt:  TIGMAPFEALYG-KRCRSPLCWDEVGEKELVGPELVRLTNEAVQKIRARMRTAQSRQKSYADVRRKSL-EFEVGDPVFLKVAPMKGVLRYGHK-GKLSPK

Query:  FIGPFEILERVGPVAYKLALPPALSGV-HDVFHVSMLRKY
        F GPF +L++ GP  Y+L LP ++  +    FHVS L KY
Subjt:  FIGPFEILERVGPVAYKLALPPALSGV-HDVFHVSMLRKY

Q9UR07 Transposon Tf2-11 polyprotein3.5e-4732.65Show/hide
Query:  MHPGGTKMYQDLKQHFWWKSMKRDVAGFVSKCLVCQQVKAPRQKTAGLLQPLSIPEWKWENIAMDFIVGLPKTPKGYIVIWVVVDRLTKSAHFLPGKVTY
        +HPG   +   + + F WK +++ +  +V  C  CQ  K+   K  G LQP+   E  WE+++MDFI  LP++  GY  ++VVVDR +K A  +P   + 
Subjt:  MHPGGTKMYQDLKQHFWWKSMKRDVAGFVSKCLVCQQVKAPRQKTAGLLQPLSIPEWKWENIAMDFIVGLPKTPKGYIVIWVVVDRLTKSAHFLPGKVTY

Query:  TVDNWAQLYVKEIVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRLDFSTAFHPQTDGQTEHLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQA
        T +  A+++ + ++   G P  I++D D  FTS  W+         + FS  + PQTDGQTE  NQ +E +LR        +W   + L++ SYNN+  +
Subjt:  TVDNWAQLYVKEIVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRLDFSTAFHPQTDGQTEHLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQA

Query:  TIGMAPFEALYG-KRCRSPLCWDEVGEKELVGPELVRLTNEAVQKIRARMRTAQSRQKSYADVRRKSL-EFEVGDPVFLKVAPMKGVLRYGHK-GKLSPK
           M PFE ++      SPL      +K     E  + T +  Q ++  + T   + K Y D++ + + EF+ GD V +K    +    + HK  KL+P 
Subjt:  TIGMAPFEALYG-KRCRSPLCWDEVGEKELVGPELVRLTNEAVQKIRARMRTAQSRQKSYADVRRKSL-EFEVGDPVFLKVAPMKGVLRYGHK-GKLSPK

Query:  FIGPFEILERVGPVAYKLALPPALSGV-HDVFHVSMLRKY
        F GPF +L++ GP  Y+L LP ++  +    FHVS L KY
Subjt:  FIGPFEILERVGPVAYKLALPPALSGV-HDVFHVSMLRKY

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATCCAGGAGGTACTAAGATGTACCAAGATTTAAAACAACACTTTTGGTGGAAGAGCATGAAGAGGGATGTGGCCGGGTTTGTGAGCAAGTGCTTAGTTTGTCAACA
AGTGAAAGCTCCAAGACAAAAAACGGCGGGGTTGTTGCAGCCCCTAAGCATACCAGAGTGGAAGTGGGAAAACATAGCGATGGACTTCATAGTAGGTTTACCCAAAACGC
CCAAGGGCTATATAGTGATCTGGGTAGTTGTCGATAGGTTGACCAAGTCGGCACACTTTCTACCTGGGAAGGTTACATATACAGTTGACAATTGGGCACAACTGTACGTG
AAAGAAATAGTAAGACTACATGGAGTCCCAGTGTCTATAGTGTCGGATCGGGATCCACGCTTTACGTCAGCGTTTTGGCGCGGACTTCAAAAAGCACTGGGTACCCGCCT
CGACTTTAGTACCGCCTTTCACCCCCAAACAGATGGACAAACGGAGCATTTAAACCAAATTCTAGAGGACATGCTACGCGCTTGCGTACTAGATTTTAAGGAAAGTTGGG
ATTCCAAACTACACCTAATGGAATTCTCGTATAACAACAGTTTCCAAGCAACTATTGGAATGGCACCGTTTGAGGCCCTGTACGGGAAACGGTGTAGGTCCCCACTATGT
TGGGACGAGGTAGGAGAGAAAGAATTAGTAGGACCCGAGTTGGTTCGACTCACCAATGAGGCTGTCCAGAAAATTCGAGCGAGGATGCGTACCGCTCAAAGTAGACAGAA
AAGCTACGCCGATGTAAGGCGTAAAAGCCTGGAGTTTGAGGTGGGGGACCCAGTATTCCTCAAGGTGGCACCTATGAAAGGTGTGTTAAGATACGGACATAAGGGCAAGT
TAAGTCCTAAATTCATTGGACCATTCGAGATCCTAGAGCGGGTTGGTCCGGTAGCGTATAAGTTAGCCTTACCTCCAGCCCTCTCAGGAGTACATGACGTATTTCATGTG
TCGATGTTGAGGAAGTACATTACGGATCCTATCCACGTAATAGACTACGAACCACTCAAACTCAATGAAGATCTGAGCTACGAGGAAAAATCAGTAAGAATCTTAGCTAG
AGAAGTAAAAACCTTACGCAACAGGAGTATTGCGTTCATTAAGGAGCTTTTAGGCCGCAAAACTCCCAAAAAGGAAAGAAGGACGACGAACAAAGAGGAAAAGGGAAGGA
AGAGGACCGAAATCGGCAACGGAAACGCCGCGGGAAGGAGAGAGAAAACTCGCCAGAAAATCGGCCAAAGTCGACCTTCCACAGCCCGCGACCCGAACCACCACCCGAAC
CGGACTGCGACCCGCGCTTCGACCCGACACTACCGCCCGACGTGGCTTTCTCTGTTGCTGCCACGTGTCACACAGCGGCGCGGTAGTCCTCGGCTCGGCTCGGCGCAAAC
GGCTCAGGCTCCCTCCGGTGACGCTCCGGCGGCTCCTTCGGCGGCTCCAGCGGCTGTTCGGCTCGGCTCACCTGTTTTCAGCCCGGTTTCCACTGTTTCGACCCTCCCGA
ATCTGTTTTCGGCCCCAATTAAGGCCGTGTGCCACATCATTTTTCAGGTAAAGGCGAGGCGCACTTGTACAGAAGACGGCGGCATCGCGAGCAGAGACTGTGGCACGTGC
ATAGGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCATCCAGGAGGTACTAAGATGTACCAAGATTTAAAACAACACTTTTGGTGGAAGAGCATGAAGAGGGATGTGGCCGGGTTTGTGAGCAAGTGCTTAGTTTGTCAACA
AGTGAAAGCTCCAAGACAAAAAACGGCGGGGTTGTTGCAGCCCCTAAGCATACCAGAGTGGAAGTGGGAAAACATAGCGATGGACTTCATAGTAGGTTTACCCAAAACGC
CCAAGGGCTATATAGTGATCTGGGTAGTTGTCGATAGGTTGACCAAGTCGGCACACTTTCTACCTGGGAAGGTTACATATACAGTTGACAATTGGGCACAACTGTACGTG
AAAGAAATAGTAAGACTACATGGAGTCCCAGTGTCTATAGTGTCGGATCGGGATCCACGCTTTACGTCAGCGTTTTGGCGCGGACTTCAAAAAGCACTGGGTACCCGCCT
CGACTTTAGTACCGCCTTTCACCCCCAAACAGATGGACAAACGGAGCATTTAAACCAAATTCTAGAGGACATGCTACGCGCTTGCGTACTAGATTTTAAGGAAAGTTGGG
ATTCCAAACTACACCTAATGGAATTCTCGTATAACAACAGTTTCCAAGCAACTATTGGAATGGCACCGTTTGAGGCCCTGTACGGGAAACGGTGTAGGTCCCCACTATGT
TGGGACGAGGTAGGAGAGAAAGAATTAGTAGGACCCGAGTTGGTTCGACTCACCAATGAGGCTGTCCAGAAAATTCGAGCGAGGATGCGTACCGCTCAAAGTAGACAGAA
AAGCTACGCCGATGTAAGGCGTAAAAGCCTGGAGTTTGAGGTGGGGGACCCAGTATTCCTCAAGGTGGCACCTATGAAAGGTGTGTTAAGATACGGACATAAGGGCAAGT
TAAGTCCTAAATTCATTGGACCATTCGAGATCCTAGAGCGGGTTGGTCCGGTAGCGTATAAGTTAGCCTTACCTCCAGCCCTCTCAGGAGTACATGACGTATTTCATGTG
TCGATGTTGAGGAAGTACATTACGGATCCTATCCACGTAATAGACTACGAACCACTCAAACTCAATGAAGATCTGAGCTACGAGGAAAAATCAGTAAGAATCTTAGCTAG
AGAAGTAAAAACCTTACGCAACAGGAGTATTGCGTTCATTAAGGAGCTTTTAGGCCGCAAAACTCCCAAAAAGGAAAGAAGGACGACGAACAAAGAGGAAAAGGGAAGGA
AGAGGACCGAAATCGGCAACGGAAACGCCGCGGGAAGGAGAGAGAAAACTCGCCAGAAAATCGGCCAAAGTCGACCTTCCACAGCCCGCGACCCGAACCACCACCCGAAC
CGGACTGCGACCCGCGCTTCGACCCGACACTACCGCCCGACGTGGCTTTCTCTGTTGCTGCCACGTGTCACACAGCGGCGCGGTAGTCCTCGGCTCGGCTCGGCGCAAAC
GGCTCAGGCTCCCTCCGGTGACGCTCCGGCGGCTCCTTCGGCGGCTCCAGCGGCTGTTCGGCTCGGCTCACCTGTTTTCAGCCCGGTTTCCACTGTTTCGACCCTCCCGA
ATCTGTTTTCGGCCCCAATTAAGGCCGTGTGCCACATCATTTTTCAGGTAAAGGCGAGGCGCACTTGTACAGAAGACGGCGGCATCGCGAGCAGAGACTGTGGCACGTGC
ATAGGGTAG
Protein sequenceShow/hide protein sequence
MHPGGTKMYQDLKQHFWWKSMKRDVAGFVSKCLVCQQVKAPRQKTAGLLQPLSIPEWKWENIAMDFIVGLPKTPKGYIVIWVVVDRLTKSAHFLPGKVTYTVDNWAQLYV
KEIVRLHGVPVSIVSDRDPRFTSAFWRGLQKALGTRLDFSTAFHPQTDGQTEHLNQILEDMLRACVLDFKESWDSKLHLMEFSYNNSFQATIGMAPFEALYGKRCRSPLC
WDEVGEKELVGPELVRLTNEAVQKIRARMRTAQSRQKSYADVRRKSLEFEVGDPVFLKVAPMKGVLRYGHKGKLSPKFIGPFEILERVGPVAYKLALPPALSGVHDVFHV
SMLRKYITDPIHVIDYEPLKLNEDLSYEEKSVRILAREVKTLRNRSIAFIKELLGRKTPKKERRTTNKEEKGRKRTEIGNGNAAGRREKTRQKIGQSRPSTARDPNHHPN
RTATRASTRHYRPTWLSLLLPRVTQRRGSPRLGSAQTAQAPSGDAPAAPSAAPAAVRLGSPVFSPVSTVSTLPNLFSAPIKAVCHIIFQVKARRTCTEDGGIASRDCGTC
IG