; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc03g0064541 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc03g0064541
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionIntegrase
Genome locationCMiso1.1chr03:5480361..5481209
RNA-Seq ExpressionCmc03g0064541
SyntenyCmc03g0064541
Gene Ontology termsGO:0006397 - mRNA processing (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003729 - mRNA binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0060377.1 integrase [Cucumis melo var. makuwa]1.9e-11781.88Show/hide
Query:  STSDDEISPRRMRSIQQIYNATNRINDDHFANFALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKRALGVKLVYRTKLKSDGNVEK
        STS+DEISPRRMRSIQ+IYN TNRINDDHFANFALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNK+ALGVK VYRTKLKSDGNVEK
Subjt:  STSDDEISPRRMRSIQQIYNATNRINDDHFANFALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKRALGVKLVYRTKLKSDGNVEK

Query:  YK---------------------------TIRLILSLAGQNGCKVYQVHVKSAFLIGYLTEEIFVAQPLGYVKRGEEEIVYKLKKALYGLKQAPRAWYSR
        YK                           TIRLILSLA QNG KVYQ+ VKSAFL G+L EEIFVAQPLGYV+RGEEE VYKLKKALYGLKQAPRAWYSR
Subjt:  YK---------------------------TIRLILSLAGQNGCKVYQVHVKSAFLIGYLTEEIFVAQPLGYVKRGEEEIVYKLKKALYGLKQAPRAWYSR

Query:  INSFFLKTGFRRYPYEHALYVKEDKYDKFLIVSLYVDNLLFTKNDKFLFDDFKNSMKKKFRMSDMGLIHYFLGIEV
        I+SFFLKTGFRR PYEHALYVKEDKY KFLIVSLYVD+LLFT NDKFL DDFKNSMK +F MSDMGLIHYFLGIEV
Subjt:  INSFFLKTGFRRYPYEHALYVKEDKYDKFLIVSLYVDNLLFTKNDKFLFDDFKNSMKKKFRMSDMGLIHYFLGIEV

TYJ95504.1 integrase [Cucumis melo var. makuwa]1.9e-11781.88Show/hide
Query:  STSDDEISPRRMRSIQQIYNATNRINDDHFANFALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKRALGVKLVYRTKLKSDGNVEK
        STS+DEISPRRMRSIQ+IYN TNRINDDHFANFALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNK+ALGVK VYRTKLKSDGNVEK
Subjt:  STSDDEISPRRMRSIQQIYNATNRINDDHFANFALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKRALGVKLVYRTKLKSDGNVEK

Query:  YK---------------------------TIRLILSLAGQNGCKVYQVHVKSAFLIGYLTEEIFVAQPLGYVKRGEEEIVYKLKKALYGLKQAPRAWYSR
        YK                           TIRLILSLA QNG KVYQ+ VKSAFL G+L EEIFVAQPLGYV+RGEEE VYKLKKALYGLKQAPRAWYSR
Subjt:  YK---------------------------TIRLILSLAGQNGCKVYQVHVKSAFLIGYLTEEIFVAQPLGYVKRGEEEIVYKLKKALYGLKQAPRAWYSR

Query:  INSFFLKTGFRRYPYEHALYVKEDKYDKFLIVSLYVDNLLFTKNDKFLFDDFKNSMKKKFRMSDMGLIHYFLGIEV
        I+SFFLKTGFRR PYEHALYVKEDKY KFLIVSLYVD+LLFT NDKFL DDFKNSMK +F MSDMGLIHYFLGIEV
Subjt:  INSFFLKTGFRRYPYEHALYVKEDKYDKFLIVSLYVDNLLFTKNDKFLFDDFKNSMKKKFRMSDMGLIHYFLGIEV

TYK07359.1 integrase [Cucumis melo var. makuwa]4.9e-11882.25Show/hide
Query:  STSDDEISPRRMRSIQQIYNATNRINDDHFANFALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKRALGVKLVYRTKLKSDGNVEK
        STSDDEISPRRMRSIQ+IYN TNRINDDHFANFALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNK+ALGVK VYRTKLKSDGNVEK
Subjt:  STSDDEISPRRMRSIQQIYNATNRINDDHFANFALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKRALGVKLVYRTKLKSDGNVEK

Query:  YK---------------------------TIRLILSLAGQNGCKVYQVHVKSAFLIGYLTEEIFVAQPLGYVKRGEEEIVYKLKKALYGLKQAPRAWYSR
        YK                           TIRLILSLA QNG KVYQ+ VKSAFL G+L EEIFVAQPLGYV+RGEEE VYKLKKALYGLKQAPRAWYSR
Subjt:  YK---------------------------TIRLILSLAGQNGCKVYQVHVKSAFLIGYLTEEIFVAQPLGYVKRGEEEIVYKLKKALYGLKQAPRAWYSR

Query:  INSFFLKTGFRRYPYEHALYVKEDKYDKFLIVSLYVDNLLFTKNDKFLFDDFKNSMKKKFRMSDMGLIHYFLGIEV
        I+SFFLKTGFRR PYEHALYVKEDKY KFLIVSLYVD+LLFT NDKFL DDFKNSMK +F MSDMGLIHYFLGIEV
Subjt:  INSFFLKTGFRRYPYEHALYVKEDKYDKFLIVSLYVDNLLFTKNDKFLFDDFKNSMKKKFRMSDMGLIHYFLGIEV

TYK08724.1 integrase [Cucumis melo var. makuwa]4.9e-11882.25Show/hide
Query:  STSDDEISPRRMRSIQQIYNATNRINDDHFANFALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKRALGVKLVYRTKLKSDGNVEK
        STS+DEISPRRMRSIQ+IYN TNRINDDHFANFALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNK+ALGVK VYRTKLKSDGNVEK
Subjt:  STSDDEISPRRMRSIQQIYNATNRINDDHFANFALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKRALGVKLVYRTKLKSDGNVEK

Query:  YK---------------------------TIRLILSLAGQNGCKVYQVHVKSAFLIGYLTEEIFVAQPLGYVKRGEEEIVYKLKKALYGLKQAPRAWYSR
        YK                           TIRLILSLA QNG KVYQ+ VKSAFL G+L EEIFVAQPLGYV+RGEEE VYKLKKALYGLKQAPRAWYSR
Subjt:  YK---------------------------TIRLILSLAGQNGCKVYQVHVKSAFLIGYLTEEIFVAQPLGYVKRGEEEIVYKLKKALYGLKQAPRAWYSR

Query:  INSFFLKTGFRRYPYEHALYVKEDKYDKFLIVSLYVDNLLFTKNDKFLFDDFKNSMKKKFRMSDMGLIHYFLGIEV
        I+SFFLKTGFRR PYEHALYVKEDKY KFLIVSLYVD+LLFT NDKFL DDFKNSMKK+F MSDMGLIHYFLGIEV
Subjt:  INSFFLKTGFRRYPYEHALYVKEDKYDKFLIVSLYVDNLLFTKNDKFLFDDFKNSMKKKFRMSDMGLIHYFLGIEV

TYK30104.1 integrase [Cucumis melo var. makuwa]4.9e-11882.25Show/hide
Query:  STSDDEISPRRMRSIQQIYNATNRINDDHFANFALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKRALGVKLVYRTKLKSDGNVEK
        STS+DEISPRRMRSIQ+IYN TNRINDDHFANFALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNK+ALGVK VYRTKLKSDGNVEK
Subjt:  STSDDEISPRRMRSIQQIYNATNRINDDHFANFALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKRALGVKLVYRTKLKSDGNVEK

Query:  YK---------------------------TIRLILSLAGQNGCKVYQVHVKSAFLIGYLTEEIFVAQPLGYVKRGEEEIVYKLKKALYGLKQAPRAWYSR
        YK                           TIRLILSLA QNG KVYQ+ VKSAFL G+L EEIFVAQPLGYV+RGEEE VYKLKKALYGLKQAPRAWYSR
Subjt:  YK---------------------------TIRLILSLAGQNGCKVYQVHVKSAFLIGYLTEEIFVAQPLGYVKRGEEEIVYKLKKALYGLKQAPRAWYSR

Query:  INSFFLKTGFRRYPYEHALYVKEDKYDKFLIVSLYVDNLLFTKNDKFLFDDFKNSMKKKFRMSDMGLIHYFLGIEV
        I+SFFLKTGFRR PYEHALYVKEDKY KFLIVSLYVD+LLFT NDKFL DDFKNSMKK+F MSDMGLIHYFLGIEV
Subjt:  INSFFLKTGFRRYPYEHALYVKEDKYDKFLIVSLYVDNLLFTKNDKFLFDDFKNSMKKKFRMSDMGLIHYFLGIEV

TrEMBL top hitse value%identityAlignment
A0A5D3BQ81 Integrase9.0e-11881.88Show/hide
Query:  STSDDEISPRRMRSIQQIYNATNRINDDHFANFALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKRALGVKLVYRTKLKSDGNVEK
        STS+DEISPRRMRSIQ+IYN TNRINDDHFANFALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNK+ALGVK VYRTKLKSDGNVEK
Subjt:  STSDDEISPRRMRSIQQIYNATNRINDDHFANFALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKRALGVKLVYRTKLKSDGNVEK

Query:  YK---------------------------TIRLILSLAGQNGCKVYQVHVKSAFLIGYLTEEIFVAQPLGYVKRGEEEIVYKLKKALYGLKQAPRAWYSR
        YK                           TIRLILSLA QNG KVYQ+ VKSAFL G+L EEIFVAQPLGYV+RGEEE VYKLKKALYGLKQAPRAWYSR
Subjt:  YK---------------------------TIRLILSLAGQNGCKVYQVHVKSAFLIGYLTEEIFVAQPLGYVKRGEEEIVYKLKKALYGLKQAPRAWYSR

Query:  INSFFLKTGFRRYPYEHALYVKEDKYDKFLIVSLYVDNLLFTKNDKFLFDDFKNSMKKKFRMSDMGLIHYFLGIEV
        I+SFFLKTGFRR PYEHALYVKEDKY KFLIVSLYVD+LLFT NDKFL DDFKNSMK +F MSDMGLIHYFLGIEV
Subjt:  INSFFLKTGFRRYPYEHALYVKEDKYDKFLIVSLYVDNLLFTKNDKFLFDDFKNSMKKKFRMSDMGLIHYFLGIEV

A0A5D3CAM0 Integrase2.4e-11882.25Show/hide
Query:  STSDDEISPRRMRSIQQIYNATNRINDDHFANFALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKRALGVKLVYRTKLKSDGNVEK
        STSDDEISPRRMRSIQ+IYN TNRINDDHFANFALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNK+ALGVK VYRTKLKSDGNVEK
Subjt:  STSDDEISPRRMRSIQQIYNATNRINDDHFANFALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKRALGVKLVYRTKLKSDGNVEK

Query:  YK---------------------------TIRLILSLAGQNGCKVYQVHVKSAFLIGYLTEEIFVAQPLGYVKRGEEEIVYKLKKALYGLKQAPRAWYSR
        YK                           TIRLILSLA QNG KVYQ+ VKSAFL G+L EEIFVAQPLGYV+RGEEE VYKLKKALYGLKQAPRAWYSR
Subjt:  YK---------------------------TIRLILSLAGQNGCKVYQVHVKSAFLIGYLTEEIFVAQPLGYVKRGEEEIVYKLKKALYGLKQAPRAWYSR

Query:  INSFFLKTGFRRYPYEHALYVKEDKYDKFLIVSLYVDNLLFTKNDKFLFDDFKNSMKKKFRMSDMGLIHYFLGIEV
        I+SFFLKTGFRR PYEHALYVKEDKY KFLIVSLYVD+LLFT NDKFL DDFKNSMK +F MSDMGLIHYFLGIEV
Subjt:  INSFFLKTGFRRYPYEHALYVKEDKYDKFLIVSLYVDNLLFTKNDKFLFDDFKNSMKKKFRMSDMGLIHYFLGIEV

A0A5D3CBW3 Integrase2.4e-11882.25Show/hide
Query:  STSDDEISPRRMRSIQQIYNATNRINDDHFANFALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKRALGVKLVYRTKLKSDGNVEK
        STS+DEISPRRMRSIQ+IYN TNRINDDHFANFALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNK+ALGVK VYRTKLKSDGNVEK
Subjt:  STSDDEISPRRMRSIQQIYNATNRINDDHFANFALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKRALGVKLVYRTKLKSDGNVEK

Query:  YK---------------------------TIRLILSLAGQNGCKVYQVHVKSAFLIGYLTEEIFVAQPLGYVKRGEEEIVYKLKKALYGLKQAPRAWYSR
        YK                           TIRLILSLA QNG KVYQ+ VKSAFL G+L EEIFVAQPLGYV+RGEEE VYKLKKALYGLKQAPRAWYSR
Subjt:  YK---------------------------TIRLILSLAGQNGCKVYQVHVKSAFLIGYLTEEIFVAQPLGYVKRGEEEIVYKLKKALYGLKQAPRAWYSR

Query:  INSFFLKTGFRRYPYEHALYVKEDKYDKFLIVSLYVDNLLFTKNDKFLFDDFKNSMKKKFRMSDMGLIHYFLGIEV
        I+SFFLKTGFRR PYEHALYVKEDKY KFLIVSLYVD+LLFT NDKFL DDFKNSMKK+F MSDMGLIHYFLGIEV
Subjt:  INSFFLKTGFRRYPYEHALYVKEDKYDKFLIVSLYVDNLLFTKNDKFLFDDFKNSMKKKFRMSDMGLIHYFLGIEV

A0A5D3CLV1 Integrase9.0e-11881.88Show/hide
Query:  STSDDEISPRRMRSIQQIYNATNRINDDHFANFALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKRALGVKLVYRTKLKSDGNVEK
        STS+DEISPRRMRSIQ+IYN TNRINDDHFANFALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNK+ALGVK VYRTKLKSDGNVEK
Subjt:  STSDDEISPRRMRSIQQIYNATNRINDDHFANFALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKRALGVKLVYRTKLKSDGNVEK

Query:  YK---------------------------TIRLILSLAGQNGCKVYQVHVKSAFLIGYLTEEIFVAQPLGYVKRGEEEIVYKLKKALYGLKQAPRAWYSR
        YK                           TIRLILSLA QNG KVYQ+ VKSAFL G+L EEIFVAQPLGYV+RGEEE VYKLKKALYGLKQAPRAWYSR
Subjt:  YK---------------------------TIRLILSLAGQNGCKVYQVHVKSAFLIGYLTEEIFVAQPLGYVKRGEEEIVYKLKKALYGLKQAPRAWYSR

Query:  INSFFLKTGFRRYPYEHALYVKEDKYDKFLIVSLYVDNLLFTKNDKFLFDDFKNSMKKKFRMSDMGLIHYFLGIEV
        I+SFFLKTGFRR PYEHALYVKEDKY KFLIVSLYVD+LLFT NDKFL DDFKNSMK +F MSDMGLIHYFLGIEV
Subjt:  INSFFLKTGFRRYPYEHALYVKEDKYDKFLIVSLYVDNLLFTKNDKFLFDDFKNSMKKKFRMSDMGLIHYFLGIEV

A0A5D3E2J1 Integrase2.4e-11882.25Show/hide
Query:  STSDDEISPRRMRSIQQIYNATNRINDDHFANFALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKRALGVKLVYRTKLKSDGNVEK
        STS+DEISPRRMRSIQ+IYN TNRINDDHFANFALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNK+ALGVK VYRTKLKSDGNVEK
Subjt:  STSDDEISPRRMRSIQQIYNATNRINDDHFANFALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKRALGVKLVYRTKLKSDGNVEK

Query:  YK---------------------------TIRLILSLAGQNGCKVYQVHVKSAFLIGYLTEEIFVAQPLGYVKRGEEEIVYKLKKALYGLKQAPRAWYSR
        YK                           TIRLILSLA QNG KVYQ+ VKSAFL G+L EEIFVAQPLGYV+RGEEE VYKLKKALYGLKQAPRAWYSR
Subjt:  YK---------------------------TIRLILSLAGQNGCKVYQVHVKSAFLIGYLTEEIFVAQPLGYVKRGEEEIVYKLKKALYGLKQAPRAWYSR

Query:  INSFFLKTGFRRYPYEHALYVKEDKYDKFLIVSLYVDNLLFTKNDKFLFDDFKNSMKKKFRMSDMGLIHYFLGIEV
        I+SFFLKTGFRR PYEHALYVKEDKY KFLIVSLYVD+LLFT NDKFL DDFKNSMKK+F MSDMGLIHYFLGIEV
Subjt:  INSFFLKTGFRRYPYEHALYVKEDKYDKFLIVSLYVDNLLFTKNDKFLFDDFKNSMKKKFRMSDMGLIHYFLGIEV

SwissProt top hitse value%identityAlignment
P04146 Copia protein6.1e-2331.12Show/hide
Query:  PVTFDE-AIQDEK--WKIAMDQEIDAIRRNETWELMELPTNKRALGVKLVYRTKLKSDGNVEKYK---------------------------TIRLILSL
        P +FDE   +D+K  W+ A++ E++A + N TW + + P NK  +  + V+  K    GN  +YK                           + R ILSL
Subjt:  PVTFDE-AIQDEK--WKIAMDQEIDAIRRNETWELMELPTNKRALGVKLVYRTKLKSDGNVEKYK---------------------------TIRLILSL

Query:  AGQNGCKVYQVHVKSAFLIGYLTEEIFVAQPLGYVKRGEEEIVYKLKKALYGLKQAPRAWYSRINSFFLKTGFRRYPYEHALYV-KEDKYDKFLIVSLYV
          Q   KV+Q+ VK+AFL G L EEI++  P G     +   V KL KA+YGLKQA R W+        +  F     +  +Y+  +   ++ + V LYV
Subjt:  AGQNGCKVYQVHVKSAFLIGYLTEEIFVAQPLGYVKRGEEEIVYKLKKALYGLKQAPRAWYSRINSFFLKTGFRRYPYEHALYV-KEDKYDKFLIVSLYV

Query:  DNLLFTKNDKFLFDDFKNSMKKKFRMSDMGLIHYFLGIEVK
        D+++    D    ++FK  + +KFRM+D+  I +F+GI ++
Subjt:  DNLLFTKNDKFLFDDFKNSMKKKFRMSDMGLIHYFLGIEVK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-2935.29Show/hide
Query:  AMDQEIDAIRRNETWELMELPTNKRALGVKLVYRTKLKSDGNVEKYK---------------------------TIRLILSLAGQNGCKVYQVHVKSAFL
        AM +E++++++N T++L+ELP  KR L  K V++ K   D  + +YK                           +IR ILSLA     +V Q+ VK+AFL
Subjt:  AMDQEIDAIRRNETWELMELPTNKRALGVKLVYRTKLKSDGNVEKYK---------------------------TIRLILSLAGQNGCKVYQVHVKSAFL

Query:  IGYLTEEIFVAQPLGYVKRGEEEIVYKLKKALYGLKQAPRAWYSRINSFFLKTGFRRYPYEHALYVKEDKYDKFLIVSLYVDNLLFTKNDKFLFDDFKNS
         G L EEI++ QP G+   G++ +V KL K+LYGLKQAPR WY + +SF     + +   +  +Y K    + F+I+ LYVD++L    DK L    K  
Subjt:  IGYLTEEIFVAQPLGYVKRGEEEIVYKLKKALYGLKQAPRAWYSRINSFFLKTGFRRYPYEHALYVKEDKYDKFLIVSLYVDNLLFTKNDKFLFDDFKNS

Query:  MKKKFRMSDMGLIHYFLGIEV
        + K F M D+G     LG+++
Subjt:  MKKKFRMSDMGLIHYFLGIEV

P25600 Putative transposon Ty5-1 protein YCL074W7.3e-1636.22Show/hide
Query:  VKSAFLIGYLTEEIFVAQPLGYVKRGEEEIVYKLKKALYGLKQAPRAWYSRINSFFLKTGFRRYPYEHALYVKEDKYDKFLIVSLYVDNLLFTKNDKFLF
        V +AFL   + E I+V QP G+V     + V++L   +YGLKQAP  W   IN+   K GF R+  EH LY +    D  + +++YVD+LL       ++
Subjt:  VKSAFLIGYLTEEIFVAQPLGYVKRGEEEIVYKLKKALYGLKQAPRAWYSRINSFFLKTGFRRYPYEHALYVKEDKYDKFLIVSLYVDNLLFTKNDKFLF

Query:  DDFKNSMKKKFRMSDMGLIHYFLGIEV
        D  K  + K + M D+G +  FLG+ +
Subjt:  DDFKNSMKKKFRMSDMGLIHYFLGIEV

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.2e-3232.65Show/hide
Query:  ALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELM-ELPTNKRALGVKLVYRTKLKSDGNVEKYK---------------------------TIR
        +L A  +P T  +A++DE+W+ AM  EI+A   N TW+L+   P++   +G + ++  K  SDG++ +YK                           +IR
Subjt:  ALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELM-ELPTNKRALGVKLVYRTKLKSDGNVEKYK---------------------------TIR

Query:  LILSLAGQNGCKVYQVHVKSAFLIGYLTEEIFVAQPLGYVKRGEEEIVYKLKKALYGLKQAPRAWYSRINSFFLKTGFRRYPYEHALYVKEDKYDKFLIV
        ++L +A      + Q+ V +AFL G LT++++++QP G++ +     V KL+KALYGLKQAPRAWY  + ++ L  GF     + +L+V + +    + +
Subjt:  LILSLAGQNGCKVYQVHVKSAFLIGYLTEEIFVAQPLGYVKRGEEEIVYKLKKALYGLKQAPRAWYSRINSFFLKTGFRRYPYEHALYVKEDKYDKFLIV

Query:  SLYVDNLLFTKNDKFLFDDFKNSMKKKFRMSDMGLIHYFLGIEVK
         +YVD++L T ND  L  +  +++ ++F + D   +HYFLGIE K
Subjt:  SLYVDNLLFTKNDKFLFDDFKNSMKKKFRMSDMGLIHYFLGIEVK

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.1e-3131.84Show/hide
Query:  ALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELM-ELPTNKRALGVKLVYRTKLKSDGNVEKYK---------------------------TIR
        +L A  +P T  +A++D++W+ AM  EI+A   N TW+L+   P +   +G + ++  K  SDG++ +YK                           +IR
Subjt:  ALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELM-ELPTNKRALGVKLVYRTKLKSDGNVEKYK---------------------------TIR

Query:  LILSLAGQNGCKVYQVHVKSAFLIGYLTEEIFVAQPLGYVKRGEEEIVYKLKKALYGLKQAPRAWYSRINSFFLKTGFRRYPYEHALYVKEDKYDKFLIV
        ++L +A      + Q+ V +AFL G LT+E++++QP G+V +   + V +L+KA+YGLKQAPRAWY  + ++ L  GF     + +L+V + +    + +
Subjt:  LILSLAGQNGCKVYQVHVKSAFLIGYLTEEIFVAQPLGYVKRGEEEIVYKLKKALYGLKQAPRAWYSRINSFFLKTGFRRYPYEHALYVKEDKYDKFLIV

Query:  SLYVDNLLFTKNDKFLFDDFKNSMKKKFRMSDMGLIHYFLGIEVK
         +YVD++L T ND  L     +++ ++F + +   +HYFLGIE K
Subjt:  SLYVDNLLFTKNDKFLFDDFKNSMKKKFRMSDMGLIHYFLGIEVK

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 85.7e-3232.78Show/hide
Query:  DPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKRALGVKLVYRTKLKSDGNVEKYK---------------------------TIRLILSLAG
        +P T++EA +   W  AMD EI A+    TWE+  LP NK+ +G K VY+ K  SDG +E+YK                           +++LIL+++ 
Subjt:  DPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKRALGVKLVYRTKLKSDGNVEKYK---------------------------TIRLILSLAG

Query:  QNGCKVYQVHVKSAFLIGYLTEEIFVAQPLGYVKRGEEEI----VYKLKKALYGLKQAPRAWYSRINSFFLKTGFRRYPYEHALYVKEDKYDKFLIVSLY
             ++Q+ + +AFL G L EEI++  P GY  R  + +    V  LKK++YGLKQA R W+ + +   +  GF +   +H  ++K      FL V +Y
Subjt:  QNGCKVYQVHVKSAFLIGYLTEEIFVAQPLGYVKRGEEEI----VYKLKKALYGLKQAPRAWYSRINSFFLKTGFRRYPYEHALYVKEDKYDKFLIVSLY

Query:  VDNLLFTKNDKFLFDDFKNSMKKKFRMSDMGLIHYFLGIEV
        VD+++   N+    D+ K+ +K  F++ D+G + YFLG+E+
Subjt:  VDNLLFTKNDKFLFDDFKNSMKKKFRMSDMGLIHYFLGIEV

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.3e-0936.25Show/hide
Query:  NRINDDHFANFALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKRALGVKLVYRTKLKSDGNVEKYK
        N++N  +          +P +   A++D  W  AM +E+DA+ RN+TW L+  P N+  LG K V++TKL SDG +++ K
Subjt:  NRINDDHFANFALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKRALGVKLVYRTKLKSDGNVEKYK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCATTTTCCACAAGTGATGATGAAATCTCACCAAGGAGAATGAGGAGCATTCAACAAATTTATAATGCAACTAACAGAATTAATGATGATCATTTTGCTAATTTTGC
ATTGTTTGCTGGTGTTGATCCTGTGACTTTTGATGAAGCCATCCAAGATGAGAAATGGAAGATTGCAATGGATCAAGAAATTGATGCGATAAGAAGAAATGAAACATGGG
AGTTGATGGAGCTTCCGACAAACAAACGAGCTCTTGGAGTAAAATTGGTGTACAGAACAAAGTTGAAGTCAGATGGTAATGTTGAAAAATACAAGACCATTCGATTGATT
TTGTCATTAGCTGGTCAAAATGGATGTAAAGTTTATCAAGTGCATGTAAAATCCGCTTTCTTGATTGGATACTTGACGGAAGAGATATTTGTTGCACAACCTTTGGGCTA
TGTGAAAAGGGGAGAAGAAGAAATAGTGTACAAGTTGAAAAAGGCCTTGTATGGATTAAAGCAAGCTCCGCGAGCTTGGTACAGTCGTATCAACAGTTTTTTTCTAAAGA
CAGGATTTCGAAGATATCCATATGAACATGCACTCTATGTCAAAGAAGACAAGTATGACAAATTTCTTATCGTTTCTCTTTACGTTGATAATTTACTTTTTACTAAAAAT
GATAAATTTTTGTTTGATGATTTTAAGAATTCCATGAAAAAGAAATTTAGGATGAGTGATATGGGTCTCATCCACTACTTTCTTGGAATTGAGGTTAAATCAAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCCATTTTCCACAAGTGATGATGAAATCTCACCAAGGAGAATGAGGAGCATTCAACAAATTTATAATGCAACTAACAGAATTAATGATGATCATTTTGCTAATTTTGC
ATTGTTTGCTGGTGTTGATCCTGTGACTTTTGATGAAGCCATCCAAGATGAGAAATGGAAGATTGCAATGGATCAAGAAATTGATGCGATAAGAAGAAATGAAACATGGG
AGTTGATGGAGCTTCCGACAAACAAACGAGCTCTTGGAGTAAAATTGGTGTACAGAACAAAGTTGAAGTCAGATGGTAATGTTGAAAAATACAAGACCATTCGATTGATT
TTGTCATTAGCTGGTCAAAATGGATGTAAAGTTTATCAAGTGCATGTAAAATCCGCTTTCTTGATTGGATACTTGACGGAAGAGATATTTGTTGCACAACCTTTGGGCTA
TGTGAAAAGGGGAGAAGAAGAAATAGTGTACAAGTTGAAAAAGGCCTTGTATGGATTAAAGCAAGCTCCGCGAGCTTGGTACAGTCGTATCAACAGTTTTTTTCTAAAGA
CAGGATTTCGAAGATATCCATATGAACATGCACTCTATGTCAAAGAAGACAAGTATGACAAATTTCTTATCGTTTCTCTTTACGTTGATAATTTACTTTTTACTAAAAAT
GATAAATTTTTGTTTGATGATTTTAAGAATTCCATGAAAAAGAAATTTAGGATGAGTGATATGGGTCTCATCCACTACTTTCTTGGAATTGAGGTTAAATCAAAATGA
Protein sequenceShow/hide protein sequence
MPFSTSDDEISPRRMRSIQQIYNATNRINDDHFANFALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKRALGVKLVYRTKLKSDGNVEKYKTIRLI
LSLAGQNGCKVYQVHVKSAFLIGYLTEEIFVAQPLGYVKRGEEEIVYKLKKALYGLKQAPRAWYSRINSFFLKTGFRRYPYEHALYVKEDKYDKFLIVSLYVDNLLFTKN
DKFLFDDFKNSMKKKFRMSDMGLIHYFLGIEVKSK