; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0005840 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0005840
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionReverse transcriptase Ty1/copia-type domain-containing protein
Genome locationchr01:25189710..25193021
RNA-Seq ExpressionPay0005840
SyntenyPay0005840
Gene Ontology termsNA
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0051794.1 uncharacterized protein E6C27_scaffold60G002030 [Cucumis melo var. makuwa]1.5e-5336.69Show/hide
Query:  MDVKSAFLNGYISEEVYIAQPKGFIDPVHPNHVYKLHKALYGLKQAPRAW-----VIISDS---ESEDD-------VETAPVSSDEYGVG-FVDVLKQKA
        MDVKS FLNGY++EEVY+AQPKGF+D   P +VYKL+KALYGLKQAPRAW     + + +    + E D         T  +SS E G   +   ++   
Subjt:  MDVKSAFLNGYISEEVYIAQPKGFIDPVHPNHVYKLHKALYGLKQAPRAW-----VIISDS---ESEDD-------VETAPVSSDEYGVG-FVDVLKQKA

Query:  ISVKRFSPSGKFPSSL-QVP------RSSDEPIDDLFDEFQHVSIPHGEESSYAPPTVDSFSGSTLSQTTAAQEPPFFTPASSHPTDVP------DVTTH
          +   S S   PS+   VP      RS + P D++ D+ + V+     +      TVD           A QE P   P  S P          ++T  
Subjt:  ISVKRFSPSGKFPSSL-QVP------RSSDEPIDDLFDEFQHVSIPHGEESSYAPPTVDSFSGSTLSQTTAAQEPPFFTPASSHPTDVP------DVTTH

Query:  AP--------SSVPIDEISFHSKDYVHKWNFVVKLRIADEAVISNQHR------------------SSVGHFYPKFIREFIVNLPADLNDHGSPEYQKVH
        A          SV ID ISF+ ++ V  W  VV+ RIAD+  IS++H                   S+VG FYP+ ++EFIVN PA  ND  SP+YQ VH
Subjt:  AP--------SSVPIDEISFHSKDYVHKWNFVVKLRIADEAVISNQHR------------------SSVGHFYPKFIREFIVNLPADLNDHGSPEYQKVH

Query:  VRGKCFEISPTLINRFLKHSLPSDYSVTLPLPEQLTLQLSSGSVRQWPCDGYFPIHPTTHLSIVSTTLVNSLFLIGTGSKIDVGDFIFKHVLRHVDTFGV
        +RG  F ISPT+IN  L  S P +    + L  +  + L    +  W         P++H S VS TL   L+ I    K+D G FI+  +LRHV +FGV
Subjt:  VRGKCFEISPTLINRFLKHSLPSDYSVTLPLPEQLTLQLSSGSVRQWPCDGYFPIHPTTHLSIVSTTLVNSLFLIGTGSKIDVGDFIFKHVLRHVDTFGV

Query:  NVPIFFPQLLCAFLLSQHPKVLTYEDIQGPTLRTISLSYKLFQGSHL
         +PI  P+   + LL  +  VLT  D   P  +T++LSY+LFQGS++
Subjt:  NVPIFFPQLLCAFLLSQHPKVLTYEDIQGPTLRTISLSYKLFQGSHL

KAA0063048.1 gag-pol polyprotein [Cucumis melo var. makuwa]3.9e-1480Show/hide
Query:  MDVKSAFLNGYISEEVYIAQPKGFIDPVHPNHVYKLHKALYGLKQAPRAW
        MDVKSAFLNGY+S EVY+ QPKGF+D VH +HVYKL KALYGLKQA RAW
Subjt:  MDVKSAFLNGYISEEVYIAQPKGFIDPVHPNHVYKLHKALYGLKQAPRAW

KAA0063048.1 gag-pol polyprotein [Cucumis melo var. makuwa]6.8e-5132.22Show/hide
Query:  MDVKSAFLNGYISEEVYIAQPKGFIDPVHPNHVYKLHKALYGLKQAPRAWV-----------------------IISDSESEDDVETAPVSSDEYGVGFV
        MDVKSAFLNGY+SEEVY+AQPKGF+DPVH +HVYKL KALYGLK+APR W                        I   ++ + D  T+ + S ++ + ++
Subjt:  MDVKSAFLNGYISEEVYIAQPKGFIDPVHPNHVYKLHKALYGLKQAPRAWV-----------------------IISDSESEDDVETAPVSSDEYGVGFV

Query:  DVLKQKAISVKRFSPSG----------------KFPSSLQVPRSSDEPIDDLFDEFQHVSIPHGEESSYAP-----PTVDSFSGSTL----------SQT
                S  R S SG                K+PS +    SS+  +     E   ++ P    SS        PT   +    +          S  
Subjt:  DVLKQKAISVKRFSPSG----------------KFPSSLQVPRSSDEPIDDLFDEFQHVSIPHGEESSYAP-----PTVDSFSGSTL----------SQT

Query:  TAAQEPPFFTPASSHPTDVPDVTTHAPSSVPIDEISFHSKDYV------HKWNFVVK--------------LRIADEAVISNQHRS----------SVGH
        +       F P SS   + P V+     +V +D  S  S+D V      H+   V                + + +    S+ H S            G 
Subjt:  TAAQEPPFFTPASSHPTDVPDVTTHAPSSVPIDEISFHSKDYV------HKWNFVVK--------------LRIADEAVISNQHRS----------SVGH

Query:  F-YPKFIREFIVN--------------LPADLNDHGSPEYQKVHVRGKCFEISPTLINRFLKHSLPSDYSVTLPLPEQLTLQLSSGSVRQWPCDGYFPIH
        F  P  + E + +               P  L+D  + EYQKVH+RG CF +SP L+N +L  SLP+DY+V  P PE+L  +L+ G+V  WP DG  P+ 
Subjt:  F-YPKFIREFIVN--------------LPADLNDHGSPEYQKVHVRGKCFEISPTLINRFLKHSLPSDYSVTLPLPEQLTLQLSSGSVRQWPCDGYFPIH

Query:  ------------------PTTHLSIVSTTLVNSLFLIGTGSKIDVGDFIFKHVLRHVDTFGVNVPIFFPQLLCAFLLSQHPKVLTYEDIQGPTLRTISLS
                          P+ + SI+ST L + ++L+GTG K++VG+FIF H+L+HV+TF +++PI FP +L  FLL+Q   +LT  D  G + R I LS
Subjt:  ------------------PTTHLSIVSTTLVNSLFLIGTGSKIDVGDFIFKHVLRHVDTFGVNVPIFFPQLLCAFLLSQHPKVLTYEDIQGPTLRTISLS

Query:  YKLFQGSHL
          LFQG+H+
Subjt:  YKLFQGSHL

TYK16303.1 gag-pol polyprotein [Cucumis melo var. makuwa]5.6e-5338.82Show/hide
Query:  SPSGKFPSSLQVPRSSDEPIDDLFDEFQHVSIPHGEESSYAPPTVDSFSGSTLSQTTAAQEPPFFTPASSH--PTDVPDVTTHAPSSVPIDEISFHSKDY
        SP    PS  +    +D+  +D     +  ++P  E +S +     +F     S   + +EP   T A     P +VP         VPID +SFHS++ 
Subjt:  SPSGKFPSSLQVPRSSDEPIDDLFDEFQHVSIPHGEESSYAPPTVDSFSGSTLSQTTAAQEPPFFTPASSH--PTDVPDVTTHAPSSVPIDEISFHSKDY

Query:  VHKWNFVVKLRIADEAVISNQHR------------------SSVGHFYPKFIREFIVNLPADLNDHGSPEYQKVHVRGKCFEISPTLINRFLKHSLPSDY
         HKWN+VVK RIADEA I +Q+                   S VG FYP+ +RE IVNLP+D ND  + EYQKVH+RG  F +SP L+N +L  SLP+DY
Subjt:  VHKWNFVVKLRIADEAVISNQHR------------------SSVGHFYPKFIREFIVNLPADLNDHGSPEYQKVHVRGKCFEISPTLINRFLKHSLPSDY

Query:  SVTLPLPEQLTLQLSSGSVRQWPCDG----------YFPIH--------PTTHLSIVSTTLVNSLFLIGTGSKIDVGDFIFKHVLRHVDTFGVNVPIFFP
        +V+ P PE+L  +L+ G++  WP DG          Y  +H        P+TH   +ST+L + ++L+GTG K++  +FIF H+LRHVDTF +++PI FP
Subjt:  SVTLPLPEQLTLQLSSGSVRQWPCDG----------YFPIH--------PTTHLSIVSTTLVNSLFLIGTGSKIDVGDFIFKHVLRHVDTFGVNVPIFFP

Query:  QLLCAFLLSQHPKVLTYEDIQGPTLRTISLSYKLFQGSHL
        ++L  FLL+Q    LT  D  G   R I L   LFQGS++
Subjt:  QLLCAFLLSQHPKVLTYEDIQGPTLRTISLSYKLFQGSHL

TYK16303.1 gag-pol polyprotein [Cucumis melo var. makuwa]3.9e-1480Show/hide
Query:  MDVKSAFLNGYISEEVYIAQPKGFIDPVHPNHVYKLHKALYGLKQAPRAW
        MDVKSAFLNGY+S EVY+ QPKGF+D VH +HVYKL KALYGLKQA RAW
Subjt:  MDVKSAFLNGYISEEVYIAQPKGFIDPVHPNHVYKLHKALYGLKQAPRAW

TYK16303.1 gag-pol polyprotein [Cucumis melo var. makuwa]2.8e-5238.53Show/hide
Query:  SPSGKFPSSLQVPRSSDEPIDDLFDEFQHVSIPHGEESSYAPPTVDSFSGSTLSQTTAAQEPPFFTPASSH--PTDVPDVTTHAPSSVPIDEISFHSKDY
        SP    PS  +    +D+  +D     +  ++P  E +S +     +F     S   + +EP   T A     P +VP         VPID +SFHS++ 
Subjt:  SPSGKFPSSLQVPRSSDEPIDDLFDEFQHVSIPHGEESSYAPPTVDSFSGSTLSQTTAAQEPPFFTPASSH--PTDVPDVTTHAPSSVPIDEISFHSKDY

Query:  VHKWNFVVKLRIADEAVISNQHR------------------SSVGHFYPKFIREFIVNLPADLNDHGSPEYQKVHVRGKCFEISPTLINRFLKHSLPSDY
         HKWN+VVK RIADEA I +Q+                   S VG FYP+ +RE IVNLP+D ND  + EYQKVH+RG  F +SP L+N +L  SLP+DY
Subjt:  VHKWNFVVKLRIADEAVISNQHR------------------SSVGHFYPKFIREFIVNLPADLNDHGSPEYQKVHVRGKCFEISPTLINRFLKHSLPSDY

Query:  SVTLPLPEQLTLQLSSGSVRQWPCDG----------YFPIH--------PTTHLSIVSTTLVNSLFLIGTGSKIDVGDFIFKHVLRHVDTFGVNVPIFFP
        +V+ P PE+L  +L+ G++  WP DG          Y  +H        P+TH   + T+L + ++L+GTG K++  +FIF H+LRHVDTF +++PI FP
Subjt:  SVTLPLPEQLTLQLSSGSVRQWPCDG----------YFPIH--------PTTHLSIVSTTLVNSLFLIGTGSKIDVGDFIFKHVLRHVDTFGVNVPIFFP

Query:  QLLCAFLLSQHPKVLTYEDIQGPTLRTISLSYKLFQGSHL
        ++L  FLL+Q    LT  D  G   R I L   LFQGS++
Subjt:  QLLCAFLLSQHPKVLTYEDIQGPTLRTISLSYKLFQGSHL

XP_008458113.1 PREDICTED: uncharacterized protein LOC103497643 [Cucumis melo]2.9e-24393.51Show/hide
Query:  MDVKSAFLNGYISEEVYIAQPKGFIDPVHPNHVYKLHKALYGLKQAPRAWVIISDSESEDDVETAPVSSDEYGVGFVDVLKQKAISVKRFSPSGKFPSSL
        MDVKSAFLNGY+SEEVYIAQPKGFIDPVHPNHVYKLHKALYGLKQAPRAWVIISDSESEDDVETAPVS DEYGV  VDVLKQKAISVKRFSPS KFPSSL
Subjt:  MDVKSAFLNGYISEEVYIAQPKGFIDPVHPNHVYKLHKALYGLKQAPRAWVIISDSESEDDVETAPVSSDEYGVGFVDVLKQKAISVKRFSPSGKFPSSL

Query:  QVPRSSDEPIDDLFDEFQHVSIPHGEESSYAPPTVDSFSGSTLSQTTAAQEPPFFTPASSHPTDVPDVTTHAPSSVPIDEISFHSKDYVHKWNFVVKLRI
        QVPRSSDEPIDDLFDEFQHVSIPHGEESSYAPPTVDSFSGSTLSQTTAAQEPPFFTPASSHPTDVPDVTTHAPSSVPIDEISFHSKD VHKWNFVVKLRI
Subjt:  QVPRSSDEPIDDLFDEFQHVSIPHGEESSYAPPTVDSFSGSTLSQTTAAQEPPFFTPASSHPTDVPDVTTHAPSSVPIDEISFHSKDYVHKWNFVVKLRI

Query:  ADEAVISNQHRSSVGHFYPKFIREFIVNLPADLNDHGSPEYQKVHVRGKCFEISPTLINRFLKHSLPSDYSVTLPLPEQLTLQLSSGSVRQWPCDGYFPI
        ADEAVISNQHRSSVGHFYPKFIREFIVNLPADLNDHGSPEYQKVHVRGKCFEISPTLINRFLKHSLP+DYSVTLPLPEQLTL+LSSGSVRQWPCDGYFPI
Subjt:  ADEAVISNQHRSSVGHFYPKFIREFIVNLPADLNDHGSPEYQKVHVRGKCFEISPTLINRFLKHSLPSDYSVTLPLPEQLTLQLSSGSVRQWPCDGYFPI

Query:  ------------------HPTTHLSIVSTTLVNSLFLIGTGSKIDVGDFIFKHVLRHVDTFGVNVPIFFPQLLCAFLLSQHPKVLTYEDIQGPTLRTISL
                          HPTTHLSIVSTTLVNSLFLIGTGSKIDVGDFIFKHVLRHVDTFGVNVP FFPQLLCAFLLSQHPKVLTYEDIQ PT RTISL
Subjt:  ------------------HPTTHLSIVSTTLVNSLFLIGTGSKIDVGDFIFKHVLRHVDTFGVNVPIFFPQLLCAFLLSQHPKVLTYEDIQGPTLRTISL

Query:  SYKLFQGSHLLSDELRSLSVAILDLDELIGGLTSRRVAVDIVIHVVQILFESFESSKSAPAS
        SYKLFQGSHLLSDELRSLSVAILDLDELIGGLTSRRVAVDIVIHVVQIL ESFESSKSAPAS
Subjt:  SYKLFQGSHLLSDELRSLSVAILDLDELIGGLTSRRVAVDIVIHVVQILFESFESSKSAPAS

TrEMBL top hitse value%identityAlignment
A0A1S3C780 uncharacterized protein LOC1034976431.4e-24393.51Show/hide
Query:  MDVKSAFLNGYISEEVYIAQPKGFIDPVHPNHVYKLHKALYGLKQAPRAWVIISDSESEDDVETAPVSSDEYGVGFVDVLKQKAISVKRFSPSGKFPSSL
        MDVKSAFLNGY+SEEVYIAQPKGFIDPVHPNHVYKLHKALYGLKQAPRAWVIISDSESEDDVETAPVS DEYGV  VDVLKQKAISVKRFSPS KFPSSL
Subjt:  MDVKSAFLNGYISEEVYIAQPKGFIDPVHPNHVYKLHKALYGLKQAPRAWVIISDSESEDDVETAPVSSDEYGVGFVDVLKQKAISVKRFSPSGKFPSSL

Query:  QVPRSSDEPIDDLFDEFQHVSIPHGEESSYAPPTVDSFSGSTLSQTTAAQEPPFFTPASSHPTDVPDVTTHAPSSVPIDEISFHSKDYVHKWNFVVKLRI
        QVPRSSDEPIDDLFDEFQHVSIPHGEESSYAPPTVDSFSGSTLSQTTAAQEPPFFTPASSHPTDVPDVTTHAPSSVPIDEISFHSKD VHKWNFVVKLRI
Subjt:  QVPRSSDEPIDDLFDEFQHVSIPHGEESSYAPPTVDSFSGSTLSQTTAAQEPPFFTPASSHPTDVPDVTTHAPSSVPIDEISFHSKDYVHKWNFVVKLRI

Query:  ADEAVISNQHRSSVGHFYPKFIREFIVNLPADLNDHGSPEYQKVHVRGKCFEISPTLINRFLKHSLPSDYSVTLPLPEQLTLQLSSGSVRQWPCDGYFPI
        ADEAVISNQHRSSVGHFYPKFIREFIVNLPADLNDHGSPEYQKVHVRGKCFEISPTLINRFLKHSLP+DYSVTLPLPEQLTL+LSSGSVRQWPCDGYFPI
Subjt:  ADEAVISNQHRSSVGHFYPKFIREFIVNLPADLNDHGSPEYQKVHVRGKCFEISPTLINRFLKHSLPSDYSVTLPLPEQLTLQLSSGSVRQWPCDGYFPI

Query:  ------------------HPTTHLSIVSTTLVNSLFLIGTGSKIDVGDFIFKHVLRHVDTFGVNVPIFFPQLLCAFLLSQHPKVLTYEDIQGPTLRTISL
                          HPTTHLSIVSTTLVNSLFLIGTGSKIDVGDFIFKHVLRHVDTFGVNVP FFPQLLCAFLLSQHPKVLTYEDIQ PT RTISL
Subjt:  ------------------HPTTHLSIVSTTLVNSLFLIGTGSKIDVGDFIFKHVLRHVDTFGVNVPIFFPQLLCAFLLSQHPKVLTYEDIQGPTLRTISL

Query:  SYKLFQGSHLLSDELRSLSVAILDLDELIGGLTSRRVAVDIVIHVVQILFESFESSKSAPAS
        SYKLFQGSHLLSDELRSLSVAILDLDELIGGLTSRRVAVDIVIHVVQIL ESFESSKSAPAS
Subjt:  SYKLFQGSHLLSDELRSLSVAILDLDELIGGLTSRRVAVDIVIHVVQILFESFESSKSAPAS

A0A5A7U8Y1 Reverse transcriptase Ty1/copia-type domain-containing protein7.1e-5436.69Show/hide
Query:  MDVKSAFLNGYISEEVYIAQPKGFIDPVHPNHVYKLHKALYGLKQAPRAW-----VIISDS---ESEDD-------VETAPVSSDEYGVG-FVDVLKQKA
        MDVKS FLNGY++EEVY+AQPKGF+D   P +VYKL+KALYGLKQAPRAW     + + +    + E D         T  +SS E G   +   ++   
Subjt:  MDVKSAFLNGYISEEVYIAQPKGFIDPVHPNHVYKLHKALYGLKQAPRAW-----VIISDS---ESEDD-------VETAPVSSDEYGVG-FVDVLKQKA

Query:  ISVKRFSPSGKFPSSL-QVP------RSSDEPIDDLFDEFQHVSIPHGEESSYAPPTVDSFSGSTLSQTTAAQEPPFFTPASSHPTDVP------DVTTH
          +   S S   PS+   VP      RS + P D++ D+ + V+     +      TVD           A QE P   P  S P          ++T  
Subjt:  ISVKRFSPSGKFPSSL-QVP------RSSDEPIDDLFDEFQHVSIPHGEESSYAPPTVDSFSGSTLSQTTAAQEPPFFTPASSHPTDVP------DVTTH

Query:  AP--------SSVPIDEISFHSKDYVHKWNFVVKLRIADEAVISNQHR------------------SSVGHFYPKFIREFIVNLPADLNDHGSPEYQKVH
        A          SV ID ISF+ ++ V  W  VV+ RIAD+  IS++H                   S+VG FYP+ ++EFIVN PA  ND  SP+YQ VH
Subjt:  AP--------SSVPIDEISFHSKDYVHKWNFVVKLRIADEAVISNQHR------------------SSVGHFYPKFIREFIVNLPADLNDHGSPEYQKVH

Query:  VRGKCFEISPTLINRFLKHSLPSDYSVTLPLPEQLTLQLSSGSVRQWPCDGYFPIHPTTHLSIVSTTLVNSLFLIGTGSKIDVGDFIFKHVLRHVDTFGV
        +RG  F ISPT+IN  L  S P +    + L  +  + L    +  W         P++H S VS TL   L+ I    K+D G FI+  +LRHV +FGV
Subjt:  VRGKCFEISPTLINRFLKHSLPSDYSVTLPLPEQLTLQLSSGSVRQWPCDGYFPIHPTTHLSIVSTTLVNSLFLIGTGSKIDVGDFIFKHVLRHVDTFGV

Query:  NVPIFFPQLLCAFLLSQHPKVLTYEDIQGPTLRTISLSYKLFQGSHL
         +PI  P+   + LL  +  VLT  D   P  +T++LSY+LFQGS++
Subjt:  NVPIFFPQLLCAFLLSQHPKVLTYEDIQGPTLRTISLSYKLFQGSHL

A0A5A7V603 Gag-pol polyprotein1.9e-1480Show/hide
Query:  MDVKSAFLNGYISEEVYIAQPKGFIDPVHPNHVYKLHKALYGLKQAPRAW
        MDVKSAFLNGY+S EVY+ QPKGF+D VH +HVYKL KALYGLKQA RAW
Subjt:  MDVKSAFLNGYISEEVYIAQPKGFIDPVHPNHVYKLHKALYGLKQAPRAW

A0A5A7V603 Gag-pol polyprotein3.3e-5132.22Show/hide
Query:  MDVKSAFLNGYISEEVYIAQPKGFIDPVHPNHVYKLHKALYGLKQAPRAWV-----------------------IISDSESEDDVETAPVSSDEYGVGFV
        MDVKSAFLNGY+SEEVY+AQPKGF+DPVH +HVYKL KALYGLK+APR W                        I   ++ + D  T+ + S ++ + ++
Subjt:  MDVKSAFLNGYISEEVYIAQPKGFIDPVHPNHVYKLHKALYGLKQAPRAWV-----------------------IISDSESEDDVETAPVSSDEYGVGFV

Query:  DVLKQKAISVKRFSPSG----------------KFPSSLQVPRSSDEPIDDLFDEFQHVSIPHGEESSYAP-----PTVDSFSGSTL----------SQT
                S  R S SG                K+PS +    SS+  +     E   ++ P    SS        PT   +    +          S  
Subjt:  DVLKQKAISVKRFSPSG----------------KFPSSLQVPRSSDEPIDDLFDEFQHVSIPHGEESSYAP-----PTVDSFSGSTL----------SQT

Query:  TAAQEPPFFTPASSHPTDVPDVTTHAPSSVPIDEISFHSKDYV------HKWNFVVK--------------LRIADEAVISNQHRS----------SVGH
        +       F P SS   + P V+     +V +D  S  S+D V      H+   V                + + +    S+ H S            G 
Subjt:  TAAQEPPFFTPASSHPTDVPDVTTHAPSSVPIDEISFHSKDYV------HKWNFVVK--------------LRIADEAVISNQHRS----------SVGH

Query:  F-YPKFIREFIVN--------------LPADLNDHGSPEYQKVHVRGKCFEISPTLINRFLKHSLPSDYSVTLPLPEQLTLQLSSGSVRQWPCDGYFPIH
        F  P  + E + +               P  L+D  + EYQKVH+RG CF +SP L+N +L  SLP+DY+V  P PE+L  +L+ G+V  WP DG  P+ 
Subjt:  F-YPKFIREFIVN--------------LPADLNDHGSPEYQKVHVRGKCFEISPTLINRFLKHSLPSDYSVTLPLPEQLTLQLSSGSVRQWPCDGYFPIH

Query:  ------------------PTTHLSIVSTTLVNSLFLIGTGSKIDVGDFIFKHVLRHVDTFGVNVPIFFPQLLCAFLLSQHPKVLTYEDIQGPTLRTISLS
                          P+ + SI+ST L + ++L+GTG K++VG+FIF H+L+HV+TF +++PI FP +L  FLL+Q   +LT  D  G + R I LS
Subjt:  ------------------PTTHLSIVSTTLVNSLFLIGTGSKIDVGDFIFKHVLRHVDTFGVNVPIFFPQLLCAFLLSQHPKVLTYEDIQGPTLRTISLS

Query:  YKLFQGSHL
          LFQG+H+
Subjt:  YKLFQGSHL

A0A5D3CWQ1 Gag-pol polyprotein2.7e-5338.82Show/hide
Query:  SPSGKFPSSLQVPRSSDEPIDDLFDEFQHVSIPHGEESSYAPPTVDSFSGSTLSQTTAAQEPPFFTPASSH--PTDVPDVTTHAPSSVPIDEISFHSKDY
        SP    PS  +    +D+  +D     +  ++P  E +S +     +F     S   + +EP   T A     P +VP         VPID +SFHS++ 
Subjt:  SPSGKFPSSLQVPRSSDEPIDDLFDEFQHVSIPHGEESSYAPPTVDSFSGSTLSQTTAAQEPPFFTPASSH--PTDVPDVTTHAPSSVPIDEISFHSKDY

Query:  VHKWNFVVKLRIADEAVISNQHR------------------SSVGHFYPKFIREFIVNLPADLNDHGSPEYQKVHVRGKCFEISPTLINRFLKHSLPSDY
         HKWN+VVK RIADEA I +Q+                   S VG FYP+ +RE IVNLP+D ND  + EYQKVH+RG  F +SP L+N +L  SLP+DY
Subjt:  VHKWNFVVKLRIADEAVISNQHR------------------SSVGHFYPKFIREFIVNLPADLNDHGSPEYQKVHVRGKCFEISPTLINRFLKHSLPSDY

Query:  SVTLPLPEQLTLQLSSGSVRQWPCDG----------YFPIH--------PTTHLSIVSTTLVNSLFLIGTGSKIDVGDFIFKHVLRHVDTFGVNVPIFFP
        +V+ P PE+L  +L+ G++  WP DG          Y  +H        P+TH   +ST+L + ++L+GTG K++  +FIF H+LRHVDTF +++PI FP
Subjt:  SVTLPLPEQLTLQLSSGSVRQWPCDG----------YFPIH--------PTTHLSIVSTTLVNSLFLIGTGSKIDVGDFIFKHVLRHVDTFGVNVPIFFP

Query:  QLLCAFLLSQHPKVLTYEDIQGPTLRTISLSYKLFQGSHL
        ++L  FLL+Q    LT  D  G   R I L   LFQGS++
Subjt:  QLLCAFLLSQHPKVLTYEDIQGPTLRTISLSYKLFQGSHL

A0A5D3CWQ1 Gag-pol polyprotein1.9e-1480Show/hide
Query:  MDVKSAFLNGYISEEVYIAQPKGFIDPVHPNHVYKLHKALYGLKQAPRAW
        MDVKSAFLNGY+S EVY+ QPKGF+D VH +HVYKL KALYGLKQA RAW
Subjt:  MDVKSAFLNGYISEEVYIAQPKGFIDPVHPNHVYKLHKALYGLKQAPRAW

A0A5D3CWQ1 Gag-pol polyprotein1.3e-5238.53Show/hide
Query:  SPSGKFPSSLQVPRSSDEPIDDLFDEFQHVSIPHGEESSYAPPTVDSFSGSTLSQTTAAQEPPFFTPASSH--PTDVPDVTTHAPSSVPIDEISFHSKDY
        SP    PS  +    +D+  +D     +  ++P  E +S +     +F     S   + +EP   T A     P +VP         VPID +SFHS++ 
Subjt:  SPSGKFPSSLQVPRSSDEPIDDLFDEFQHVSIPHGEESSYAPPTVDSFSGSTLSQTTAAQEPPFFTPASSH--PTDVPDVTTHAPSSVPIDEISFHSKDY

Query:  VHKWNFVVKLRIADEAVISNQHR------------------SSVGHFYPKFIREFIVNLPADLNDHGSPEYQKVHVRGKCFEISPTLINRFLKHSLPSDY
         HKWN+VVK RIADEA I +Q+                   S VG FYP+ +RE IVNLP+D ND  + EYQKVH+RG  F +SP L+N +L  SLP+DY
Subjt:  VHKWNFVVKLRIADEAVISNQHR------------------SSVGHFYPKFIREFIVNLPADLNDHGSPEYQKVHVRGKCFEISPTLINRFLKHSLPSDY

Query:  SVTLPLPEQLTLQLSSGSVRQWPCDG----------YFPIH--------PTTHLSIVSTTLVNSLFLIGTGSKIDVGDFIFKHVLRHVDTFGVNVPIFFP
        +V+ P PE+L  +L+ G++  WP DG          Y  +H        P+TH   + T+L + ++L+GTG K++  +FIF H+LRHVDTF +++PI FP
Subjt:  SVTLPLPEQLTLQLSSGSVRQWPCDG----------YFPIH--------PTTHLSIVSTTLVNSLFLIGTGSKIDVGDFIFKHVLRHVDTFGVNVPIFFP

Query:  QLLCAFLLSQHPKVLTYEDIQGPTLRTISLSYKLFQGSHL
        ++L  FLL+Q    LT  D  G   R I L   LFQGS++
Subjt:  QLLCAFLLSQHPKVLTYEDIQGPTLRTISLSYKLFQGSHL

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.2e-0754Show/hide
Query:  MDVKSAFLNGYISEEVYIAQPKGFIDPVHPNHVYKLHKALYGLKQAPRAW
        MDVK+AFLNG + EE+Y+  P+G     + ++V KL+KA+YGLKQA R W
Subjt:  MDVKSAFLNGYISEEVYIAQPKGFIDPVHPNHVYKLHKALYGLKQAPRAW

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.9e-0953.57Show/hide
Query:  MDVKSAFLNGYISEEVYIAQPKGFIDPVHPNHVYKLHKALYGLKQAPRAWVIISDS
        +DVK+AFL+G + EE+Y+ QP+GF      + V KL+K+LYGLKQAPR W +  DS
Subjt:  MDVKSAFLNGYISEEVYIAQPKGFIDPVHPNHVYKLHKALYGLKQAPRAWVIISDS

P25600 Putative transposon Ty5-1 protein YCL074W2.6e-0848Show/hide
Query:  MDVKSAFLNGYISEEVYIAQPKGFIDPVHPNHVYKLHKALYGLKQAPRAW
        MDV +AFLN  + E +Y+ QP GF++  +P++V++L+  +YGLKQAP  W
Subjt:  MDVKSAFLNGYISEEVYIAQPKGFIDPVHPNHVYKLHKALYGLKQAPRAW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.7e-1261.54Show/hide
Query:  MDVKSAFLNGYISEEVYIAQPKGFIDPVHPNHVYKLHKALYGLKQAPRAWVI
        +DV +AFL G ++++VY++QP GFID   PN+V KL KALYGLKQAPRAW +
Subjt:  MDVKSAFLNGYISEEVYIAQPKGFIDPVHPNHVYKLHKALYGLKQAPRAWVI

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.4e-1155.77Show/hide
Query:  MDVKSAFLNGYISEEVYIAQPKGFIDPVHPNHVYKLHKALYGLKQAPRAWVI
        +DV +AFL G +++EVY++QP GF+D   P++V +L KA+YGLKQAPRAW +
Subjt:  MDVKSAFLNGYISEEVYIAQPKGFIDPVHPNHVYKLHKALYGLKQAPRAWVI

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.7e-0744.64Show/hide
Query:  MDVKSAFLNGYISEEVYIAQPKGFI----DPVHPNHVYKLHKALYGLKQAPRAWVI
        +D+ +AFLNG + EE+Y+  P G+     D + PN V  L K++YGLKQA R W +
Subjt:  MDVKSAFLNGYISEEVYIAQPKGFI----DPVHPNHVYKLHKALYGLKQAPRAWVI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGTAAAAAGTGCATTTCTAAATGGCTATATATCCGAGGAAGTATATATTGCACAACCCAAAGGCTTCATTGATCCCGTACATCCTAATCATGTGTACAAA
CTGCATAAGGCTCTCTATGGGCTCAAACAAGCTCCACGTGCATGGGTAATCATCTCCGATTCAGAATCCGAAGATGATGTTGAAACTGCACCTGTGTCTTCCGAT
GAATATGGTGTAGGTTTCGTCGATGTACTTAAACAAAAGGCCATTTCAGTCAAGAGATTTTCTCCCTCTGGAAAATTTCCCTCATCCTTACAAGTTCCTCGCTCA
TCTGATGAACCTATTGATGATTTGTTTGATGAGTTTCAACATGTCTCTATTCCCCATGGAGAAGAATCCTCATATGCACCTCCTACTGTTGATTCGTTTTCTGGC
TCTACACTTTCGCAAACCACTGCTGCACAGGAGCCTCCATTTTTCACACCTGCCTCATCTCATCCCACTGATGTTCCTGATGTGACCACTCATGCTCCTTCATCT
GTTCCCATTGATGAGATTTCTTTTCATTCGAAAGACTATGTACATAAGTGGAATTTTGTTGTCAAGCTGCGCATTGCTGACGAAGCTGTCATCTCTAATCAACAT
CGCTCCTCTGTTGGGCATTTTTATCCAAAGTTTATTCGAGAGTTTATAGTTAATTTGCCTGCTGATCTCAATGACCATGGCTCTCCTGAATATCAAAAGGTACAT
GTTCGTGGTAAGTGCTTTGAAATTTCTCCTACACTTATAAATCGATTCTTGAAGCATTCGTTACCTTCTGATTATTCTGTCACTCTACCACTTCCTGAGCAACTT
ACTTTGCAGCTTTCTAGTGGTTCAGTTCGTCAATGGCCATGTGATGGATATTTTCCTATACATCCTACTACTCATTTGTCTATAGTTTCTACTACTTTGGTTAAT
TCTTTGTTTTTAATTGGTACTGGATCTAAAATAGATGTTGGAGACTTTATCTTTAAGCATGTGTTAAGACATGTTGATACTTTTGGGGTCAATGTCCCCATTTTT
TTTCCTCAATTACTCTGTGCATTTTTATTGTCTCAACATCCAAAGGTACTCACTTATGAGGATATTCAAGGCCCTACTCTTCGCACCATTTCTCTCAGTTATAAA
TTGTTTCAGGGATCGCATCTTCTGTCTGATGAATTGCGGTCTCTTAGTGTGGCGATTTTGGATCTCGATGAATTGATTGGTGGTCTTACTAGCAGACGTGTGGCT
GTTGATATAGTTATTCATGTTGTTCAGATTCTGTTTGAATCTTTCGAGTCTTCTAAGTCTGCTCCCGCTTCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATGTAAAAAGTGCATTTCTAAATGGCTATATATCCGAGGAAGTATATATTGCACAACCCAAAGGCTTCATTGATCCCGTACATCCTAATCATGTGTACAAA
CTGCATAAGGCTCTCTATGGGCTCAAACAAGCTCCACGTGCATGGGTAATCATCTCCGATTCAGAATCCGAAGATGATGTTGAAACTGCACCTGTGTCTTCCGAT
GAATATGGTGTAGGTTTCGTCGATGTACTTAAACAAAAGGCCATTTCAGTCAAGAGATTTTCTCCCTCTGGAAAATTTCCCTCATCCTTACAAGTTCCTCGCTCA
TCTGATGAACCTATTGATGATTTGTTTGATGAGTTTCAACATGTCTCTATTCCCCATGGAGAAGAATCCTCATATGCACCTCCTACTGTTGATTCGTTTTCTGGC
TCTACACTTTCGCAAACCACTGCTGCACAGGAGCCTCCATTTTTCACACCTGCCTCATCTCATCCCACTGATGTTCCTGATGTGACCACTCATGCTCCTTCATCT
GTTCCCATTGATGAGATTTCTTTTCATTCGAAAGACTATGTACATAAGTGGAATTTTGTTGTCAAGCTGCGCATTGCTGACGAAGCTGTCATCTCTAATCAACAT
CGCTCCTCTGTTGGGCATTTTTATCCAAAGTTTATTCGAGAGTTTATAGTTAATTTGCCTGCTGATCTCAATGACCATGGCTCTCCTGAATATCAAAAGGTACAT
GTTCGTGGTAAGTGCTTTGAAATTTCTCCTACACTTATAAATCGATTCTTGAAGCATTCGTTACCTTCTGATTATTCTGTCACTCTACCACTTCCTGAGCAACTT
ACTTTGCAGCTTTCTAGTGGTTCAGTTCGTCAATGGCCATGTGATGGATATTTTCCTATACATCCTACTACTCATTTGTCTATAGTTTCTACTACTTTGGTTAAT
TCTTTGTTTTTAATTGGTACTGGATCTAAAATAGATGTTGGAGACTTTATCTTTAAGCATGTGTTAAGACATGTTGATACTTTTGGGGTCAATGTCCCCATTTTT
TTTCCTCAATTACTCTGTGCATTTTTATTGTCTCAACATCCAAAGGTACTCACTTATGAGGATATTCAAGGCCCTACTCTTCGCACCATTTCTCTCAGTTATAAA
TTGTTTCAGGGATCGCATCTTCTGTCTGATGAATTGCGGTCTCTTAGTGTGGCGATTTTGGATCTCGATGAATTGATTGGTGGTCTTACTAGCAGACGTGTGGCT
GTTGATATAGTTATTCATGTTGTTCAGATTCTGTTTGAATCTTTCGAGTCTTCTAAGTCTGCTCCCGCTTCCTAA
Protein sequenceShow/hide protein sequence
MDVKSAFLNGYISEEVYIAQPKGFIDPVHPNHVYKLHKALYGLKQAPRAWVIISDSESEDDVETAPVSSDEYGVGFVDVLKQKAISVKRFSPSGKFPSSLQVPRS
SDEPIDDLFDEFQHVSIPHGEESSYAPPTVDSFSGSTLSQTTAAQEPPFFTPASSHPTDVPDVTTHAPSSVPIDEISFHSKDYVHKWNFVVKLRIADEAVISNQH
RSSVGHFYPKFIREFIVNLPADLNDHGSPEYQKVHVRGKCFEISPTLINRFLKHSLPSDYSVTLPLPEQLTLQLSSGSVRQWPCDGYFPIHPTTHLSIVSTTLVN
SLFLIGTGSKIDVGDFIFKHVLRHVDTFGVNVPIFFPQLLCAFLLSQHPKVLTYEDIQGPTLRTISLSYKLFQGSHLLSDELRSLSVAILDLDELIGGLTSRRVA
VDIVIHVVQILFESFESSKSAPAS