; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0107801 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0107801
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr04:26401149..26402279
RNA-Seq ExpressionCmc04g0107801
SyntenyCmc04g0107801
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037369.1 pol protein [Cucumis melo var. makuwa]3.3e-18789.1Show/hide
Query:  MSFGLTNALAVFMDLMNIVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSQCEFWLKKVTCLNHVVSSEGVFVDPAKIEAVTSW
        MSFGLT A AVFMDLMN +FKDFLD+FVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFS+CEFWL+KVT L HVVSSEGV VDPAKIEAVT+W
Subjt:  MSFGLTNALAVFMDLMNIVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSQCEFWLKKVTCLNHVVSSEGVFVDPAKIEAVTSW

Query:  PRPSTVSEIRSFLGLAGYYKRFVEDFSRIASPLTQLTRKGIPFVWSPACESSFQELKHKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYT
        PRPSTVSEIRSFLGLAGYY+RFVEDFSRIA+PLTQLTRKGIPFVWSPACESSFQELK KLV+APVLTVPDGSG+FVIYSDASKKGLGCVLMQQGKVVAY 
Subjt:  PRPSTVSEIRSFLGLAGYYKRFVEDFSRIASPLTQLTRKGIPFVWSPACESSFQELKHKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYT

Query:  SRQLKCHEQNYPTDDLELVAEVFALKIWRHYLYSEKIQIFNDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVAHSAALITK
        SRQLK HEQNYPT DLEL A VFALKIWRHYLY EKIQI+ DHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVAHSAALITK
Subjt:  SRQLKCHEQNYPTDDLELVAEVFALKIWRHYLYSEKIQIFNDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVAHSAALITK

Query:  QAPLLRDFDRAEIAVSIREVTSQLAQLSMQPTLRQRIIVAQLNDPYLVEKRLLVEAEQGEAFSISSDDGLTFDGCL
        Q PLL+DF+RAEIAVS+ EVT+QLAQLS+QPTLRQ+II AQLNDPYLVEKR +VE  QGE FSISSDDGL F+G L
Subjt:  QAPLLRDFDRAEIAVSIREVTSQLAQLSMQPTLRQRIIVAQLNDPYLVEKRLLVEAEQGEAFSISSDDGLTFDGCL

KAA0040547.1 pol protein [Cucumis melo var. makuwa]5.6e-18789.36Show/hide
Query:  MSFGLTNALAVFMDLMNIVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSQCEFWLKKVTCLNHVVSSEGVFVDPAKIEAVTSW
        MSFGLTNA AVFMDLMN VFKDFLD+FVIVFIDDILIYSKTEAEHEEHLHQVLETLRAN LYAKFS+CEFWL+KVT L HVVSSEGV VDPAKIEAVT+W
Subjt:  MSFGLTNALAVFMDLMNIVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSQCEFWLKKVTCLNHVVSSEGVFVDPAKIEAVTSW

Query:  PRPSTVSEIRSFLGLAGYYKRFVEDFSRIASPLTQLTRKGIPFVWSPACESSFQELKHKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYT
        PRPSTVSEI+SFLGLAGYY+RFVEDFSRIASPLTQLTRKG PFVWSPACESSFQELK KLV+APVLTVPDGSG+FVIYSDASKKGLGCVLMQQGKVVAY 
Subjt:  PRPSTVSEIRSFLGLAGYYKRFVEDFSRIASPLTQLTRKGIPFVWSPACESSFQELKHKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYT

Query:  SRQLKCHEQNYPTDDLELVAEVFALKIWRHYLYSEKIQIFNDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVAHSAALITK
        SRQLK +EQNYPT DLELVA VFALKIWRHYLY EKIQI+ DHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVAHSA LITK
Subjt:  SRQLKCHEQNYPTDDLELVAEVFALKIWRHYLYSEKIQIFNDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVAHSAALITK

Query:  QAPLLRDFDRAEIAVSIREVTSQLAQLSMQPTLRQRIIVAQLNDPYLVEKRLLVEAEQGEAFSISSDDGLTFDGCL
        Q PLLRDF+RAEIAVS+ EVTSQLAQLS+QPTLRQ+II AQLNDPYLVEKR +VE  QGE FSISSDDGL F+G L
Subjt:  QAPLLRDFDRAEIAVSIREVTSQLAQLSMQPTLRQRIIVAQLNDPYLVEKRLLVEAEQGEAFSISSDDGLTFDGCL

KAA0050760.1 pol protein [Cucumis melo var. makuwa]7.3e-18788.83Show/hide
Query:  MSFGLTNALAVFMDLMNIVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSQCEFWLKKVTCLNHVVSSEGVFVDPAKIEAVTSW
        MSFGLTNA AVFMDLMN VFKDFLD+FVIVFIDDILIYSKTEA+HEEHLHQVLETLRANKLYAKFS+CEFWL+KVT L HVVSSEGV VDPAKIEAVT+W
Subjt:  MSFGLTNALAVFMDLMNIVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSQCEFWLKKVTCLNHVVSSEGVFVDPAKIEAVTSW

Query:  PRPSTVSEIRSFLGLAGYYKRFVEDFSRIASPLTQLTRKGIPFVWSPACESSFQELKHKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYT
        PRPSTVSEIRSFLGLAGYY+RFVEDFSRIASPLTQLTRKG PFVWSPACE SFQELK KLV+APVLTVPDGSG+FVIYSDASKKGLGCVLMQQGKVVAY 
Subjt:  PRPSTVSEIRSFLGLAGYYKRFVEDFSRIASPLTQLTRKGIPFVWSPACESSFQELKHKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYT

Query:  SRQLKCHEQNYPTDDLELVAEVFALKIWRHYLYSEKIQIFNDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVAHSAALITK
        SRQLK HEQNYPT DLEL A VFALKIWRHYLY EKIQI+ DHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVAHSAALITK
Subjt:  SRQLKCHEQNYPTDDLELVAEVFALKIWRHYLYSEKIQIFNDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVAHSAALITK

Query:  QAPLLRDFDRAEIAVSIREVTSQLAQLSMQPTLRQRIIVAQLNDPYLVEKRLLVEAEQGEAFSISSDDGLTFDGCL
        Q PLLRDF+RAEIAVS+ EVT+QLAQL++QPTLRQ+II AQL+DPYL EKR +VE EQGE FSISSDDGL F+G L
Subjt:  QAPLLRDFDRAEIAVSIREVTSQLAQLSMQPTLRQRIIVAQLNDPYLVEKRLLVEAEQGEAFSISSDDGLTFDGCL

KAA0063793.1 pol protein [Cucumis melo var. makuwa]1.1e-18789.36Show/hide
Query:  MSFGLTNALAVFMDLMNIVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSQCEFWLKKVTCLNHVVSSEGVFVDPAKIEAVTSW
        MSFGLTNA AVFMDLMN VFKDFLD+FVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFS+CEFWL+KVT L HVVSSEGV VDPAKIEAVT+W
Subjt:  MSFGLTNALAVFMDLMNIVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSQCEFWLKKVTCLNHVVSSEGVFVDPAKIEAVTSW

Query:  PRPSTVSEIRSFLGLAGYYKRFVEDFSRIASPLTQLTRKGIPFVWSPACESSFQELKHKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYT
        PRPSTVSEIRSFLGLAGYY+RFVEDFSRIASPLTQLTRKG PFVWSPACE SFQELK KLV+APVLTVPDGSG+FVIYSDASKKGLGCVLMQQGKVVAY 
Subjt:  PRPSTVSEIRSFLGLAGYYKRFVEDFSRIASPLTQLTRKGIPFVWSPACESSFQELKHKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYT

Query:  SRQLKCHEQNYPTDDLELVAEVFALKIWRHYLYSEKIQIFNDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVAHSAALITK
        SRQLK HEQNYPT DLEL A VFALKIWRHYLY EKIQI+ DHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVAHSAALITK
Subjt:  SRQLKCHEQNYPTDDLELVAEVFALKIWRHYLYSEKIQIFNDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVAHSAALITK

Query:  QAPLLRDFDRAEIAVSIREVTSQLAQLSMQPTLRQRIIVAQLNDPYLVEKRLLVEAEQGEAFSISSDDGLTFDGCL
        Q PLLRDF+RAEIAVS+ EVT+QLAQL++QPTLRQ+II AQLNDPYL EKR +VE EQGE FSISSDDGL F+G L
Subjt:  QAPLLRDFDRAEIAVSIREVTSQLAQLSMQPTLRQRIIVAQLNDPYLVEKRLLVEAEQGEAFSISSDDGLTFDGCL

KAA0066951.1 pol protein [Cucumis melo var. makuwa]1.1e-18789.63Show/hide
Query:  MSFGLTNALAVFMDLMNIVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSQCEFWLKKVTCLNHVVSSEGVFVDPAKIEAVTSW
        MSFGLTNA AVFMDLMN VFKDFLD+FVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFS+CEFWL+KVT L HVVSSEGV VDPAKIEAVT+W
Subjt:  MSFGLTNALAVFMDLMNIVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSQCEFWLKKVTCLNHVVSSEGVFVDPAKIEAVTSW

Query:  PRPSTVSEIRSFLGLAGYYKRFVEDFSRIASPLTQLTRKGIPFVWSPACESSFQELKHKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYT
        PRPSTVSEIRSFLGLAGYY+RFVEDFSRIASPLTQLTRKG PFVWSPACESSFQELK KLV+APVLTVPDGSG+FVIYSDASKKGLGCVLMQQGKVVAY 
Subjt:  PRPSTVSEIRSFLGLAGYYKRFVEDFSRIASPLTQLTRKGIPFVWSPACESSFQELKHKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYT

Query:  SRQLKCHEQNYPTDDLELVAEVFALKIWRHYLYSEKIQIFNDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVAHSAALITK
        SRQLK HEQNYPT DLEL A VFALKIWRHYLY EKIQI+ DHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVAHSAALITK
Subjt:  SRQLKCHEQNYPTDDLELVAEVFALKIWRHYLYSEKIQIFNDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVAHSAALITK

Query:  QAPLLRDFDRAEIAVSIREVTSQLAQLSMQPTLRQRIIVAQLNDPYLVEKRLLVEAEQGEAFSISSDDGLTFDGCL
        Q PLLRDF+RAEIAVS+ EVT+QLAQLS+QPTLRQ+II AQLNDPYLVEKR +VE  QGE FSIS DDGL F+G L
Subjt:  QAPLLRDFDRAEIAVSIREVTSQLAQLSMQPTLRQRIIVAQLNDPYLVEKRLLVEAEQGEAFSISSDDGLTFDGCL

TrEMBL top hitse value%identityAlignment
A0A5A7T7M0 Reverse transcriptase1.6e-18789.1Show/hide
Query:  MSFGLTNALAVFMDLMNIVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSQCEFWLKKVTCLNHVVSSEGVFVDPAKIEAVTSW
        MSFGLT A AVFMDLMN +FKDFLD+FVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFS+CEFWL+KVT L HVVSSEGV VDPAKIEAVT+W
Subjt:  MSFGLTNALAVFMDLMNIVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSQCEFWLKKVTCLNHVVSSEGVFVDPAKIEAVTSW

Query:  PRPSTVSEIRSFLGLAGYYKRFVEDFSRIASPLTQLTRKGIPFVWSPACESSFQELKHKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYT
        PRPSTVSEIRSFLGLAGYY+RFVEDFSRIA+PLTQLTRKGIPFVWSPACESSFQELK KLV+APVLTVPDGSG+FVIYSDASKKGLGCVLMQQGKVVAY 
Subjt:  PRPSTVSEIRSFLGLAGYYKRFVEDFSRIASPLTQLTRKGIPFVWSPACESSFQELKHKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYT

Query:  SRQLKCHEQNYPTDDLELVAEVFALKIWRHYLYSEKIQIFNDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVAHSAALITK
        SRQLK HEQNYPT DLEL A VFALKIWRHYLY EKIQI+ DHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVAHSAALITK
Subjt:  SRQLKCHEQNYPTDDLELVAEVFALKIWRHYLYSEKIQIFNDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVAHSAALITK

Query:  QAPLLRDFDRAEIAVSIREVTSQLAQLSMQPTLRQRIIVAQLNDPYLVEKRLLVEAEQGEAFSISSDDGLTFDGCL
        Q PLL+DF+RAEIAVS+ EVT+QLAQLS+QPTLRQ+II AQLNDPYLVEKR +VE  QGE FSISSDDGL F+G L
Subjt:  QAPLLRDFDRAEIAVSIREVTSQLAQLSMQPTLRQRIIVAQLNDPYLVEKRLLVEAEQGEAFSISSDDGLTFDGCL

A0A5A7TG62 Reverse transcriptase2.7e-18789.36Show/hide
Query:  MSFGLTNALAVFMDLMNIVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSQCEFWLKKVTCLNHVVSSEGVFVDPAKIEAVTSW
        MSFGLTNA AVFMDLMN VFKDFLD+FVIVFIDDILIYSKTEAEHEEHLHQVLETLRAN LYAKFS+CEFWL+KVT L HVVSSEGV VDPAKIEAVT+W
Subjt:  MSFGLTNALAVFMDLMNIVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSQCEFWLKKVTCLNHVVSSEGVFVDPAKIEAVTSW

Query:  PRPSTVSEIRSFLGLAGYYKRFVEDFSRIASPLTQLTRKGIPFVWSPACESSFQELKHKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYT
        PRPSTVSEI+SFLGLAGYY+RFVEDFSRIASPLTQLTRKG PFVWSPACESSFQELK KLV+APVLTVPDGSG+FVIYSDASKKGLGCVLMQQGKVVAY 
Subjt:  PRPSTVSEIRSFLGLAGYYKRFVEDFSRIASPLTQLTRKGIPFVWSPACESSFQELKHKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYT

Query:  SRQLKCHEQNYPTDDLELVAEVFALKIWRHYLYSEKIQIFNDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVAHSAALITK
        SRQLK +EQNYPT DLELVA VFALKIWRHYLY EKIQI+ DHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVAHSA LITK
Subjt:  SRQLKCHEQNYPTDDLELVAEVFALKIWRHYLYSEKIQIFNDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVAHSAALITK

Query:  QAPLLRDFDRAEIAVSIREVTSQLAQLSMQPTLRQRIIVAQLNDPYLVEKRLLVEAEQGEAFSISSDDGLTFDGCL
        Q PLLRDF+RAEIAVS+ EVTSQLAQLS+QPTLRQ+II AQLNDPYLVEKR +VE  QGE FSISSDDGL F+G L
Subjt:  QAPLLRDFDRAEIAVSIREVTSQLAQLSMQPTLRQRIIVAQLNDPYLVEKRLLVEAEQGEAFSISSDDGLTFDGCL

A0A5A7V6R2 Reverse transcriptase5.4e-18889.36Show/hide
Query:  MSFGLTNALAVFMDLMNIVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSQCEFWLKKVTCLNHVVSSEGVFVDPAKIEAVTSW
        MSFGLTNA AVFMDLMN VFKDFLD+FVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFS+CEFWL+KVT L HVVSSEGV VDPAKIEAVT+W
Subjt:  MSFGLTNALAVFMDLMNIVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSQCEFWLKKVTCLNHVVSSEGVFVDPAKIEAVTSW

Query:  PRPSTVSEIRSFLGLAGYYKRFVEDFSRIASPLTQLTRKGIPFVWSPACESSFQELKHKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYT
        PRPSTVSEIRSFLGLAGYY+RFVEDFSRIASPLTQLTRKG PFVWSPACE SFQELK KLV+APVLTVPDGSG+FVIYSDASKKGLGCVLMQQGKVVAY 
Subjt:  PRPSTVSEIRSFLGLAGYYKRFVEDFSRIASPLTQLTRKGIPFVWSPACESSFQELKHKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYT

Query:  SRQLKCHEQNYPTDDLELVAEVFALKIWRHYLYSEKIQIFNDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVAHSAALITK
        SRQLK HEQNYPT DLEL A VFALKIWRHYLY EKIQI+ DHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVAHSAALITK
Subjt:  SRQLKCHEQNYPTDDLELVAEVFALKIWRHYLYSEKIQIFNDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVAHSAALITK

Query:  QAPLLRDFDRAEIAVSIREVTSQLAQLSMQPTLRQRIIVAQLNDPYLVEKRLLVEAEQGEAFSISSDDGLTFDGCL
        Q PLLRDF+RAEIAVS+ EVT+QLAQL++QPTLRQ+II AQLNDPYL EKR +VE EQGE FSISSDDGL F+G L
Subjt:  QAPLLRDFDRAEIAVSIREVTSQLAQLSMQPTLRQRIIVAQLNDPYLVEKRLLVEAEQGEAFSISSDDGLTFDGCL

A0A5A7VBY3 Reverse transcriptase3.5e-18789.36Show/hide
Query:  MSFGLTNALAVFMDLMNIVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSQCEFWLKKVTCLNHVVSSEGVFVDPAKIEAVTSW
        MSFGLTNA AVFMDLMN VFKDFLD+FVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFS+CEFWL+KVT L HVVSSEGV VDPAKIEAVT+W
Subjt:  MSFGLTNALAVFMDLMNIVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSQCEFWLKKVTCLNHVVSSEGVFVDPAKIEAVTSW

Query:  PRPSTVSEIRSFLGLAGYYKRFVEDFSRIASPLTQLTRKGIPFVWSPACESSFQELKHKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYT
        PRPSTVSEIRSFLGLAGYY+RFVEDFSR ASPLTQLTRKG PFVWSPACESSFQELK KLV+APVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAY 
Subjt:  PRPSTVSEIRSFLGLAGYYKRFVEDFSRIASPLTQLTRKGIPFVWSPACESSFQELKHKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYT

Query:  SRQLKCHEQNYPTDDLELVAEVFALKIWRHYLYSEKIQIFNDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVAHSAALITK
        SRQLK HEQNYPT DLEL A VFALKIWR YLY EKIQIF DHKSLKYFFTQKELNMRQRRWL+LVKDYDCEI+YHP KANVVADALSRKVAHSAALITK
Subjt:  SRQLKCHEQNYPTDDLELVAEVFALKIWRHYLYSEKIQIFNDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVAHSAALITK

Query:  QAPLLRDFDRAEIAVSIREVTSQLAQLSMQPTLRQRIIVAQLNDPYLVEKRLLVEAEQGEAFSISSDDGLTFDGCL
        Q PLLRDF+RAEIAVS+ EVTSQLAQLS+QPTLRQ+IIVAQLNDPYLVEKR +VE   GE FSIS DDGL F+GCL
Subjt:  QAPLLRDFDRAEIAVSIREVTSQLAQLSMQPTLRQRIIVAQLNDPYLVEKRLLVEAEQGEAFSISSDDGLTFDGCL

A0A5A7VMR4 Reverse transcriptase5.4e-18889.63Show/hide
Query:  MSFGLTNALAVFMDLMNIVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSQCEFWLKKVTCLNHVVSSEGVFVDPAKIEAVTSW
        MSFGLTNA AVFMDLMN VFKDFLD+FVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFS+CEFWL+KVT L HVVSSEGV VDPAKIEAVT+W
Subjt:  MSFGLTNALAVFMDLMNIVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSQCEFWLKKVTCLNHVVSSEGVFVDPAKIEAVTSW

Query:  PRPSTVSEIRSFLGLAGYYKRFVEDFSRIASPLTQLTRKGIPFVWSPACESSFQELKHKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYT
        PRPSTVSEIRSFLGLAGYY+RFVEDFSRIASPLTQLTRKG PFVWSPACESSFQELK KLV+APVLTVPDGSG+FVIYSDASKKGLGCVLMQQGKVVAY 
Subjt:  PRPSTVSEIRSFLGLAGYYKRFVEDFSRIASPLTQLTRKGIPFVWSPACESSFQELKHKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYT

Query:  SRQLKCHEQNYPTDDLELVAEVFALKIWRHYLYSEKIQIFNDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVAHSAALITK
        SRQLK HEQNYPT DLEL A VFALKIWRHYLY EKIQI+ DHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVAHSAALITK
Subjt:  SRQLKCHEQNYPTDDLELVAEVFALKIWRHYLYSEKIQIFNDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVAHSAALITK

Query:  QAPLLRDFDRAEIAVSIREVTSQLAQLSMQPTLRQRIIVAQLNDPYLVEKRLLVEAEQGEAFSISSDDGLTFDGCL
        Q PLLRDF+RAEIAVS+ EVT+QLAQLS+QPTLRQ+II AQLNDPYLVEKR +VE  QGE FSIS DDGL F+G L
Subjt:  QAPLLRDFDRAEIAVSIREVTSQLAQLSMQPTLRQRIIVAQLNDPYLVEKRLLVEAEQGEAFSISSDDGLTFDGCL

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.61.4e-5539.31Show/hide
Query:  MSFGLTNALAVFMDLMNIVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSQCEFWLKKVTCLNHVVSSEGVFVDPAKIEAVTSW
        M FGL NA A F   MN + +  L+   +V++DDI+++S +  EH + L  V E L    L  +  +CEF  ++ T L HV++ +G+  +P KIEA+  +
Subjt:  MSFGLTNALAVFMDLMNIVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSQCEFWLKKVTCLNHVVSSEGVFVDPAKIEAVTSW

Query:  PRPSTVSEIRSFLGLAGYYKRFVEDFSRIASPLTQLTRKGIPF-VWSPACESSFQELKHKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAY
        P P+   EI++FLGL GYY++F+ +F+ IA P+T+  +K +     +P  +S+F++LK+ +   P+L VPD +  F + +DAS   LG VL Q G  ++Y
Subjt:  PRPSTVSEIRSFLGLAGYYKRFVEDFSRIASPLTQLTRKGIPF-VWSPACESSFQELKHKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAY

Query:  TSRQLKCHEQNYPTDDLELVAEVFALKIWRHYLYSEKIQIFNDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSR
         SR L  HE NY T + EL+A V+A K +RHYL     +I +DH+ L + +  K+ N +  RW   + ++D +I Y  GK N VADALSR
Subjt:  TSRQLKCHEQNYPTDDLELVAEVFALKIWRHYLYSEKIQIFNDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSR

P0CT41 Transposon Tf2-12 polyprotein1.2e-4330.61Show/hide
Query:  MSFGLTNALAVFMDLMNIVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSQCEFWLKKVTCLNHVVSSEGVFVDPAKIEAVTSW
        M +G++ A A F   +N +  +  ++ V+ ++DDILI+SK+E+EH +H+  VL+ L+   L    ++CEF   +V  + + +S +G       I+ V  W
Subjt:  MSFGLTNALAVFMDLMNIVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSQCEFWLKKVTCLNHVVSSEGVFVDPAKIEAVTSW

Query:  PRPSTVSEIRSFLGLAGYYKRFVEDFSRIASPLTQLTRKGIPFVWSPACESSFQELKHKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----
         +P    E+R FLG   Y ++F+   S++  PL  L +K + + W+P    + + +K  LVS PVL   D S   ++ +DAS   +G VL Q+       
Subjt:  PRPSTVSEIRSFLGLAGYYKRFVEDFSRIASPLTQLTRKGIPFVWSPACESSFQELKHKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----

Query:  VVAYTSRQLKCHEQNYPTDDLELVAEVFALKIWRHYLYS--EKIQIFNDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKV
         V Y S ++   + NY   D E++A + +LK WRHYL S  E  +I  DH++L    T +    N R  RW   ++D++ EI Y PG AN +ADALSR  
Subjt:  VVAYTSRQLKCHEQNYPTDDLELVAEVFALKIWRHYLYS--EKIQIFNDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKV

Query:  AHSAALITKQAPLLRDFDRAEIAVSIREVTSQLAQLSMQPTLRQRIIVAQLNDPYLVEKRLLVEAEQGEAFSISSDDGL
             ++ +  P+ +D +   I        + + Q+S+    + +++    ND  L+   LL   ++    +I   DGL
Subjt:  AHSAALITKQAPLLRDFDRAEIAVSIREVTSQLAQLSMQPTLRQRIIVAQLNDPYLVEKRLLVEAEQGEAFSISSDDGL

P10401 Retrovirus-related Pol polyprotein from transposon gypsy6.7e-5035.64Show/hide
Query:  MSFGLTNALAVFMDLMNIVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSQCEFWLKKVTCLNHVVSSEGVFVDPAKIEAVTSW
        + FGL NA ++F   ++ V ++ +     V++DD++I+S+ E++H  H+  VL+ L    +     +  F+ + V  L  +VS +G   DP K++A+  +
Subjt:  MSFGLTNALAVFMDLMNIVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSQCEFWLKKVTCLNHVVSSEGVFVDPAKIEAVTSW

Query:  PRPSTVSEIRSFLGLAGYYKRFVEDFSRIASPLTQLTR-----------KGIPFVWSPACESSFQELKHKLVSAPV-LTVPDGSGSFVIYSDASKKGLGC
        P P  V ++RSFLGLA YY+ F++DF+ IA P+T + +           K IP  ++    ++FQ L++ L S  V L  PD    F + +DAS  G+G 
Subjt:  PRPSTVSEIRSFLGLAGYYKRFVEDFSRIASPLTQLTR-----------KGIPFVWSPACESSFQELKHKLVSAPV-LTVPDGSGSFVIYSDASKKGLGC

Query:  VLMQQGKVVAYTSRQLKCHEQNYPTDDLELVAEVFALKIWRHYLY-SEKIQIFNDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADAL
        VL Q+G+ +   SR LK  EQNY T++ EL+A V+AL   +++LY S +I IF DH+ L +    +  N + +RW   +  ++ ++ Y PGK N VADAL
Subjt:  VLMQQGKVVAYTSRQLKCHEQNYPTDDLELVAEVFALKIWRHYLY-SEKIQIFNDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADAL

Query:  SRK
        SR+
Subjt:  SRK

P20825 Retrovirus-related Pol polyprotein from transposon 2973.9e-5037.24Show/hide
Query:  MSFGLTNALAVFMDLMNIVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSQCEFWLKKVTCLNHVVSSEGVFVDPAKIEAVTSW
        M FGL NA A F   MN + +  L+   +V++DDI+I+S +  EH   +  V   L    L  +  +CEF  K+   L H+V+ +G+  +P K++A+ S+
Subjt:  MSFGLTNALAVFMDLMNIVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSQCEFWLKKVTCLNHVVSSEGVFVDPAKIEAVTSW

Query:  PRPSTVSEIRSFLGLAGYYKRFVEDFSRIASPLTQLTRKGIPF-VWSPACESSFQELKHKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAY
        P P+   EIR+FLGL GYY++F+ +++ IA P+T   +K             +F++LK  ++  P+L +PD    FV+ +DAS   LG VL Q G  +++
Subjt:  PRPSTVSEIRSFLGLAGYYKRFVEDFSRIASPLTQLTRKGIPF-VWSPACESSFQELKHKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAY

Query:  TSRQLKCHEQNYPTDDLELVAEVFALKIWRHYLYSEKIQIFNDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSR
         SR L  HE NY   + EL+A V+A K +RHYL   +  I +DH+ L++    KE   +  RW   + +Y  +I Y  GK N VADALSR
Subjt:  TSRQLKCHEQNYPTDDLELVAEVFALKIWRHYLYSEKIQIFNDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSR

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus9.6e-4933.44Show/hide
Query:  MSFGLTNALAVFMDLMNIVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSQCEFWLKKVTCLNHVVSSEGVFVDPAKIEAVTSW
        + FGL NA A+F  +++ + ++ +     V+IDDI+++S+    H ++L  VL +L    L     +  F   +V  L ++V+++G+  DP K+ A++  
Subjt:  MSFGLTNALAVFMDLMNIVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSQCEFWLKKVTCLNHVVSSEGVFVDPAKIEAVTSW

Query:  PRPSTVSEIRSFLGLAGYYKRFVEDFSRIASPLTQLTR-----------KGIPFVWSPACESSFQELKHKLVSAPVLTVPDGSGSFVIYSDASKKGLGCV
        P P++V E++ FLG+  YY++F++D++++A PLT LTR             +P         SF +LK  L S+ +L  P  +  F + +DAS   +G V
Subjt:  PRPSTVSEIRSFLGLAGYYKRFVEDFSRIASPLTQLTR-----------KGIPFVWSPACESSFQELKHKLVSAPVLTVPDGSGSFVIYSDASKKGLGCV

Query:  LMQ----QGKVVAYTSRQLKCHEQNYPTDDLELVAEVFALKIWRHYLY-SEKIQIFNDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVA
        L Q    + + +AY SR L   E+NY T + E++A +++L   R YLY +  I+++ DH+ L +    +  N + +RW   +++Y+CE++Y PGK+NVVA
Subjt:  LMQ----QGKVVAYTSRQLKCHEQNYPTDDLELVAEVFALKIWRHYLY-SEKIQIFNDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVA

Query:  DALSR
        DALSR
Subjt:  DALSR

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein8.4e-2439.69Show/hide
Query:  HLHQVLETLRANKLYAKFSQCEFWLKKVTCL--NHVVSSEGVFVDPAKIEAVTSWPRPSTVSEIRSFLGLAGYYKRFVEDFSRIASPLTQLTRKGIPFVW
        HL  VL+    ++ YA   +C F   ++  L   H++S EGV  DPAK+EA+  WP P   +E+R FLGL GYY+RFV+++ +I  PLT+L +K     W
Subjt:  HLHQVLETLRANKLYAKFSQCEFWLKKVTCL--NHVVSSEGVFVDPAKIEAVTSWPRPSTVSEIRSFLGLAGYYKRFVEDFSRIASPLTQLTRKGIPFVW

Query:  SPACESSFQELKHKLVSAPVLTVPDGSGSFV
        +     +F+ LK  + + PVL +PD    FV
Subjt:  SPACESSFQELKHKLVSAPVLTVPDGSGSFV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTTTGGTTTGACTAATGCTCTTGCGGTATTCATGGACTTGATGAACATAGTGTTTAAGGATTTCTTAGACACGTTTGTTATAGTTTTCATTGACGACATTTTGAT
TTACTCCAAGACTGAGGCTGAGCATGAGGAGCATTTGCATCAGGTTTTGGAGACTCTTCGAGCTAATAAGTTGTATGCCAAATTCTCCCAGTGTGAGTTCTGGCTGAAGA
AGGTGACTTGTCTCAACCATGTGGTTTCTAGTGAGGGAGTTTTTGTGGATCCAGCAAAGATCGAAGCGGTTACCAGTTGGCCTCGACCGTCTACAGTTAGCGAGATTCGT
AGTTTCCTGGGTTTGGCAGGTTACTACAAGAGGTTCGTGGAAGACTTCTCTCGTATAGCCAGTCCCTTGACTCAGTTGACCAGGAAAGGGATTCCTTTTGTTTGGAGCCC
AGCTTGTGAGAGTAGCTTCCAGGAGCTTAAGCATAAGCTTGTGTCTGCACCAGTCCTTACAGTGCCAGATGGATCTGGAAGTTTCGTGATCTACAGTGATGCCTCAAAAA
AAGGACTGGGTTGTGTTTTGATGCAGCAAGGTAAGGTAGTTGCTTATACCTCCCGTCAGTTGAAGTGTCATGAGCAGAACTACCCTACTGATGACCTAGAGTTGGTAGCA
GAGGTTTTTGCACTGAAGATATGGAGACACTACCTGTACAGTGAGAAGATACAGATTTTCAATGACCATAAGAGCCTAAAGTACTTCTTCACCCAGAAGGAGTTGAACAT
GAGACAGAGAAGATGGCTTGAGTTGGTGAAGGATTATGACTGCGAGATTTTGTACCATCCAGGTAAGGCAAATGTAGTAGCTGATGCGCTGAGTAGGAAGGTTGCACATT
CAGCAGCGCTTATCACCAAGCAAGCTCCCTTGCTCAGAGATTTTGATAGAGCCGAGATTGCAGTCTCTATAAGGGAAGTTACCTCACAGTTGGCTCAGTTGTCAATGCAG
CCGACCTTGAGACAGAGGATTATTGTTGCTCAGCTAAATGATCCTTATTTGGTCGAGAAGCGTCTATTAGTAGAGGCAGAGCAAGGTGAGGCTTTCTCCATATCCTCTGA
TGATGGACTTACATTTGATGGATGTTTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTTTTGGTTTGACTAATGCTCTTGCGGTATTCATGGACTTGATGAACATAGTGTTTAAGGATTTCTTAGACACGTTTGTTATAGTTTTCATTGACGACATTTTGAT
TTACTCCAAGACTGAGGCTGAGCATGAGGAGCATTTGCATCAGGTTTTGGAGACTCTTCGAGCTAATAAGTTGTATGCCAAATTCTCCCAGTGTGAGTTCTGGCTGAAGA
AGGTGACTTGTCTCAACCATGTGGTTTCTAGTGAGGGAGTTTTTGTGGATCCAGCAAAGATCGAAGCGGTTACCAGTTGGCCTCGACCGTCTACAGTTAGCGAGATTCGT
AGTTTCCTGGGTTTGGCAGGTTACTACAAGAGGTTCGTGGAAGACTTCTCTCGTATAGCCAGTCCCTTGACTCAGTTGACCAGGAAAGGGATTCCTTTTGTTTGGAGCCC
AGCTTGTGAGAGTAGCTTCCAGGAGCTTAAGCATAAGCTTGTGTCTGCACCAGTCCTTACAGTGCCAGATGGATCTGGAAGTTTCGTGATCTACAGTGATGCCTCAAAAA
AAGGACTGGGTTGTGTTTTGATGCAGCAAGGTAAGGTAGTTGCTTATACCTCCCGTCAGTTGAAGTGTCATGAGCAGAACTACCCTACTGATGACCTAGAGTTGGTAGCA
GAGGTTTTTGCACTGAAGATATGGAGACACTACCTGTACAGTGAGAAGATACAGATTTTCAATGACCATAAGAGCCTAAAGTACTTCTTCACCCAGAAGGAGTTGAACAT
GAGACAGAGAAGATGGCTTGAGTTGGTGAAGGATTATGACTGCGAGATTTTGTACCATCCAGGTAAGGCAAATGTAGTAGCTGATGCGCTGAGTAGGAAGGTTGCACATT
CAGCAGCGCTTATCACCAAGCAAGCTCCCTTGCTCAGAGATTTTGATAGAGCCGAGATTGCAGTCTCTATAAGGGAAGTTACCTCACAGTTGGCTCAGTTGTCAATGCAG
CCGACCTTGAGACAGAGGATTATTGTTGCTCAGCTAAATGATCCTTATTTGGTCGAGAAGCGTCTATTAGTAGAGGCAGAGCAAGGTGAGGCTTTCTCCATATCCTCTGA
TGATGGACTTACATTTGATGGATGTTTGTGA
Protein sequenceShow/hide protein sequence
MSFGLTNALAVFMDLMNIVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSQCEFWLKKVTCLNHVVSSEGVFVDPAKIEAVTSWPRPSTVSEIR
SFLGLAGYYKRFVEDFSRIASPLTQLTRKGIPFVWSPACESSFQELKHKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYTSRQLKCHEQNYPTDDLELVA
EVFALKIWRHYLYSEKIQIFNDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVAHSAALITKQAPLLRDFDRAEIAVSIREVTSQLAQLSMQ
PTLRQRIIVAQLNDPYLVEKRLLVEAEQGEAFSISSDDGLTFDGCL