; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0107971 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0107971
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCMiso1.1chr04:26779709..26781160
RNA-Seq ExpressionCmc04g0107971
SyntenyCmc04g0107971
Gene Ontology termsGO:0009231 - riboflavin biosynthetic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0071897 - DNA biosynthetic process (biological process)
GO:0016020 - membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
GO:0004491 - methylmalonate-semialdehyde dehydrogenase (acylating) activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0008686 - 3,4-dihydroxy-2-butanone-4-phosphate synthase activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KYP36562.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]9.6e-20069.25Show/hide
Query:  MQGFLTTRTTNPNERFIFMGNRVKVPAEAVGTYRLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSSYYFKFGNECFSLFKQNIFIGSGILCDDLYKFK
        MQGF T +T +PNE+F+FMGNRVKVP EAVGTYRL L+TGHHLDL +T YVPS+SRNL+SLSKLD   Y F FGN CFSLFK+N  IG+GILCD LYK  
Subjt:  MQGFLTTRTTNPNERFIFMGNRVKVPAEAVGTYRLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSSYYFKFGNECFSLFKQNIFIGSGILCDDLYKFK

Query:  LDNVFAESLLTLHHNVGTKRGQTNESSAYLWHEHLGHISKERIKRLINNEILPNLDFTDLEICVDCIKGKQTKHTVNKKATRSSQLLEIIHTDICGSFDV
        LD ++ E+LLTLHHN+GTKR   NE SA+LWH  LGHIS+ER++RLI NEILPNLDFTDL ICVDCIKGKQTKHT  K ATRS+QLLEI+HTDICG FDV
Subjt:  LDNVFAESLLTLHHNVGTKRGQTNESSAYLWHEHLGHISKERIKRLINNEILPNLDFTDLEICVDCIKGKQTKHTVNKKATRSSQLLEIIHTDICGSFDV

Query:  PSFGGEKYFISFIDDFSRYVYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSNRGGEYYGKYDNNGRCPGPFAKFLESHGICAQYTMAGTPQQNGVA
         SFG EKYFI+FIDD+SRY Y+YLLHEKSQA+DAL++++NEVERQLD+ VK++RS+RGGEYYG+Y+  G+ PGPFAK L+  GICAQYTM GTPQQNGV+
Subjt:  PSFGGEKYFISFIDDFSRYVYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSNRGGEYYGKYDNNGRCPGPFAKFLESHGICAQYTMAGTPQQNGVA

Query:  ERRNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVPNTPFELWTGRKPNLRHLHVWGCQVEVRIYNPHEKKLDSRTTSGFFIGYLEKSKGC
        ERRNRTLM+MVRSML NS+LP+ LWMYAL+TA YLLNRVPSK+V  TPFELWTGR P+LRHLHVWGCQ E+RIYNP EKKLD+RT SG+FIGY EKSKG 
Subjt:  ERRNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVPNTPFELWTGRKPNLRHLHVWGCQVEVRIYNPHEKKLDSRTTSGFFIGYLEKSKGC

Query:  RFYCPNHSTKIVETGNIRFIENDIISGSLKPRKVEI--------LSSITSSQVVVLVVYYVNNPQEQQINGQTPHNDVVTNEPVTEGPQEI
         FYCPNH+ +IVETGN RFIEN  +SGS  P+KVE+        L+  + ++V V +    NN +E+Q N +     ++ NEP+ E PQEI
Subjt:  RFYCPNHSTKIVETGNIRFIENDIISGSLKPRKVEI--------LSSITSSQVVVLVVYYVNNPQEQQINGQTPHNDVVTNEPVTEGPQEI

KYP65984.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]1.9e-20871.89Show/hide
Query:  MQGFLTTRTTNPNERFIFMGNRVKVPAEAVGTYRLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSSYYFKFGNECFSLFKQNIFIGSGILCDDLYKFK
        MQGFLTTRTT PNE+F+FMGNRVKVP EAVGTYRL LDTGHHLDLF+T YVPSISRNL+SLSKLD + Y  KFGN CFSL+K    IGSGILCD LYK  
Subjt:  MQGFLTTRTTNPNERFIFMGNRVKVPAEAVGTYRLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSSYYFKFGNECFSLFKQNIFIGSGILCDDLYKFK

Query:  LDNVFAESLLTLHHNVGTKRGQTNESSAYLWHEHLGHISKERIKRLINNEILPNLDFTDLEICVDCIKGKQTKHTVNKKATRSSQLLEIIHTDICGSFDV
        LDN+FAE+LLTLHHN+GTKRG  NE  AYLWH+ LGH+SKER++RL+ NEILP+LDFTDL +CVDCIKGKQTKHT  K ATRS+QLLEIIHTDICG FDV
Subjt:  LDNVFAESLLTLHHNVGTKRGQTNESSAYLWHEHLGHISKERIKRLINNEILPNLDFTDLEICVDCIKGKQTKHTVNKKATRSSQLLEIIHTDICGSFDV

Query:  PSFGGEKYFISFIDDFSRYVYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSNRGGEYYGKYDNNGRCPGPFAKFLESHGICAQYTMAGTPQQNGVA
         SF  EKYFI+FIDD+SRY Y+YLLH+KSQAI+AL+++I EVERQLD  VKI+RS+RGGEYYG+YD  G+ PGPFAKFLE  GICAQYTM GTPQQNGV+
Subjt:  PSFGGEKYFISFIDDFSRYVYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSNRGGEYYGKYDNNGRCPGPFAKFLESHGICAQYTMAGTPQQNGVA

Query:  ERRNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVPNTPFELWTGRKPNLRHLHVWGCQVEVRIYNPHEKKLDSRTTSGFFIGYLEKSKGC
        ER NRTL++MVRSML NSSL +SLW YAL++A YLLNRVPSK+VP TPFELWTGRKP+LRHLHVWGC  ++R YNP EKKLD+RT +G+FIGY EKSKG 
Subjt:  ERRNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVPNTPFELWTGRKPNLRHLHVWGCQVEVRIYNPHEKKLDSRTTSGFFIGYLEKSKGC

Query:  RFYCPNHSTKIVETGNIRFIENDIISGSLKPRKVEI--------LSSITSSQVVVLVVYYVNNPQEQQINGQTPHNDVVTNEPVTEGPQEI
        RFYCPNH+T+I+ETGN RFIEN  +SGS  PRKVEI        L+ I++  VV  V+  +NN +EQQ N    HN+V  NEP+ E PQ+I
Subjt:  RFYCPNHSTKIVETGNIRFIENDIISGSLKPRKVEI--------LSSITSSQVVVLVVYYVNNPQEQQINGQTPHNDVVTNEPVTEGPQEI

RZC09906.1 B2 protein isoform D [Glycine soja]5.1e-19374.19Show/hide
Query:  GFLTTRTTNPNERFIFMGNRVKVPAEAVGTYRLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSSYYFKFGNECFSLFKQNIFIGSGILCDDLYKFKLD
        GFLT +T +PN++F+FMGNRVK P EAVGTYRL LDTGHHLDL +T YVPS+SRNL+SLSKLD + Y F FGN CFSLFK N  IG+G+LCD LYK KLD
Subjt:  GFLTTRTTNPNERFIFMGNRVKVPAEAVGTYRLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSSYYFKFGNECFSLFKQNIFIGSGILCDDLYKFKLD

Query:  NVFAESLLTLHHNVGTKRGQTNESSAYLWHEHLGHISKERIKRLINNEILPNLDFTDLEICVDCIKGKQTKHTVNKKATRSSQLLEIIHTDICGSFDVPS
         ++ E++LTLHHNVGTKR   NE SA+LWH+ LGHIS+ERI+RLI NEILP+LDFTDL ICVDCIKGKQTKHT  K ATRS+QLLEI+HTDICG FDV S
Subjt:  NVFAESLLTLHHNVGTKRGQTNESSAYLWHEHLGHISKERIKRLINNEILPNLDFTDLEICVDCIKGKQTKHTVNKKATRSSQLLEIIHTDICGSFDVPS

Query:  FGGEKYFISFIDDFSRYVYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSNRGGEYYGKYDNNGRCPGPFAKFLESHGICAQYTMAGTPQQNGVAER
        FG E+YFI+FIDD+SRY Y+YLLHEKSQA++AL++++NEVERQLDR VKI+RS+R GEYY +YD  G+  GPFAK L+  GICAQYTM GT QQNGV+ER
Subjt:  FGGEKYFISFIDDFSRYVYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSNRGGEYYGKYDNNGRCPGPFAKFLESHGICAQYTMAGTPQQNGVAER

Query:  RNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVPNTPFELWTGRKPNLRHLHVWGCQVEVRIYNPHEKKLDSRTTSGFFIGYLEKSKGCRF
        RNRTLM+MVRSMLINS+LPVSLWMYAL+TA YLLNRVPSK++P TPFELWT R P++RHLHVWGCQ E+RIYNP E+KLDSRT SG+FIGY EKSKG  F
Subjt:  RNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVPNTPFELWTGRKPNLRHLHVWGCQVEVRIYNPHEKKLDSRTTSGFFIGYLEKSKGCRF

Query:  YCPNHSTKIVETGNIRFIENDIISGSLKPRKVEI
        YCPNHS +IVETGN RFIEN  ISGS  PR+V+I
Subjt:  YCPNHSTKIVETGNIRFIENDIISGSLKPRKVEI

RZC12927.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 isoform A [Glycine soja]2.4e-20370.59Show/hide
Query:  MQGFLTTRTTNPNERFIFMGNRVKVPAEAVGTYRLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSSYYFKFGNECFSLFKQNIFIGSGILCDDLYKFK
        MQGFLT +T +PNE+F+FMGNRVK P EAVGTYRL LDTGHHLDL +T YVPS+SRNL+SLSKLD + Y F FGN CFSLFK N  IG+G+LCD LYK K
Subjt:  MQGFLTTRTTNPNERFIFMGNRVKVPAEAVGTYRLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSSYYFKFGNECFSLFKQNIFIGSGILCDDLYKFK

Query:  LDNVFAESLLTLHHNVGTKRGQTNESSAYLWHEHLGHISKERIKRLINNEILPNLDFTDLEICVDCIKGKQTKHTVNKKATRSSQLLEIIHTDICGSFDV
        LD ++ E++LTLHHNVGTKR   NE SA+LWH+ LGHIS+ERI+RLI NEILP+LDFTDL ICVDCIKGKQTKHT  K ATRS+QLLEI+HTDICG FDV
Subjt:  LDNVFAESLLTLHHNVGTKRGQTNESSAYLWHEHLGHISKERIKRLINNEILPNLDFTDLEICVDCIKGKQTKHTVNKKATRSSQLLEIIHTDICGSFDV

Query:  PSFGGEKYFISFIDDFSRYVYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSNRGGEYYGKYDNNGRCPGPFAKFLESHGICAQYTMAGTPQQNGVA
         SFG E+YFI+FIDD+SRY Y+YLLHEKSQA++AL++++NEVERQLDR VKI+RS+RGGEYYG+YD  G+ PGPFAK L+  GICAQYTM GTPQQNGV+
Subjt:  PSFGGEKYFISFIDDFSRYVYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSNRGGEYYGKYDNNGRCPGPFAKFLESHGICAQYTMAGTPQQNGVA

Query:  ERRNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVPNTPFELWTGRKPNLRHLHVWGCQVEVRIYNPHEKKLDSRTTSGFFIGYLEKSKGC
        ERRNRTLM+MVRSMLINS+LPVSLWMYAL+TA YLLN VPSK+VP TPFELWT R P++RHLHVWGCQ E+RIYNP E+KLD+RT SG+FIGY EKSKG 
Subjt:  ERRNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVPNTPFELWTGRKPNLRHLHVWGCQVEVRIYNPHEKKLDSRTTSGFFIGYLEKSKGC

Query:  RFYCPNHSTKIVETGNIRFIENDIISGSLKPRKVEI--------LSSITSSQVVVLVVYYVNNPQEQQINGQTPHND--VVTNEPVTEGPQEI
         FYCPNHST+IVETGN RFIEN  ISGS  PR+VEI        L+  +SS+V+   V   N+ +E Q      HND  ++ NEP+ E PQE+
Subjt:  RFYCPNHSTKIVETGNIRFIENDIISGSLKPRKVEI--------LSSITSSQVVVLVVYYVNNPQEQQINGQTPHND--VVTNEPVTEGPQEI

RZC25410.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Glycine soja]1.3e-20170.18Show/hide
Query:  MQGFLTTRTTNPNERFIFMGNRVKVPAEAVGTYRLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSSYYFKFGNECFSLFKQNIFIGSGILCDDLYKFK
        MQGFLT +T +PNE+F+FMGNRVK P EAVGTYRL LDTGHHLDL +T YVPS+SRNL+SLSKLD + Y F FGN CFSLFK N  IG+G+LCD LYK K
Subjt:  MQGFLTTRTTNPNERFIFMGNRVKVPAEAVGTYRLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSSYYFKFGNECFSLFKQNIFIGSGILCDDLYKFK

Query:  LDNVFAESLLTLHHNVGTKRGQTNESSAYLWHEHLGHISKERIKRLINNEILPNLDFTDLEICVDCIKGKQTKHTVNKKATRSSQLLEIIHTDICGSFDV
        LD ++ E++LTLHHNVGTKR   NE SA+LWH+ LGHIS ERI+RLI NEILP+LDFTDL ICVDCIKGKQTKHT  K ATRS+QLLEI+HTDICG FDV
Subjt:  LDNVFAESLLTLHHNVGTKRGQTNESSAYLWHEHLGHISKERIKRLINNEILPNLDFTDLEICVDCIKGKQTKHTVNKKATRSSQLLEIIHTDICGSFDV

Query:  PSFGGEKYFISFIDDFSRYVYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSNRGGEYYGKYDNNGRCPGPFAKFLESHGICAQYTMAGTPQQNGVA
         SFG E+YFI+FIDD+SRY Y+YLLHEKSQA++AL++++NEVERQLDR VKI+RS+RGGEYY +YD  G+ P PFAK L+  GICAQYTM GTPQQNGV+
Subjt:  PSFGGEKYFISFIDDFSRYVYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSNRGGEYYGKYDNNGRCPGPFAKFLESHGICAQYTMAGTPQQNGVA

Query:  ERRNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVPNTPFELWTGRKPNLRHLHVWGCQVEVRIYNPHEKKLDSRTTSGFFIGYLEKSKGC
        ERRN+TLM+MVRSMLINS+LPVSLWMYAL+TA YLLNRVPSK+VP TPFELWT R P++RHLHVWGCQ E+RIYNP E+KLD+RT SG+FIGY EKSKG 
Subjt:  ERRNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVPNTPFELWTGRKPNLRHLHVWGCQVEVRIYNPHEKKLDSRTTSGFFIGYLEKSKGC

Query:  RFYCPNHSTKIVETGNIRFIENDIISGSLKPRKVEI--------LSSITSSQVVVLVVYYVNNPQEQQINGQTPHND--VVTNEPVTEGPQEI
         FYCPNHST+IVETGN RFIEN  ISGS  PR+VEI        L+  +SS+V+   V   N+ +E Q      HND  ++ NEP+ E PQE+
Subjt:  RFYCPNHSTKIVETGNIRFIENDIISGSLKPRKVEI--------LSSITSSQVVVLVVYYVNNPQEQQINGQTPHND--VVTNEPVTEGPQEI

TrEMBL top hitse value%identityAlignment
A0A151R237 Retrovirus-related Pol polyprotein from transposon TNT 1-944.7e-20069.25Show/hide
Query:  MQGFLTTRTTNPNERFIFMGNRVKVPAEAVGTYRLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSSYYFKFGNECFSLFKQNIFIGSGILCDDLYKFK
        MQGF T +T +PNE+F+FMGNRVKVP EAVGTYRL L+TGHHLDL +T YVPS+SRNL+SLSKLD   Y F FGN CFSLFK+N  IG+GILCD LYK  
Subjt:  MQGFLTTRTTNPNERFIFMGNRVKVPAEAVGTYRLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSSYYFKFGNECFSLFKQNIFIGSGILCDDLYKFK

Query:  LDNVFAESLLTLHHNVGTKRGQTNESSAYLWHEHLGHISKERIKRLINNEILPNLDFTDLEICVDCIKGKQTKHTVNKKATRSSQLLEIIHTDICGSFDV
        LD ++ E+LLTLHHN+GTKR   NE SA+LWH  LGHIS+ER++RLI NEILPNLDFTDL ICVDCIKGKQTKHT  K ATRS+QLLEI+HTDICG FDV
Subjt:  LDNVFAESLLTLHHNVGTKRGQTNESSAYLWHEHLGHISKERIKRLINNEILPNLDFTDLEICVDCIKGKQTKHTVNKKATRSSQLLEIIHTDICGSFDV

Query:  PSFGGEKYFISFIDDFSRYVYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSNRGGEYYGKYDNNGRCPGPFAKFLESHGICAQYTMAGTPQQNGVA
         SFG EKYFI+FIDD+SRY Y+YLLHEKSQA+DAL++++NEVERQLD+ VK++RS+RGGEYYG+Y+  G+ PGPFAK L+  GICAQYTM GTPQQNGV+
Subjt:  PSFGGEKYFISFIDDFSRYVYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSNRGGEYYGKYDNNGRCPGPFAKFLESHGICAQYTMAGTPQQNGVA

Query:  ERRNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVPNTPFELWTGRKPNLRHLHVWGCQVEVRIYNPHEKKLDSRTTSGFFIGYLEKSKGC
        ERRNRTLM+MVRSML NS+LP+ LWMYAL+TA YLLNRVPSK+V  TPFELWTGR P+LRHLHVWGCQ E+RIYNP EKKLD+RT SG+FIGY EKSKG 
Subjt:  ERRNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVPNTPFELWTGRKPNLRHLHVWGCQVEVRIYNPHEKKLDSRTTSGFFIGYLEKSKGC

Query:  RFYCPNHSTKIVETGNIRFIENDIISGSLKPRKVEI--------LSSITSSQVVVLVVYYVNNPQEQQINGQTPHNDVVTNEPVTEGPQEI
         FYCPNH+ +IVETGN RFIEN  +SGS  P+KVE+        L+  + ++V V +    NN +E+Q N +     ++ NEP+ E PQEI
Subjt:  RFYCPNHSTKIVETGNIRFIENDIISGSLKPRKVEI--------LSSITSSQVVVLVVYYVNNPQEQQINGQTPHNDVVTNEPVTEGPQEI

A0A151TG02 Retrovirus-related Pol polyprotein from transposon TNT 1-949.4e-20971.89Show/hide
Query:  MQGFLTTRTTNPNERFIFMGNRVKVPAEAVGTYRLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSSYYFKFGNECFSLFKQNIFIGSGILCDDLYKFK
        MQGFLTTRTT PNE+F+FMGNRVKVP EAVGTYRL LDTGHHLDLF+T YVPSISRNL+SLSKLD + Y  KFGN CFSL+K    IGSGILCD LYK  
Subjt:  MQGFLTTRTTNPNERFIFMGNRVKVPAEAVGTYRLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSSYYFKFGNECFSLFKQNIFIGSGILCDDLYKFK

Query:  LDNVFAESLLTLHHNVGTKRGQTNESSAYLWHEHLGHISKERIKRLINNEILPNLDFTDLEICVDCIKGKQTKHTVNKKATRSSQLLEIIHTDICGSFDV
        LDN+FAE+LLTLHHN+GTKRG  NE  AYLWH+ LGH+SKER++RL+ NEILP+LDFTDL +CVDCIKGKQTKHT  K ATRS+QLLEIIHTDICG FDV
Subjt:  LDNVFAESLLTLHHNVGTKRGQTNESSAYLWHEHLGHISKERIKRLINNEILPNLDFTDLEICVDCIKGKQTKHTVNKKATRSSQLLEIIHTDICGSFDV

Query:  PSFGGEKYFISFIDDFSRYVYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSNRGGEYYGKYDNNGRCPGPFAKFLESHGICAQYTMAGTPQQNGVA
         SF  EKYFI+FIDD+SRY Y+YLLH+KSQAI+AL+++I EVERQLD  VKI+RS+RGGEYYG+YD  G+ PGPFAKFLE  GICAQYTM GTPQQNGV+
Subjt:  PSFGGEKYFISFIDDFSRYVYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSNRGGEYYGKYDNNGRCPGPFAKFLESHGICAQYTMAGTPQQNGVA

Query:  ERRNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVPNTPFELWTGRKPNLRHLHVWGCQVEVRIYNPHEKKLDSRTTSGFFIGYLEKSKGC
        ER NRTL++MVRSML NSSL +SLW YAL++A YLLNRVPSK+VP TPFELWTGRKP+LRHLHVWGC  ++R YNP EKKLD+RT +G+FIGY EKSKG 
Subjt:  ERRNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVPNTPFELWTGRKPNLRHLHVWGCQVEVRIYNPHEKKLDSRTTSGFFIGYLEKSKGC

Query:  RFYCPNHSTKIVETGNIRFIENDIISGSLKPRKVEI--------LSSITSSQVVVLVVYYVNNPQEQQINGQTPHNDVVTNEPVTEGPQEI
        RFYCPNH+T+I+ETGN RFIEN  +SGS  PRKVEI        L+ I++  VV  V+  +NN +EQQ N    HN+V  NEP+ E PQ+I
Subjt:  RFYCPNHSTKIVETGNIRFIENDIISGSLKPRKVEI--------LSSITSSQVVVLVVYYVNNPQEQQINGQTPHNDVVTNEPVTEGPQEI

A0A445KGB1 B2 protein isoform D2.5e-19374.19Show/hide
Query:  GFLTTRTTNPNERFIFMGNRVKVPAEAVGTYRLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSSYYFKFGNECFSLFKQNIFIGSGILCDDLYKFKLD
        GFLT +T +PN++F+FMGNRVK P EAVGTYRL LDTGHHLDL +T YVPS+SRNL+SLSKLD + Y F FGN CFSLFK N  IG+G+LCD LYK KLD
Subjt:  GFLTTRTTNPNERFIFMGNRVKVPAEAVGTYRLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSSYYFKFGNECFSLFKQNIFIGSGILCDDLYKFKLD

Query:  NVFAESLLTLHHNVGTKRGQTNESSAYLWHEHLGHISKERIKRLINNEILPNLDFTDLEICVDCIKGKQTKHTVNKKATRSSQLLEIIHTDICGSFDVPS
         ++ E++LTLHHNVGTKR   NE SA+LWH+ LGHIS+ERI+RLI NEILP+LDFTDL ICVDCIKGKQTKHT  K ATRS+QLLEI+HTDICG FDV S
Subjt:  NVFAESLLTLHHNVGTKRGQTNESSAYLWHEHLGHISKERIKRLINNEILPNLDFTDLEICVDCIKGKQTKHTVNKKATRSSQLLEIIHTDICGSFDVPS

Query:  FGGEKYFISFIDDFSRYVYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSNRGGEYYGKYDNNGRCPGPFAKFLESHGICAQYTMAGTPQQNGVAER
        FG E+YFI+FIDD+SRY Y+YLLHEKSQA++AL++++NEVERQLDR VKI+RS+R GEYY +YD  G+  GPFAK L+  GICAQYTM GT QQNGV+ER
Subjt:  FGGEKYFISFIDDFSRYVYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSNRGGEYYGKYDNNGRCPGPFAKFLESHGICAQYTMAGTPQQNGVAER

Query:  RNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVPNTPFELWTGRKPNLRHLHVWGCQVEVRIYNPHEKKLDSRTTSGFFIGYLEKSKGCRF
        RNRTLM+MVRSMLINS+LPVSLWMYAL+TA YLLNRVPSK++P TPFELWT R P++RHLHVWGCQ E+RIYNP E+KLDSRT SG+FIGY EKSKG  F
Subjt:  RNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVPNTPFELWTGRKPNLRHLHVWGCQVEVRIYNPHEKKLDSRTTSGFFIGYLEKSKGCRF

Query:  YCPNHSTKIVETGNIRFIENDIISGSLKPRKVEI
        YCPNHS +IVETGN RFIEN  ISGS  PR+V+I
Subjt:  YCPNHSTKIVETGNIRFIENDIISGSLKPRKVEI

A0A445KPR8 Retrovirus-related Pol polyprotein from transposon TNT 1-94 isoform A1.2e-20370.59Show/hide
Query:  MQGFLTTRTTNPNERFIFMGNRVKVPAEAVGTYRLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSSYYFKFGNECFSLFKQNIFIGSGILCDDLYKFK
        MQGFLT +T +PNE+F+FMGNRVK P EAVGTYRL LDTGHHLDL +T YVPS+SRNL+SLSKLD + Y F FGN CFSLFK N  IG+G+LCD LYK K
Subjt:  MQGFLTTRTTNPNERFIFMGNRVKVPAEAVGTYRLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSSYYFKFGNECFSLFKQNIFIGSGILCDDLYKFK

Query:  LDNVFAESLLTLHHNVGTKRGQTNESSAYLWHEHLGHISKERIKRLINNEILPNLDFTDLEICVDCIKGKQTKHTVNKKATRSSQLLEIIHTDICGSFDV
        LD ++ E++LTLHHNVGTKR   NE SA+LWH+ LGHIS+ERI+RLI NEILP+LDFTDL ICVDCIKGKQTKHT  K ATRS+QLLEI+HTDICG FDV
Subjt:  LDNVFAESLLTLHHNVGTKRGQTNESSAYLWHEHLGHISKERIKRLINNEILPNLDFTDLEICVDCIKGKQTKHTVNKKATRSSQLLEIIHTDICGSFDV

Query:  PSFGGEKYFISFIDDFSRYVYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSNRGGEYYGKYDNNGRCPGPFAKFLESHGICAQYTMAGTPQQNGVA
         SFG E+YFI+FIDD+SRY Y+YLLHEKSQA++AL++++NEVERQLDR VKI+RS+RGGEYYG+YD  G+ PGPFAK L+  GICAQYTM GTPQQNGV+
Subjt:  PSFGGEKYFISFIDDFSRYVYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSNRGGEYYGKYDNNGRCPGPFAKFLESHGICAQYTMAGTPQQNGVA

Query:  ERRNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVPNTPFELWTGRKPNLRHLHVWGCQVEVRIYNPHEKKLDSRTTSGFFIGYLEKSKGC
        ERRNRTLM+MVRSMLINS+LPVSLWMYAL+TA YLLN VPSK+VP TPFELWT R P++RHLHVWGCQ E+RIYNP E+KLD+RT SG+FIGY EKSKG 
Subjt:  ERRNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVPNTPFELWTGRKPNLRHLHVWGCQVEVRIYNPHEKKLDSRTTSGFFIGYLEKSKGC

Query:  RFYCPNHSTKIVETGNIRFIENDIISGSLKPRKVEI--------LSSITSSQVVVLVVYYVNNPQEQQINGQTPHND--VVTNEPVTEGPQEI
         FYCPNHST+IVETGN RFIEN  ISGS  PR+VEI        L+  +SS+V+   V   N+ +E Q      HND  ++ NEP+ E PQE+
Subjt:  RFYCPNHSTKIVETGNIRFIENDIISGSLKPRKVEI--------LSSITSSQVVVLVVYYVNNPQEQQINGQTPHND--VVTNEPVTEGPQEI

A0A445LQ30 Retrovirus-related Pol polyprotein from transposon TNT 1-946.5e-20270.18Show/hide
Query:  MQGFLTTRTTNPNERFIFMGNRVKVPAEAVGTYRLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSSYYFKFGNECFSLFKQNIFIGSGILCDDLYKFK
        MQGFLT +T +PNE+F+FMGNRVK P EAVGTYRL LDTGHHLDL +T YVPS+SRNL+SLSKLD + Y F FGN CFSLFK N  IG+G+LCD LYK K
Subjt:  MQGFLTTRTTNPNERFIFMGNRVKVPAEAVGTYRLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSSYYFKFGNECFSLFKQNIFIGSGILCDDLYKFK

Query:  LDNVFAESLLTLHHNVGTKRGQTNESSAYLWHEHLGHISKERIKRLINNEILPNLDFTDLEICVDCIKGKQTKHTVNKKATRSSQLLEIIHTDICGSFDV
        LD ++ E++LTLHHNVGTKR   NE SA+LWH+ LGHIS ERI+RLI NEILP+LDFTDL ICVDCIKGKQTKHT  K ATRS+QLLEI+HTDICG FDV
Subjt:  LDNVFAESLLTLHHNVGTKRGQTNESSAYLWHEHLGHISKERIKRLINNEILPNLDFTDLEICVDCIKGKQTKHTVNKKATRSSQLLEIIHTDICGSFDV

Query:  PSFGGEKYFISFIDDFSRYVYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSNRGGEYYGKYDNNGRCPGPFAKFLESHGICAQYTMAGTPQQNGVA
         SFG E+YFI+FIDD+SRY Y+YLLHEKSQA++AL++++NEVERQLDR VKI+RS+RGGEYY +YD  G+ P PFAK L+  GICAQYTM GTPQQNGV+
Subjt:  PSFGGEKYFISFIDDFSRYVYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSNRGGEYYGKYDNNGRCPGPFAKFLESHGICAQYTMAGTPQQNGVA

Query:  ERRNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVPNTPFELWTGRKPNLRHLHVWGCQVEVRIYNPHEKKLDSRTTSGFFIGYLEKSKGC
        ERRN+TLM+MVRSMLINS+LPVSLWMYAL+TA YLLNRVPSK+VP TPFELWT R P++RHLHVWGCQ E+RIYNP E+KLD+RT SG+FIGY EKSKG 
Subjt:  ERRNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVPNTPFELWTGRKPNLRHLHVWGCQVEVRIYNPHEKKLDSRTTSGFFIGYLEKSKGC

Query:  RFYCPNHSTKIVETGNIRFIENDIISGSLKPRKVEI--------LSSITSSQVVVLVVYYVNNPQEQQINGQTPHND--VVTNEPVTEGPQEI
         FYCPNHST+IVETGN RFIEN  ISGS  PR+VEI        L+  +SS+V+   V   N+ +E Q      HND  ++ NEP+ E PQE+
Subjt:  RFYCPNHSTKIVETGNIRFIENDIISGSLKPRKVEI--------LSSITSSQVVVLVVYYVNNPQEQQINGQTPHND--VVTNEPVTEGPQEI

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.0e-3929.6Show/hide
Query:  GTYRLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSSYYFKFGNECFSLFKQNIFI--GSGILCDDLYKFKLDNVFAESLLTLHHNVGTKRGQTNESSA
        G  RL  D  H + L D  +    + NL+S+ +L  +    +F     ++ K  + +   SG+    L    + N  A S+   H N           + 
Subjt:  GTYRLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSSYYFKFGNECFSLFKQNIFI--GSGILCDDLYKFKLDNVFAESLLTLHHNVGTKRGQTNESSA

Query:  YLWHEHLGHISKERI-----KRLINNEILPNLDFTDLEICVDCIKGKQTKHTVN--KKATRSSQLLEIIHTDICGSFDVPSFGGEKYFISFIDDFSRYVY
         LWHE  GHIS  ++     K + +++ L N      EIC  C+ GKQ +      K  T   + L ++H+D+CG     +   + YF+ F+D F+ Y  
Subjt:  YLWHEHLGHISKERI-----KRLINNEILPNLDFTDLEICVDCIKGKQTKHTVN--KKATRSSQLLEIIHTDICGSFDVPSFGGEKYFISFIDDFSRYVY

Query:  IYLLHEKSQAIDALKVFINEVERQLDRNVKILRSNRGGEYYGKYDNNGRCPGPFAKFLESHGICAQYTMAGTPQQNGVAERRNRTLMNMVRSMLINSSLP
         YL+  KS      + F+ + E   +  V  L  + G EY               +F    GI    T+  TPQ NGV+ER  RT+    R+M+  + L 
Subjt:  IYLLHEKSQAIDALKVFINEVERQLDRNVKILRSNRGGEYYGKYDNNGRCPGPFAKFLESHGICAQYTMAGTPQQNGVAERRNRTLMNMVRSMLINSSLP

Query:  VSLWMYALRTAQYLLNRVPSKSV---PNTPFELWTGRKPNLRHLHVWGCQVEVRIYNPHEKKLDSRTTSGFFIGY
         S W  A+ TA YL+NR+PS+++     TP+E+W  +KP L+HL V+G  V V I N  + K D ++    F+GY
Subjt:  VSLWMYALRTAQYLLNRVPSKSV---PNTPFELWTGRKPNLRHLHVWGCQVEVRIYNPHEKKLDSRTTSGFFIGY

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.3e-6031.08Show/hide
Query:  MGNRVKVPAEAVGTYRLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSSYYFKFGNECFSLFKQNIFIGSGILCDDLYKFKLDNVFAESLLTLHHNVGT
        MGN        +G   +  + G  L L D  +VP +  NLIS   LD   Y   F N+ + L K ++ I  G+    LY+   +              G 
Subjt:  MGNRVKVPAEAVGTYRLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSSYYFKFGNECFSLFKQNIFIGSGILCDDLYKFKLDNVFAESLLTLHHNVGT

Query:  KRGQTNESSAYLWHEHLGHISKERIKRLINNEILPNLDFTDLEICVDCIKGKQTKHTVNKKATRSSQLLEIIHTDICGSFDVPSFGGEKYFISFIDDFSR
             +E S  LWH+ +GH+S++ ++ L    ++     T ++ C  C+ GKQ + +    + R   +L+++++D+CG  ++ S GG KYF++FIDD SR
Subjt:  KRGQTNESSAYLWHEHLGHISKERIKRLINNEILPNLDFTDLEICVDCIKGKQTKHTVNKKATRSSQLLEIIHTDICGSFDVPSFGGEKYFISFIDDFSR

Query:  YVYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSNRGGEYYGKYDNNGRCPGPFAKFLESHGICAQYTMAGTPQQNGVAERRNRTLMNMVRSMLINS
         +++Y+L  K Q     + F   VER+  R +K LRS+ GGEY  +          F ++  SHGI  + T+ GTPQ NGVAER NRT++  VRSML  +
Subjt:  YVYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSNRGGEYYGKYDNNGRCPGPFAKFLESHGICAQYTMAGTPQQNGVAERRNRTLMNMVRSMLINS

Query:  SLPVSLWMYALRTAQYLLNRVPSKSVP-NTPFELWTGRKPNLRHLHVWGCQVEVRIYNPHEKKLDSRTTSGFFIGYLEKSKGCRFYCPNHSTKIVETGNI
         LP S W  A++TA YL+NR PS  +    P  +WT ++ +  HL V+GC+    +      KLD ++    FIGY ++  G R + P    K++ + ++
Subjt:  SLPVSLWMYALRTAQYLLNRVPSKSVP-NTPFELWTGRKPNLRHLHVWGCQVEVRIYNPHEKKLDSRTTSGFFIGYLEKSKGCRFYCPNHSTKIVETGNI

Query:  RFIENDIISGSLKPRKVEILSSITSSQVVVLVVYYVNNP-------QEQQINGQTPHNDVVTNEPVTEGPQEI
         F E+++ + +    KV+  + I  +   V +    NNP        E    G+ P   +   E + EG +E+
Subjt:  RFIENDIISGSLKPRKVEILSSITSSQVVVLVVYYVNNP-------QEQQINGQTPHNDVVTNEPVTEGPQEI

Q12501 Transposon Ty2-OR2 Gag-Pol polyprotein7.8e-2725.24Show/hide
Query:  VPAEAVGTYRLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSSYYFKFGNECFSLFKQNIFIGSGILCDDLYKFKLDNVFAESLLTLHHNVGTKRGQTN
        +P  A+G        G    +    + P+I+ +L+SLS+L   +    F             +   +   D Y      +    +  L  N   K    N
Subjt:  VPAEAVGTYRLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSSYYFKFGNECFSLFKQNIFIGSGILCDDLYKFKLDNVFAESLLTLHHNVGTKRGQTN

Query:  ESSAYLWHEHLGHISKERIKRLINNEILPNLDFTDLE-------ICVDCIKGKQTKHTVNK----KATRSSQLLEIIHTDICGSFDVPSFGGEKYFISFI
        +    L H  LGH +   I++ +    +  L  +D+E        C DC+ GK TKH   K    K   S +  + +HTDI G           YFISF 
Subjt:  ESSAYLWHEHLGHISKERIKRLINNEILPNLDFTDLE-------ICVDCIKGKQTKHTVNK----KATRSSQLLEIIHTDICGSFDVPSFGGEKYFISFI

Query:  DDFSRYVYIYLLHEKSQ--AIDALKVFINEVERQLDRNVKILRSNRGGEYYGKYDNNGRCPGPFAKFLESHGICAQYTMAGTPQQNGVAERRNRTLMNMV
        D+ +R+ ++Y LH++ +   ++     +  ++ Q +  V +++ +RG EY  K            KF  + GI A YT     + +GVAER NRTL+N  
Subjt:  DDFSRYVYIYLLHEKSQ--AIDALKVFINEVERQLDRNVKILRSNRGGEYYGKYDNNGRCPGPFAKFLESHGICAQYTMAGTPQQNGVAERRNRTLMNMV

Query:  RSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVPNTPFELWTGRKPNLRHLHVWGCQVEVRIYNPHEKKLDSRTTSGFFIGYLEKSKGCRFYCPNHSTKI
        R++L  S LP  LW  A+  +  + N + S     +  +       ++  +  +G  V V  +NP + K+  R   G+ +     S G   Y P+   K 
Subjt:  RSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVPNTPFELWTGRKPNLRHLHVWGCQVEVRIYNPHEKKLDSRTTSGFFIGYLEKSKGCRFYCPNHSTKI

Query:  VETGNIRFIEND
        V+T N   ++N+
Subjt:  VETGNIRFIEND

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.3e-3426.42Show/hide
Query:  VPAEAVGTYRLTLDTGHHLDLFDTFYVPSISRNLISLSKL-DTSSYYFKFGNECFSLFKQN--IFIGSGILCDDLYKFKLDNVFAESLLTLHHNVGTKRG
        +P    G+  L+  +   L+L +  YVP+I +NLIS+ +L + +    +F    F +   N  + +  G   D+LY++ + +    SL            
Subjt:  VPAEAVGTYRLTLDTGHHLDLFDTFYVPSISRNLISLSKL-DTSSYYFKFGNECFSLFKQN--IFIGSGILCDDLYKFKLDNVFAESLLTLHHNVGTKRG

Query:  QTNESSAYLWHEHLGHISKERIKRLINNEILPNLDFTDLEI-CVDCIKGKQTKHTVNKKATRSSQLLEIIHTDICGSFDVPSFGGEKYFISFIDDFSRYV
         +++++   WH  LGH +   +  +I+N  L  L+ +   + C DC+  K  K   ++    S++ LE I++D+  S  + S    +Y++ F+D F+RY 
Subjt:  QTNESSAYLWHEHLGHISKERIKRLINNEILPNLDFTDLEI-CVDCIKGKQTKHTVNKKATRSSQLLEIIHTDICGSFDVPSFGGEKYFISFIDDFSRYV

Query:  YIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSNRGGEYYGKYDNNGRCPGPFAKFLESHGICAQYTMAGTPQQNGVAERRNRTLMNMVRSMLINSSL
        ++Y L +KSQ  +    F N +E +    +    S+ GGE+   ++           +   HGI    +   TP+ NG++ER++R ++    ++L ++S+
Subjt:  YIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSNRGGEYYGKYDNNGRCPGPFAKFLESHGICAQYTMAGTPQQNGVAERRNRTLMNMVRSMLINSSL

Query:  PVSLWMYALRTAQYLLNRVPSKSVP-NTPFELWTGRKPNLRHLHVWGCQVE--VRIYNPHEKKLDSRTTSGFFIGYLEKSKGCRFYCPNHSTKIVETGNI
        P + W YA   A YL+NR+P+  +   +PF+   G  PN   L V+GC     +R YN H  KLD ++    F+GY   ++         ++++  + ++
Subjt:  PVSLWMYALRTAQYLLNRVPSKSVP-NTPFELWTGRKPNLRHLHVWGCQVE--VRIYNPHEKKLDSRTTSGFFIGYLEKSKGCRFYCPNHSTKIVETGNI

Query:  RFIEN
        RF EN
Subjt:  RFIEN

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.4e-3730.73Show/hide
Query:  LDLFDTFYVPSISRNLISLSKL-DTSSYYFKFGNECFSLFKQN--IFIGSGILCDDLYKFKLDNVFAESLLTLHHNVGTKRGQTNESSAYLWHEHLGHIS
        LDL    YVP+I +NLIS+ +L +T+    +F    F +   N  + +  G   D+LY++ + +  A S+              ++++   WH  LGH S
Subjt:  LDLFDTFYVPSISRNLISLSKL-DTSSYYFKFGNECFSLFKQN--IFIGSGILCDDLYKFKLDNVFAESLLTLHHNVGTKRGQTNESSAYLWHEHLGHIS

Query:  KERIKRLINNEILPNLDFT-DLEICVDCIKGKQTKHTVNKKATRSSQLLEIIHTDICGSFDVPSFGGEKYFISFIDDFSRYVYIYLLHEKSQAIDALKVF
           +  +I+N  LP L+ +  L  C DC   K  K   +     SS+ LE I++D+  S  + S    +Y++ F+D F+RY ++Y L +KSQ  D   +F
Subjt:  KERIKRLINNEILPNLDFT-DLEICVDCIKGKQTKHTVNKKATRSSQLLEIIHTDICGSFDVPSFGGEKYFISFIDDFSRYVYIYLLHEKSQAIDALKVF

Query:  INEVERQLDRNVKILRSNRGGEYYGKYDNNGRCPGPFAKFLESHGICAQYTMAGTPQQNGVAERRNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNR
         + VE +    +  L S+ GGE+    D           +L  HGI    +   TP+ NG++ER++R ++ M  ++L ++S+P + W YA   A YL+NR
Subjt:  INEVERQLDRNVKILRSNRGGEYYGKYDNNGRCPGPFAKFLESHGICAQYTMAGTPQQNGVAERRNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNR

Query:  VPSKSVP-NTPFELWTGRKPNLRHLHVWGCQVE--VRIYNPHEKKLDSRTTSGFFIGY
        +P+  +   +PF+   G+ PN   L V+GC     +R YN H  KL+ ++    F+GY
Subjt:  VPSKSVP-NTPFELWTGRKPNLRHLHVWGCQVE--VRIYNPHEKKLDSRTTSGFFIGY

Arabidopsis top hitse value%identityAlignment
ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein6.6e-0533.72Show/hide
Query:  NRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVP-NTPFELWTGRKPNLRHLHVWGCQVEVRIYNPHEKKLDSRTTSG
        NRT++  VRSML    LP +    A  TA +++N+ PS ++  + P E+W    P   +L  +GC   V   +  E KL  R   G
Subjt:  NRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVP-NTPFELWTGRKPNLRHLHVWGCQVEVRIYNPHEKKLDSRTTSG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGGGATTCCTTACGACTCGAACAACAAACCCAAATGAGAGATTCATTTTTATGGGAAACAGAGTCAAAGTTCCAGCTGAAGCTGTGGGAACCTATCGTTTAACTTT
AGACACTGGACATCATTTAGACCTTTTTGATACCTTTTATGTTCCTTCTATTTCTCGTAATTTGATTTCCTTGTCAAAACTTGATACTTCAAGTTATTACTTTAAATTTG
GAAATGAGTGTTTTAGTTTATTCAAACAGAACATTTTTATTGGTTCTGGTATTCTTTGTGATGACTTATATAAATTCAAGCTTGATAATGTTTTTGCTGAGAGCTTGTTA
ACTCTACATCATAATGTTGGTACTAAACGTGGTCAAACAAATGAATCGTCGGCTTACTTGTGGCATGAACATTTAGGTCACATATCCAAAGAAAGAATTAAAAGATTGAT
AAATAATGAAATTCTTCCAAATTTGGATTTTACTGACCTTGAAATTTGTGTGGATTGTATTAAAGGAAAACAAACAAAACACACAGTTAATAAAAAAGCTACAAGAAGCT
CACAACTTCTTGAAATTATACACACGGATATTTGTGGGTCTTTTGATGTTCCATCTTTTGGTGGAGAAAAGTATTTTATCAGCTTTATTGATGATTTCTCACGTTATGTT
TATATATATTTATTGCATGAGAAATCTCAAGCAATAGATGCCTTAAAAGTATTTATAAATGAAGTTGAAAGGCAATTAGATAGAAACGTGAAAATCTTAAGATCTAATAG
AGGTGGTGAGTATTATGGAAAATATGACAACAATGGACGATGCCCCGGTCCCTTCGCTAAATTCCTAGAAAGCCATGGCATATGTGCTCAATACACAATGGCAGGAACAC
CACAACAAAATGGTGTTGCAGAAAGACGAAATCGTACATTAATGAATATGGTTAGAAGCATGTTAATTAATTCATCTTTACCTGTGTCCTTGTGGATGTATGCATTAAGA
ACCGCTCAATATTTATTAAACAGGGTTCCTAGTAAGTCAGTTCCAAATACACCTTTTGAACTGTGGACAGGAAGGAAACCTAATTTAAGACACCTACATGTTTGGGGTTG
TCAAGTGGAAGTAAGAATTTATAATCCACATGAAAAGAAACTGGATTCAAGAACAACTAGTGGTTTCTTCATTGGTTATCTAGAAAAATCAAAAGGGTGTAGATTTTATT
GTCCTAACCACAGTACGAAAATAGTTGAAACTGGAAATATAAGGTTCATTGAGAATGACATAATTAGTGGGAGTTTGAAACCACGTAAAGTGGAAATTCTTTCATCTATA
ACTTCTTCTCAAGTTGTTGTTCTTGTAGTTTACTATGTTAACAATCCACAAGAACAACAAATTAATGGTCAAACACCACATAATGATGTTGTAACAAATGAACCTGTAAC
TGAGGGACCACAAGAAATATAG
mRNA sequenceShow/hide mRNA sequence
ATGCAGGGATTCCTTACGACTCGAACAACAAACCCAAATGAGAGATTCATTTTTATGGGAAACAGAGTCAAAGTTCCAGCTGAAGCTGTGGGAACCTATCGTTTAACTTT
AGACACTGGACATCATTTAGACCTTTTTGATACCTTTTATGTTCCTTCTATTTCTCGTAATTTGATTTCCTTGTCAAAACTTGATACTTCAAGTTATTACTTTAAATTTG
GAAATGAGTGTTTTAGTTTATTCAAACAGAACATTTTTATTGGTTCTGGTATTCTTTGTGATGACTTATATAAATTCAAGCTTGATAATGTTTTTGCTGAGAGCTTGTTA
ACTCTACATCATAATGTTGGTACTAAACGTGGTCAAACAAATGAATCGTCGGCTTACTTGTGGCATGAACATTTAGGTCACATATCCAAAGAAAGAATTAAAAGATTGAT
AAATAATGAAATTCTTCCAAATTTGGATTTTACTGACCTTGAAATTTGTGTGGATTGTATTAAAGGAAAACAAACAAAACACACAGTTAATAAAAAAGCTACAAGAAGCT
CACAACTTCTTGAAATTATACACACGGATATTTGTGGGTCTTTTGATGTTCCATCTTTTGGTGGAGAAAAGTATTTTATCAGCTTTATTGATGATTTCTCACGTTATGTT
TATATATATTTATTGCATGAGAAATCTCAAGCAATAGATGCCTTAAAAGTATTTATAAATGAAGTTGAAAGGCAATTAGATAGAAACGTGAAAATCTTAAGATCTAATAG
AGGTGGTGAGTATTATGGAAAATATGACAACAATGGACGATGCCCCGGTCCCTTCGCTAAATTCCTAGAAAGCCATGGCATATGTGCTCAATACACAATGGCAGGAACAC
CACAACAAAATGGTGTTGCAGAAAGACGAAATCGTACATTAATGAATATGGTTAGAAGCATGTTAATTAATTCATCTTTACCTGTGTCCTTGTGGATGTATGCATTAAGA
ACCGCTCAATATTTATTAAACAGGGTTCCTAGTAAGTCAGTTCCAAATACACCTTTTGAACTGTGGACAGGAAGGAAACCTAATTTAAGACACCTACATGTTTGGGGTTG
TCAAGTGGAAGTAAGAATTTATAATCCACATGAAAAGAAACTGGATTCAAGAACAACTAGTGGTTTCTTCATTGGTTATCTAGAAAAATCAAAAGGGTGTAGATTTTATT
GTCCTAACCACAGTACGAAAATAGTTGAAACTGGAAATATAAGGTTCATTGAGAATGACATAATTAGTGGGAGTTTGAAACCACGTAAAGTGGAAATTCTTTCATCTATA
ACTTCTTCTCAAGTTGTTGTTCTTGTAGTTTACTATGTTAACAATCCACAAGAACAACAAATTAATGGTCAAACACCACATAATGATGTTGTAACAAATGAACCTGTAAC
TGAGGGACCACAAGAAATATAG
Protein sequenceShow/hide protein sequence
MQGFLTTRTTNPNERFIFMGNRVKVPAEAVGTYRLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSSYYFKFGNECFSLFKQNIFIGSGILCDDLYKFKLDNVFAESLL
TLHHNVGTKRGQTNESSAYLWHEHLGHISKERIKRLINNEILPNLDFTDLEICVDCIKGKQTKHTVNKKATRSSQLLEIIHTDICGSFDVPSFGGEKYFISFIDDFSRYV
YIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSNRGGEYYGKYDNNGRCPGPFAKFLESHGICAQYTMAGTPQQNGVAERRNRTLMNMVRSMLINSSLPVSLWMYALR
TAQYLLNRVPSKSVPNTPFELWTGRKPNLRHLHVWGCQVEVRIYNPHEKKLDSRTTSGFFIGYLEKSKGCRFYCPNHSTKIVETGNIRFIENDIISGSLKPRKVEILSSI
TSSQVVVLVVYYVNNPQEQQINGQTPHNDVVTNEPVTEGPQEI