; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh14G012400 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh14G012400
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionRetrotransposon protein, putative, Ty1-copia subclass
Genome locationCmo_Chr14:10483084..10486094
RNA-Seq ExpressionCmoCh14G012400
SyntenyCmoCh14G012400
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAN06870.1 Putative gag-pol polyprotein [Oryza sativa Japonica Group]3.6e-19156.48Show/hide
Query:  ASKLPQQSSASLPCESALVVPPMIEASAPPPADDIAQCPVESSVAGQ------PTAIA-SVAPLATADTAVPSNVDHAPTTHPY----GTRLKHNIKKPK
        A+  P  S A+        V    E+S  P    +   P  S VA Q      PTA + S  P  + D +V   VD     H       TRL+  I+K K
Subjt:  ASKLPQQSSASLPCESALVVPPMIEASAPPPADDIAQCPVESSVAGQ------PTAIA-SVAPLATADTAVPSNVDHAPTTHPY----GTRLKHNIKKPK

Query:  VRTDGTVTYLVA-RSSASEPTSHITAMEHPLCRQAMNDEFQALQKNKTWHLVPPRAGFNVIDCKWVFKLKQKPDGSIDRYKARLVAKGFKQQYGVDYDDT
        V TDGTV Y     +   EP +   A++    ++AM+ EF ALQ NKTWHLVPP+ G NVIDCKWV+K+K+K DGS+DRYKARLVAKGFKQ+YG+DY+DT
Subjt:  VRTDGTVTYLVA-RSSASEPTSHITAMEHPLCRQAMNDEFQALQKNKTWHLVPPRAGFNVIDCKWVFKLKQKPDGSIDRYKARLVAKGFKQQYGVDYDDT

Query:  FSPVVKLTTIRLLLSLAISCGWAIRQIDIQNAFLHGLLNEDVYMKQPPGFVDSQHPGYLCKLDKSLYGFKQAPHAWFSRLSSKLLQLDFTPSKVDVSLFI
        FSPVVK  TIR++LS+A+S GW +RQ+D+QNAFLHG+L E+VYMKQPPG+ DS  PGY+CKLDK+LYG KQAP AW+SRLS KL  + F  SK D SLF 
Subjt:  FSPVVKLTTIRLLLSLAISCGWAIRQIDIQNAFLHGLLNEDVYMKQPPGFVDSQHPGYLCKLDKSLYGFKQAPHAWFSRLSSKLLQLDFTPSKVDVSLFI

Query:  FNKTGIQMYILIYVDDIIIISSSSTATEKLLTQLQDDFAVKDLGILSYFLGIEVRHTSSGLILTQHKYIRDLLARTNMLTSKGVPTPMLPSEKLLLNGGE
        +NK  + +++LIYVDDII+ SS   A   LL  LQ +FA+KDLG L YFLGIEV   S G++LTQ KY+ DLL R NM   K V TP+  SEKL +N G+
Subjt:  FNKTGIQMYILIYVDDIIIISSSSTATEKLLTQLQDDFAVKDLGILSYFLGIEVRHTSSGLILTQHKYIRDLLARTNMLTSKGVPTPMLPSEKLLLNGGE

Query:  KLSPEDTTRYRSVVGALQYLSLTRPDISFCVNRVCQFMSSPTSIHWAAVKRILHYLHDTIDMGLCLTKSSTDLLSAFSDADWAGNPDDRRSTGGYVIFFG
         L P D T YRSVVGALQYL+LTRPDI+F VN+VCQF+ +PT+ HWAAVKRIL YL     +GL L+KS + L+SA+SDADWAG+ DDRRSTGG+ +F G
Subjt:  KLSPEDTTRYRSVVGALQYLSLTRPDISFCVNRVCQFMSSPTSIHWAAVKRILHYLHDTIDMGLCLTKSSTDLLSAFSDADWAGNPDDRRSTGGYVIFFG

Query:  GNLISWSSRKQSTVSRSSTEAEYKAVANATAELIWIQVLLRELGISQARAPSLWCDNIGATYLSANPIFHRRTKHVEVDYHFVRERVSTRQLDVRLISSK
         NL+SWS+RKQ+TVSRSSTE+EYKA+ANATAE++W+Q LL EL +       LWCDN+GA YLSANP+FH RTKH+EVDYHFVRERVS + L++  + + 
Subjt:  GNLISWSSRKQSTVSRSSTEAEYKAVANATAELIWIQVLLRELGISQARAPSLWCDNIGATYLSANPIFHRRTKHVEVDYHFVRERVSTRQLDVRLISSK

Query:  DQLADIMTKPLPASSFSYFRRNLNL
        DQ+AD  TK L       F+ NLNL
Subjt:  DQLADIMTKPLPASSFSYFRRNLNL

BAH94406.1 Os08g0544300 [Oryza sativa Japonica Group]6.6e-19358.19Show/hide
Query:  ESSVAGQPTAIASVAPLATADTAVPSNV-----DHAPTTHPYG--------TRLKHNIKKPKVRTDGTVTYLVARSSASEPTSHITAMEHPLCRQAMNDE
        E  V   P   +S +P   +   V  +V     D      P          TRL+  I+K KV TDGTV +L   SS  EP S   A+ +   ++AM+ E
Subjt:  ESSVAGQPTAIASVAPLATADTAVPSNV-----DHAPTTHPYG--------TRLKHNIKKPKVRTDGTVTYLVARSSASEPTSHITAMEHPLCRQAMNDE

Query:  FQALQKNKTWHLVPPRAGFNVIDCKWVFKLKQKPDGSIDRYKARLVAKGFKQQYGVDYDDTFSPVVKLTTIRLLLSLAISCGWAIRQIDIQNAFLHGLLN
        + AL +NKTWHLVPP+ G NVIDCKWV+K+K+K DGS+DRYKARLVAKGFKQ+YG+DY+DTFSPVVK  TIR++LSLA+S GW++RQ+D++NAFLHG+L 
Subjt:  FQALQKNKTWHLVPPRAGFNVIDCKWVFKLKQKPDGSIDRYKARLVAKGFKQQYGVDYDDTFSPVVKLTTIRLLLSLAISCGWAIRQIDIQNAFLHGLLN

Query:  EDVYMKQPPGFVDSQHPGYLCKLDKSLYGFKQAPHAWFSRLSSKLLQLDFTPSKVDVSLFIFNKTGIQMYILIYVDDIIIISSSSTATEKLLTQLQDDFA
        E+VYM+QPPG+     P Y+CKLDK+LYG KQAP AW+SRLS+KL +L F PSK D SLF + K  + +++LIYVDDII+ SS   AT  LL +L  DFA
Subjt:  EDVYMKQPPGFVDSQHPGYLCKLDKSLYGFKQAPHAWFSRLSSKLLQLDFTPSKVDVSLFIFNKTGIQMYILIYVDDIIIISSSSTATEKLLTQLQDDFA

Query:  VKDLGILSYFLGIEVRHTSSGLILTQHKYIRDLLARTNMLTSKGVPTPMLPSEKLLLNGGEKLSPEDTTRYRSVVGALQYLSLTRPDISFCVNRVCQFMS
        +KDLG L YFLGIEV     GL+L+Q KY  DLL R  M   K V TP+  SEKL +N G  L P+D+T+YRSVVGALQYL+LTRPDISF +N+VCQF+ 
Subjt:  VKDLGILSYFLGIEVRHTSSGLILTQHKYIRDLLARTNMLTSKGVPTPMLPSEKLLLNGGEKLSPEDTTRYRSVVGALQYLSLTRPDISFCVNRVCQFMS

Query:  SPTSIHWAAVKRILHYLHDTIDMGLCLTKSSTDLLSAFSDADWAGNPDDRRSTGGYVIFFGGNLISWSSRKQSTVSRSSTEAEYKAVANATAELIWIQVL
        +PT+ HWAAVKRIL Y+  T+D GL   ++ + L+S FSDADWAG+PDDRRSTGG+ +F G NL+SWS+RKQ+TVSRSSTEAEYKA+ANATAE++W+Q L
Subjt:  SPTSIHWAAVKRILHYLHDTIDMGLCLTKSSTDLLSAFSDADWAGNPDDRRSTGGYVIFFGGNLISWSSRKQSTVSRSSTEAEYKAVANATAELIWIQVL

Query:  LRELGISQARAPSLWCDNIGATYLSANPIFHRRTKHVEVDYHFVRERVSTRQLDVRLISSKDQLADIMTKPLPASSFSYFRRNLNL
        L+ELG+   RA  LWCDN+GA YLSANPIFH RTKH+EVD+HFVRERV+ + L++  IS+KDQ+AD  TK +P      F+ NLNL
Subjt:  LRELGISQARAPSLWCDNIGATYLSANPIFHRRTKHVEVDYHFVRERVSTRQLDVRLISSKDQLADIMTKPLPASSFSYFRRNLNL

pir|T02087| gag/pol polyprotein - maize retrotransposon Hopscotch [Zea mays]7.0e-19558.56Show/hide
Query:  STSASKLPQQSSASLPCESALVVPPMIEASAPPPADDIAQC----PVESSVAGQPTAIASVAPLATAD---TAVPSNVDH-APTTHPYG-----TRLKHN
        + +A  LP      +    ALV   ++ A++P P    A        +S  +G P A  SV  +  AD    A  S+V H  P + P       TRL+H 
Subjt:  STSASKLPQQSSASLPCESALVVPPMIEASAPPPADDIAQC----PVESSVAGQPTAIASVAPLATAD---TAVPSNVDH-APTTHPYG-----TRLKHN

Query:  IKKPKVRTDGTVTYLVARSSASEPTSHITAMEHPLCRQAMNDEFQALQKNKTWHLVPPRAGFNVIDCKWVFKLKQKPDGSIDRYKARLVAKGFKQQYGVD
        I KPK  TDGTV Y  A +  +EP+S   A+  P  R AM  EFQALQKN TW LVPP    N+IDCKWVFK+K   DGSIDR KARLVAKGFKQQYG+D
Subjt:  IKKPKVRTDGTVTYLVARSSASEPTSHITAMEHPLCRQAMNDEFQALQKNKTWHLVPPRAGFNVIDCKWVFKLKQKPDGSIDRYKARLVAKGFKQQYGVD

Query:  YDDTFSPVVKLTTIRLLLSLAISCGWAIRQIDIQNAFLHGLLNEDVYMKQPPGFVDSQHPGYLCKLDKSLYGFKQAPHAWFSRLSSKLLQLDFTPSKVDV
        YDDTFSPVVK +TIRL+LSLA+S  W++RQ+D+QNAFLHG+L E VYMKQPPGF D+ HP Y C L KSLYG KQ P AW+SRLS KL  L F PSK DV
Subjt:  YDDTFSPVVKLTTIRLLLSLAISCGWAIRQIDIQNAFLHGLLNEDVYMKQPPGFVDSQHPGYLCKLDKSLYGFKQAPHAWFSRLSSKLLQLDFTPSKVDV

Query:  SLFIFNKTGIQMYILIYVDDIIIISSSSTATEKLLTQLQDDFAVKDLGILSYFLGIEVRHTSSGLILTQHKYIRDLLARTNMLTSKGVPTPMLPSEKLLL
        SLFI+N     +YIL+YVDDIII  SS  A + +L +L+DDFA+KDLG L YFLGIEV     GL+L Q KY RDLL R  M   K V TP+  SEKL  
Subjt:  SLFIFNKTGIQMYILIYVDDIIIISSSSTATEKLLTQLQDDFAVKDLGILSYFLGIEVRHTSSGLILTQHKYIRDLLARTNMLTSKGVPTPMLPSEKLLL

Query:  NGGEKLSPEDTTRYRSVVGALQYLSLTRPDISFCVNRVCQFMSSPTSIHWAAVKRILHYLHDTIDMGLCLTKSSTDLLSAFSDADWAGNPDDRRSTGGYV
        + G  LSPE+TT+YRSVVGALQYL+LTRPD+S+ +NRVCQF+ +PT +HW AVKRIL  +  TI +GL +  S + +LSAFSDADWAG PDDR+STGGY 
Subjt:  NGGEKLSPEDTTRYRSVVGALQYLSLTRPDISFCVNRVCQFMSSPTSIHWAAVKRILHYLHDTIDMGLCLTKSSTDLLSAFSDADWAGNPDDRRSTGGYV

Query:  IFFGGNLISWSSRKQSTVSRSSTEAEYKAVANATAELIWIQVLLRELGISQARAPSLWCDNIGATYLSANPIFHRRTKHVEVDYHFVRERVSTRQLDVRL
        +F G NLISW+S+KQSTVSRSSTEAEYKA+ANATAE+IW+Q LL ELGI     P LWCDN+GATYLS+ PIF+ RTKH+EVD+HFVR+RV +++LD+RL
Subjt:  IFFGGNLISWSSRKQSTVSRSSTEAEYKAVANATAELIWIQVLLRELGISQARAPSLWCDNIGATYLSANPIFHRRTKHVEVDYHFVRERVSTRQLDVRL

Query:  ISSKDQLADIMTKPLPASSFSYFRR
        IS+ DQ+AD  TK L     + FRR
Subjt:  ISSKDQLADIMTKPLPASSFSYFRR

QCC26836.1 Hopscotch gagpol polyprotein [Zea mays]8.9e-19858.98Show/hide
Query:  STSASKLPQQSSASLPCESALVVPPMIEASAPPPADDIAQC----PVESSVAGQPTAIASVAPLATAD---TAVPSNVDH-APTTHPYG-----TRLKHN
        + +A  LP      +    ALV   ++ A++P P    A        +S  +G P A  SV  +  AD    A  S+V H  P + P       TRL+H 
Subjt:  STSASKLPQQSSASLPCESALVVPPMIEASAPPPADDIAQC----PVESSVAGQPTAIASVAPLATAD---TAVPSNVDH-APTTHPYG-----TRLKHN

Query:  IKKPKVRTDGTVTYLVARSSASEPTSHITAMEHPLCRQAMNDEFQALQKNKTWHLVPPRAGFNVIDCKWVFKLKQKPDGSIDRYKARLVAKGFKQQYGVD
        I KPK  TDGTV Y  A +  +EP+S   A+  P  R AM  EFQALQKN TW LVPP    N+IDCKWVFK+K   DGSIDR KARLVAKGFKQQYG+D
Subjt:  IKKPKVRTDGTVTYLVARSSASEPTSHITAMEHPLCRQAMNDEFQALQKNKTWHLVPPRAGFNVIDCKWVFKLKQKPDGSIDRYKARLVAKGFKQQYGVD

Query:  YDDTFSPVVKLTTIRLLLSLAISCGWAIRQIDIQNAFLHGLLNEDVYMKQPPGFVDSQHPGYLCKLDKSLYGFKQAPHAWFSRLSSKLLQLDFTPSKVDV
        YDDTFSPVVK +TIRL+LSLA+S  W++RQ+D+QNAFLHG+L E VYMKQPPGF D+ HP Y C L KSLYG KQAP AW+SRLS KL  L F PSK DV
Subjt:  YDDTFSPVVKLTTIRLLLSLAISCGWAIRQIDIQNAFLHGLLNEDVYMKQPPGFVDSQHPGYLCKLDKSLYGFKQAPHAWFSRLSSKLLQLDFTPSKVDV

Query:  SLFIFNKTGIQMYILIYVDDIIIISSSSTATEKLLTQLQDDFAVKDLGILSYFLGIEVRHTSSGLILTQHKYIRDLLARTNMLTSKGVPTPMLPSEKLLL
        SLFI+N     +YIL+YVDDIII  SS  A + +L +L+DDFA+KDLG L YFLGIEV     GL+L Q KY RDLL R  M   K V TP+  SEKL  
Subjt:  SLFIFNKTGIQMYILIYVDDIIIISSSSTATEKLLTQLQDDFAVKDLGILSYFLGIEVRHTSSGLILTQHKYIRDLLARTNMLTSKGVPTPMLPSEKLLL

Query:  NGGEKLSPEDTTRYRSVVGALQYLSLTRPDISFCVNRVCQFMSSPTSIHWAAVKRILHYLHDTIDMGLCLTKSSTDLLSAFSDADWAGNPDDRRSTGGYV
        + G  LSPE+TT+YRSVVGALQYL+LTRPD+S+ +NRVCQF+ +PT +HW AVKRIL  +  TI +GL +  S + +LSAFSDADWAG PDDR+STGGY 
Subjt:  NGGEKLSPEDTTRYRSVVGALQYLSLTRPDISFCVNRVCQFMSSPTSIHWAAVKRILHYLHDTIDMGLCLTKSSTDLLSAFSDADWAGNPDDRRSTGGYV

Query:  IFFGGNLISWSSRKQSTVSRSSTEAEYKAVANATAELIWIQVLLRELGISQARAPSLWCDNIGATYLSANPIFHRRTKHVEVDYHFVRERVSTRQLDVRL
        +F G NLISW+S+KQSTVSRSSTEAEYKA+ANATAE+IW+Q LL ELGI     P LWCDN+GATYLS+ PIF+ RTKH+EVD+HFVR+RV +++LD+RL
Subjt:  IFFGGNLISWSSRKQSTVSRSSTEAEYKAVANATAELIWIQVLLRELGISQARAPSLWCDNIGATYLSANPIFHRRTKHVEVDYHFVRERVSTRQLDVRL

Query:  ISSKDQLADIMTKPLPASSFSYFRRNLNL
        IS+ DQ+AD  TK L     + FRRNLNL
Subjt:  ISSKDQLADIMTKPLPASSFSYFRRNLNL

RLM69625.1 putative polyprotein [Panicum miliaceum]7.8e-19457.17Show/hide
Query:  STSASKLPQQSSASLPCESALVVPPMIEASAPPPADDIA-----QCPVESSVAGQPTAIASVAPLATADTAVPSNVDHAPTTHPYGTRLKHNIKKPKVRT
        + S S + Q S  + P +   VVPP       P +   A        + SS A   T +   AP A    A PS V       P  TRL+  I+KPKV T
Subjt:  STSASKLPQQSSASLPCESALVVPPMIEASAPPPADDIA-----QCPVESSVAGQPTAIASVAPLATADTAVPSNVDHAPTTHPYGTRLKHNIKKPKVRT

Query:  DGTVTYLVARSSASEPTSHITAMEHPLCRQAMNDEFQALQKNKTWHLVPPRAGFNVIDCKWVFKLKQKPDGSIDRYKARLVAKGFKQQYGVDYDDTFSPV
        DGTV Y    +++ EP S   A+     + AM+ E+ AL  NKTWHLVPP+ G NVIDCKWV+K+K+K DGS+DRYKARLVAKGF+Q+YG+DY+DTFSPV
Subjt:  DGTVTYLVARSSASEPTSHITAMEHPLCRQAMNDEFQALQKNKTWHLVPPRAGFNVIDCKWVFKLKQKPDGSIDRYKARLVAKGFKQQYGVDYDDTFSPV

Query:  VKLTTIRLLLSLAISCGWAIRQIDIQNAFLHGLLNEDVYMKQPPGFVDSQHPGYLCKLDKSLYGFKQAPHAWFSRLSSKLLQLDFTPSKVDVSLFIFNKT
        VK  TIR +L +A+S GW++R++D+QNAFLHG L EDVYMKQPPG+ D    GY+CKLDK+LYG K+AP AW+SRLS+KL QL F  SK D SLF +NK 
Subjt:  VKLTTIRLLLSLAISCGWAIRQIDIQNAFLHGLLNEDVYMKQPPGFVDSQHPGYLCKLDKSLYGFKQAPHAWFSRLSSKLLQLDFTPSKVDVSLFIFNKT

Query:  GIQMYILIYVDDIIIISSSSTATEKLLTQLQDDFAVKDLGILSYFLGIEVRHTSSGLILTQHKYIRDLLARTNMLTSKGVPTPMLPSEKLLLNGGEKLSP
         + +YILIYVDDII+ SS+  AT  LL  L+ +FA+KDLG L +FLGIEV+  ++G++LTQ KY +D+L R +M+  K V +P+  SEKL  + G+ L P
Subjt:  GIQMYILIYVDDIIIISSSSTATEKLLTQLQDDFAVKDLGILSYFLGIEVRHTSSGLILTQHKYIRDLLARTNMLTSKGVPTPMLPSEKLLLNGGEKLSP

Query:  EDTTRYRSVVGALQYLSLTRPDISFCVNRVCQFMSSPTSIHWAAVKRILHYLHDTIDMGLCLTKSSTDLLSAFSDADWAGNPDDRRSTGGYVIFFGGNLI
        +D T YRS+VG LQYL LTRPDISF VN+VCQ++ +PT++HWA VKRIL YL  T+++GL + KS + L+SAFSDADWAG+ DDRRSTGG+ +F G NLI
Subjt:  EDTTRYRSVVGALQYLSLTRPDISFCVNRVCQFMSSPTSIHWAAVKRILHYLHDTIDMGLCLTKSSTDLLSAFSDADWAGNPDDRRSTGGYVIFFGGNLI

Query:  SWSSRKQSTVSRSSTEAEYKAVANATAELIWIQVLLRELGISQARAPSLWCDNIGATYLSANPIFHRRTKHVEVDYHFVRERVSTRQLDVRLISSKDQLA
        SWS+RKQSTVSRSSTEAEYKAVANATAE++WIQ LL ELGI   +   LWCDNIGA YLSANP+FH RTKH+EVDYHFVRERV+ R LD+  IS++DQ+A
Subjt:  SWSSRKQSTVSRSSTEAEYKAVANATAELIWIQVLLRELGISQARAPSLWCDNIGATYLSANPIFHRRTKHVEVDYHFVRERVSTRQLDVRLISSKDQLA

Query:  DIMTKPLPASSFSYFRRNLNL
        +  TKPL   +   F+ NLNL
Subjt:  DIMTKPLPASSFSYFRRNLNL

TrEMBL top hitse value%identityAlignment
A0A3L6Q0W7 Putative polyprotein3.8e-19457.17Show/hide
Query:  STSASKLPQQSSASLPCESALVVPPMIEASAPPPADDIA-----QCPVESSVAGQPTAIASVAPLATADTAVPSNVDHAPTTHPYGTRLKHNIKKPKVRT
        + S S + Q S  + P +   VVPP       P +   A        + SS A   T +   AP A    A PS V       P  TRL+  I+KPKV T
Subjt:  STSASKLPQQSSASLPCESALVVPPMIEASAPPPADDIA-----QCPVESSVAGQPTAIASVAPLATADTAVPSNVDHAPTTHPYGTRLKHNIKKPKVRT

Query:  DGTVTYLVARSSASEPTSHITAMEHPLCRQAMNDEFQALQKNKTWHLVPPRAGFNVIDCKWVFKLKQKPDGSIDRYKARLVAKGFKQQYGVDYDDTFSPV
        DGTV Y    +++ EP S   A+     + AM+ E+ AL  NKTWHLVPP+ G NVIDCKWV+K+K+K DGS+DRYKARLVAKGF+Q+YG+DY+DTFSPV
Subjt:  DGTVTYLVARSSASEPTSHITAMEHPLCRQAMNDEFQALQKNKTWHLVPPRAGFNVIDCKWVFKLKQKPDGSIDRYKARLVAKGFKQQYGVDYDDTFSPV

Query:  VKLTTIRLLLSLAISCGWAIRQIDIQNAFLHGLLNEDVYMKQPPGFVDSQHPGYLCKLDKSLYGFKQAPHAWFSRLSSKLLQLDFTPSKVDVSLFIFNKT
        VK  TIR +L +A+S GW++R++D+QNAFLHG L EDVYMKQPPG+ D    GY+CKLDK+LYG K+AP AW+SRLS+KL QL F  SK D SLF +NK 
Subjt:  VKLTTIRLLLSLAISCGWAIRQIDIQNAFLHGLLNEDVYMKQPPGFVDSQHPGYLCKLDKSLYGFKQAPHAWFSRLSSKLLQLDFTPSKVDVSLFIFNKT

Query:  GIQMYILIYVDDIIIISSSSTATEKLLTQLQDDFAVKDLGILSYFLGIEVRHTSSGLILTQHKYIRDLLARTNMLTSKGVPTPMLPSEKLLLNGGEKLSP
         + +YILIYVDDII+ SS+  AT  LL  L+ +FA+KDLG L +FLGIEV+  ++G++LTQ KY +D+L R +M+  K V +P+  SEKL  + G+ L P
Subjt:  GIQMYILIYVDDIIIISSSSTATEKLLTQLQDDFAVKDLGILSYFLGIEVRHTSSGLILTQHKYIRDLLARTNMLTSKGVPTPMLPSEKLLLNGGEKLSP

Query:  EDTTRYRSVVGALQYLSLTRPDISFCVNRVCQFMSSPTSIHWAAVKRILHYLHDTIDMGLCLTKSSTDLLSAFSDADWAGNPDDRRSTGGYVIFFGGNLI
        +D T YRS+VG LQYL LTRPDISF VN+VCQ++ +PT++HWA VKRIL YL  T+++GL + KS + L+SAFSDADWAG+ DDRRSTGG+ +F G NLI
Subjt:  EDTTRYRSVVGALQYLSLTRPDISFCVNRVCQFMSSPTSIHWAAVKRILHYLHDTIDMGLCLTKSSTDLLSAFSDADWAGNPDDRRSTGGYVIFFGGNLI

Query:  SWSSRKQSTVSRSSTEAEYKAVANATAELIWIQVLLRELGISQARAPSLWCDNIGATYLSANPIFHRRTKHVEVDYHFVRERVSTRQLDVRLISSKDQLA
        SWS+RKQSTVSRSSTEAEYKAVANATAE++WIQ LL ELGI   +   LWCDNIGA YLSANP+FH RTKH+EVDYHFVRERV+ R LD+  IS++DQ+A
Subjt:  SWSSRKQSTVSRSSTEAEYKAVANATAELIWIQVLLRELGISQARAPSLWCDNIGATYLSANPIFHRRTKHVEVDYHFVRERVSTRQLDVRLISSKDQLA

Query:  DIMTKPLPASSFSYFRRNLNL
        +  TKPL   +   F+ NLNL
Subjt:  DIMTKPLPASSFSYFRRNLNL

A0A4D6GKR5 Hopscotch gagpol polyprotein4.3e-19858.98Show/hide
Query:  STSASKLPQQSSASLPCESALVVPPMIEASAPPPADDIAQC----PVESSVAGQPTAIASVAPLATAD---TAVPSNVDH-APTTHPYG-----TRLKHN
        + +A  LP      +    ALV   ++ A++P P    A        +S  +G P A  SV  +  AD    A  S+V H  P + P       TRL+H 
Subjt:  STSASKLPQQSSASLPCESALVVPPMIEASAPPPADDIAQC----PVESSVAGQPTAIASVAPLATAD---TAVPSNVDH-APTTHPYG-----TRLKHN

Query:  IKKPKVRTDGTVTYLVARSSASEPTSHITAMEHPLCRQAMNDEFQALQKNKTWHLVPPRAGFNVIDCKWVFKLKQKPDGSIDRYKARLVAKGFKQQYGVD
        I KPK  TDGTV Y  A +  +EP+S   A+  P  R AM  EFQALQKN TW LVPP    N+IDCKWVFK+K   DGSIDR KARLVAKGFKQQYG+D
Subjt:  IKKPKVRTDGTVTYLVARSSASEPTSHITAMEHPLCRQAMNDEFQALQKNKTWHLVPPRAGFNVIDCKWVFKLKQKPDGSIDRYKARLVAKGFKQQYGVD

Query:  YDDTFSPVVKLTTIRLLLSLAISCGWAIRQIDIQNAFLHGLLNEDVYMKQPPGFVDSQHPGYLCKLDKSLYGFKQAPHAWFSRLSSKLLQLDFTPSKVDV
        YDDTFSPVVK +TIRL+LSLA+S  W++RQ+D+QNAFLHG+L E VYMKQPPGF D+ HP Y C L KSLYG KQAP AW+SRLS KL  L F PSK DV
Subjt:  YDDTFSPVVKLTTIRLLLSLAISCGWAIRQIDIQNAFLHGLLNEDVYMKQPPGFVDSQHPGYLCKLDKSLYGFKQAPHAWFSRLSSKLLQLDFTPSKVDV

Query:  SLFIFNKTGIQMYILIYVDDIIIISSSSTATEKLLTQLQDDFAVKDLGILSYFLGIEVRHTSSGLILTQHKYIRDLLARTNMLTSKGVPTPMLPSEKLLL
        SLFI+N     +YIL+YVDDIII  SS  A + +L +L+DDFA+KDLG L YFLGIEV     GL+L Q KY RDLL R  M   K V TP+  SEKL  
Subjt:  SLFIFNKTGIQMYILIYVDDIIIISSSSTATEKLLTQLQDDFAVKDLGILSYFLGIEVRHTSSGLILTQHKYIRDLLARTNMLTSKGVPTPMLPSEKLLL

Query:  NGGEKLSPEDTTRYRSVVGALQYLSLTRPDISFCVNRVCQFMSSPTSIHWAAVKRILHYLHDTIDMGLCLTKSSTDLLSAFSDADWAGNPDDRRSTGGYV
        + G  LSPE+TT+YRSVVGALQYL+LTRPD+S+ +NRVCQF+ +PT +HW AVKRIL  +  TI +GL +  S + +LSAFSDADWAG PDDR+STGGY 
Subjt:  NGGEKLSPEDTTRYRSVVGALQYLSLTRPDISFCVNRVCQFMSSPTSIHWAAVKRILHYLHDTIDMGLCLTKSSTDLLSAFSDADWAGNPDDRRSTGGYV

Query:  IFFGGNLISWSSRKQSTVSRSSTEAEYKAVANATAELIWIQVLLRELGISQARAPSLWCDNIGATYLSANPIFHRRTKHVEVDYHFVRERVSTRQLDVRL
        +F G NLISW+S+KQSTVSRSSTEAEYKA+ANATAE+IW+Q LL ELGI     P LWCDN+GATYLS+ PIF+ RTKH+EVD+HFVR+RV +++LD+RL
Subjt:  IFFGGNLISWSSRKQSTVSRSSTEAEYKAVANATAELIWIQVLLRELGISQARAPSLWCDNIGATYLSANPIFHRRTKHVEVDYHFVRERVSTRQLDVRL

Query:  ISSKDQLADIMTKPLPASSFSYFRRNLNL
        IS+ DQ+AD  TK L     + FRRNLNL
Subjt:  ISSKDQLADIMTKPLPASSFSYFRRNLNL

C7J5P9 Os08g0544300 protein3.2e-19358.19Show/hide
Query:  ESSVAGQPTAIASVAPLATADTAVPSNV-----DHAPTTHPYG--------TRLKHNIKKPKVRTDGTVTYLVARSSASEPTSHITAMEHPLCRQAMNDE
        E  V   P   +S +P   +   V  +V     D      P          TRL+  I+K KV TDGTV +L   SS  EP S   A+ +   ++AM+ E
Subjt:  ESSVAGQPTAIASVAPLATADTAVPSNV-----DHAPTTHPYG--------TRLKHNIKKPKVRTDGTVTYLVARSSASEPTSHITAMEHPLCRQAMNDE

Query:  FQALQKNKTWHLVPPRAGFNVIDCKWVFKLKQKPDGSIDRYKARLVAKGFKQQYGVDYDDTFSPVVKLTTIRLLLSLAISCGWAIRQIDIQNAFLHGLLN
        + AL +NKTWHLVPP+ G NVIDCKWV+K+K+K DGS+DRYKARLVAKGFKQ+YG+DY+DTFSPVVK  TIR++LSLA+S GW++RQ+D++NAFLHG+L 
Subjt:  FQALQKNKTWHLVPPRAGFNVIDCKWVFKLKQKPDGSIDRYKARLVAKGFKQQYGVDYDDTFSPVVKLTTIRLLLSLAISCGWAIRQIDIQNAFLHGLLN

Query:  EDVYMKQPPGFVDSQHPGYLCKLDKSLYGFKQAPHAWFSRLSSKLLQLDFTPSKVDVSLFIFNKTGIQMYILIYVDDIIIISSSSTATEKLLTQLQDDFA
        E+VYM+QPPG+     P Y+CKLDK+LYG KQAP AW+SRLS+KL +L F PSK D SLF + K  + +++LIYVDDII+ SS   AT  LL +L  DFA
Subjt:  EDVYMKQPPGFVDSQHPGYLCKLDKSLYGFKQAPHAWFSRLSSKLLQLDFTPSKVDVSLFIFNKTGIQMYILIYVDDIIIISSSSTATEKLLTQLQDDFA

Query:  VKDLGILSYFLGIEVRHTSSGLILTQHKYIRDLLARTNMLTSKGVPTPMLPSEKLLLNGGEKLSPEDTTRYRSVVGALQYLSLTRPDISFCVNRVCQFMS
        +KDLG L YFLGIEV     GL+L+Q KY  DLL R  M   K V TP+  SEKL +N G  L P+D+T+YRSVVGALQYL+LTRPDISF +N+VCQF+ 
Subjt:  VKDLGILSYFLGIEVRHTSSGLILTQHKYIRDLLARTNMLTSKGVPTPMLPSEKLLLNGGEKLSPEDTTRYRSVVGALQYLSLTRPDISFCVNRVCQFMS

Query:  SPTSIHWAAVKRILHYLHDTIDMGLCLTKSSTDLLSAFSDADWAGNPDDRRSTGGYVIFFGGNLISWSSRKQSTVSRSSTEAEYKAVANATAELIWIQVL
        +PT+ HWAAVKRIL Y+  T+D GL   ++ + L+S FSDADWAG+PDDRRSTGG+ +F G NL+SWS+RKQ+TVSRSSTEAEYKA+ANATAE++W+Q L
Subjt:  SPTSIHWAAVKRILHYLHDTIDMGLCLTKSSTDLLSAFSDADWAGNPDDRRSTGGYVIFFGGNLISWSSRKQSTVSRSSTEAEYKAVANATAELIWIQVL

Query:  LRELGISQARAPSLWCDNIGATYLSANPIFHRRTKHVEVDYHFVRERVSTRQLDVRLISSKDQLADIMTKPLPASSFSYFRRNLNL
        L+ELG+   RA  LWCDN+GA YLSANPIFH RTKH+EVD+HFVRERV+ + L++  IS+KDQ+AD  TK +P      F+ NLNL
Subjt:  LRELGISQARAPSLWCDNIGATYLSANPIFHRRTKHVEVDYHFVRERVSTRQLDVRLISSKDQLADIMTKPLPASSFSYFRRNLNL

Q10RF3 Putative gag-pol polyprotein1.7e-19156.48Show/hide
Query:  ASKLPQQSSASLPCESALVVPPMIEASAPPPADDIAQCPVESSVAGQ------PTAIA-SVAPLATADTAVPSNVDHAPTTHPY----GTRLKHNIKKPK
        A+  P  S A+        V    E+S  P    +   P  S VA Q      PTA + S  P  + D +V   VD     H       TRL+  I+K K
Subjt:  ASKLPQQSSASLPCESALVVPPMIEASAPPPADDIAQCPVESSVAGQ------PTAIA-SVAPLATADTAVPSNVDHAPTTHPY----GTRLKHNIKKPK

Query:  VRTDGTVTYLVA-RSSASEPTSHITAMEHPLCRQAMNDEFQALQKNKTWHLVPPRAGFNVIDCKWVFKLKQKPDGSIDRYKARLVAKGFKQQYGVDYDDT
        V TDGTV Y     +   EP +   A++    ++AM+ EF ALQ NKTWHLVPP+ G NVIDCKWV+K+K+K DGS+DRYKARLVAKGFKQ+YG+DY+DT
Subjt:  VRTDGTVTYLVA-RSSASEPTSHITAMEHPLCRQAMNDEFQALQKNKTWHLVPPRAGFNVIDCKWVFKLKQKPDGSIDRYKARLVAKGFKQQYGVDYDDT

Query:  FSPVVKLTTIRLLLSLAISCGWAIRQIDIQNAFLHGLLNEDVYMKQPPGFVDSQHPGYLCKLDKSLYGFKQAPHAWFSRLSSKLLQLDFTPSKVDVSLFI
        FSPVVK  TIR++LS+A+S GW +RQ+D+QNAFLHG+L E+VYMKQPPG+ DS  PGY+CKLDK+LYG KQAP AW+SRLS KL  + F  SK D SLF 
Subjt:  FSPVVKLTTIRLLLSLAISCGWAIRQIDIQNAFLHGLLNEDVYMKQPPGFVDSQHPGYLCKLDKSLYGFKQAPHAWFSRLSSKLLQLDFTPSKVDVSLFI

Query:  FNKTGIQMYILIYVDDIIIISSSSTATEKLLTQLQDDFAVKDLGILSYFLGIEVRHTSSGLILTQHKYIRDLLARTNMLTSKGVPTPMLPSEKLLLNGGE
        +NK  + +++LIYVDDII+ SS   A   LL  LQ +FA+KDLG L YFLGIEV   S G++LTQ KY+ DLL R NM   K V TP+  SEKL +N G+
Subjt:  FNKTGIQMYILIYVDDIIIISSSSTATEKLLTQLQDDFAVKDLGILSYFLGIEVRHTSSGLILTQHKYIRDLLARTNMLTSKGVPTPMLPSEKLLLNGGE

Query:  KLSPEDTTRYRSVVGALQYLSLTRPDISFCVNRVCQFMSSPTSIHWAAVKRILHYLHDTIDMGLCLTKSSTDLLSAFSDADWAGNPDDRRSTGGYVIFFG
         L P D T YRSVVGALQYL+LTRPDI+F VN+VCQF+ +PT+ HWAAVKRIL YL     +GL L+KS + L+SA+SDADWAG+ DDRRSTGG+ +F G
Subjt:  KLSPEDTTRYRSVVGALQYLSLTRPDISFCVNRVCQFMSSPTSIHWAAVKRILHYLHDTIDMGLCLTKSSTDLLSAFSDADWAGNPDDRRSTGGYVIFFG

Query:  GNLISWSSRKQSTVSRSSTEAEYKAVANATAELIWIQVLLRELGISQARAPSLWCDNIGATYLSANPIFHRRTKHVEVDYHFVRERVSTRQLDVRLISSK
         NL+SWS+RKQ+TVSRSSTE+EYKA+ANATAE++W+Q LL EL +       LWCDN+GA YLSANP+FH RTKH+EVDYHFVRERVS + L++  + + 
Subjt:  GNLISWSSRKQSTVSRSSTEAEYKAVANATAELIWIQVLLRELGISQARAPSLWCDNIGATYLSANPIFHRRTKHVEVDYHFVRERVSTRQLDVRLISSK

Query:  DQLADIMTKPLPASSFSYFRRNLNL
        DQ+AD  TK L       F+ NLNL
Subjt:  DQLADIMTKPLPASSFSYFRRNLNL

V9GZT4 Copia-like retrotransposon Hopscotch polyprotein3.4e-19558.56Show/hide
Query:  STSASKLPQQSSASLPCESALVVPPMIEASAPPPADDIAQC----PVESSVAGQPTAIASVAPLATAD---TAVPSNVDH-APTTHPYG-----TRLKHN
        + +A  LP      +    ALV   ++ A++P P    A        +S  +G P A  SV  +  AD    A  S+V H  P + P       TRL+H 
Subjt:  STSASKLPQQSSASLPCESALVVPPMIEASAPPPADDIAQC----PVESSVAGQPTAIASVAPLATAD---TAVPSNVDH-APTTHPYG-----TRLKHN

Query:  IKKPKVRTDGTVTYLVARSSASEPTSHITAMEHPLCRQAMNDEFQALQKNKTWHLVPPRAGFNVIDCKWVFKLKQKPDGSIDRYKARLVAKGFKQQYGVD
        I KPK  TDGTV Y  A +  +EP+S   A+  P  R AM  EFQALQKN TW LVPP    N+IDCKWVFK+K   DGSIDR KARLVAKGFKQQYG+D
Subjt:  IKKPKVRTDGTVTYLVARSSASEPTSHITAMEHPLCRQAMNDEFQALQKNKTWHLVPPRAGFNVIDCKWVFKLKQKPDGSIDRYKARLVAKGFKQQYGVD

Query:  YDDTFSPVVKLTTIRLLLSLAISCGWAIRQIDIQNAFLHGLLNEDVYMKQPPGFVDSQHPGYLCKLDKSLYGFKQAPHAWFSRLSSKLLQLDFTPSKVDV
        YDDTFSPVVK +TIRL+LSLA+S  W++RQ+D+QNAFLHG+L E VYMKQPPGF D+ HP Y C L KSLYG KQ P AW+SRLS KL  L F PSK DV
Subjt:  YDDTFSPVVKLTTIRLLLSLAISCGWAIRQIDIQNAFLHGLLNEDVYMKQPPGFVDSQHPGYLCKLDKSLYGFKQAPHAWFSRLSSKLLQLDFTPSKVDV

Query:  SLFIFNKTGIQMYILIYVDDIIIISSSSTATEKLLTQLQDDFAVKDLGILSYFLGIEVRHTSSGLILTQHKYIRDLLARTNMLTSKGVPTPMLPSEKLLL
        SLFI+N     +YIL+YVDDIII  SS  A + +L +L+DDFA+KDLG L YFLGIEV     GL+L Q KY RDLL R  M   K V TP+  SEKL  
Subjt:  SLFIFNKTGIQMYILIYVDDIIIISSSSTATEKLLTQLQDDFAVKDLGILSYFLGIEVRHTSSGLILTQHKYIRDLLARTNMLTSKGVPTPMLPSEKLLL

Query:  NGGEKLSPEDTTRYRSVVGALQYLSLTRPDISFCVNRVCQFMSSPTSIHWAAVKRILHYLHDTIDMGLCLTKSSTDLLSAFSDADWAGNPDDRRSTGGYV
        + G  LSPE+TT+YRSVVGALQYL+LTRPD+S+ +NRVCQF+ +PT +HW AVKRIL  +  TI +GL +  S + +LSAFSDADWAG PDDR+STGGY 
Subjt:  NGGEKLSPEDTTRYRSVVGALQYLSLTRPDISFCVNRVCQFMSSPTSIHWAAVKRILHYLHDTIDMGLCLTKSSTDLLSAFSDADWAGNPDDRRSTGGYV

Query:  IFFGGNLISWSSRKQSTVSRSSTEAEYKAVANATAELIWIQVLLRELGISQARAPSLWCDNIGATYLSANPIFHRRTKHVEVDYHFVRERVSTRQLDVRL
        +F G NLISW+S+KQSTVSRSSTEAEYKA+ANATAE+IW+Q LL ELGI     P LWCDN+GATYLS+ PIF+ RTKH+EVD+HFVR+RV +++LD+RL
Subjt:  IFFGGNLISWSSRKQSTVSRSSTEAEYKAVANATAELIWIQVLLRELGISQARAPSLWCDNIGATYLSANPIFHRRTKHVEVDYHFVRERVSTRQLDVRL

Query:  ISSKDQLADIMTKPLPASSFSYFRR
        IS+ DQ+AD  TK L     + FRR
Subjt:  ISSKDQLADIMTKPLPASSFSYFRR

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.6e-9637.92Show/hide
Query:  QAMNDEFQALQKNKTWHLVPPRAGFNVIDCKWVFKLKQKPDGSIDRYKARLVAKGFKQQYGVDYDDTFSPVVKLTTIRLLLSLAISCGWAIRQIDIQNAF
        +A+N E  A + N TW +       N++D +WVF +K    G+  RYKARLVA+GF Q+Y +DY++TF+PV ++++ R +LSL I     + Q+D++ AF
Subjt:  QAMNDEFQALQKNKTWHLVPPRAGFNVIDCKWVFKLKQKPDGSIDRYKARLVAKGFKQQYGVDYDDTFSPVVKLTTIRLLLSLAISCGWAIRQIDIQNAF

Query:  LHGLLNEDVYMKQPPGFVDSQHPGYLCKLDKSLYGFKQAPHAWFSRLSSKLLQLDFTPSKVDVSLFIFNKTGI--QMYILIYVDDIIIISSSSTATEKLL
        L+G L E++YM+ P G   S +   +CKL+K++YG KQA   WF      L + +F  S VD  ++I +K  I   +Y+L+YVDD++I +   T      
Subjt:  LHGLLNEDVYMKQPPGFVDSQHPGYLCKLDKSLYGFKQAPHAWFSRLSSKLLQLDFTPSKVDVSLFIFNKTGI--QMYILIYVDDIIIISSSSTATEKLL

Query:  TQLQDDFAVKDLGILSYFLGIEVRHTSSGLILTQHKYIRDLLARTNMLTSKGVPTPMLPSE--KLLLNGGEKLSPEDTTRYRSVVGALQYLSL-TRPDIS
          L + F + DL  + +F+GI +      + L+Q  Y++ +L++ NM     V TP LPS+    LLN  E    +  T  RS++G L Y+ L TRPD++
Subjt:  TQLQDDFAVKDLGILSYFLGIEVRHTSSGLILTQHKYIRDLLARTNMLTSKGVPTPMLPSE--KLLLNGGEKLSPEDTTRYRSVVGALQYLSL-TRPDIS

Query:  FCVNRVCQFMSSPTSIHWAAVKRILHYLHDTIDMGLCLTKSST--DLLSAFSDADWAGNPDDRRSTGGYVI-FFGGNLISWSSRKQSTVSRSSTEAEYKA
          VN + ++ S   S  W  +KR+L YL  TIDM L   K+    + +  + D+DWAG+  DR+ST GY+   F  NLI W++++Q++V+ SSTEAEY A
Subjt:  FCVNRVCQFMSSPTSIHWAAVKRILHYLHDTIDMGLCLTKSST--DLLSAFSDADWAGNPDDRRSTGGYVI-FFGGNLISWSSRKQSTVSRSSTEAEYKA

Query:  VANATAELIWIQVLLRELGISQARAPSLWCDNIGATYLSANPIFHRRTKHVEVDYHFVRERVSTRQLDVRLISSKDQLADIMTKPLPASSFSYFRRNLNL
        +  A  E +W++ LL  + I       ++ DN G   ++ NP  H+R KH+++ YHF RE+V    + +  I +++QLADI TKPLPA+ F   R  L L
Subjt:  VANATAELIWIQVLLRELGISQARAPSLWCDNIGATYLSANPIFHRRTKHVEVDYHFVRERVSTRQLDVRLISSKDQLADIMTKPLPASSFSYFRRNLNL

Query:  V
        +
Subjt:  V

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-9940.77Show/hide
Query:  TYLVARSSASEPTSHITAMEHPLCRQ---AMNDEFQALQKNKTWHLVPPRAGFNVIDCKWVFKLKQKPDGSIDRYKARLVAKGFKQQYGVDYDDTFSPVV
        T  V  S   EP S    + HP   Q   AM +E ++LQKN T+ LV    G   + CKWVFKLK+  D  + RYKARLV KGF+Q+ G+D+D+ FSPVV
Subjt:  TYLVARSSASEPTSHITAMEHPLCRQ---AMNDEFQALQKNKTWHLVPPRAGFNVIDCKWVFKLKQKPDGSIDRYKARLVAKGFKQQYGVDYDDTFSPVV

Query:  KLTTIRLLLSLAISCGWAIRQIDIQNAFLHGLLNEDVYMKQPPGFVDSQHPGYLCKLDKSLYGFKQAPHAWFSRLSSKLLQLDFTPSKVDVSLFI--FNK
        K+T+IR +LSLA S    + Q+D++ AFLHG L E++YM+QP GF  +     +CKL+KSLYG KQAP  W+ +  S +    +  +  D  ++   F++
Subjt:  KLTTIRLLLSLAISCGWAIRQIDIQNAFLHGLLNEDVYMKQPPGFVDSQHPGYLCKLDKSLYGFKQAPHAWFSRLSSKLLQLDFTPSKVDVSLFI--FNK

Query:  TGIQMYILIYVDDIIIISSSSTATEKLLTQLQDDFAVKDLGILSYFLGIEV--RHTSSGLILTQHKYIRDLLARTNMLTSKGVPTPMLP----SEKLLLN
            + +L+YVDD++I+        KL   L   F +KDLG     LG+++    TS  L L+Q KYI  +L R NM  +K V TP+      S+K+   
Subjt:  TGIQMYILIYVDDIIIISSSSTATEKLLTQLQDDFAVKDLGILSYFLGIEV--RHTSSGLILTQHKYIRDLLARTNMLTSKGVPTPMLP----SEKLLLN

Query:  GGEKLSPEDTTRYRSVVGALQY-LSLTRPDISFCVNRVCQFMSSPTSIHWAAVKRILHYLHDTIDMGLCLTKSSTDLLSAFSDADWAGNPDDRRSTGGYV
          E+        Y S VG+L Y +  TRPDI+  V  V +F+ +P   HW AVK IL YL  T    LC    S  +L  ++DAD AG+ D+R+S+ GY+
Subjt:  GGEKLSPEDTTRYRSVVGALQY-LSLTRPDISFCVNRVCQFMSSPTSIHWAAVKRILHYLHDTIDMGLCLTKSSTDLLSAFSDADWAGNPDDRRSTGGYV

Query:  IFFGGNLISWSSRKQSTVSRSSTEAEYKAVANATAELIWIQVLLRELGISQARAPSLWCDNIGATYLSANPIFHRRTKHVEVDYHFVRERVSTRQLDVRL
          F G  ISW S+ Q  V+ S+TEAEY A      E+IW++  L+ELG+ Q +   ++CD+  A  LS N ++H RTKH++V YH++RE V    L V  
Subjt:  IFFGGNLISWSSRKQSTVSRSSTEAEYKAVANATAELIWIQVLLRELGISQARAPSLWCDNIGATYLSANPIFHRRTKHVEVDYHFVRERVSTRQLDVRL

Query:  ISSKDQLADIMTKPLPASSF
        IS+ +  AD++TK +P + F
Subjt:  ISSKDQLADIMTKPLPASSF

P92519 Uncharacterized mitochondrial protein AtMg008109.9e-5148.47Show/hide
Query:  MYILIYVDDIIIISSSSTATEKLLTQLQDDFAVKDLGILSYFLGIEVRHTSSGLILTQHKYIRDLLARTNMLTSKGVPTPMLPSEKLLLNGGEKLSP-ED
        MY+L+YVDDI++  SS+T    L+ QL   F++KDLG + YFLGI+++   SGL L+Q KY   +L    ML  K + TP+     L LN     +   D
Subjt:  MYILIYVDDIIIISSSSTATEKLLTQLQDDFAVKDLGILSYFLGIEVRHTSSGLILTQHKYIRDLLARTNMLTSKGVPTPMLPSEKLLLNGGEKLSP-ED

Query:  TTRYRSVVGALQYLSLTRPDISFCVNRVCQFMSSPTSIHWAAVKRILHYLHDTIDMGLCLTKSSTDLLSAFSDADWAGNPDDRRSTGGYVIFFGGNLISW
         + +RS+VGALQYL+LTRPDIS+ VN VCQ M  PT   +  +KR+L Y+  TI  GL + K+S   + AF D+DWAG    RRST G+  F G N+ISW
Subjt:  TTRYRSVVGALQYLSLTRPDISFCVNRVCQFMSSPTSIHWAAVKRILHYLHDTIDMGLCLTKSSTDLLSAFSDADWAGNPDDRRSTGGYVIFFGGNLISW

Query:  SSRKQSTVSRSSTEAEYKAVANATAELIW
        S+++Q TVSRSSTE EY+A+A   AEL W
Subjt:  SSRKQSTVSRSSTEAEYKAVANATAELIW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.2e-15046.58Show/hide
Query:  PTNSLDAENLVSTSASKLPQQSSASLPCESALVVPPMIEASAPPPADDIAQCPVESSVAGQPTAIASVAPLATADTAVPSNVDHAP-TTHPYGTRLKHNI
        PT +    +    ++   P   S S   +S L  P    +S+P P    +     SS +  P +I    P   A   + +N + AP  TH  GTR K  I
Subjt:  PTNSLDAENLVSTSASKLPQQSSASLPCESALVVPPMIEASAPPPADDIAQCPVESSVAGQPTAIASVAPLATADTAVPSNVDHAP-TTHPYGTRLKHNI

Query:  KKPKVRTDGTVTYLVARSSASEPTSHITAMEHPLCRQAMNDEFQALQKNKTWHLV-PPRAGFNVIDCKWVFKLKQKPDGSIDRYKARLVAKGFKQQYGVD
         KP  +     +  V+ ++ SEP + I A++    R AM  E  A   N TW LV PP +   ++ C+W+F  K   DGS++RYKARLVAKG+ Q+ G+D
Subjt:  KKPKVRTDGTVTYLVARSSASEPTSHITAMEHPLCRQAMNDEFQALQKNKTWHLV-PPRAGFNVIDCKWVFKLKQKPDGSIDRYKARLVAKGFKQQYGVD

Query:  YDDTFSPVVKLTTIRLLLSLAISCGWAIRQIDIQNAFLHGLLNEDVYMKQPPGFVDSQHPGYLCKLDKSLYGFKQAPHAWFSRLSSKLLQLDFTPSKVDV
        Y +TFSPV+K T+IR++L +A+   W IRQ+D+ NAFL G L +DVYM QPPGF+D   P Y+CKL K+LYG KQAP AW+  L + LL + F  S  D 
Subjt:  YDDTFSPVVKLTTIRLLLSLAISCGWAIRQIDIQNAFLHGLLNEDVYMKQPPGFVDSQHPGYLCKLDKSLYGFKQAPHAWFSRLSSKLLQLDFTPSKVDV

Query:  SLFIFNKTGIQMYILIYVDDIIIISSSSTATEKLLTQLQDDFAVKDLGILSYFLGIEVRHTSSGLILTQHKYIRDLLARTNMLTSKGVPTPMLPSEKLLL
        SLF+  +    +Y+L+YVDDI+I  +  T     L  L   F+VKD   L YFLGIE +   +GL L+Q +YI DLLARTNM+T+K V TPM PS KL L
Subjt:  SLFIFNKTGIQMYILIYVDDIIIISSSSTATEKLLTQLQDDFAVKDLGILSYFLGIEVRHTSSGLILTQHKYIRDLLARTNMLTSKGVPTPMLPSEKLLL

Query:  NGGEKLSPEDTTRYRSVVGALQYLSLTRPDISFCVNRVCQFMSSPTSIHWAAVKRILHYLHDTIDMGLCLTKSSTDLLSAFSDADWAGNPDDRRSTGGYV
          G KL+  D T YR +VG+LQYL+ TRPDIS+ VNR+ QFM  PT  H  A+KRIL YL  T + G+ L K +T  L A+SDADWAG+ DD  ST GY+
Subjt:  NGGEKLSPEDTTRYRSVVGALQYLSLTRPDISFCVNRVCQFMSSPTSIHWAAVKRILHYLHDTIDMGLCLTKSSTDLLSAFSDADWAGNPDDRRSTGGYV

Query:  IFFGGNLISWSSRKQSTVSRSSTEAEYKAVANATAELIWIQVLLRELGISQARAPSLWCDNIGATYLSANPIFHRRTKHVEVDYHFVRERVSTRQLDVRL
        ++ G + ISWSS+KQ  V RSSTEAEY++VAN ++E+ WI  LL ELGI   R P ++CDN+GATYL ANP+FH R KH+ +DYHF+R +V +  L V  
Subjt:  IFFGGNLISWSSRKQSTVSRSSTEAEYKAVANATAELIWIQVLLRELGISQARAPSLWCDNIGATYLSANPIFHRRTKHVEVDYHFVRERVSTRQLDVRL

Query:  ISSKDQLADIMTKPLPASSFSYFRRNLNL
        +S+ DQLAD +TKPL  ++F  F   + +
Subjt:  ISSKDQLADIMTKPLPASSFSYFRRNLNL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.5e-15046.24Show/hide
Query:  PQQSSASLPCE-SALVVPPMIEASAPPPADDIAQCPVESSVAGQPTAIASVAPLATADTAVPSNVDHAPTTHPYGTRLKHNIKKPKVRTDGTVTYLVARS
        P   S + P + S L   P+     P P+  I++    +S +   T+   + P+  A   +  N      TH   TR K  I+KP  +     +Y  + +
Subjt:  PQQSSASLPCE-SALVVPPMIEASAPPPADDIAQCPVESSVAGQPTAIASVAPLATADTAVPSNVDHAPTTHPYGTRLKHNIKKPKVRTDGTVTYLVARS

Query:  SASEPTSHITAMEHPLCRQAMNDEFQALQKNKTWHLV-PPRAGFNVIDCKWVFKLKQKPDGSIDRYKARLVAKGFKQQYGVDYDDTFSPVVKLTTIRLLL
        + SEP + I AM+    RQAM  E  A   N TW LV PP     ++ C+W+F  K   DGS++RYKARLVAKG+ Q+ G+DY +TFSPV+K T+IR++L
Subjt:  SASEPTSHITAMEHPLCRQAMNDEFQALQKNKTWHLV-PPRAGFNVIDCKWVFKLKQKPDGSIDRYKARLVAKGFKQQYGVDYDDTFSPVVKLTTIRLLL

Query:  SLAISCGWAIRQIDIQNAFLHGLLNEDVYMKQPPGFVDSQHPGYLCKLDKSLYGFKQAPHAWFSRLSSKLLQLDFTPSKVDVSLFIFNKTGIQMYILIYV
         +A+   W IRQ+D+ NAFL G L ++VYM QPPGFVD   P Y+C+L K++YG KQAP AW+  L + LL + F  S  D SLF+  +    +Y+L+YV
Subjt:  SLAISCGWAIRQIDIQNAFLHGLLNEDVYMKQPPGFVDSQHPGYLCKLDKSLYGFKQAPHAWFSRLSSKLLQLDFTPSKVDVSLFIFNKTGIQMYILIYV

Query:  DDIIIISSSSTATEKLLTQLQDDFAVKDLGILSYFLGIEVRHTSSGLILTQHKYIRDLLARTNMLTSKGVPTPMLPSEKLLLNGGEKLSPEDTTRYRSVV
        DDI+I  + +   +  L  L   F+VK+   L YFLGIE +    GL L+Q +Y  DLLARTNMLT+K V TPM  S KL L+ G KL   D T YR +V
Subjt:  DDIIIISSSSTATEKLLTQLQDDFAVKDLGILSYFLGIEVRHTSSGLILTQHKYIRDLLARTNMLTSKGVPTPMLPSEKLLLNGGEKLSPEDTTRYRSVV

Query:  GALQYLSLTRPDISFCVNRVCQFMSSPTSIHWAAVKRILHYLHDTIDMGLCLTKSSTDLLSAFSDADWAGNPDDRRSTGGYVIFFGGNLISWSSRKQSTV
        G+LQYL+ TRPD+S+ VNR+ Q+M  PT  HW A+KR+L YL  T D G+ L K +T  L A+SDADWAG+ DD  ST GY+++ G + ISWSS+KQ  V
Subjt:  GALQYLSLTRPDISFCVNRVCQFMSSPTSIHWAAVKRILHYLHDTIDMGLCLTKSSTDLLSAFSDADWAGNPDDRRSTGGYVIFFGGNLISWSSRKQSTV

Query:  SRSSTEAEYKAVANATAELIWIQVLLRELGISQARAPSLWCDNIGATYLSANPIFHRRTKHVEVDYHFVRERVSTRQLDVRLISSKDQLADIMTKPLPAS
         RSSTEAEY++VAN ++EL WI  LL ELGI  +  P ++CDN+GATYL ANP+FH R KH+ +DYHF+R +V +  L V  +S+ DQLAD +TKPL   
Subjt:  SRSSTEAEYKAVANATAELIWIQVLLRELGISQARAPSLWCDNIGATYLSANPIFHRRTKHVEVDYHFVRERVSTRQLDVRLISSKDQLADIMTKPLPAS

Query:  SFSYFRRNLNLV
        +F  F R + ++
Subjt:  SFSYFRRNLNLV

Arabidopsis top hitse value%identityAlignment
AT3G12050.1 Aha1 domain-containing protein3.0e-1039.81Show/hide
Query:  VEARRVTTVCEKANFGKKGGGDLVGKEASVELRFR-------GKSLKKVDSLLEILYIFDENVDEDHEVMVFVNVEGKIGKKIKEAILVKGKPIGLEKVR
        ++  ++  V  +A    + G  + G E +V L +        GK+L K D L+++ YI DEN DED E+   V  EG IG+ +KEA++ KGK I LEKVR
Subjt:  VEARRVTTVCEKANFGKKGGGDLVGKEASVELRFR-------GKSLKKVDSLLEILYIFDENVDEDHEVMVFVNVEGKIGKKIKEAILVKGKPIGLEKVR

Query:  LYV
        +YV
Subjt:  LYV

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.4e-11142.74Show/hide
Query:  TYLVARSSASEPTSHITAMEHPLCRQAMNDEFQALQKNKTWHLVPPRAGFNVIDCKWVFKLKQKPDGSIDRYKARLVAKGFKQQYGVDYDDTFSPVVKLT
        ++LV  + A EP+++  A E  +   AM+DE  A++   TW +         I CKWV+K+K   DG+I+RYKARLVAKG+ QQ G+D+ +TFSPV KLT
Subjt:  TYLVARSSASEPTSHITAMEHPLCRQAMNDEFQALQKNKTWHLVPPRAGFNVIDCKWVFKLKQKPDGSIDRYKARLVAKGFKQQYGVDYDDTFSPVVKLT

Query:  TIRLLLSLAISCGWAIRQIDIQNAFLHGLLNEDVYMKQPPGFV----DSQHPGYLCKLDKSLYGFKQAPHAWFSRLSSKLLQLDFTPSKVDVSLFIFNKT
        +++L+L+++    + + Q+DI NAFL+G L+E++YMK PPG+     DS  P  +C L KS+YG KQA   WF + S  L+   F  S  D + F+    
Subjt:  TIRLLLSLAISCGWAIRQIDIQNAFLHGLLNEDVYMKQPPGFV----DSQHPGYLCKLDKSLYGFKQAPHAWFSRLSSKLLQLDFTPSKVDVSLFIFNKT

Query:  GIQMYILIYVDDIIIISSSSTATEKLLTQLQDDFAVKDLGILSYFLGIEVRHTSSGLILTQHKYIRDLLARTNMLTSKGVPTPMLPSEKLLLNGGEKLSP
         + + +L+YVDDIII S++  A ++L +QL+  F ++DLG L YFLG+E+  +++G+ + Q KY  DLL  T +L  K    PM PS     + G     
Subjt:  GIQMYILIYVDDIIIISSSSTATEKLLTQLQDDFAVKDLGILSYFLGIEVRHTSSGLILTQHKYIRDLLARTNMLTSKGVPTPMLPSEKLLLNGGEKLSP

Query:  EDTTRYRSVVGALQYLSLTRPDISFCVNRVCQFMSSPTSIHWAAVKRILHYLHDTIDMGLCLTKSSTDLLSAFSDADWAGNPDDRRSTGGYVIFFGGNLI
         D   YR ++G L YL +TR DISF VN++ QF  +P   H  AV +ILHY+  T+  GL  +  +   L  FSDA +    D RRST GY +F G +LI
Subjt:  EDTTRYRSVVGALQYLSLTRPDISFCVNRVCQFMSSPTSIHWAAVKRILHYLHDTIDMGLCLTKSSTDLLSAFSDADWAGNPDDRRSTGGYVIFFGGNLI

Query:  SWSSRKQSTVSRSSTEAEYKAVANATAELIWIQVLLRELGISQARAPSLWCDNIGATYLSANPIFHRRTKHVEVDYHFVRER
        SW S+KQ  VS+SS EAEY+A++ AT E++W+    REL +  ++   L+CDN  A +++ N +FH RTKH+E D H VRER
Subjt:  SWSSRKQSTVSRSSTEAEYKAVANATAELIWIQVLLRELGISQARAPSLWCDNIGATYLSANPIFHRRTKHVEVDYHFVRER

ATMG00240.1 Gag-Pol-related retrotransposon family protein1.0e-1345.45Show/hide
Query:  YLSLTRPDISFCVNRVCQFMSSPTSIHWAAVKRILHYLHDTIDMGLCLTKSSTDLLSAFSDADWAGNPDDRRSTGGY
        YL++TRPD++F VNR+ QF S+  +    AV ++LHY+  T+  GL  + +S   L AF+D+DWA  PD RRS  G+
Subjt:  YLSLTRPDISFCVNRVCQFMSSPTSIHWAAVKRILHYLHDTIDMGLCLTKSSTDLLSAFSDADWAGNPDDRRSTGGY

ATMG00810.1 DNA/RNA polymerases superfamily protein7.1e-5248.47Show/hide
Query:  MYILIYVDDIIIISSSSTATEKLLTQLQDDFAVKDLGILSYFLGIEVRHTSSGLILTQHKYIRDLLARTNMLTSKGVPTPMLPSEKLLLNGGEKLSP-ED
        MY+L+YVDDI++  SS+T    L+ QL   F++KDLG + YFLGI+++   SGL L+Q KY   +L    ML  K + TP+     L LN     +   D
Subjt:  MYILIYVDDIIIISSSSTATEKLLTQLQDDFAVKDLGILSYFLGIEVRHTSSGLILTQHKYIRDLLARTNMLTSKGVPTPMLPSEKLLLNGGEKLSP-ED

Query:  TTRYRSVVGALQYLSLTRPDISFCVNRVCQFMSSPTSIHWAAVKRILHYLHDTIDMGLCLTKSSTDLLSAFSDADWAGNPDDRRSTGGYVIFFGGNLISW
         + +RS+VGALQYL+LTRPDIS+ VN VCQ M  PT   +  +KR+L Y+  TI  GL + K+S   + AF D+DWAG    RRST G+  F G N+ISW
Subjt:  TTRYRSVVGALQYLSLTRPDISFCVNRVCQFMSSPTSIHWAAVKRILHYLHDTIDMGLCLTKSSTDLLSAFSDADWAGNPDDRRSTGGYVIFFGGNLISW

Query:  SSRKQSTVSRSSTEAEYKAVANATAELIW
        S+++Q TVSRSSTE EY+A+A   AEL W
Subjt:  SSRKQSTVSRSSTEAEYKAVANATAELIW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.5e-2547.24Show/hide
Query:  TRLKHNIKKPKVRTDGTVTYLVARSSASEPTSHITAMEHPLCRQAMNDEFQALQKNKTWHLVPPRAGFNVIDCKWVFKLKQKPDGSIDRYKARLVAKGFK
        TR K  I K   +   T+T  + +    EP S I A++ P   QAM +E  AL +NKTW LVPP    N++ CKWVFK K   DG++DR KARLVAKGF 
Subjt:  TRLKHNIKKPKVRTDGTVTYLVARSSASEPTSHITAMEHPLCRQAMNDEFQALQKNKTWHLVPPRAGFNVIDCKWVFKLKQKPDGSIDRYKARLVAKGFK

Query:  QQYGVDYDDTFSPVVKLTTIRLLLSLA
        Q+ G+ + +T+SPVV+  TIR +L++A
Subjt:  QQYGVDYDDTFSPVVKLTTIRLLLSLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGGGCCTACTAACTCTTTGGATGCAGAAAATTTGGTGTCTACATCAGCTTCGAAATTGCCGCAACAATCCTCCGCGTCGCTGCCATGCGAATCGGCGTTGGTTGT
TCCGCCAATGATTGAGGCCTCGGCTCCTCCGCCAGCAGATGATATTGCGCAATGCCCGGTCGAATCCTCGGTCGCTGGTCAACCAACTGCTATAGCATCGGTTGCTCCCC
TCGCAACGGCTGATACGGCCGTCCCCTCAAATGTGGATCATGCACCTACTACTCATCCGTATGGTACGCGATTGAAGCACAATATCAAGAAACCCAAGGTGCGTACAGAT
GGAACAGTAACATATCTTGTAGCTCGGTCTTCTGCCTCTGAACCTACTTCACATATTACTGCTATGGAGCATCCCCTCTGCCGTCAGGCAATGAATGATGAATTTCAGGC
ACTTCAAAAAAATAAGACATGGCACTTAGTTCCTCCTCGTGCTGGTTTTAACGTTATTGATTGCAAATGGGTTTTCAAACTCAAACAAAAGCCAGATGGCTCAATTGATC
GCTACAAAGCACGCCTGGTTGCTAAAGGTTTTAAACAGCAGTATGGCGTTGATTATGATGATACCTTTAGTCCAGTTGTTAAGCTCACTACCATTCGGCTCTTATTATCT
CTTGCTATTTCTTGTGGTTGGGCTATTCGGCAGATTGATATTCAAAATGCTTTTCTTCATGGCCTTCTTAATGAAGATGTTTATATGAAGCAGCCTCCTGGATTTGTGGA
TTCTCAACACCCTGGTTATCTCTGCAAGCTGGATAAGTCGCTTTATGGCTTTAAACAAGCTCCGCATGCCTGGTTTTCTCGCCTTAGCTCCAAACTATTACAGCTGGATT
TTACACCTTCAAAGGTTGATGTCTCTCTTTTTATTTTTAACAAAACGGGCATTCAGATGTATATCCTCATCTACGTTGATGATATTATTATCATCAGCTCATCTTCTACG
GCTACTGAGAAACTTCTTACACAGCTTCAGGATGATTTTGCCGTCAAGGATCTTGGTATTTTGAGTTATTTTCTTGGGATTGAGGTCCGCCATACTTCTAGTGGACTTAT
TCTCACACAACATAAATACATTCGAGATTTATTAGCCAGAACCAATATGCTCACCTCCAAAGGTGTGCCCACACCTATGCTTCCCAGTGAGAAGTTGTTATTGAATGGTG
GTGAAAAGCTCTCACCTGAGGATACTACTCGCTATCGAAGTGTCGTTGGTGCTCTCCAATATTTGTCTCTGACACGTCCTGATATATCCTTCTGTGTCAACAGAGTGTGT
CAGTTCATGTCCTCTCCGACTTCTATACATTGGGCGGCAGTCAAACGAATTCTCCATTATCTACATGACACTATTGATATGGGTTTGTGTCTTACAAAGTCCAGCACTGA
TTTGTTGAGTGCCTTTTCAGATGCTGATTGGGCTGGTAATCCTGATGATCGTCGAAGCACTGGAGGCTATGTGATCTTCTTTGGTGGCAATCTTATCTCTTGGAGTTCGA
GGAAACAATCGACAGTTTCTCGTTCTAGTACGGAAGCCGAATATAAGGCGGTTGCTAATGCCACTGCCGAATTAATTTGGATCCAAGTTCTCTTGCGTGAGCTCGGGATC
TCGCAAGCGCGAGCGCCTAGCCTATGGTGTGACAACATTGGTGCCACCTACCTATCCGCCAATCCAATCTTTCATCGACGGACGAAGCATGTTGAGGTTGATTATCACTT
CGTTCGTGAACGAGTATCGACTCGTCAGCTTGATGTTCGACTCATATCTTCCAAGGATCAGCTCGCCGATATCATGACAAAGCCACTGCCAGCTTCTTCTTTTAGCTATT
TTAGGCGCAATCTGAACTTAGTAGTACATCTTGAAGGGACCGTGGGGGAGAGGTCGAGAAACACTTTTCATGAAGATGTCAACAACCTGGTTCGGAGGAGACGTCGAGAA
ACACTTTCTTGCCCATCGAAAACTGTAAAGGGATCGTCGCTTAGTGGCGATGCTGCTGCCTTGGCTTCGGGGAGAATATGTCTAACCCATGAAAAGGGATCGTCGCTCAG
CGACGATGCTACCTTGGCTTCGAAGGGGAAACCTAGAGCAAGGGACAAAGACAATGTTGAAGCCAGAAGAGTCACCACAGTGTGCGAAAAAGCCAATTTTGGCAAGAAGG
GAGGCGGTGACTTGGTTGGCAAAGAGGCCAGTGTGGAACTCAGATTTCGAGGTAAATCTCTGAAGAAGGTCGATAGCCTTCTCGAGATTCTGTATATTTTCGATGAGAAT
GTTGATGAGGATCATGAGGTGATGGTTTTTGTGAACGTTGAAGGGAAGATTGGGAAGAAAATAAAAGAGGCTATATTGGTTAAGGGAAAACCAATTGGGTTGGAAAAGGT
GAGGTTATATGTATAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTGGGCCTACTAACTCTTTGGATGCAGAAAATTTGGTGTCTACATCAGCTTCGAAATTGCCGCAACAATCCTCCGCGTCGCTGCCATGCGAATCGGCGTTGGTTGT
TCCGCCAATGATTGAGGCCTCGGCTCCTCCGCCAGCAGATGATATTGCGCAATGCCCGGTCGAATCCTCGGTCGCTGGTCAACCAACTGCTATAGCATCGGTTGCTCCCC
TCGCAACGGCTGATACGGCCGTCCCCTCAAATGTGGATCATGCACCTACTACTCATCCGTATGGTACGCGATTGAAGCACAATATCAAGAAACCCAAGGTGCGTACAGAT
GGAACAGTAACATATCTTGTAGCTCGGTCTTCTGCCTCTGAACCTACTTCACATATTACTGCTATGGAGCATCCCCTCTGCCGTCAGGCAATGAATGATGAATTTCAGGC
ACTTCAAAAAAATAAGACATGGCACTTAGTTCCTCCTCGTGCTGGTTTTAACGTTATTGATTGCAAATGGGTTTTCAAACTCAAACAAAAGCCAGATGGCTCAATTGATC
GCTACAAAGCACGCCTGGTTGCTAAAGGTTTTAAACAGCAGTATGGCGTTGATTATGATGATACCTTTAGTCCAGTTGTTAAGCTCACTACCATTCGGCTCTTATTATCT
CTTGCTATTTCTTGTGGTTGGGCTATTCGGCAGATTGATATTCAAAATGCTTTTCTTCATGGCCTTCTTAATGAAGATGTTTATATGAAGCAGCCTCCTGGATTTGTGGA
TTCTCAACACCCTGGTTATCTCTGCAAGCTGGATAAGTCGCTTTATGGCTTTAAACAAGCTCCGCATGCCTGGTTTTCTCGCCTTAGCTCCAAACTATTACAGCTGGATT
TTACACCTTCAAAGGTTGATGTCTCTCTTTTTATTTTTAACAAAACGGGCATTCAGATGTATATCCTCATCTACGTTGATGATATTATTATCATCAGCTCATCTTCTACG
GCTACTGAGAAACTTCTTACACAGCTTCAGGATGATTTTGCCGTCAAGGATCTTGGTATTTTGAGTTATTTTCTTGGGATTGAGGTCCGCCATACTTCTAGTGGACTTAT
TCTCACACAACATAAATACATTCGAGATTTATTAGCCAGAACCAATATGCTCACCTCCAAAGGTGTGCCCACACCTATGCTTCCCAGTGAGAAGTTGTTATTGAATGGTG
GTGAAAAGCTCTCACCTGAGGATACTACTCGCTATCGAAGTGTCGTTGGTGCTCTCCAATATTTGTCTCTGACACGTCCTGATATATCCTTCTGTGTCAACAGAGTGTGT
CAGTTCATGTCCTCTCCGACTTCTATACATTGGGCGGCAGTCAAACGAATTCTCCATTATCTACATGACACTATTGATATGGGTTTGTGTCTTACAAAGTCCAGCACTGA
TTTGTTGAGTGCCTTTTCAGATGCTGATTGGGCTGGTAATCCTGATGATCGTCGAAGCACTGGAGGCTATGTGATCTTCTTTGGTGGCAATCTTATCTCTTGGAGTTCGA
GGAAACAATCGACAGTTTCTCGTTCTAGTACGGAAGCCGAATATAAGGCGGTTGCTAATGCCACTGCCGAATTAATTTGGATCCAAGTTCTCTTGCGTGAGCTCGGGATC
TCGCAAGCGCGAGCGCCTAGCCTATGGTGTGACAACATTGGTGCCACCTACCTATCCGCCAATCCAATCTTTCATCGACGGACGAAGCATGTTGAGGTTGATTATCACTT
CGTTCGTGAACGAGTATCGACTCGTCAGCTTGATGTTCGACTCATATCTTCCAAGGATCAGCTCGCCGATATCATGACAAAGCCACTGCCAGCTTCTTCTTTTAGCTATT
TTAGGCGCAATCTGAACTTAGTAGTACATCTTGAAGGGACCGTGGGGGAGAGGTCGAGAAACACTTTTCATGAAGATGTCAACAACCTGGTTCGGAGGAGACGTCGAGAA
ACACTTTCTTGCCCATCGAAAACTGTAAAGGGATCGTCGCTTAGTGGCGATGCTGCTGCCTTGGCTTCGGGGAGAATATGTCTAACCCATGAAAAGGGATCGTCGCTCAG
CGACGATGCTACCTTGGCTTCGAAGGGGAAACCTAGAGCAAGGGACAAAGACAATGTTGAAGCCAGAAGAGTCACCACAGTGTGCGAAAAAGCCAATTTTGGCAAGAAGG
GAGGCGGTGACTTGGTTGGCAAAGAGGCCAGTGTGGAACTCAGATTTCGAGGTAAATCTCTGAAGAAGGTCGATAGCCTTCTCGAGATTCTGTATATTTTCGATGAGAAT
GTTGATGAGGATCATGAGGTGATGGTTTTTGTGAACGTTGAAGGGAAGATTGGGAAGAAAATAAAAGAGGCTATATTGGTTAAGGGAAAACCAATTGGGTTGGAAAAGGT
GAGGTTATATGTATAG
Protein sequenceShow/hide protein sequence
MSGPTNSLDAENLVSTSASKLPQQSSASLPCESALVVPPMIEASAPPPADDIAQCPVESSVAGQPTAIASVAPLATADTAVPSNVDHAPTTHPYGTRLKHNIKKPKVRTD
GTVTYLVARSSASEPTSHITAMEHPLCRQAMNDEFQALQKNKTWHLVPPRAGFNVIDCKWVFKLKQKPDGSIDRYKARLVAKGFKQQYGVDYDDTFSPVVKLTTIRLLLS
LAISCGWAIRQIDIQNAFLHGLLNEDVYMKQPPGFVDSQHPGYLCKLDKSLYGFKQAPHAWFSRLSSKLLQLDFTPSKVDVSLFIFNKTGIQMYILIYVDDIIIISSSST
ATEKLLTQLQDDFAVKDLGILSYFLGIEVRHTSSGLILTQHKYIRDLLARTNMLTSKGVPTPMLPSEKLLLNGGEKLSPEDTTRYRSVVGALQYLSLTRPDISFCVNRVC
QFMSSPTSIHWAAVKRILHYLHDTIDMGLCLTKSSTDLLSAFSDADWAGNPDDRRSTGGYVIFFGGNLISWSSRKQSTVSRSSTEAEYKAVANATAELIWIQVLLRELGI
SQARAPSLWCDNIGATYLSANPIFHRRTKHVEVDYHFVRERVSTRQLDVRLISSKDQLADIMTKPLPASSFSYFRRNLNLVVHLEGTVGERSRNTFHEDVNNLVRRRRRE
TLSCPSKTVKGSSLSGDAAALASGRICLTHEKGSSLSDDATLASKGKPRARDKDNVEARRVTTVCEKANFGKKGGGDLVGKEASVELRFRGKSLKKVDSLLEILYIFDEN
VDEDHEVMVFVNVEGKIGKKIKEAILVKGKPIGLEKVRLYV