; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0012922 (gene) of Snake gourd v1 genome

Gene IDTan0012922
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG10:27478421..27480955
RNA-Seq ExpressionTan0012922
SyntenyTan0012922
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-21544.96Show/hide
Query:  TVLFNNLQTFQSLVQTRGQEAETSVA--LRPFHRGSTSGIKFVDSSGPKGKKEKLK---EVKVDRVVALKGKEIKDVARKCFYSNGDEHLKKSCPKFLDE
        T L N LQTF+SL++ +GQ+ E +VA   R FHRGSTSG K + SS    K +K K     K +   A   K+ K     CF+ N + H K++CPK+L E
Subjt:  TVLFNNLQTFQSLVQTRGQEAETSVA--LRPFHRGSTSGIKFVDSSGPKGKKEKLK---EVKVDRVVALKGKEIKDVARKCFYSNGDEHLKKSCPKFLDE

Query:  KKMT--GKCDLLVTETCLVESNDSAWILDSGATNHVCSSFQGLSPWQRMQEGEITLRVGSEEVVSAAAIG------------------------------
        KK    GK DLLV ETCLVE++DSAWI+DSGATNHVCSSFQG+S W++++ GE+T+RVG+  VVSA A+G                              
Subjt:  KKMT--GKCDLLVTETCLVESNDSAWILDSGATNHVCSSFQGLSPWQRMQEGEITLRVGSEEVVSAAAIG------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------TKNGVSERRNRTLLDMVRSMMSYARLPDSFWDYAVETTVYILNTVSSKSVCETPFELWHGRKTSLCHFRIWGCP
                                   +NGVSERRNRTLLDMVRSMMSYA LP+SFW YAV+T VYILN V SKSV ETP +LW+GRK SL HFRIWGCP
Subjt:  --------------------------TKNGVSERRNRTLLDMVRSMMSYARLPDSFWDYAVETTVYILNTVSSKSVCETPFELWHGRKTSLCHFRIWGCP

Query:  THVLVSNLKKLEPRSKLCLFVGYPKETRGGYFYDPKDNRVLVSTNATFFEEDHIRDHLPRSKIVLNEMDNTSVRVADGAI-------------------R
         HVL +N KKLEPRSKLCLFVGYPK TRGGYFYDPKDN+V VSTNATF EEDHIR+H PRSKIVLNE+   +   +   +                   +
Subjt:  THVLVSNLKKLEPRSKLCLFVGYPKETRGGYFYDPKDNRVLVSTNATFFEEDHIRDHLPRSKIVLNEMDNTSVRVADGAI-------------------R

Query:  SQKLGMPRRSGRVVRQPERYMGLAETQVLTLDDDYEDPLTYDQAMADIDKDEWIKAMDQEMESMYFNSVWELVDQSDGVKPIGCKWIYKRKRGVDGKVQT
         Q L  PRRSGRV   P RYM L ET  +  D D EDPLT+ +AM D+DKDEWIKAM+ E+ESMYFNSVW+LVDQ DGVKPIGCKWIYKRKRG DGKVQT
Subjt:  SQKLGMPRRSGRVVRQPERYMGLAETQVLTLDDDYEDPLTYDQAMADIDKDEWIKAMDQEMESMYFNSVWELVDQSDGVKPIGCKWIYKRKRGVDGKVQT

Query:  FKARLVAKGFTQVEGVDYEETFSPVAM--------------------MDVKTAFLNGNLDEIIYMDQPKGFIAQGQEQRVCRLKRSIYGLKQASRSWNIR
        FKARLVAKG+TQVEGVDYEETFSPVAM                    MDVKTAFLNGNL+E IYM QP+GFI  GQEQ++C+L RSIYGLKQASRSWNIR
Subjt:  FKARLVAKGFTQVEGVDYEETFSPVAM--------------------MDVKTAFLNGNLDEIIYMDQPKGFIAQGQEQRVCRLKRSIYGLKQASRSWNIR

Query:  FDEAIKSYGFDQNVDEPCVYKKIVNSIVAFLILYVDDILLIGNEVGFLTDIKEWLASQFQTKDLGEAQYVLGIR--------------------------
        FD AIKSYGFDQ VDEPCVYK+I+N  VAFL+LYVDDILLIGN++G LTDIK+WLA+QFQ KDLGEAQ+VLGI+                          
Subjt:  FDEAIKSYGFDQNVDEPCVYKKIVNSIVAFLILYVDDILLIGNEVGFLTDIKEWLASQFQTKDLGEAQYVLGIR--------------------------

Query:  CELQKKGLSPFGHGVHLPEDQCPKTPQEVEDMRRIP-------------------------MLQLQSSSILGE-----------RGTTD--LVYGSGDLI
         +  K+GL PF HGV L ++QCPKTPQ+VE+MR IP                         + + QS+  L             R T D  LVYGS DLI
Subjt:  CELQKKGLSPFGHGVHLPEDQCPKTPQEVEDMRRIP-------------------------MLQLQSSSILGE-----------RGTTD--LVYGSGDLI

Query:  LHGYTDSTFGHDKDSRKSTSGSPSFSNGGVVV
        L GYTDS F  D+DSRKSTSGS    NGG VV
Subjt:  LHGYTDSTFGHDKDSRKSTSGSPSFSNGGVVV

KAA0047792.1 gag/pol protein [Cucumis melo var. makuwa]6.3e-21644.96Show/hide
Query:  TVLFNNLQTFQSLVQTRGQEAETSVA--LRPFHRGSTSGIKFVDSSGPKGKKEKLK---EVKVDRVVALKGKEIKDVARKCFYSNGDEHLKKSCPKFLDE
        T L N LQTF+SL++ +GQ+ E +VA   R FHRGSTSG K + SS    K +K K     K +   A   K+ K     CF+ N + H K++CPK+L E
Subjt:  TVLFNNLQTFQSLVQTRGQEAETSVA--LRPFHRGSTSGIKFVDSSGPKGKKEKLK---EVKVDRVVALKGKEIKDVARKCFYSNGDEHLKKSCPKFLDE

Query:  KKMT--GKCDLLVTETCLVESNDSAWILDSGATNHVCSSFQGLSPWQRMQEGEITLRVGSEEVVSAAAIG------------------------------
        KK    GK DLLV ETCLVE++DSAWI+DSGATNHVCSSFQG+S W++++ GE+T+RVG+  VVSA A+G                              
Subjt:  KKMT--GKCDLLVTETCLVESNDSAWILDSGATNHVCSSFQGLSPWQRMQEGEITLRVGSEEVVSAAAIG------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------TKNGVSERRNRTLLDMVRSMMSYARLPDSFWDYAVETTVYILNTVSSKSVCETPFELWHGRKTSLCHFRIWGCP
                                   +NGVSERRNRTLLDMVRSMMSYA LP+SFW YAV+T VYILN V SKSV ETP +LW+GRK SL HFRIWGCP
Subjt:  --------------------------TKNGVSERRNRTLLDMVRSMMSYARLPDSFWDYAVETTVYILNTVSSKSVCETPFELWHGRKTSLCHFRIWGCP

Query:  THVLVSNLKKLEPRSKLCLFVGYPKETRGGYFYDPKDNRVLVSTNATFFEEDHIRDHLPRSKIVLNEMDNTSVRVADGAI-------------------R
         HVL +N KKLEPRSKLCLFVGYPK TRGGYFYDPKDN+V VSTNATF EEDHIR+H PRSKIVLNE+   +   +   +                   +
Subjt:  THVLVSNLKKLEPRSKLCLFVGYPKETRGGYFYDPKDNRVLVSTNATFFEEDHIRDHLPRSKIVLNEMDNTSVRVADGAI-------------------R

Query:  SQKLGMPRRSGRVVRQPERYMGLAETQVLTLDDDYEDPLTYDQAMADIDKDEWIKAMDQEMESMYFNSVWELVDQSDGVKPIGCKWIYKRKRGVDGKVQT
         Q L  PRRSGRV   P RYM L ET  +  D D EDPLT+ +AM D+DKDEWIKAM+ E+ESMYFNSVW+LVDQ DGVKPIGCKWIYKRKRG DGKVQT
Subjt:  SQKLGMPRRSGRVVRQPERYMGLAETQVLTLDDDYEDPLTYDQAMADIDKDEWIKAMDQEMESMYFNSVWELVDQSDGVKPIGCKWIYKRKRGVDGKVQT

Query:  FKARLVAKGFTQVEGVDYEETFSPVAM--------------------MDVKTAFLNGNLDEIIYMDQPKGFIAQGQEQRVCRLKRSIYGLKQASRSWNIR
        FKARLVAKG+TQVEGVDYEETFSPVAM                    MDVKTAFLNGNL+E IYM QP+GFI  GQEQ++C+L RSIYGLKQASRSWNIR
Subjt:  FKARLVAKGFTQVEGVDYEETFSPVAM--------------------MDVKTAFLNGNLDEIIYMDQPKGFIAQGQEQRVCRLKRSIYGLKQASRSWNIR

Query:  FDEAIKSYGFDQNVDEPCVYKKIVNSIVAFLILYVDDILLIGNEVGFLTDIKEWLASQFQTKDLGEAQYVLGIR--------------------------
        FD AIKSYGFDQ VDEPCVYK+I+N  VAFL+LYVDDILLIGN++G LTDIK+WLA+QFQ KDLGEAQ+VLGI+                          
Subjt:  FDEAIKSYGFDQNVDEPCVYKKIVNSIVAFLILYVDDILLIGNEVGFLTDIKEWLASQFQTKDLGEAQYVLGIR--------------------------

Query:  CELQKKGLSPFGHGVHLPEDQCPKTPQEVEDMRRIP-------------------------MLQLQSSSILGE-----------RGTTD--LVYGSGDLI
         +  K+GL PF HGV L ++QCPKTPQ+VE+MR IP                         + + QS+  L             R T D  LVYGS DLI
Subjt:  CELQKKGLSPFGHGVHLPEDQCPKTPQEVEDMRRIP-------------------------MLQLQSSSILGE-----------RGTTD--LVYGSGDLI

Query:  LHGYTDSTFGHDKDSRKSTSGSPSFSNGGVVV
        L GYTDS F  D+DSRKSTSGS    NGG VV
Subjt:  LHGYTDSTFGHDKDSRKSTSGSPSFSNGGVVV

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-21544.96Show/hide
Query:  TVLFNNLQTFQSLVQTRGQEAETSVA--LRPFHRGSTSGIKFVDSSGPKGKKEKLK---EVKVDRVVALKGKEIKDVARKCFYSNGDEHLKKSCPKFLDE
        T L N LQTF+SL++ +GQ+ E +VA   R FHRGSTSG K + SS    K +K K     K +   A   K+ K     CF+ N + H K++CPK+L E
Subjt:  TVLFNNLQTFQSLVQTRGQEAETSVA--LRPFHRGSTSGIKFVDSSGPKGKKEKLK---EVKVDRVVALKGKEIKDVARKCFYSNGDEHLKKSCPKFLDE

Query:  KKMT--GKCDLLVTETCLVESNDSAWILDSGATNHVCSSFQGLSPWQRMQEGEITLRVGSEEVVSAAAIG------------------------------
        KK    GK DLLV ETCLVE++DSAWI+DSGATNHVCSSFQG+S W++++ GE+T+RVG+  VVSA A+G                              
Subjt:  KKMT--GKCDLLVTETCLVESNDSAWILDSGATNHVCSSFQGLSPWQRMQEGEITLRVGSEEVVSAAAIG------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------TKNGVSERRNRTLLDMVRSMMSYARLPDSFWDYAVETTVYILNTVSSKSVCETPFELWHGRKTSLCHFRIWGCP
                                   +NGVSERRNRTLLDMVRSMMSYA LP+SFW YAV+T VYILN V SKSV ETP +LW+GRK SL HFRIWGCP
Subjt:  --------------------------TKNGVSERRNRTLLDMVRSMMSYARLPDSFWDYAVETTVYILNTVSSKSVCETPFELWHGRKTSLCHFRIWGCP

Query:  THVLVSNLKKLEPRSKLCLFVGYPKETRGGYFYDPKDNRVLVSTNATFFEEDHIRDHLPRSKIVLNEMDNTSVRVADGAI-------------------R
         HVL +N KKLEPRSKLCLFVGYPK TRGGYFYDPKDN+V VSTNATF EEDHIR+H PRSKIVLNE+   +   +   +                   +
Subjt:  THVLVSNLKKLEPRSKLCLFVGYPKETRGGYFYDPKDNRVLVSTNATFFEEDHIRDHLPRSKIVLNEMDNTSVRVADGAI-------------------R

Query:  SQKLGMPRRSGRVVRQPERYMGLAETQVLTLDDDYEDPLTYDQAMADIDKDEWIKAMDQEMESMYFNSVWELVDQSDGVKPIGCKWIYKRKRGVDGKVQT
         Q L  PRRSGRV   P RYM L ET  +  D D EDPLT+ +AM D+DKDEWIKAM+ E+ESMYFNSVW+LVDQ DGVKPIGCKWIYKRKRG DGKVQT
Subjt:  SQKLGMPRRSGRVVRQPERYMGLAETQVLTLDDDYEDPLTYDQAMADIDKDEWIKAMDQEMESMYFNSVWELVDQSDGVKPIGCKWIYKRKRGVDGKVQT

Query:  FKARLVAKGFTQVEGVDYEETFSPVAM--------------------MDVKTAFLNGNLDEIIYMDQPKGFIAQGQEQRVCRLKRSIYGLKQASRSWNIR
        FKARLVAKG+TQVEGVDYEETFSPVAM                    MDVKTAFLNGNL+E IYM QP+GFI  GQEQ++C+L RSIYGLKQASRSWNIR
Subjt:  FKARLVAKGFTQVEGVDYEETFSPVAM--------------------MDVKTAFLNGNLDEIIYMDQPKGFIAQGQEQRVCRLKRSIYGLKQASRSWNIR

Query:  FDEAIKSYGFDQNVDEPCVYKKIVNSIVAFLILYVDDILLIGNEVGFLTDIKEWLASQFQTKDLGEAQYVLGIR--------------------------
        FD AIKSYGFDQ VDEPCVYK+I+N  VAFL+LYVDDILLIGN++G LTDIK+WLA+QFQ KDLGEAQ+VLGI+                          
Subjt:  FDEAIKSYGFDQNVDEPCVYKKIVNSIVAFLILYVDDILLIGNEVGFLTDIKEWLASQFQTKDLGEAQYVLGIR--------------------------

Query:  CELQKKGLSPFGHGVHLPEDQCPKTPQEVEDMRRIP-------------------------MLQLQSSSILGE-----------RGTTD--LVYGSGDLI
         +  K+GL PF HGV L ++QCPKTPQ+VE+MR IP                         + + QS+  L             R T D  LVYGS DLI
Subjt:  CELQKKGLSPFGHGVHLPEDQCPKTPQEVEDMRRIP-------------------------MLQLQSSSILGE-----------RGTTD--LVYGSGDLI

Query:  LHGYTDSTFGHDKDSRKSTSGSPSFSNGGVVV
        L GYTDS F  D+DSRKSTSGS    NGG VV
Subjt:  LHGYTDSTFGHDKDSRKSTSGSPSFSNGGVVV

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-21544.96Show/hide
Query:  TVLFNNLQTFQSLVQTRGQEAETSVA--LRPFHRGSTSGIKFVDSSGPKGKKEKLK---EVKVDRVVALKGKEIKDVARKCFYSNGDEHLKKSCPKFLDE
        T L N LQTF+SL++ +GQ+ E +VA   R FHRGSTSG K + SS    K +K K     K +   A   K+ K     CF+ N + H K++CPK+L E
Subjt:  TVLFNNLQTFQSLVQTRGQEAETSVA--LRPFHRGSTSGIKFVDSSGPKGKKEKLK---EVKVDRVVALKGKEIKDVARKCFYSNGDEHLKKSCPKFLDE

Query:  KKMT--GKCDLLVTETCLVESNDSAWILDSGATNHVCSSFQGLSPWQRMQEGEITLRVGSEEVVSAAAIG------------------------------
        KK    GK DLLV ETCLVE++DSAWI+DSGATNHVCSSFQG+S W++++ GE+T+RVG+  VVSA A+G                              
Subjt:  KKMT--GKCDLLVTETCLVESNDSAWILDSGATNHVCSSFQGLSPWQRMQEGEITLRVGSEEVVSAAAIG------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------TKNGVSERRNRTLLDMVRSMMSYARLPDSFWDYAVETTVYILNTVSSKSVCETPFELWHGRKTSLCHFRIWGCP
                                   +NGVSERRNRTLLDMVRSMMSYA LP+SFW YAV+T VYILN V SKSV ETP +LW+GRK SL HFRIWGCP
Subjt:  --------------------------TKNGVSERRNRTLLDMVRSMMSYARLPDSFWDYAVETTVYILNTVSSKSVCETPFELWHGRKTSLCHFRIWGCP

Query:  THVLVSNLKKLEPRSKLCLFVGYPKETRGGYFYDPKDNRVLVSTNATFFEEDHIRDHLPRSKIVLNEMDNTSVRVADGAI-------------------R
         HVL +N KKLEPRSKLCLFVGYPK TRGGYFYDPKDN+V VSTNATF EEDHIR+H PRSKIVLNE+   +   +   +                   +
Subjt:  THVLVSNLKKLEPRSKLCLFVGYPKETRGGYFYDPKDNRVLVSTNATFFEEDHIRDHLPRSKIVLNEMDNTSVRVADGAI-------------------R

Query:  SQKLGMPRRSGRVVRQPERYMGLAETQVLTLDDDYEDPLTYDQAMADIDKDEWIKAMDQEMESMYFNSVWELVDQSDGVKPIGCKWIYKRKRGVDGKVQT
         Q L  PRRSGRV   P RYM L ET  +  D D EDPLT+ +AM D+DKDEWIKAM+ E+ESMYFNSVW+LVDQ DGVKPIGCKWIYKRKRG DGKVQT
Subjt:  SQKLGMPRRSGRVVRQPERYMGLAETQVLTLDDDYEDPLTYDQAMADIDKDEWIKAMDQEMESMYFNSVWELVDQSDGVKPIGCKWIYKRKRGVDGKVQT

Query:  FKARLVAKGFTQVEGVDYEETFSPVAM--------------------MDVKTAFLNGNLDEIIYMDQPKGFIAQGQEQRVCRLKRSIYGLKQASRSWNIR
        FKARLVAKG+TQVEGVDYEETFSPVAM                    MDVKTAFLNGNL+E IYM QP+GFI  GQEQ++C+L RSIYGLKQASRSWNIR
Subjt:  FKARLVAKGFTQVEGVDYEETFSPVAM--------------------MDVKTAFLNGNLDEIIYMDQPKGFIAQGQEQRVCRLKRSIYGLKQASRSWNIR

Query:  FDEAIKSYGFDQNVDEPCVYKKIVNSIVAFLILYVDDILLIGNEVGFLTDIKEWLASQFQTKDLGEAQYVLGIR--------------------------
        FD AIKSYGFDQ VDEPCVYK+I+N  VAFL+LYVDDILLIGN++G LTDIK+WLA+QFQ KDLGEAQ+VLGI+                          
Subjt:  FDEAIKSYGFDQNVDEPCVYKKIVNSIVAFLILYVDDILLIGNEVGFLTDIKEWLASQFQTKDLGEAQYVLGIR--------------------------

Query:  CELQKKGLSPFGHGVHLPEDQCPKTPQEVEDMRRIP-------------------------MLQLQSSSILGE-----------RGTTD--LVYGSGDLI
         +  K+GL PF HGV L ++QCPKTPQ+VE+MR IP                         + + QS+  L             R T D  LVYGS DLI
Subjt:  CELQKKGLSPFGHGVHLPEDQCPKTPQEVEDMRRIP-------------------------MLQLQSSSILGE-----------RGTTD--LVYGSGDLI

Query:  LHGYTDSTFGHDKDSRKSTSGSPSFSNGGVVV
        L GYTDS F  D+DSRKSTSGS    NGG VV
Subjt:  LHGYTDSTFGHDKDSRKSTSGSPSFSNGGVVV

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-21544.96Show/hide
Query:  TVLFNNLQTFQSLVQTRGQEAETSVA--LRPFHRGSTSGIKFVDSSGPKGKKEKLK---EVKVDRVVALKGKEIKDVARKCFYSNGDEHLKKSCPKFLDE
        T L N LQTF+SL++ +GQ+ E +VA   R FHRGSTSG K + SS    K +K K     K +   A   K+ K     CF+ N + H K++CPK+L E
Subjt:  TVLFNNLQTFQSLVQTRGQEAETSVA--LRPFHRGSTSGIKFVDSSGPKGKKEKLK---EVKVDRVVALKGKEIKDVARKCFYSNGDEHLKKSCPKFLDE

Query:  KKMT--GKCDLLVTETCLVESNDSAWILDSGATNHVCSSFQGLSPWQRMQEGEITLRVGSEEVVSAAAIG------------------------------
        KK    GK DLLV ETCLVE++DSAWI+DSGATNHVCSSFQG+S W++++ GE+T+RVG+  VVSA A+G                              
Subjt:  KKMT--GKCDLLVTETCLVESNDSAWILDSGATNHVCSSFQGLSPWQRMQEGEITLRVGSEEVVSAAAIG------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------TKNGVSERRNRTLLDMVRSMMSYARLPDSFWDYAVETTVYILNTVSSKSVCETPFELWHGRKTSLCHFRIWGCP
                                   +NGVSERRNRTLLDMVRSMMSYA LP+SFW YAV+T VYILN V SKSV ETP +LW+GRK SL HFRIWGCP
Subjt:  --------------------------TKNGVSERRNRTLLDMVRSMMSYARLPDSFWDYAVETTVYILNTVSSKSVCETPFELWHGRKTSLCHFRIWGCP

Query:  THVLVSNLKKLEPRSKLCLFVGYPKETRGGYFYDPKDNRVLVSTNATFFEEDHIRDHLPRSKIVLNEMDNTSVRVADGAI-------------------R
         HVL +N KKLEPRSKLCLFVGYPK TRGGYFYDPKDN+V VSTNATF EEDHIR+H PRSKIVLNE+   +   +   +                   +
Subjt:  THVLVSNLKKLEPRSKLCLFVGYPKETRGGYFYDPKDNRVLVSTNATFFEEDHIRDHLPRSKIVLNEMDNTSVRVADGAI-------------------R

Query:  SQKLGMPRRSGRVVRQPERYMGLAETQVLTLDDDYEDPLTYDQAMADIDKDEWIKAMDQEMESMYFNSVWELVDQSDGVKPIGCKWIYKRKRGVDGKVQT
         Q L  PRRSGRV   P RYM L ET  +  D D EDPLT+ +AM D+DKDEWIKAM+ E+ESMYFNSVW+LVDQ DGVKPIGCKWIYKRKRG DGKVQT
Subjt:  SQKLGMPRRSGRVVRQPERYMGLAETQVLTLDDDYEDPLTYDQAMADIDKDEWIKAMDQEMESMYFNSVWELVDQSDGVKPIGCKWIYKRKRGVDGKVQT

Query:  FKARLVAKGFTQVEGVDYEETFSPVAM--------------------MDVKTAFLNGNLDEIIYMDQPKGFIAQGQEQRVCRLKRSIYGLKQASRSWNIR
        FKARLVAKG+TQVEGVDYEETFSPVAM                    MDVKTAFLNGNL+E IYM QP+GFI  GQEQ++C+L RSIYGLKQASRSWNIR
Subjt:  FKARLVAKGFTQVEGVDYEETFSPVAM--------------------MDVKTAFLNGNLDEIIYMDQPKGFIAQGQEQRVCRLKRSIYGLKQASRSWNIR

Query:  FDEAIKSYGFDQNVDEPCVYKKIVNSIVAFLILYVDDILLIGNEVGFLTDIKEWLASQFQTKDLGEAQYVLGIR--------------------------
        FD AIKSYGFDQ VDEPCVYK+I+N  VAFL+LYVDDILLIGN++G LTDIK+WLA+QFQ KDLGEAQ+VLGI+                          
Subjt:  FDEAIKSYGFDQNVDEPCVYKKIVNSIVAFLILYVDDILLIGNEVGFLTDIKEWLASQFQTKDLGEAQYVLGIR--------------------------

Query:  CELQKKGLSPFGHGVHLPEDQCPKTPQEVEDMRRIP-------------------------MLQLQSSSILGE-----------RGTTD--LVYGSGDLI
         +  K+GL PF HGV L ++QCPKTPQ+VE+MR IP                         + + QS+  L             R T D  LVYGS DLI
Subjt:  CELQKKGLSPFGHGVHLPEDQCPKTPQEVEDMRRIP-------------------------MLQLQSSSILGE-----------RGTTD--LVYGSGDLI

Query:  LHGYTDSTFGHDKDSRKSTSGSPSFSNGGVVV
        L GYTDS F  D+DSRKSTSGS    NGG VV
Subjt:  LHGYTDSTFGHDKDSRKSTSGSPSFSNGGVVV

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein5.2e-21644.96Show/hide
Query:  TVLFNNLQTFQSLVQTRGQEAETSVA--LRPFHRGSTSGIKFVDSSGPKGKKEKLK---EVKVDRVVALKGKEIKDVARKCFYSNGDEHLKKSCPKFLDE
        T L N LQTF+SL++ +GQ+ E +VA   R FHRGSTSG K + SS    K +K K     K +   A   K+ K     CF+ N + H K++CPK+L E
Subjt:  TVLFNNLQTFQSLVQTRGQEAETSVA--LRPFHRGSTSGIKFVDSSGPKGKKEKLK---EVKVDRVVALKGKEIKDVARKCFYSNGDEHLKKSCPKFLDE

Query:  KKMT--GKCDLLVTETCLVESNDSAWILDSGATNHVCSSFQGLSPWQRMQEGEITLRVGSEEVVSAAAIG------------------------------
        KK    GK DLLV ETCLVE++DSAWI+DSGATNHVCSSFQG+S W++++ GE+T+RVG+  VVSA A+G                              
Subjt:  KKMT--GKCDLLVTETCLVESNDSAWILDSGATNHVCSSFQGLSPWQRMQEGEITLRVGSEEVVSAAAIG------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------TKNGVSERRNRTLLDMVRSMMSYARLPDSFWDYAVETTVYILNTVSSKSVCETPFELWHGRKTSLCHFRIWGCP
                                   +NGVSERRNRTLLDMVRSMMSYA LP+SFW YAV+T VYILN V SKSV ETP +LW+GRK SL HFRIWGCP
Subjt:  --------------------------TKNGVSERRNRTLLDMVRSMMSYARLPDSFWDYAVETTVYILNTVSSKSVCETPFELWHGRKTSLCHFRIWGCP

Query:  THVLVSNLKKLEPRSKLCLFVGYPKETRGGYFYDPKDNRVLVSTNATFFEEDHIRDHLPRSKIVLNEMDNTSVRVADGAI-------------------R
         HVL +N KKLEPRSKLCLFVGYPK TRGGYFYDPKDN+V VSTNATF EEDHIR+H PRSKIVLNE+   +   +   +                   +
Subjt:  THVLVSNLKKLEPRSKLCLFVGYPKETRGGYFYDPKDNRVLVSTNATFFEEDHIRDHLPRSKIVLNEMDNTSVRVADGAI-------------------R

Query:  SQKLGMPRRSGRVVRQPERYMGLAETQVLTLDDDYEDPLTYDQAMADIDKDEWIKAMDQEMESMYFNSVWELVDQSDGVKPIGCKWIYKRKRGVDGKVQT
         Q L  PRRSGRV   P RYM L ET  +  D D EDPLT+ +AM D+DKDEWIKAM+ E+ESMYFNSVW+LVDQ DGVKPIGCKWIYKRKRG DGKVQT
Subjt:  SQKLGMPRRSGRVVRQPERYMGLAETQVLTLDDDYEDPLTYDQAMADIDKDEWIKAMDQEMESMYFNSVWELVDQSDGVKPIGCKWIYKRKRGVDGKVQT

Query:  FKARLVAKGFTQVEGVDYEETFSPVAM--------------------MDVKTAFLNGNLDEIIYMDQPKGFIAQGQEQRVCRLKRSIYGLKQASRSWNIR
        FKARLVAKG+TQVEGVDYEETFSPVAM                    MDVKTAFLNGNL+E IYM QP+GFI  GQEQ++C+L RSIYGLKQASRSWNIR
Subjt:  FKARLVAKGFTQVEGVDYEETFSPVAM--------------------MDVKTAFLNGNLDEIIYMDQPKGFIAQGQEQRVCRLKRSIYGLKQASRSWNIR

Query:  FDEAIKSYGFDQNVDEPCVYKKIVNSIVAFLILYVDDILLIGNEVGFLTDIKEWLASQFQTKDLGEAQYVLGIR--------------------------
        FD AIKSYGFDQ VDEPCVYK+I+N  VAFL+LYVDDILLIGN++G LTDIK+WLA+QFQ KDLGEAQ+VLGI+                          
Subjt:  FDEAIKSYGFDQNVDEPCVYKKIVNSIVAFLILYVDDILLIGNEVGFLTDIKEWLASQFQTKDLGEAQYVLGIR--------------------------

Query:  CELQKKGLSPFGHGVHLPEDQCPKTPQEVEDMRRIP-------------------------MLQLQSSSILGE-----------RGTTD--LVYGSGDLI
         +  K+GL PF HGV L ++QCPKTPQ+VE+MR IP                         + + QS+  L             R T D  LVYGS DLI
Subjt:  CELQKKGLSPFGHGVHLPEDQCPKTPQEVEDMRRIP-------------------------MLQLQSSSILGE-----------RGTTD--LVYGSGDLI

Query:  LHGYTDSTFGHDKDSRKSTSGSPSFSNGGVVV
        L GYTDS F  D+DSRKSTSGS    NGG VV
Subjt:  LHGYTDSTFGHDKDSRKSTSGSPSFSNGGVVV

A0A5A7TWB9 Gag/pol protein3.0e-21644.96Show/hide
Query:  TVLFNNLQTFQSLVQTRGQEAETSVA--LRPFHRGSTSGIKFVDSSGPKGKKEKLK---EVKVDRVVALKGKEIKDVARKCFYSNGDEHLKKSCPKFLDE
        T L N LQTF+SL++ +GQ+ E +VA   R FHRGSTSG K + SS    K +K K     K +   A   K+ K     CF+ N + H K++CPK+L E
Subjt:  TVLFNNLQTFQSLVQTRGQEAETSVA--LRPFHRGSTSGIKFVDSSGPKGKKEKLK---EVKVDRVVALKGKEIKDVARKCFYSNGDEHLKKSCPKFLDE

Query:  KKMT--GKCDLLVTETCLVESNDSAWILDSGATNHVCSSFQGLSPWQRMQEGEITLRVGSEEVVSAAAIG------------------------------
        KK    GK DLLV ETCLVE++DSAWI+DSGATNHVCSSFQG+S W++++ GE+T+RVG+  VVSA A+G                              
Subjt:  KKMT--GKCDLLVTETCLVESNDSAWILDSGATNHVCSSFQGLSPWQRMQEGEITLRVGSEEVVSAAAIG------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------TKNGVSERRNRTLLDMVRSMMSYARLPDSFWDYAVETTVYILNTVSSKSVCETPFELWHGRKTSLCHFRIWGCP
                                   +NGVSERRNRTLLDMVRSMMSYA LP+SFW YAV+T VYILN V SKSV ETP +LW+GRK SL HFRIWGCP
Subjt:  --------------------------TKNGVSERRNRTLLDMVRSMMSYARLPDSFWDYAVETTVYILNTVSSKSVCETPFELWHGRKTSLCHFRIWGCP

Query:  THVLVSNLKKLEPRSKLCLFVGYPKETRGGYFYDPKDNRVLVSTNATFFEEDHIRDHLPRSKIVLNEMDNTSVRVADGAI-------------------R
         HVL +N KKLEPRSKLCLFVGYPK TRGGYFYDPKDN+V VSTNATF EEDHIR+H PRSKIVLNE+   +   +   +                   +
Subjt:  THVLVSNLKKLEPRSKLCLFVGYPKETRGGYFYDPKDNRVLVSTNATFFEEDHIRDHLPRSKIVLNEMDNTSVRVADGAI-------------------R

Query:  SQKLGMPRRSGRVVRQPERYMGLAETQVLTLDDDYEDPLTYDQAMADIDKDEWIKAMDQEMESMYFNSVWELVDQSDGVKPIGCKWIYKRKRGVDGKVQT
         Q L  PRRSGRV   P RYM L ET  +  D D EDPLT+ +AM D+DKDEWIKAM+ E+ESMYFNSVW+LVDQ DGVKPIGCKWIYKRKRG DGKVQT
Subjt:  SQKLGMPRRSGRVVRQPERYMGLAETQVLTLDDDYEDPLTYDQAMADIDKDEWIKAMDQEMESMYFNSVWELVDQSDGVKPIGCKWIYKRKRGVDGKVQT

Query:  FKARLVAKGFTQVEGVDYEETFSPVAM--------------------MDVKTAFLNGNLDEIIYMDQPKGFIAQGQEQRVCRLKRSIYGLKQASRSWNIR
        FKARLVAKG+TQVEGVDYEETFSPVAM                    MDVKTAFLNGNL+E IYM QP+GFI  GQEQ++C+L RSIYGLKQASRSWNIR
Subjt:  FKARLVAKGFTQVEGVDYEETFSPVAM--------------------MDVKTAFLNGNLDEIIYMDQPKGFIAQGQEQRVCRLKRSIYGLKQASRSWNIR

Query:  FDEAIKSYGFDQNVDEPCVYKKIVNSIVAFLILYVDDILLIGNEVGFLTDIKEWLASQFQTKDLGEAQYVLGIR--------------------------
        FD AIKSYGFDQ VDEPCVYK+I+N  VAFL+LYVDDILLIGN++G LTDIK+WLA+QFQ KDLGEAQ+VLGI+                          
Subjt:  FDEAIKSYGFDQNVDEPCVYKKIVNSIVAFLILYVDDILLIGNEVGFLTDIKEWLASQFQTKDLGEAQYVLGIR--------------------------

Query:  CELQKKGLSPFGHGVHLPEDQCPKTPQEVEDMRRIP-------------------------MLQLQSSSILGE-----------RGTTD--LVYGSGDLI
         +  K+GL PF HGV L ++QCPKTPQ+VE+MR IP                         + + QS+  L             R T D  LVYGS DLI
Subjt:  CELQKKGLSPFGHGVHLPEDQCPKTPQEVEDMRRIP-------------------------MLQLQSSSILGE-----------RGTTD--LVYGSGDLI

Query:  LHGYTDSTFGHDKDSRKSTSGSPSFSNGGVVV
        L GYTDS F  D+DSRKSTSGS    NGG VV
Subjt:  LHGYTDSTFGHDKDSRKSTSGSPSFSNGGVVV

A0A5A7TZD7 Gag/pol protein5.2e-21644.96Show/hide
Query:  TVLFNNLQTFQSLVQTRGQEAETSVA--LRPFHRGSTSGIKFVDSSGPKGKKEKLK---EVKVDRVVALKGKEIKDVARKCFYSNGDEHLKKSCPKFLDE
        T L N LQTF+SL++ +GQ+ E +VA   R FHRGSTSG K + SS    K +K K     K +   A   K+ K     CF+ N + H K++CPK+L E
Subjt:  TVLFNNLQTFQSLVQTRGQEAETSVA--LRPFHRGSTSGIKFVDSSGPKGKKEKLK---EVKVDRVVALKGKEIKDVARKCFYSNGDEHLKKSCPKFLDE

Query:  KKMT--GKCDLLVTETCLVESNDSAWILDSGATNHVCSSFQGLSPWQRMQEGEITLRVGSEEVVSAAAIG------------------------------
        KK    GK DLLV ETCLVE++DSAWI+DSGATNHVCSSFQG+S W++++ GE+T+RVG+  VVSA A+G                              
Subjt:  KKMT--GKCDLLVTETCLVESNDSAWILDSGATNHVCSSFQGLSPWQRMQEGEITLRVGSEEVVSAAAIG------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------TKNGVSERRNRTLLDMVRSMMSYARLPDSFWDYAVETTVYILNTVSSKSVCETPFELWHGRKTSLCHFRIWGCP
                                   +NGVSERRNRTLLDMVRSMMSYA LP+SFW YAV+T VYILN V SKSV ETP +LW+GRK SL HFRIWGCP
Subjt:  --------------------------TKNGVSERRNRTLLDMVRSMMSYARLPDSFWDYAVETTVYILNTVSSKSVCETPFELWHGRKTSLCHFRIWGCP

Query:  THVLVSNLKKLEPRSKLCLFVGYPKETRGGYFYDPKDNRVLVSTNATFFEEDHIRDHLPRSKIVLNEMDNTSVRVADGAI-------------------R
         HVL +N KKLEPRSKLCLFVGYPK TRGGYFYDPKDN+V VSTNATF EEDHIR+H PRSKIVLNE+   +   +   +                   +
Subjt:  THVLVSNLKKLEPRSKLCLFVGYPKETRGGYFYDPKDNRVLVSTNATFFEEDHIRDHLPRSKIVLNEMDNTSVRVADGAI-------------------R

Query:  SQKLGMPRRSGRVVRQPERYMGLAETQVLTLDDDYEDPLTYDQAMADIDKDEWIKAMDQEMESMYFNSVWELVDQSDGVKPIGCKWIYKRKRGVDGKVQT
         Q L  PRRSGRV   P RYM L ET  +  D D EDPLT+ +AM D+DKDEWIKAM+ E+ESMYFNSVW+LVDQ DGVKPIGCKWIYKRKRG DGKVQT
Subjt:  SQKLGMPRRSGRVVRQPERYMGLAETQVLTLDDDYEDPLTYDQAMADIDKDEWIKAMDQEMESMYFNSVWELVDQSDGVKPIGCKWIYKRKRGVDGKVQT

Query:  FKARLVAKGFTQVEGVDYEETFSPVAM--------------------MDVKTAFLNGNLDEIIYMDQPKGFIAQGQEQRVCRLKRSIYGLKQASRSWNIR
        FKARLVAKG+TQVEGVDYEETFSPVAM                    MDVKTAFLNGNL+E IYM QP+GFI  GQEQ++C+L RSIYGLKQASRSWNIR
Subjt:  FKARLVAKGFTQVEGVDYEETFSPVAM--------------------MDVKTAFLNGNLDEIIYMDQPKGFIAQGQEQRVCRLKRSIYGLKQASRSWNIR

Query:  FDEAIKSYGFDQNVDEPCVYKKIVNSIVAFLILYVDDILLIGNEVGFLTDIKEWLASQFQTKDLGEAQYVLGIR--------------------------
        FD AIKSYGFDQ VDEPCVYK+I+N  VAFL+LYVDDILLIGN++G LTDIK+WLA+QFQ KDLGEAQ+VLGI+                          
Subjt:  FDEAIKSYGFDQNVDEPCVYKKIVNSIVAFLILYVDDILLIGNEVGFLTDIKEWLASQFQTKDLGEAQYVLGIR--------------------------

Query:  CELQKKGLSPFGHGVHLPEDQCPKTPQEVEDMRRIP-------------------------MLQLQSSSILGE-----------RGTTD--LVYGSGDLI
         +  K+GL PF HGV L ++QCPKTPQ+VE+MR IP                         + + QS+  L             R T D  LVYGS DLI
Subjt:  CELQKKGLSPFGHGVHLPEDQCPKTPQEVEDMRRIP-------------------------MLQLQSSSILGE-----------RGTTD--LVYGSGDLI

Query:  LHGYTDSTFGHDKDSRKSTSGSPSFSNGGVVV
        L GYTDS F  D+DSRKSTSGS    NGG VV
Subjt:  LHGYTDSTFGHDKDSRKSTSGSPSFSNGGVVV

A0A5D3CPJ6 Gag/pol protein5.2e-21644.96Show/hide
Query:  TVLFNNLQTFQSLVQTRGQEAETSVA--LRPFHRGSTSGIKFVDSSGPKGKKEKLK---EVKVDRVVALKGKEIKDVARKCFYSNGDEHLKKSCPKFLDE
        T L N LQTF+SL++ +GQ+ E +VA   R FHRGSTSG K + SS    K +K K     K +   A   K+ K     CF+ N + H K++CPK+L E
Subjt:  TVLFNNLQTFQSLVQTRGQEAETSVA--LRPFHRGSTSGIKFVDSSGPKGKKEKLK---EVKVDRVVALKGKEIKDVARKCFYSNGDEHLKKSCPKFLDE

Query:  KKMT--GKCDLLVTETCLVESNDSAWILDSGATNHVCSSFQGLSPWQRMQEGEITLRVGSEEVVSAAAIG------------------------------
        KK    GK DLLV ETCLVE++DSAWI+DSGATNHVCSSFQG+S W++++ GE+T+RVG+  VVSA A+G                              
Subjt:  KKMT--GKCDLLVTETCLVESNDSAWILDSGATNHVCSSFQGLSPWQRMQEGEITLRVGSEEVVSAAAIG------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------TKNGVSERRNRTLLDMVRSMMSYARLPDSFWDYAVETTVYILNTVSSKSVCETPFELWHGRKTSLCHFRIWGCP
                                   +NGVSERRNRTLLDMVRSMMSYA LP+SFW YAV+T VYILN V SKSV ETP +LW+GRK SL HFRIWGCP
Subjt:  --------------------------TKNGVSERRNRTLLDMVRSMMSYARLPDSFWDYAVETTVYILNTVSSKSVCETPFELWHGRKTSLCHFRIWGCP

Query:  THVLVSNLKKLEPRSKLCLFVGYPKETRGGYFYDPKDNRVLVSTNATFFEEDHIRDHLPRSKIVLNEMDNTSVRVADGAI-------------------R
         HVL +N KKLEPRSKLCLFVGYPK TRGGYFYDPKDN+V VSTNATF EEDHIR+H PRSKIVLNE+   +   +   +                   +
Subjt:  THVLVSNLKKLEPRSKLCLFVGYPKETRGGYFYDPKDNRVLVSTNATFFEEDHIRDHLPRSKIVLNEMDNTSVRVADGAI-------------------R

Query:  SQKLGMPRRSGRVVRQPERYMGLAETQVLTLDDDYEDPLTYDQAMADIDKDEWIKAMDQEMESMYFNSVWELVDQSDGVKPIGCKWIYKRKRGVDGKVQT
         Q L  PRRSGRV   P RYM L ET  +  D D EDPLT+ +AM D+DKDEWIKAM+ E+ESMYFNSVW+LVDQ DGVKPIGCKWIYKRKRG DGKVQT
Subjt:  SQKLGMPRRSGRVVRQPERYMGLAETQVLTLDDDYEDPLTYDQAMADIDKDEWIKAMDQEMESMYFNSVWELVDQSDGVKPIGCKWIYKRKRGVDGKVQT

Query:  FKARLVAKGFTQVEGVDYEETFSPVAM--------------------MDVKTAFLNGNLDEIIYMDQPKGFIAQGQEQRVCRLKRSIYGLKQASRSWNIR
        FKARLVAKG+TQVEGVDYEETFSPVAM                    MDVKTAFLNGNL+E IYM QP+GFI  GQEQ++C+L RSIYGLKQASRSWNIR
Subjt:  FKARLVAKGFTQVEGVDYEETFSPVAM--------------------MDVKTAFLNGNLDEIIYMDQPKGFIAQGQEQRVCRLKRSIYGLKQASRSWNIR

Query:  FDEAIKSYGFDQNVDEPCVYKKIVNSIVAFLILYVDDILLIGNEVGFLTDIKEWLASQFQTKDLGEAQYVLGIR--------------------------
        FD AIKSYGFDQ VDEPCVYK+I+N  VAFL+LYVDDILLIGN++G LTDIK+WLA+QFQ KDLGEAQ+VLGI+                          
Subjt:  FDEAIKSYGFDQNVDEPCVYKKIVNSIVAFLILYVDDILLIGNEVGFLTDIKEWLASQFQTKDLGEAQYVLGIR--------------------------

Query:  CELQKKGLSPFGHGVHLPEDQCPKTPQEVEDMRRIP-------------------------MLQLQSSSILGE-----------RGTTD--LVYGSGDLI
         +  K+GL PF HGV L ++QCPKTPQ+VE+MR IP                         + + QS+  L             R T D  LVYGS DLI
Subjt:  CELQKKGLSPFGHGVHLPEDQCPKTPQEVEDMRRIP-------------------------MLQLQSSSILGE-----------RGTTD--LVYGSGDLI

Query:  LHGYTDSTFGHDKDSRKSTSGSPSFSNGGVVV
        L GYTDS F  D+DSRKSTSGS    NGG VV
Subjt:  LHGYTDSTFGHDKDSRKSTSGSPSFSNGGVVV

A0A5D3CSZ6 Gag/pol protein5.2e-21644.96Show/hide
Query:  TVLFNNLQTFQSLVQTRGQEAETSVA--LRPFHRGSTSGIKFVDSSGPKGKKEKLK---EVKVDRVVALKGKEIKDVARKCFYSNGDEHLKKSCPKFLDE
        T L N LQTF+SL++ +GQ+ E +VA   R FHRGSTSG K + SS    K +K K     K +   A   K+ K     CF+ N + H K++CPK+L E
Subjt:  TVLFNNLQTFQSLVQTRGQEAETSVA--LRPFHRGSTSGIKFVDSSGPKGKKEKLK---EVKVDRVVALKGKEIKDVARKCFYSNGDEHLKKSCPKFLDE

Query:  KKMT--GKCDLLVTETCLVESNDSAWILDSGATNHVCSSFQGLSPWQRMQEGEITLRVGSEEVVSAAAIG------------------------------
        KK    GK DLLV ETCLVE++DSAWI+DSGATNHVCSSFQG+S W++++ GE+T+RVG+  VVSA A+G                              
Subjt:  KKMT--GKCDLLVTETCLVESNDSAWILDSGATNHVCSSFQGLSPWQRMQEGEITLRVGSEEVVSAAAIG------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------TKNGVSERRNRTLLDMVRSMMSYARLPDSFWDYAVETTVYILNTVSSKSVCETPFELWHGRKTSLCHFRIWGCP
                                   +NGVSERRNRTLLDMVRSMMSYA LP+SFW YAV+T VYILN V SKSV ETP +LW+GRK SL HFRIWGCP
Subjt:  --------------------------TKNGVSERRNRTLLDMVRSMMSYARLPDSFWDYAVETTVYILNTVSSKSVCETPFELWHGRKTSLCHFRIWGCP

Query:  THVLVSNLKKLEPRSKLCLFVGYPKETRGGYFYDPKDNRVLVSTNATFFEEDHIRDHLPRSKIVLNEMDNTSVRVADGAI-------------------R
         HVL +N KKLEPRSKLCLFVGYPK TRGGYFYDPKDN+V VSTNATF EEDHIR+H PRSKIVLNE+   +   +   +                   +
Subjt:  THVLVSNLKKLEPRSKLCLFVGYPKETRGGYFYDPKDNRVLVSTNATFFEEDHIRDHLPRSKIVLNEMDNTSVRVADGAI-------------------R

Query:  SQKLGMPRRSGRVVRQPERYMGLAETQVLTLDDDYEDPLTYDQAMADIDKDEWIKAMDQEMESMYFNSVWELVDQSDGVKPIGCKWIYKRKRGVDGKVQT
         Q L  PRRSGRV   P RYM L ET  +  D D EDPLT+ +AM D+DKDEWIKAM+ E+ESMYFNSVW+LVDQ DGVKPIGCKWIYKRKRG DGKVQT
Subjt:  SQKLGMPRRSGRVVRQPERYMGLAETQVLTLDDDYEDPLTYDQAMADIDKDEWIKAMDQEMESMYFNSVWELVDQSDGVKPIGCKWIYKRKRGVDGKVQT

Query:  FKARLVAKGFTQVEGVDYEETFSPVAM--------------------MDVKTAFLNGNLDEIIYMDQPKGFIAQGQEQRVCRLKRSIYGLKQASRSWNIR
        FKARLVAKG+TQVEGVDYEETFSPVAM                    MDVKTAFLNGNL+E IYM QP+GFI  GQEQ++C+L RSIYGLKQASRSWNIR
Subjt:  FKARLVAKGFTQVEGVDYEETFSPVAM--------------------MDVKTAFLNGNLDEIIYMDQPKGFIAQGQEQRVCRLKRSIYGLKQASRSWNIR

Query:  FDEAIKSYGFDQNVDEPCVYKKIVNSIVAFLILYVDDILLIGNEVGFLTDIKEWLASQFQTKDLGEAQYVLGIR--------------------------
        FD AIKSYGFDQ VDEPCVYK+I+N  VAFL+LYVDDILLIGN++G LTDIK+WLA+QFQ KDLGEAQ+VLGI+                          
Subjt:  FDEAIKSYGFDQNVDEPCVYKKIVNSIVAFLILYVDDILLIGNEVGFLTDIKEWLASQFQTKDLGEAQYVLGIR--------------------------

Query:  CELQKKGLSPFGHGVHLPEDQCPKTPQEVEDMRRIP-------------------------MLQLQSSSILGE-----------RGTTD--LVYGSGDLI
         +  K+GL PF HGV L ++QCPKTPQ+VE+MR IP                         + + QS+  L             R T D  LVYGS DLI
Subjt:  CELQKKGLSPFGHGVHLPEDQCPKTPQEVEDMRRIP-------------------------MLQLQSSSILGE-----------RGTTD--LVYGSGDLI

Query:  LHGYTDSTFGHDKDSRKSTSGSPSFSNGGVVV
        L GYTDS F  D+DSRKSTSGS    NGG VV
Subjt:  LHGYTDSTFGHDKDSRKSTSGSPSFSNGGVVV

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.1e-4826.8Show/hide
Query:  NGVSERRNRTLLDMVRSMMSYARLPDSFWDYAVETTVYILNTVSSKSVCE---TPFELWHGRKTSLCHFRIWGCPTHVLVSNLK-KLEPRSKLCLFVGYP
        NGVSER  RT+ +  R+M+S A+L  SFW  AV T  Y++N + S+++ +   TP+E+WH +K  L H R++G   +V + N + K + +S   +FVGY 
Subjt:  NGVSERRNRTLLDMVRSMMSYARLPDSFWDYAVETTVYILNTVSSKSVCE---TPFELWHGRKTSLCHFRIWGCPTHVLVSNLK-KLEPRSKLCLFVGYP

Query:  KETRGGYFYD-------------------------------------------PKDNRVLVST----------NATFFEE--------------------
         E  G   +D                                           P D+R ++ T          N  F ++                    
Subjt:  KETRGGYFYD-------------------------------------------PKDNRVLVST----------NATFFEE--------------------

Query:  -----------DHIRDHLPRSKIVLNEM----------------DNTSVRVADGAIRSQKLGMP------------RRSGRVVRQPERYMGLAETQ----
                     ++D    +K  LNE                 +    R ++ A   +++G+             RRS R+  +P+      +      
Subjt:  -----------DHIRDHLPRSKIVLNEM----------------DNTSVRVADGAIRSQKLGMP------------RRSGRVVRQPERYMGLAETQ----

Query:  VLTLDDDYED-PLTYDQAMADIDKDEWIKAMDQEMESMYFNSVWELVDQSDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGFTQVEGVDYEETFSPVA
        VL     + D P ++D+     DK  W +A++ E+ +   N+ W +  + +    +  +W++  K    G    +KARLVA+GFTQ   +DYEETF+PVA
Subjt:  VLTLDDDYED-PLTYDQAMADIDKDEWIKAMDQEMESMYFNSVWELVDQSDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGFTQVEGVDYEETFSPVA

Query:  --------------------MMDVKTAFLNGNLDEIIYMDQPKGFIAQGQEQRVCRLKRSIYGLKQASRSWNIRFDEAIKSYGFDQNVDEPCVY---KKI
                             MDVKTAFLNG L E IYM  P+G         VC+L ++IYGLKQA+R W   F++A+K   F  +  + C+Y   K  
Subjt:  --------------------MMDVKTAFLNGNLDEIIYMDQPKGFIAQGQEQRVCRLKRSIYGLKQASRSWNIRFDEAIKSYGFDQNVDEPCVY---KKI

Query:  VNSIVAFLILYVDDILLIGNEVGFLTDIKEWLASQFQTKDLGEAQYVLGIRCELQK
        +N  + +++LYVDD+++   ++  + + K +L  +F+  DL E ++ +GIR E+Q+
Subjt:  VNSIVAFLILYVDDILLIGNEVGFLTDIKEWLASQFQTKDLGEAQYVLGIRCELQK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.7e-7630.68Show/hide
Query:  NGVSERRNRTLLDMVRSMMSYARLPDSFWDYAVETTVYILNTVSSKSVC-ETPFELWHGRKTSLCHFRIWGCP--THVLVSNLKKLEPRSKLCLFVGYPK
        NGV+ER NRT+++ VRSM+  A+LP SFW  AV+T  Y++N   S  +  E P  +W  ++ S  H +++GC    HV      KL+ +S  C+F+GY  
Subjt:  NGVSERRNRTLLDMVRSMMSYARLPDSFWDYAVETTVYILNTVSSKSVC-ETPFELWHGRKTSLCHFRIWGCP--THVLVSNLKKLEPRSKLCLFVGYPK

Query:  ETRGGYFYDPKDNRVLVSTNATFFEEDHIRDHLPRSKIVLN----------------------------------------EMDNTSVRVADGAIRSQKL
        E  G   +DP   +V+ S +   F E  +R     S+ V N                                        E  +  V   +   + ++ 
Subjt:  ETRGGYFYDPKDNRVLVSTNATFFEEDHIRDHLPRSKIVLN----------------------------------------EMDNTSVRVADGAIRSQKL

Query:  GMP-RRSGRVVRQPERYMGLAETQVLTLDDDYEDPLTYDQAMADIDKDEWIKAMDQEMESMYFNSVWELVDQSDGVKPIGCKWIYKRKRGVDGKVQTFKA
          P RRS R   +  RY     T+ + + DD E P +  + ++  +K++ +KAM +EMES+  N  ++LV+   G +P+ CKW++K K+  D K+  +KA
Subjt:  GMP-RRSGRVVRQPERYMGLAETQVLTLDDDYEDPLTYDQAMADIDKDEWIKAMDQEMESMYFNSVWELVDQSDGVKPIGCKWIYKRKRGVDGKVQTFKA

Query:  RLVAKGFTQVEGVDYEETFSPVAMM--------------------DVKTAFLNGNLDEIIYMDQPKGFIAQGQEQRVCRLKRSIYGLKQASRSWNIRFDE
        RLV KGF Q +G+D++E FSPV  M                    DVKTAFL+G+L+E IYM+QP+GF   G++  VC+L +S+YGLKQA R W ++FD 
Subjt:  RLVAKGFTQVEGVDYEETFSPVAMM--------------------DVKTAFLNGNLDEIIYMDQPKGFIAQGQEQRVCRLKRSIYGLKQASRSWNIRFDE

Query:  AIKSYGFDQNVDEPCVY-KKIVNSIVAFLILYVDDILLIGNEVGFLTDIKEWLASQFQTKDLGEAQYVLGIRCELQ------------------------
         +KS  + +   +PCVY K+   +    L+LYVDD+L++G + G +  +K  L+  F  KDLG AQ +LG++   +                        
Subjt:  AIKSYGFDQNVDEPCVY-KKIVNSIVAFLILYVDDILLIGNEVGFLTDIKEWLASQFQTKDLGEAQYVLGIRCELQ------------------------

Query:  --KKGLSPFGHGVHLPEDQCPKTPQEVEDMRRIPMLQLQSSSILGE------------------------------------RGTTD--LVYGSGDLILH
          K   +P    + L +  CP T +E  +M ++P      S +                                       RGTT   L +G  D IL 
Subjt:  --KKGLSPFGHGVHLPEDQCPKTPQEVEDMRRIPMLQLQSSSILGE------------------------------------RGTTD--LVYGSGDLILH

Query:  GYTDSTFGHDKDSRKSTSGSPSFSNGGVV
        GYTD+    D D+RKS++G     +GG +
Subjt:  GYTDSTFGHDKDSRKSTSGSPSFSNGGVV

P25600 Putative transposon Ty5-1 protein YCL074W2.0e-1533.58Show/hide
Query:  MDVKTAFLNGNLDEIIYMDQPKGFIAQGQEQRVCRLKRSIYGLKQASRSWNIRFDEAIKSYGFDQNVDEPCVYKKIVNSIVAFLILYVDDILLIGNEVGF
        MDV TAFLN  +DE IY+ QP GF+ +     V  L   +YGLKQA   WN   +  +K  GF ++  E  +Y +  +    ++ +YVDD+L+       
Subjt:  MDVKTAFLNGNLDEIIYMDQPKGFIAQGQEQRVCRLKRSIYGLKQASRSWNIRFDEAIKSYGFDQNVDEPCVYKKIVNSIVAFLILYVDDILLIGNEVGF

Query:  LTDIKEWLASQFQTKDLGEAQYVLGIRCELQKKG
           +K+ L   +  KDLG+    LG+       G
Subjt:  LTDIKEWLASQFQTKDLGEAQYVLGIRCELQKKG

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.4e-3835.22Show/hide
Query:  DPLTYDQAMADIDKDEWIKAMDQEMESMYFNSVWELVDQSDG-VKPIGCKWIYKRKRGVDGKVQTFKARLVAKGFTQVEGVDYEETFSPV----------
        +P T  QA+ D   + W  AM  E+ +   N  W+LV      V  +GC+WI+ +K   DG +  +KARLVAKG+ Q  G+DY ETFSPV          
Subjt:  DPLTYDQAMADIDKDEWIKAMDQEMESMYFNSVWELVDQSDG-VKPIGCKWIYKRKRGVDGKVQTFKARLVAKGFTQVEGVDYEETFSPV----------

Query:  ----------AMMDVKTAFLNGNLDEIIYMDQPKGFIAQGQEQRVCRLKRSIYGLKQASRSWNIRFDEAIKSYGFDQNVDEPCVYKKIVNSIVAFLILYV
                    +DV  AFL G L + +YM QP GFI + +   VC+L++++YGLKQA R+W +     + + GF  +V +  ++       + ++++YV
Subjt:  ----------AMMDVKTAFLNGNLDEIIYMDQPKGFIAQGQEQRVCRLKRSIYGLKQASRSWNIRFDEAIKSYGFDQNVDEPCVYKKIVNSIVAFLILYV

Query:  DDILLIGNEVGFLTDIKEWLASQFQTKDLGEAQYVLGIRCELQKKGL
        DDIL+ GN+   L +  + L+ +F  KD  E  Y LGI  +    GL
Subjt:  DDILLIGNEVGFLTDIKEWLASQFQTKDLGEAQYVLGIRCELQKKGL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.8e-3835.22Show/hide
Query:  DPLTYDQAMADIDKDEWIKAMDQEMESMYFNSVWELV-DQSDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGFTQVEGVDYEETFSPV----------
        +P T  QAM D   D W +AM  E+ +   N  W+LV      V  +GC+WI+ +K   DG +  +KARLVAKG+ Q  G+DY ETFSPV          
Subjt:  DPLTYDQAMADIDKDEWIKAMDQEMESMYFNSVWELV-DQSDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGFTQVEGVDYEETFSPV----------

Query:  ----------AMMDVKTAFLNGNLDEIIYMDQPKGFIAQGQEQRVCRLKRSIYGLKQASRSWNIRFDEAIKSYGFDQNVDEPCVYKKIVNSIVAFLILYV
                    +DV  AFL G L + +YM QP GF+ + +   VCRL+++IYGLKQA R+W +     + + GF  ++ +  ++       + ++++YV
Subjt:  ----------AMMDVKTAFLNGNLDEIIYMDQPKGFIAQGQEQRVCRLKRSIYGLKQASRSWNIRFDEAIKSYGFDQNVDEPCVYKKIVNSIVAFLILYV

Query:  DDILLIGNEVGFLTDIKEWLASQFQTKDLGEAQYVLGIRCELQKKGL
        DDIL+ GN+   L    + L+ +F  K+  +  Y LGI  +   +GL
Subjt:  DDILLIGNEVGFLTDIKEWLASQFQTKDLGEAQYVLGIRCELQKKGL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.3e-4538.89Show/hide
Query:  EDPLTYDQAMADIDKDEWIKAMDQEMESMYFNSVWELVDQSDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGFTQVEGVDYEETFSPVA---------
        ++P TY++A   +    W  AMD E+ +M     WE+       KPIGCKW+YK K   DG ++ +KARLVAKG+TQ EG+D+ ETFSPV          
Subjt:  EDPLTYDQAMADIDKDEWIKAMDQEMESMYFNSVWELVDQSDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGFTQVEGVDYEETFSPVA---------

Query:  -----------MMDVKTAFLNGNLDEIIYMDQPKGFIA-QGQE---QRVCRLKRSIYGLKQASRSWNIRFDEAIKSYGFDQNVDEPCVYKKIVNSIVAFL
                    +D+  AFLNG+LDE IYM  P G+ A QG       VC LK+SIYGLKQASR W ++F   +  +GF Q+  +   + KI  ++   +
Subjt:  -----------MMDVKTAFLNGNLDEIIYMDQPKGFIA-QGQE---QRVCRLKRSIYGLKQASRSWNIRFDEAIKSYGFDQNVDEPCVYKKIVNSIVAFL

Query:  ILYVDDILLIGNEVGFLTDIKEWLASQFQTKDLGEAQYVLGIRCELQKKGLS
        ++YVDDI++  N    + ++K  L S F+ +DLG  +Y LG+       G++
Subjt:  ILYVDDILLIGNEVGFLTDIKEWLASQFQTKDLGEAQYVLGIRCELQKKGLS

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.9e-0532.93Show/hide
Query:  NRTLLDMVRSMMSYARLPDSFWDYAVETTVYILNTVSSKSV-CETPFELWHGRKTSLCHFRIWGCPTHVLVSNLKKLEPRSK
        NRT+++ VRSM+    LP +F   A  T V+I+N   S ++    P E+W     +  + R +GC  ++      KL+PR+K
Subjt:  NRTLLDMVRSMMSYARLPDSFWDYAVETTVYILNTVSSKSV-CETPFELWHGRKTSLCHFRIWGCPTHVLVSNLKKLEPRSK

ATMG00810.1 DNA/RNA polymerases superfamily protein2.1e-0445.28Show/hide
Query:  FLILYVDDILLIGNEVGFLTDIKEWLASQFQTKDLGEAQYVLGIRCELQKKGL
        +L+LYVDDILL G+    L  +   L+S F  KDLG   Y LGI+ +    GL
Subjt:  FLILYVDDILLIGNEVGFLTDIKEWLASQFQTKDLGEAQYVLGIRCELQKKGL

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)3.3e-1341.67Show/hide
Query:  WIKAMDQEMESMYFNSVWELVDQSDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGFTQVEGVDYEETFSPVAMMDVKTAFLN
        W +AM +E++++  N  W LV        +GCKW++K K   DG +   KARLVAKGF Q EG+ + ET+SPV         LN
Subjt:  WIKAMDQEMESMYFNSVWELVDQSDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGFTQVEGVDYEETFSPVAMMDVKTAFLN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACGGTTCAACAGTCCTATTCAACAATTTACAAACCTTCCAGTCTTTGGTGCAAACTAGGGGACAAGAAGCTGAGACAAGTGTTGCCTTAAGACCTTTTCACAGGGG
TTCGACCTCTGGGATAAAGTTTGTAGATTCTTCAGGCCCTAAGGGGAAGAAGGAGAAGTTGAAGGAAGTTAAGGTTGACCGCGTTGTCGCCCTAAAAGGCAAAGAAATCA
AGGATGTTGCCAGAAAGTGTTTCTACTCCAACGGGGACGAACACCTAAAAAAGAGTTGTCCCAAGTTCCTTGATGAAAAGAAGATGACAGGTAAATGTGATTTACTTGTC
ACAGAAACTTGTTTAGTGGAGAGTAATGACTCGGCTTGGATATTGGATTCAGGCGCCACTAACCATGTTTGCTCTTCTTTTCAAGGATTAAGTCCCTGGCAGCGGATGCA
AGAGGGTGAGATAACACTCAGGGTTGGATCCGAAGAAGTTGTCTCGGCTGCAGCAATAGGCACGAAGAATGGTGTATCGGAGAGGAGAAATAGAACATTGTTGGACATGG
TTCGGTCGATGATGAGCTATGCTCGTCTCCCTGATTCCTTTTGGGATTACGCAGTGGAGACTACGGTTTACATTTTGAACACAGTTTCGTCGAAAAGTGTTTGTGAAACA
CCTTTCGAACTCTGGCATGGTCGTAAAACCAGTTTATGTCATTTCAGAATTTGGGGATGCCCGACCCATGTGTTGGTGTCAAACCTGAAGAAGCTCGAACCCCGTTCAAA
GTTGTGCCTCTTTGTAGGTTACCCAAAAGAGACTAGAGGTGGTTACTTTTATGATCCTAAGGATAATAGGGTACTTGTGTCGACAAACGCCACTTTCTTTGAGGAAGACC
ACATCAGGGATCACTTACCAAGGAGTAAGATTGTGTTAAATGAAATGGACAATACATCAGTAAGAGTTGCTGATGGTGCTATCCGTTCCCAAAAGTTGGGAATGCCTCGA
CGTAGTGGGAGGGTTGTGAGACAGCCTGAACGTTACATGGGTTTAGCTGAAACCCAAGTCCTCACCCTTGATGATGACTACGAGGATCCATTGACCTATGATCAGGCAAT
GGCTGACATTGACAAAGACGAATGGATTAAAGCTATGGACCAGGAAATGGAGTCGATGTACTTCAATTCCGTCTGGGAGCTTGTGGACCAATCGGATGGGGTAAAACCTA
TTGGTTGCAAGTGGATCTACAAGCGTAAACGTGGTGTAGATGGGAAGGTGCAGACCTTCAAAGCTCGACTAGTGGCAAAGGGTTTTACCCAGGTGGAAGGGGTTGACTAT
GAGGAAACCTTTTCACCTGTTGCCATGATGGATGTCAAGACCGCCTTTCTGAATGGCAACCTTGACGAGATCATCTACATGGACCAGCCCAAGGGGTTCATTGCCCAAGG
CCAAGAGCAAAGAGTTTGCCGGCTTAAAAGGTCTATTTATGGACTGAAACAAGCCTCGAGGTCTTGGAATATAAGGTTTGATGAGGCGATCAAATCTTATGGCTTTGATC
AAAATGTTGACGAGCCTTGTGTTTACAAGAAAATCGTTAACAGTATCGTCGCATTTCTTATATTGTATGTGGATGATATCCTTCTCATTGGGAATGAGGTAGGATTTCTT
ACTGACATTAAGGAATGGTTGGCTTCGCAATTCCAAACTAAAGATTTGGGAGAGGCACAGTATGTTCTAGGTATAAGATGCGAACTCCAGAAGAAGGGCTTGTCGCCTTT
CGGGCATGGGGTTCACCTGCCTGAGGATCAATGTCCTAAGACGCCTCAAGAGGTTGAGGATATGAGACGAATCCCTATGCTTCAGTTGCAATCCTCAAGTATCTTAGGAG
AACGAGGAACTACTGACCTTGTGTATGGGAGTGGGGATTTGATCCTTCACGGATACACAGATTCGACTTTCGGGCACGATAAGGATTCTAGGAAATCCACTTCGGGGTCA
CCTTCATTCTCGAATGGAGGAGTCGTAGTGTGTGGCGAAGCATCAAACAAGGATGCATCGATTCCACGATGGAAGTCGAGTATGTTACGGCTTGTGAGCGCGCGAAGGAA
GTCGTTTGCCTCGGGAAGTTCATGA
mRNA sequenceShow/hide mRNA sequence
ATGCACGGTTCAACAGTCCTATTCAACAATTTACAAACCTTCCAGTCTTTGGTGCAAACTAGGGGACAAGAAGCTGAGACAAGTGTTGCCTTAAGACCTTTTCACAGGGG
TTCGACCTCTGGGATAAAGTTTGTAGATTCTTCAGGCCCTAAGGGGAAGAAGGAGAAGTTGAAGGAAGTTAAGGTTGACCGCGTTGTCGCCCTAAAAGGCAAAGAAATCA
AGGATGTTGCCAGAAAGTGTTTCTACTCCAACGGGGACGAACACCTAAAAAAGAGTTGTCCCAAGTTCCTTGATGAAAAGAAGATGACAGGTAAATGTGATTTACTTGTC
ACAGAAACTTGTTTAGTGGAGAGTAATGACTCGGCTTGGATATTGGATTCAGGCGCCACTAACCATGTTTGCTCTTCTTTTCAAGGATTAAGTCCCTGGCAGCGGATGCA
AGAGGGTGAGATAACACTCAGGGTTGGATCCGAAGAAGTTGTCTCGGCTGCAGCAATAGGCACGAAGAATGGTGTATCGGAGAGGAGAAATAGAACATTGTTGGACATGG
TTCGGTCGATGATGAGCTATGCTCGTCTCCCTGATTCCTTTTGGGATTACGCAGTGGAGACTACGGTTTACATTTTGAACACAGTTTCGTCGAAAAGTGTTTGTGAAACA
CCTTTCGAACTCTGGCATGGTCGTAAAACCAGTTTATGTCATTTCAGAATTTGGGGATGCCCGACCCATGTGTTGGTGTCAAACCTGAAGAAGCTCGAACCCCGTTCAAA
GTTGTGCCTCTTTGTAGGTTACCCAAAAGAGACTAGAGGTGGTTACTTTTATGATCCTAAGGATAATAGGGTACTTGTGTCGACAAACGCCACTTTCTTTGAGGAAGACC
ACATCAGGGATCACTTACCAAGGAGTAAGATTGTGTTAAATGAAATGGACAATACATCAGTAAGAGTTGCTGATGGTGCTATCCGTTCCCAAAAGTTGGGAATGCCTCGA
CGTAGTGGGAGGGTTGTGAGACAGCCTGAACGTTACATGGGTTTAGCTGAAACCCAAGTCCTCACCCTTGATGATGACTACGAGGATCCATTGACCTATGATCAGGCAAT
GGCTGACATTGACAAAGACGAATGGATTAAAGCTATGGACCAGGAAATGGAGTCGATGTACTTCAATTCCGTCTGGGAGCTTGTGGACCAATCGGATGGGGTAAAACCTA
TTGGTTGCAAGTGGATCTACAAGCGTAAACGTGGTGTAGATGGGAAGGTGCAGACCTTCAAAGCTCGACTAGTGGCAAAGGGTTTTACCCAGGTGGAAGGGGTTGACTAT
GAGGAAACCTTTTCACCTGTTGCCATGATGGATGTCAAGACCGCCTTTCTGAATGGCAACCTTGACGAGATCATCTACATGGACCAGCCCAAGGGGTTCATTGCCCAAGG
CCAAGAGCAAAGAGTTTGCCGGCTTAAAAGGTCTATTTATGGACTGAAACAAGCCTCGAGGTCTTGGAATATAAGGTTTGATGAGGCGATCAAATCTTATGGCTTTGATC
AAAATGTTGACGAGCCTTGTGTTTACAAGAAAATCGTTAACAGTATCGTCGCATTTCTTATATTGTATGTGGATGATATCCTTCTCATTGGGAATGAGGTAGGATTTCTT
ACTGACATTAAGGAATGGTTGGCTTCGCAATTCCAAACTAAAGATTTGGGAGAGGCACAGTATGTTCTAGGTATAAGATGCGAACTCCAGAAGAAGGGCTTGTCGCCTTT
CGGGCATGGGGTTCACCTGCCTGAGGATCAATGTCCTAAGACGCCTCAAGAGGTTGAGGATATGAGACGAATCCCTATGCTTCAGTTGCAATCCTCAAGTATCTTAGGAG
AACGAGGAACTACTGACCTTGTGTATGGGAGTGGGGATTTGATCCTTCACGGATACACAGATTCGACTTTCGGGCACGATAAGGATTCTAGGAAATCCACTTCGGGGTCA
CCTTCATTCTCGAATGGAGGAGTCGTAGTGTGTGGCGAAGCATCAAACAAGGATGCATCGATTCCACGATGGAAGTCGAGTATGTTACGGCTTGTGAGCGCGCGAAGGAA
GTCGTTTGCCTCGGGAAGTTCATGA
Protein sequenceShow/hide protein sequence
MHGSTVLFNNLQTFQSLVQTRGQEAETSVALRPFHRGSTSGIKFVDSSGPKGKKEKLKEVKVDRVVALKGKEIKDVARKCFYSNGDEHLKKSCPKFLDEKKMTGKCDLLV
TETCLVESNDSAWILDSGATNHVCSSFQGLSPWQRMQEGEITLRVGSEEVVSAAAIGTKNGVSERRNRTLLDMVRSMMSYARLPDSFWDYAVETTVYILNTVSSKSVCET
PFELWHGRKTSLCHFRIWGCPTHVLVSNLKKLEPRSKLCLFVGYPKETRGGYFYDPKDNRVLVSTNATFFEEDHIRDHLPRSKIVLNEMDNTSVRVADGAIRSQKLGMPR
RSGRVVRQPERYMGLAETQVLTLDDDYEDPLTYDQAMADIDKDEWIKAMDQEMESMYFNSVWELVDQSDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGFTQVEGVDY
EETFSPVAMMDVKTAFLNGNLDEIIYMDQPKGFIAQGQEQRVCRLKRSIYGLKQASRSWNIRFDEAIKSYGFDQNVDEPCVYKKIVNSIVAFLILYVDDILLIGNEVGFL
TDIKEWLASQFQTKDLGEAQYVLGIRCELQKKGLSPFGHGVHLPEDQCPKTPQEVEDMRRIPMLQLQSSSILGERGTTDLVYGSGDLILHGYTDSTFGHDKDSRKSTSGS
PSFSNGGVVVCGEASNKDASIPRWKSSMLRLVSARRKSFASGSS