; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G12920 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G12920
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationChr5:12327065..12330005
RNA-Seq ExpressionCSPI05G12920
SyntenyCSPI05G12920
Gene Ontology termsNA
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031744753.1 uncharacterized protein LOC101212255 isoform X1 [Cucumis sativus]2.0e-25952.89Show/hide
Query:  ANSTPVASTVALGNTKGLLTTSSKWVIDSGVTAHMTGNSHLFSRPLSPAPFLSVTLANDFTSSVLGSGTIHLTPSFSLSS--------------------
        ++STP+ASTVA GN K LLT+S+KWVIDSG TAHMTGNSHLFSRPLSPAPF SVTLA+  TSSVLGSGTIHLTPSFSLSS                    
Subjt:  ANSTPVASTVALGNTKGLLTTSSKWVIDSGVTAHMTGNSHLFSRPLSPAPFLSVTLANDFTSSVLGSGTIHLTPSFSLSS--------------------

Query:  ---------------DHVTKKIIDRGYELGDLYLFDHQVSQVVACPVVPSPFEIHCCLGHPSLFMLKKLYQEFRSLSSLNCDLCQFAKYHRLRSSPRVDK
                       D VTKKII RGYE G LYLFDHQVSQ VACPVVPSPFE+HC LGHPSLF+LKKLY EFRSLSSLNCD CQFAK+HRL SSPRVDK
Subjt:  ---------------DHVTKKIIDRGYELGDLYLFDHQVSQVVACPVVPSPFEIHCCLGHPSLFMLKKLYQEFRSLSSLNCDLCQFAKYHRLRSSPRVDK

Query:  RAIAPFDL---------PVLT-----------------------------------LHLKM--------------------------------VLH----
        RAIAPF+L         PV++                                    H ++                                ++H    
Subjt:  RAIAPFDL---------PVLT-----------------------------------LHLKM--------------------------------VLH----

Query:  ----------NEKNRHLLETARALSFQMHVPKTFWSDVI-----------------------------LFPIAPKIFGCVCFVRDVRPQHTKLDPKSLMC
                    KNRHLLETARALSFQMHV K FW D +                             LFPIAPKIFGCVCFVRDVRP HTKLDPKSL C
Subjt:  ----------NEKNRHLLETARALSFQMHVPKTFWSDVI-----------------------------LFPIAPKIFGCVCFVRDVRPQHTKLDPKSLMC

Query:  IFLGYSRVQKVYCCYCPTLKRYLVSPDVAFFVDTPFTSSPSNLCQEEDDNLFIYETHVLHQCFLHHVIRGQVMI--------------------------
        IFLGYSRVQK Y CYCPTLKRYLVSPDV FF DTPFTSSPS+LCQ EDDNLFIYE           V   + +I                          
Subjt:  IFLGYSRVQKVYCCYCPTLKRYLVSPDVAFFVDTPFTSSPSNLCQEEDDNLFIYETHVLHQCFLHHVIRGQVMI--------------------------

Query:  ------FPLLFAK------IPV---ISYHQLSPSTYAFITSLESTSIPNSVHGTLSHLDWQNEMIEEMTALDDN----------GKKVIGCKWVFAVKMN
               P+   K       PV   ISYHQLSPSTYAFITSLESTSIPNSVH  LSH  WQN MIEEMTALDDN          GKK IGCKWVFAVKMN
Subjt:  ------FPLLFAK------IPV---ISYHQLSPSTYAFITSLESTSIPNSVHGTLSHLDWQNEMIEEMTALDDN----------GKKVIGCKWVFAVKMN

Query:  PDGL---------------IEGSSY--TFSPVAKLTSIRLFLSIAATHKWHLHQLDIKNVFLHGDLQEEVYIEQPPGYVTQGESDK-------------S
        PDG                I G+ Y  TFSPVAKLTSIRLFLS+AAT+KW LHQLDIKN FLHGDLQEEVY+EQPPG+V QGESDK             S
Subjt:  PDGL---------------IEGSSY--TFSPVAKLTSIRLFLSIAATHKWHLHQLDIKNVFLHGDLQEEVYIEQPPGYVTQGESDK-------------S

Query:  PRAWFGKFSQALVCF-------------------------------------------------------------------------------------
        PRAWFGKFSQALVCF                                                                                     
Subjt:  PRAWFGKFSQALVCF-------------------------------------------------------------------------------------

Query:  ------------------DQQLVKEGELRKDTERYRRLVGKL-YLTVTRPDIAYFVSVVSQFMSSYTVDHWAVVEQILCYLKAAPGRGILYKDHGHTIVE
                          +QQLVKEGEL KD ERYRRLVGKL YLTVTRPDIAY VSVVSQFMSS TVDHWA VEQILCYLKAAPGRGILYKDHGHT VE
Subjt:  ------------------DQQLVKEGELRKDTERYRRLVGKL-YLTVTRPDIAYFVSVVSQFMSSYTVDHWAVVEQILCYLKAAPGRGILYKDHGHTIVE

Query:  CFSDADWAGSREDRRSTSGYCVF--------------------AESESRAIAQSVCEIVWIHQLLSKKGFSITVAAKLWCANQAALHIASIP--------
        CFSDADWAGSREDRRSTSGYCVF                    AESE RA+AQSVCEIVWIHQLLS+ GFSITV AKLWC NQAALHIAS P        
Subjt:  CFSDADWAGSREDRRSTSGYCVF--------------------AESESRAIAQSVCEIVWIHQLLSKKGFSITVAAKLWCANQAALHIASIP--------

Query:  -----------------------TGEQLGDILTKALNGARIIYLCNSLGTIDIFALA
                               TGEQLGDILTKALNG RI YLCN LG IDIFA A
Subjt:  -----------------------TGEQLGDILTKALNGARIIYLCNSLGTIDIFALA

XP_031744754.1 uncharacterized protein LOC101212255 isoform X2 [Cucumis sativus]1.5e-22751.67Show/hide
Query:  GSGTIHLTPSFSLS-SDHVTKKIIDRGYELGDLYLFDHQVSQVVACPVVPSPFEIHCCLGHPSLFMLKKLYQEFRSLSSLNCDLCQFAKYHRLRSSPRVD
        G G+ H   +  +   D VTKKII RGYE G LYLFDHQVSQ VACPVVPSPFE+HC LGHPSLF+LKKLY EFRSLSSLNCD CQFAK+HRL SSPRVD
Subjt:  GSGTIHLTPSFSLS-SDHVTKKIIDRGYELGDLYLFDHQVSQVVACPVVPSPFEIHCCLGHPSLFMLKKLYQEFRSLSSLNCDLCQFAKYHRLRSSPRVD

Query:  KRAIAPFDL---------PVLT-----------------------------------LHLKM--------------------------------VLH---
        KRAIAPF+L         PV++                                    H ++                                ++H   
Subjt:  KRAIAPFDL---------PVLT-----------------------------------LHLKM--------------------------------VLH---

Query:  -----------NEKNRHLLETARALSFQMHVPKTFWSDVI-----------------------------LFPIAPKIFGCVCFVRDVRPQHTKLDPKSLM
                     KNRHLLETARALSFQMHV K FW D +                             LFPIAPKIFGCVCFVRDVRP HTKLDPKSL 
Subjt:  -----------NEKNRHLLETARALSFQMHVPKTFWSDVI-----------------------------LFPIAPKIFGCVCFVRDVRPQHTKLDPKSLM

Query:  CIFLGYSRVQKVYCCYCPTLKRYLVSPDVAFFVDTPFTSSPSNLCQEEDDNLFIYETHVLHQCFLHHVIRGQVMI-------------------------
        CIFLGYSRVQK Y CYCPTLKRYLVSPDV FF DTPFTSSPS+LCQ EDDNLFIYE           V   + +I                         
Subjt:  CIFLGYSRVQKVYCCYCPTLKRYLVSPDVAFFVDTPFTSSPSNLCQEEDDNLFIYETHVLHQCFLHHVIRGQVMI-------------------------

Query:  -------FPLLFAK------IPV---ISYHQLSPSTYAFITSLESTSIPNSVHGTLSHLDWQNEMIEEMTALDDN----------GKKVIGCKWVFAVKM
                P+   K       PV   ISYHQLSPSTYAFITSLESTSIPNSVH  LSH  WQN MIEEMTALDDN          GKK IGCKWVFAVKM
Subjt:  -------FPLLFAK------IPV---ISYHQLSPSTYAFITSLESTSIPNSVHGTLSHLDWQNEMIEEMTALDDN----------GKKVIGCKWVFAVKM

Query:  NPDGL---------------IEGSSY--TFSPVAKLTSIRLFLSIAATHKWHLHQLDIKNVFLHGDLQEEVYIEQPPGYVTQGESDK-------------
        NPDG                I G+ Y  TFSPVAKLTSIRLFLS+AAT+KW LHQLDIKN FLHGDLQEEVY+EQPPG+V QGESDK             
Subjt:  NPDGL---------------IEGSSY--TFSPVAKLTSIRLFLSIAATHKWHLHQLDIKNVFLHGDLQEEVYIEQPPGYVTQGESDK-------------

Query:  SPRAWFGKFSQALVCF------------------------------------------------------------------------------------
        SPRAWFGKFSQALVCF                                                                                    
Subjt:  SPRAWFGKFSQALVCF------------------------------------------------------------------------------------

Query:  -------------------DQQLVKEGELRKDTERYRRLVGKL-YLTVTRPDIAYFVSVVSQFMSSYTVDHWAVVEQILCYLKAAPGRGILYKDHGHTIV
                           +QQLVKEGEL KD ERYRRLVGKL YLTVTRPDIAY VSVVSQFMSS TVDHWA VEQILCYLKAAPGRGILYKDHGHT V
Subjt:  -------------------DQQLVKEGELRKDTERYRRLVGKL-YLTVTRPDIAYFVSVVSQFMSSYTVDHWAVVEQILCYLKAAPGRGILYKDHGHTIV

Query:  ECFSDADWAGSREDRRSTSGYCVF--------------------AESESRAIAQSVCEIVWIHQLLSKKGFSITVAAKLWCANQAALHIASIP-------
        ECFSDADWAGSREDRRSTSGYCVF                    AESE RA+AQSVCEIVWIHQLLS+ GFSITV AKLWC NQAALHIAS P       
Subjt:  ECFSDADWAGSREDRRSTSGYCVF--------------------AESESRAIAQSVCEIVWIHQLLSKKGFSITVAAKLWCANQAALHIASIP-------

Query:  ------------------------TGEQLGDILTKALNGARIIYLCNSLGTIDIFALA
                                TGEQLGDILTKALNG RI YLCN LG IDIFA A
Subjt:  ------------------------TGEQLGDILTKALNGARIIYLCNSLGTIDIFALA

XP_031744755.1 uncharacterized protein LOC101212255 isoform X3 [Cucumis sativus]1.5e-22751.67Show/hide
Query:  GSGTIHLTPSFSLS-SDHVTKKIIDRGYELGDLYLFDHQVSQVVACPVVPSPFEIHCCLGHPSLFMLKKLYQEFRSLSSLNCDLCQFAKYHRLRSSPRVD
        G G+ H   +  +   D VTKKII RGYE G LYLFDHQVSQ VACPVVPSPFE+HC LGHPSLF+LKKLY EFRSLSSLNCD CQFAK+HRL SSPRVD
Subjt:  GSGTIHLTPSFSLS-SDHVTKKIIDRGYELGDLYLFDHQVSQVVACPVVPSPFEIHCCLGHPSLFMLKKLYQEFRSLSSLNCDLCQFAKYHRLRSSPRVD

Query:  KRAIAPFDL---------PVLT-----------------------------------LHLKM--------------------------------VLH---
        KRAIAPF+L         PV++                                    H ++                                ++H   
Subjt:  KRAIAPFDL---------PVLT-----------------------------------LHLKM--------------------------------VLH---

Query:  -----------NEKNRHLLETARALSFQMHVPKTFWSDVI-----------------------------LFPIAPKIFGCVCFVRDVRPQHTKLDPKSLM
                     KNRHLLETARALSFQMHV K FW D +                             LFPIAPKIFGCVCFVRDVRP HTKLDPKSL 
Subjt:  -----------NEKNRHLLETARALSFQMHVPKTFWSDVI-----------------------------LFPIAPKIFGCVCFVRDVRPQHTKLDPKSLM

Query:  CIFLGYSRVQKVYCCYCPTLKRYLVSPDVAFFVDTPFTSSPSNLCQEEDDNLFIYETHVLHQCFLHHVIRGQVMI-------------------------
        CIFLGYSRVQK Y CYCPTLKRYLVSPDV FF DTPFTSSPS+LCQ EDDNLFIYE           V   + +I                         
Subjt:  CIFLGYSRVQKVYCCYCPTLKRYLVSPDVAFFVDTPFTSSPSNLCQEEDDNLFIYETHVLHQCFLHHVIRGQVMI-------------------------

Query:  -------FPLLFAK------IPV---ISYHQLSPSTYAFITSLESTSIPNSVHGTLSHLDWQNEMIEEMTALDDN----------GKKVIGCKWVFAVKM
                P+   K       PV   ISYHQLSPSTYAFITSLESTSIPNSVH  LSH  WQN MIEEMTALDDN          GKK IGCKWVFAVKM
Subjt:  -------FPLLFAK------IPV---ISYHQLSPSTYAFITSLESTSIPNSVHGTLSHLDWQNEMIEEMTALDDN----------GKKVIGCKWVFAVKM

Query:  NPDGL---------------IEGSSY--TFSPVAKLTSIRLFLSIAATHKWHLHQLDIKNVFLHGDLQEEVYIEQPPGYVTQGESDK-------------
        NPDG                I G+ Y  TFSPVAKLTSIRLFLS+AAT+KW LHQLDIKN FLHGDLQEEVY+EQPPG+V QGESDK             
Subjt:  NPDGL---------------IEGSSY--TFSPVAKLTSIRLFLSIAATHKWHLHQLDIKNVFLHGDLQEEVYIEQPPGYVTQGESDK-------------

Query:  SPRAWFGKFSQALVCF------------------------------------------------------------------------------------
        SPRAWFGKFSQALVCF                                                                                    
Subjt:  SPRAWFGKFSQALVCF------------------------------------------------------------------------------------

Query:  -------------------DQQLVKEGELRKDTERYRRLVGKL-YLTVTRPDIAYFVSVVSQFMSSYTVDHWAVVEQILCYLKAAPGRGILYKDHGHTIV
                           +QQLVKEGEL KD ERYRRLVGKL YLTVTRPDIAY VSVVSQFMSS TVDHWA VEQILCYLKAAPGRGILYKDHGHT V
Subjt:  -------------------DQQLVKEGELRKDTERYRRLVGKL-YLTVTRPDIAYFVSVVSQFMSSYTVDHWAVVEQILCYLKAAPGRGILYKDHGHTIV

Query:  ECFSDADWAGSREDRRSTSGYCVF--------------------AESESRAIAQSVCEIVWIHQLLSKKGFSITVAAKLWCANQAALHIASIP-------
        ECFSDADWAGSREDRRSTSGYCVF                    AESE RA+AQSVCEIVWIHQLLS+ GFSITV AKLWC NQAALHIAS P       
Subjt:  ECFSDADWAGSREDRRSTSGYCVF--------------------AESESRAIAQSVCEIVWIHQLLSKKGFSITVAAKLWCANQAALHIASIP-------

Query:  ------------------------TGEQLGDILTKALNGARIIYLCNSLGTIDIFALA
                                TGEQLGDILTKALNG RI YLCN LG IDIFA A
Subjt:  ------------------------TGEQLGDILTKALNGARIIYLCNSLGTIDIFALA

XP_031744756.1 uncharacterized protein LOC101212255 isoform X4 [Cucumis sativus]1.5e-22751.67Show/hide
Query:  GSGTIHLTPSFSLS-SDHVTKKIIDRGYELGDLYLFDHQVSQVVACPVVPSPFEIHCCLGHPSLFMLKKLYQEFRSLSSLNCDLCQFAKYHRLRSSPRVD
        G G+ H   +  +   D VTKKII RGYE G LYLFDHQVSQ VACPVVPSPFE+HC LGHPSLF+LKKLY EFRSLSSLNCD CQFAK+HRL SSPRVD
Subjt:  GSGTIHLTPSFSLS-SDHVTKKIIDRGYELGDLYLFDHQVSQVVACPVVPSPFEIHCCLGHPSLFMLKKLYQEFRSLSSLNCDLCQFAKYHRLRSSPRVD

Query:  KRAIAPFDL---------PVLT-----------------------------------LHLKM--------------------------------VLH---
        KRAIAPF+L         PV++                                    H ++                                ++H   
Subjt:  KRAIAPFDL---------PVLT-----------------------------------LHLKM--------------------------------VLH---

Query:  -----------NEKNRHLLETARALSFQMHVPKTFWSDVI-----------------------------LFPIAPKIFGCVCFVRDVRPQHTKLDPKSLM
                     KNRHLLETARALSFQMHV K FW D +                             LFPIAPKIFGCVCFVRDVRP HTKLDPKSL 
Subjt:  -----------NEKNRHLLETARALSFQMHVPKTFWSDVI-----------------------------LFPIAPKIFGCVCFVRDVRPQHTKLDPKSLM

Query:  CIFLGYSRVQKVYCCYCPTLKRYLVSPDVAFFVDTPFTSSPSNLCQEEDDNLFIYETHVLHQCFLHHVIRGQVMI-------------------------
        CIFLGYSRVQK Y CYCPTLKRYLVSPDV FF DTPFTSSPS+LCQ EDDNLFIYE           V   + +I                         
Subjt:  CIFLGYSRVQKVYCCYCPTLKRYLVSPDVAFFVDTPFTSSPSNLCQEEDDNLFIYETHVLHQCFLHHVIRGQVMI-------------------------

Query:  -------FPLLFAK------IPV---ISYHQLSPSTYAFITSLESTSIPNSVHGTLSHLDWQNEMIEEMTALDDN----------GKKVIGCKWVFAVKM
                P+   K       PV   ISYHQLSPSTYAFITSLESTSIPNSVH  LSH  WQN MIEEMTALDDN          GKK IGCKWVFAVKM
Subjt:  -------FPLLFAK------IPV---ISYHQLSPSTYAFITSLESTSIPNSVHGTLSHLDWQNEMIEEMTALDDN----------GKKVIGCKWVFAVKM

Query:  NPDGL---------------IEGSSY--TFSPVAKLTSIRLFLSIAATHKWHLHQLDIKNVFLHGDLQEEVYIEQPPGYVTQGESDK-------------
        NPDG                I G+ Y  TFSPVAKLTSIRLFLS+AAT+KW LHQLDIKN FLHGDLQEEVY+EQPPG+V QGESDK             
Subjt:  NPDGL---------------IEGSSY--TFSPVAKLTSIRLFLSIAATHKWHLHQLDIKNVFLHGDLQEEVYIEQPPGYVTQGESDK-------------

Query:  SPRAWFGKFSQALVCF------------------------------------------------------------------------------------
        SPRAWFGKFSQALVCF                                                                                    
Subjt:  SPRAWFGKFSQALVCF------------------------------------------------------------------------------------

Query:  -------------------DQQLVKEGELRKDTERYRRLVGKL-YLTVTRPDIAYFVSVVSQFMSSYTVDHWAVVEQILCYLKAAPGRGILYKDHGHTIV
                           +QQLVKEGEL KD ERYRRLVGKL YLTVTRPDIAY VSVVSQFMSS TVDHWA VEQILCYLKAAPGRGILYKDHGHT V
Subjt:  -------------------DQQLVKEGELRKDTERYRRLVGKL-YLTVTRPDIAYFVSVVSQFMSSYTVDHWAVVEQILCYLKAAPGRGILYKDHGHTIV

Query:  ECFSDADWAGSREDRRSTSGYCVF--------------------AESESRAIAQSVCEIVWIHQLLSKKGFSITVAAKLWCANQAALHIASIP-------
        ECFSDADWAGSREDRRSTSGYCVF                    AESE RA+AQSVCEIVWIHQLLS+ GFSITV AKLWC NQAALHIAS P       
Subjt:  ECFSDADWAGSREDRRSTSGYCVF--------------------AESESRAIAQSVCEIVWIHQLLSKKGFSITVAAKLWCANQAALHIASIP-------

Query:  ------------------------TGEQLGDILTKALNGARIIYLCNSLGTIDIFALA
                                TGEQLGDILTKALNG RI YLCN LG IDIFA A
Subjt:  ------------------------TGEQLGDILTKALNGARIIYLCNSLGTIDIFALA

XP_031744758.1 uncharacterized protein LOC101212255 isoform X5 [Cucumis sativus]6.8e-22852.17Show/hide
Query:  LSSDHVTKKIIDRGYELGDLYLFDHQVSQVVACPVVPSPFEIHCCLGHPSLFMLKKLYQEFRSLSSLNCDLCQFAKYHRLRSSPRVDKRAIAPFDL----
        L  D VTKKII RGYE G LYLFDHQVSQ VACPVVPSPFE+HC LGHPSLF+LKKLY EFRSLSSLNCD CQFAK+HRL SSPRVDKRAIAPF+L    
Subjt:  LSSDHVTKKIIDRGYELGDLYLFDHQVSQVVACPVVPSPFEIHCCLGHPSLFMLKKLYQEFRSLSSLNCDLCQFAKYHRLRSSPRVDKRAIAPFDL----

Query:  -----PVLT-----------------------------------LHLKM--------------------------------VLH--------------NE
             PV++                                    H ++                                ++H                
Subjt:  -----PVLT-----------------------------------LHLKM--------------------------------VLH--------------NE

Query:  KNRHLLETARALSFQMHVPKTFWSDVI-----------------------------LFPIAPKIFGCVCFVRDVRPQHTKLDPKSLMCIFLGYSRVQKVY
        KNRHLLETARALSFQMHV K FW D +                             LFPIAPKIFGCVCFVRDVRP HTKLDPKSL CIFLGYSRVQK Y
Subjt:  KNRHLLETARALSFQMHVPKTFWSDVI-----------------------------LFPIAPKIFGCVCFVRDVRPQHTKLDPKSLMCIFLGYSRVQKVY

Query:  CCYCPTLKRYLVSPDVAFFVDTPFTSSPSNLCQEEDDNLFIYETHVLHQCFLHHVIRGQVMI--------------------------------FPLLFA
         CYCPTLKRYLVSPDV FF DTPFTSSPS+LCQ EDDNLFIYE           V   + +I                                 P+   
Subjt:  CCYCPTLKRYLVSPDVAFFVDTPFTSSPSNLCQEEDDNLFIYETHVLHQCFLHHVIRGQVMI--------------------------------FPLLFA

Query:  K------IPV---ISYHQLSPSTYAFITSLESTSIPNSVHGTLSHLDWQNEMIEEMTALDDN----------GKKVIGCKWVFAVKMNPDGL--------
        K       PV   ISYHQLSPSTYAFITSLESTSIPNSVH  LSH  WQN MIEEMTALDDN          GKK IGCKWVFAVKMNPDG         
Subjt:  K------IPV---ISYHQLSPSTYAFITSLESTSIPNSVHGTLSHLDWQNEMIEEMTALDDN----------GKKVIGCKWVFAVKMNPDGL--------

Query:  -------IEGSSY--TFSPVAKLTSIRLFLSIAATHKWHLHQLDIKNVFLHGDLQEEVYIEQPPGYVTQGESDK-------------SPRAWFGKFSQAL
               I G+ Y  TFSPVAKLTSIRLFLS+AAT+KW LHQLDIKN FLHGDLQEEVY+EQPPG+V QGESDK             SPRAWFGKFSQAL
Subjt:  -------IEGSSY--TFSPVAKLTSIRLFLSIAATHKWHLHQLDIKNVFLHGDLQEEVYIEQPPGYVTQGESDK-------------SPRAWFGKFSQAL

Query:  VCF-------------------------------------------------------------------------------------------------
        VCF                                                                                                 
Subjt:  VCF-------------------------------------------------------------------------------------------------

Query:  ------DQQLVKEGELRKDTERYRRLVGKL-YLTVTRPDIAYFVSVVSQFMSSYTVDHWAVVEQILCYLKAAPGRGILYKDHGHTIVECFSDADWAGSRE
              +QQLVKEGEL KD ERYRRLVGKL YLTVTRPDIAY VSVVSQFMSS TVDHWA VEQILCYLKAAPGRGILYKDHGHT VECFSDADWAGSRE
Subjt:  ------DQQLVKEGELRKDTERYRRLVGKL-YLTVTRPDIAYFVSVVSQFMSSYTVDHWAVVEQILCYLKAAPGRGILYKDHGHTIVECFSDADWAGSRE

Query:  DRRSTSGYCVF--------------------AESESRAIAQSVCEIVWIHQLLSKKGFSITVAAKLWCANQAALHIASIP--------------------
        DRRSTSGYCVF                    AESE RA+AQSVCEIVWIHQLLS+ GFSITV AKLWC NQAALHIAS P                    
Subjt:  DRRSTSGYCVF--------------------AESESRAIAQSVCEIVWIHQLLSKKGFSITVAAKLWCANQAALHIASIP--------------------

Query:  -----------TGEQLGDILTKALNGARIIYLCNSLGTIDIFALA
                   TGEQLGDILTKALNG RI YLCN LG IDIFA A
Subjt:  -----------TGEQLGDILTKALNGARIIYLCNSLGTIDIFALA

TrEMBL top hitse value%identityAlignment
A0A438DE60 Retrovirus-related Pol polyprotein from transposon TNT 1-949.8e-15638.19Show/hide
Query:  STPVASTVALGNTKGLLTTSSKWVIDSGVTAHMTGNSHLFS--RPLSPAPFLSVTLANDFTSSVLGSGTIHLTPSFSLSS--------------------
        STPV++    G T  L+++S+KW+IDS  T HMTGN   FS  R  S  P   VT+A+  T  + GSGT+  T S +LSS                    
Subjt:  STPVASTVALGNTKGLLTTSSKWVIDSGVTAHMTGNSHLFS--RPLSPAPFLSVTLANDFTSSVLGSGTIHLTPSFSLSS--------------------

Query:  ---------------DHVTKKIIDRGYELGDLYLFDHQVSQVVACPVVPSPFEIHCCLGHPSLFMLKKLYQEFRSLSSLNCDLCQFAKYHRLRSSPRVDK
                       D +TK+   +G+    LY+ +  V + VAC    SP E HC LGHPSL +LKKL  +F +L SL+C+ C FAK+HR    PR++K
Subjt:  ---------------DHVTKKIIDRGYELGDLYLFDHQVSQVVACPVVPSPFEIHCCLGHPSLFMLKKLYQEFRSLSSLNCDLCQFAKYHRLRSSPRVDK

Query:  RAIAPFDL---------PVLT---------------------------------------------LHLKM----------------------VLH----
        RA + F+L         PV +                                             + +K+                      +LH    
Subjt:  RAIAPFDL---------PVLT---------------------------------------------LHLKM----------------------VLH----

Query:  ----------NEKNRHLLETARALSFQMHVPKTFWSDVI-----------------------------LFPIAPKIFGCVCFVRDVRPQHTKLDPKSLMC
                    KNRHLLETARAL FQM VPK FW+D +                             LFP+AP+IFGC C+VRD RP  TKLDPK+L C
Subjt:  ----------NEKNRHLLETARALSFQMHVPKTFWSDVI-----------------------------LFPIAPKIFGCVCFVRDVRPQHTKLDPKSLMC

Query:  IFLGYSRVQKVYCCYCPTLKRYLVSPDVAFFVDTPFTSSPSNLCQEEDDNLFIYE-----------THVLHQCFLHHVIRGQVMIFPLLFAKIPVISYHQ
        +FLGYSR+QK Y C+ P L +YLVS DV F  DT F SSP++   EED+   +Y+           + V     L H     V+  P   AK P++  + 
Subjt:  IFLGYSRVQKVYCCYCPTLKRYLVSPDVAFFVDTPFTSSPSNLCQEEDDNLFIYE-----------THVLHQCFLHHVIRGQVMIFPLLFAKIPVISYHQ

Query:  LSPSTYAFITSLESTSIPNSVHGT--------LSHLDWQNEMIEEMTALDDN----------GKKVIGCKWVFAVKMNPDGLIE---------------G
          P T     +   +S   SV  T        L+H  W+N M+EE+ AL+DN          GKKV+GCKWVFAVK++PDG +                G
Subjt:  LSPSTYAFITSLESTSIPNSVHGT--------LSHLDWQNEMIEEMTALDDN----------GKKVIGCKWVFAVKMNPDGLIE---------------G

Query:  SSY--TFSPVAKLTSIRLFLSIAATHKWHLHQLDIKNVFLHGDLQEEVYIEQPPGYVTQGESDK-------------SPRAWFGKFSQALVCFDQQLVKE
          Y  TFSPVAKL S+RLF+SIAA+ +W +HQLDIKN FLHGDL+EEVY+EQPPG+V QGE  K             SPRAWFGKFS+ +  F     ++
Subjt:  SSY--TFSPVAKLTSIRLFLSIAATHKWHLHQLDIKNVFLHGDLQEEVYIEQPPGYVTQGESDK-------------SPRAWFGKFSQALVCFDQQLVKE

Query:  -------------------------------------------------GELR--------------------------KDT---------ERYRRLVGK
                                                         GEL+                          K+T         ERYRR+VGK
Subjt:  -------------------------------------------------GELR--------------------------KDT---------ERYRRLVGK

Query:  L-YLTVTRPDIAYFVSVVSQFMSSYTVDHWAVVEQILCYLKAAPGRGILYKDHGHTIVECFSDADWAGSREDRRSTSGYCVF------------------
        L YLTVTRPDIAY VSVVSQF S+ T+ HWA +EQILCYLK APG GILY   GHT +ECFSDADWAGS+ DRRST+GY VF                  
Subjt:  L-YLTVTRPDIAYFVSVVSQFMSSYTVDHWAVVEQILCYLKAAPGRGILYKDHGHTIVECFSDADWAGSREDRRSTSGYCVF------------------

Query:  --AESESRAIAQSVCEIVWIHQLLSKKGFSITVAAKLWCANQAALHIASIP-------------------------------TGEQLGDILTKALNGARI
          AESE RA+AQ+ CEI+WIHQLL + G   T+ AKLWC NQAALHIA+ P                               TGEQLGDI TKALNG R+
Subjt:  --AESESRAIAQSVCEIVWIHQLLSKKGFSITVAAKLWCANQAALHIASIP-------------------------------TGEQLGDILTKALNGARI

Query:  IYLCNSLGTIDIFALA
         Y CN LG I+I+A A
Subjt:  IYLCNSLGTIDIFALA

A0A438GAA6 Retrovirus-related Pol polyprotein from transposon TNT 1-945.7e-15636.43Show/hide
Query:  STPVASTVALGNTKGLLTTSSKWVIDSGVTAHMTGNSHLFS--RPLSPAPFLSVTLANDFTSSVLGSGTIHLTPSFSLSS--------------------
        STPV++    G T  L+++S+KW+IDSG T HMTGN   FS  R  S  P   VT+A+  T  + GSGT+  T S +LSS                    
Subjt:  STPVASTVALGNTKGLLTTSSKWVIDSGVTAHMTGNSHLFS--RPLSPAPFLSVTLANDFTSSVLGSGTIHLTPSFSLSS--------------------

Query:  ---------------DHVTKKIIDRGYELGDLYLFDHQVSQVVACPVVPSPFEIHCCLGHPSLFMLKKLYQEFRSLSSLNCDLCQFAKYHRLRSSPRVDK
                       D +TK+   +G+    LY+ D  V + VAC    SP E HC LGHPSL +LKKL  +F +L SL+C+ C FAK+HR    PR++K
Subjt:  ---------------DHVTKKIIDRGYELGDLYLFDHQVSQVVACPVVPSPFEIHCCLGHPSLFMLKKLYQEFRSLSSLNCDLCQFAKYHRLRSSPRVDK

Query:  RAIAPFDL---------PVLT---------------------------------------------LHLKM----------------------VLH----
        RA + F+L         PV +                                             + +K+                      +LH    
Subjt:  RAIAPFDL---------PVLT---------------------------------------------LHLKM----------------------VLH----

Query:  ----------NEKNRHLLETARALSFQMHVPKTFWSDVI-----------------------------LFPIAPKIFGCVCFVRDVRPQHTKLDPKSLMC
                    KNRHLLETARAL FQM VPK FW+D +                             LFP+AP+IFGC C+VRD RP   KLDPK+L C
Subjt:  ----------NEKNRHLLETARALSFQMHVPKTFWSDVI-----------------------------LFPIAPKIFGCVCFVRDVRPQHTKLDPKSLMC

Query:  IFLGYSRVQKVYCCYCPTLKRYLVSPDVAFFVDTPFTSSPSNLCQEEDDNLFIYETHVLHQCFLHHVI---------RGQVMIFPLLFAKIPV-------
        +FLGYSR+QK Y C+ P L +YLVS DV F  DT F SSP++   EED+   +Y+            +          G V+  P   AK P+       
Subjt:  IFLGYSRVQKVYCCYCPTLKRYLVSPDVAFFVDTPFTSSPSNLCQEEDDNLFIYETHVLHQCFLHHVI---------RGQVMIFPLLFAKIPV-------

Query:  -------------------------------------------ISYHQLSPSTYAFITSLESTSIPNSVHGTLSHLDWQNEMIEEMTALDDN--------
                                                   +SY  LS S+   + S++S S+P +V   L+H  W+N M+EE+ AL+DN        
Subjt:  -------------------------------------------ISYHQLSPSTYAFITSLESTSIPNSVHGTLSHLDWQNEMIEEMTALDDN--------

Query:  --GKKVIGCKWVFAVKMNPDGLIE---------------GSSY--TFSPVAKLTSIRLFLSIAATHKWHLHQLDIKNVFLHGDLQEEVYIEQPPGYVTQG
          GKKV+GCKWVFAVK+NPDG +                G  Y  TFSPVAKL S+RLF+SIAA+ +W +HQLDIKN FLHGDL+EEVY+EQPPG+V QG
Subjt:  --GKKVIGCKWVFAVKMNPDGLIE---------------GSSY--TFSPVAKLTSIRLFLSIAATHKWHLHQLDIKNVFLHGDLQEEVYIEQPPGYVTQG

Query:  ESDK-------------SPRAWFGKFSQALVCFDQQLVKE-------------------------------------------------GELR-------
        E  K             SPRAWFGKFS+ +  F     ++                                                 GEL+       
Subjt:  ESDK-------------SPRAWFGKFSQALVCFDQQLVKE-------------------------------------------------GELR-------

Query:  -------------------KDT-----------------------------ERYRRLVGKL-YLTVTRPDIAYFVSVVSQFMSSYTVDHWAVVEQILCYL
                           K+T                             ERYRR+VGKL YLTVTRPDIAY VSVVSQF S+ T+ HWA +EQILCYL
Subjt:  -------------------KDT-----------------------------ERYRRLVGKL-YLTVTRPDIAYFVSVVSQFMSSYTVDHWAVVEQILCYL

Query:  KAAPGRGILYKDHGHTIVECFSDADWAGSREDRRSTSGYCVF--------------------AESESRAIAQSVCEIVWIHQLLSKKGFSITVAAKLWCA
        K APG GILY   GHT +ECFSDADWAGS+ DRRST+GYCVF                    AESE RA++Q+ CEI+WIHQLL + G   T+ AKLWC 
Subjt:  KAAPGRGILYKDHGHTIVECFSDADWAGSREDRRSTSGYCVF--------------------AESESRAIAQSVCEIVWIHQLLSKKGFSITVAAKLWCA

Query:  NQAALHIASIP-------------------------------TGEQLGDILTKALNGARIIYLCNSLGTIDIFALA
        NQAALHIA+ P                               TGEQLGDI TKALNG R+ Y CN LG I+I+A A
Subjt:  NQAALHIASIP-------------------------------TGEQLGDILTKALNGARIIYLCNSLGTIDIFALA

A0A438HEX0 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-15638.16Show/hide
Query:  STPVASTVALGNTKGLLTTSSKWVIDSGVTAHMTGNSHLFS--RPLSPAPFLSVTLANDFTSSVLGSGTIHLTPS-------------FSLSSDHVTKKI
        STPV++    G T  L+++S+KW+IDSG T HMTGN   FS  R  S  P   VT+A+  T  + GSGT+  T S             F+L SD +TK+ 
Subjt:  STPVASTVALGNTKGLLTTSSKWVIDSGVTAHMTGNSHLFS--RPLSPAPFLSVTLANDFTSSVLGSGTIHLTPS-------------FSLSSDHVTKKI

Query:  IDRGYELGDLYLFDHQVSQVVACPVVPSPFEIHCCLGHPSLFMLKKLYQEFRSLSSLNCDLCQFAKYHRLRSSPRVDKRAIAPFDL---------PVLT-
          +G+    LY+ D  V + VAC    SP E HC LGHPSL +LKKL  +F +L SL+C+ C FAK+HR    PR++KRA + F+L         PV + 
Subjt:  IDRGYELGDLYLFDHQVSQVVACPVVPSPFEIHCCLGHPSLFMLKKLYQEFRSLSSLNCDLCQFAKYHRLRSSPRVDKRAIAPFDL---------PVLT-

Query:  -----------------------------LHLKMVLH--------------NEKNRHLLETARALSFQMHVPKTFWSDVI--------------------
                                     +    +LH                KNRHLLETARAL FQM VPK FW+D +                    
Subjt:  -----------------------------LHLKMVLH--------------NEKNRHLLETARALSFQMHVPKTFWSDVI--------------------

Query:  ---------LFPIAPKIFGCVCFVRDVRPQHTKLDPKSLMCIFLGYSRVQKVYCCYCPTLKRYLVSPDVAFFVDTPFTSSPSNLCQEEDDNLFIYETHVL
                 LF +AP+IFGC C+VRD RP  TKLDPK+L C+FLGYSR+QK Y C+ P L +YLVS DV F  DT F SSP++   EED+   +Y+    
Subjt:  ---------LFPIAPKIFGCVCFVRDVRPQHTKLDPKSLMCIFLGYSRVQKVYCCYCPTLKRYLVSPDVAFFVDTPFTSSPSNLCQEEDDNLFIYETHVL

Query:  HQCFLHHVI---------RGQVMIFPLLFAKIPV--------------------------------------------------ISYHQLSPSTYAFITS
                +          G V+  P   AK P+                                                  +SY  LS S+   + S
Subjt:  HQCFLHHVI---------RGQVMIFPLLFAKIPV--------------------------------------------------ISYHQLSPSTYAFITS

Query:  LESTSIPNSVHGTLSHLDWQNEMIEEMTALDDN----------GKKVIGCKWVFAVKMNPDGLIE---------------GSSY--TFSPVAKLTSIRLF
        ++S S+P +V   L+H  W+N M+EE+ AL DN          GKKV+GCKWVFAVK+NPDG +                G  Y  TFSPVAKL S+RLF
Subjt:  LESTSIPNSVHGTLSHLDWQNEMIEEMTALDDN----------GKKVIGCKWVFAVKMNPDGLIE---------------GSSY--TFSPVAKLTSIRLF

Query:  LSIAATHKWHLHQLDIKNVFLHGDLQEEVYIEQPPGYVTQGESDK-------------SPRAWFGKFSQALVCFDQQLVKE-------------------
        +SIAA+ +W +HQLDIKN FLHGDL+EEVY+EQPPG+V QGE  K             SPRAWFGKFS+ +  F     ++                   
Subjt:  LSIAATHKWHLHQLDIKNVFLHGDLQEEVYIEQPPGYVTQGESDK-------------SPRAWFGKFSQALVCFDQQLVKE-------------------

Query:  ------------------------------GELR--------------------------KDT-----------------------------ERYRRLVG
                                      GEL+                          K+T                             ERYRR+VG
Subjt:  ------------------------------GELR--------------------------KDT-----------------------------ERYRRLVG

Query:  KL-YLTVTRPDIAYFVSVVSQFMSSYTVDHWAVVEQILCYLKAAPGRGILYKDHGHTIVECFSDADWAGSREDRRSTSGYCVF-----------------
        KL YLTVTRPDIAY VSVVSQF S+ T+ HWA +EQILCYLK APG GILY   GHT +ECFSDADWAGS+ DRRST+GYCVF                 
Subjt:  KL-YLTVTRPDIAYFVSVVSQFMSSYTVDHWAVVEQILCYLKAAPGRGILYKDHGHTIVECFSDADWAGSREDRRSTSGYCVF-----------------

Query:  ---AESESRAIAQSVCEIVWIHQLLSKKGFSITVAAKLWCANQAALHIASIP-------------------------------TGEQLGDILTKALNGAR
           AESE RA++Q+ CEI+WIHQLL + G   T+ AKLWC NQAALHIA+ P                               TGEQLGDI TKALNG R
Subjt:  ---AESESRAIAQSVCEIVWIHQLLSKKGFSITVAAKLWCANQAALHIASIP-------------------------------TGEQLGDILTKALNGAR

Query:  I
        +
Subjt:  I

A0A438IRR9 Retrovirus-related Pol polyprotein from transposon TNT 1-947.5e-15636.43Show/hide
Query:  STPVASTVALGNTKGLLTTSSKWVIDSGVTAHMTGNSHLFS--RPLSPAPFLSVTLANDFTSSVLGSGTIHLTPSFSLSS--------------------
        STPV++    G T  L+++S+KW+IDSG T HMTGN   FS  R  S  P   VT+A+  T  + GSGT+  T S +LSS                    
Subjt:  STPVASTVALGNTKGLLTTSSKWVIDSGVTAHMTGNSHLFS--RPLSPAPFLSVTLANDFTSSVLGSGTIHLTPSFSLSS--------------------

Query:  ---------------DHVTKKIIDRGYELGDLYLFDHQVSQVVACPVVPSPFEIHCCLGHPSLFMLKKLYQEFRSLSSLNCDLCQFAKYHRLRSSPRVDK
                       D +TK+   +G+    LY+ D  V + VAC    SP E HC LGHPSL +LKKL  +F +L SL+C+ C FAK+HR    PR++K
Subjt:  ---------------DHVTKKIIDRGYELGDLYLFDHQVSQVVACPVVPSPFEIHCCLGHPSLFMLKKLYQEFRSLSSLNCDLCQFAKYHRLRSSPRVDK

Query:  RAIAPFDL---------PVLT-------------------------------------------------------------------LHLKMVLH----
        RA + F+L         PV +                                                                   +    +LH    
Subjt:  RAIAPFDL---------PVLT-------------------------------------------------------------------LHLKMVLH----

Query:  ----------NEKNRHLLETARALSFQMHVPKTFWSDVI-----------------------------LFPIAPKIFGCVCFVRDVRPQHTKLDPKSLMC
                    KNRHLLETARAL FQM VPK FW+D +                             LFP+AP+IFGC C+VRD RP  TKLDPK+L C
Subjt:  ----------NEKNRHLLETARALSFQMHVPKTFWSDVI-----------------------------LFPIAPKIFGCVCFVRDVRPQHTKLDPKSLMC

Query:  IFLGYSRVQKVYCCYCPTLKRYLVSPDVAFFVDTPFTSSPSNLCQEEDDNLFIYETHVLHQCFLHHVI---------RGQVMIFPLLFAKIPV-------
        +FLGYSR+QK Y C+ P L +YLVS DV F  DT F SSP++   EED+   +Y+            +          G V+  P   AK P+       
Subjt:  IFLGYSRVQKVYCCYCPTLKRYLVSPDVAFFVDTPFTSSPSNLCQEEDDNLFIYETHVLHQCFLHHVI---------RGQVMIFPLLFAKIPV-------

Query:  -------------------------------------------ISYHQLSPSTYAFITSLESTSIPNSVHGTLSHLDWQNEMIEEMTALDDN--------
                                                   +SY  LS S+   + S++S S+P +V   L+H  W+N M+EE+ AL+DN        
Subjt:  -------------------------------------------ISYHQLSPSTYAFITSLESTSIPNSVHGTLSHLDWQNEMIEEMTALDDN--------

Query:  --GKKVIGCKWVFAVKMNPDGLIE---------------GSSY--TFSPVAKLTSIRLFLSIAATHKWHLHQLDIKNVFLHGDLQEEVYIEQPPGYVTQG
          GKKV+GCKWVFAVK+N DG +                G  Y  TFSPVAKL S+RLF+SIAA+ +W +HQLDIKN FLHGDL+EEVY+EQPPG+V QG
Subjt:  --GKKVIGCKWVFAVKMNPDGLIE---------------GSSY--TFSPVAKLTSIRLFLSIAATHKWHLHQLDIKNVFLHGDLQEEVYIEQPPGYVTQG

Query:  ESDK-------------SPRAWFGKFSQALVCFDQQLVKE-------------------------------------------------GELR-------
        E  K             SPRAWFGKFS+ +  F     ++                                                 GEL+       
Subjt:  ESDK-------------SPRAWFGKFSQALVCFDQQLVKE-------------------------------------------------GELR-------

Query:  -------------------KDT-----------------------------ERYRRLVGKL-YLTVTRPDIAYFVSVVSQFMSSYTVDHWAVVEQILCYL
                           K+T                             ERYRR+VGKL YLTVTRPDIAY VSVVSQF S+ T+ HWA +EQILCYL
Subjt:  -------------------KDT-----------------------------ERYRRLVGKL-YLTVTRPDIAYFVSVVSQFMSSYTVDHWAVVEQILCYL

Query:  KAAPGRGILYKDHGHTIVECFSDADWAGSREDRRSTSGYCVF--------------------AESESRAIAQSVCEIVWIHQLLSKKGFSITVAAKLWCA
        K APG GILY   GHT +ECFSDADWAGS+ DRRST+GYCVF                    AESE RA+AQ+ CEI+WIHQLL + G   T+ AKLWC 
Subjt:  KAAPGRGILYKDHGHTIVECFSDADWAGSREDRRSTSGYCVF--------------------AESESRAIAQSVCEIVWIHQLLSKKGFSITVAAKLWCA

Query:  NQAALHIASIP-------------------------------TGEQLGDILTKALNGARIIYLCNSLGTIDIFALA
        NQAALHIA+ P                               TGEQLGDI TKALNG R+ Y CN LG I+I+A A
Subjt:  NQAALHIASIP-------------------------------TGEQLGDILTKALNGARIIYLCNSLGTIDIFALA

B0FBS2 Uncharacterized protein1.2e-15636.52Show/hide
Query:  STPVASTVALGNTKGLLTTSSKWVIDSGVTAHMTGNSHLFS--RPLSPAPFLSVTLANDFTSSVLGSGTIHLTPSFSLSS--------------------
        STPV++    G T  L+++S+KW+IDSG T HMTGN   FS  R  S  P   VT+A+  T  + GSGT+  T S +LSS                    
Subjt:  STPVASTVALGNTKGLLTTSSKWVIDSGVTAHMTGNSHLFS--RPLSPAPFLSVTLANDFTSSVLGSGTIHLTPSFSLSS--------------------

Query:  ---------------DHVTKKIIDRGYELGDLYLFDHQVSQVVACPVVPSPFEIHCCLGHPSLFMLKKLYQEFRSLSSLNCDLCQFAKYHRLRSSPRVDK
                       D +TK+   +G+    LY+ D  V + VAC    SP E HC LGHPSL +LKKL  +F +L SL+C+ C FAK+HR    PR++K
Subjt:  ---------------DHVTKKIIDRGYELGDLYLFDHQVSQVVACPVVPSPFEIHCCLGHPSLFMLKKLYQEFRSLSSLNCDLCQFAKYHRLRSSPRVDK

Query:  RAIAPFDL---------PVLT---------------------------------------------LHLKM----------------------VLH----
        RA + F+L         PV +                                             + +K+                      +LH    
Subjt:  RAIAPFDL---------PVLT---------------------------------------------LHLKM----------------------VLH----

Query:  ----------NEKNRHLLETARALSFQMHVPKTFWSDVI-----------------------------LFPIAPKIFGCVCFVRDVRPQHTKLDPKSLMC
                    KNRHLLETARAL FQM VPK FW+D +                             LFP+AP+IFGC C+VRD RP  TKLDPK+L C
Subjt:  ----------NEKNRHLLETARALSFQMHVPKTFWSDVI-----------------------------LFPIAPKIFGCVCFVRDVRPQHTKLDPKSLMC

Query:  IFLGYSRVQKVYCCYCPTLKRYLVSPDVAFFVDTPFTSSPSNLCQEEDDNLFIYETHVLHQCFLHHVI---------RGQVMIFPLLFAKIPV-------
        +FLGYSR+QK Y C+ P L +YLVS DV F  DT F SSP++   EED+   +Y+            +          G V+  P   AK P+       
Subjt:  IFLGYSRVQKVYCCYCPTLKRYLVSPDVAFFVDTPFTSSPSNLCQEEDDNLFIYETHVLHQCFLHHVI---------RGQVMIFPLLFAKIPV-------

Query:  -------------------------------------------ISYHQLSPSTYAFITSLESTSIPNSVHGTLSHLDWQNEMIEEMTALDDN--------
                                                   +SY  LS S+   + S++S S+P +V   L+H  W+N M+EE+ AL+DN        
Subjt:  -------------------------------------------ISYHQLSPSTYAFITSLESTSIPNSVHGTLSHLDWQNEMIEEMTALDDN--------

Query:  --GKKVIGCKWVFAVKMNPDGLIE---------------GSSY--TFSPVAKLTSIRLFLSIAATHKWHLHQLDIKNVFLHGDLQEEVYIEQPPGYVTQG
          GKKV+GCKWVFAVK+NPDG +                G  Y  TFSPVAKL S+RLF+SIAA+ +W +HQLDIKN FLHGDL+EEVY+EQPPG+V QG
Subjt:  --GKKVIGCKWVFAVKMNPDGLIE---------------GSSY--TFSPVAKLTSIRLFLSIAATHKWHLHQLDIKNVFLHGDLQEEVYIEQPPGYVTQG

Query:  ESDK-------------SPRAWFGKFSQALVCFDQQLVKE-------------------------------------------------GELR-------
        E  K             SPRAWFGKFS+ +  F     ++                                                 GEL+       
Subjt:  ESDK-------------SPRAWFGKFSQALVCFDQQLVKE-------------------------------------------------GELR-------

Query:  -------------------KDT-----------------------------ERYRRLVGKL-YLTVTRPDIAYFVSVVSQFMSSYTVDHWAVVEQILCYL
                           K+T                             ERYRR+VGKL YLTVTRPDIAY VSVVSQF S+ T+ HWA +EQILCYL
Subjt:  -------------------KDT-----------------------------ERYRRLVGKL-YLTVTRPDIAYFVSVVSQFMSSYTVDHWAVVEQILCYL

Query:  KAAPGRGILYKDHGHTIVECFSDADWAGSREDRRSTSGYCVF--------------------AESESRAIAQSVCEIVWIHQLLSKKGFSITVAAKLWCA
        K APG GILY   GHT +ECFSDADWAGS+ DRRST+GYCVF                    AESE RA++Q+ CEI+WIHQLL + G   T+ AKLWC 
Subjt:  KAAPGRGILYKDHGHTIVECFSDADWAGSREDRRSTSGYCVF--------------------AESESRAIAQSVCEIVWIHQLLSKKGFSITVAAKLWCA

Query:  NQAALHIASIP-------------------------------TGEQLGDILTKALNGARIIYLCNSLGTIDIFALA
        NQAALHIA+ P                               TGEQLGDI TKALNG R+ Y CN LG I+I+A A
Subjt:  NQAALHIASIP-------------------------------TGEQLGDILTKALNGARIIYLCNSLGTIDIFALA

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.0e-1720.67Show/hide
Query:  PVISYHQLSPSTYAFITSLES--TSIPNSVHGTLSHLD---WQNEMIEEMTALDDN----------GKKVIGCKWVFAVKMNPDG---------LIEGSS
        P ISY++   S    + +  +    +PNS        D   W+  +  E+ A   N           K ++  +WVF+VK N  G         +  G +
Subjt:  PVISYHQLSPSTYAFITSLES--TSIPNSVHGTLSHLD---WQNEMIEEMTALDDN----------GKKVIGCKWVFAVKMNPDG---------LIEGSS

Query:  Y--------TFSPVAKLTSIRLFLSIAATHKWHLHQLDIKNVFLHGDLQEEVYIEQPPGYVTQGES-----------DKSPRAWFGKFSQA---------
                 TF+PVA+++S R  LS+   +   +HQ+D+K  FL+G L+EE+Y+  P G     ++            ++ R WF  F QA         
Subjt:  Y--------TFSPVAKLTSIRLFLSIAATHKWHLHQLDIKNVFLHGDLQEEVYIEQPPGYVTQGES-----------DKSPRAWFGKFSQA---------

Query:  --------------------LVCFDQQLVKEGELRK---------------DTERYRRLVG--------KLYLT--------------------------
                            L+  D  ++  G++ +               D    +  +G        K+YL+                          
Subjt:  --------------------LVCFDQQLVKEGELRK---------------DTERYRRLVG--------KLYLT--------------------------

Query:  -----------------------------VTRPDIAYFVSVVSQFMSSYTVDHWAVVEQILCYLKAAPGRGILYKDH--GHTIVECFSDADWAGSREDRR
                                      TRPD+   V+++S++ S    + W  ++++L YLK      +++K +      +  + D+DWAGS  DR+
Subjt:  -----------------------------VTRPDIAYFVSVVSQFMSSYTVDHWAVVEQILCYLKAAPGRGILYKDH--GHTIVECFSDADWAGSREDRR

Query:  STSGY---------------------CVFAESESRAIAQSVCEIVWIHQLLSKKGFSITVAAKLWCANQAALHIAS------------------------
        ST+GY                         E+E  A+ ++V E +W+  LL+     +    K++  NQ  + IA+                        
Subjt:  STSGY---------------------CVFAESESRAIAQSVCEIVWIHQLLSKKGFSITVAAKLWCANQAALHIAS------------------------

Query:  -------IPTGEQLGDILTKALNGARIIYLCNSLGTI
               IPT  QL DI TK L  AR + L + LG +
Subjt:  -------IPTGEQLGDILTKALNGARIIYLCNSLGTI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.2e-3123.2Show/hide
Query:  HLLETARALSFQMHVPKTFWSDVILFPIAPKIFGCVCFVRDVRPQHTKLDPKSLMCIFLGYSRVQKVYCCYCPTLKRYLVSPDVAF--------------
        +L+  + ++     +P+  W++  +     K+FGC  F    + Q TKLD KS+ CIF+GY   +  Y  + P  K+ + S DV F              
Subjt:  HLLETARALSFQMHVPKTFWSDVILFPIAPKIFGCVCFVRDVRPQHTKLDPKSLMCIFLGYSRVQKVYCCYCPTLKRYLVSPDVAF--------------

Query:  --------FVDTPFTSSPSNLCQEEDDNL---------FIYETHVLHQCF--LHHVIRGQVMIFPLLFAKIPVISYHQLSPSTYAFITSLESTSIPNSVH
                FV  P TS+     +   D +          I +   L +    + H  +G+    PL  ++ P +   +   + Y  I+       P S+ 
Subjt:  --------FVDTPFTSSPSNLCQEEDDNL---------FIYETHVLHQCF--LHHVIRGQVMIFPLLFAKIPVISYHQLSPSTYAFITSLESTSIPNSVH

Query:  GTLSHLD---WQNEMIEEMTALDDN----------GKKVIGCKWVFAVKMNPDGLI---------------EGSSY--TFSPVAKLTSIRLFLSIAATHK
          LSH +       M EEM +L  N          GK+ + CKWVF +K + D  +               +G  +   FSPV K+TSIR  LS+AA+  
Subjt:  GTLSHLD---WQNEMIEEMTALDDN----------GKKVIGCKWVFAVKMNPDGLI---------------EGSSY--TFSPVAKLTSIRLFLSIAATHK

Query:  WHLHQLDIKNVFLHGDLQEEVYIEQPPGYVTQGES-------------DKSPRAWFGKFSQAL-------------------------------------
          + QLD+K  FLHGDL+EE+Y+EQP G+   G+               ++PR W+ KF   +                                     
Subjt:  WHLHQLDIKNVFLHGDLQEEVYIEQPPGYVTQGES-------------DKSPRAWFGKFSQAL-------------------------------------

Query:  ---------------VCFD-------QQLVKEGELRKDTER------------------------------------------------------YRRLV
                         FD       QQ++    +R+ T R                                                      Y   V
Subjt:  ---------------VCFD-------QQLVKEGELRKDTER------------------------------------------------------YRRLV

Query:  GKLY--LTVTRPDIAYFVSVVSQFMSSYTVDHWAVVEQILCYLKAAPGRGILYKDHGHTIVECFSDADWAGSREDRRSTSGY-----------------C
        G L   +  TRPDIA+ V VVS+F+ +   +HW  V+ IL YL+   G  + +      I++ ++DAD AG  ++R+S++GY                 C
Subjt:  GKLY--LTVTRPDIAYFVSVVSQFMSSYTVDHWAVVEQILCYLKAAPGRGILYKDHGHTIVECFSDADWAGSREDRRSTSGY-----------------C

Query:  V---FAESESRAIAQSVCEIVWIHQLLSKKGFSITVAAKLWCANQAALHIA
        V     E+E  A  ++  E++W+ + L + G        ++C +Q+A+ ++
Subjt:  V---FAESESRAIAQSVCEIVWIHQLLSKKGFSITVAAKLWCANQAALHIA

P92519 Uncharacterized mitochondrial protein AtMg008105.2e-1333.85Show/hide
Query:  DTERYRRLVGKL-YLTVTRPDIAYFVSVVSQFMSSYTVDHWAVVEQILCYLKAAPGRGILYKDHGHTIVECFSDADWAGSREDRRSTSGYCVF-------
        D   +R +VG L YLT+TRPDI+Y V++V Q M   T+  + +++++L Y+K     G+    +    V+ F D+DWAG    RRST+G+C F       
Subjt:  DTERYRRLVGKL-YLTVTRPDIAYFVSVVSQFMSSYTVDHWAVVEQILCYLKAAPGRGILYKDHGHTIVECFSDADWAGSREDRRSTSGYCVF-------

Query:  -------------AESESRAIAQSVCEIVW
                      E+E RA+A +  E+ W
Subjt:  -------------AESESRAIAQSVCEIVW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.2e-3125.49Show/hide
Query:  YAFITSLESTSIPNSVHGTLSHLDWQNEMIEEMTALDDNGK-----------KVIGCKWVFAVKMNPDGLIE---------------GSSY--TFSPVAK
        Y+   SL + S P +    L    W+N M  E+ A   N              ++GC+W+F  K N DG +                G  Y  TFSPV K
Subjt:  YAFITSLESTSIPNSVHGTLSHLDWQNEMIEEMTALDDNGK-----------KVIGCKWVFAVKMNPDGLIE---------------GSSY--TFSPVAK

Query:  LTSIRLFLSIAATHKWHLHQLDIKNVFLHGDLQEEVYIEQPPGYVTQGESD-------------KSPRAWF--------------------------GK-
         TSIR+ L +A    W + QLD+ N FL G L ++VY+ QPPG++ +   +             ++PRAW+                          GK 
Subjt:  LTSIRLFLSIAATHKWHLHQLDIKNVFLHGDLQEEVYIEQPPGYVTQGESD-------------KSPRAWF--------------------------GK-

Query:  FSQALVCFDQQLVK-----------------------------------------------------------------------------EGELRKDTE
            LV  D  L+                                                                               G    D  
Subjt:  FSQALVCFDQQLVK-----------------------------------------------------------------------------EGELRKDTE

Query:  RYRRLVGKL-YLTVTRPDIAYFVSVVSQFMSSYTVDHWAVVEQILCYLKAAPGRGILYKDHGHTIVECFSDADWAGSREDRRSTSGYCVF----------
         YR +VG L YL  TRPDI+Y V+ +SQFM   T +H   +++IL YL   P  GI  K      +  +SDADWAG ++D  ST+GY V+          
Subjt:  RYRRLVGKL-YLTVTRPDIAYFVSVVSQFMSSYTVDHWAVVEQILCYLKAAPGRGILYKDHGHTIVECFSDADWAGSREDRRSTSGYCVF----------

Query:  ----------AESESRAIAQSVCEIVWIHQLLSKKGFSITVAAKLWCANQAALHIASIP
                   E+E R++A +  E+ WI  LL++ G  +T    ++C N  A ++ + P
Subjt:  ----------AESESRAIAQSVCEIVWIHQLLSKKGFSITVAAKLWCANQAALHIASIP

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.3e-0732.52Show/hide
Query:  LHNEKNRHLLETARALSFQMHVPKTFWSDVI-----------------------LFPIAP-----KIFGCVCFVRDVRP--QHTKLDPKSLMCIFLGYSR
        L   K+RH++ET   L     +PKT+W                           LF  +P     ++FGC C+   +RP  QH KLD KS  C+FLGYS 
Subjt:  LHNEKNRHLLETARALSFQMHVPKTFWSDVI-----------------------LFPIAP-----KIFGCVCFVRDVRP--QHTKLDPKSLMCIFLGYSR

Query:  VQKVYCCYCPTLKRYLVSPDVAF
         Q  Y C      R  +S  V F
Subjt:  VQKVYCCYCPTLKRYLVSPDVAF

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.6e-3324.62Show/hide
Query:  YAFITSLESTSIPNSVHGTLSHLDWQNEMIEEMTALDDN-----------GKKVIGCKWVFAVKMNPDGLIE---------------GSSY--TFSPVAK
        Y++ TSL + S P +    +    W+  M  E+ A   N              ++GC+W+F  K N DG +                G  Y  TFSPV K
Subjt:  YAFITSLESTSIPNSVHGTLSHLDWQNEMIEEMTALDDN-----------GKKVIGCKWVFAVKMNPDGLIE---------------GSSY--TFSPVAK

Query:  LTSIRLFLSIAATHKWHLHQLDIKNVFLHGDLQEEVYIEQPPGYVTQGESD-------------KSPRAWFGKFSQALVCF-------------------
         TSIR+ L +A    W + QLD+ N FL G L +EVY+ QPPG+V +   D             ++PRAW+ +    L+                     
Subjt:  LTSIRLFLSIAATHKWHLHQLDIKNVFLHGDLQEEVYIEQPPGYVTQGESD-------------KSPRAWFGKFSQALVCF-------------------

Query:  ----------------DQQLVK---------------------------------------------------------------------EGELRKDTE
                        D  L+K                                                                      G    D  
Subjt:  ----------------DQQLVK---------------------------------------------------------------------EGELRKDTE

Query:  RYRRLVGKL-YLTVTRPDIAYFVSVVSQFMSSYTVDHWAVVEQILCYLKAAPGRGILYKDHGHTIVECFSDADWAGSREDRRSTSGYCVF----------
         YR +VG L YL  TRPD++Y V+ +SQ+M   T DHW  ++++L YL   P  GI  K      +  +SDADWAG  +D  ST+GY V+          
Subjt:  RYRRLVGKL-YLTVTRPDIAYFVSVVSQFMSSYTVDHWAVVEQILCYLKAAPGRGILYKDHGHTIVECFSDADWAGSREDRRSTSGYCVF----------

Query:  ----------AESESRAIAQSVCEIVWIHQLLSKKGFSIT---------VAAKLWCAN----------------------QAALHIASIPTGEQLGDILT
                   E+E R++A +  E+ WI  LL++ G  ++         V A   CAN                        AL +  + T +QL D LT
Subjt:  ----------AESESRAIAQSVCEIVWIHQLLSKKGFSIT---------VAAKLWCAN----------------------QAALHIASIPTGEQLGDILT

Query:  KALNGARIIYLCNSLGTIDI
        K L+          +G I +
Subjt:  KALNGARIIYLCNSLGTIDI

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.3e-3927.02Show/hide
Query:  ISYHQLSPSTYAFITSLESTSIPNSVHGTLSHLDWQNEMIEEMTALDDN----------GKKVIGCKWVFAVKMNPDGLI---------------EGSSY
        +SY ++SP  ++F+  +     P++ +     L W   M +E+ A++             KK IGCKWV+ +K N DG I               EG  +
Subjt:  ISYHQLSPSTYAFITSLESTSIPNSVHGTLSHLDWQNEMIEEMTALDDN----------GKKVIGCKWVFAVKMNPDGLI---------------EGSSY

Query:  --TFSPVAKLTSIRLFLSIAATHKWHLHQLDIKNVFLHGDLQEEVYIEQPPGYVT-QGES----------------DKSPRAWFGKFSQALVCF------
          TFSPV KLTS++L L+I+A + + LHQLDI N FL+GDL EE+Y++ PPGY   QG+S                 ++ R WF KFS  L+ F      
Subjt:  --TFSPVAKLTSIRLFLSIAATHKWHLHQLDIKNVFLHGDLQEEVYIEQPPGYVT-QGES----------------DKSPRAWFGKFSQALVCF------

Query:  -------------------------------------DQQLVKEGELRK---------------------------------------------------
                                               QL    +LR                                                    
Subjt:  -------------------------------------DQQLVKEGELRK---------------------------------------------------

Query:  ----------DTERYRRLVGKL-YLTVTRPDIAYFVSVVSQFMSSYTVDHWAVVEQILCYLKAAPGRGILYKDHGHTIVECFSDADWAGSREDRRSTSGY
                  D + YRRL+G+L YL +TR DI++ V+ +SQF  +  + H   V +IL Y+K   G+G+ Y       ++ FSDA +   ++ RRST+GY
Subjt:  ----------DTERYRRLVGKL-YLTVTRPDIAYFVSVVSQFMSSYTVDHWAVVEQILCYLKAAPGRGILYKDHGHTIVECFSDADWAGSREDRRSTSGY

Query:  CVF--------------------AESESRAIAQSVCEIVWIHQLLSKKGFSITVAAKLWCANQAALHIAS
        C+F                    AE+E RA++ +  E++W+ Q   +    ++    L+C N AA+HIA+
Subjt:  CVF--------------------AESESRAIAQSVCEIVWIHQLLSKKGFSITVAAKLWCANQAALHIAS

ATMG00240.1 Gag-Pol-related retrotransposon family protein6.5e-1136.71Show/hide
Query:  LYLTVTRPDIAYFVSVVSQFMSSYTVDHWAVVEQILCYLKAAPGRGILYKDHGHTIVECFSDADWAGSREDRRSTSGYC
        +YLT+TRPD+ + V+ +SQF S+        V ++L Y+K   G+G+ Y       ++ F+D+DWA   + RRS +G+C
Subjt:  LYLTVTRPDIAYFVSVVSQFMSSYTVDHWAVVEQILCYLKAAPGRGILYKDHGHTIVECFSDADWAGSREDRRSTSGYC

ATMG00810.1 DNA/RNA polymerases superfamily protein3.7e-1433.85Show/hide
Query:  DTERYRRLVGKL-YLTVTRPDIAYFVSVVSQFMSSYTVDHWAVVEQILCYLKAAPGRGILYKDHGHTIVECFSDADWAGSREDRRSTSGYCVF-------
        D   +R +VG L YLT+TRPDI+Y V++V Q M   T+  + +++++L Y+K     G+    +    V+ F D+DWAG    RRST+G+C F       
Subjt:  DTERYRRLVGKL-YLTVTRPDIAYFVSVVSQFMSSYTVDHWAVVEQILCYLKAAPGRGILYKDHGHTIVECFSDADWAGSREDRRSTSGYCVF-------

Query:  -------------AESESRAIAQSVCEIVW
                      E+E RA+A +  E+ W
Subjt:  -------------AESESRAIAQSVCEIVW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCAACTCTACTCCGGTTGCATCCACTGTTGCCCTAGGTAATACAAAGGGTCTTCTTACAACATCTTCCAAATGGGTCATAGACTCTGGTGTCACAGCTCATATGAC
AGGTAATTCTCACCTATTTTCTAGACCATTGTCCCCGGCTCCTTTTCTATCTGTTACATTGGCCAATGACTTCACATCTTCTGTTCTTGGCTCTGGCACTATTCACCTTA
CCCCATCATTTTCTCTCTCTTCTGATCATGTGACGAAGAAGATTATTGATAGAGGCTATGAGTTGGGAGACCTTTATCTCTTTGATCATCAAGTATCACAAGTTGTGGCG
TGTCCTGTCGTTCCCTCTCCTTTTGAAATCCATTGTTGTTTAGGTCATCCATCTTTGTTCATGTTGAAGAAACTTTATCAAGAATTTAGGTCTTTGTCCTCTTTAAATTG
TGATTTGTGTCAATTTGCTAAATATCATCGTCTTAGATCGAGTCCTCGAGTCGATAAACGAGCAATTGCTCCATTTGATCTTCCTGTGCTGACACTCCATCTTAAAATGG
TGTTGCATAACGAAAAAAATAGGCATTTACTTGAAACTGCCCGTGCTTTATCGTTTCAAATGCATGTTCCAAAAACCTTTTGGTCGGATGTTATTTTGTTTCCTATTGCT
CCTAAGATATTTGGTTGTGTTTGTTTTGTTCGCGACGTTCGTCCTCAACATACTAAGTTAGATCCCAAATCCTTGATGTGTATCTTCTTGGGTTATTCACGTGTTCAAAA
GGTTTATTGTTGTTATTGTCCTACCCTTAAAAGGTATTTGGTTTCGCCTGATGTTGCATTTTTTGTGGATACACCATTTACTTCATCACCATCGAATTTGTGTCAGGAGG
AGGATGACAATCTTTTTATATATGAGACTCATGTCCTCCATCAGTGCTTCCTTCATCATGTGATCCGGGGCCAAGTGATGATCTTCCCATTGCTCTTCGCGAAGATTCCA
GTTATTTCCTATCACCAGTTATCTCCCTCCACATATGCTTTTATTACGTCTCTTGAGTCCACATCTATTCCTAACTCTGTTCATGGAACTTTATCTCATCTTGATTGGCA
AAATGAAATGATTGAGGAGATGACTGCTTTAGATGATAATGGAAAGAAGGTCATTGGTTGTAAATGGGTGTTTGCTGTCAAGATGAATCCTGATGGCTTGATTGAAGGCT
CTTCTTATACATTCTCTCCAGTTGCCAAATTAACTTCCATTCGCCTATTTCTTTCCATTGCTGCTACCCATAAATGGCATTTGCATCAACTTGACATTAAGAATGTTTTT
CTTCACGGTGATCTTCAAGAGGAAGTTTATATTGAACAACCACCTGGGTATGTTACTCAGGGGGAGAGTGATAAAAGTCCTCGTGCGTGGTTTGGTAAGTTTAGTCAAGC
TCTTGTATGCTTTGATCAACAACTTGTTAAAGAAGGAGAATTACGTAAAGATACTGAGAGATATAGGAGACTAGTTGGAAAGTTGTACTTAACAGTGACTCGACCAGACA
TTGCCTATTTCGTAAGTGTTGTAAGTCAATTCATGTCTTCCTATACAGTGGATCATTGGGCTGTAGTAGAGCAGATTTTGTGTTATCTAAAAGCTGCTCCTGGACGTGGG
ATCTTATACAAAGATCATGGACATACGATAGTTGAATGTTTTTCTGATGCTGATTGGGCAGGATCTCGTGAGGATAGAAGATCAACTTCTGGATATTGTGTCTTTGCTGA
GTCAGAATCTAGAGCTATTGCACAATCTGTGTGTGAAATAGTATGGATTCACCAACTATTATCTAAGAAAGGCTTCAGTATTACCGTGGCAGCTAAATTATGGTGTGCTA
ATCAAGCTGCACTTCATATTGCATCTATTCCAACCGGAGAACAATTGGGAGATATTCTAACTAAAGCTTTAAATGGAGCAAGAATAATATATCTATGCAACAGTCTGGGC
ACGATCGACATATTTGCTCTGGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCAACTCTACTCCGGTTGCATCCACTGTTGCCCTAGGTAATACAAAGGGTCTTCTTACAACATCTTCCAAATGGGTCATAGACTCTGGTGTCACAGCTCATATGAC
AGGTAATTCTCACCTATTTTCTAGACCATTGTCCCCGGCTCCTTTTCTATCTGTTACATTGGCCAATGACTTCACATCTTCTGTTCTTGGCTCTGGCACTATTCACCTTA
CCCCATCATTTTCTCTCTCTTCTGATCATGTGACGAAGAAGATTATTGATAGAGGCTATGAGTTGGGAGACCTTTATCTCTTTGATCATCAAGTATCACAAGTTGTGGCG
TGTCCTGTCGTTCCCTCTCCTTTTGAAATCCATTGTTGTTTAGGTCATCCATCTTTGTTCATGTTGAAGAAACTTTATCAAGAATTTAGGTCTTTGTCCTCTTTAAATTG
TGATTTGTGTCAATTTGCTAAATATCATCGTCTTAGATCGAGTCCTCGAGTCGATAAACGAGCAATTGCTCCATTTGATCTTCCTGTGCTGACACTCCATCTTAAAATGG
TGTTGCATAACGAAAAAAATAGGCATTTACTTGAAACTGCCCGTGCTTTATCGTTTCAAATGCATGTTCCAAAAACCTTTTGGTCGGATGTTATTTTGTTTCCTATTGCT
CCTAAGATATTTGGTTGTGTTTGTTTTGTTCGCGACGTTCGTCCTCAACATACTAAGTTAGATCCCAAATCCTTGATGTGTATCTTCTTGGGTTATTCACGTGTTCAAAA
GGTTTATTGTTGTTATTGTCCTACCCTTAAAAGGTATTTGGTTTCGCCTGATGTTGCATTTTTTGTGGATACACCATTTACTTCATCACCATCGAATTTGTGTCAGGAGG
AGGATGACAATCTTTTTATATATGAGACTCATGTCCTCCATCAGTGCTTCCTTCATCATGTGATCCGGGGCCAAGTGATGATCTTCCCATTGCTCTTCGCGAAGATTCCA
GTTATTTCCTATCACCAGTTATCTCCCTCCACATATGCTTTTATTACGTCTCTTGAGTCCACATCTATTCCTAACTCTGTTCATGGAACTTTATCTCATCTTGATTGGCA
AAATGAAATGATTGAGGAGATGACTGCTTTAGATGATAATGGAAAGAAGGTCATTGGTTGTAAATGGGTGTTTGCTGTCAAGATGAATCCTGATGGCTTGATTGAAGGCT
CTTCTTATACATTCTCTCCAGTTGCCAAATTAACTTCCATTCGCCTATTTCTTTCCATTGCTGCTACCCATAAATGGCATTTGCATCAACTTGACATTAAGAATGTTTTT
CTTCACGGTGATCTTCAAGAGGAAGTTTATATTGAACAACCACCTGGGTATGTTACTCAGGGGGAGAGTGATAAAAGTCCTCGTGCGTGGTTTGGTAAGTTTAGTCAAGC
TCTTGTATGCTTTGATCAACAACTTGTTAAAGAAGGAGAATTACGTAAAGATACTGAGAGATATAGGAGACTAGTTGGAAAGTTGTACTTAACAGTGACTCGACCAGACA
TTGCCTATTTCGTAAGTGTTGTAAGTCAATTCATGTCTTCCTATACAGTGGATCATTGGGCTGTAGTAGAGCAGATTTTGTGTTATCTAAAAGCTGCTCCTGGACGTGGG
ATCTTATACAAAGATCATGGACATACGATAGTTGAATGTTTTTCTGATGCTGATTGGGCAGGATCTCGTGAGGATAGAAGATCAACTTCTGGATATTGTGTCTTTGCTGA
GTCAGAATCTAGAGCTATTGCACAATCTGTGTGTGAAATAGTATGGATTCACCAACTATTATCTAAGAAAGGCTTCAGTATTACCGTGGCAGCTAAATTATGGTGTGCTA
ATCAAGCTGCACTTCATATTGCATCTATTCCAACCGGAGAACAATTGGGAGATATTCTAACTAAAGCTTTAAATGGAGCAAGAATAATATATCTATGCAACAGTCTGGGC
ACGATCGACATATTTGCTCTGGCTTGA
Protein sequenceShow/hide protein sequence
MANSTPVASTVALGNTKGLLTTSSKWVIDSGVTAHMTGNSHLFSRPLSPAPFLSVTLANDFTSSVLGSGTIHLTPSFSLSSDHVTKKIIDRGYELGDLYLFDHQVSQVVA
CPVVPSPFEIHCCLGHPSLFMLKKLYQEFRSLSSLNCDLCQFAKYHRLRSSPRVDKRAIAPFDLPVLTLHLKMVLHNEKNRHLLETARALSFQMHVPKTFWSDVILFPIA
PKIFGCVCFVRDVRPQHTKLDPKSLMCIFLGYSRVQKVYCCYCPTLKRYLVSPDVAFFVDTPFTSSPSNLCQEEDDNLFIYETHVLHQCFLHHVIRGQVMIFPLLFAKIP
VISYHQLSPSTYAFITSLESTSIPNSVHGTLSHLDWQNEMIEEMTALDDNGKKVIGCKWVFAVKMNPDGLIEGSSYTFSPVAKLTSIRLFLSIAATHKWHLHQLDIKNVF
LHGDLQEEVYIEQPPGYVTQGESDKSPRAWFGKFSQALVCFDQQLVKEGELRKDTERYRRLVGKLYLTVTRPDIAYFVSVVSQFMSSYTVDHWAVVEQILCYLKAAPGRG
ILYKDHGHTIVECFSDADWAGSREDRRSTSGYCVFAESESRAIAQSVCEIVWIHQLLSKKGFSITVAAKLWCANQAALHIASIPTGEQLGDILTKALNGARIIYLCNSLG
TIDIFALA