; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011136 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011136
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr1:15414301..15421760
RNA-Seq ExpressionLag0011136
SyntenyLag0011136
Gene Ontology termsNA
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN63649.1 hypothetical protein VITISV_037657 [Vitis vinifera]1.8e-14931.53Show/hide
Query:  PNPNLFTPNPYPTLPQPLVVKLNVSNFLLWKNQLLNVVLANGLYGFLDGSIPAPPKF-------------------------------------------
        PN + F  +  P+L Q   V+L+ SN+LLW+ Q+LN+++ANGL   + G I AP +F                                           
Subjt:  PNPNLFTPNPYPTLPQPLVVKLNVSNFLLWKNQLLNVVLANGLYGFLDGSIPAPPKF-------------------------------------------

Query:  --------------------------DAPTLEDVRSLLLAYEARLERQTTVEQLNLAQANLSSHNIHHSNRRS-----------FSKPQPQFNHFS----
                                  +  +LE++ S+LL +E RLE+Q T E+ NL QAN+++ NI   N+++            ++ Q QFNH +    
Subjt:  --------------------------DAPTLEDVRSLLLAYEARLERQTTVEQLNLAQANLSSHNIHHSNRRS-----------FSKPQPQFNHFS----

Query:  KSSFTSSNQQSPFSPSILGKPQPNTSWTSK-------PNNSKPQCQICGNFGHTALICHHRTNLTYQTPPPQPQALLHTSQPAVLATPTTFSDTSSVNQA
        +     +N    F          N S++S+        +N KPQCQ+CG +GH A+ C+HR + TY       Q  ++    A++ATP+T          
Subjt:  KSSFTSSNQQSPFSPSILGKPQPNTSWTSK-------PNNSKPQCQICGNFGHTALICHHRTNLTYQTPPPQPQALLHTSQPAVLATPTTFSDTSSVNQA

Query:  DFTHPDESWFPDFGATHHMTSDISSLCNPMAYNGGDQITIGNGKQISISHIGSSLISSLSKPILLQNVLHTPSITKKLLSVSKLCKDNKAYVQFYSSFFP
             DESW+ D GATHH+T +++ L +   + G D++ +GNG +++IS+IG S ISS+S+ + L+N+LH P +T  L+SV++LC DN   V+F+++ F 
Subjt:  DFTHPDESWFPDFGATHHMTSDISSLCNPMAYNGGDQITIGNGKQISISHIGSSLISSLSKPILLQNVLHTPSITKKLLSVSKLCKDNKAYVQFYSSFFP

Query:  VKDLQTKTILLWGKLEDGLYKLSSSF----------NTVVSSSS-KSPTTFLSSVQVLPN----WHLQLGH-------------------LAASTLKQ--
        VKD  +K  LL G L  GLYKLSSS           N +   +S  +    +SS   L N    WH +LGH                   L+ S   Q  
Subjt:  VKDLQTKTILLWGKLEDGLYKLSSSF----------NTVVSSSS-KSPTTFLSSVQVLPN----WHLQLGH-------------------LAASTLKQ--

Query:  ------------VLSSCG----VSFVDDFTRFTWMYMLKSKDETFQCFLSFKKLVEVQF-----------------------------------------
                    V+ + G    V FVDD TRF+W+Y+L  KD+    FL FK ++E QF                                         
Subjt:  ------------VLSSCG----VSFVDDFTRFTWMYMLKSKDETFQCFLSFKKLVEVQF-----------------------------------------

Query:  --------------------GLPLSFWSFAFQTAVYVINRLPTTVLHSKSPYTMLYNCLPNYSLPLRVFSCACFPFLRPFNAHKLLFRSTECLFIGHSSN
                             +PL +W FAFQ+A+Y+INRLP++VL+  SPY  LY+CLPNYS  LRV+ C C+PFLRPFN HK  +RS +C FIG+SS 
Subjt:  --------------------GLPLSFWSFAFQTAVYVINRLPTTVLHSKSPYTMLYNCLPNYSLPLRVFSCACFPFLRPFNAHKLLFRSTECLFIGHSSN

Query:  HKGYLCLDPTTDRLFVSRHVFNEAFFPCVNSDFKFGSIKSSPPDHSFFSPAPFHIPSVSISVLKSPNISPASASSATPSAPPGFSPNSSDLVHRQCNTIL
        HKGYLCL+ +  ++ +SRHV                              AP             P   P+S+   +P  P         +  R  N I 
Subjt:  HKGYLCLDPTTDRLFVSRHVFNEAFFPCVNSDFKFGSIKSSPPDHSFFSPAPFHIPSVSISVLKSPNISPASASSATPSAPPGFSPNSSDLVHRQCNTIL

Query:  MSSLHSSPISSISPASLMALILSLTPTPSNRLERLDAWQEAKKDLESRQRRDATTTSAPRKYDVRASRRYRLVQQESFRPAPINSPLSQGSNHPIFGELF
           +  SP  +I             P       ++  WQ+A                                                           
Subjt:  MSSLHSSPISSISPASLMALILSLTPTPSNRLERLDAWQEAKKDLESRQRRDATTTSAPRKYDVRASRRYRLVQQESFRPAPINSPLSQGSNHPIFGELF

Query:  PSSIERKSEKNCGEKGLWIKITWAKLFPFYACEKLHNKYRNLVQNNTWILVPNSFAYKLVSCKWVYRIKLKLDGSVERYKTRFIARDFDQTQGIDYFETF
                                          +H ++  L+ N TW+LVP      ++ C WVY++K K DG+VERYK R +A+ F QT   DYFETF
Subjt:  PSSIERKSEKNCGEKGLWIKITWAKLFPFYACEKLHNKYRNLVQNNTWILVPNSFAYKLVSCKWVYRIKLKLDGSVERYKTRFIARDFDQTQGIDYFETF

Query:  SLVIKLTTIQILLSLVVRFGSVIKQLDVHNVFLNGDLHED----------VYMVQPPGFIDKDKSNHVCKLQK--------------------TLYNGEA
        S V+K TTI+++LSL +     I+QLDVHN FLNGDL E           + ++     +   ++ ++C L +                    ++ +G  
Subjt:  SLVIKLTTIQILLSLVVRFGSVIKQLDVHNVFLNGDLHED----------VYMVQPPGFIDKDKSNHVCKLQK--------------------TLYNGEA

Query:  LPNPKQYRSIVEALQYCIITKPDLSFEAN--CPNDLCNTSGYFIFHGSNLVSWSSAKQKVVSRSSVEFEYRGLSNAAAELVWIQSLLSELGLCFPSPPLL
        L +P  YRS+V ALQYC IT+PD+++  N  C      TS +       L    ++KQKVVSRSS E EYRGL+NAAA+L WIQSLL EL +    PP+L
Subjt:  LPNPKQYRSIVEALQYCIITKPDLSFEAN--CPNDLCNTSGYFIFHGSNLVSWSSAKQKVVSRSSVEFEYRGLSNAAAELVWIQSLLSELGLCFPSPPLL

Query:  LCDNINATYLASNPIFHSRSKHIKIDYHFIREKVMRKQLIVRFVPSEDQLANILTKPLPTARFPSLHTKLTVQSR
          DN++ TYLA+NP+ HSR+KH++IDYHF+RE+V+ K L VRF+PS+DQ+ +ILTK L T RF  L +KLTV SR
Subjt:  LCDNINATYLASNPIFHSRSKHIKIDYHFIREKVMRKQLIVRFVPSEDQLANILTKPLPTARFPSLHTKLTVQSR

CAN73924.1 hypothetical protein VITISV_041509 [Vitis vinifera]1.7e-13930.94Show/hide
Query:  DAPTLEDVRSLLLAYEARLERQ-TTVEQLNLAQANLSS-----HNIHHSNRRSFSKPQPQFNHFSKSSFTSSNQQSPFSPSILGKPQPNTSWTSKPN--N
        D  +L  V S+LL +E RL  Q ++    + A A+++S      N  H  R      +PQ     ++S +S+   + F P       P  S  +KP+  +
Subjt:  DAPTLEDVRSLLLAYEARLERQ-TTVEQLNLAQANLSS-----HNIHHSNRRSFSKPQPQFNHFSKSSFTSSNQQSPFSPSILGKPQPNTSWTSKPN--N

Query:  SKPQCQICGNFGHTALICHHRTNLTYQTPPPQPQALLHTSQPAVLATPTTFSDTSSVNQADFTHPDESWFPDFGATHHMTSDISSLCNPMAYNGGDQITI
        ++PQCQ+CG FGHTA+ C+HR ++ YQ     P A    S   + A P               H D SWF D GATHH++    +L     Y+G DQ+TI
Subjt:  SKPQCQICGNFGHTALICHHRTNLTYQTPPPQPQALLHTSQPAVLATPTTFSDTSSVNQADFTHPDESWFPDFGATHHMTSDISSLCNPMAYNGGDQITI

Query:  GNGKQISISHIGSSLISSLSKPILLQNVLHTPSITKKLLSVSKLCKDNKAYVQFYSSFFPVKDLQTKTILLWGKLEDGLYKLSSSFNTVVSSSSKSPTTF
        G+G  + I + G+      SK   L  VLH P ++  L+SVSK   DN  + + +SS F VKD  TK ILL G L DGLY+          SSS  P  F
Subjt:  GNGKQISISHIGSSLISSLSKPILLQNVLHTPSITKKLLSVSKLCKDNKAYVQFYSSFFPVKDLQTKTILLWGKLEDGLYKLSSSFNTVVSSSSKSPTTF

Query:  LSSVQVLPN--WHLQLGHLAASTLKQVLSSCGVS-----------------------------------------------------------FVDDFTR
        +++        WH +LGH A   L + L+SC  S                                                           F+DD++R
Subjt:  LSSVQVLPN--WHLQLGHLAASTLKQVLSSCGVS-----------------------------------------------------------FVDDFTR

Query:  FTWMYMLKSKDETFQCFLSFKKLVE-------------------------------------------------------------VQFGLPLSFWSFAF
         TW+Y L +KD+  Q F++F+K+VE                                                              Q  LP  +W++AF
Subjt:  FTWMYMLKSKDETFQCFLSFKKLVE-------------------------------------------------------------VQFGLPLSFWSFAF

Query:  QTAVYVINRLPTTVLHSKSPYTMLYNCLPNYSLPLRVFSCACFPFLRPFNAHKLLFRSTECLFIGHSSNHKGYLCLDPTTDRLFVSRHV-FNEAFFPCVN
        QTAVY+IN LP  +LH +SP   L++ LPNY   LRVF C CFP LRP+  HKL +RST C+F+G++  HKGYLCLD +T+R+++SR+V F+E+ FP   
Subjt:  QTAVYVINRLPTTVLHSKSPYTMLYNCLPNYSLPLRVFSCACFPFLRPFNAHKLLFRSTECLFIGHSSNHKGYLCLDPTTDRLFVSRHV-FNEAFFPCVN

Query:  SDFKFGSIKSSPPDHSFFSPAPFHIPSVSISVLKSPNISPASASSATPSAPPGFSPNSSDLVHRQCNTILMSSLHSSPISSISPASLMALILSLTPTPSN
            F S  SSPP     SP+P H+PS + +++ SP++S        PS+P   SP             +++S    P+  +  A+      S  P P N
Subjt:  SDFKFGSIKSSPPDHSFFSPAPFHIPSVSISVLKSPNISPASASSATPSAPPGFSPNSSDLVHRQCNTILMSSLHSSPISSISPASLMALILSLTPTPSN

Query:  RLERLDAWQEAKKDLESRQRRDATTTSAPRKYDVRASRRYRLVQQESFRPAPINSPLSQGSNHPIFGELFPSSIERKSEKNCGEKGLWIKITWAKLFPFY
            +     AK  +  ++      T+ PR Y                         SQ S +                            +W       
Subjt:  RLERLDAWQEAKKDLESRQRRDATTTSAPRKYDVRASRRYRLVQQESFRPAPINSPLSQGSNHPIFGELFPSSIERKSEKNCGEKGLWIKITWAKLFPFY

Query:  ACEKLHNKYRNLVQNNTWILVPNSFAYKLVSCKWVYRIKLKLDGSVERYKTRFIARDFDQTQGIDYFETFSLVIKLTTIQILLSLVVRFGSVIKQLDVHN
          + ++++Y+ L++NNTW LVP   +  +V C+W+Y++K + DGS++R+K R +A+ F QT GIDYF+TFS V+K  TI+++L+L V F   ++QLDV N
Subjt:  ACEKLHNKYRNLVQNNTWILVPNSFAYKLVSCKWVYRIKLKLDGSVERYKTRFIARDFDQTQGIDYFETFSLVIKLTTIQILLSLVVRFGSVIKQLDVHN

Query:  VFLNGDLHEDVYMVQPPGFIDKDKSNHVCKLQKTLY----------------------------------------------------------------
         FLNGDL E+V+M QP GF++     +VCKL K LY                                                                
Subjt:  VFLNGDLHEDVYMVQPPGFIDKDKSNHVCKLQKTLY----------------------------------------------------------------

Query:  ------------------------------------------------------------------NGEALPNPKQYRSIVEALQYCIITKPDLSFEAN-
                                                                          +G +L +P +YR  V ALQY  +T+PD++F  N 
Subjt:  ------------------------------------------------------------------NGEALPNPKQYRSIVEALQYCIITKPDLSFEAN-

Query:  ---------------------------------------------------CPNDLCNTSGYFIFHGSNLVSWSSAKQKVVSRSSVEFEYRGLSNAAAEL
                                                           CP+D  +TSGY +F GSNL+SWSS+KQ++VS+SS E EYRGL +  AEL
Subjt:  ---------------------------------------------------CPNDLCNTSGYFIFHGSNLVSWSSAKQKVVSRSSVEFEYRGLSNAAAEL

Query:  VWIQSLLSELGLCFP-SPPLLLCDNINATYLASNPIFHSRSKHIKIDYHFIREKVMRKQLIVRFVPSEDQLANILTKPLPTARFPSLHTKLTV
        VWIQSLL E  LC P SPP+L CDN +A +LA+NP+FHSRSKHI++D HFIREKV+R++L + +VPS DQLA+I TK LP  +F +L +KLTV
Subjt:  VWIQSLLSELGLCFP-SPPLLLCDNINATYLASNPIFHSRSKHIKIDYHFIREKVMRKQLIVRFVPSEDQLANILTKPLPTARFPSLHTKLTV

RVW64314.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]3.7e-14730.25Show/hide
Query:  KNQLLNVVLANGL-YGFLDGSIPAPPKFDAPTLEDVRSLLLAYEARLERQTTVEQLNLAQANLSSHNIHHSNRRSFSKPQPQFNHFSKSSFTSSNQQSPF
        ++Q+L ++   G  Y  +  S+ A  + D  +L  V S+LL +E RL  Q +V + N+  ANL++               PQ+ HF+    +  N+QS F
Subjt:  KNQLLNVVLANGL-YGFLDGSIPAPPKFDAPTLEDVRSLLLAYEARLERQTTVEQLNLAQANLSSHNIHHSNRRSFSKPQPQFNHFSKSSFTSSNQQSPF

Query:  SPSILGKPQPNTSWTSKPNNSKPQCQICGNFGHTALICHHRTNLTYQTPPPQPQALLHTSQP-------AVLATPTTFSDTSSVNQADFTHPDESWFPDF
        +       +      S+ +  +PQCQ+CG FGHT + C+HR ++ +Q   P     + T++P       A++A+P+T S             DE+WF D 
Subjt:  SPSILGKPQPNTSWTSKPNNSKPQCQICGNFGHTALICHHRTNLTYQTPPPQPQALLHTSQP-------AVLATPTTFSDTSSVNQADFTHPDESWFPDF

Query:  GATHHMTSDISSLCNPMAYNGGDQITIGNGKQISISHIGSSLISSLSKPILLQNVLHTPSITKKLLSVSKLCKDNKAYVQFYSSFFPVKDLQTKTILLWG
        GATHH++  I  L +   Y G D++ +GNGK + I H G++   S SK   L+ VLH P I   L+SVS+ C DN  + +F+  FF VKD  TK ILL G
Subjt:  GATHHMTSDISSLCNPMAYNGGDQITIGNGKQISISHIGSSLISSLSKPILLQNVLHTPSITKKLLSVSKLCKDNKAYVQFYSSFFPVKDLQTKTILLWG

Query:  KLEDGLYKLSSSF---NTVVSSSSKSPTTFLSSVQVLPNWHLQLGHLAASTLKQVLSSCGVS--------------------------------------
         LE GLY+  + F        SSS   ++ LS       WH +LGH A + LK +L+SC +S                                      
Subjt:  KLEDGLYKLSSSF---NTVVSSSSKSPTTFLSSVQVLPNWHLQLGHLAASTLKQVLSSCGVS--------------------------------------

Query:  -------------------FVDDFTRFTWMYMLKSKDETFQCFLSFKKLVEVQF----------------------------------------------
                           FVDDF+RF+W+Y L SKD+    F+ FK LVE QF                                              
Subjt:  -------------------FVDDFTRFTWMYMLKSKDETFQCFLSFKKLVEVQF----------------------------------------------

Query:  ---------------GLPLSFWSFAFQTAVYVINRLPTTVLHSKSPYTMLYNCLPNYSLPLRVFSCACFPFLRPFNAHKLLFRSTECLFIGHSSNHKGYL
                        LP  FW +AF TA+++INRLPT VL+ +SP+ +L+   PNY +  ++F C C+P++RP+N +KL +RS++C+F+G+SSNHKGY+
Subjt:  ---------------GLPLSFWSFAFQTAVYVINRLPTTVLHSKSPYTMLYNCLPNYSLPLRVFSCACFPFLRPFNAHKLLFRSTECLFIGHSSNHKGYL

Query:  CLDPTTDRLFVSRH-VFNEAFFPCVNSDFKFGSIKSSPPDHSFFSPAPFHIPSVSISVLKSPNISPASASSATPSAPPGFSPNSSDLVHRQCNTILMSSL
        CL+P T RL+V+RH VF+E  FP             S PD    S +   IP+ +     SP +S +  S  TPS       +S  L +   +TI +  L
Subjt:  CLDPTTDRLFVSRH-VFNEAFFPCVNSDFKFGSIKSSPPDHSFFSPAPFHIPSVSISVLKSPNISPASASSATPSAPPGFSPNSSDLVHRQCNTILMSSL

Query:  HSSPISSISPASLMALILSLTPTPSNRLERLDAWQEAKKDLESRQRRDATTTSAPRKYDVRASRRYRLVQQESFRPAPINSPLSQGSNHPIFGELFPSSI
           P + IS +          P P+N+   +     AK  +  ++   ++  S P  +                      +   + SN            
Subjt:  HSSPISSISPASLMALILSLTPTPSNRLERLDAWQEAKKDLESRQRRDATTTSAPRKYDVRASRRYRLVQQESFRPAPINSPLSQGSNHPIFGELFPSSI

Query:  ERKSEKNCGEKGLWIKITWAKLFPFYACEKLHNKYRNLVQNNTWILVPNSFAYKLVSCKWVYRIKLKLDGSVERYKTRFIARDFDQTQGIDYFETFSLVI
                     W+           A EK   ++  L +NNTW LVP      ++ CKWVY++K K DG+V+RYK R +A+ F QT G+DYFETFS V+
Subjt:  ERKSEKNCGEKGLWIKITWAKLFPFYACEKLHNKYRNLVQNNTWILVPNSFAYKLVSCKWVYRIKLKLDGSVERYKTRFIARDFDQTQGIDYFETFSLVI

Query:  KLTTIQILLSLVVRFGSVIKQLDVHNVFLNGDLHEDVYMVQPPGFIDKDKSNHVCKLQKTLY--------------------------------------
        K +TI+I+L++ + F   + QLDV N FL+GDL E V+M QPPGFI+    +HVCKL K LY                                      
Subjt:  KLTTIQILLSLVVRFGSVIKQLDVHNVFLNGDLHEDVYMVQPPGFIDKDKSNHVCKLQKTLY--------------------------------------

Query:  --------------------------------------------------------------------------------------------NGEALPNP
                                                                                                    +GE   + 
Subjt:  --------------------------------------------------------------------------------------------NGEALPNP

Query:  KQYRSIVEALQYCIITKPDLSFEAN----------------------------------------------------CPNDLCNTSGYFIFHGSNLVSWS
          YRS V ALQY  +T+PD+SF  N                                                    CP+D  +T GY IF G NLVSWS
Subjt:  KQYRSIVEALQYCIITKPDLSFEAN----------------------------------------------------CPNDLCNTSGYFIFHGSNLVSWS

Query:  SAKQKVVSRSSVEFEYRGLSNAAAELVWIQSLLSELGLCFPSPPLLLCDNINATYLASNPIFHSRSKHIKIDYHFIREKVMRKQLIVRFVPSEDQLANIL
        S KQKVVSRSS E EYR L++A +E++WIQ +L EL L   SPPLL CDN +A +LA+NP+FH+R+KHI++D HFIR+ V+RKQL+++++PS +Q+A+I 
Subjt:  SAKQKVVSRSSVEFEYRGLSNAAAELVWIQSLLSELGLCFPSPPLLLCDNINATYLASNPIFHSRSKHIKIDYHFIREKVMRKQLIVRFVPSEDQLANIL

Query:  TKPLPTARFPSLHTKLTVQSRSGFLARTANDRRRWRDRRRES
        TK + +++F S  TKL+V      ++   +DRR   D++  +
Subjt:  TKPLPTARFPSLHTKLTVQSRSGFLARTANDRRRWRDRRRES

RVX06084.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.1e-14630.18Show/hide
Query:  KNQLLNVVLANGL-YGFLDGSIPAPPKFDAPTLEDVRSLLLAYEARLERQTTVEQLNLAQANLSSHNIHHSNRRSFSKPQPQFNHFSKSSFTSSNQQSPF
        ++Q+L ++   G  Y  +  S+ A  + D  +L  V S+LL +E RL  Q +V + N+  ANL++               PQ+ HF+    +  N+QS F
Subjt:  KNQLLNVVLANGL-YGFLDGSIPAPPKFDAPTLEDVRSLLLAYEARLERQTTVEQLNLAQANLSSHNIHHSNRRSFSKPQPQFNHFSKSSFTSSNQQSPF

Query:  SPSILGKPQPNTSWTSKPNNSKPQCQICGNFGHTALICHHRTNLTYQTPPPQPQALLHTSQP-------AVLATPTTFSDTSSVNQADFTHPDESWFPDF
        +       +      S+ +  +PQCQ+CG FGHT + C+HR ++ +Q   P     + T++P       A++A+P+T S             DE+WF D 
Subjt:  SPSILGKPQPNTSWTSKPNNSKPQCQICGNFGHTALICHHRTNLTYQTPPPQPQALLHTSQP-------AVLATPTTFSDTSSVNQADFTHPDESWFPDF

Query:  GATHHMTSDISSLCNPMAYNGGDQITIGNGKQISISHIGSSLISSLSKPILLQNVLHTPSITKKLLSVSKLCKDNKAYVQFYSSFFPVKDLQTKTILLWG
        GATHH++  I  L +   Y G D++ +GNGK + I H G++   S SK   L+ VLH P I   L+SVS+ C DN  + +F+  FF VKD  TK ILL G
Subjt:  GATHHMTSDISSLCNPMAYNGGDQITIGNGKQISISHIGSSLISSLSKPILLQNVLHTPSITKKLLSVSKLCKDNKAYVQFYSSFFPVKDLQTKTILLWG

Query:  KLEDGLYKLSSSF---NTVVSSSSKSPTTFLSSVQVLPNWHLQLGHLAASTLKQVLSSCGVS--------------------------------------
         LE GLY+  + F        SSS   ++ LS       WH +LGH A + LK +L+SC +S                                      
Subjt:  KLEDGLYKLSSSF---NTVVSSSSKSPTTFLSSVQVLPNWHLQLGHLAASTLKQVLSSCGVS--------------------------------------

Query:  -------------------FVDDFTRFTWMYMLKSKDETFQCFLSFKKLVEVQF----------------------------------------------
                           FVDDF+RF+W+Y L SKD+    F+ FK LVE QF                                              
Subjt:  -------------------FVDDFTRFTWMYMLKSKDETFQCFLSFKKLVEVQF----------------------------------------------

Query:  ---------------GLPLSFWSFAFQTAVYVINRLPTTVLHSKSPYTMLYNCLPNYSLPLRVFSCACFPFLRPFNAHKLLFRSTECLFIGHSSNHKGYL
                        LP  FW +AF T +++INRLPT VL+ +SP+ +L+   PNY +  ++F C C+P++RP+N +KL +RS++C+F+G+SSNHKGY+
Subjt:  ---------------GLPLSFWSFAFQTAVYVINRLPTTVLHSKSPYTMLYNCLPNYSLPLRVFSCACFPFLRPFNAHKLLFRSTECLFIGHSSNHKGYL

Query:  CLDPTTDRLFVSRH-VFNEAFFPCVNSDFKFGSIKSSPPDHSFFSPAPFHIPSVSISVLKSPNISPASASSATPSAPPGFSPNSSDLVHRQCNTILMSSL
        CL+P T RL+V+RH VF+E  FP             S PD    S +   IP+ +     SP +S +  S  TPS       +S  L +   +TI +  L
Subjt:  CLDPTTDRLFVSRH-VFNEAFFPCVNSDFKFGSIKSSPPDHSFFSPAPFHIPSVSISVLKSPNISPASASSATPSAPPGFSPNSSDLVHRQCNTILMSSL

Query:  HSSPISSISPASLMALILSLTPTPSNRLERLDAWQEAKKDLESRQRRDATTTSAPRKYDVRASRRYRLVQQESFRPAPINSPLSQGSNHPIFGELFPSSI
           P + IS +          P P+N+   +     AK  +  ++   ++  S P  +                      +   + SN            
Subjt:  HSSPISSISPASLMALILSLTPTPSNRLERLDAWQEAKKDLESRQRRDATTTSAPRKYDVRASRRYRLVQQESFRPAPINSPLSQGSNHPIFGELFPSSI

Query:  ERKSEKNCGEKGLWIKITWAKLFPFYACEKLHNKYRNLVQNNTWILVPNSFAYKLVSCKWVYRIKLKLDGSVERYKTRFIARDFDQTQGIDYFETFSLVI
                     W+           A EK   ++  L +NNTW LVP      ++ CKWVY++K K DG+V+RYK R +A+ F QT G+DYFETFS V+
Subjt:  ERKSEKNCGEKGLWIKITWAKLFPFYACEKLHNKYRNLVQNNTWILVPNSFAYKLVSCKWVYRIKLKLDGSVERYKTRFIARDFDQTQGIDYFETFSLVI

Query:  KLTTIQILLSLVVRFGSVIKQLDVHNVFLNGDLHEDVYMVQPPGFIDKDKSNHVCKLQKTLY--------------------------------------
        K +TI+I+L++ + F   + QLDV N FL+GDL E V+M QPPGFI+    +HVCKL K LY                                      
Subjt:  KLTTIQILLSLVVRFGSVIKQLDVHNVFLNGDLHEDVYMVQPPGFIDKDKSNHVCKLQKTLY--------------------------------------

Query:  --------------------------------------------------------------------------------------------NGEALPNP
                                                                                                    +GE   + 
Subjt:  --------------------------------------------------------------------------------------------NGEALPNP

Query:  KQYRSIVEALQYCIITKPDLSFEAN----------------------------------------------------CPNDLCNTSGYFIFHGSNLVSWS
          YRS V ALQY  +T+PD+SF  N                                                    CP+D  +T GY IF G NLVSWS
Subjt:  KQYRSIVEALQYCIITKPDLSFEAN----------------------------------------------------CPNDLCNTSGYFIFHGSNLVSWS

Query:  SAKQKVVSRSSVEFEYRGLSNAAAELVWIQSLLSELGLCFPSPPLLLCDNINATYLASNPIFHSRSKHIKIDYHFIREKVMRKQLIVRFVPSEDQLANIL
        S KQKVVSRSS E EYR L++A +E++WIQ +L EL L   SPPLL CDN +A +LA+NP+FH+R+KHI++D HFIR+ V+RKQL+++++PS +Q+A+I 
Subjt:  SAKQKVVSRSSVEFEYRGLSNAAAELVWIQSLLSELGLCFPSPPLLLCDNINATYLASNPIFHSRSKHIKIDYHFIREKVMRKQLIVRFVPSEDQLANIL

Query:  TKPLPTARFPSLHTKLTVQSRSGFLARTANDRRRWRDRRRES
        TK + +++F S  TKL+V      ++   +DRR   D++  +
Subjt:  TKPLPTARFPSLHTKLTVQSRSGFLARTANDRRRWRDRRRES

RVX14515.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]2.3e-14433.42Show/hide
Query:  DFGATHHMTSDISSLCNPMAYNGGDQITIGNGKQISISHIGSSLISSLSKPILLQNVLHTPSITKKLLSVSKLCKDNKAYVQFYSSFFPVKDLQTKTILL
        D GATHH+T +++ L +   + G D++ +GNG +++IS+IG S ISS S+ + L+N+LH P +T  L+SV++LC DN   V+F+++ F VKD  +K  LL
Subjt:  DFGATHHMTSDISSLCNPMAYNGGDQITIGNGKQISISHIGSSLISSLSKPILLQNVLHTPSITKKLLSVSKLCKDNKAYVQFYSSFFPVKDLQTKTILL

Query:  WGKLEDGLYKLSSSF----------NTVVSSSS-KSPTTFLSSVQVLPN----WHLQLGHLAASTLKQVLSSCG--------------------------
         G L  GLYKLSSS           N +   +S  +    +SS   L N    WH +LGH A   + +VLS+C                           
Subjt:  WGKLEDGLYKLSSSF----------NTVVSSSS-KSPTTFLSSVQVLPN----WHLQLGHLAASTLKQVLSSCG--------------------------

Query:  ------------------------------VSFVDDFTRFTWMYMLKSKDETFQCFLSFKKLVEVQF---------------------------------
                                      V FVDD TRF+W+Y+L SKD+    FL FK ++E QF                                 
Subjt:  ------------------------------VSFVDDFTRFTWMYMLKSKDETFQCFLSFKKLVEVQF---------------------------------

Query:  ----------------------------GLPLSFWSFAFQTAVYVINRLPTTVLHSKSPYTMLYNCLPNYSLPLRVFSCACFPFLRPFNAHKLLFRSTEC
                                     +PL +W FAFQ+A+Y+INRLP++VL+  SPY  LY+CLPNYS  LRV+ C C+PFLRPFN HK  +RS +C
Subjt:  ----------------------------GLPLSFWSFAFQTAVYVINRLPTTVLHSKSPYTMLYNCLPNYSLPLRVFSCACFPFLRPFNAHKLLFRSTEC

Query:  LFIGHSSNHKGYLCLDPTTDRLFVSRHV-FNEAFFPCVNSDFKFGSIKSSPPDHSFFSPAPFHIPSVSISVLKSPNISPASASSATPSAPPGFSPNSSDL
         FIG++S HKGYLCL+ +  ++ +SRHV F E  FP     F   S   SP      SP    IP      L SP I+P S+   + S+P      SS+L
Subjt:  LFIGHSSNHKGYLCLDPTTDRLFVSRHV-FNEAFFPCVNSDFKFGSIKSSPPDHSFFSPAPFHIPSVSISVLKSPNISPASASSATPSAPPGFSPNSSDL

Query:  VHRQCNTILMSSLHSSPISSISPASLMALILSLTPTPSNRLERLDAWQEAKKDLESRQRRDATTTSAPRKYDVRASRRYRLVQQESFRPAPINSPLSQGS
        V     ++ +SS   +PI +  P+S      S   +P + +                      TT A               +   F+P    SP     
Subjt:  VHRQCNTILMSSLHSSPISSISPASLMALILSLTPTPSNRLERLDAWQEAKKDLESRQRRDATTTSAPRKYDVRASRRYRLVQQESFRPAPINSPLSQGS

Query:  NHPIFGELFPSSIERKSEKNCGEKGLWIKITWAKLFPFYACEKLHNKYRNLVQNNTWILVPNSFAYKLVSCKWVYRIKLKLDGSVERYKTRFIARDFDQT
        N  IF  L   +  +++ K          + W         + +H ++  L+ N TW+LVP      ++ C+WVY++K K DG+VERYK R +A+ F QT
Subjt:  NHPIFGELFPSSIERKSEKNCGEKGLWIKITWAKLFPFYACEKLHNKYRNLVQNNTWILVPNSFAYKLVSCKWVYRIKLKLDGSVERYKTRFIARDFDQT

Query:  QGIDYFETFSLVIKLTTIQILLSLVVRFGSVIKQLDVHNVFLNGDLHEDVYMVQPPGFIDKDKSNHVCKLQKTLY-------------------------
         G DYFETFS V+K TTI+++LSL +     I+QLDVHN FLNGDL E V+M QPPGF+D  K N VCKL K LY                         
Subjt:  QGIDYFETFSLVIKLTTIQILLSLVVRFGSVIKQLDVHNVFLNGDLHEDVYMVQPPGFIDKDKSNHVCKLQKTLY-------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----NGEALPNPKQYRSIVEALQYCIITKPDLSFEAN----------------------------------------------------CPNDLCNTSG
             +G  L +P  YRS+V ALQYC IT+PD+++  N                                                    CP+D  +TSG
Subjt:  -----NGEALPNPKQYRSIVEALQYCIITKPDLSFEAN----------------------------------------------------CPNDLCNTSG

Query:  YFIFHGSNLVSWSSAKQKVVSRSSVEFEYRGLSNAAAELVWIQSLLSELGLCFPSPPLLLCDNINATYLASNPIFHSRSKHIKIDYHFIREKVMRKQLIV
        Y IF G NLVSWS++KQKVVSRSS E EYRGL+NA AEL WIQSLL EL +    PP+L CDN++ TYLA+NP+ HSR+KH++IDYHF+RE+V++K L V
Subjt:  YFIFHGSNLVSWSSAKQKVVSRSSVEFEYRGLSNAAAELVWIQSLLSELGLCFPSPPLLLCDNINATYLASNPIFHSRSKHIKIDYHFIREKVMRKQLIV

Query:  RFVPSEDQLANILTKPLPTARFPSLHTKLTVQSR
        RF+PSEDQ+A+IL K L T RF  L +KLTV SR
Subjt:  RFVPSEDQLANILTKPLPTARFPSLHTKLTVQSR

TrEMBL top hitse value%identityAlignment
A0A438FWJ3 Retrovirus-related Pol polyprotein from transposon TNT 1-941.8e-14730.25Show/hide
Query:  KNQLLNVVLANGL-YGFLDGSIPAPPKFDAPTLEDVRSLLLAYEARLERQTTVEQLNLAQANLSSHNIHHSNRRSFSKPQPQFNHFSKSSFTSSNQQSPF
        ++Q+L ++   G  Y  +  S+ A  + D  +L  V S+LL +E RL  Q +V + N+  ANL++               PQ+ HF+    +  N+QS F
Subjt:  KNQLLNVVLANGL-YGFLDGSIPAPPKFDAPTLEDVRSLLLAYEARLERQTTVEQLNLAQANLSSHNIHHSNRRSFSKPQPQFNHFSKSSFTSSNQQSPF

Query:  SPSILGKPQPNTSWTSKPNNSKPQCQICGNFGHTALICHHRTNLTYQTPPPQPQALLHTSQP-------AVLATPTTFSDTSSVNQADFTHPDESWFPDF
        +       +      S+ +  +PQCQ+CG FGHT + C+HR ++ +Q   P     + T++P       A++A+P+T S             DE+WF D 
Subjt:  SPSILGKPQPNTSWTSKPNNSKPQCQICGNFGHTALICHHRTNLTYQTPPPQPQALLHTSQP-------AVLATPTTFSDTSSVNQADFTHPDESWFPDF

Query:  GATHHMTSDISSLCNPMAYNGGDQITIGNGKQISISHIGSSLISSLSKPILLQNVLHTPSITKKLLSVSKLCKDNKAYVQFYSSFFPVKDLQTKTILLWG
        GATHH++  I  L +   Y G D++ +GNGK + I H G++   S SK   L+ VLH P I   L+SVS+ C DN  + +F+  FF VKD  TK ILL G
Subjt:  GATHHMTSDISSLCNPMAYNGGDQITIGNGKQISISHIGSSLISSLSKPILLQNVLHTPSITKKLLSVSKLCKDNKAYVQFYSSFFPVKDLQTKTILLWG

Query:  KLEDGLYKLSSSF---NTVVSSSSKSPTTFLSSVQVLPNWHLQLGHLAASTLKQVLSSCGVS--------------------------------------
         LE GLY+  + F        SSS   ++ LS       WH +LGH A + LK +L+SC +S                                      
Subjt:  KLEDGLYKLSSSF---NTVVSSSSKSPTTFLSSVQVLPNWHLQLGHLAASTLKQVLSSCGVS--------------------------------------

Query:  -------------------FVDDFTRFTWMYMLKSKDETFQCFLSFKKLVEVQF----------------------------------------------
                           FVDDF+RF+W+Y L SKD+    F+ FK LVE QF                                              
Subjt:  -------------------FVDDFTRFTWMYMLKSKDETFQCFLSFKKLVEVQF----------------------------------------------

Query:  ---------------GLPLSFWSFAFQTAVYVINRLPTTVLHSKSPYTMLYNCLPNYSLPLRVFSCACFPFLRPFNAHKLLFRSTECLFIGHSSNHKGYL
                        LP  FW +AF TA+++INRLPT VL+ +SP+ +L+   PNY +  ++F C C+P++RP+N +KL +RS++C+F+G+SSNHKGY+
Subjt:  ---------------GLPLSFWSFAFQTAVYVINRLPTTVLHSKSPYTMLYNCLPNYSLPLRVFSCACFPFLRPFNAHKLLFRSTECLFIGHSSNHKGYL

Query:  CLDPTTDRLFVSRH-VFNEAFFPCVNSDFKFGSIKSSPPDHSFFSPAPFHIPSVSISVLKSPNISPASASSATPSAPPGFSPNSSDLVHRQCNTILMSSL
        CL+P T RL+V+RH VF+E  FP             S PD    S +   IP+ +     SP +S +  S  TPS       +S  L +   +TI +  L
Subjt:  CLDPTTDRLFVSRH-VFNEAFFPCVNSDFKFGSIKSSPPDHSFFSPAPFHIPSVSISVLKSPNISPASASSATPSAPPGFSPNSSDLVHRQCNTILMSSL

Query:  HSSPISSISPASLMALILSLTPTPSNRLERLDAWQEAKKDLESRQRRDATTTSAPRKYDVRASRRYRLVQQESFRPAPINSPLSQGSNHPIFGELFPSSI
           P + IS +          P P+N+   +     AK  +  ++   ++  S P  +                      +   + SN            
Subjt:  HSSPISSISPASLMALILSLTPTPSNRLERLDAWQEAKKDLESRQRRDATTTSAPRKYDVRASRRYRLVQQESFRPAPINSPLSQGSNHPIFGELFPSSI

Query:  ERKSEKNCGEKGLWIKITWAKLFPFYACEKLHNKYRNLVQNNTWILVPNSFAYKLVSCKWVYRIKLKLDGSVERYKTRFIARDFDQTQGIDYFETFSLVI
                     W+           A EK   ++  L +NNTW LVP      ++ CKWVY++K K DG+V+RYK R +A+ F QT G+DYFETFS V+
Subjt:  ERKSEKNCGEKGLWIKITWAKLFPFYACEKLHNKYRNLVQNNTWILVPNSFAYKLVSCKWVYRIKLKLDGSVERYKTRFIARDFDQTQGIDYFETFSLVI

Query:  KLTTIQILLSLVVRFGSVIKQLDVHNVFLNGDLHEDVYMVQPPGFIDKDKSNHVCKLQKTLY--------------------------------------
        K +TI+I+L++ + F   + QLDV N FL+GDL E V+M QPPGFI+    +HVCKL K LY                                      
Subjt:  KLTTIQILLSLVVRFGSVIKQLDVHNVFLNGDLHEDVYMVQPPGFIDKDKSNHVCKLQKTLY--------------------------------------

Query:  --------------------------------------------------------------------------------------------NGEALPNP
                                                                                                    +GE   + 
Subjt:  --------------------------------------------------------------------------------------------NGEALPNP

Query:  KQYRSIVEALQYCIITKPDLSFEAN----------------------------------------------------CPNDLCNTSGYFIFHGSNLVSWS
          YRS V ALQY  +T+PD+SF  N                                                    CP+D  +T GY IF G NLVSWS
Subjt:  KQYRSIVEALQYCIITKPDLSFEAN----------------------------------------------------CPNDLCNTSGYFIFHGSNLVSWS

Query:  SAKQKVVSRSSVEFEYRGLSNAAAELVWIQSLLSELGLCFPSPPLLLCDNINATYLASNPIFHSRSKHIKIDYHFIREKVMRKQLIVRFVPSEDQLANIL
        S KQKVVSRSS E EYR L++A +E++WIQ +L EL L   SPPLL CDN +A +LA+NP+FH+R+KHI++D HFIR+ V+RKQL+++++PS +Q+A+I 
Subjt:  SAKQKVVSRSSVEFEYRGLSNAAAELVWIQSLLSELGLCFPSPPLLLCDNINATYLASNPIFHSRSKHIKIDYHFIREKVMRKQLIVRFVPSEDQLANIL

Query:  TKPLPTARFPSLHTKLTVQSRSGFLARTANDRRRWRDRRRES
        TK + +++F S  TKL+V      ++   +DRR   D++  +
Subjt:  TKPLPTARFPSLHTKLTVQSRSGFLARTANDRRRWRDRRRES

A0A438JAU4 Retrovirus-related Pol polyprotein from transposon TNT 1-945.3e-14730.18Show/hide
Query:  KNQLLNVVLANGL-YGFLDGSIPAPPKFDAPTLEDVRSLLLAYEARLERQTTVEQLNLAQANLSSHNIHHSNRRSFSKPQPQFNHFSKSSFTSSNQQSPF
        ++Q+L ++   G  Y  +  S+ A  + D  +L  V S+LL +E RL  Q +V + N+  ANL++               PQ+ HF+    +  N+QS F
Subjt:  KNQLLNVVLANGL-YGFLDGSIPAPPKFDAPTLEDVRSLLLAYEARLERQTTVEQLNLAQANLSSHNIHHSNRRSFSKPQPQFNHFSKSSFTSSNQQSPF

Query:  SPSILGKPQPNTSWTSKPNNSKPQCQICGNFGHTALICHHRTNLTYQTPPPQPQALLHTSQP-------AVLATPTTFSDTSSVNQADFTHPDESWFPDF
        +       +      S+ +  +PQCQ+CG FGHT + C+HR ++ +Q   P     + T++P       A++A+P+T S             DE+WF D 
Subjt:  SPSILGKPQPNTSWTSKPNNSKPQCQICGNFGHTALICHHRTNLTYQTPPPQPQALLHTSQP-------AVLATPTTFSDTSSVNQADFTHPDESWFPDF

Query:  GATHHMTSDISSLCNPMAYNGGDQITIGNGKQISISHIGSSLISSLSKPILLQNVLHTPSITKKLLSVSKLCKDNKAYVQFYSSFFPVKDLQTKTILLWG
        GATHH++  I  L +   Y G D++ +GNGK + I H G++   S SK   L+ VLH P I   L+SVS+ C DN  + +F+  FF VKD  TK ILL G
Subjt:  GATHHMTSDISSLCNPMAYNGGDQITIGNGKQISISHIGSSLISSLSKPILLQNVLHTPSITKKLLSVSKLCKDNKAYVQFYSSFFPVKDLQTKTILLWG

Query:  KLEDGLYKLSSSF---NTVVSSSSKSPTTFLSSVQVLPNWHLQLGHLAASTLKQVLSSCGVS--------------------------------------
         LE GLY+  + F        SSS   ++ LS       WH +LGH A + LK +L+SC +S                                      
Subjt:  KLEDGLYKLSSSF---NTVVSSSSKSPTTFLSSVQVLPNWHLQLGHLAASTLKQVLSSCGVS--------------------------------------

Query:  -------------------FVDDFTRFTWMYMLKSKDETFQCFLSFKKLVEVQF----------------------------------------------
                           FVDDF+RF+W+Y L SKD+    F+ FK LVE QF                                              
Subjt:  -------------------FVDDFTRFTWMYMLKSKDETFQCFLSFKKLVEVQF----------------------------------------------

Query:  ---------------GLPLSFWSFAFQTAVYVINRLPTTVLHSKSPYTMLYNCLPNYSLPLRVFSCACFPFLRPFNAHKLLFRSTECLFIGHSSNHKGYL
                        LP  FW +AF T +++INRLPT VL+ +SP+ +L+   PNY +  ++F C C+P++RP+N +KL +RS++C+F+G+SSNHKGY+
Subjt:  ---------------GLPLSFWSFAFQTAVYVINRLPTTVLHSKSPYTMLYNCLPNYSLPLRVFSCACFPFLRPFNAHKLLFRSTECLFIGHSSNHKGYL

Query:  CLDPTTDRLFVSRH-VFNEAFFPCVNSDFKFGSIKSSPPDHSFFSPAPFHIPSVSISVLKSPNISPASASSATPSAPPGFSPNSSDLVHRQCNTILMSSL
        CL+P T RL+V+RH VF+E  FP             S PD    S +   IP+ +     SP +S +  S  TPS       +S  L +   +TI +  L
Subjt:  CLDPTTDRLFVSRH-VFNEAFFPCVNSDFKFGSIKSSPPDHSFFSPAPFHIPSVSISVLKSPNISPASASSATPSAPPGFSPNSSDLVHRQCNTILMSSL

Query:  HSSPISSISPASLMALILSLTPTPSNRLERLDAWQEAKKDLESRQRRDATTTSAPRKYDVRASRRYRLVQQESFRPAPINSPLSQGSNHPIFGELFPSSI
           P + IS +          P P+N+   +     AK  +  ++   ++  S P  +                      +   + SN            
Subjt:  HSSPISSISPASLMALILSLTPTPSNRLERLDAWQEAKKDLESRQRRDATTTSAPRKYDVRASRRYRLVQQESFRPAPINSPLSQGSNHPIFGELFPSSI

Query:  ERKSEKNCGEKGLWIKITWAKLFPFYACEKLHNKYRNLVQNNTWILVPNSFAYKLVSCKWVYRIKLKLDGSVERYKTRFIARDFDQTQGIDYFETFSLVI
                     W+           A EK   ++  L +NNTW LVP      ++ CKWVY++K K DG+V+RYK R +A+ F QT G+DYFETFS V+
Subjt:  ERKSEKNCGEKGLWIKITWAKLFPFYACEKLHNKYRNLVQNNTWILVPNSFAYKLVSCKWVYRIKLKLDGSVERYKTRFIARDFDQTQGIDYFETFSLVI

Query:  KLTTIQILLSLVVRFGSVIKQLDVHNVFLNGDLHEDVYMVQPPGFIDKDKSNHVCKLQKTLY--------------------------------------
        K +TI+I+L++ + F   + QLDV N FL+GDL E V+M QPPGFI+    +HVCKL K LY                                      
Subjt:  KLTTIQILLSLVVRFGSVIKQLDVHNVFLNGDLHEDVYMVQPPGFIDKDKSNHVCKLQKTLY--------------------------------------

Query:  --------------------------------------------------------------------------------------------NGEALPNP
                                                                                                    +GE   + 
Subjt:  --------------------------------------------------------------------------------------------NGEALPNP

Query:  KQYRSIVEALQYCIITKPDLSFEAN----------------------------------------------------CPNDLCNTSGYFIFHGSNLVSWS
          YRS V ALQY  +T+PD+SF  N                                                    CP+D  +T GY IF G NLVSWS
Subjt:  KQYRSIVEALQYCIITKPDLSFEAN----------------------------------------------------CPNDLCNTSGYFIFHGSNLVSWS

Query:  SAKQKVVSRSSVEFEYRGLSNAAAELVWIQSLLSELGLCFPSPPLLLCDNINATYLASNPIFHSRSKHIKIDYHFIREKVMRKQLIVRFVPSEDQLANIL
        S KQKVVSRSS E EYR L++A +E++WIQ +L EL L   SPPLL CDN +A +LA+NP+FH+R+KHI++D HFIR+ V+RKQL+++++PS +Q+A+I 
Subjt:  SAKQKVVSRSSVEFEYRGLSNAAAELVWIQSLLSELGLCFPSPPLLLCDNINATYLASNPIFHSRSKHIKIDYHFIREKVMRKQLIVRFVPSEDQLANIL

Query:  TKPLPTARFPSLHTKLTVQSRSGFLARTANDRRRWRDRRRES
        TK + +++F S  TKL+V      ++   +DRR   D++  +
Subjt:  TKPLPTARFPSLHTKLTVQSRSGFLARTANDRRRWRDRRRES

A5AQ25 Reverse transcriptase Ty1/copia-type domain-containing protein1.4e-13934.28Show/hide
Query:  DAPTLEDVRSLLLAYEARLERQTTVEQLNLAQANLSSHNIHHSNRRSFSKPQPQFNHFSKSSFTSSNQQSPFSPSILGKPQPNTSWTSKPNNSKPQCQIC
        D  +LE + S+LLA++  LE+Q+++EQ++   AN +S     SN R   +   +FN      +T +N    +     G         +   + KPQCQ+C
Subjt:  DAPTLEDVRSLLLAYEARLERQTTVEQLNLAQANLSSHNIHHSNRRSFSKPQPQFNHFSKSSFTSSNQQSPFSPSILGKPQPNTSWTSKPNNSKPQCQIC

Query:  GNFGHTALICHHRTNLTYQTPPPQPQALLHTSQPAVLATPTTFSDTSSVNQADFTHPDESWFPDFGATHHMTSDISSLCNPMAYNGGDQITIGNGKQISI
        G FGHTA IC+HR ++++Q         L+      + T         V  A  +  DESW+ D G +HH+T ++ +L +   Y G D++TIGNGK +SI
Subjt:  GNFGHTALICHHRTNLTYQTPPPQPQALLHTSQPAVLATPTTFSDTSSVNQADFTHPDESWFPDFGATHHMTSDISSLCNPMAYNGGDQITIGNGKQISI

Query:  SHIGSSLISSLSKPILLQNVLHTPSITKKLLSVSKLCKDNKAYVQFYSSFFPVKDLQTKTILLWGKLEDGLYKLSSSFN----TVVSSSSKSPTTFLSSV
        S+IGS  + S +    L+ V H P I+  L+SV+K   DN A ++F+S+ F VKD  TK +L  GKLE+GLYK     N    + ++++S     F S+V
Subjt:  SHIGSSLISSLSKPILLQNVLHTPSITKKLLSVSKLCKDNKAYVQFYSSFFPVKDLQTKTILLWGKLEDGLYKLSSSFN----TVVSSSSKSPTTFLSSV

Query:  QVLPN-WHLQLGHLAASTLKQVLSSCGVSFVDDFTRFTWMYMLKSKDETFQCFLSFKKLVEVQF----------------GLPLSFWSFAF---------
        +     WH +LGH+A+  + +V+++       D++ +TW Y L++KD+  + F  FK  +E QF                 L L F              
Subjt:  QVLPN-WHLQLGHLAASTLKQVLSSCGVSFVDDFTRFTWMYMLKSKDETFQCFLSFKKLVEVQF----------------GLPLSFWSFAF---------

Query:  ---QTAVYVINRLPTTVLHSKSPYTMLYNCLPNYSLPLRVFSCACFPFLRPFNAHKLLFRSTECLFIGHSSNHKGYLCLDPTTDRLFVSRH-VFNEAFFP
            TA ++INR+P+ VL   SPY  L+   P+Y   LRVF C C+PF+RP+N HKL +RS +CLF+G+S NHKG+LCLD  T R++++ H VF+E+ FP
Subjt:  ---QTAVYVINRLPTTVLHSKSPYTMLYNCLPNYSLPLRVFSCACFPFLRPFNAHKLLFRSTECLFIGHSSNHKGYLCLDPTTDRLFVSRH-VFNEAFFP

Query:  CVNSDFKFGSIKSSPPDHSFFSPAPFHIPSVSISVLKSPNISPASASSATPSAPPGFSPNSSDLVHRQCNTILMSSLHSSPISSISPASLMALILSLTPT
           S     S  +S    +     P   P+  +  L +  IS AS  S + S      P +S                SSP+ + S +  + L     P 
Subjt:  CVNSDFKFGSIKSSPPDHSFFSPAPFHIPSVSISVLKSPNISPASASSATPSAPPGFSPNSSDLVHRQCNTILMSSLHSSPISSISPASLMALILSLTPT

Query:  PSNRLERLDAWQEAKKDLESRQRRDATTTSAPRKYDVRASRRYRLVQQESFRPAPINSPLSQGSNHPIFGELFPSSIERKSEKNCGEKGLWIKITWAKLF
        P                          T  APR      +R  R + ++                     +LF   +   SE +   K  +    W K  
Subjt:  PSNRLERLDAWQEAKKDLESRQRRDATTTSAPRKYDVRASRRYRLVQQESFRPAPINSPLSQGSNHPIFGELFPSSIERKSEKNCGEKGLWIKITWAKLF

Query:  PFYACEKLHNKYRNLVQNNTWILVPNSFAYKLVSCKWVYRIKLKLDGSVERYKTRFIARDFDQTQGIDYFETFSLVIKLTTIQILLSLVVRFGSVIKQLD
               +  +   L +N TW LV       ++ CKWVY++K K DGS+ERYK R +A+ ++QT G+DYFETFS V+K  TI+I+L++ + F   I+QLD
Subjt:  PFYACEKLHNKYRNLVQNNTWILVPNSFAYKLVSCKWVYRIKLKLDGSVERYKTRFIARDFDQTQGIDYFETFSLVIKLTTIQILLSLVVRFGSVIKQLD

Query:  VHNVFLNGDLHE----DVYMVQPPGFIDKDKSNHVCKLQKTLYNGEALPNPKQYRSIVEALQYCIITKPDLSFEAN--C-----PNDL------------
        VHN FLN  LH     D    + PG + K+ S          ++G+ + +   YRS+V ALQY  +T+PD++F  N  C     P  +            
Subjt:  VHNVFLNGDLHE----DVYMVQPPGFIDKDKSNHVCKLQKTLYNGEALPNPKQYRSIVEALQYCIITKPDLSFEAN--C-----PNDL------------

Query:  --------------CNTSGYFIFHGSNLVSWSSAKQKVVSRSSVEFEYRGLSNAAAELVWIQSLLSELGLCFPSPPLLLCDNINATYLASNPIFHSRSKH
                       ++SGY ++ G NLVSWSS KQKVVS SS EFEYRGL  A AE+VW+Q+LL EL +  P+ PLL  DN +A ++A NP+FH+R+KH
Subjt:  --------------CNTSGYFIFHGSNLVSWSSAKQKVVSRSSVEFEYRGLSNAAAELVWIQSLLSELGLCFPSPPLLLCDNINATYLASNPIFHSRSKH

Query:  IKIDYHFIREKVMRKQLIVRFVPSEDQLANILTKPLPTARFPSLHTKLTVQSR
        IKID HFIR++VMR ++ + FVP+E+Q  ++LTK L ++RF SL ++L +  R
Subjt:  IKIDYHFIREKVMRKQLIVRFVPSEDQLANILTKPLPTARFPSLHTKLTVQSR

A5AYB0 Integrase catalytic domain-containing protein8.1e-14030.94Show/hide
Query:  DAPTLEDVRSLLLAYEARLERQ-TTVEQLNLAQANLSS-----HNIHHSNRRSFSKPQPQFNHFSKSSFTSSNQQSPFSPSILGKPQPNTSWTSKPN--N
        D  +L  V S+LL +E RL  Q ++    + A A+++S      N  H  R      +PQ     ++S +S+   + F P       P  S  +KP+  +
Subjt:  DAPTLEDVRSLLLAYEARLERQ-TTVEQLNLAQANLSS-----HNIHHSNRRSFSKPQPQFNHFSKSSFTSSNQQSPFSPSILGKPQPNTSWTSKPN--N

Query:  SKPQCQICGNFGHTALICHHRTNLTYQTPPPQPQALLHTSQPAVLATPTTFSDTSSVNQADFTHPDESWFPDFGATHHMTSDISSLCNPMAYNGGDQITI
        ++PQCQ+CG FGHTA+ C+HR ++ YQ     P A    S   + A P               H D SWF D GATHH++    +L     Y+G DQ+TI
Subjt:  SKPQCQICGNFGHTALICHHRTNLTYQTPPPQPQALLHTSQPAVLATPTTFSDTSSVNQADFTHPDESWFPDFGATHHMTSDISSLCNPMAYNGGDQITI

Query:  GNGKQISISHIGSSLISSLSKPILLQNVLHTPSITKKLLSVSKLCKDNKAYVQFYSSFFPVKDLQTKTILLWGKLEDGLYKLSSSFNTVVSSSSKSPTTF
        G+G  + I + G+      SK   L  VLH P ++  L+SVSK   DN  + + +SS F VKD  TK ILL G L DGLY+          SSS  P  F
Subjt:  GNGKQISISHIGSSLISSLSKPILLQNVLHTPSITKKLLSVSKLCKDNKAYVQFYSSFFPVKDLQTKTILLWGKLEDGLYKLSSSFNTVVSSSSKSPTTF

Query:  LSSVQVLPN--WHLQLGHLAASTLKQVLSSCGVS-----------------------------------------------------------FVDDFTR
        +++        WH +LGH A   L + L+SC  S                                                           F+DD++R
Subjt:  LSSVQVLPN--WHLQLGHLAASTLKQVLSSCGVS-----------------------------------------------------------FVDDFTR

Query:  FTWMYMLKSKDETFQCFLSFKKLVE-------------------------------------------------------------VQFGLPLSFWSFAF
         TW+Y L +KD+  Q F++F+K+VE                                                              Q  LP  +W++AF
Subjt:  FTWMYMLKSKDETFQCFLSFKKLVE-------------------------------------------------------------VQFGLPLSFWSFAF

Query:  QTAVYVINRLPTTVLHSKSPYTMLYNCLPNYSLPLRVFSCACFPFLRPFNAHKLLFRSTECLFIGHSSNHKGYLCLDPTTDRLFVSRHV-FNEAFFPCVN
        QTAVY+IN LP  +LH +SP   L++ LPNY   LRVF C CFP LRP+  HKL +RST C+F+G++  HKGYLCLD +T+R+++SR+V F+E+ FP   
Subjt:  QTAVYVINRLPTTVLHSKSPYTMLYNCLPNYSLPLRVFSCACFPFLRPFNAHKLLFRSTECLFIGHSSNHKGYLCLDPTTDRLFVSRHV-FNEAFFPCVN

Query:  SDFKFGSIKSSPPDHSFFSPAPFHIPSVSISVLKSPNISPASASSATPSAPPGFSPNSSDLVHRQCNTILMSSLHSSPISSISPASLMALILSLTPTPSN
            F S  SSPP     SP+P H+PS + +++ SP++S        PS+P   SP             +++S    P+  +  A+      S  P P N
Subjt:  SDFKFGSIKSSPPDHSFFSPAPFHIPSVSISVLKSPNISPASASSATPSAPPGFSPNSSDLVHRQCNTILMSSLHSSPISSISPASLMALILSLTPTPSN

Query:  RLERLDAWQEAKKDLESRQRRDATTTSAPRKYDVRASRRYRLVQQESFRPAPINSPLSQGSNHPIFGELFPSSIERKSEKNCGEKGLWIKITWAKLFPFY
            +     AK  +  ++      T+ PR Y                         SQ S +                            +W       
Subjt:  RLERLDAWQEAKKDLESRQRRDATTTSAPRKYDVRASRRYRLVQQESFRPAPINSPLSQGSNHPIFGELFPSSIERKSEKNCGEKGLWIKITWAKLFPFY

Query:  ACEKLHNKYRNLVQNNTWILVPNSFAYKLVSCKWVYRIKLKLDGSVERYKTRFIARDFDQTQGIDYFETFSLVIKLTTIQILLSLVVRFGSVIKQLDVHN
          + ++++Y+ L++NNTW LVP   +  +V C+W+Y++K + DGS++R+K R +A+ F QT GIDYF+TFS V+K  TI+++L+L V F   ++QLDV N
Subjt:  ACEKLHNKYRNLVQNNTWILVPNSFAYKLVSCKWVYRIKLKLDGSVERYKTRFIARDFDQTQGIDYFETFSLVIKLTTIQILLSLVVRFGSVIKQLDVHN

Query:  VFLNGDLHEDVYMVQPPGFIDKDKSNHVCKLQKTLY----------------------------------------------------------------
         FLNGDL E+V+M QP GF++     +VCKL K LY                                                                
Subjt:  VFLNGDLHEDVYMVQPPGFIDKDKSNHVCKLQKTLY----------------------------------------------------------------

Query:  ------------------------------------------------------------------NGEALPNPKQYRSIVEALQYCIITKPDLSFEAN-
                                                                          +G +L +P +YR  V ALQY  +T+PD++F  N 
Subjt:  ------------------------------------------------------------------NGEALPNPKQYRSIVEALQYCIITKPDLSFEAN-

Query:  ---------------------------------------------------CPNDLCNTSGYFIFHGSNLVSWSSAKQKVVSRSSVEFEYRGLSNAAAEL
                                                           CP+D  +TSGY +F GSNL+SWSS+KQ++VS+SS E EYRGL +  AEL
Subjt:  ---------------------------------------------------CPNDLCNTSGYFIFHGSNLVSWSSAKQKVVSRSSVEFEYRGLSNAAAEL

Query:  VWIQSLLSELGLCFP-SPPLLLCDNINATYLASNPIFHSRSKHIKIDYHFIREKVMRKQLIVRFVPSEDQLANILTKPLPTARFPSLHTKLTV
        VWIQSLL E  LC P SPP+L CDN +A +LA+NP+FHSRSKHI++D HFIREKV+R++L + +VPS DQLA+I TK LP  +F +L +KLTV
Subjt:  VWIQSLLSELGLCFP-SPPLLLCDNINATYLASNPIFHSRSKHIKIDYHFIREKVMRKQLIVRFVPSEDQLANILTKPLPTARFPSLHTKLTV

A5BMF5 Uncharacterized protein8.7e-15031.53Show/hide
Query:  PNPNLFTPNPYPTLPQPLVVKLNVSNFLLWKNQLLNVVLANGLYGFLDGSIPAPPKF-------------------------------------------
        PN + F  +  P+L Q   V+L+ SN+LLW+ Q+LN+++ANGL   + G I AP +F                                           
Subjt:  PNPNLFTPNPYPTLPQPLVVKLNVSNFLLWKNQLLNVVLANGLYGFLDGSIPAPPKF-------------------------------------------

Query:  --------------------------DAPTLEDVRSLLLAYEARLERQTTVEQLNLAQANLSSHNIHHSNRRS-----------FSKPQPQFNHFS----
                                  +  +LE++ S+LL +E RLE+Q T E+ NL QAN+++ NI   N+++            ++ Q QFNH +    
Subjt:  --------------------------DAPTLEDVRSLLLAYEARLERQTTVEQLNLAQANLSSHNIHHSNRRS-----------FSKPQPQFNHFS----

Query:  KSSFTSSNQQSPFSPSILGKPQPNTSWTSK-------PNNSKPQCQICGNFGHTALICHHRTNLTYQTPPPQPQALLHTSQPAVLATPTTFSDTSSVNQA
        +     +N    F          N S++S+        +N KPQCQ+CG +GH A+ C+HR + TY       Q  ++    A++ATP+T          
Subjt:  KSSFTSSNQQSPFSPSILGKPQPNTSWTSK-------PNNSKPQCQICGNFGHTALICHHRTNLTYQTPPPQPQALLHTSQPAVLATPTTFSDTSSVNQA

Query:  DFTHPDESWFPDFGATHHMTSDISSLCNPMAYNGGDQITIGNGKQISISHIGSSLISSLSKPILLQNVLHTPSITKKLLSVSKLCKDNKAYVQFYSSFFP
             DESW+ D GATHH+T +++ L +   + G D++ +GNG +++IS+IG S ISS+S+ + L+N+LH P +T  L+SV++LC DN   V+F+++ F 
Subjt:  DFTHPDESWFPDFGATHHMTSDISSLCNPMAYNGGDQITIGNGKQISISHIGSSLISSLSKPILLQNVLHTPSITKKLLSVSKLCKDNKAYVQFYSSFFP

Query:  VKDLQTKTILLWGKLEDGLYKLSSSF----------NTVVSSSS-KSPTTFLSSVQVLPN----WHLQLGH-------------------LAASTLKQ--
        VKD  +K  LL G L  GLYKLSSS           N +   +S  +    +SS   L N    WH +LGH                   L+ S   Q  
Subjt:  VKDLQTKTILLWGKLEDGLYKLSSSF----------NTVVSSSS-KSPTTFLSSVQVLPN----WHLQLGH-------------------LAASTLKQ--

Query:  ------------VLSSCG----VSFVDDFTRFTWMYMLKSKDETFQCFLSFKKLVEVQF-----------------------------------------
                    V+ + G    V FVDD TRF+W+Y+L  KD+    FL FK ++E QF                                         
Subjt:  ------------VLSSCG----VSFVDDFTRFTWMYMLKSKDETFQCFLSFKKLVEVQF-----------------------------------------

Query:  --------------------GLPLSFWSFAFQTAVYVINRLPTTVLHSKSPYTMLYNCLPNYSLPLRVFSCACFPFLRPFNAHKLLFRSTECLFIGHSSN
                             +PL +W FAFQ+A+Y+INRLP++VL+  SPY  LY+CLPNYS  LRV+ C C+PFLRPFN HK  +RS +C FIG+SS 
Subjt:  --------------------GLPLSFWSFAFQTAVYVINRLPTTVLHSKSPYTMLYNCLPNYSLPLRVFSCACFPFLRPFNAHKLLFRSTECLFIGHSSN

Query:  HKGYLCLDPTTDRLFVSRHVFNEAFFPCVNSDFKFGSIKSSPPDHSFFSPAPFHIPSVSISVLKSPNISPASASSATPSAPPGFSPNSSDLVHRQCNTIL
        HKGYLCL+ +  ++ +SRHV                              AP             P   P+S+   +P  P         +  R  N I 
Subjt:  HKGYLCLDPTTDRLFVSRHVFNEAFFPCVNSDFKFGSIKSSPPDHSFFSPAPFHIPSVSISVLKSPNISPASASSATPSAPPGFSPNSSDLVHRQCNTIL

Query:  MSSLHSSPISSISPASLMALILSLTPTPSNRLERLDAWQEAKKDLESRQRRDATTTSAPRKYDVRASRRYRLVQQESFRPAPINSPLSQGSNHPIFGELF
           +  SP  +I             P       ++  WQ+A                                                           
Subjt:  MSSLHSSPISSISPASLMALILSLTPTPSNRLERLDAWQEAKKDLESRQRRDATTTSAPRKYDVRASRRYRLVQQESFRPAPINSPLSQGSNHPIFGELF

Query:  PSSIERKSEKNCGEKGLWIKITWAKLFPFYACEKLHNKYRNLVQNNTWILVPNSFAYKLVSCKWVYRIKLKLDGSVERYKTRFIARDFDQTQGIDYFETF
                                          +H ++  L+ N TW+LVP      ++ C WVY++K K DG+VERYK R +A+ F QT   DYFETF
Subjt:  PSSIERKSEKNCGEKGLWIKITWAKLFPFYACEKLHNKYRNLVQNNTWILVPNSFAYKLVSCKWVYRIKLKLDGSVERYKTRFIARDFDQTQGIDYFETF

Query:  SLVIKLTTIQILLSLVVRFGSVIKQLDVHNVFLNGDLHED----------VYMVQPPGFIDKDKSNHVCKLQK--------------------TLYNGEA
        S V+K TTI+++LSL +     I+QLDVHN FLNGDL E           + ++     +   ++ ++C L +                    ++ +G  
Subjt:  SLVIKLTTIQILLSLVVRFGSVIKQLDVHNVFLNGDLHED----------VYMVQPPGFIDKDKSNHVCKLQK--------------------TLYNGEA

Query:  LPNPKQYRSIVEALQYCIITKPDLSFEAN--CPNDLCNTSGYFIFHGSNLVSWSSAKQKVVSRSSVEFEYRGLSNAAAELVWIQSLLSELGLCFPSPPLL
        L +P  YRS+V ALQYC IT+PD+++  N  C      TS +       L    ++KQKVVSRSS E EYRGL+NAAA+L WIQSLL EL +    PP+L
Subjt:  LPNPKQYRSIVEALQYCIITKPDLSFEAN--CPNDLCNTSGYFIFHGSNLVSWSSAKQKVVSRSSVEFEYRGLSNAAAELVWIQSLLSELGLCFPSPPLL

Query:  LCDNINATYLASNPIFHSRSKHIKIDYHFIREKVMRKQLIVRFVPSEDQLANILTKPLPTARFPSLHTKLTVQSR
          DN++ TYLA+NP+ HSR+KH++IDYHF+RE+V+ K L VRF+PS+DQ+ +ILTK L T RF  L +KLTV SR
Subjt:  LCDNINATYLASNPIFHSRSKHIKIDYHFIREKVMRKQLIVRFVPSEDQLANILTKPLPTARFPSLHTKLTVQSR

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.6e-3422.96Show/hide
Query:  LPLSFWSFAFQTAVYVINRLPTTVL--HSKSPYTMLYNCLPNYSLPLRVFSCACFPFLRPFNAHKLLFRSTECLFIGHSSNHKGYLCLDPTTDRLFVSRH
        L  SFW  A  TA Y+INR+P+  L   SK+PY M +N  P Y   LRVF    +  ++     K   +S + +F+G+  N  G+   D   ++  V+R 
Subjt:  LPLSFWSFAFQTAVYVINRLPTTVL--HSKSPYTMLYNCLPNYSLPLRVFSCACFPFLRPFNAHKLLFRSTECLFIGHSSNHKGYLCLDPTTDRLFVSRH

Query:  VFNE----------AFFPCVNSDFKFGSIKSSPPDHSFFSPAPFHIPSVSISVLKSPNISPASASSATPSAPPGFSPNSSDLVHRQCNTILMSSLHSSPI
        V  +           F      D K    K+ P D        F   S     ++    S  S +   P+              ++C+ I    L  S  
Subjt:  VFNE----------AFFPCVNSDFKFGSIKSSPPDHSFFSPAPFHIPSVSISVLKSPNISPASASSATPSAPPGFSPNSSDLVHRQCNTILMSSLHSSPI

Query:  SSISPASLMALILSLTPTPSNRLERLDAWQEAKKDLESRQRRDATTT--------SAPRKYD-----VRASRRYRLVQQESFRPAPINSPLSQGSNHPIF
        S+                 S + +R D   E+K      + R++ T           P K D      R S R +   Q S+     +      + H IF
Subjt:  SSISPASLMALILSLTPTPSNRLERLDAWQEAKKDLESRQRRDATTT--------SAPRKYD-----VRASRRYRLVQQESFRPAPINSPLSQGSNHPIF

Query:  GELFPSS---IERKSEKNCGEKGLWIKITWAKLFPFYACEKLHNKYRNLVQNNTWILVPNSFAYKLVSCKWVYRIKLKLDGSVERYKTRFIARDFDQTQG
         ++ P+S   I+ + +K+  E+ +  ++   K+                  NNTW +        +V  +WV+ +K    G+  RYK R +AR F Q   
Subjt:  GELFPSS---IERKSEKNCGEKGLWIKITWAKLFPFYACEKLHNKYRNLVQNNTWILVPNSFAYKLVSCKWVYRIKLKLDGSVERYKTRFIARDFDQTQG

Query:  IDYFETFSLVIKLTTIQILLSLVVRFGSVIKQLDVHNVFLNGDLHEDVYMVQPPGFIDKDKSNHVCKLQKTLY---------------------------
        IDY ETF+ V ++++ + +LSLV+++   + Q+DV   FLNG L E++YM  P G      S++VCKL K +Y                           
Subjt:  IDYFETFSLVIKLTTIQILLSLVVRFGSVIKQLDVHNVFLNGDLHEDVYMVQPPGFIDKDKSNHVCKLQKTLY---------------------------

Query:  -----------------------------------------------------------------------------------------NGEALPNPKQY
                                                                                                 N  + P P + 
Subjt:  -----------------------------------------------------------------------------------------NGEALPNPKQY

Query:  ---------------RSIVEALQYCII-TKPDLSFEANCPN------------------------------------------------------DLCNT
                       RS++  L Y ++ T+PDL+   N  +                                                      D  +T
Subjt:  ---------------RSIVEALQYCII-TKPDLSFEANCPN------------------------------------------------------DLCNT

Query:  SGY-FIFHGSNLVSWSSAKQKVVSRSSVEFEYRGLSNAAAELVWIQSLLSELGLCFPSPPLLLCDNINATYLASNPIFHSRSKHIKIDYHFIREKVMRKQ
        +GY F     NL+ W++ +Q  V+ SS E EY  L  A  E +W++ LL+ + +   +P  +  DN     +A+NP  H R+KHI I YHF RE+V    
Subjt:  SGY-FIFHGSNLVSWSSAKQKVVSRSSVEFEYRGLSNAAAELVWIQSLLSELGLCFPSPPLLLCDNINATYLASNPIFHSRSKHIKIDYHFIREKVMRKQ

Query:  LIVRFVPSEDQLANILTKPLPTARFPSLHTKL
        + + ++P+E+QLA+I TKPLP ARF  L  KL
Subjt:  LIVRFVPSEDQLANILTKPLPTARFPSLHTKL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.3e-3520.77Show/hide
Query:  PDESWFPDFGATHHMTSDISSLCNPMAYNGGDQITIGNGKQISISHIGSSLI-SSLSKPILLQNVLHTPSITKKLLSVSKLCKDNKAYVQFYSSFFPVKD
        P+  W  D  A+HH T      C  +A + G  + +GN     I+ IG   I +++   ++L++V H P +   L+S   L +D       Y S+F  + 
Subjt:  PDESWFPDFGATHHMTSDISSLCNPMAYNGGDQITIGNGKQISISHIGSSLI-SSLSKPILLQNVLHTPSITKKLLSVSKLCKDNKAYVQFYSSFFPVKD

Query:  ---LQTKTILLWGKLEDGLYKLSSSFNTVVSSSSKSPTTFLSSVQVLPNWHLQLGHLAASTLKQVLSS--------------------------------
            +   ++  G     LY+     N  +     +      SV +   WH ++GH++   L Q+L+                                 
Subjt:  ---LQTKTILLWGKLEDGLYKLSSSFNTVVSSSSKSPTTFLSSVQVLPNWHLQLGHLAASTLKQVLSS--------------------------------

Query:  --------------CG-------------VSFVDDFTRFTWMYMLKSKDETFQCFLSFKKLVEVQFG---------------------------------
                      CG             V+F+DD +R  W+Y+LK+KD+ FQ F  F  LVE + G                                 
Subjt:  --------------CG-------------VSFVDDFTRFTWMYMLKSKDETFQCFLSFKKLVEVQFG---------------------------------

Query:  ------------------------------LPLSFWSFAFQTAVYVINRLPTTVLHSKSPYTMLYNCLPNYSLPLRVFSCACFPFLRPFNAHKLLFRSTE
                                      LP SFW  A QTA Y+INR P+  L  + P  +  N   +YS  L+VF C  F  +      KL  +S  
Subjt:  ------------------------------LPLSFWSFAFQTAVYVINRLPTTVLHSKSPYTMLYNCLPNYSLPLRVFSCACFPFLRPFNAHKLLFRSTE

Query:  CLFIGHSSNHKGYLCLDPTTDRLFVSRHVFNEAFFPCVNSDFKFGSIKSSPPDHSFFSPAPFHIPSVSISVLKSPNISPASASSATPSAPPGFSPNSSDL
        C+FIG+     GY   DP   ++  SR V            F+   ++                                +A+  +     G  P     
Subjt:  CLFIGHSSNHKGYLCLDPTTDRLFVSRHVFNEAFFPCVNSDFKFGSIKSSPPDHSFFSPAPFHIPSVSISVLKSPNISPASASSATPSAPPGFSPNSSDL

Query:  VHRQCNTILMSSLHSSPISSISPASLMALILSLTPTPSNRLERLDAWQEAKKDLE-----SRQRRDATTTSAPRKYDVRASRRYRLVQQESFRPAPINSP
             N + + S  ++P S+ S    ++        P   +E+ +   E  +++E       Q +    +  PR      SRRY   +            
Subjt:  VHRQCNTILMSSLHSSPISSISPASLMALILSLTPTPSNRLERLDAWQEAKKDLE-----SRQRRDATTTSAPRKYDVRASRRYRLVQQESFRPAPINSP

Query:  LSQGSNHPIFGELFPSSIERKSEKNCGEKGLWIKITWAKLFPFYACEKLHNKYRNLVQNNTWILVPNSFAYKLVSCKWVYRIKLKLDGSVERYKTRFIAR
        L +  +HP              EKN                     + +  +  +L +N T+ LV      + + CKWV+++K   D  + RYK R + +
Subjt:  LSQGSNHPIFGELFPSSIERKSEKNCGEKGLWIKITWAKLFPFYACEKLHNKYRNLVQNNTWILVPNSFAYKLVSCKWVYRIKLKLDGSVERYKTRFIAR

Query:  DFDQTQGIDYFETFSLVIKLTTIQILLSLVVRFGSVIKQLDVHNVFLNGDLHEDVYMVQPPGFIDKDKSNHVCKLQKTLYNGEALP--------------
         F+Q +GID+ E FS V+K+T+I+ +LSL       ++QLDV   FL+GDL E++YM QP GF    K + VCKL K+LY  +  P              
Subjt:  DFDQTQGIDYFETFSLVIKLTTIQILLSLVVRFGSVIKQLDVHNVFLNGDLHEDVYMVQPPGFIDKDKSNHVCKLQKTLYNGEALP--------------

Query:  ---------------------------------------------------------NPKQ---------------------------------------
                                                                  P Q                                       
Subjt:  ---------------------------------------------------------NPKQ---------------------------------------

Query:  -----------------------------YRSIVEALQYCII-TKPDLSF------------------------------------------------EA
                                     Y S V +L Y ++ T+PD++                                                 +A
Subjt:  -----------------------------YRSIVEALQYCII-TKPDLSF------------------------------------------------EA

Query:  NCPNDLCN---TSGYFIFHGSNLVSWSSAKQKVVSRSSVEFEYRGLSNAAAELVWIQSLLSELGLCFPSPPLLLCDNINATYLASNPIFHSRSKHIKIDY
        +   D+ N   ++GY        +SW S  QK V+ S+ E EY   +    E++W++  L ELGL      ++ CD+ +A  L+ N ++H+R+KHI + Y
Subjt:  NCPNDLCN---TSGYFIFHGSNLVSWSSAKQKVVSRSSVEFEYRGLSNAAAELVWIQSLLSELGLCFPSPPLLLCDNINATYLASNPIFHSRSKHIKIDY

Query:  HFIREKVMRKQLIVRFVPSEDQLANILTKPLPTARF
        H+IRE V  + L V  + + +  A++LTK +P  +F
Subjt:  HFIREKVMRKQLIVRFVPSEDQLANILTKPLPTARF

P92520 Uncharacterized mitochondrial protein AtMg008201.1e-1138.1Show/hide
Query:  CEKLHNKYRNLVQNNTWILVPNSFAYKLVSCKWVYRIKLKLDGSVERYKTRFIARDFDQTQGIDYFETFSLVIKLTTIQILLSL
        C+ +  +   L +N TWILVP      ++ CKWV++ KL  DG+++R K R +A+ F Q +GI + ET+S V++  TI+ +L++
Subjt:  CEKLHNKYRNLVQNNTWILVPNSFAYKLVSCKWVYRIKLKLDGSVERYKTRFIARDFDQTQGIDYFETFSLVIKLTTIQILLSL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.3e-12028.48Show/hide
Query:  PTLEDVRSLLLAYEARLERQTTVEQLNLAQANLSSHNIHHSNRRSFSKPQPQFNHFSKSSFTSSNQQSPFSPSILGKPQPNTSWTSKPNNSKP---QCQI
        PTL ++   LL +E+++   ++   + +    +S  N   +N  +      ++++ +     ++N   P+        Q +T++    N SKP   +CQI
Subjt:  PTLEDVRSLLLAYEARLERQTTVEQLNLAQANLSSHNIHHSNRRSFSKPQPQFNHFSKSSFTSSNQQSPFSPSILGKPQPNTSWTSKPNNSKP---QCQI

Query:  CGNFGHTALIC----HHRTNLTYQTPPP-----QPQALLHTSQPAVLATPTTFSDTSSVNQADFTHPDESWFPDFGATHHMTSDISSLCNPMAYNGGDQI
        CG  GH+A  C    H  +++  Q PP      QP+A L    P                     +   +W  D GATHH+TSD ++L     Y GGD +
Subjt:  CGNFGHTALIC----HHRTNLTYQTPPP-----QPQALLHTSQPAVLATPTTFSDTSSVNQADFTHPDESWFPDFGATHHMTSDISSLCNPMAYNGGDQI

Query:  TIGNGKQISISHIGSSLISSLSKPILLQNVLHTPSITKKLLSVSKLCKDNKAYVQFYSSFFPVKDLQTKTILLWGKLEDGLYKLSSSFNTVVSSSSKSPT
         + +G  I ISH GS+ +S+ S+P+ L N+L+ P+I K L+SV +LC  N   V+F+ + F VKDL T   LL GK +D LY+       + SS   S  
Subjt:  TIGNGKQISISHIGSSLISSLSKPILLQNVLHTPSITKKLLSVSKLCKDNKAYVQFYSSFFPVKDLQTKTILLWGKLEDGLYKLSSSFNTVVSSSSKSPT

Query:  TFLSSVQVLPNWHLQLGHLAASTLKQVLS--------------SCG--------------------------------------------VSFVDDFTRF
           SS     +WH +LGH A S L  V+S              SC                                             V FVD FTR+
Subjt:  TFLSSVQVLPNWHLQLGHLAASTLKQVLS--------------SCG--------------------------------------------VSFVDDFTRF

Query:  TWMYMLKSKDETFQCFLSFKKLVEVQF-------------------------------------------------------------GLPLSFWSFAFQ
        TW+Y LK K +  + F++FK L+E +F                                                              +P ++W +AF 
Subjt:  TWMYMLKSKDETFQCFLSFKKLVEVQF-------------------------------------------------------------GLPLSFWSFAFQ

Query:  TAVYVINRLPTTVLHSKSPYTMLYNCLPNYSLPLRVFSCACFPFLRPFNAHKLLFRSTECLFIGHSSNHKGYLCLDPTTDRLFVSRHV-FNEAFFPCVNS
         AVY+INRLPT +L  +SP+  L+   PNY   LRVF CAC+P+LRP+N HKL  +S +C+F+G+S     YLCL   T RL++SRHV F+E  FP  N 
Subjt:  TAVYVINRLPTTVLHSKSPYTMLYNCLPNYSLPLRVFSCACFPFLRPFNAHKLLFRSTECLFIGHSSNHKGYLCLDPTTDRLFVSRHV-FNEAFFPCVNS

Query:  DFKFGSIKSSPPDHSFFSPAPFHIPSVSISVLKSPNISPASASSATPSAPPGFSPNSSDLVHRQCNTILMSSLHSSPISSISPASLMALILSLTPTPSNR
              ++    + S        +P+    VL +P+ S    ++  PS+P     NS     +  ++ L SS  SS  SS  P +        T  P+  
Subjt:  DFKFGSIKSSPPDHSFFSPAPFHIPSVSISVLKSPNISPASASSATPSAPPGFSPNSSDLVHRQCNTILMSSLHSSPISSISPASLMALILSLTPTPSNR

Query:  LERLDAWQEAKKDLESRQRRD--ATTTSAPRKYDVRASRRYRLVQQESFRPAPIN------SPLSQGSNHPIFGELFPSSIERKSE----KNCGEKGLWI
          +  + Q   ++  + +     A + S P +    +          S  P P +       PL+Q  N+     L   S+  +++    K   +  L +
Subjt:  LERLDAWQEAKKDLESRQRRD--ATTTSAPRKYDVRASRRYRLVQQESFRPAPIN------SPLSQGSNHPIFGELFPSSIERKSE----KNCGEKGLWI

Query:  KITWAKLFPFYACEKLHN-KYRNL--------VQNNTWILVPNSFAY-KLVSCKWVYRIKLKLDGSVERYKTRFIARDFDQTQGIDYFETFSLVIKLTTI
         +  A+  P  A + L + ++RN         + N+TW LVP   ++  +V C+W++  K   DGS+ RYK R +A+ ++Q  G+DY ETFS VIK T+I
Subjt:  KITWAKLFPFYACEKLHN-KYRNL--------VQNNTWILVPNSFAY-KLVSCKWVYRIKLKLDGSVERYKTRFIARDFDQTQGIDYFETFSLVIKLTTI

Query:  QILLSLVVRFGSVIKQLDVHNVFLNGDLHEDVYMVQPPGFIDKDKSNHVCKLQK----------------------------------------------
        +I+L + V     I+QLDV+N FL G L +DVYM QPPGFIDKD+ N+VCKL+K                                              
Subjt:  QILLSLVVRFGSVIKQLDVHNVFLNGDLHEDVYMVQPPGFIDKDKSNHVCKLQK----------------------------------------------

Query:  ------------------------------------------------------------------------------------TLYNGEALPNPKQYRS
                                                                                            +LY+G  L +P +YR 
Subjt:  ------------------------------------------------------------------------------------TLYNGEALPNPKQYRS

Query:  IVEALQYCIITKPDLSFEAN--------------------------CPN--------------------------DLCNTSGYFIFHGSNLVSWSSAKQK
        IV +LQY   T+PD+S+  N                           PN                          D  +T+GY ++ G + +SWSS KQK
Subjt:  IVEALQYCIITKPDLSFEAN--------------------------CPN--------------------------DLCNTSGYFIFHGSNLVSWSSAKQK

Query:  VVSRSSVEFEYRGLSNAAAELVWIQSLLSELGLCFPSPPLLLCDNINATYLASNPIFHSRSKHIKIDYHFIREKVMRKQLIVRFVPSEDQLANILTKPLP
         V RSS E EYR ++N ++E+ WI SLL+ELG+    PP++ CDN+ ATYL +NP+FHSR KHI IDYHFIR +V    L V  V + DQLA+ LTKPL 
Subjt:  VVSRSSVEFEYRGLSNAAAELVWIQSLLSELGLCFPSPPLLLCDNINATYLASNPIFHSRSKHIKIDYHFIREKVMRKQLIVRFVPSEDQLANILTKPLP

Query:  TARFPSLHTKLTV
           F +  +K+ V
Subjt:  TARFPSLHTKLTV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.7e-11327.29Show/hide
Query:  PTLEDVRSLLLAYEARLERQTTVEQLNLAQANLSSHNIHHSNRRSFSKPQPQFNHFSKSSFTSSNQQS-PFSPSILGKPQPNTSWTSKPNNSKPQCQICG
        P+L ++   L+  E++L    + E + +  AN+ +H   ++NR        Q N     ++ ++N +S  + PS  G    N     +P     +CQIC 
Subjt:  PTLEDVRSLLLAYEARLERQTTVEQLNLAQANLSSHNIHHSNRRSFSKPQPQFNHFSKSSFTSSNQQS-PFSPSILGKPQPNTSWTSKPNNSKPQCQICG

Query:  NFGHTALICHHRTNLTYQTPPPQPQALLHTSQPAVLATPTTFSDTSSVNQADFTHPDESWFPDFGATHHMTSDISSLCNPMAYNGGDQITIGNGKQISIS
          GH+A  C            PQ      T+      +P T     +    +  +   +W  D GATHH+TSD ++L     Y GGD + I +G  I I+
Subjt:  NFGHTALICHHRTNLTYQTPPPQPQALLHTSQPAVLATPTTFSDTSSVNQADFTHPDESWFPDFGATHHMTSDISSLCNPMAYNGGDQITIGNGKQISIS

Query:  HIGSSLISSLSKPILLQNVLHTPSITKKLLSVSKLCKDNKAYVQFYSSFFPVKDLQTKTILLWGKLEDGLYKLSSSFNTVVSSSSKSPTTFLSSVQVLPN
        H GS+ + + S+ + L  VL+ P+I K L+SV +LC  N+  V+F+ + F VKDL T   LL GK +D LY+       + SS + S      S     +
Subjt:  HIGSSLISSLSKPILLQNVLHTPSITKKLLSVSKLCKDNKAYVQFYSSFFPVKDLQTKTILLWGKLEDGLYKLSSSFNTVVSSSSKSPTTFLSSVQVLPN

Query:  WHLQLGHLAASTLKQVLS--------------SCG--------------------------------------------VSFVDDFTRFTWMYMLKSKDE
        WH +LGH + + L  V+S              SC                                             V FVD FTR+TW+Y LK K +
Subjt:  WHLQLGHLAASTLKQVLS--------------SCG--------------------------------------------VSFVDDFTRFTWMYMLKSKDE

Query:  TFQCFLSFKKLVEVQF-------------------------------------------------------------GLPLSFWSFAFQTAVYVINRLPT
            F+ FK LVE +F                                                              +P ++W +AF  AVY+INRLPT
Subjt:  TFQCFLSFKKLVEVQF-------------------------------------------------------------GLPLSFWSFAFQTAVYVINRLPT

Query:  TVLHSKSPYTMLYNCLPNYSLPLRVFSCACFPFLRPFNAHKLLFRSTECLFIGHSSNHKGYLCLDPTTDRLFVSRHV-FNEAFFPCVNSDFKFGSIK---
         +L  +SP+  L+   PNY   L+VF CAC+P+LRP+N HKL  +S +C F+G+S     YLCL   T RL+ SRHV F+E  FP   ++F   + +   
Subjt:  TVLHSKSPYTMLYNCLPNYSLPLRVFSCACFPFLRPFNAHKLLFRSTECLFIGHSSNHKGYLCLDPTTDRLFVSRHV-FNEAFFPCVNSDFKFGSIK---

Query:  --SSP--PDHSFFSPAPFHIP-----------------------SVSISVLKSPNISPASASSATPSAPPGFSPNSSDLVHRQCNTILMSSLHSSP-ISS
          S+P  P H+     P  +P                       +  +S    P+ S +S SS+ P+AP    P  +   H+  N+   S + ++P  +S
Subjt:  --SSP--PDHSFFSPAPFHIP-----------------------SVSISVLKSPNISPASASSATPSAPPGFSPNSSDLVHRQCNTILMSSLHSSP-ISS

Query:  ISPASLMALILSLTPTPSNRLERLDAWQEAKKDLESRQRRDATTTSAPRKYDVRASRRYRLVQQESFRPAPINSPLSQGSNHPIFGELFPSSIERKSEKN
         SP S        +P P + +        +    E      ++T++ P    + A    ++  Q       + +    G   P     + +S+   SE  
Subjt:  ISPASLMALILSLTPTPSNRLERLDAWQEAKKDLESRQRRDATTTSAPRKYDVRASRRYRLVQQESFRPAPINSPLSQGSNHPIFGELFPSSIERKSEKN

Query:  CGEKGLWIKITWAKLFPFYACEKLHNKYRNLVQNNTWILV-PNSFAYKLVSCKWVYRIKLKLDGSVERYKTRFIARDFDQTQGIDYFETFSLVIKLTTIQ
           + +     W         + + ++    + N+TW LV P   +  +V C+W++  K   DGS+ RYK R +A+ ++Q  G+DY ETFS VIK T+I+
Subjt:  CGEKGLWIKITWAKLFPFYACEKLHNKYRNLVQNNTWILV-PNSFAYKLVSCKWVYRIKLKLDGSVERYKTRFIARDFDQTQGIDYFETFSLVIKLTTIQ

Query:  ILLSLVVRFGSVIKQLDVHNVFLNGDLHEDVYMVQPPGFIDKDKSNHVCKLQK-----------------------------------------------
        I+L + V     I+QLDV+N FL G L ++VYM QPPGF+DKD+ ++VC+L+K                                               
Subjt:  ILLSLVVRFGSVIKQLDVHNVFLNGDLHEDVYMVQPPGFIDKDKSNHVCKLQK-----------------------------------------------

Query:  -----------------------------------------------------------------------------------TLYNGEALPNPKQYRSI
                                                                                           TL++G  LP+P +YR I
Subjt:  -----------------------------------------------------------------------------------TLYNGEALPNPKQYRSI

Query:  VEALQYCIITKPDLSFEAN----------------------------------------------------CPNDLCNTSGYFIFHGSNLVSWSSAKQKV
        V +LQY   T+PDLS+  N                                                      +D  +T+GY ++ G + +SWSS KQK 
Subjt:  VEALQYCIITKPDLSFEAN----------------------------------------------------CPNDLCNTSGYFIFHGSNLVSWSSAKQKV

Query:  VSRSSVEFEYRGLSNAAAELVWIQSLLSELGLCFPSPPLLLCDNINATYLASNPIFHSRSKHIKIDYHFIREKVMRKQLIVRFVPSEDQLANILTKPLPT
        V RSS E EYR ++N ++EL WI SLL+ELG+    PP++ CDN+ ATYL +NP+FHSR KHI +DYHFIR +V    L V  V + DQLA+ LTKPL  
Subjt:  VSRSSVEFEYRGLSNAAAELVWIQSLLSELGLCFPSPPLLLCDNINATYLASNPIFHSRSKHIKIDYHFIREKVMRKQLIVRFVPSEDQLANILTKPLPT

Query:  ARFPSLHTKLTV
          F +   K+ V
Subjt:  ARFPSLHTKLTV

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.3e-3825.71Show/hide
Query:  CEKLHNKYRNLVQNNTW---ILVPNSFAYKLVSCKWVYRIKLKLDGSVERYKTRFIARDFDQTQGIDYFETFSLVIKLTTIQILLSLVVRFGSVIKQLDV
        C  + ++   +   +TW    L PN    K + CKWVY+IK   DG++ERYK R +A+ + Q +GID+ ETFS V KLT+++++L++   +   + QLD+
Subjt:  CEKLHNKYRNLVQNNTW---ILVPNSFAYKLVSCKWVYRIKLKLDGSVERYKTRFIARDFDQTQGIDYFETFSLVIKLTTIQILLSLVVRFGSVIKQLDV

Query:  HNVFLNGDLHEDVYMVQPPGFI----DKDKSNHVCKLQKTLY----------------------------------------------------------
         N FLNGDL E++YM  PPG+     D    N VC L+K++Y                                                          
Subjt:  HNVFLNGDLHEDVYMVQPPGFI----DKDKSNHVCKLQKTLY----------------------------------------------------------

Query:  ------------------------------------------------------------------------NGEALPNPKQYRSIVEALQYCIITKPDL
                                                                                +G    + K YR ++  L Y  IT+ D+
Subjt:  ------------------------------------------------------------------------NGEALPNPKQYRSIVEALQYCIITKPDL

Query:  SFEAN----------------------------------------------------CPNDLCNTSGYFIFHGSNLVSWSSAKQKVVSRSSVEFEYRGLS
        SF  N                                                    C +   +T+GY +F G++L+SW S KQ+VVS+SS E EYR LS
Subjt:  SFEAN----------------------------------------------------CPNDLCNTSGYFIFHGSNLVSWSSAKQKVVSRSSVEFEYRGLS

Query:  NAAAELVWIQSLLSELGLCFPSPPLLLCDNINATYLASNPIFHSRSKHIKIDYHFIREK
         A  E++W+     EL L    P LL CDN  A ++A+N +FH R+KHI+ D H +RE+
Subjt:  NAAAELVWIQSLLSELGLCFPSPPLLLCDNINATYLASNPIFHSRSKHIKIDYHFIREK

ATMG00810.1 DNA/RNA polymerases superfamily protein4.6e-1030.53Show/hide
Query:  PNPKQYRSIVEALQYCIITKPDLSFEAN----------------------------------------------------CPNDLCNTSGYFIFHGSNLV
        P+P  +RSIV ALQY  +T+PD+S+  N                                                    C +   +T+G+  F G N++
Subjt:  PNPKQYRSIVEALQYCIITKPDLSFEAN----------------------------------------------------CPNDLCNTSGYFIFHGSNLV

Query:  SWSSAKQKVVSRSSVEFEYRGLSNAAAELVW
        SWS+ +Q  VSRSS E EYR L+  AAEL W
Subjt:  SWSSAKQKVVSRSSVEFEYRGLSNAAAELVW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)7.5e-1338.1Show/hide
Query:  CEKLHNKYRNLVQNNTWILVPNSFAYKLVSCKWVYRIKLKLDGSVERYKTRFIARDFDQTQGIDYFETFSLVIKLTTIQILLSL
        C+ +  +   L +N TWILVP      ++ CKWV++ KL  DG+++R K R +A+ F Q +GI + ET+S V++  TI+ +L++
Subjt:  CEKLHNKYRNLVQNNTWILVPNSFAYKLVSCKWVYRIKLKLDGSVERYKTRFIARDFDQTQGIDYFETFSLVIKLTTIQILLSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGTTCGACCGGCAAATCCTAATCGAAATCATAATCAACCCAAATGCCCAACCTTCTCCTCAGCCTCCCACTAGTTATTTCCCTCCATTCAATCCAAATGTCCAACC
TCCCGCTGGATATTTTCCTTATCAACAGCCTTTCCCTGTTAACCCTCAACCCTTCCCGCAACAGCAGTTCAATCTGAGGCATCTGTTTCAAGCCCCTCAACCACCAAATT
TTCCTCCTAATCCAAATTTATTTACACCTAATCCTTACCCTACTCTACCTCAACCTCTAGTTGTGAAGCTCAACGTCAGCAACTTCTTATTATGGAAAAACCAGTTGCTC
AATGTTGTTCTTGCAAACGGACTGTATGGCTTTCTCGATGGTTCGATTCCAGCGCCTCCCAAATTTGATGCTCCTACTTTAGAAGATGTTAGAAGCCTCCTTTTGGCATA
TGAAGCTCGGTTAGAAAGGCAGACCACTGTTGAACAACTCAACCTAGCTCAAGCAAACCTTAGTAGTCATAATATTCATCATTCTAATCGCCGGTCTTTTTCTAAACCTC
AACCACAATTTAATCACTTTTCCAAGTCCTCCTTTACCTCATCCAACCAACAATCCCCTTTTTCTCCTAGCATACTTGGTAAACCTCAACCTAATACTTCTTGGACCTCC
AAACCAAATAATTCCAAACCTCAGTGTCAAATTTGTGGAAATTTTGGGCACACTGCTCTTATATGCCATCATAGAACAAACTTAACCTATCAAACTCCTCCACCACAACC
TCAAGCCTTATTACATACATCTCAGCCTGCTGTTCTAGCAACTCCCACCACTTTTTCTGATACATCTTCTGTTAATCAAGCTGATTTTACTCATCCTGATGAGTCTTGGT
TTCCTGATTTTGGTGCAACACACCATATGACTTCGGACATCTCATCTCTTTGCAATCCAATGGCATACAATGGTGGTGACCAAATCACAATCGGAAATGGTAAGCAAATA
TCTATATCTCATATTGGTTCATCTTTAATTTCTTCTCTCTCAAAGCCTATTTTGCTTCAAAATGTTTTGCATACTCCTTCGATTACAAAGAAACTGTTAAGTGTATCAAA
ATTATGTAAGGATAATAAGGCTTATGTTCAATTCTACTCTTCTTTCTTTCCTGTCAAAGACCTTCAAACCAAGACCATTCTACTCTGGGGCAAGCTTGAAGATGGTCTCT
ACAAATTGTCCTCCTCTTTCAATACGGTTGTGTCTTCGAGTTCCAAGTCCCCTACAACTTTTTTGTCTTCGGTTCAAGTCCTTCCTAATTGGCATCTTCAGCTGGGCCAC
CTTGCGGCTTCTACTCTGAAGCAAGTTTTGTCTTCATGTGGTGTTTCATTTGTTGATGATTTCACACGTTTCACTTGGATGTATATGTTAAAATCTAAAGATGAAACGTT
CCAATGCTTTCTTTCTTTTAAGAAGCTTGTTGAGGTTCAATTTGGTTTGCCCCTATCTTTTTGGAGCTTTGCTTTTCAAACAGCAGTCTACGTTATCAATAGATTACCAA
CTACCGTTCTTCATTCAAAATCTCCTTATACTATGCTTTATAATTGTTTACCTAATTACTCTTTGCCGCTTAGAGTGTTTAGTTGTGCTTGTTTCCCATTTCTTCGACCT
TTCAATGCTCATAAGCTTCTGTTTCGCTCGACCGAATGCTTGTTCATTGGCCATAGTTCAAACCACAAAGGCTATTTGTGTCTTGATCCAACCACAGATCGTTTATTTGT
GTCAAGACATGTTTTTAATGAGGCTTTCTTTCCTTGTGTTAATAGCGATTTTAAGTTTGGTTCTATAAAGTCTTCTCCACCCGATCACTCTTTCTTCAGTCCTGCTCCTT
TTCATATTCCTTCTGTGTCTATCTCTGTGCTCAAATCTCCTAATATTTCACCTGCTTCCGCTTCCTCTGCTACACCATCTGCTCCACCTGGCTTTTCTCCCAATTCCTCT
GATTTGGTTCACCGTCAATGTAACACCATTCTCATGTCTTCTCTTCACTCATCTCCTATTTCATCCATCTCTCCTGCCTCTTTGATGGCTCTAATTCTATCCCTTACTCC
AACCCCTTCTAATAGACTAGAGCGTCTCGATGCTTGGCAGGAAGCGAAGAAAGATCTAGAATCGCGACAACGTCGCGACGCTACAACAACTTCTGCGCCAAGAAAGTATG
ACGTTAGAGCGTCGCGACGCTACCGACTTGTTCAGCAAGAAAGCTTTCGTCCAGCACCTATAAATAGCCCCCTCTCCCAAGGTTCAAATCATCCCATTTTTGGGGAGCTC
TTCCCTAGCTCAATAGAGAGAAAAAGTGAGAAGAATTGTGGAGAAAAAGGCCTATGGATCAAGATCACTTGGGCTAAATTATTCCCATTTTACGCTTGCGAAAAATTGCA
CAACAAGTATAGGAATTTAGTTCAAAACAACACATGGATTCTTGTGCCAAATTCTTTTGCCTATAAACTTGTAAGTTGTAAATGGGTATATCGGATCAAGCTCAAACTAG
ATGGTTCTGTTGAGAGGTATAAGACACGTTTTATAGCTCGTGATTTTGATCAAACTCAAGGTATAGACTACTTTGAAACTTTTAGCCTTGTGATCAAACTAACGACCATT
CAAATTTTACTGTCTCTTGTTGTGCGGTTTGGCTCGGTTATCAAACAATTAGATGTCCACAATGTTTTTCTCAATGGTGACCTCCATGAGGATGTGTACATGGTCCAACC
GCCTGGTTTCATTGACAAGGATAAATCGAATCATGTTTGTAAACTTCAAAAGACCCTGTATAATGGTGAAGCTCTGCCCAATCCAAAACAGTATCGAAGCATCGTCGAGG
CCCTTCAGTATTGTATAATTACCAAGCCTGATTTGAGCTTTGAGGCAAATTGTCCAAATGATCTCTGCAACACGAGTGGTTACTTTATTTTCCATGGTTCCAACTTGGTA
TCTTGGTCATCTGCTAAGCAAAAGGTTGTCTCTAGGTCCAGTGTTGAGTTCGAATATCGTGGCTTGTCCAATGCTGCTGCTGAACTTGTTTGGATTCAGTCCCTTTTATC
TGAACTTGGTTTATGTTTTCCTTCACCTCCGTTGCTCCTTTGTGATAATATTAACGCAACTTACCTTGCTTCCAATCCAATCTTTCATAGTCGATCCAAGCACATAAAAA
TCGACTATCATTTCATCCGAGAGAAGGTTATGAGAAAACAACTGATTGTTCGTTTCGTTCCATCAGAGGACCAACTAGCAAACATATTAACCAAGCCTCTGCCTACTGCT
CGGTTTCCGAGCTTACACACCAAGCTCACAGTCCAATCGAGATCTGGGTTTTTGGCGAGGACAGCGAACGACAGACGTCGATGGCGTGATCGGAGAAGAGAGAGTAGCGG
GGTTAGATCTGAGGAAGTGGTGGCAGCGCAACTGATCGGAGAAGAAAGAGTTGCTCGACGACGACATTTTTCGTGCAGAATGCTCGATGGCGCAGCAACGCTCTTGACGG
CATCAGCGACTTTTCCGGCGAGACAGCGGTGGCTCCAACGAGGTCTGCCGTCTCCGGCTGCGGCGGTCGGCGGTGACTGCGACGGCCGGTGGGAGTTGTCACCGGCAACT
GGAGTTGAGGAAGAAGAAGACTTGAAGGAGAAGATGAAGACTTGTGGAGAAGATGAAGAAGGAGGAGAGAGAAAGAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGTGGTTCGACCGGCAAATCCTAATCGAAATCATAATCAACCCAAATGCCCAACCTTCTCCTCAGCCTCCCACTAGTTATTTCCCTCCATTCAATCCAAATGTCCAACC
TCCCGCTGGATATTTTCCTTATCAACAGCCTTTCCCTGTTAACCCTCAACCCTTCCCGCAACAGCAGTTCAATCTGAGGCATCTGTTTCAAGCCCCTCAACCACCAAATT
TTCCTCCTAATCCAAATTTATTTACACCTAATCCTTACCCTACTCTACCTCAACCTCTAGTTGTGAAGCTCAACGTCAGCAACTTCTTATTATGGAAAAACCAGTTGCTC
AATGTTGTTCTTGCAAACGGACTGTATGGCTTTCTCGATGGTTCGATTCCAGCGCCTCCCAAATTTGATGCTCCTACTTTAGAAGATGTTAGAAGCCTCCTTTTGGCATA
TGAAGCTCGGTTAGAAAGGCAGACCACTGTTGAACAACTCAACCTAGCTCAAGCAAACCTTAGTAGTCATAATATTCATCATTCTAATCGCCGGTCTTTTTCTAAACCTC
AACCACAATTTAATCACTTTTCCAAGTCCTCCTTTACCTCATCCAACCAACAATCCCCTTTTTCTCCTAGCATACTTGGTAAACCTCAACCTAATACTTCTTGGACCTCC
AAACCAAATAATTCCAAACCTCAGTGTCAAATTTGTGGAAATTTTGGGCACACTGCTCTTATATGCCATCATAGAACAAACTTAACCTATCAAACTCCTCCACCACAACC
TCAAGCCTTATTACATACATCTCAGCCTGCTGTTCTAGCAACTCCCACCACTTTTTCTGATACATCTTCTGTTAATCAAGCTGATTTTACTCATCCTGATGAGTCTTGGT
TTCCTGATTTTGGTGCAACACACCATATGACTTCGGACATCTCATCTCTTTGCAATCCAATGGCATACAATGGTGGTGACCAAATCACAATCGGAAATGGTAAGCAAATA
TCTATATCTCATATTGGTTCATCTTTAATTTCTTCTCTCTCAAAGCCTATTTTGCTTCAAAATGTTTTGCATACTCCTTCGATTACAAAGAAACTGTTAAGTGTATCAAA
ATTATGTAAGGATAATAAGGCTTATGTTCAATTCTACTCTTCTTTCTTTCCTGTCAAAGACCTTCAAACCAAGACCATTCTACTCTGGGGCAAGCTTGAAGATGGTCTCT
ACAAATTGTCCTCCTCTTTCAATACGGTTGTGTCTTCGAGTTCCAAGTCCCCTACAACTTTTTTGTCTTCGGTTCAAGTCCTTCCTAATTGGCATCTTCAGCTGGGCCAC
CTTGCGGCTTCTACTCTGAAGCAAGTTTTGTCTTCATGTGGTGTTTCATTTGTTGATGATTTCACACGTTTCACTTGGATGTATATGTTAAAATCTAAAGATGAAACGTT
CCAATGCTTTCTTTCTTTTAAGAAGCTTGTTGAGGTTCAATTTGGTTTGCCCCTATCTTTTTGGAGCTTTGCTTTTCAAACAGCAGTCTACGTTATCAATAGATTACCAA
CTACCGTTCTTCATTCAAAATCTCCTTATACTATGCTTTATAATTGTTTACCTAATTACTCTTTGCCGCTTAGAGTGTTTAGTTGTGCTTGTTTCCCATTTCTTCGACCT
TTCAATGCTCATAAGCTTCTGTTTCGCTCGACCGAATGCTTGTTCATTGGCCATAGTTCAAACCACAAAGGCTATTTGTGTCTTGATCCAACCACAGATCGTTTATTTGT
GTCAAGACATGTTTTTAATGAGGCTTTCTTTCCTTGTGTTAATAGCGATTTTAAGTTTGGTTCTATAAAGTCTTCTCCACCCGATCACTCTTTCTTCAGTCCTGCTCCTT
TTCATATTCCTTCTGTGTCTATCTCTGTGCTCAAATCTCCTAATATTTCACCTGCTTCCGCTTCCTCTGCTACACCATCTGCTCCACCTGGCTTTTCTCCCAATTCCTCT
GATTTGGTTCACCGTCAATGTAACACCATTCTCATGTCTTCTCTTCACTCATCTCCTATTTCATCCATCTCTCCTGCCTCTTTGATGGCTCTAATTCTATCCCTTACTCC
AACCCCTTCTAATAGACTAGAGCGTCTCGATGCTTGGCAGGAAGCGAAGAAAGATCTAGAATCGCGACAACGTCGCGACGCTACAACAACTTCTGCGCCAAGAAAGTATG
ACGTTAGAGCGTCGCGACGCTACCGACTTGTTCAGCAAGAAAGCTTTCGTCCAGCACCTATAAATAGCCCCCTCTCCCAAGGTTCAAATCATCCCATTTTTGGGGAGCTC
TTCCCTAGCTCAATAGAGAGAAAAAGTGAGAAGAATTGTGGAGAAAAAGGCCTATGGATCAAGATCACTTGGGCTAAATTATTCCCATTTTACGCTTGCGAAAAATTGCA
CAACAAGTATAGGAATTTAGTTCAAAACAACACATGGATTCTTGTGCCAAATTCTTTTGCCTATAAACTTGTAAGTTGTAAATGGGTATATCGGATCAAGCTCAAACTAG
ATGGTTCTGTTGAGAGGTATAAGACACGTTTTATAGCTCGTGATTTTGATCAAACTCAAGGTATAGACTACTTTGAAACTTTTAGCCTTGTGATCAAACTAACGACCATT
CAAATTTTACTGTCTCTTGTTGTGCGGTTTGGCTCGGTTATCAAACAATTAGATGTCCACAATGTTTTTCTCAATGGTGACCTCCATGAGGATGTGTACATGGTCCAACC
GCCTGGTTTCATTGACAAGGATAAATCGAATCATGTTTGTAAACTTCAAAAGACCCTGTATAATGGTGAAGCTCTGCCCAATCCAAAACAGTATCGAAGCATCGTCGAGG
CCCTTCAGTATTGTATAATTACCAAGCCTGATTTGAGCTTTGAGGCAAATTGTCCAAATGATCTCTGCAACACGAGTGGTTACTTTATTTTCCATGGTTCCAACTTGGTA
TCTTGGTCATCTGCTAAGCAAAAGGTTGTCTCTAGGTCCAGTGTTGAGTTCGAATATCGTGGCTTGTCCAATGCTGCTGCTGAACTTGTTTGGATTCAGTCCCTTTTATC
TGAACTTGGTTTATGTTTTCCTTCACCTCCGTTGCTCCTTTGTGATAATATTAACGCAACTTACCTTGCTTCCAATCCAATCTTTCATAGTCGATCCAAGCACATAAAAA
TCGACTATCATTTCATCCGAGAGAAGGTTATGAGAAAACAACTGATTGTTCGTTTCGTTCCATCAGAGGACCAACTAGCAAACATATTAACCAAGCCTCTGCCTACTGCT
CGGTTTCCGAGCTTACACACCAAGCTCACAGTCCAATCGAGATCTGGGTTTTTGGCGAGGACAGCGAACGACAGACGTCGATGGCGTGATCGGAGAAGAGAGAGTAGCGG
GGTTAGATCTGAGGAAGTGGTGGCAGCGCAACTGATCGGAGAAGAAAGAGTTGCTCGACGACGACATTTTTCGTGCAGAATGCTCGATGGCGCAGCAACGCTCTTGACGG
CATCAGCGACTTTTCCGGCGAGACAGCGGTGGCTCCAACGAGGTCTGCCGTCTCCGGCTGCGGCGGTCGGCGGTGACTGCGACGGCCGGTGGGAGTTGTCACCGGCAACT
GGAGTTGAGGAAGAAGAAGACTTGAAGGAGAAGATGAAGACTTGTGGAGAAGATGAAGAAGGAGGAGAGAGAAAGAATTAA
Protein sequenceShow/hide protein sequence
MWFDRQILIEIIINPNAQPSPQPPTSYFPPFNPNVQPPAGYFPYQQPFPVNPQPFPQQQFNLRHLFQAPQPPNFPPNPNLFTPNPYPTLPQPLVVKLNVSNFLLWKNQLL
NVVLANGLYGFLDGSIPAPPKFDAPTLEDVRSLLLAYEARLERQTTVEQLNLAQANLSSHNIHHSNRRSFSKPQPQFNHFSKSSFTSSNQQSPFSPSILGKPQPNTSWTS
KPNNSKPQCQICGNFGHTALICHHRTNLTYQTPPPQPQALLHTSQPAVLATPTTFSDTSSVNQADFTHPDESWFPDFGATHHMTSDISSLCNPMAYNGGDQITIGNGKQI
SISHIGSSLISSLSKPILLQNVLHTPSITKKLLSVSKLCKDNKAYVQFYSSFFPVKDLQTKTILLWGKLEDGLYKLSSSFNTVVSSSSKSPTTFLSSVQVLPNWHLQLGH
LAASTLKQVLSSCGVSFVDDFTRFTWMYMLKSKDETFQCFLSFKKLVEVQFGLPLSFWSFAFQTAVYVINRLPTTVLHSKSPYTMLYNCLPNYSLPLRVFSCACFPFLRP
FNAHKLLFRSTECLFIGHSSNHKGYLCLDPTTDRLFVSRHVFNEAFFPCVNSDFKFGSIKSSPPDHSFFSPAPFHIPSVSISVLKSPNISPASASSATPSAPPGFSPNSS
DLVHRQCNTILMSSLHSSPISSISPASLMALILSLTPTPSNRLERLDAWQEAKKDLESRQRRDATTTSAPRKYDVRASRRYRLVQQESFRPAPINSPLSQGSNHPIFGEL
FPSSIERKSEKNCGEKGLWIKITWAKLFPFYACEKLHNKYRNLVQNNTWILVPNSFAYKLVSCKWVYRIKLKLDGSVERYKTRFIARDFDQTQGIDYFETFSLVIKLTTI
QILLSLVVRFGSVIKQLDVHNVFLNGDLHEDVYMVQPPGFIDKDKSNHVCKLQKTLYNGEALPNPKQYRSIVEALQYCIITKPDLSFEANCPNDLCNTSGYFIFHGSNLV
SWSSAKQKVVSRSSVEFEYRGLSNAAAELVWIQSLLSELGLCFPSPPLLLCDNINATYLASNPIFHSRSKHIKIDYHFIREKVMRKQLIVRFVPSEDQLANILTKPLPTA
RFPSLHTKLTVQSRSGFLARTANDRRRWRDRRRESSGVRSEEVVAAQLIGEERVARRRHFSCRMLDGAATLLTASATFPARQRWLQRGLPSPAAAVGGDCDGRWELSPAT
GVEEEEDLKEKMKTCGEDEEGGERKN