; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026146 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026146
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr10:30578236..30586745
RNA-Seq ExpressionLag0026146
SyntenyLag0026146
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR021109 - Aspartic peptidase domain superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_019198426.1 PREDICTED: uncharacterized protein LOC109192308 [Ipomoea nil]1.2e-13542.7Show/hide
Query:  QANGQAEVSNMVTYCSKDSMMVLFPNSRMWIDAASEGSIMNKSPKEVRDIIVNLAESERQSSIKQDGPIASLAKE-VKNFEVKLDKLI--ELVHSIADRL
        Q+  QA  S M    S D ++    N+       +  SI      ++  +  ++++ E Q+S K         KE V    ++  K +  E   S+ +  
Subjt:  QANGQAEVSNMVTYCSKDSMMVLFPNSRMWIDAASEGSIMNKSPKEVRDIIVNLAESERQSSIKQDGPIASLAKE-VKNFEVKLDKLI--ELVHSIADRL

Query:  KVREDCSSCTTISDSSDYHPEGQ--QSDNSLENLIKTIADT-TPSFQQDVRNSF---EINVPLLKDMERIPRYARFMKELCNTKPETKERGRIEVSKTVS
        + +    S +T ++ S    E    Q D+SL   +  +A +     +  +  +F   EIN+PLL+ +++IP+YA+F+KELC  K + K +  I + K VS
Subjt:  KVREDCSSCTTISDSSDYHPEGQ--QSDNSLENLIKTIADT-TPSFQQDVRNSF---EINVPLLKDMERIPRYARFMKELCNTKPETKERGRIEVSKTVS

Query:  TLLQSNMPEKFGDPGLLYIPCVIGSKSIEYAMLDLSASFNVMPYSVYHDLKLNDLRTTGNVFQLADRSYMHPLGVVEDVIVQ-----------------R
         ++ +++P+K  DPG+  +PC+IG+K I+ AMLDL AS NVMP+ +Y  L +N L+ T  V QLADRS +HP+GVVEDV+VQ                  
Subjt:  TLLQSNMPEKFGDPGLLYIPCVIGSKSIEYAMLDLSASFNVMPYSVYHDLKLNDLRTTGNVFQLADRSYMHPLGVVEDVIVQ-----------------R

Query:  FSPSVASILLGRTFVKSARTMINLHRDVLSIEYQEEIVKVNIPQVAKYPDVSASLYHVEIVDSVVHDV----VVSDMINMGLEHDTQDVDIDETGELVPD
               ILLGR F+K+A+T +++ + +LS+E+  EI K NI    KYPD   SLY+V+++DS+V DV       ++ ++  E D  D  +D + E++  
Subjt:  FSPSVASILLGRTFVKSARTMINLHRDVLSIEYQEEIVKVNIPQVAKYPDVSASLYHVEIVDSVVHDV----VVSDMINMGLEHDTQDVDIDETGELVPD

Query:  SSPDLISFDLGISVTSGEMLPSVVQAPKLELKALPEHLKYVYLGEGNRLPVIISKKLTVGQEERLVNMLREHKKAIGWTLAYIIGIDPATCMHRIQLEED
            L S +   S+T  ++LPSV++AP LELK LP+HLKYVYLGEG +LP+IIS KL   QE++LV++L+EHK A+GWTLA I  I P  CMHRI LEED
Subjt:  SSPDLISFDLGISVTSGEMLPSVVQAPKLELKALPEHLKYVYLGEGNRLPVIISKKLTVGQEERLVNMLREHKKAIGWTLAYIIGIDPATCMHRIQLEED

Query:  AKPSREPQRRMNPILREVVKKEIDKLQAACIIYSISDSKWASIKSLLHWKIKRKQLLPVPLERMPLEGCPLDCVMP----------QHFMADHGIVLGHV
        A+P REPQR +NP L++VV KEI KL  A   Y I+ ++    K+             +P       G    C+M           + FM D   V GHV
Subjt:  AKPSREPQRRMNPILREVVKKEIDKLQAACIIYSISDSKWASIKSLLHWKIKRKQLLPVPLERMPLEGCPLDCVMP----------QHFMADHGIVLGHV

Query:  ISERGIEVDKAKIDVIANLPYPTNVREVRSFLGHAGFYRRFIKDFSKTALPMTNLLQKDVTFEFGAECQAAFDLLKKMLTSAPIIHPPRWDLTFEIMCDA
        IS +GIEVDKAK+D+I NLPYPTNVREVRSFLGH GFYR+FIKDFS+ A+P++ LLQK+VTFEFG ECQ AFD LK +LTSAP+I P  W+L FEIMCDA
Subjt:  ISERGIEVDKAKIDVIANLPYPTNVREVRSFLGHAGFYRRFIKDFSKTALPMTNLLQKDVTFEFGAECQAAFDLLKKMLTSAPIIHPPRWDLTFEIMCDA

Query:  SNYAV
        SNYAV
Subjt:  SNYAV

XP_031116495.1 uncharacterized protein LOC116020153 [Ipomoea triloba]3.9e-13147.99Show/hide
Query:  QQDVRNSF---EINVPLLKDMERIPRYARFMKELCNTKPETKERGRIEVSKTVSTLLQSNMPEKFGDPGLLYIPCVIGSKSIEYAMLDLSASFNVMPYSV
        Q+++  +F   E+N+PLL  +++IPRYA+F+KELC  K + K    + + + +S +LQ  +P K  DPG+  IPC IG+ ++E AMLDL AS NVMP S+
Subjt:  QQDVRNSF---EINVPLLKDMERIPRYARFMKELCNTKPETKERGRIEVSKTVSTLLQSNMPEKFGDPGLLYIPCVIGSKSIEYAMLDLSASFNVMPYSV

Query:  YHDLKLNDLRTTGNVFQLADRSYMHPLGVVEDVIVQ-----------------RFSPSVASILLGRTFVKSARTMINLHRDVLSIEYQEEIVKVNIPQVA
        Y  L +  L+ TG + QLADRS ++P+GV+EDV+VQ                   SP+ + ILLGR F+K+++T I++H   L++E+   ++K NI    
Subjt:  YHDLKLNDLRTTGNVFQLADRSYMHPLGVVEDVIVQ-----------------RFSPSVASILLGRTFVKSARTMINLHRDVLSIEYQEEIVKVNIPQVA

Query:  KYPDVSASLYHVEIVDSVVHDVVVSDMINMGLEHDTQDVDIDETGELVPDSSPDLISFD---LGISVTSGEMLPSVVQAPKLELKALPEHLKYVYLGEGN
        KYPD  +S+   +++  +VHD          L  + QD+ + E   L P      I ++   + +  +S ++LPSV+QAPK+ELK LP+HLKY +LG+G 
Subjt:  KYPDVSASLYHVEIVDSVVHDVVVSDMINMGLEHDTQDVDIDETGELVPDSSPDLISFD---LGISVTSGEMLPSVVQAPKLELKALPEHLKYVYLGEGN

Query:  RLPVIISKKLTVGQEERLVNMLREHKKAIGWTLAYIIGIDPATCMHRIQLEEDAKPSREPQRRMNPILREVVKKEIDKLQAACIIYSISDSKWASIKSLL
         LPV+IS KL+  +EE+LV  LRE+K+AIGWT+A I G+ P+TCMHRI LE+D KPS++PQRR+NP + EVVKKEI KL  A IIY ISDSKW S   + 
Subjt:  RLPVIISKKLTVGQEERLVNMLREHKKAIGWTLAYIIGIDPATCMHRIQLEEDAKPSREPQRRMNPILREVVKKEIDKLQAACIIYSISDSKWASIKSLL

Query:  HWKIKRKQLLPVPLERMPLEGCPLDCVMPQHFMADHGIVLGHVISERGIEVDKAKIDVIANLPYPTNVREVRSFLGHAGFYRRFIKDFSKTALPMTNLLQ
           + +K  + V +E    E  P                    +S +GIEVD+AK+DVI +LPYPT+VREVRSFLGHAGFYRRFIKDFSK + P+  LLQ
Subjt:  HWKIKRKQLLPVPLERMPLEGCPLDCVMPQHFMADHGIVLGHVISERGIEVDKAKIDVIANLPYPTNVREVRSFLGHAGFYRRFIKDFSKTALPMTNLLQ

Query:  KDVTFEFGAECQAAFDLLKKMLTSAPIIHPPRWDLTFEIMCDASNYAV
        KDVTFEF   C+ AFD LK MLTSAPII  P WDL FE+MCDASNYAV
Subjt:  KDVTFEFGAECQAAFDLLKKMLTSAPIIHPPRWDLTFEIMCDASNYAV

XP_031116558.1 uncharacterized protein LOC116020218 [Ipomoea triloba]2.1e-13246.27Show/hide
Query:  LKVREDCSSCTTISDSSDYHPEGQQSDNSLENLIKTIAD--TTPSFQQDVRNS---------------FEINVPLLKDMERIPRYARFMKELCNTKPETK
        L+  ++      ++ S+D   E    + +      +++D   TP F Q +  S                E+N+PLL  ++++PRYA+F+KELC TK + K
Subjt:  LKVREDCSSCTTISDSSDYHPEGQQSDNSLENLIKTIAD--TTPSFQQDVRNS---------------FEINVPLLKDMERIPRYARFMKELCNTKPETK

Query:  -ERG-RIEVSKTVSTLLQSNMPEKFGDPGLLYIPCVIGSKSIEYAMLDLSASFNVMPYSVYHDLKLNDLRTTGNVFQLADRSYMHPLGVVEDVIVQ----
         +RG ++ + K VS ++QS +PEK  DPG+  IPC+IG   +E AMLDL AS NVMPYSVY  LKL  L  T  V QLADRS  +P GV+EDV++Q    
Subjt:  -ERG-RIEVSKTVSTLLQSNMPEKFGDPGLLYIPCVIGSKSIEYAMLDLSASFNVMPYSVYHDLKLNDLRTTGNVFQLADRSYMHPLGVVEDVIVQ----

Query:  -----------RFSPSVASILLGRTFVKSARTMINLHRDVLSIEYQEEIVKVNIPQVAKYPDVSASLYHVEIVDSVVHDVVVSDMINMGLEHDTQDVDID
                     +   A ILLGR F+K+A+T I++H   L++E+   IV  NI    K P    S Y ++I DS+V +V          E D +D    
Subjt:  -----------RFSPSVASILLGRTFVKSARTMINLHRDVLSIEYQEEIVKVNIPQVAKYPDVSASLYHVEIVDSVVHDVVVSDMINMGLEHDTQDVDID

Query:  ETGELVPDSSPDLISFDLGISVTSGEMLPSVVQAPKLELKALPEHLKYVYLGEGNRLPVIISKKLTVGQEERLVNMLREHKKAIGWTLAYIIGIDPATCM
           E +   + + +S+ + + V+S ++LPS+VQAP  ELK LPEHLKY +LGE   LPVIIS KL+  +EE+LV++L+EHK AI WT+A I GI P+TCM
Subjt:  ETGELVPDSSPDLISFDLGISVTSGEMLPSVVQAPKLELKALPEHLKYVYLGEGNRLPVIISKKLTVGQEERLVNMLREHKKAIGWTLAYIIGIDPATCM

Query:  HRIQLEEDAKPSREPQRRMNPILREVVKKEIDKLQAACIIYSISDSKWASIKSLLHWKIKRKQLLPVPLERMPLEGCPLDCVMPQHFMADHGIVLGHVIS
        HRI LEE A+PSR+PQRR+NP + EVVK+++ KL    IIY ISDSKW S   ++  K     +     E +P                    VLGHVIS
Subjt:  HRIQLEEDAKPSREPQRRMNPILREVVKKEIDKLQAACIIYSISDSKWASIKSLLHWKIKRKQLLPVPLERMPLEGCPLDCVMPQHFMADHGIVLGHVIS

Query:  ERGIEVDKAKIDVIANLPYPTNVREVRSFLGHAGFYRRFIKDFSKTALPMTNLLQKDVTFEFGAECQAAFDLLKKMLTSAPIIHPPRWDLTFEIMCDASN
         RGIEVDKAK+D+I +LPYPTNVREVRSFLGHAGFYRRFIKDFSK ALP+  LLQK+ +FEF  EC+  FD LK++LTSAP+I PP W+  FEIMCDAS+
Subjt:  ERGIEVDKAKIDVIANLPYPTNVREVRSFLGHAGFYRRFIKDFSKTALPMTNLLQKDVTFEFGAECQAAFDLLKKMLTSAPIIHPPRWDLTFEIMCDASN

Query:  YAV
        +AV
Subjt:  YAV

XP_038973755.1 uncharacterized protein LOC120105408 [Phoenix dactylifera]1.7e-12946.9Show/hide
Query:  EINVPLLKDMERIPRYARFMKELCNTKPETKERG--RIEVSKTVSTLLQSNMPEKFGDPGLLYIPCVIGSKSIEYAMLDLSASFNVMPYSVYHDLKLNDL
        E+N+PLL  ++++PRYA+F+KELC  K + K +G   + V + +S ++Q  +P K  DPG+  IPC+IG+   E AM+DL AS NVM YS+Y  LK   L
Subjt:  EINVPLLKDMERIPRYARFMKELCNTKPETKERG--RIEVSKTVSTLLQSNMPEKFGDPGLLYIPCVIGSKSIEYAMLDLSASFNVMPYSVYHDLKLNDL

Query:  RTTGNVFQLADRSYMHPLGVVEDVIVQ---------------RFSPSVASILLGRTFVKSARTMINLHRDVLSIEYQEEIVKVNIPQVAKYPDVSASLYH
          TG V QLADRS  +P GVVEDV+VQ                     A ILLGR F+K+++T I++H   L++E+  EI+K NI    KYP     +Y 
Subjt:  RTTGNVFQLADRSYMHPLGVVEDVIVQ---------------RFSPSVASILLGRTFVKSARTMINLHRDVLSIEYQEEIVKVNIPQVAKYPDVSASLYH

Query:  VEIVDSVVHDVVVSD-------MINMGLEHDTQDVDID-ETGELVP--DSSPDL-----ISFDLGISVTSGEMLPSVVQAPKLELKALPEHLKYVYLGEG
        ++++DS+  +V   D        I+  LE + +++ +  +  E+V   ++ P L     +S+ + + V++G  LPSV+QAP  +LK LP HLKYV+LG+ 
Subjt:  VEIVDSVVHDVVVSD-------MINMGLEHDTQDVDID-ETGELVP--DSSPDL-----ISFDLGISVTSGEMLPSVVQAPKLELKALPEHLKYVYLGEG

Query:  NRLPVIISKKLTVGQEERLVNMLREHKKAIGWTLAYIIGIDPATCMHRIQLEEDAKPSREPQRRMNPILREVVKKEIDKLQAACIIYSISDSKWASIKSL
          LPVIIS KL+  QEE+LV +L+EH+ AIGWT+A I GI P TCMHRI LEE AKPSR+PQRR+NP + +VVKKEI KL    +IY ISDS W S   +
Subjt:  NRLPVIISKKLTVGQEERLVNMLREHKKAIGWTLAYIIGIDPATCMHRIQLEEDAKPSREPQRRMNPILREVVKKEIDKLQAACIIYSISDSKWASIKSL

Query:  LHWKI-------KRKQLLPVPLERMPLE---------GCPLDCVMPQHFMADHGIVLGHVISERGIEVDKAKIDVIANLPYPTNVREVRSFLGHAGFYRR
        +  K        +  +L+P    ++ +           CP      +       IVLGHV+S RGIEVD+AK+D+I +LP PT VREVRSFLGHAGFYRR
Subjt:  LHWKI-------KRKQLLPVPLERMPLE---------GCPLDCVMPQHFMADHGIVLGHVISERGIEVDKAKIDVIANLPYPTNVREVRSFLGHAGFYRR

Query:  FIKDFSKTALPMTNLLQKDVTFEFGAECQAAFDLLKKMLTSAPIIHPPRWDLTFEIMCDASNYAV
        FIKDFSK ALP+  LLQK++ FEF   C+ AFD LK++LTSAP+I PP W++ FEIMCDAS+YAV
Subjt:  FIKDFSKTALPMTNLLQKDVTFEFGAECQAAFDLLKKMLTSAPIIHPPRWDLTFEIMCDASNYAV

XP_038978293.1 uncharacterized protein LOC120108689 [Phoenix dactylifera]1.7e-12946.9Show/hide
Query:  EINVPLLKDMERIPRYARFMKELCNTKPETKERG--RIEVSKTVSTLLQSNMPEKFGDPGLLYIPCVIGSKSIEYAMLDLSASFNVMPYSVYHDLKLNDL
        E+N+PLL  ++++PRYA+F+KELC  K + K +G   + V + +S ++Q  +P K  DPG+  IPC+IG+   E AM+DL AS NVM YS+Y  LK   L
Subjt:  EINVPLLKDMERIPRYARFMKELCNTKPETKERG--RIEVSKTVSTLLQSNMPEKFGDPGLLYIPCVIGSKSIEYAMLDLSASFNVMPYSVYHDLKLNDL

Query:  RTTGNVFQLADRSYMHPLGVVEDVIVQ---------------RFSPSVASILLGRTFVKSARTMINLHRDVLSIEYQEEIVKVNIPQVAKYPDVSASLYH
          TG V QLADRS  +P GVVEDV+VQ                     A ILLGR F+K+++T I++H   L++E+  EI+K NI    KYP     +Y 
Subjt:  RTTGNVFQLADRSYMHPLGVVEDVIVQ---------------RFSPSVASILLGRTFVKSARTMINLHRDVLSIEYQEEIVKVNIPQVAKYPDVSASLYH

Query:  VEIVDSVVHDVVVSD-------MINMGLEHDTQDVDID-ETGELVP--DSSPDL-----ISFDLGISVTSGEMLPSVVQAPKLELKALPEHLKYVYLGEG
        ++++DS+  +V   D        I+  LE + +++ +  +  E+V   ++ P L     +S+ + + V++G  LPSV+QAP  +LK LP HLKYV+LG+ 
Subjt:  VEIVDSVVHDVVVSD-------MINMGLEHDTQDVDID-ETGELVP--DSSPDL-----ISFDLGISVTSGEMLPSVVQAPKLELKALPEHLKYVYLGEG

Query:  NRLPVIISKKLTVGQEERLVNMLREHKKAIGWTLAYIIGIDPATCMHRIQLEEDAKPSREPQRRMNPILREVVKKEIDKLQAACIIYSISDSKWASIKSL
          LPVIIS KL+  QEE+LV +L+EH+ AIGWT+A I GI P TCMHRI LEE AKPSR+PQRR+NP + +VVKKEI KL    +IY ISDS W S   +
Subjt:  NRLPVIISKKLTVGQEERLVNMLREHKKAIGWTLAYIIGIDPATCMHRIQLEEDAKPSREPQRRMNPILREVVKKEIDKLQAACIIYSISDSKWASIKSL

Query:  LHWKI-------KRKQLLPVPLERMPLE---------GCPLDCVMPQHFMADHGIVLGHVISERGIEVDKAKIDVIANLPYPTNVREVRSFLGHAGFYRR
        +  K        +  +L+P    ++ +           CP      +       IVLGHV+S RGIEVD+AK+D+I +LP PT VREVRSFLGHAGFYRR
Subjt:  LHWKI-------KRKQLLPVPLERMPLE---------GCPLDCVMPQHFMADHGIVLGHVISERGIEVDKAKIDVIANLPYPTNVREVRSFLGHAGFYRR

Query:  FIKDFSKTALPMTNLLQKDVTFEFGAECQAAFDLLKKMLTSAPIIHPPRWDLTFEIMCDASNYAV
        FIKDFSK ALP+  LLQK++ FEF   C+ AFD LK++LTSAP+I PP W++ FEIMCDAS+YAV
Subjt:  FIKDFSKTALPMTNLLQKDVTFEFGAECQAAFDLLKKMLTSAPIIHPPRWDLTFEIMCDASNYAV

TrEMBL top hitse value%identityAlignment
A0A1U8N082 uncharacterized protein LOC1079432355.4e-12637.87Show/hide
Query:  LFPNSRMWIDAASEGSIMNKSPKEVRDIIVNLAESERQSSIKQDGPIASLAKEVKNFEVKLDKLIELVHS-IADRLKVREDCSSCTTISDSSD-------
        L P     +DAAS G+++N +P++ RD+I  +A + +Q     + P           E K+D+L  +++S IA++ K    C  C T   ++D       
Subjt:  LFPNSRMWIDAASEGSIMNKSPKEVRDIIVNLAESERQSSIKQDGPIASLAKEVKNFEVKLDKLIELVHS-IADRLKVREDCSSCTTISDSSD-------

Query:  ------------------YH---------------------------------------PEGQQSDNSL-------------ENLIKTIADTTPSFQQDV
                          +H                                       P  +Q+ N++              NL + I    P   + V
Subjt:  ------------------YH---------------------------------------PEGQQSDNSL-------------ENLIKTIADTTPSFQQDV

Query:  RN--------------------------------SFEINVPLLKDMERIPRYARFMKELCNTKPETKERGRIEVSKTVSTLLQSNMPEKFGDPGLLYIPC
        R+                                + EIN+PLL  +++IPRYA+F+KELC  K +     R+ V + VS +LQ  MP K  D G+  IPC
Subjt:  RN--------------------------------SFEINVPLLKDMERIPRYARFMKELCNTKPETKERGRIEVSKTVSTLLQSNMPEKFGDPGLLYIPC

Query:  VIGSKSIEYAMLDLSASFNVMPYSVYHDLKLNDLRTTGNVFQLADRSYMHPLGVVEDVIV-----------------QRFSPSVASILLGRTFVKSARTM
         IG   I+ AM DL AS NVMPYS+Y  L    L  TG + QLADRS MHP GV+EDV+V                 +  +P  + +LLGR F+ +A T 
Subjt:  VIGSKSIEYAMLDLSASFNVMPYSVYHDLKLNDLRTTGNVFQLADRSYMHPLGVVEDVIV-----------------QRFSPSVASILLGRTFVKSARTM

Query:  INLHRDVLSIEYQEEIVKVNIPQVAKYPDVSASLYHVEIVDSVVHDVVVSDMINMGLEHDTQDVD-IDETGELVPDSSPDLISFDLGISVTSGEMLPSVV
        I++    L++E+  EIVK N+     +P    S+  ++I+DS+V +   S      +  D  ++D I+   EL+   SP+             ++LPSV+
Subjt:  INLHRDVLSIEYQEEIVKVNIPQVAKYPDVSASLYHVEIVDSVVHDVVVSDMINMGLEHDTQDVD-IDETGELVPDSSPDLISFDLGISVTSGEMLPSVV

Query:  QAPKLELKALPEHLKYVYLGEGNRLPVIISKKLTVGQEERLVNMLREHKKAIGWTLAYIIGIDPATCMHRIQLEEDAKPSREPQRRMNPILREVVKKEID
        Q P+LELK LPEHLKY +LG+GN LPVIIS +L+  +EE LV++L+ HK+AIGWT+A + GI P TC HRI LEED KP RE QRR+NP + EVVK EI 
Subjt:  QAPKLELKALPEHLKYVYLGEGNRLPVIISKKLTVGQEERLVNMLREHKKAIGWTLAYIIGIDPATCMHRIQLEEDAKPSREPQRRMNPILREVVKKEID

Query:  KLQAACIIYSISDSKWASIKSLLHWKIKRKQLLPVPLERMPLEGCPLDCVMPQHFMADHGIVLGHVISERGIEVDKAKIDVIANLPYPTNVREVRSFLGH
        KL  A +IY ISDS+W S                 P++ +P +      V  +    D G++LGHV+S +GIEVDKAKID+I +LPYP+ VRE+RSFLGH
Subjt:  KLQAACIIYSISDSKWASIKSLLHWKIKRKQLLPVPLERMPLEGCPLDCVMPQHFMADHGIVLGHVISERGIEVDKAKIDVIANLPYPTNVREVRSFLGH

Query:  AGFYRRFIKDFSKTALPMTNLLQKDVTFEFGAECQAAFDLLKKMLTSAPIIHPPRWDLTFEIMCDASNYAV
        AGFYRRFIK+FSK A P+  LLQKD  F+FG +C+ AFD LKK L SAPI+ PP W   FEIMCDAS  +V
Subjt:  AGFYRRFIKDFSKTALPMTNLLQKDVTFEFGAECQAAFDLLKKMLTSAPIIHPPRWDLTFEIMCDASNYAV

A0A6P6TC26 uncharacterized protein LOC1136998611.2e-12547.37Show/hide
Query:  EINVPLLKDMERIPRYARFMKELCNTKPETKERGRIEVSKTVSTLLQSNMPEKFGDPGLLYIPCVIGSKSIEYAMLDLSASFNVMPYSVYHDLKLNDLRT
        EIN+PLL  ++++PRYA+F+K+LC  + + +   RI V + VS +LQ  +P K GDPG+  IPC IG+ SI  AMLDL AS NVMP ++Y  L L  L+ 
Subjt:  EINVPLLKDMERIPRYARFMKELCNTKPETKERGRIEVSKTVSTLLQSNMPEKFGDPGLLYIPCVIGSKSIEYAMLDLSASFNVMPYSVYHDLKLNDLRT

Query:  TGNVFQLADRSYMHPLGVVEDVIVQ-----------------RFSPSVASILLGRTFVKSARTMINLHRDVLSIEYQEEIVKVNIPQVAKYPDVSASLYH
        TG + QLADR+  +P GV+EDV+VQ                   SP+ + ILLGR F+ +ART I++ +  L++E+  EIV  +I +  KYP  S +++ 
Subjt:  TGNVFQLADRSYMHPLGVVEDVIVQ-----------------RFSPSVASILLGRTFVKSARTMINLHRDVLSIEYQEEIVKVNIPQVAKYPDVSASLYH

Query:  VEIVDSVVHDVVVSDMINMGLEHDTQDVDIDETGELVPD--------------SSPDLISFDLG---ISVTSGEMLPSVVQAPKLELKALPEHLKYVYLG
        V I+D VV +V   D  +      T+ +D++   E+  D               SP  + +D+    +     ++LPS+VQAP++ELK LPEHLKY +LG
Subjt:  VEIVDSVVHDVVVSDMINMGLEHDTQDVDIDETGELVPD--------------SSPDLISFDLG---ISVTSGEMLPSVVQAPKLELKALPEHLKYVYLG

Query:  EGNRLPVIISKKLTVGQEERLVNMLREHKKAIGWTLAYIIGIDPATCMHRIQLEEDAKPSREPQRRMNPILREVVKKEIDKLQAACIIYSISDSKWASIK
        E   LPVIIS KL+  +E++L+ +LREHK+AIGWT+A I GI P+ CMHRI+LEEDA+P R+PQRR+NPI+ EVVKKE+ KL    II+SISDS W S  
Subjt:  EGNRLPVIISKKLTVGQEERLVNMLREHKKAIGWTLAYIIGIDPATCMHRIQLEEDAKPSREPQRRMNPILREVVKKEIDKLQAACIIYSISDSKWASIK

Query:  SLLHWKIKRKQLLPVPLERMPLEGCPLDCVMPQHFMADHGIVLGHVISERGIEVDKAKIDVIANLPYPTNVREVRSFLGHAGFYRRFIKDFSKTALPMTN
                            P++  P    +      +H   L  +    GIEVDKAKIDVI+ LPYP  VREVRSFLGHAGFYRRFIKDFSK   P+  
Subjt:  SLLHWKIKRKQLLPVPLERMPLEGCPLDCVMPQHFMADHGIVLGHVISERGIEVDKAKIDVIANLPYPTNVREVRSFLGHAGFYRRFIKDFSKTALPMTN

Query:  LLQKDVTFEFGAECQAAFDLLKKMLTSAPIIHPPRWDLTFEIMCDASNYAV
        LLQKDV+FEF  EC+ AF+ LKK+LTS P+I PP W+L FEIMCDAS+YAV
Subjt:  LLQKDVTFEFGAECQAAFDLLKKMLTSAPIIHPPRWDLTFEIMCDASNYAV

A0A6P6X561 uncharacterized protein LOC1137390151.7e-12446.75Show/hide
Query:  EINVPLLKDMERIPRYARFMKELCNTKPETKERGRIEVSKTVSTLLQSNMPEKFGDPGLLYIPCVIGSKSIEYAMLDLSASFNVMPYSVYHDLKLNDLRT
        EIN+PLL  ++++PRYA+F+K+LC  + + +   RI V + VS +LQ  +P K GDPG+  IPC IG+ SI  AMLDL AS NVMP ++Y  L L  L+ 
Subjt:  EINVPLLKDMERIPRYARFMKELCNTKPETKERGRIEVSKTVSTLLQSNMPEKFGDPGLLYIPCVIGSKSIEYAMLDLSASFNVMPYSVYHDLKLNDLRT

Query:  TGNVFQLADRSYMHPLGVVEDVIVQ-----------------RFSPSVASILLGRTFVKSARTMINLHRDVLSIEYQEEIVKVNIPQVAKYPDVSASLYH
        TG + QLADR+  +P GV+EDV+VQ                   SP+ + ILLGR F+ +ART I++ +  L++E+  EIV  +I +  KYP  S +++ 
Subjt:  TGNVFQLADRSYMHPLGVVEDVIVQ-----------------RFSPSVASILLGRTFVKSARTMINLHRDVLSIEYQEEIVKVNIPQVAKYPDVSASLYH

Query:  VEIVDSVVHDVVV---SDMINMGLEHDTQDVDIDETGELVPD--------------SSPDLISFDLG---ISVTSGEMLPSVVQAPKLELKALPEHLKYV
        V I+D +V +V      D + + +   T+ +D++   E+  D               SP  + +D+    +     ++LPS+VQAP++ELK LPEHLKY 
Subjt:  VEIVDSVVHDVVV---SDMINMGLEHDTQDVDIDETGELVPD--------------SSPDLISFDLG---ISVTSGEMLPSVVQAPKLELKALPEHLKYV

Query:  YLGEGNRLPVIISKKLTVGQEERLVNMLREHKKAIGWTLAYIIGIDPATCMHRIQLEEDAKPSREPQRRMNPILREVVKKEIDKLQAACIIYSISDSKWA
        +LGE   LPVIIS KL+  +E++L+ +LREHK+AIGWT+A I GI P+ CMHRI+LEEDA+P R+PQRR+NPI+ EVVKKE+ KL    II+SISDS W 
Subjt:  YLGEGNRLPVIISKKLTVGQEERLVNMLREHKKAIGWTLAYIIGIDPATCMHRIQLEEDAKPSREPQRRMNPILREVVKKEIDKLQAACIIYSISDSKWA

Query:  SIKSLLHWKIKRKQLLPVPLERMPLEGCPLDCVMPQHFMADHGIVLGHVISERGIEVDKAKIDVIANLPYPTNVREVRSFLGHAGFYRRFIKDFSKTALP
        S                      P++  P    +      +H   L  +    GIEVDKAKIDVI+ LPYP  VREVRSFLGHAGFYRRFIKDFSK   P
Subjt:  SIKSLLHWKIKRKQLLPVPLERMPLEGCPLDCVMPQHFMADHGIVLGHVISERGIEVDKAKIDVIANLPYPTNVREVRSFLGHAGFYRRFIKDFSKTALP

Query:  MTNLLQKDVTFEFGAECQAAFDLLKKMLTSAPIIHPPRWDLTFEIMCDASNYAV
        +  LLQKDV+FEF  +C+ AF+ LKK+LTS P+I PP W+L FEIMCDAS+YAV
Subjt:  MTNLLQKDVTFEFGAECQAAFDLLKKMLTSAPIIHPPRWDLTFEIMCDASNYAV

A0A6P8D4X0 Reverse transcriptase2.9e-12440.12Show/hide
Query:  QQDVRNSF---EINVPLLKDMERIPRYARFMKELCNTKPETKERGRIEVSKTVSTLLQSNMPEKFGDPGLLYIPCVIGSKSIEYAMLDLSASFNVMPYSV
        +Q++  +F   E+N+PLL  ++++PRYA+F+KELC  K +  +  +I V +  S ++Q  +P+K  DPG+  IPC IG K IE AMLDL AS NVMP S+
Subjt:  QQDVRNSF---EINVPLLKDMERIPRYARFMKELCNTKPETKERGRIEVSKTVSTLLQSNMPEKFGDPGLLYIPCVIGSKSIEYAMLDLSASFNVMPYSV

Query:  YHDLKLNDLRTTGNVFQLADRSYMHPLGVVEDVIVQ-----------------RFSPSVASILLGRTFVKSARTMINLHRDVLSIEYQEEIVKVNIPQVA
        Y  L L  L+ T  + QLADRS  +P G++EDV+++                 + + + + ILLGR F+K+ART I++H   LS+E+ +E +  NI    
Subjt:  YHDLKLNDLRTTGNVFQLADRSYMHPLGVVEDVIVQ-----------------RFSPSVASILLGRTFVKSARTMINLHRDVLSIEYQEEIVKVNIPQVA

Query:  KYPDVSASLYHVEIVDSVVHDVV----VSDMINMGLEHDTQ-DVDIDETG---------ELVPDSSPDLISFDLGISVTSGEMLPSVVQAPKLELKALPE
        ++PD   S+  V+++DS V DV     V + ++  ++ +   D   +E G         + + D S    S  L +  ++  +LPS+VQAPKLELK LPE
Subjt:  KYPDVSASLYHVEIVDSVVHDVV----VSDMINMGLEHDTQ-DVDIDETG---------ELVPDSSPDLISFDLGISVTSGEMLPSVVQAPKLELKALPE

Query:  HLKYVYLGEGNRLPVIISKKLTVGQEERLVNMLREHKKAIGWTLAYIIGIDPATCMHRIQLEEDAKPSREPQRRMNPILREVVKKEIDKLQAACIIYSIS
        +LKYVYLGE   LPVIISK+LT  QEE+L+ +L+E++ AIGWTLA I GI P+ CMHRI LE+DA+P R+PQR++NP ++EVV KEI KL    IIY IS
Subjt:  HLKYVYLGEGNRLPVIISKKLTVGQEERLVNMLREHKKAIGWTLAYIIGIDPATCMHRIQLEEDAKPSREPQRRMNPILREVVKKEIDKLQAACIIYSIS

Query:  DSKWASIKSLL-----------------------HWKI----------KRKQLLPVP-----LERMP-------LEG--------------------CPL
        DS+W S   ++                        W++           RK   P+P     LER+        L+G                    CP 
Subjt:  DSKWASIKSLL-----------------------HWKI----------KRKQLLPVP-----LERMP-------LEG--------------------CPL

Query:  D--------------------CVM-------------------------------------------------PQHFMADHGIVLGHVISERGIEVDKAK
                             C+M                                                   HFM  HGIVLGHVIS RGIEVDK+K
Subjt:  D--------------------CVM-------------------------------------------------PQHFMADHGIVLGHVISERGIEVDKAK

Query:  IDVIANLPYPTNVREVRSFLGHAGFYRRFIKDFSKTALPMTNLLQKDVTFEFGAECQAAFDLLKKMLTSAPIIHPPRWDLTFEIMCDASNYAV
        +D+I NLPYP+N+R +RSFLGHAGFYRRFIKDFSK A P+ +LLQKD  F FG  C+ AFD LK++LTSAPII PP W L FEIM DAS+YA+
Subjt:  IDVIANLPYPTNVREVRSFLGHAGFYRRFIKDFSKTALPMTNLLQKDVTFEFGAECQAAFDLLKKMLTSAPIIHPPRWDLTFEIMCDASNYAV

A0A6P8DJV3 Reverse transcriptase2.9e-12440.12Show/hide
Query:  QQDVRNSF---EINVPLLKDMERIPRYARFMKELCNTKPETKERGRIEVSKTVSTLLQSNMPEKFGDPGLLYIPCVIGSKSIEYAMLDLSASFNVMPYSV
        +Q++  +F   E+N+PLL  ++++PRYA+F+KELC  K +  +  +I V +  S ++Q  +P+K  DPG+  IPC IG K IE AMLDL AS NVMP S+
Subjt:  QQDVRNSF---EINVPLLKDMERIPRYARFMKELCNTKPETKERGRIEVSKTVSTLLQSNMPEKFGDPGLLYIPCVIGSKSIEYAMLDLSASFNVMPYSV

Query:  YHDLKLNDLRTTGNVFQLADRSYMHPLGVVEDVIVQ-----------------RFSPSVASILLGRTFVKSARTMINLHRDVLSIEYQEEIVKVNIPQVA
        Y  L L  L+ T  + QLADRS  +P G++EDV+++                 + + + + ILLGR F+K+ART I++H   LS+E+ +E +  NI    
Subjt:  YHDLKLNDLRTTGNVFQLADRSYMHPLGVVEDVIVQ-----------------RFSPSVASILLGRTFVKSARTMINLHRDVLSIEYQEEIVKVNIPQVA

Query:  KYPDVSASLYHVEIVDSVVHDVV----VSDMINMGLEHDTQ-DVDIDETG---------ELVPDSSPDLISFDLGISVTSGEMLPSVVQAPKLELKALPE
        ++PD   S+  V+++DS V DV     V + ++  ++ +   D   +E G         + + D S    S  L +  ++  +LPS+VQAPKLELK LPE
Subjt:  KYPDVSASLYHVEIVDSVVHDVV----VSDMINMGLEHDTQ-DVDIDETG---------ELVPDSSPDLISFDLGISVTSGEMLPSVVQAPKLELKALPE

Query:  HLKYVYLGEGNRLPVIISKKLTVGQEERLVNMLREHKKAIGWTLAYIIGIDPATCMHRIQLEEDAKPSREPQRRMNPILREVVKKEIDKLQAACIIYSIS
        +LKYVYLGE   LPVIISK+LT  QEE+L+ +L+E++ AIGWTLA I GI P+ CMHRI LE+DA+P R+PQR++NP ++EVV KEI KL    IIY IS
Subjt:  HLKYVYLGEGNRLPVIISKKLTVGQEERLVNMLREHKKAIGWTLAYIIGIDPATCMHRIQLEEDAKPSREPQRRMNPILREVVKKEIDKLQAACIIYSIS

Query:  DSKWASIKSLL-----------------------HWKI----------KRKQLLPVP-----LERMP-------LEG--------------------CPL
        DS+W S   ++                        W++           RK   P+P     LER+        L+G                    CP 
Subjt:  DSKWASIKSLL-----------------------HWKI----------KRKQLLPVP-----LERMP-------LEG--------------------CPL

Query:  D--------------------CVM-------------------------------------------------PQHFMADHGIVLGHVISERGIEVDKAK
                             C+M                                                   HFM  HGIVLGHVIS RGIEVDK+K
Subjt:  D--------------------CVM-------------------------------------------------PQHFMADHGIVLGHVISERGIEVDKAK

Query:  IDVIANLPYPTNVREVRSFLGHAGFYRRFIKDFSKTALPMTNLLQKDVTFEFGAECQAAFDLLKKMLTSAPIIHPPRWDLTFEIMCDASNYAV
        +D+I NLPYP+N+R +RSFLGHAGFYRRFIKDFSK A P+ +LLQKD  F FG  C+ AFD LK++LTSAPII PP W L FEIM DAS+YA+
Subjt:  IDVIANLPYPTNVREVRSFLGHAGFYRRFIKDFSKTALPMTNLLQKDVTFEFGAECQAAFDLLKKMLTSAPIIHPPRWDLTFEIMCDASNYAV

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.64.0e-1730.69Show/hide
Query:  QRRMNPILREVVKKEIDKLQAACIIYSISDSKWASIKSLLHWKIKRKQLLPVPLERMPLEGCPLDCVMPQHFMADHGIVLGHVISERGIEVDKAKIDVIA
        QR MN ILR ++ K         I++S S  +      L+  K+ +  L      ++ L+ C         F+      LGHV++  GI+ +  KI+ I 
Subjt:  QRRMNPILREVVKKEIDKLQAACIIYSISDSKWASIKSLLHWKIKRKQLLPVPLERMPLEGCPLDCVMPQHFMADHGIVLGHVISERGIEVDKAKIDVIA

Query:  NLPYPTNVREVRSFLGHAGFYRRFIKDFSKTALPMTNLLQKDVTFE-FGAECQAAFDLLKKMLTSAPIIHPPRWDLTFEIMCDASNYAV
          P PT  +E+++FLG  G+YR+FI +F+  A PMT  L+K++  +    E  +AF  LK +++  PI+  P +   F +  DAS+ A+
Subjt:  NLPYPTNVREVRSFLGHAGFYRRFIKDFSKTALPMTNLLQKDVTFE-FGAECQAAFDLLKKMLTSAPIIHPPRWDLTFEIMCDASNYAV

P10394 Retrovirus-related Pol polyprotein from transposon 4121.8e-1439.81Show/hide
Query:  LGHVISERGIEVDKAKIDVIANLPYPTNVREVRSFLGHAGFYRRFIKDFSKTALPMTNLLQKDVTFEFGAECQAAFDLLKKMLTSAPIIHPPRWDLTFEI
        LGH  +++GI  D  K DVI N P P +    R F+    +YRRFIK+F+  +  +T L +K+V FE+  ECQ AF  LK  L +  ++  P +   F I
Subjt:  LGHVISERGIEVDKAKIDVIANLPYPTNVREVRSFLGHAGFYRRFIKDFSKTALPMTNLLQKDVTFEFGAECQAAFDLLKKMLTSAPIIHPPRWDLTFEI

Query:  MCDASNYA
          DAS  A
Subjt:  MCDASNYA

P20825 Retrovirus-related Pol polyprotein from transposon 2973.3e-1630.69Show/hide
Query:  QRRMNPILREVVKKEIDKLQAACIIYSISDSKWASIKSLLHWKIKRKQLLPVPLERMPLEGCPLDCVMPQHFMADHGIVLGHVISERGIEVDKAKIDVIA
        QR MN ILR ++ K         II+S S ++  +   L+  K+    L      ++ L+ C         F+      LGH+++  GI+ +  K+  I 
Subjt:  QRRMNPILREVVKKEIDKLQAACIIYSISDSKWASIKSLLHWKIKRKQLLPVPLERMPLEGCPLDCVMPQHFMADHGIVLGHVISERGIEVDKAKIDVIA

Query:  NLPYPTNVREVRSFLGHAGFYRRFIKDFSKTALPMTNLLQKDVTFEF-GAECQAAFDLLKKMLTSAPIIHPPRWDLTFEIMCDASNYAV
        + P PT  +E+R+FLG  G+YR+FI +++  A PMT+ L+K    +    E   AF+ LK ++   PI+  P ++  F +  DASN A+
Subjt:  NLPYPTNVREVRSFLGHAGFYRRFIKDFSKTALPMTNLLQKDVTFEF-GAECQAAFDLLKKMLTSAPIIHPPRWDLTFEIMCDASNYAV

P92523 Uncharacterized mitochondrial protein AtMg008604.5e-1334.26Show/hide
Query:  HVISERGIEVDKAKIDVIANLPYPTNVREVRSFLGHAGFYRRFIKDFSKTALPMTNLLQKDVTFEFGAECQAAFDLLKKMLTSAPIIHPPRWDLTFEIMC
        H+IS  G+  D AK++ +   P P N  E+R FLG  G+YRRF+K++ K   P+T LL+K+ + ++      AF  LK  +T+ P++  P   L F    
Subjt:  HVISERGIEVDKAKIDVIANLPYPTNVREVRSFLGHAGFYRRFIKDFSKTALPMTNLLQKDVTFEFGAECQAAFDLLKKMLTSAPIIHPPRWDLTFEIMC

Query:  DASNYAVY
           N++ +
Subjt:  DASNYAVY

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus1.3e-1529.65Show/hide
Query:  QRRMNPILREVVKKEIDKLQAACIIYSISDSKWASIKSLLHWKIKRKQLLPVPLERMPLEGCPLDCVMPQHFMADHGIVLGHVISERGIEVDKAKIDVIA
        QR ++ ILRE + K        C +Y I D    S     HWK  R  L  +    + +       +   HF+      LG++++  GI+ D  K+  I+
Subjt:  QRRMNPILREVVKKEIDKLQAACIIYSISDSKWASIKSLLHWKIKRKQLLPVPLERMPLEGCPLDCVMPQHFMADHGIVLGHVISERGIEVDKAKIDVIA

Query:  NLPYPTNVREVRSFLGHAGFYRRFIKDFSKTALPMTNLLQ-----------KDVTFEFGAECQAAFDLLKKMLTSAPIIHPPRWDLTFEIMCDASNYAV
         +P PT+V+E++ FLG   +YR+FI+D++K A P+TNL +             V          +F+ LK +L S+ I+  P +   F +  DASN+A+
Subjt:  NLPYPTNVREVRSFLGHAGFYRRFIKDFSKTALPMTNLLQ-----------KDVTFEFGAECQAAFDLLKKMLTSAPIIHPPRWDLTFEIMCDASNYAV

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein3.2e-1434.26Show/hide
Query:  HVISERGIEVDKAKIDVIANLPYPTNVREVRSFLGHAGFYRRFIKDFSKTALPMTNLLQKDVTFEFGAECQAAFDLLKKMLTSAPIIHPPRWDLTFEIMC
        H+IS  G+  D AK++ +   P P N  E+R FLG  G+YRRF+K++ K   P+T LL+K+ + ++      AF  LK  +T+ P++  P   L F    
Subjt:  HVISERGIEVDKAKIDVIANLPYPTNVREVRSFLGHAGFYRRFIKDFSKTALPMTNLLQKDVTFEFGAECQAAFDLLKKMLTSAPIIHPPRWDLTFEIMC

Query:  DASNYAVY
           N++ +
Subjt:  DASNYAVY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGATTTGACGGATCAACATTTTGAGGATTATGGAGAGATTCGTATACATGATTTAGCAGTTTGTACTTTAGAAAGGAAAAATGAAAAAGAAGTGTCTAGGTTAGA
GAGGGTTTTTGAGTCATTAGATTTGAATGACAGGAAGGCTCTTCCTATTAAGCCATCCCAAATTGAGGCTCCTACTTTAGATTTGAAACCTTTACCAGATCATGTAAAGT
ATGCATATCTTGGGGAAGGTGAAACGTTGCCTATTATTGTTGCATCTGATTTAGTGTTGGAGGATGAAGAGGCCTTAGTTAAGTTGATGCAGCAATACAACAAGGCAATA
GATTGTAGGAAGGCTTTCGAGACTTTAAAGGTTGCTTTGATTTCAACACCCATTCTTTGTGCACCAAACTGGAGTTTTCCGTTCGAGGTGATGTGTGATGCCAGTTATGT
GGCAGTAGATGCAATGCTGGGGAAAAGCAGGGAGCTTGACTTAGAGATAAAGGACAAGAAGGGATCAGCAAATGTTATTGCAGATTATTTGTCTCGTCTTGATCCATCAT
CATCTTTGCTCGAGCAATCTGTCATCTCAGATTCATTTCCAGATGAACAACTCTTTGTTGTTGATATAAAGGTAGTGCACAGTGATGAAGCAAAGGAAATCCTGGAGCAA
TGTCACTCTTCCCGTATGGAGGTCATTTCAACGGTCAAAGGACAGCTATTAGGGTTTTACAATGTGGATTTTTCTGGGCTTCTTTATTTAAGGTTGCGTACTGTTAACTA
CGTGTCCAAGTGGGTGGAGGTCATTGCATGTCGTCATAATGATGCTAAAATGGTGCCAAGGTTTCTTCAGTCGCACATTTTTGCGCGGTTTGGGATACCTAGGGCTCTTG
TGAGCGATGACGGTACACACTTTCTGAATAATGTTTTAGCTAAGCTTTTAGCTAAATATGGAGTTAAGCATAGGATAGCTACCCCTTATCACCCACAAGCAAATGGTCAG
GCTGAAGTTAGTAATATGGTCACCTATTGTTCCAAGGATTCTATGATGGTATTGTTCCCAAATAGTAGGATGTGGATCGACGCAGCTAGTGAAGGCTCTATAATGAACAA
GTCACCAAAGGAAGTGAGAGATATCATAGTTAACCTTGCAGAGAGTGAACGCCAGAGTAGCATTAAACAAGATGGCCCTATAGCGTCCTTAGCTAAAGAGGTAAAGAATT
TTGAGGTCAAGTTAGACAAGCTAATTGAACTGGTTCATTCTATTGCAGATAGACTTAAGGTGAGGGAAGACTGTAGCAGTTGTACAACCATTAGTGATTCATCAGACTAC
CACCCCGAGGGTCAGCAGTCAGATAATTCCTTAGAGAATCTCATTAAAACTATAGCTGATACCACTCCATCTTTTCAGCAGGATGTGCGAAATAGCTTCGAGATTAACGT
ACCCCTGCTAAAAGACATGGAACGTATTCCTAGGTACGCGCGGTTTATGAAAGAGTTGTGTAATACCAAACCAGAGACGAAAGAGCGAGGAAGAATCGAGGTAAGTAAGA
CCGTTTCAACACTCTTACAAAGTAACATGCCAGAGAAGTTTGGAGATCCAGGTTTGCTTTACATACCTTGTGTAATAGGGAGTAAGAGTATAGAATATGCCATGCTTGAT
CTGAGTGCATCTTTCAATGTCATGCCTTACTCCGTCTATCATGATCTTAAATTGAATGACTTACGAACTACTGGTAATGTGTTTCAGTTAGCTGATAGATCTTATATGCA
CCCTTTAGGGGTTGTAGAGGATGTTATAGTTCAGAGGTTTTCCCCTAGTGTTGCATCTATTTTGTTGGGGAGAACGTTTGTGAAATCAGCTAGGACCATGATAAATCTTC
ATAGGGATGTATTGTCTATAGAGTATCAGGAGGAGATTGTTAAGGTCAATATTCCACAAGTAGCTAAATACCCTGATGTTTCTGCTTCCCTTTACCATGTGGAGATAGTT
GACTCTGTAGTACATGATGTCGTAGTGAGTGACATGATTAATATGGGTTTGGAGCACGACACCCAAGATGTTGACATTGATGAAACAGGGGAGTTGGTTCCTGATTCTTC
CCCTGATCTTATTTCCTTTGATCTTGGTATATCTGTTACTTCTGGTGAAATGTTGCCCTCTGTTGTGCAAGCACCCAAGTTAGAGCTGAAGGCGTTACCAGAGCATTTAA
AGTATGTCTATCTAGGGGAAGGTAATAGATTACCAGTTATTATTTCTAAAAAGTTAACTGTAGGTCAAGAGGAACGACTGGTAAACATGCTGAGAGAGCATAAGAAAGCC
ATTGGGTGGACCTTAGCTTACATCATTGGGATCGACCCAGCCACTTGCATGCATAGGATTCAGTTGGAGGAGGACGCAAAACCGTCAAGGGAGCCTCAGAGGCGCATGAA
TCCAATTCTGAGAGAAGTGGTAAAGAAAGAGATCGATAAGCTTCAAGCTGCATGTATTATCTACTCCATTTCTGATAGTAAATGGGCTTCTATCAAATCCCTATTGCACT
GGAAGATCAAGAGAAAACAACTTTTACCTGTCCCTTTGGAACGTATGCCTTTAGAAGGATGCCCTTTGGATTGTGTAATGCCCCAGCATTTCATGGCTGACCATGGAATT
GTTTTAGGACATGTGATTTCTGAGCGAGGCATAGAGGTTGATAAGGCTAAAATTGATGTTATAGCTAACCTACCCTACCCTACGAACGTTCGGGAGGTTAGATCTTTTCT
TGGCCATGCAGGTTTCTATCGCAGGTTTATTAAAGACTTCAGCAAGACTGCATTGCCCATGACGAACTTATTGCAGAAGGACGTCACCTTCGAGTTTGGGGCAGAGTGCC
AGGCAGCATTTGACCTTCTGAAGAAGATGCTGACTAGTGCACCGATTATCCATCCTCCTAGGTGGGACCTTACCTTCGAAATCATGTGTGATGCAAGCAACTACGCTGTT
TATGGTTTTAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGATTTGACGGATCAACATTTTGAGGATTATGGAGAGATTCGTATACATGATTTAGCAGTTTGTACTTTAGAAAGGAAAAATGAAAAAGAAGTGTCTAGGTTAGA
GAGGGTTTTTGAGTCATTAGATTTGAATGACAGGAAGGCTCTTCCTATTAAGCCATCCCAAATTGAGGCTCCTACTTTAGATTTGAAACCTTTACCAGATCATGTAAAGT
ATGCATATCTTGGGGAAGGTGAAACGTTGCCTATTATTGTTGCATCTGATTTAGTGTTGGAGGATGAAGAGGCCTTAGTTAAGTTGATGCAGCAATACAACAAGGCAATA
GATTGTAGGAAGGCTTTCGAGACTTTAAAGGTTGCTTTGATTTCAACACCCATTCTTTGTGCACCAAACTGGAGTTTTCCGTTCGAGGTGATGTGTGATGCCAGTTATGT
GGCAGTAGATGCAATGCTGGGGAAAAGCAGGGAGCTTGACTTAGAGATAAAGGACAAGAAGGGATCAGCAAATGTTATTGCAGATTATTTGTCTCGTCTTGATCCATCAT
CATCTTTGCTCGAGCAATCTGTCATCTCAGATTCATTTCCAGATGAACAACTCTTTGTTGTTGATATAAAGGTAGTGCACAGTGATGAAGCAAAGGAAATCCTGGAGCAA
TGTCACTCTTCCCGTATGGAGGTCATTTCAACGGTCAAAGGACAGCTATTAGGGTTTTACAATGTGGATTTTTCTGGGCTTCTTTATTTAAGGTTGCGTACTGTTAACTA
CGTGTCCAAGTGGGTGGAGGTCATTGCATGTCGTCATAATGATGCTAAAATGGTGCCAAGGTTTCTTCAGTCGCACATTTTTGCGCGGTTTGGGATACCTAGGGCTCTTG
TGAGCGATGACGGTACACACTTTCTGAATAATGTTTTAGCTAAGCTTTTAGCTAAATATGGAGTTAAGCATAGGATAGCTACCCCTTATCACCCACAAGCAAATGGTCAG
GCTGAAGTTAGTAATATGGTCACCTATTGTTCCAAGGATTCTATGATGGTATTGTTCCCAAATAGTAGGATGTGGATCGACGCAGCTAGTGAAGGCTCTATAATGAACAA
GTCACCAAAGGAAGTGAGAGATATCATAGTTAACCTTGCAGAGAGTGAACGCCAGAGTAGCATTAAACAAGATGGCCCTATAGCGTCCTTAGCTAAAGAGGTAAAGAATT
TTGAGGTCAAGTTAGACAAGCTAATTGAACTGGTTCATTCTATTGCAGATAGACTTAAGGTGAGGGAAGACTGTAGCAGTTGTACAACCATTAGTGATTCATCAGACTAC
CACCCCGAGGGTCAGCAGTCAGATAATTCCTTAGAGAATCTCATTAAAACTATAGCTGATACCACTCCATCTTTTCAGCAGGATGTGCGAAATAGCTTCGAGATTAACGT
ACCCCTGCTAAAAGACATGGAACGTATTCCTAGGTACGCGCGGTTTATGAAAGAGTTGTGTAATACCAAACCAGAGACGAAAGAGCGAGGAAGAATCGAGGTAAGTAAGA
CCGTTTCAACACTCTTACAAAGTAACATGCCAGAGAAGTTTGGAGATCCAGGTTTGCTTTACATACCTTGTGTAATAGGGAGTAAGAGTATAGAATATGCCATGCTTGAT
CTGAGTGCATCTTTCAATGTCATGCCTTACTCCGTCTATCATGATCTTAAATTGAATGACTTACGAACTACTGGTAATGTGTTTCAGTTAGCTGATAGATCTTATATGCA
CCCTTTAGGGGTTGTAGAGGATGTTATAGTTCAGAGGTTTTCCCCTAGTGTTGCATCTATTTTGTTGGGGAGAACGTTTGTGAAATCAGCTAGGACCATGATAAATCTTC
ATAGGGATGTATTGTCTATAGAGTATCAGGAGGAGATTGTTAAGGTCAATATTCCACAAGTAGCTAAATACCCTGATGTTTCTGCTTCCCTTTACCATGTGGAGATAGTT
GACTCTGTAGTACATGATGTCGTAGTGAGTGACATGATTAATATGGGTTTGGAGCACGACACCCAAGATGTTGACATTGATGAAACAGGGGAGTTGGTTCCTGATTCTTC
CCCTGATCTTATTTCCTTTGATCTTGGTATATCTGTTACTTCTGGTGAAATGTTGCCCTCTGTTGTGCAAGCACCCAAGTTAGAGCTGAAGGCGTTACCAGAGCATTTAA
AGTATGTCTATCTAGGGGAAGGTAATAGATTACCAGTTATTATTTCTAAAAAGTTAACTGTAGGTCAAGAGGAACGACTGGTAAACATGCTGAGAGAGCATAAGAAAGCC
ATTGGGTGGACCTTAGCTTACATCATTGGGATCGACCCAGCCACTTGCATGCATAGGATTCAGTTGGAGGAGGACGCAAAACCGTCAAGGGAGCCTCAGAGGCGCATGAA
TCCAATTCTGAGAGAAGTGGTAAAGAAAGAGATCGATAAGCTTCAAGCTGCATGTATTATCTACTCCATTTCTGATAGTAAATGGGCTTCTATCAAATCCCTATTGCACT
GGAAGATCAAGAGAAAACAACTTTTACCTGTCCCTTTGGAACGTATGCCTTTAGAAGGATGCCCTTTGGATTGTGTAATGCCCCAGCATTTCATGGCTGACCATGGAATT
GTTTTAGGACATGTGATTTCTGAGCGAGGCATAGAGGTTGATAAGGCTAAAATTGATGTTATAGCTAACCTACCCTACCCTACGAACGTTCGGGAGGTTAGATCTTTTCT
TGGCCATGCAGGTTTCTATCGCAGGTTTATTAAAGACTTCAGCAAGACTGCATTGCCCATGACGAACTTATTGCAGAAGGACGTCACCTTCGAGTTTGGGGCAGAGTGCC
AGGCAGCATTTGACCTTCTGAAGAAGATGCTGACTAGTGCACCGATTATCCATCCTCCTAGGTGGGACCTTACCTTCGAAATCATGTGTGATGCAAGCAACTACGCTGTT
TATGGTTTTAGTTGA
Protein sequenceShow/hide protein sequence
MEDLTDQHFEDYGEIRIHDLAVCTLERKNEKEVSRLERVFESLDLNDRKALPIKPSQIEAPTLDLKPLPDHVKYAYLGEGETLPIIVASDLVLEDEEALVKLMQQYNKAI
DCRKAFETLKVALISTPILCAPNWSFPFEVMCDASYVAVDAMLGKSRELDLEIKDKKGSANVIADYLSRLDPSSSLLEQSVISDSFPDEQLFVVDIKVVHSDEAKEILEQ
CHSSRMEVISTVKGQLLGFYNVDFSGLLYLRLRTVNYVSKWVEVIACRHNDAKMVPRFLQSHIFARFGIPRALVSDDGTHFLNNVLAKLLAKYGVKHRIATPYHPQANGQ
AEVSNMVTYCSKDSMMVLFPNSRMWIDAASEGSIMNKSPKEVRDIIVNLAESERQSSIKQDGPIASLAKEVKNFEVKLDKLIELVHSIADRLKVREDCSSCTTISDSSDY
HPEGQQSDNSLENLIKTIADTTPSFQQDVRNSFEINVPLLKDMERIPRYARFMKELCNTKPETKERGRIEVSKTVSTLLQSNMPEKFGDPGLLYIPCVIGSKSIEYAMLD
LSASFNVMPYSVYHDLKLNDLRTTGNVFQLADRSYMHPLGVVEDVIVQRFSPSVASILLGRTFVKSARTMINLHRDVLSIEYQEEIVKVNIPQVAKYPDVSASLYHVEIV
DSVVHDVVVSDMINMGLEHDTQDVDIDETGELVPDSSPDLISFDLGISVTSGEMLPSVVQAPKLELKALPEHLKYVYLGEGNRLPVIISKKLTVGQEERLVNMLREHKKA
IGWTLAYIIGIDPATCMHRIQLEEDAKPSREPQRRMNPILREVVKKEIDKLQAACIIYSISDSKWASIKSLLHWKIKRKQLLPVPLERMPLEGCPLDCVMPQHFMADHGI
VLGHVISERGIEVDKAKIDVIANLPYPTNVREVRSFLGHAGFYRRFIKDFSKTALPMTNLLQKDVTFEFGAECQAAFDLLKKMLTSAPIIHPPRWDLTFEIMCDASNYAV
YGFS