; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0025448 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0025448
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr05:8540921..8545503
RNA-Seq ExpressionPI0025448
SyntenyPI0025448
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026071.1 uncharacterized protein E6C27_scaffold581G00620 [Cucumis melo var. makuwa]2.8e-14048.21Show/hide
Query:  VDEQCFVFMPSNVDRRVLWRRLVETTSRWSIPSVVMGDFNAIRVPSKACGGSPVTSEMEEFDLAIREADLVEPSVQGNWFTWTSKDPGSVERSR------
        V+  C     SN++RR+LW RLVE TS WS P VVM DFNAIRV S+A  GSP+  EME+F+LAIR+ADLVEPSVQGNWFTWTSK  GS    R      
Subjt:  VDEQCFVFMPSNVDRRVLWRRLVETTSRWSIPSVVMGDFNAIRVPSKACGGSPVTSEMEEFDLAIREADLVEPSVQGNWFTWTSKDPGSVERSR------

Query:  -DAGLET--------------------------------------------------EAFWSA-------------IRQEKAFLCQK-SRGIRVTSHEGL
         D  L T                                                     WS              +   K  L ++  R I+  S E  
Subjt:  -DAGLET--------------------------------------------------EAFWSA-------------IRQEKAFLCQK-SRGIRVTSHEGL

Query:  V----------QMAVNFFRNSLGSQGVGYKELTPLLEEVVQFRWSEECCSALLAPIRRDEIRRTCYLPP-------------GVNATTITLIPKHCEAER
        +          +MAVN+FRNSLGSQ +GY+EL+P+++++VQF+WSEECC AL  PI R+E+RR  +                G+NAT ITLIPKH  AER
Subjt:  V----------QMAVNFFRNSLGSQGVGYKELTPLLEEVVQFRWSEECCSALLAPIRRDEIRRTCYLPP-------------GVNATTITLIPKHCEAER

Query:  MEDYRPISCCNLVYKCISKILADRIRVWLSSFINGNQSTFVVGRSIVDNILLCHEMV------------------GDYHGSSGMPRCAMKVDLQKAYDSA
        +ED+ PISC N++YKCISKILADR+RVWL SFI+ NQS F+ GRSI++NILLC E++                  G +HG  G+          +  D  
Subjt:  MEDYRPISCCNLVYKCISKILADRIRVWLSSFINGNQSTFVVGRSIVDNILLCHEMV------------------GDYHGSSGMPRCAMKVDLQKAYDSA

Query:  LPCVLLVGLWRVLRPLIMVNDSVEGYHHRCEKVCLTHLTFADDLMIFCYADTASISFVRETFCQFGKLSGLIANLGKSSMFVAGVDSETATMLADSMGFA
         P + ++ +  + R L  +  S + +HHRCEKV LTHLTFADDLMIFC AD  SI F+R+   +FG+LSGL AN  KSS+FVAGV++E A+ LA  MGFA
Subjt:  LPCVLLVGLWRVLRPLIMVNDSVEGYHHRCEKVCLTHLTFADDLMIFCYADTASISFVRETFCQFGKLSGLIANLGKSSMFVAGVDSETATMLADSMGFA

Query:  LGTLPVRYLGIPLLSGRLQSSDCAPLIQRITSRTQSWSTRVLSFAA---CAFGVAKS-YVFWASVFVLPAEVHHEVDRLLMSYLLRGKEDGRRGAKVAWV
         G LPVRYLG+PLL+GRL+S+DCAPLIQRITS+ +SW+ RVLSFA        V +S  V+WASVFVLPA VH+EVD++L SYL RGKE+GR G KVAWV
Subjt:  LGTLPVRYLGIPLLSGRLQSSDCAPLIQRITSRTQSWSTRVLSFAA---CAFGVAKS-YVFWASVFVLPAEVHHEVDRLLMSYLLRGKEDGRRGAKVAWV

Query:  DVCRPFEEGGGIVR
        DVC PFEEGG  +R
Subjt:  DVCRPFEEGGGIVR

KAA0046851.1 uncharacterized protein E6C27_scaffold19358G00020 [Cucumis melo var. makuwa]3.3e-14142.54Show/hide
Query:  VDEQCFVFMPSNVDRRVLWRRLVETTSRWSIPSVVMGDFNAIRVPSKACGGSPVTSEMEEFDLAIREADLVEPSVQGNWFTWTSKDPGS-----------
        V+  C     S+ +RR LWR L E TS WS   VVMGDFNAIRV S+A GGSP+  EMEEFDLAIR+ADLVEPSVQGNWFTWTSK  GS           
Subjt:  VDEQCFVFMPSNVDRRVLWRRLVETTSRWSIPSVVMGDFNAIRVPSKACGGSPVTSEMEEFDLAIREADLVEPSVQGNWFTWTSKDPGS-----------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------VER-------SRDAGLETEAFWSAIRQEKAFLCQKSR----------------------------------GIRVTSHEGLVQMAVNF
                    VER       SR A L TE FW+A+R E+A L QKS+                                  G RV+SH+G+ QMAVN+
Subjt:  ------------VER-------SRDAGLETEAFWSAIRQEKAFLCQKSR----------------------------------GIRVTSHEGLVQMAVNF

Query:  FRNSLGSQGVGYKELTPLLEEVVQFRWSEECCSALLAPIRRDEIRR-----------------------------------------TCYLPPGVNATTI
        F NSLGSQ +GY+EL+P+++++VQF+WSEECC AL  PI R+E+RR                                         TCYLP GVNAT I
Subjt:  FRNSLGSQGVGYKELTPLLEEVVQFRWSEECCSALLAPIRRDEIRR-----------------------------------------TCYLPPGVNATTI

Query:  TLIPKHCEAERMEDYRPISCCNLVYKCISKILADRIRVWLSSFINGNQSTFVVGRSIVDNILLCHEMVGDYHGSSGMPRCAMKVDLQKAYDSALPCVLLV
        TLIPKH  AER+ED+RPISCCN++YKCISKILADR+R+WL SFI+ NQS F+ GRSI++NILLC E+VG YH +SG PRC +KVDLQKAYDS     L  
Subjt:  TLIPKHCEAERMEDYRPISCCNLVYKCISKILADRIRVWLSSFINGNQSTFVVGRSIVDNILLCHEMVGDYHGSSGMPRCAMKVDLQKAYDSALPCVLLV

Query:  GLWRVLRPL----------------IMVNDSVEG--------------------------------------YHHRCEKVCLTHLTFADDLMIFCYADTA
         L  +  PL                IM+N S+EG                                      +HHRCEKV LTHLTFADDLMIFC AD  
Subjt:  GLWRVLRPL----------------IMVNDSVEG--------------------------------------YHHRCEKVCLTHLTFADDLMIFCYADTA

Query:  SISFVRETFCQFGKLSGLIANLGKSSMFVAGVDSETATMLADSMGFALGTLPVRYLGIPLLSGRLQSSDCAPLIQRITSRTQSWSTRVLSFAA---CAFG
        SISF+RE   +FG+ SGL AN  KSS+FV GV++E A+ LA  +G +  + P            L+S DCAPLIQRITSR +SW+ RVLSFA        
Subjt:  SISFVRETFCQFGKLSGLIANLGKSSMFVAGVDSETATMLADSMGFALGTLPVRYLGIPLLSGRLQSSDCAPLIQRITSRTQSWSTRVLSFAA---CAFG

Query:  VAKS-YVFWASVFVLPAEVHHEVDRLLMSYLLRGKEDGRRGAKVAWVDVCRPFEEGG
        V +S  V+WASVFVLPA VH+EVD++L SYL RGKE+GR G KVAWVDVC PFEEGG
Subjt:  VAKS-YVFWASVFVLPAEVHHEVDRLLMSYLLRGKEDGRRGAKVAWVDVCRPFEEGG

KAA0059752.1 F17F8.5 [Cucumis melo var. makuwa]4.0e-13452.57Show/hide
Query:  SRDAGLETEAFWSAIRQEKAFLCQKSR----------------------------------GIRVTSHEGLVQMAVNFFRNSLGSQGVGYKELTPLLEEV
        S  AGL TE FW+A+R EKA L QKSR                                  G RV+SH+G+ QMAVN+F NSLGSQ +GY+EL+P+++++
Subjt:  SRDAGLETEAFWSAIRQEKAFLCQKSR----------------------------------GIRVTSHEGLVQMAVNFFRNSLGSQGVGYKELTPLLEEV

Query:  VQFRWSEECCSALLAPIRRDEIRR-----------------------------------------TCYLPPGVNATTITLIPKHCEAERMEDYRPISCCN
        VQF+WSEECC AL  PI R+E+RR                                         TCYLP GVNAT ITLIPKH  AER+ED+R ISCCN
Subjt:  VQFRWSEECCSALLAPIRRDEIRR-----------------------------------------TCYLPPGVNATTITLIPKHCEAERMEDYRPISCCN

Query:  LVYKCISKILADRIRVWLSSFINGNQSTFVVGRSIVDNILLCHEMVGDYHGSSGMPRCAMKVDLQKAYDSA-------------------------LPCV
        ++YKCISKILADR+RVWL SFI+ NQS FV GRSI++NILLC E+VG YH +SG PRC +KVD QKAYDS                          L   
Subjt:  LVYKCISKILADRIRVWLSSFINGNQSTFVVGRSIVDNILLCHEMVGDYHGSSGMPRCAMKVDLQKAYDSA-------------------------LPCV

Query:  LLVGLWRVLRPLIMVNDSVEGYHHRCEKVCLTHLTFADDLMIFCYADTASISFVRETFCQFGKLSGLIANLGKSSMFVAGVDSETATMLADSMGFALGTL
        L V +  VL  ++        +HHRCEKV LT+LTFADDLMIFC AD  SI F+RE   +FG+LSGL AN  KSS+FVAGV++E A+ LA  MGF  G L
Subjt:  LLVGLWRVLRPLIMVNDSVEGYHHRCEKVCLTHLTFADDLMIFCYADTASISFVRETFCQFGKLSGLIANLGKSSMFVAGVDSETATMLADSMGFALGTL

Query:  PVRYLGIPLLSGRLQSSDCAPLIQRITSRTQSWSTRVLSFAA----CAFGVAKSYVFWASVFVLPAEVHHEVDRLLMSYLLRGKEDGRRGAKVAWVDVCR
        P+RYLG+PLL+GRL+S+D APLIQRITSR +SW+ RVLSFA       F +    V+WASVFVLPA VHHEVD++L+SYL RGKE+GR G KVAWVDVC 
Subjt:  PVRYLGIPLLSGRLQSSDCAPLIQRITSRTQSWSTRVLSFAA----CAFGVAKSYVFWASVFVLPAEVHHEVDRLLMSYLLRGKEDGRRGAKVAWVDVCR

Query:  PFEEGG
        PFEE G
Subjt:  PFEEGG

KAA0062888.1 non-LTR retroelement reverse transcriptase-like protein [Cucumis melo var. makuwa]3.3e-14144.9Show/hide
Query:  VDEQCFVFMPSNVDRRVLWRRLVETTSRWSIPSVVMGDFNAIRVPSKACGGSPVTSEMEEFDLAIREADLVEPSVQGNWFTWTSK---------------
        V+  C     SN++RR+LWRRLVE TS WS P VVMGDFNAIRV  +A GGSP+  EME+FDLA R+ADLVEPSVQGNWFTWTSK               
Subjt:  VDEQCFVFMPSNVDRRVLWRRLVETTSRWSIPSVVMGDFNAIRVPSKACGGSPVTSEMEEFDLAIREADLVEPSVQGNWFTWTSK---------------

Query:  -------------------DPGSVE------------------------------------------------------------------RSRDAGLET
                           DP  +E                                                                   SR  GL T
Subjt:  -------------------DPGSVE------------------------------------------------------------------RSRDAGLET

Query:  EAFWSAIRQEKAFLCQKSR----------------------------------GIRVTSHEGLVQMAVNFFRNSLGSQGVGYKELTPLLEEVVQFRWSEE
        EAFW+A+R E+A L QKSR                                  G RV+SH+G+VQ+AVN+FRNSLGSQ +GY+EL P+++++VQFRWSEE
Subjt:  EAFWSAIRQEKAFLCQKSR----------------------------------GIRVTSHEGLVQMAVNFFRNSLGSQGVGYKELTPLLEEVVQFRWSEE

Query:  CCSALLAPIRRDEIRR-----------------------------------------TCYLPPGVNATTITLIPKHCEAERMEDYRPISCCNLVYKCISK
        CC AL  PI R+E+RR                                         TCYLP GVNAT ITLIPK   AE+ME++RPISCCN++YKCISK
Subjt:  CCSALLAPIRRDEIRR-----------------------------------------TCYLPPGVNATTITLIPKHCEAERMEDYRPISCCNLVYKCISK

Query:  ILADRIRVWLSSFINGNQSTFVVGRSIVDNILLCHEMVGDYHGSSGMPRCAMKVDLQKAYDSA-------------LPCVLLVGLWRVLRPLIMVNDSVE
        ILADR+RVWL SFI  NQS F+ GRSI+DNILLC E+V  YH +SG PRC +KVDLQKAYDS              L   L V +  VL  ++       
Subjt:  ILADRIRVWLSSFINGNQSTFVVGRSIVDNILLCHEMVGDYHGSSGMPRCAMKVDLQKAYDSA-------------LPCVLLVGLWRVLRPLIMVNDSVE

Query:  GYHHRCEKVCLTHLTFADDLMIFCYADTASISFVRETFCQFGKLSGLIANLGKSSMFVAGVDSETATMLADSMGFALGTLPVRYLGIPLLSGRLQSSDCA
         +HHRCEKV LTHLTFADDLMIFC AD  S+S                              +E A+ LA SMGF LG LPVRYLG+PLL+GRL+S+DCA
Subjt:  GYHHRCEKVCLTHLTFADDLMIFCYADTASISFVRETFCQFGKLSGLIANLGKSSMFVAGVDSETATMLADSMGFALGTLPVRYLGIPLLSGRLQSSDCA

Query:  PLIQRITSRTQSWSTRVLSFAA---CAFGVAKS-YVFWASVFVLPAEVHHEVDRLLMSYLLRGKEDGRRGAKVAWVD
        PLIQRITSR +SW+  VLSFA        V +S  V+W SVF+LP  VH+EVD +L SYL RGKE+GR G KVAWV+
Subjt:  PLIQRITSRTQSWSTRVLSFAA---CAFGVAKS-YVFWASVFVLPAEVHHEVDRLLMSYLLRGKEDGRRGAKVAWVD

TYK28099.1 uncharacterized protein E5676_scaffold1467G00020 [Cucumis melo var. makuwa]2.8e-14048.21Show/hide
Query:  VDEQCFVFMPSNVDRRVLWRRLVETTSRWSIPSVVMGDFNAIRVPSKACGGSPVTSEMEEFDLAIREADLVEPSVQGNWFTWTSKDPGSVERSR------
        V+  C     SN++RR+LW RLVE TS WS P VVM DFNAIRV S+A  GSP+  EME+F+LAIR+ADLVEPSVQGNWFTWTSK  GS    R      
Subjt:  VDEQCFVFMPSNVDRRVLWRRLVETTSRWSIPSVVMGDFNAIRVPSKACGGSPVTSEMEEFDLAIREADLVEPSVQGNWFTWTSKDPGSVERSR------

Query:  -DAGLET--------------------------------------------------EAFWSA-------------IRQEKAFLCQK-SRGIRVTSHEGL
         D  L T                                                     WS              +   K  L ++  R I+  S E  
Subjt:  -DAGLET--------------------------------------------------EAFWSA-------------IRQEKAFLCQK-SRGIRVTSHEGL

Query:  V----------QMAVNFFRNSLGSQGVGYKELTPLLEEVVQFRWSEECCSALLAPIRRDEIRRTCYLPP-------------GVNATTITLIPKHCEAER
        +          +MAVN+FRNSLGSQ +GY+EL+P+++++VQF+WSEECC AL  PI R+E+RR  +                G+NAT ITLIPKH  AER
Subjt:  V----------QMAVNFFRNSLGSQGVGYKELTPLLEEVVQFRWSEECCSALLAPIRRDEIRRTCYLPP-------------GVNATTITLIPKHCEAER

Query:  MEDYRPISCCNLVYKCISKILADRIRVWLSSFINGNQSTFVVGRSIVDNILLCHEMV------------------GDYHGSSGMPRCAMKVDLQKAYDSA
        +ED+ PISC N++YKCISKILADR+RVWL SFI+ NQS F+ GRSI++NILLC E++                  G +HG  G+          +  D  
Subjt:  MEDYRPISCCNLVYKCISKILADRIRVWLSSFINGNQSTFVVGRSIVDNILLCHEMV------------------GDYHGSSGMPRCAMKVDLQKAYDSA

Query:  LPCVLLVGLWRVLRPLIMVNDSVEGYHHRCEKVCLTHLTFADDLMIFCYADTASISFVRETFCQFGKLSGLIANLGKSSMFVAGVDSETATMLADSMGFA
         P + ++ +  + R L  +  S + +HHRCEKV LTHLTFADDLMIFC AD  SI F+R+   +FG+LSGL AN  KSS+FVAGV++E A+ LA  MGFA
Subjt:  LPCVLLVGLWRVLRPLIMVNDSVEGYHHRCEKVCLTHLTFADDLMIFCYADTASISFVRETFCQFGKLSGLIANLGKSSMFVAGVDSETATMLADSMGFA

Query:  LGTLPVRYLGIPLLSGRLQSSDCAPLIQRITSRTQSWSTRVLSFAA---CAFGVAKS-YVFWASVFVLPAEVHHEVDRLLMSYLLRGKEDGRRGAKVAWV
         G LPVRYLG+PLL+GRL+S+DCAPLIQRITS+ +SW+ RVLSFA        V +S  V+WASVFVLPA VH+EVD++L SYL RGKE+GR G KVAWV
Subjt:  LGTLPVRYLGIPLLSGRLQSSDCAPLIQRITSRTQSWSTRVLSFAA---CAFGVAKS-YVFWASVFVLPAEVHHEVDRLLMSYLLRGKEDGRRGAKVAWV

Query:  DVCRPFEEGGGIVR
        DVC PFEEGG  +R
Subjt:  DVCRPFEEGGGIVR

TrEMBL top hitse value%identityAlignment
A0A5A7SPE5 Reverse transcriptase domain-containing protein1.4e-14048.21Show/hide
Query:  VDEQCFVFMPSNVDRRVLWRRLVETTSRWSIPSVVMGDFNAIRVPSKACGGSPVTSEMEEFDLAIREADLVEPSVQGNWFTWTSKDPGSVERSR------
        V+  C     SN++RR+LW RLVE TS WS P VVM DFNAIRV S+A  GSP+  EME+F+LAIR+ADLVEPSVQGNWFTWTSK  GS    R      
Subjt:  VDEQCFVFMPSNVDRRVLWRRLVETTSRWSIPSVVMGDFNAIRVPSKACGGSPVTSEMEEFDLAIREADLVEPSVQGNWFTWTSKDPGSVERSR------

Query:  -DAGLET--------------------------------------------------EAFWSA-------------IRQEKAFLCQK-SRGIRVTSHEGL
         D  L T                                                     WS              +   K  L ++  R I+  S E  
Subjt:  -DAGLET--------------------------------------------------EAFWSA-------------IRQEKAFLCQK-SRGIRVTSHEGL

Query:  V----------QMAVNFFRNSLGSQGVGYKELTPLLEEVVQFRWSEECCSALLAPIRRDEIRRTCYLPP-------------GVNATTITLIPKHCEAER
        +          +MAVN+FRNSLGSQ +GY+EL+P+++++VQF+WSEECC AL  PI R+E+RR  +                G+NAT ITLIPKH  AER
Subjt:  V----------QMAVNFFRNSLGSQGVGYKELTPLLEEVVQFRWSEECCSALLAPIRRDEIRRTCYLPP-------------GVNATTITLIPKHCEAER

Query:  MEDYRPISCCNLVYKCISKILADRIRVWLSSFINGNQSTFVVGRSIVDNILLCHEMV------------------GDYHGSSGMPRCAMKVDLQKAYDSA
        +ED+ PISC N++YKCISKILADR+RVWL SFI+ NQS F+ GRSI++NILLC E++                  G +HG  G+          +  D  
Subjt:  MEDYRPISCCNLVYKCISKILADRIRVWLSSFINGNQSTFVVGRSIVDNILLCHEMV------------------GDYHGSSGMPRCAMKVDLQKAYDSA

Query:  LPCVLLVGLWRVLRPLIMVNDSVEGYHHRCEKVCLTHLTFADDLMIFCYADTASISFVRETFCQFGKLSGLIANLGKSSMFVAGVDSETATMLADSMGFA
         P + ++ +  + R L  +  S + +HHRCEKV LTHLTFADDLMIFC AD  SI F+R+   +FG+LSGL AN  KSS+FVAGV++E A+ LA  MGFA
Subjt:  LPCVLLVGLWRVLRPLIMVNDSVEGYHHRCEKVCLTHLTFADDLMIFCYADTASISFVRETFCQFGKLSGLIANLGKSSMFVAGVDSETATMLADSMGFA

Query:  LGTLPVRYLGIPLLSGRLQSSDCAPLIQRITSRTQSWSTRVLSFAA---CAFGVAKS-YVFWASVFVLPAEVHHEVDRLLMSYLLRGKEDGRRGAKVAWV
         G LPVRYLG+PLL+GRL+S+DCAPLIQRITS+ +SW+ RVLSFA        V +S  V+WASVFVLPA VH+EVD++L SYL RGKE+GR G KVAWV
Subjt:  LGTLPVRYLGIPLLSGRLQSSDCAPLIQRITSRTQSWSTRVLSFAA---CAFGVAKS-YVFWASVFVLPAEVHHEVDRLLMSYLLRGKEDGRRGAKVAWV

Query:  DVCRPFEEGGGIVR
        DVC PFEEGG  +R
Subjt:  DVCRPFEEGGGIVR

A0A5A7TZS0 Reverse transcriptase domain-containing protein1.6e-14142.54Show/hide
Query:  VDEQCFVFMPSNVDRRVLWRRLVETTSRWSIPSVVMGDFNAIRVPSKACGGSPVTSEMEEFDLAIREADLVEPSVQGNWFTWTSKDPGS-----------
        V+  C     S+ +RR LWR L E TS WS   VVMGDFNAIRV S+A GGSP+  EMEEFDLAIR+ADLVEPSVQGNWFTWTSK  GS           
Subjt:  VDEQCFVFMPSNVDRRVLWRRLVETTSRWSIPSVVMGDFNAIRVPSKACGGSPVTSEMEEFDLAIREADLVEPSVQGNWFTWTSKDPGS-----------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------VER-------SRDAGLETEAFWSAIRQEKAFLCQKSR----------------------------------GIRVTSHEGLVQMAVNF
                    VER       SR A L TE FW+A+R E+A L QKS+                                  G RV+SH+G+ QMAVN+
Subjt:  ------------VER-------SRDAGLETEAFWSAIRQEKAFLCQKSR----------------------------------GIRVTSHEGLVQMAVNF

Query:  FRNSLGSQGVGYKELTPLLEEVVQFRWSEECCSALLAPIRRDEIRR-----------------------------------------TCYLPPGVNATTI
        F NSLGSQ +GY+EL+P+++++VQF+WSEECC AL  PI R+E+RR                                         TCYLP GVNAT I
Subjt:  FRNSLGSQGVGYKELTPLLEEVVQFRWSEECCSALLAPIRRDEIRR-----------------------------------------TCYLPPGVNATTI

Query:  TLIPKHCEAERMEDYRPISCCNLVYKCISKILADRIRVWLSSFINGNQSTFVVGRSIVDNILLCHEMVGDYHGSSGMPRCAMKVDLQKAYDSALPCVLLV
        TLIPKH  AER+ED+RPISCCN++YKCISKILADR+R+WL SFI+ NQS F+ GRSI++NILLC E+VG YH +SG PRC +KVDLQKAYDS     L  
Subjt:  TLIPKHCEAERMEDYRPISCCNLVYKCISKILADRIRVWLSSFINGNQSTFVVGRSIVDNILLCHEMVGDYHGSSGMPRCAMKVDLQKAYDSALPCVLLV

Query:  GLWRVLRPL----------------IMVNDSVEG--------------------------------------YHHRCEKVCLTHLTFADDLMIFCYADTA
         L  +  PL                IM+N S+EG                                      +HHRCEKV LTHLTFADDLMIFC AD  
Subjt:  GLWRVLRPL----------------IMVNDSVEG--------------------------------------YHHRCEKVCLTHLTFADDLMIFCYADTA

Query:  SISFVRETFCQFGKLSGLIANLGKSSMFVAGVDSETATMLADSMGFALGTLPVRYLGIPLLSGRLQSSDCAPLIQRITSRTQSWSTRVLSFAA---CAFG
        SISF+RE   +FG+ SGL AN  KSS+FV GV++E A+ LA  +G +  + P            L+S DCAPLIQRITSR +SW+ RVLSFA        
Subjt:  SISFVRETFCQFGKLSGLIANLGKSSMFVAGVDSETATMLADSMGFALGTLPVRYLGIPLLSGRLQSSDCAPLIQRITSRTQSWSTRVLSFAA---CAFG

Query:  VAKS-YVFWASVFVLPAEVHHEVDRLLMSYLLRGKEDGRRGAKVAWVDVCRPFEEGG
        V +S  V+WASVFVLPA VH+EVD++L SYL RGKE+GR G KVAWVDVC PFEEGG
Subjt:  VAKS-YVFWASVFVLPAEVHHEVDRLLMSYLLRGKEDGRRGAKVAWVDVCRPFEEGG

A0A5A7UV01 F17F8.51.9e-13452.57Show/hide
Query:  SRDAGLETEAFWSAIRQEKAFLCQKSR----------------------------------GIRVTSHEGLVQMAVNFFRNSLGSQGVGYKELTPLLEEV
        S  AGL TE FW+A+R EKA L QKSR                                  G RV+SH+G+ QMAVN+F NSLGSQ +GY+EL+P+++++
Subjt:  SRDAGLETEAFWSAIRQEKAFLCQKSR----------------------------------GIRVTSHEGLVQMAVNFFRNSLGSQGVGYKELTPLLEEV

Query:  VQFRWSEECCSALLAPIRRDEIRR-----------------------------------------TCYLPPGVNATTITLIPKHCEAERMEDYRPISCCN
        VQF+WSEECC AL  PI R+E+RR                                         TCYLP GVNAT ITLIPKH  AER+ED+R ISCCN
Subjt:  VQFRWSEECCSALLAPIRRDEIRR-----------------------------------------TCYLPPGVNATTITLIPKHCEAERMEDYRPISCCN

Query:  LVYKCISKILADRIRVWLSSFINGNQSTFVVGRSIVDNILLCHEMVGDYHGSSGMPRCAMKVDLQKAYDSA-------------------------LPCV
        ++YKCISKILADR+RVWL SFI+ NQS FV GRSI++NILLC E+VG YH +SG PRC +KVD QKAYDS                          L   
Subjt:  LVYKCISKILADRIRVWLSSFINGNQSTFVVGRSIVDNILLCHEMVGDYHGSSGMPRCAMKVDLQKAYDSA-------------------------LPCV

Query:  LLVGLWRVLRPLIMVNDSVEGYHHRCEKVCLTHLTFADDLMIFCYADTASISFVRETFCQFGKLSGLIANLGKSSMFVAGVDSETATMLADSMGFALGTL
        L V +  VL  ++        +HHRCEKV LT+LTFADDLMIFC AD  SI F+RE   +FG+LSGL AN  KSS+FVAGV++E A+ LA  MGF  G L
Subjt:  LLVGLWRVLRPLIMVNDSVEGYHHRCEKVCLTHLTFADDLMIFCYADTASISFVRETFCQFGKLSGLIANLGKSSMFVAGVDSETATMLADSMGFALGTL

Query:  PVRYLGIPLLSGRLQSSDCAPLIQRITSRTQSWSTRVLSFAA----CAFGVAKSYVFWASVFVLPAEVHHEVDRLLMSYLLRGKEDGRRGAKVAWVDVCR
        P+RYLG+PLL+GRL+S+D APLIQRITSR +SW+ RVLSFA       F +    V+WASVFVLPA VHHEVD++L+SYL RGKE+GR G KVAWVDVC 
Subjt:  PVRYLGIPLLSGRLQSSDCAPLIQRITSRTQSWSTRVLSFAA----CAFGVAKSYVFWASVFVLPAEVHHEVDRLLMSYLLRGKEDGRRGAKVAWVDVCR

Query:  PFEEGG
        PFEE G
Subjt:  PFEEGG

A0A5A7V5J2 Non-LTR retroelement reverse transcriptase-like protein1.6e-14144.9Show/hide
Query:  VDEQCFVFMPSNVDRRVLWRRLVETTSRWSIPSVVMGDFNAIRVPSKACGGSPVTSEMEEFDLAIREADLVEPSVQGNWFTWTSK---------------
        V+  C     SN++RR+LWRRLVE TS WS P VVMGDFNAIRV  +A GGSP+  EME+FDLA R+ADLVEPSVQGNWFTWTSK               
Subjt:  VDEQCFVFMPSNVDRRVLWRRLVETTSRWSIPSVVMGDFNAIRVPSKACGGSPVTSEMEEFDLAIREADLVEPSVQGNWFTWTSK---------------

Query:  -------------------DPGSVE------------------------------------------------------------------RSRDAGLET
                           DP  +E                                                                   SR  GL T
Subjt:  -------------------DPGSVE------------------------------------------------------------------RSRDAGLET

Query:  EAFWSAIRQEKAFLCQKSR----------------------------------GIRVTSHEGLVQMAVNFFRNSLGSQGVGYKELTPLLEEVVQFRWSEE
        EAFW+A+R E+A L QKSR                                  G RV+SH+G+VQ+AVN+FRNSLGSQ +GY+EL P+++++VQFRWSEE
Subjt:  EAFWSAIRQEKAFLCQKSR----------------------------------GIRVTSHEGLVQMAVNFFRNSLGSQGVGYKELTPLLEEVVQFRWSEE

Query:  CCSALLAPIRRDEIRR-----------------------------------------TCYLPPGVNATTITLIPKHCEAERMEDYRPISCCNLVYKCISK
        CC AL  PI R+E+RR                                         TCYLP GVNAT ITLIPK   AE+ME++RPISCCN++YKCISK
Subjt:  CCSALLAPIRRDEIRR-----------------------------------------TCYLPPGVNATTITLIPKHCEAERMEDYRPISCCNLVYKCISK

Query:  ILADRIRVWLSSFINGNQSTFVVGRSIVDNILLCHEMVGDYHGSSGMPRCAMKVDLQKAYDSA-------------LPCVLLVGLWRVLRPLIMVNDSVE
        ILADR+RVWL SFI  NQS F+ GRSI+DNILLC E+V  YH +SG PRC +KVDLQKAYDS              L   L V +  VL  ++       
Subjt:  ILADRIRVWLSSFINGNQSTFVVGRSIVDNILLCHEMVGDYHGSSGMPRCAMKVDLQKAYDSA-------------LPCVLLVGLWRVLRPLIMVNDSVE

Query:  GYHHRCEKVCLTHLTFADDLMIFCYADTASISFVRETFCQFGKLSGLIANLGKSSMFVAGVDSETATMLADSMGFALGTLPVRYLGIPLLSGRLQSSDCA
         +HHRCEKV LTHLTFADDLMIFC AD  S+S                              +E A+ LA SMGF LG LPVRYLG+PLL+GRL+S+DCA
Subjt:  GYHHRCEKVCLTHLTFADDLMIFCYADTASISFVRETFCQFGKLSGLIANLGKSSMFVAGVDSETATMLADSMGFALGTLPVRYLGIPLLSGRLQSSDCA

Query:  PLIQRITSRTQSWSTRVLSFAA---CAFGVAKS-YVFWASVFVLPAEVHHEVDRLLMSYLLRGKEDGRRGAKVAWVD
        PLIQRITSR +SW+  VLSFA        V +S  V+W SVF+LP  VH+EVD +L SYL RGKE+GR G KVAWV+
Subjt:  PLIQRITSRTQSWSTRVLSFAA---CAFGVAKS-YVFWASVFVLPAEVHHEVDRLLMSYLLRGKEDGRRGAKVAWVD

A0A5D3DXE4 Reverse transcriptase domain-containing protein1.4e-14048.21Show/hide
Query:  VDEQCFVFMPSNVDRRVLWRRLVETTSRWSIPSVVMGDFNAIRVPSKACGGSPVTSEMEEFDLAIREADLVEPSVQGNWFTWTSKDPGSVERSR------
        V+  C     SN++RR+LW RLVE TS WS P VVM DFNAIRV S+A  GSP+  EME+F+LAIR+ADLVEPSVQGNWFTWTSK  GS    R      
Subjt:  VDEQCFVFMPSNVDRRVLWRRLVETTSRWSIPSVVMGDFNAIRVPSKACGGSPVTSEMEEFDLAIREADLVEPSVQGNWFTWTSKDPGSVERSR------

Query:  -DAGLET--------------------------------------------------EAFWSA-------------IRQEKAFLCQK-SRGIRVTSHEGL
         D  L T                                                     WS              +   K  L ++  R I+  S E  
Subjt:  -DAGLET--------------------------------------------------EAFWSA-------------IRQEKAFLCQK-SRGIRVTSHEGL

Query:  V----------QMAVNFFRNSLGSQGVGYKELTPLLEEVVQFRWSEECCSALLAPIRRDEIRRTCYLPP-------------GVNATTITLIPKHCEAER
        +          +MAVN+FRNSLGSQ +GY+EL+P+++++VQF+WSEECC AL  PI R+E+RR  +                G+NAT ITLIPKH  AER
Subjt:  V----------QMAVNFFRNSLGSQGVGYKELTPLLEEVVQFRWSEECCSALLAPIRRDEIRRTCYLPP-------------GVNATTITLIPKHCEAER

Query:  MEDYRPISCCNLVYKCISKILADRIRVWLSSFINGNQSTFVVGRSIVDNILLCHEMV------------------GDYHGSSGMPRCAMKVDLQKAYDSA
        +ED+ PISC N++YKCISKILADR+RVWL SFI+ NQS F+ GRSI++NILLC E++                  G +HG  G+          +  D  
Subjt:  MEDYRPISCCNLVYKCISKILADRIRVWLSSFINGNQSTFVVGRSIVDNILLCHEMV------------------GDYHGSSGMPRCAMKVDLQKAYDSA

Query:  LPCVLLVGLWRVLRPLIMVNDSVEGYHHRCEKVCLTHLTFADDLMIFCYADTASISFVRETFCQFGKLSGLIANLGKSSMFVAGVDSETATMLADSMGFA
         P + ++ +  + R L  +  S + +HHRCEKV LTHLTFADDLMIFC AD  SI F+R+   +FG+LSGL AN  KSS+FVAGV++E A+ LA  MGFA
Subjt:  LPCVLLVGLWRVLRPLIMVNDSVEGYHHRCEKVCLTHLTFADDLMIFCYADTASISFVRETFCQFGKLSGLIANLGKSSMFVAGVDSETATMLADSMGFA

Query:  LGTLPVRYLGIPLLSGRLQSSDCAPLIQRITSRTQSWSTRVLSFAA---CAFGVAKS-YVFWASVFVLPAEVHHEVDRLLMSYLLRGKEDGRRGAKVAWV
         G LPVRYLG+PLL+GRL+S+DCAPLIQRITS+ +SW+ RVLSFA        V +S  V+WASVFVLPA VH+EVD++L SYL RGKE+GR G KVAWV
Subjt:  LGTLPVRYLGIPLLSGRLQSSDCAPLIQRITSRTQSWSTRVLSFAA---CAFGVAKS-YVFWASVFVLPAEVHHEVDRLLMSYLLRGKEDGRRGAKVAWV

Query:  DVCRPFEEGGGIVR
        DVC PFEEGG  +R
Subjt:  DVCRPFEEGGGIVR

SwissProt top hitse value%identityAlignment
P11369 LINE-1 retrotransposable element ORF2 protein2.1e-0531.37Show/hide
Query:  LPPGVNATTITLIPK-HCEAERMEDYRPISCCNLVYKCISKILADRIRVWLSSFINGNQSTFVVGRSIVDNILLCHEMVGDYHGSSGMPRCAMKVDLQKA
        LP      TITLIPK   +  ++E++RPIS  N+  K ++KILA+RI+  + + I+ +Q  F+ G     NI     ++   +         + +D +KA
Subjt:  LPPGVNATTITLIPK-HCEAERMEDYRPISCCNLVYKCISKILADRIRVWLSSFINGNQSTFVVGRSIVDNILLCHEMVGDYHGSSGMPRCAMKVDLQKA

Query:  YD
        +D
Subjt:  YD

P14381 Transposon TX1 uncharacterized 149 kDa protein3.0e-0727.72Show/hide
Query:  LPPGVNATTITLIPKHCEAERMEDYRPISCCNLVYKCISKILADRIRVWLSSFINGNQSTFVVGRSIVDNILLCHEMVGDYHGSSGMPRCAMKVDLQKAY
        LP       ++L+PK  +   ++++RP+S  +  YK ++K ++ R++  L+  I+ +QS  V GR+I DN+ L  +++  +   +G+    + +D +KA+
Subjt:  LPPGVNATTITLIPKHCEAERMEDYRPISCCNLVYKCISKILADRIRVWLSSFINGNQSTFVVGRSIVDNILLCHEMVGDYHGSSGMPRCAMKVDLQKAY

Query:  D
        D
Subjt:  D

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein4.8e-0547.83Show/hide
Query:  EIRRTCYLPPGVNATTITLIPKHCEAERMEDYRPISCCNLVYKCIS
        E  RT +L    NAT ITLIPK    +++  +RP+SCC +VYK I+
Subjt:  EIRRTCYLPPGVNATTITLIPKHCEAERMEDYRPISCCNLVYKCIS

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein7.0e-2042.22Show/hide
Query:  VAGVDSETATMLADSMGFALGTLPVRYLGIPLLSGRLQSSDCAPLIQRITSRTQSWSTRVLSFAA---CAFGVAKSYV-FWASVFVLPAEVHHEVDRLLM
        +AGV       +  S  FA G LPVRYLG+PLL+ ++ +SD  PL+++I  R   W+ R LSFA        V  S   FW S F LP+    E+D +  
Subjt:  VAGVDSETATMLADSMGFALGTLPVRYLGIPLLSGRLQSSDCAPLIQRITSRTQSWSTRVLSFAA---CAFGVAKSYV-FWASVFVLPAEVHHEVDRLLM

Query:  SYLLRGKEDGRRGAKVAWVDVCRPFEEGGGIVRFL
        S+L  G E   + AKVAW DVC P +EGG  +R L
Subjt:  SYLLRGKEDGRRGAKVAWVDVCRPFEEGGGIVRFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTGGGGTTGCTCTTGTAGCTATAGCAGCAGCGGTGTGGGTCAGACTTGAGGTTACTCCTAGTGTGGTGGATGAGCAGTGTTTTGTGTTTATGCCTAGTAATGTGGA
CCGTCGTGTGCTTTGGCGTAGGTTAGTTGAGACCACTTCTAGATGGTCGATTCCAAGTGTGGTTATGGGTGATTTTAATGCAATTCGGGTGCCCTCTAAAGCGTGTGGTG
GAAGTCCAGTTACTAGTGAAATGGAGGAGTTTGATCTTGCTATTCGTGAAGCTGACTTGGTTGAGCCGTCTGTTCAGGGTAACTGGTTCACTTGGACGAGTAAAGATCCT
GGGTCTGTGGAGAGGAGTCGTGATGCTGGTCTTGAGACTGAGGCTTTTTGGTCGGCTATCCGTCAAGAAAAAGCCTTTCTCTGCCAGAAATCACGGGGAATTCGGGTGAC
TTCTCATGAGGGGTTGGTGCAAATGGCTGTCAACTTTTTTCGTAACAGTCTTGGGTCCCAAGGGGTTGGCTATAAGGAGCTCACTCCTTTGTTGGAGGAGGTGGTTCAGT
TTAGGTGGTCGGAGGAGTGTTGTAGTGCGTTACTGGCTCCTATTAGGCGTGATGAGATCAGGAGGACGTGTTATCTGCCTCCTGGGGTAAATGCTACTACGATCACCCTC
ATTCCCAAGCATTGTGAGGCTGAGCGTATGGAGGATTATAGGCCTATTTCATGTTGTAATTTGGTCTATAAGTGCATTTCTAAGATTTTGGCTGATAGAATTCGTGTGTG
GCTTTCGTCTTTTATCAATGGTAATCAGTCCACCTTTGTTGTGGGGAGGAGTATTGTGGATAACATTCTGCTTTGTCATGAGATGGTAGGGGATTACCATGGTTCTTCTG
GGATGCCTAGGTGTGCGATGAAGGTGGACCTTCAGAAGGCTTACGATTCGGCACTCCCTTGCGTTTTGTTAGTTGGATTATGGCGTGTGTTACGTCCCCTTATTATGGTT
AATGACTCTGTCGAAGGGTATCATCATCGTTGTGAGAAGGTTTGTTTGACCCATCTAACCTTCGCTGATGATCTTATGATATTTTGTTATGCTGATACTGCTTCTATTAG
CTTTGTGCGCGAGACTTTCTGTCAGTTTGGGAAATTATCGGGGTTGATTGCTAATCTAGGGAAGAGCTCTATGTTTGTGGCGGGAGTTGATAGTGAGACTGCTACTATGT
TGGCGGATAGTATGGGGTTTGCTCTGGGTACTCTGCCTGTTCGTTACCTGGGTATTCCCTTACTCTCTGGTCGGCTACAGTCTTCAGATTGTGCTCCGCTCATCCAACGT
ATTACTAGTCGAACTCAGTCTTGGTCGACTAGAGTATTGTCCTTCGCTGCTTGTGCGTTCGGTGTTGCGAAGTCTTACGTCTTTTGGGCCAGTGTGTTTGTTTTGCCGGC
TGAGGTGCATCATGAGGTTGATAGGCTTTTGATGTCGTATCTGTTGAGGGGTAAGGAGGATGGGAGAAGAGGTGCGAAGGTTGCTTGGGTGGACGTGTGTCGCCCGTTTG
AGGAAGGGGGTGGGATAGTTCGATTCCTCGTTCGATTCCTCCTAATCTCATCACGCCTAACAAGTGTCAAAGAAATGCAACACAGCATCACAAAAATCCTCACCCACCAC
ATTCCAAGCACTTTTGAAGAACCCCGTCGAAAAACCATTAGGACCAGATTATACAACAACAAGGAAGACCTCACAGCCCGAGGTCAAGGCACCAAAGCATTGAACGCACA
AAACCACCAAACACAACAAATAGAGAAACGAAACAAGAAACCAGGGAAAAAAACCCCCACAATCGGGGAATCAAACCAAAAGGAAAATCAGACGACAAAAGGAAAAATCC
AAACAACAAAATGGAAAGCACTAGATAAGATCATGAACATCTTCCTGCCAAGAGACAGCACGAGCACGAACTGTAGACCGGATACATTGAATAAGAATATCGGGCTCACG
AACCGCACCACCATGGAGACGACGATTCCGCTCATTCCAGATAAAGTAGATAGAAGCACACCAGAGCACCCGTCAAAGCTTCCTCATCACACCCTGCCCAATACCCTTAC
GACTGACCCAAGACAACTCAACCTCCCACCCCCCAACCCTATGAGAGGACGCCATATCCCAAATAATACGATTCCATATATCATACCCAAAAAGACAAGAAAAAAACAGA
TGATCCCGAGATTCAACACCCCTAGCTTAAGACACGATCGAGGAATCAAACTATCCCACCTACTCAACCTATCCCGAGTGCCCAACCTATCTCTAATAACCAACCAAGCA
CAAAAAGAATGTCTAGGAATATTCCCCCACCCCACAGCAGACCAACCCAACAAACCTTTTCACTCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGCTTGGGGTTGCTCTTGTAGCTATAGCAGCAGCGGTGTGGGTCAGACTTGAGGTTACTCCTAGTGTGGTGGATGAGCAGTGTTTTGTGTTTATGCCTAGTAATGTGGA
CCGTCGTGTGCTTTGGCGTAGGTTAGTTGAGACCACTTCTAGATGGTCGATTCCAAGTGTGGTTATGGGTGATTTTAATGCAATTCGGGTGCCCTCTAAAGCGTGTGGTG
GAAGTCCAGTTACTAGTGAAATGGAGGAGTTTGATCTTGCTATTCGTGAAGCTGACTTGGTTGAGCCGTCTGTTCAGGGTAACTGGTTCACTTGGACGAGTAAAGATCCT
GGGTCTGTGGAGAGGAGTCGTGATGCTGGTCTTGAGACTGAGGCTTTTTGGTCGGCTATCCGTCAAGAAAAAGCCTTTCTCTGCCAGAAATCACGGGGAATTCGGGTGAC
TTCTCATGAGGGGTTGGTGCAAATGGCTGTCAACTTTTTTCGTAACAGTCTTGGGTCCCAAGGGGTTGGCTATAAGGAGCTCACTCCTTTGTTGGAGGAGGTGGTTCAGT
TTAGGTGGTCGGAGGAGTGTTGTAGTGCGTTACTGGCTCCTATTAGGCGTGATGAGATCAGGAGGACGTGTTATCTGCCTCCTGGGGTAAATGCTACTACGATCACCCTC
ATTCCCAAGCATTGTGAGGCTGAGCGTATGGAGGATTATAGGCCTATTTCATGTTGTAATTTGGTCTATAAGTGCATTTCTAAGATTTTGGCTGATAGAATTCGTGTGTG
GCTTTCGTCTTTTATCAATGGTAATCAGTCCACCTTTGTTGTGGGGAGGAGTATTGTGGATAACATTCTGCTTTGTCATGAGATGGTAGGGGATTACCATGGTTCTTCTG
GGATGCCTAGGTGTGCGATGAAGGTGGACCTTCAGAAGGCTTACGATTCGGCACTCCCTTGCGTTTTGTTAGTTGGATTATGGCGTGTGTTACGTCCCCTTATTATGGTT
AATGACTCTGTCGAAGGGTATCATCATCGTTGTGAGAAGGTTTGTTTGACCCATCTAACCTTCGCTGATGATCTTATGATATTTTGTTATGCTGATACTGCTTCTATTAG
CTTTGTGCGCGAGACTTTCTGTCAGTTTGGGAAATTATCGGGGTTGATTGCTAATCTAGGGAAGAGCTCTATGTTTGTGGCGGGAGTTGATAGTGAGACTGCTACTATGT
TGGCGGATAGTATGGGGTTTGCTCTGGGTACTCTGCCTGTTCGTTACCTGGGTATTCCCTTACTCTCTGGTCGGCTACAGTCTTCAGATTGTGCTCCGCTCATCCAACGT
ATTACTAGTCGAACTCAGTCTTGGTCGACTAGAGTATTGTCCTTCGCTGCTTGTGCGTTCGGTGTTGCGAAGTCTTACGTCTTTTGGGCCAGTGTGTTTGTTTTGCCGGC
TGAGGTGCATCATGAGGTTGATAGGCTTTTGATGTCGTATCTGTTGAGGGGTAAGGAGGATGGGAGAAGAGGTGCGAAGGTTGCTTGGGTGGACGTGTGTCGCCCGTTTG
AGGAAGGGGGTGGGATAGTTCGATTCCTCGTTCGATTCCTCCTAATCTCATCACGCCTAACAAGTGTCAAAGAAATGCAACACAGCATCACAAAAATCCTCACCCACCAC
ATTCCAAGCACTTTTGAAGAACCCCGTCGAAAAACCATTAGGACCAGATTATACAACAACAAGGAAGACCTCACAGCCCGAGGTCAAGGCACCAAAGCATTGAACGCACA
AAACCACCAAACACAACAAATAGAGAAACGAAACAAGAAACCAGGGAAAAAAACCCCCACAATCGGGGAATCAAACCAAAAGGAAAATCAGACGACAAAAGGAAAAATCC
AAACAACAAAATGGAAAGCACTAGATAAGATCATGAACATCTTCCTGCCAAGAGACAGCACGAGCACGAACTGTAGACCGGATACATTGAATAAGAATATCGGGCTCACG
AACCGCACCACCATGGAGACGACGATTCCGCTCATTCCAGATAAAGTAGATAGAAGCACACCAGAGCACCCGTCAAAGCTTCCTCATCACACCCTGCCCAATACCCTTAC
GACTGACCCAAGACAACTCAACCTCCCACCCCCCAACCCTATGAGAGGACGCCATATCCCAAATAATACGATTCCATATATCATACCCAAAAAGACAAGAAAAAAACAGA
TGATCCCGAGATTCAACACCCCTAGCTTAAGACACGATCGAGGAATCAAACTATCCCACCTACTCAACCTATCCCGAGTGCCCAACCTATCTCTAATAACCAACCAAGCA
CAAAAAGAATGTCTAGGAATATTCCCCCACCCCACAGCAGACCAACCCAACAAACCTTTTCACTCCTAG
Protein sequenceShow/hide protein sequence
MLGVALVAIAAAVWVRLEVTPSVVDEQCFVFMPSNVDRRVLWRRLVETTSRWSIPSVVMGDFNAIRVPSKACGGSPVTSEMEEFDLAIREADLVEPSVQGNWFTWTSKDP
GSVERSRDAGLETEAFWSAIRQEKAFLCQKSRGIRVTSHEGLVQMAVNFFRNSLGSQGVGYKELTPLLEEVVQFRWSEECCSALLAPIRRDEIRRTCYLPPGVNATTITL
IPKHCEAERMEDYRPISCCNLVYKCISKILADRIRVWLSSFINGNQSTFVVGRSIVDNILLCHEMVGDYHGSSGMPRCAMKVDLQKAYDSALPCVLLVGLWRVLRPLIMV
NDSVEGYHHRCEKVCLTHLTFADDLMIFCYADTASISFVRETFCQFGKLSGLIANLGKSSMFVAGVDSETATMLADSMGFALGTLPVRYLGIPLLSGRLQSSDCAPLIQR
ITSRTQSWSTRVLSFAACAFGVAKSYVFWASVFVLPAEVHHEVDRLLMSYLLRGKEDGRRGAKVAWVDVCRPFEEGGGIVRFLVRFLLISSRLTSVKEMQHSITKILTHH
IPSTFEEPRRKTIRTRLYNNKEDLTARGQGTKALNAQNHQTQQIEKRNKKPGKKTPTIGESNQKENQTTKGKIQTTKWKALDKIMNIFLPRDSTSTNCRPDTLNKNIGLT
NRTTMETTIPLIPDKVDRSTPEHPSKLPHHTLPNTLTTDPRQLNLPPPNPMRGRHIPNNTIPYIIPKKTRKKQMIPRFNTPSLRHDRGIKLSHLLNLSRVPNLSLITNQA
QKECLGIFPHPTADQPNKPFHS