; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0018657 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0018657
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr06:12429857..12431888
RNA-Seq ExpressionPI0018657
SyntenyPI0018657
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026071.1 uncharacterized protein E6C27_scaffold581G00620 [Cucumis melo var. makuwa]6.3e-12243.77Show/hide
Query:  GSPVMGPVSSGLPVEKYISGLLNRPHLINQKGMGQVD-------------GKVSTDSGLHCRPNDSLGPIENSVVGPQLFQKVDSRNIDSWASLFGSSSG
        G  V+GP  +GL VE   S +L    +  +   GQ D             G V++ S L  +  +    +EN +       K+ + +  +WASLFG+SS 
Subjt:  GSPVMGPVSSGLPVEKYISGLLNRPHLINQKGMGQVD-------------GKVSTDSGLHCRPNDSLGPIENSVVGPQLFQKVDSRNIDSWASLFGSSSG

Query:  NGLPYTPPSLVGSKLVVVPSEEIIAQGVRMWENSLVGQLVDATLPYAVIQRLIEKIWGKIEIPTITMLENGLICFQFRRPNSVERILSRGHGILVGNLVF
          L YTPP  +G K+VV+P EE+I QG+++WENSLVGQL+DA LPYAVIQRL                                                
Subjt:  NGLPYTPPSLVGSKLVVVPSEEIIAQGVRMWENSLVGQLVDATLPYAVIQRLIEKIWGKIEIPTITMLENGLICFQFRRPNSVERILSRGHGILVGNLVF

Query:  NSVPVWIKLGRIPMELWTESGIAVIASAIDKPLSLDLATKKRRRLSFARVCVEVEGGADLPSEVTVNLRGVECSVPVTYEWKPRMCNSCHSFGHSAGKCP
                                      KP+SLD ATKKRRRLS+ARVCVE+EGG+++ +E+TV+LRGV+ +V V YEWKPR CN C +FGHS+ KC 
Subjt:  NSVPVWIKLGRIPMELWTESGIAVIASAIDKPLSLDLATKKRRRLSFARVCVEVEGGADLPSEVTVNLRGVECSVPVTYEWKPRMCNSCHSFGHSAGKCP

Query:  QKETS----------------------------QLEEGEIQGSPSRQVSPTNNGGGKKKDFTTVTRKKRVLVSVRDKGKGKSMQAVQ-NSFGSLSDLSEG
        +   S                            Q+E+GEI+ SP+R  S    G GK  +FT VTRKK  LVS+RD  +GKSM+ +  NSFGSL ++ + 
Subjt:  QKETS----------------------------QLEEGEIQGSPSRQVSPTNNGGGKKKDFTTVTRKKRVLVSVRDKGKGKSMQAVQ-NSFGSLSDLSEG

Query:  ENWALALRVSTPPPLQIVGGDGGMPRLSPNGGPMGISMDETYMISWCSWNIRGLNDPVKCRAVSDFLRVSSVGFCCLLETRISEGNFCSVSGRFGDAWSY
        + WAL++   + PPLQ+   D G   LS      GIS                                S VGFCCLLETR+ EGNF SVS RF ++W Y
Subjt:  ENWALALRVSTPPPLQIVGGDGGMPRLSPNGGPMGISMDETYMISWCSWNIRGLNDPVKCRAVSDFLRVSSVGFCCLLETRISEGNFCSVSGRFGDAWSY

Query:  SCSYSRSGVGRISVMWKKDRFDFTPSVVDEQFVSGVIVDLHSGVTVEVLCVYASNSNMDRRVLWRRLVEITSSWSSPGVVMGDFNAISVHSEACGRSPVT
        SCSYS SGVGRI VMWKK+RF F+  V+DEQFV+G + DL SGV VEV CVYASNSN++RR+LW RLVEITS+WSSPGVVM DFNAI VHSEA   SP+ 
Subjt:  SCSYSRSGVGRISVMWKKDRFDFTPSVVDEQFVSGVIVDLHSGVTVEVLCVYASNSNMDRRVLWRRLVEITSSWSSPGVVMGDFNAISVHSEACGRSPVT

Query:  RDMEEFDLAIREADLVEPAVQGNWFT
         +ME+F+LAIR+ADLVEP+VQGNWFT
Subjt:  RDMEEFDLAIREADLVEPAVQGNWFT

KAA0046247.1 uncharacterized protein E6C27_scaffold284G00450 [Cucumis melo var. makuwa]1.5e-13645.45Show/hide
Query:  GKNMVATKVGLQEAKGVAARGTRTD--GTLEPTGSG--ARVGSPVMG----PVSSGLPVEKYISGLLNRPHLINQKGMGQVDGKVSTDSGLHCRPNDSLG
        G +++   V  +  +G    G R D  G+  P       R GS V G     +SSG  +E       N             D   +  SG  C  N+   
Subjt:  GKNMVATKVGLQEAKGVAARGTRTD--GTLEPTGSG--ARVGSPVMG----PVSSGLPVEKYISGLLNRPHLINQKGMGQVDGKVSTDSGLHCRPNDSLG

Query:  PIENSVVGPQLFQKVDSRNIDSWASLFGSSSGNGLPYTPPSLVGSKLVVVPSEEIIAQGVRMWENSLVGQLVDATLPYAVIQRLIEKIWGKIEIPTITML
         + N V      Q +DS++  +WASLFG+SS   L YT P ++G K+VV P E++I QG+++WENSLVGQL+D+ LPY VIQ L+EKIWGKIE+P IT+L
Subjt:  PIENSVVGPQLFQKVDSRNIDSWASLFGSSSGNGLPYTPPSLVGSKLVVVPSEEIIAQGVRMWENSLVGQLVDATLPYAVIQRLIEKIWGKIEIPTITML

Query:  ENGLICFQFRRPNSVERILSRG---------------HGILVGNLVFNSVPVWIKLGRIPMELWTESGIAVIASAIDKPLSLDLATKKRRRLSFARVCVE
        EN LICFQFRR  SVE ILSRG                GI+    VFNSVPVWI+LG++PMELWTE+G+AV+ASA+ KP+SLDLATK+RRRLS+ARVCVE
Subjt:  ENGLICFQFRRPNSVERILSRG---------------HGILVGNLVFNSVPVWIKLGRIPMELWTESGIAVIASAIDKPLSLDLATKKRRRLSFARVCVE

Query:  VEGGADLPSEVTVNLRGVECSVPVTYEWKPRMCNSCHSFGHSAGKCPQKETS----------------------------QLEEGEIQGSPSRQVSPTNN
        +E G+++P+E+TV+LRGV+ +V V YEWKPR CN C +FGHS   C +   S                            QLEEGEI+ SP+R  S    
Subjt:  VEGGADLPSEVTVNLRGVECSVPVTYEWKPRMCNSCHSFGHSAGKCPQKETS----------------------------QLEEGEIQGSPSRQVSPTNN

Query:  GGGKKKDFTTVTRKKRVLVSVRDKGKGKSMQAVQNSFGSLSDLSEGENWALALRVSTPPPLQIVGGDGGMPRLSPNGGPMGISMDETYMISWCSWNIRGL
        G GK  +FT VTR+K  LVSVRD+GK  S                            PPPLQ+   D G   L  NG                       
Subjt:  GGGKKKDFTTVTRKKRVLVSVRDKGKGKSMQAVQNSFGSLSDLSEGENWALALRVSTPPPLQIVGGDGGMPRLSPNGGPMGISMDETYMISWCSWNIRGL

Query:  NDPVKCRAVSDFLRVSSVGFCCLLETRISEGNFCSVSGRFGDAWSYSCSYSRSGVGRISVMWKKDRFDFTPSVVDEQFVSGVIVDLHSGVTVEVLCVYAS
           +K +AV DFL  SSVGFCCLLETR+ EGNF SVS RF ++W YSCSYS SGVGRI VMWKK+RF F+ +V+DEQF++                    
Subjt:  NDPVKCRAVSDFLRVSSVGFCCLLETRISEGNFCSVSGRFGDAWSYSCSYSRSGVGRISVMWKKDRFDFTPSVVDEQFVSGVIVDLHSGVTVEVLCVYAS

Query:  NSNMDRRVLWRRLVEITSSWSSPGVVMGDFNAISVHSEACGRSPVTRDMEEFDLAIREADLVEPAVQGNWF
                     VEITS+WSSPGVVMGDFNAI V+SEA G SP+  +ME+FDLAIR+ DLVEP VQGN F
Subjt:  NSNMDRRVLWRRLVEITSSWSSPGVVMGDFNAISVHSEACGRSPVTRDMEEFDLAIREADLVEPAVQGNWF

KAA0062888.1 non-LTR retroelement reverse transcriptase-like protein [Cucumis melo var. makuwa]1.4e-13454.34Show/hide
Query:  IPTITMLENGLICFQFRRPNSVERILSRGH---------------GILVGNLVFNSVPVWIKLGRIPMELWTESGIAVIASAIDKPLSLDLATKKRRRLS
        +PTIT+LEN LICFQFRRPNSVE ILSRG                GI+  + VFNSVPVWI+LGRIPMELWTE+ +A++AS + KP++LDLATK+  RLS
Subjt:  IPTITMLENGLICFQFRRPNSVERILSRGH---------------GILVGNLVFNSVPVWIKLGRIPMELWTESGIAVIASAIDKPLSLDLATKKRRRLS

Query:  FARVCVEVEGGADLPSEVTVNLRGVECSVPVTYEWKPRMCNSCHSFGHSAGKCPQKETS---------------------------------QLEEGEIQ
        +ARVCV++EG  ++ +E+TVNLRGV+ +V V YEWKP+ CN C + GHS GKCP+   S                                 QLEEGEI+
Subjt:  FARVCVEVEGGADLPSEVTVNLRGVECSVPVTYEWKPRMCNSCHSFGHSAGKCPQKETS---------------------------------QLEEGEIQ

Query:  GSPSRQVSPTNNGGGKKKDFTTVTRKKRVLVSVRDKGKGKSMQAVQNSFGSLSDLSEGENWALALRVSTPPPLQIVGGDGGMPRLSPNGGPMGISMDETY
         SP+R  S      GK+ +FT VTRKK  LVSVRD+GK   + A+ NSFGSL ++ + + WAL +   +PPPLQ+  G G +  L               
Subjt:  GSPSRQVSPTNNGGGKKKDFTTVTRKKRVLVSVRDKGKGKSMQAVQNSFGSLSDLSEGENWALALRVSTPPPLQIVGGDGGMPRLSPNGGPMGISMDETY

Query:  MISWCSWNIRGLNDPVKCRAVSDFLRVSSVGFCCLLETRISEGNFCSVSGRFGDAWSYSCSYSRSGVGRISVMWKKDRFDFTPSVVDEQFVSGVIVDLHS
                        K +AVSDFL  SSVGFCCLLETR+ E NF  VS RFG++W YSCSYS SGVGRI V+WKK+RF F+  VVDEQFV+G + DL S
Subjt:  MISWCSWNIRGLNDPVKCRAVSDFLRVSSVGFCCLLETRISEGNFCSVSGRFGDAWSYSCSYSRSGVGRISVMWKKDRFDFTPSVVDEQFVSGVIVDLHS

Query:  GVTVEVLCVYASNSNMDRRVLWRRLVEITSSWSSPGVVMGDFNAISVHSEACGRSPVTRDMEEFDLAIREADLVEPAVQGNWFT
        GV VEV CVYASNSN++RR+LWRRLVEITS WSSP VVMGDFNAI VH EA G SP+  +ME+FDLA R+ADLVEP+VQGNWFT
Subjt:  GVTVEVLCVYASNSNMDRRVLWRRLVEITSSWSSPGVVMGDFNAISVHSEACGRSPVTRDMEEFDLAIREADLVEPAVQGNWFT

TYK18951.1 uncharacterized protein E5676_scaffold418G00380 [Cucumis melo var. makuwa]2.3e-13244.49Show/hide
Query:  GKNMVATKVGLQEAKGVAARGTRTD--GTLEPTGSG--ARVGSPVMG----PVSSGLPVEKYISGLLNRPHLINQKGMGQVDGKVSTDSGLHCRPNDSLG
        G +++   V  +  +G    G R D  G+  P       R GS V G     +SSG  +E       N             D   +  SG  C  N+   
Subjt:  GKNMVATKVGLQEAKGVAARGTRTD--GTLEPTGSG--ARVGSPVMG----PVSSGLPVEKYISGLLNRPHLINQKGMGQVDGKVSTDSGLHCRPNDSLG

Query:  PIENSVVGPQLFQKVDSRNIDSWASLFGSSSGNGLPYTPPSLVGSKLVVVPSEEIIAQGVRMWENSLVGQLVDATLPYAVIQRLIEKIWGKIEIPTITML
         + N V      Q +DS++  +WASLFG+SS   L YT P ++G K+VV P EE+I QG+++WENSLVGQL+D+ LPY VIQ L+EKIWGKIE+P IT+L
Subjt:  PIENSVVGPQLFQKVDSRNIDSWASLFGSSSGNGLPYTPPSLVGSKLVVVPSEEIIAQGVRMWENSLVGQLVDATLPYAVIQRLIEKIWGKIEIPTITML

Query:  ENGLICFQFRRPNSVERILSRG---------------HGILVGNLVFNSVPVWIKLGRIPMELWTESGIAVIASAIDKPLSLDLATKKRRRLSFARVCVE
        EN LICFQFRR  SVE ILSRG                GI+    VFNSVPVWI+LG++PMELWTE+G+AV+ASA+ KP+SLDLATK+RRRLS+ARVCVE
Subjt:  ENGLICFQFRRPNSVERILSRG---------------HGILVGNLVFNSVPVWIKLGRIPMELWTESGIAVIASAIDKPLSLDLATKKRRRLSFARVCVE

Query:  VEGGADLPSEVTVNLRGVECSVPVTYEWKPRMCNSCHSFGHSAGKCPQKETS----------------------------QLEEGEIQGSPSRQVSPTNN
        +E G+++P+E+TV+LRGV+ +V V YEWKPR CN C +FGHS   C +   S                            QLEEGEI+ SP+R  S    
Subjt:  VEGGADLPSEVTVNLRGVECSVPVTYEWKPRMCNSCHSFGHSAGKCPQKETS----------------------------QLEEGEIQGSPSRQVSPTNN

Query:  GGGKKKDFTTVTRKKRVLVSVRDKGKGKSMQAVQNSFGSLSDLSEGENWALALRVSTPPPLQIVGGDGGMPRLSPNGGPMGISMDETYMISWCSWNIRGL
        G GK  +FT VTR+K  LVSVRD+GK  S                            PPPLQ+   D G   L  NG                       
Subjt:  GGGKKKDFTTVTRKKRVLVSVRDKGKGKSMQAVQNSFGSLSDLSEGENWALALRVSTPPPLQIVGGDGGMPRLSPNGGPMGISMDETYMISWCSWNIRGL

Query:  NDPVKCRAVSDFLRVSSVGFCCLLETRISEGNFCSVSGRFGDAWSYSCSYSRSGVGRISVMWKKDRFDFTPSVVDEQFVSGVIVDLHSGVTVEVLCVYAS
           +K +AV DFL  SSVGFCCLLETR+ EGNF SVS RF ++W YSCSYS SGVGRI VMWKK+RF F+ +V+DEQF++                    
Subjt:  NDPVKCRAVSDFLRVSSVGFCCLLETRISEGNFCSVSGRFGDAWSYSCSYSRSGVGRISVMWKKDRFDFTPSVVDEQFVSGVIVDLHSGVTVEVLCVYAS

Query:  NSNMDRRVLWRRLVEITSSWSSPGVVMGDFNAISVHSEACGRSPVTRDMEEFDLAIREADLVEPAVQGNWFT
                               GVVMGDFNAI V+SEA G SP+  +ME+FDLAIR+ DLVEP VQGNWFT
Subjt:  NSNMDRRVLWRRLVEITSSWSSPGVVMGDFNAISVHSEACGRSPVTRDMEEFDLAIREADLVEPAVQGNWFT

TYK28099.1 uncharacterized protein E5676_scaffold1467G00020 [Cucumis melo var. makuwa]5.3e-12146.79Show/hide
Query:  KVDSRNIDSWASLFGSSSGNGLPYTPPSLVGSKLVVVPSEEIIAQGVRMWENSLVGQLVDATLPYAVIQRLIEKIWGKIEIPTITMLENGLICFQFRRPN
        K+ + +  +WASLFG+SS   L YTPP  +G K+VV+P EE+I QG+++WENSLVGQL+DA LPYAVIQRL                             
Subjt:  KVDSRNIDSWASLFGSSSGNGLPYTPPSLVGSKLVVVPSEEIIAQGVRMWENSLVGQLVDATLPYAVIQRLIEKIWGKIEIPTITMLENGLICFQFRRPN

Query:  SVERILSRGHGILVGNLVFNSVPVWIKLGRIPMELWTESGIAVIASAIDKPLSLDLATKKRRRLSFARVCVEVEGGADLPSEVTVNLRGVECSVPVTYEW
                                                         KP+SLD ATKKRRRLS+ARVCVE+EGG+++ +E+TV+LRGV+ +V V YEW
Subjt:  SVERILSRGHGILVGNLVFNSVPVWIKLGRIPMELWTESGIAVIASAIDKPLSLDLATKKRRRLSFARVCVEVEGGADLPSEVTVNLRGVECSVPVTYEW

Query:  KPRMCNSCHSFGHSAGKCPQKETS----------------------------QLEEGEIQGSPSRQVSPTNNGGGKKKDFTTVTRKKRVLVSVRDKGKGK
        KPR CN C +FGHS+ KC +   S                            Q+E+GEI+ SP+R  S    G GK  +FT VTRKK  LVS+RD  +GK
Subjt:  KPRMCNSCHSFGHSAGKCPQKETS----------------------------QLEEGEIQGSPSRQVSPTNNGGGKKKDFTTVTRKKRVLVSVRDKGKGK

Query:  SMQAVQ-NSFGSLSDLSEGENWALALRVSTPPPLQIVGGDGGMPRLSPNGGPMGISMDETYMISWCSWNIRGLNDPVKCRAVSDFLRVSSVGFCCLLETR
        SM+ +  NSFGSL ++ + + WAL++   + PPLQ+   D G   LS      GIS                                S VGFCCLLETR
Subjt:  SMQAVQ-NSFGSLSDLSEGENWALALRVSTPPPLQIVGGDGGMPRLSPNGGPMGISMDETYMISWCSWNIRGLNDPVKCRAVSDFLRVSSVGFCCLLETR

Query:  ISEGNFCSVSGRFGDAWSYSCSYSRSGVGRISVMWKKDRFDFTPSVVDEQFVSGVIVDLHSGVTVEVLCVYASNSNMDRRVLWRRLVEITSSWSSPGVVM
        + EGNF SVS RF ++W YSCSYS SGVGRI VMWKK+RF F+  V+DEQFV+G + DL SGV VEV CVYASNSN++RR+LW RLVEITS+WSSPGVVM
Subjt:  ISEGNFCSVSGRFGDAWSYSCSYSRSGVGRISVMWKKDRFDFTPSVVDEQFVSGVIVDLHSGVTVEVLCVYASNSNMDRRVLWRRLVEITSSWSSPGVVM

Query:  GDFNAISVHSEACGRSPVTRDMEEFDLAIREADLVEPAVQGNWFT
         DFNAI VHSEA   SP+  +ME+F+LAIR+ADLVEP+VQGNWFT
Subjt:  GDFNAISVHSEACGRSPVTRDMEEFDLAIREADLVEPAVQGNWFT

TrEMBL top hitse value%identityAlignment
A0A5A7SPE5 Reverse transcriptase domain-containing protein3.0e-12243.77Show/hide
Query:  GSPVMGPVSSGLPVEKYISGLLNRPHLINQKGMGQVD-------------GKVSTDSGLHCRPNDSLGPIENSVVGPQLFQKVDSRNIDSWASLFGSSSG
        G  V+GP  +GL VE   S +L    +  +   GQ D             G V++ S L  +  +    +EN +       K+ + +  +WASLFG+SS 
Subjt:  GSPVMGPVSSGLPVEKYISGLLNRPHLINQKGMGQVD-------------GKVSTDSGLHCRPNDSLGPIENSVVGPQLFQKVDSRNIDSWASLFGSSSG

Query:  NGLPYTPPSLVGSKLVVVPSEEIIAQGVRMWENSLVGQLVDATLPYAVIQRLIEKIWGKIEIPTITMLENGLICFQFRRPNSVERILSRGHGILVGNLVF
          L YTPP  +G K+VV+P EE+I QG+++WENSLVGQL+DA LPYAVIQRL                                                
Subjt:  NGLPYTPPSLVGSKLVVVPSEEIIAQGVRMWENSLVGQLVDATLPYAVIQRLIEKIWGKIEIPTITMLENGLICFQFRRPNSVERILSRGHGILVGNLVF

Query:  NSVPVWIKLGRIPMELWTESGIAVIASAIDKPLSLDLATKKRRRLSFARVCVEVEGGADLPSEVTVNLRGVECSVPVTYEWKPRMCNSCHSFGHSAGKCP
                                      KP+SLD ATKKRRRLS+ARVCVE+EGG+++ +E+TV+LRGV+ +V V YEWKPR CN C +FGHS+ KC 
Subjt:  NSVPVWIKLGRIPMELWTESGIAVIASAIDKPLSLDLATKKRRRLSFARVCVEVEGGADLPSEVTVNLRGVECSVPVTYEWKPRMCNSCHSFGHSAGKCP

Query:  QKETS----------------------------QLEEGEIQGSPSRQVSPTNNGGGKKKDFTTVTRKKRVLVSVRDKGKGKSMQAVQ-NSFGSLSDLSEG
        +   S                            Q+E+GEI+ SP+R  S    G GK  +FT VTRKK  LVS+RD  +GKSM+ +  NSFGSL ++ + 
Subjt:  QKETS----------------------------QLEEGEIQGSPSRQVSPTNNGGGKKKDFTTVTRKKRVLVSVRDKGKGKSMQAVQ-NSFGSLSDLSEG

Query:  ENWALALRVSTPPPLQIVGGDGGMPRLSPNGGPMGISMDETYMISWCSWNIRGLNDPVKCRAVSDFLRVSSVGFCCLLETRISEGNFCSVSGRFGDAWSY
        + WAL++   + PPLQ+   D G   LS      GIS                                S VGFCCLLETR+ EGNF SVS RF ++W Y
Subjt:  ENWALALRVSTPPPLQIVGGDGGMPRLSPNGGPMGISMDETYMISWCSWNIRGLNDPVKCRAVSDFLRVSSVGFCCLLETRISEGNFCSVSGRFGDAWSY

Query:  SCSYSRSGVGRISVMWKKDRFDFTPSVVDEQFVSGVIVDLHSGVTVEVLCVYASNSNMDRRVLWRRLVEITSSWSSPGVVMGDFNAISVHSEACGRSPVT
        SCSYS SGVGRI VMWKK+RF F+  V+DEQFV+G + DL SGV VEV CVYASNSN++RR+LW RLVEITS+WSSPGVVM DFNAI VHSEA   SP+ 
Subjt:  SCSYSRSGVGRISVMWKKDRFDFTPSVVDEQFVSGVIVDLHSGVTVEVLCVYASNSNMDRRVLWRRLVEITSSWSSPGVVMGDFNAISVHSEACGRSPVT

Query:  RDMEEFDLAIREADLVEPAVQGNWFT
         +ME+F+LAIR+ADLVEP+VQGNWFT
Subjt:  RDMEEFDLAIREADLVEPAVQGNWFT

A0A5A7TWG5 Reverse transcriptase domain-containing protein7.5e-13745.45Show/hide
Query:  GKNMVATKVGLQEAKGVAARGTRTD--GTLEPTGSG--ARVGSPVMG----PVSSGLPVEKYISGLLNRPHLINQKGMGQVDGKVSTDSGLHCRPNDSLG
        G +++   V  +  +G    G R D  G+  P       R GS V G     +SSG  +E       N             D   +  SG  C  N+   
Subjt:  GKNMVATKVGLQEAKGVAARGTRTD--GTLEPTGSG--ARVGSPVMG----PVSSGLPVEKYISGLLNRPHLINQKGMGQVDGKVSTDSGLHCRPNDSLG

Query:  PIENSVVGPQLFQKVDSRNIDSWASLFGSSSGNGLPYTPPSLVGSKLVVVPSEEIIAQGVRMWENSLVGQLVDATLPYAVIQRLIEKIWGKIEIPTITML
         + N V      Q +DS++  +WASLFG+SS   L YT P ++G K+VV P E++I QG+++WENSLVGQL+D+ LPY VIQ L+EKIWGKIE+P IT+L
Subjt:  PIENSVVGPQLFQKVDSRNIDSWASLFGSSSGNGLPYTPPSLVGSKLVVVPSEEIIAQGVRMWENSLVGQLVDATLPYAVIQRLIEKIWGKIEIPTITML

Query:  ENGLICFQFRRPNSVERILSRG---------------HGILVGNLVFNSVPVWIKLGRIPMELWTESGIAVIASAIDKPLSLDLATKKRRRLSFARVCVE
        EN LICFQFRR  SVE ILSRG                GI+    VFNSVPVWI+LG++PMELWTE+G+AV+ASA+ KP+SLDLATK+RRRLS+ARVCVE
Subjt:  ENGLICFQFRRPNSVERILSRG---------------HGILVGNLVFNSVPVWIKLGRIPMELWTESGIAVIASAIDKPLSLDLATKKRRRLSFARVCVE

Query:  VEGGADLPSEVTVNLRGVECSVPVTYEWKPRMCNSCHSFGHSAGKCPQKETS----------------------------QLEEGEIQGSPSRQVSPTNN
        +E G+++P+E+TV+LRGV+ +V V YEWKPR CN C +FGHS   C +   S                            QLEEGEI+ SP+R  S    
Subjt:  VEGGADLPSEVTVNLRGVECSVPVTYEWKPRMCNSCHSFGHSAGKCPQKETS----------------------------QLEEGEIQGSPSRQVSPTNN

Query:  GGGKKKDFTTVTRKKRVLVSVRDKGKGKSMQAVQNSFGSLSDLSEGENWALALRVSTPPPLQIVGGDGGMPRLSPNGGPMGISMDETYMISWCSWNIRGL
        G GK  +FT VTR+K  LVSVRD+GK  S                            PPPLQ+   D G   L  NG                       
Subjt:  GGGKKKDFTTVTRKKRVLVSVRDKGKGKSMQAVQNSFGSLSDLSEGENWALALRVSTPPPLQIVGGDGGMPRLSPNGGPMGISMDETYMISWCSWNIRGL

Query:  NDPVKCRAVSDFLRVSSVGFCCLLETRISEGNFCSVSGRFGDAWSYSCSYSRSGVGRISVMWKKDRFDFTPSVVDEQFVSGVIVDLHSGVTVEVLCVYAS
           +K +AV DFL  SSVGFCCLLETR+ EGNF SVS RF ++W YSCSYS SGVGRI VMWKK+RF F+ +V+DEQF++                    
Subjt:  NDPVKCRAVSDFLRVSSVGFCCLLETRISEGNFCSVSGRFGDAWSYSCSYSRSGVGRISVMWKKDRFDFTPSVVDEQFVSGVIVDLHSGVTVEVLCVYAS

Query:  NSNMDRRVLWRRLVEITSSWSSPGVVMGDFNAISVHSEACGRSPVTRDMEEFDLAIREADLVEPAVQGNWF
                     VEITS+WSSPGVVMGDFNAI V+SEA G SP+  +ME+FDLAIR+ DLVEP VQGN F
Subjt:  NSNMDRRVLWRRLVEITSSWSSPGVVMGDFNAISVHSEACGRSPVTRDMEEFDLAIREADLVEPAVQGNWF

A0A5A7V5J2 Non-LTR retroelement reverse transcriptase-like protein7.0e-13554.34Show/hide
Query:  IPTITMLENGLICFQFRRPNSVERILSRGH---------------GILVGNLVFNSVPVWIKLGRIPMELWTESGIAVIASAIDKPLSLDLATKKRRRLS
        +PTIT+LEN LICFQFRRPNSVE ILSRG                GI+  + VFNSVPVWI+LGRIPMELWTE+ +A++AS + KP++LDLATK+  RLS
Subjt:  IPTITMLENGLICFQFRRPNSVERILSRGH---------------GILVGNLVFNSVPVWIKLGRIPMELWTESGIAVIASAIDKPLSLDLATKKRRRLS

Query:  FARVCVEVEGGADLPSEVTVNLRGVECSVPVTYEWKPRMCNSCHSFGHSAGKCPQKETS---------------------------------QLEEGEIQ
        +ARVCV++EG  ++ +E+TVNLRGV+ +V V YEWKP+ CN C + GHS GKCP+   S                                 QLEEGEI+
Subjt:  FARVCVEVEGGADLPSEVTVNLRGVECSVPVTYEWKPRMCNSCHSFGHSAGKCPQKETS---------------------------------QLEEGEIQ

Query:  GSPSRQVSPTNNGGGKKKDFTTVTRKKRVLVSVRDKGKGKSMQAVQNSFGSLSDLSEGENWALALRVSTPPPLQIVGGDGGMPRLSPNGGPMGISMDETY
         SP+R  S      GK+ +FT VTRKK  LVSVRD+GK   + A+ NSFGSL ++ + + WAL +   +PPPLQ+  G G +  L               
Subjt:  GSPSRQVSPTNNGGGKKKDFTTVTRKKRVLVSVRDKGKGKSMQAVQNSFGSLSDLSEGENWALALRVSTPPPLQIVGGDGGMPRLSPNGGPMGISMDETY

Query:  MISWCSWNIRGLNDPVKCRAVSDFLRVSSVGFCCLLETRISEGNFCSVSGRFGDAWSYSCSYSRSGVGRISVMWKKDRFDFTPSVVDEQFVSGVIVDLHS
                        K +AVSDFL  SSVGFCCLLETR+ E NF  VS RFG++W YSCSYS SGVGRI V+WKK+RF F+  VVDEQFV+G + DL S
Subjt:  MISWCSWNIRGLNDPVKCRAVSDFLRVSSVGFCCLLETRISEGNFCSVSGRFGDAWSYSCSYSRSGVGRISVMWKKDRFDFTPSVVDEQFVSGVIVDLHS

Query:  GVTVEVLCVYASNSNMDRRVLWRRLVEITSSWSSPGVVMGDFNAISVHSEACGRSPVTRDMEEFDLAIREADLVEPAVQGNWFT
        GV VEV CVYASNSN++RR+LWRRLVEITS WSSP VVMGDFNAI VH EA G SP+  +ME+FDLA R+ADLVEP+VQGNWFT
Subjt:  GVTVEVLCVYASNSNMDRRVLWRRLVEITSSWSSPGVVMGDFNAISVHSEACGRSPVTRDMEEFDLAIREADLVEPAVQGNWFT

A0A5D3D5X6 Reverse transcriptase domain-containing protein1.1e-13244.49Show/hide
Query:  GKNMVATKVGLQEAKGVAARGTRTD--GTLEPTGSG--ARVGSPVMG----PVSSGLPVEKYISGLLNRPHLINQKGMGQVDGKVSTDSGLHCRPNDSLG
        G +++   V  +  +G    G R D  G+  P       R GS V G     +SSG  +E       N             D   +  SG  C  N+   
Subjt:  GKNMVATKVGLQEAKGVAARGTRTD--GTLEPTGSG--ARVGSPVMG----PVSSGLPVEKYISGLLNRPHLINQKGMGQVDGKVSTDSGLHCRPNDSLG

Query:  PIENSVVGPQLFQKVDSRNIDSWASLFGSSSGNGLPYTPPSLVGSKLVVVPSEEIIAQGVRMWENSLVGQLVDATLPYAVIQRLIEKIWGKIEIPTITML
         + N V      Q +DS++  +WASLFG+SS   L YT P ++G K+VV P EE+I QG+++WENSLVGQL+D+ LPY VIQ L+EKIWGKIE+P IT+L
Subjt:  PIENSVVGPQLFQKVDSRNIDSWASLFGSSSGNGLPYTPPSLVGSKLVVVPSEEIIAQGVRMWENSLVGQLVDATLPYAVIQRLIEKIWGKIEIPTITML

Query:  ENGLICFQFRRPNSVERILSRG---------------HGILVGNLVFNSVPVWIKLGRIPMELWTESGIAVIASAIDKPLSLDLATKKRRRLSFARVCVE
        EN LICFQFRR  SVE ILSRG                GI+    VFNSVPVWI+LG++PMELWTE+G+AV+ASA+ KP+SLDLATK+RRRLS+ARVCVE
Subjt:  ENGLICFQFRRPNSVERILSRG---------------HGILVGNLVFNSVPVWIKLGRIPMELWTESGIAVIASAIDKPLSLDLATKKRRRLSFARVCVE

Query:  VEGGADLPSEVTVNLRGVECSVPVTYEWKPRMCNSCHSFGHSAGKCPQKETS----------------------------QLEEGEIQGSPSRQVSPTNN
        +E G+++P+E+TV+LRGV+ +V V YEWKPR CN C +FGHS   C +   S                            QLEEGEI+ SP+R  S    
Subjt:  VEGGADLPSEVTVNLRGVECSVPVTYEWKPRMCNSCHSFGHSAGKCPQKETS----------------------------QLEEGEIQGSPSRQVSPTNN

Query:  GGGKKKDFTTVTRKKRVLVSVRDKGKGKSMQAVQNSFGSLSDLSEGENWALALRVSTPPPLQIVGGDGGMPRLSPNGGPMGISMDETYMISWCSWNIRGL
        G GK  +FT VTR+K  LVSVRD+GK  S                            PPPLQ+   D G   L  NG                       
Subjt:  GGGKKKDFTTVTRKKRVLVSVRDKGKGKSMQAVQNSFGSLSDLSEGENWALALRVSTPPPLQIVGGDGGMPRLSPNGGPMGISMDETYMISWCSWNIRGL

Query:  NDPVKCRAVSDFLRVSSVGFCCLLETRISEGNFCSVSGRFGDAWSYSCSYSRSGVGRISVMWKKDRFDFTPSVVDEQFVSGVIVDLHSGVTVEVLCVYAS
           +K +AV DFL  SSVGFCCLLETR+ EGNF SVS RF ++W YSCSYS SGVGRI VMWKK+RF F+ +V+DEQF++                    
Subjt:  NDPVKCRAVSDFLRVSSVGFCCLLETRISEGNFCSVSGRFGDAWSYSCSYSRSGVGRISVMWKKDRFDFTPSVVDEQFVSGVIVDLHSGVTVEVLCVYAS

Query:  NSNMDRRVLWRRLVEITSSWSSPGVVMGDFNAISVHSEACGRSPVTRDMEEFDLAIREADLVEPAVQGNWFT
                               GVVMGDFNAI V+SEA G SP+  +ME+FDLAIR+ DLVEP VQGNWFT
Subjt:  NSNMDRRVLWRRLVEITSSWSSPGVVMGDFNAISVHSEACGRSPVTRDMEEFDLAIREADLVEPAVQGNWFT

A0A5D3DXE4 Reverse transcriptase domain-containing protein2.6e-12146.79Show/hide
Query:  KVDSRNIDSWASLFGSSSGNGLPYTPPSLVGSKLVVVPSEEIIAQGVRMWENSLVGQLVDATLPYAVIQRLIEKIWGKIEIPTITMLENGLICFQFRRPN
        K+ + +  +WASLFG+SS   L YTPP  +G K+VV+P EE+I QG+++WENSLVGQL+DA LPYAVIQRL                             
Subjt:  KVDSRNIDSWASLFGSSSGNGLPYTPPSLVGSKLVVVPSEEIIAQGVRMWENSLVGQLVDATLPYAVIQRLIEKIWGKIEIPTITMLENGLICFQFRRPN

Query:  SVERILSRGHGILVGNLVFNSVPVWIKLGRIPMELWTESGIAVIASAIDKPLSLDLATKKRRRLSFARVCVEVEGGADLPSEVTVNLRGVECSVPVTYEW
                                                         KP+SLD ATKKRRRLS+ARVCVE+EGG+++ +E+TV+LRGV+ +V V YEW
Subjt:  SVERILSRGHGILVGNLVFNSVPVWIKLGRIPMELWTESGIAVIASAIDKPLSLDLATKKRRRLSFARVCVEVEGGADLPSEVTVNLRGVECSVPVTYEW

Query:  KPRMCNSCHSFGHSAGKCPQKETS----------------------------QLEEGEIQGSPSRQVSPTNNGGGKKKDFTTVTRKKRVLVSVRDKGKGK
        KPR CN C +FGHS+ KC +   S                            Q+E+GEI+ SP+R  S    G GK  +FT VTRKK  LVS+RD  +GK
Subjt:  KPRMCNSCHSFGHSAGKCPQKETS----------------------------QLEEGEIQGSPSRQVSPTNNGGGKKKDFTTVTRKKRVLVSVRDKGKGK

Query:  SMQAVQ-NSFGSLSDLSEGENWALALRVSTPPPLQIVGGDGGMPRLSPNGGPMGISMDETYMISWCSWNIRGLNDPVKCRAVSDFLRVSSVGFCCLLETR
        SM+ +  NSFGSL ++ + + WAL++   + PPLQ+   D G   LS      GIS                                S VGFCCLLETR
Subjt:  SMQAVQ-NSFGSLSDLSEGENWALALRVSTPPPLQIVGGDGGMPRLSPNGGPMGISMDETYMISWCSWNIRGLNDPVKCRAVSDFLRVSSVGFCCLLETR

Query:  ISEGNFCSVSGRFGDAWSYSCSYSRSGVGRISVMWKKDRFDFTPSVVDEQFVSGVIVDLHSGVTVEVLCVYASNSNMDRRVLWRRLVEITSSWSSPGVVM
        + EGNF SVS RF ++W YSCSYS SGVGRI VMWKK+RF F+  V+DEQFV+G + DL SGV VEV CVYASNSN++RR+LW RLVEITS+WSSPGVVM
Subjt:  ISEGNFCSVSGRFGDAWSYSCSYSRSGVGRISVMWKKDRFDFTPSVVDEQFVSGVIVDLHSGVTVEVLCVYASNSNMDRRVLWRRLVEITSSWSSPGVVM

Query:  GDFNAISVHSEACGRSPVTRDMEEFDLAIREADLVEPAVQGNWFT
         DFNAI VHSEA   SP+  +ME+F+LAIR+ADLVEP+VQGNWFT
Subjt:  GDFNAISVHSEACGRSPVTRDMEEFDLAIREADLVEPAVQGNWFT

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.7e-0521.97Show/hide
Query:  NIRGLNDPVKCRAVSDFLRVSSVGFCCLLETRIS--EGNFCSVSGRFGDAWSYSCSYSRSGVGRISVMWKKDRFDFTPSVVDEQFVSGVIVDLHSGVTVE
        N+ GLN P+K   ++ +++      CC+ ET ++  + +   + G +   +  +    ++GV     +   D+ DF P+ + ++   G  + +   +  E
Subjt:  NIRGLNDPVKCRAVSDFLRVSSVGFCCLLETRIS--EGNFCSVSGRFGDAWSYSCSYSRSGVGRISVMWKKDRFDFTPSVVDEQFVSGVIVDLHSGVTVE

Query:  ---VLCVYASNSNMDRRVLWRRLVEITSSWSSPGVVMGDFNAISVHSEACGRSPVTRDMEEFDLAIREADLVE
           +L +YA N+    R + + L ++     S  ++MGDFN      +   R  V +D +E + A+ + DL++
Subjt:  ---VLCVYASNSNMDRRVLWRRLVEITSSWSSPGVVMGDFNAISVHSEACGRSPVTRDMEEFDLAIREADLVE

P11369 LINE-1 retrotransposable element ORF2 protein1.7e-0522.95Show/hide
Query:  SWCSWNIRGLNDPVKCRAVSDFLRVSSVGFCCLLETRISE--GNFCSVSGRFGDAWSYSCSYSRSGVGRIS--VMWKKDRFDFTPSVVDEQ------FVS
        S  S NI GLN P+K   ++D+L      FCCL ET + E   ++  V G       +   +  +G+ + +   +   D+ DF P V+ +        + 
Subjt:  SWCSWNIRGLNDPVKCRAVSDFLRVSSVGFCCLLETRISE--GNFCSVSGRFGDAWSYSCSYSRSGVGRIS--VMWKKDRFDFTPSVVDEQ------FVS

Query:  GVIVDLHSGVTVEVLCVYASNSNMDRRVLWRRLVEITSSWSSPGVVMGDFNAISVHSEACGRSPVTRDMEEFDLAIREADLVE
        G I+       + +L +YA N+      +   LV++ +  +   +++GDFN      +   +  + RD  +    +++ DL +
Subjt:  GVIVDLHSGVTVEVLCVYASNSNMDRRVLWRRLVEITSSWSSPGVVMGDFNAISVHSEACGRSPVTRDMEEFDLAIREADLVE

Arabidopsis top hitse value%identityAlignment
AT2G01050.1 zinc ion binding;nucleic acid binding1.4e-1522.38Show/hide
Query:  IDSWASLFGSSSGNGLPYTPPSLVGSKL-----------------VVVPSEEIIAQGVRMWENSLVGQLVDATLPYAVIQRLIEKIWGKIEIPTITMLEN
        ++SWA+    S+G G+   P  ++  +                  V+   EE++     +W+  ++ +++ + +P +V+ R + ++W    + T+  L  
Subjt:  IDSWASLFGSSSGNGLPYTPPSLVGSKL-----------------VVVPSEEIIAQGVRMWENSLVGQLVDATLPYAVIQRLIEKIWGKIEIPTITMLEN

Query:  GLICFQFRRPNSVERILSRGHGILVGNLV---------------FNSVPVWIKLGRIPMELWTESGIAVIASAIDKPLSLDLATKKRRRLSFARVCVEVE
             +F         L+ G   ++GN +                 + PVW++L  IP   +    +  IA  + +PL +D+ T    +  FARVC+EV 
Subjt:  GLICFQFRRPNSVERILSRGHGILVGNLV---------------FNSVPVWIKLGRIPMELWTESGIAVIASAIDKPLSLDLATKKRRRLSFARVCVEVE

Query:  GGADLPSEVTVNLRGVECSVPVTYEWKPRMCNSCHSFGHSAGKCPQKETSQLEEGEIQGSPSRQVSPTNNGGGKKKDFTTVTRKKR
            L   V +N         V YE   ++C+SC  +GH    CP+    ++  G  +    R V P    G     FT V R  R
Subjt:  GGADLPSEVTVNLRGVECSVPVTYEWKPRMCNSCHSFGHSAGKCPQKETSQLEEGEIQGSPSRQVSPTNNGGGKKKDFTTVTRKKR

AT2G07760.1 Zinc knuckle (CCHC-type) family protein1.2e-0930.97Show/hide
Query:  NSVPVWIKLGRIPMELWTESGIAVIASAIDKPLSLDLATKKRRRLSFARVCVEVEGGADLPSEV-TVNLRGVECSVPVTYEWKPRMCNSCHSFGHSAGKC
        +++PVW+ L  IP  L++  GI+ IAS +  P++          +S A + VEVE     P  +  V+ +G    V V Y W P  C  C   GH A +C
Subjt:  NSVPVWIKLGRIPMELWTESGIAVIASAIDKPLSLDLATKKRRRLSFARVCVEVEGGADLPSEV-TVNLRGVECSVPVTYEWKPRMCNSCHSFGHSAGKC

Query:  PQKETSQLEEGEI
         +   +  +  EI
Subjt:  PQKETSQLEEGEI

AT5G32613.1 Zinc knuckle (CCHC-type) family protein4.2e-0729.31Show/hide
Query:  WIKLGRIPMELWTESGIAVIASAIDKPLSLDLATKKRRRLSFARVCVEVEGGADLPSEVTV-NLRGVECSVPVTYEWKPRMCNSCHSFGHSAGKCPQKET
        W  L  +P +L++  GI+VIAS I +PL  + +      +   +V V    G  LP  + V +++G    V VTY   P  C +C  +GH   +C +   
Subjt:  WIKLGRIPMELWTESGIAVIASAIDKPLSLDLATKKRRRLSFARVCVEVEGGADLPSEVTV-NLRGVECSVPVTYEWKPRMCNSCHSFGHSAGKCPQKET

Query:  SQLEEGEIQGSPSRQV
         +L   +   S S++V
Subjt:  SQLEEGEIQGSPSRQV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGATTAAAAGACCAATATGGCGTGGAATTAACCTGAAGGGCCTCGGAAAAGGAGTTTTATCCGGCAAAAACATGGTAGCGACAAAGGTTGGGTTGCAAGAGGCTAA
AGGGGTGGCGGCGCGAGGTACACGTACGGATGGTACCCTAGAACCGACCGGTTCAGGGGCCCGGGTGGGTTCACCGGTTATGGGTCCGGTTTCATCTGGGCTTCCAGTTG
AGAAATATATAAGTGGGCTTCTTAATAGGCCTCATTTAATTAATCAGAAGGGGATGGGTCAGGTAGATGGAAAGGTTTCTACAGATTCTGGGTTGCATTGTAGGCCAAAT
GATAGTTTGGGTCCAATTGAGAATTCGGTGGTTGGGCCTCAATTATTTCAGAAGGTTGATTCAAGGAACATTGATTCGTGGGCATCCCTTTTTGGTTCTTCTTCAGGAAA
TGGCCTTCCGTATACTCCACCATCTTTGGTTGGATCGAAATTAGTGGTTGTTCCTTCGGAAGAGATTATTGCGCAAGGTGTTCGGATGTGGGAAAACTCTTTAGTGGGCC
AACTTGTTGACGCTACGTTGCCATATGCAGTGATTCAACGGCTTATCGAGAAAATTTGGGGGAAAATCGAAATACCAACCATTACGATGTTAGAGAATGGGCTTATTTGC
TTTCAATTTCGTCGTCCCAATTCGGTAGAGAGGATTCTATCCCGTGGCCATGGCATCTTGGTGGGAAACCTAGTTTTTAATTCTGTTCCTGTGTGGATCAAATTGGGTCG
TATTCCCATGGAGTTGTGGACTGAGTCAGGTATTGCAGTCATTGCTAGTGCTATTGATAAACCTCTTTCTTTAGATTTGGCCACTAAGAAGAGACGTAGACTGTCGTTTG
CTAGGGTGTGTGTTGAAGTAGAAGGGGGTGCTGATTTGCCTTCTGAGGTCACAGTTAATTTGAGGGGTGTGGAATGCAGTGTACCGGTTACTTATGAGTGGAAACCCCGT
ATGTGTAATTCATGTCATTCGTTTGGTCATTCTGCTGGCAAGTGCCCTCAGAAGGAGACGTCTCAGTTAGAGGAAGGTGAGATTCAAGGTTCTCCGAGTAGGCAGGTGTC
GCCGACTAATAATGGTGGGGGTAAGAAAAAGGATTTTACTACTGTGACTCGTAAAAAGAGAGTATTGGTTTCAGTGAGAGACAAAGGGAAAGGGAAGAGTATGCAGGCTG
TGCAGAACTCTTTTGGTAGTCTTTCTGATTTGAGTGAGGGGGAAAATTGGGCGTTGGCCTTACGGGTTAGTACGCCTCCTCCCTTACAGATAGTGGGTGGTGATGGTGGC
ATGCCTAGGTTGAGTCCGAATGGTGGTCCTATGGGGATAAGTATGGATGAGACTTACATGATTAGTTGGTGTTCGTGGAATATACGAGGTCTTAATGACCCGGTGAAGTG
CAGGGCGGTGAGTGATTTCTTGAGGGTTTCCTCTGTTGGATTTTGTTGTCTTCTGGAGACAAGAATTAGTGAGGGAAATTTTTGTTCTGTTTCTGGGAGGTTTGGGGACG
CTTGGAGTTACTCTTGTAGCTATAGCAGAAGTGGTGTGGGTCGGATTTCGGTGATGTGGAAAAAGGATAGGTTTGATTTTACTCCTAGCGTGGTGGATGAGCAGTTTGTT
TCAGGTGTGATTGTTGATTTGCATTCTGGTGTGACTGTGGAGGTGTTATGTGTTTATGCCTCTAATAGTAATATGGACCGTCGTGTGCTTTGGCGTCGGTTAGTTGAGAT
CACTTCTAGTTGGTCGAGTCCAGGTGTGGTTATGGGAGATTTTAATGCAATTTCAGTGCACTCTGAAGCTTGTGGTAGGAGTCCGGTTACTAGAGATATGGAGGAATTTG
ATCTTGCTATTCGCGAGGCTGACTTGGTTGAGCCAGCTGTTCAGGGAAACTGGTTCACTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTGATTAAAAGACCAATATGGCGTGGAATTAACCTGAAGGGCCTCGGAAAAGGAGTTTTATCCGGCAAAAACATGGTAGCGACAAAGGTTGGGTTGCAAGAGGCTAA
AGGGGTGGCGGCGCGAGGTACACGTACGGATGGTACCCTAGAACCGACCGGTTCAGGGGCCCGGGTGGGTTCACCGGTTATGGGTCCGGTTTCATCTGGGCTTCCAGTTG
AGAAATATATAAGTGGGCTTCTTAATAGGCCTCATTTAATTAATCAGAAGGGGATGGGTCAGGTAGATGGAAAGGTTTCTACAGATTCTGGGTTGCATTGTAGGCCAAAT
GATAGTTTGGGTCCAATTGAGAATTCGGTGGTTGGGCCTCAATTATTTCAGAAGGTTGATTCAAGGAACATTGATTCGTGGGCATCCCTTTTTGGTTCTTCTTCAGGAAA
TGGCCTTCCGTATACTCCACCATCTTTGGTTGGATCGAAATTAGTGGTTGTTCCTTCGGAAGAGATTATTGCGCAAGGTGTTCGGATGTGGGAAAACTCTTTAGTGGGCC
AACTTGTTGACGCTACGTTGCCATATGCAGTGATTCAACGGCTTATCGAGAAAATTTGGGGGAAAATCGAAATACCAACCATTACGATGTTAGAGAATGGGCTTATTTGC
TTTCAATTTCGTCGTCCCAATTCGGTAGAGAGGATTCTATCCCGTGGCCATGGCATCTTGGTGGGAAACCTAGTTTTTAATTCTGTTCCTGTGTGGATCAAATTGGGTCG
TATTCCCATGGAGTTGTGGACTGAGTCAGGTATTGCAGTCATTGCTAGTGCTATTGATAAACCTCTTTCTTTAGATTTGGCCACTAAGAAGAGACGTAGACTGTCGTTTG
CTAGGGTGTGTGTTGAAGTAGAAGGGGGTGCTGATTTGCCTTCTGAGGTCACAGTTAATTTGAGGGGTGTGGAATGCAGTGTACCGGTTACTTATGAGTGGAAACCCCGT
ATGTGTAATTCATGTCATTCGTTTGGTCATTCTGCTGGCAAGTGCCCTCAGAAGGAGACGTCTCAGTTAGAGGAAGGTGAGATTCAAGGTTCTCCGAGTAGGCAGGTGTC
GCCGACTAATAATGGTGGGGGTAAGAAAAAGGATTTTACTACTGTGACTCGTAAAAAGAGAGTATTGGTTTCAGTGAGAGACAAAGGGAAAGGGAAGAGTATGCAGGCTG
TGCAGAACTCTTTTGGTAGTCTTTCTGATTTGAGTGAGGGGGAAAATTGGGCGTTGGCCTTACGGGTTAGTACGCCTCCTCCCTTACAGATAGTGGGTGGTGATGGTGGC
ATGCCTAGGTTGAGTCCGAATGGTGGTCCTATGGGGATAAGTATGGATGAGACTTACATGATTAGTTGGTGTTCGTGGAATATACGAGGTCTTAATGACCCGGTGAAGTG
CAGGGCGGTGAGTGATTTCTTGAGGGTTTCCTCTGTTGGATTTTGTTGTCTTCTGGAGACAAGAATTAGTGAGGGAAATTTTTGTTCTGTTTCTGGGAGGTTTGGGGACG
CTTGGAGTTACTCTTGTAGCTATAGCAGAAGTGGTGTGGGTCGGATTTCGGTGATGTGGAAAAAGGATAGGTTTGATTTTACTCCTAGCGTGGTGGATGAGCAGTTTGTT
TCAGGTGTGATTGTTGATTTGCATTCTGGTGTGACTGTGGAGGTGTTATGTGTTTATGCCTCTAATAGTAATATGGACCGTCGTGTGCTTTGGCGTCGGTTAGTTGAGAT
CACTTCTAGTTGGTCGAGTCCAGGTGTGGTTATGGGAGATTTTAATGCAATTTCAGTGCACTCTGAAGCTTGTGGTAGGAGTCCGGTTACTAGAGATATGGAGGAATTTG
ATCTTGCTATTCGCGAGGCTGACTTGGTTGAGCCAGCTGTTCAGGGAAACTGGTTCACTTAG
Protein sequenceShow/hide protein sequence
MVIKRPIWRGINLKGLGKGVLSGKNMVATKVGLQEAKGVAARGTRTDGTLEPTGSGARVGSPVMGPVSSGLPVEKYISGLLNRPHLINQKGMGQVDGKVSTDSGLHCRPN
DSLGPIENSVVGPQLFQKVDSRNIDSWASLFGSSSGNGLPYTPPSLVGSKLVVVPSEEIIAQGVRMWENSLVGQLVDATLPYAVIQRLIEKIWGKIEIPTITMLENGLIC
FQFRRPNSVERILSRGHGILVGNLVFNSVPVWIKLGRIPMELWTESGIAVIASAIDKPLSLDLATKKRRRLSFARVCVEVEGGADLPSEVTVNLRGVECSVPVTYEWKPR
MCNSCHSFGHSAGKCPQKETSQLEEGEIQGSPSRQVSPTNNGGGKKKDFTTVTRKKRVLVSVRDKGKGKSMQAVQNSFGSLSDLSEGENWALALRVSTPPPLQIVGGDGG
MPRLSPNGGPMGISMDETYMISWCSWNIRGLNDPVKCRAVSDFLRVSSVGFCCLLETRISEGNFCSVSGRFGDAWSYSCSYSRSGVGRISVMWKKDRFDFTPSVVDEQFV
SGVIVDLHSGVTVEVLCVYASNSNMDRRVLWRRLVEITSSWSSPGVVMGDFNAISVHSEACGRSPVTRDMEEFDLAIREADLVEPAVQGNWFT