; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022829 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022829
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionzinc finger BED domain-containing protein RICESLEEPER 2-like
Genome locationchr7:38945686..38951801
RNA-Seq ExpressionLag0022829
SyntenyLag0022829
Gene Ontology termsGO:0003677 - DNA binding (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR003656 - Zinc finger, BED-type
IPR008906 - HAT, C-terminal dimerisation domain
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ACX85638.1 putative transposase [Cucumis melo]2.6e-12043.22Show/hide
Query:  TPNLGTSSKVGLLRKRKPTKPASDACEHFIRVEGCDPEYPRAACKYCGATYACDSKRNGTTNMKRHLEKCKMYTSKSDDYVEGERNSE------------
        T N   SS V  L KRKP KP S   EHFI+VEGCDP+YPRAACK+CGA+YACDSKRNGTTN+KRHLEKCKMY +  +D VEGE +SE            
Subjt:  TPNLGTSSKVGLLRKRKPTKPASDACEHFIRVEGCDPEYPRAACKYCGATYACDSKRNGTTNMKRHLEKCKMYTSKSDDYVEGERNSE------------

Query:  -----------------------------------------------MELAQKNSKFLSN----------------------------------------
                                                        ++  K  K L N                                        
Subjt:  -----------------------------------------------MELAQKNSKFLSN----------------------------------------

Query:  --------CNRKGDTIGRAIEKCLQSWGIDRLFTITVDNASSNDVALTYFVKKFKGRNELVLDGEFLHLRCSTHILNLIVSDALQDLHVSIIRIRNVVKY
                 N KGDTIGRAIEKCL+ WGIDRLFT+TVDNASSNDVA+ Y VKKFKGRN LVLDGEF+H+RC  HILNLIVSDAL+DLHVSIIRIRN VKY
Subjt:  --------CNRKGDTIGRAIEKCLQSWGIDRLFTITVDNASSNDVALTYFVKKFKGRNELVLDGEFLHLRCSTHILNLIVSDALQDLHVSIIRIRNVVKY

Query:  VRSSPARLQTLKDFAKEDKISTKSCLSMDVATRWNSTFTMLDGAIKFQKTFERLEEHDQRYLPKGEIPIIEDWDNAKVFVEFLKTFLDVTLKFSVSMS--
        VRSSPARLQ  KDFAKEDK+STK+CL+MDV TRWNSTFTMLDGAIK QKTFERLEEHD  YLPK +IP  EDWDNAKVFV+FLKTF +VT+KFS SMS  
Subjt:  VRSSPARLQTLKDFAKEDKISTKSCLSMDVATRWNSTFTMLDGAIKFQKTFERLEEHDQRYLPKGEIPIIEDWDNAKVFVEFLKTFLDVTLKFSVSMS--

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------------IISQVAR
                                                                                                     IISQVAR
Subjt:  ---------------------------------------------------------------------------------------------IISQVAR

Query:  DIYSIPISTVQYESTFSTGGRVLDCFWCSLTPQTAEALICAQNWIQSKPLDDMSEEIDGAEEID
        DIYSIPISTV  ES FSTGGRVLD F  SLTPQTAEALICAQNWIQSKPLDDM+EEIDGAEEID
Subjt:  DIYSIPISTVQYESTFSTGGRVLDCFWCSLTPQTAEALICAQNWIQSKPLDDMSEEIDGAEEID

KAA0026183.1 putative transposase [Cucumis melo var. makuwa]6.2e-13051.83Show/hide
Query:  TPNLGTSSKVGLLRKRKPTKPASDACEHFIRVEGCDPEYPRAACKYCGATYACDSKRNGTTNMKRHLEKCKMYTSKSDDYVEGERNSE------------
        T N   SS V  L KRKP KP S   EHFI+VEGCDP+YPRAACK+CGA+YACDSKRNGTTN+KRHLEKCKMY +  +D VEGE +SE            
Subjt:  TPNLGTSSKVGLLRKRKPTKPASDACEHFIRVEGCDPEYPRAACKYCGATYACDSKRNGTTNMKRHLEKCKMYTSKSDDYVEGERNSE------------

Query:  -----------------------------------------------MELAQKNSKFLSN----------------------------------------
                                                        ++  K  K L N                                        
Subjt:  -----------------------------------------------MELAQKNSKFLSN----------------------------------------

Query:  --------CNRKGDTIGRAIEKCLQSWGIDRLFTITVDNASSNDVALTYFVKKFKGRNELVLDGEFLHLRCSTHILNLIVSDALQDLHVSIIRIRNVVKY
                 N KGDTIGRAIEKCL+ WGIDRLFT+TVDNASSNDVA+ Y VKKFKGRN LVLDGEF+H+RC  HILNLIVSDAL+DLHVSIIRIRN VKY
Subjt:  --------CNRKGDTIGRAIEKCLQSWGIDRLFTITVDNASSNDVALTYFVKKFKGRNELVLDGEFLHLRCSTHILNLIVSDALQDLHVSIIRIRNVVKY

Query:  VRSSPARLQTLKDFAKEDKISTKSCLSMDVATRWNSTFTMLDGAIKFQKTFERLEEHDQRYLPKGEIPIIEDWDNAKVFVEFLKTF--------------
        VRSSPARLQ  KDFAKEDK+STK+CL+MDV TRWNSTFTMLDGAIK QKTFERLEEHD  YLPK +IP  EDWDNAKVFV+FLKTF              
Subjt:  VRSSPARLQTLKDFAKEDKISTKSCLSMDVATRWNSTFTMLDGAIKFQKTFERLEEHDQRYLPKGEIPIIEDWDNAKVFVEFLKTF--------------

Query:  ------------------------------------------------------------LDVTLKFSVSMS---IISQVARDIYSIPISTVQYESTFST
                                                                    LD+   + V+ S   IISQVARDIYSIPISTV  ES FST
Subjt:  ------------------------------------------------------------LDVTLKFSVSMS---IISQVARDIYSIPISTVQYESTFST

Query:  GGRVLDCFWCSLTPQTAEALICAQNWIQSKPLDDMSEEIDGAEEID
        GGRVLD F  SLTPQTAEALICAQNWIQSKPLDDM+EEIDGAEEID
Subjt:  GGRVLDCFWCSLTPQTAEALICAQNWIQSKPLDDMSEEIDGAEEID

KAA0060372.1 putative transposase [Cucumis melo var. makuwa]8.7e-12458.54Show/hide
Query:  TPNLGTSSKVGLLRKRKPTKPASDACEHFIRVEGCDPEYPRAACKYCGATYACDSKRNGTTNMKRHLEKCKMYTSKSDDYVEGERNSEMELAQKNSKFLS
        T N   SS V  L KRKP KP S   EHFI+VEGCDP+YPRAACK+CGA+YACDSKR                                           
Subjt:  TPNLGTSSKVGLLRKRKPTKPASDACEHFIRVEGCDPEYPRAACKYCGATYACDSKRNGTTNMKRHLEKCKMYTSKSDDYVEGERNSEMELAQKNSKFLS

Query:  NCNRKGDTIGRAIEKCLQSWGIDRLFTITVDNASSNDVALTYFVKKFKGRNELVLDGEFLHLRCSTHILNLIVSDALQDLHVSIIRIRNVVKYVRSSPAR
        N N KGDTIGRAIEKCL+ WGIDRLFT+T+DNASSNDV + Y VKKFKGRN LVLDGEF+H+RC  HILNLIVSDAL+DLHVSIIRIRNVVKYVRSSPAR
Subjt:  NCNRKGDTIGRAIEKCLQSWGIDRLFTITVDNASSNDVALTYFVKKFKGRNELVLDGEFLHLRCSTHILNLIVSDALQDLHVSIIRIRNVVKYVRSSPAR

Query:  LQTLKDFAKEDKISTKSCLSMDVATRWNSTFTMLDGAIKFQKTFERLEEHDQRYLPKGEIPIIEDWDNAKVFVEFLKTF---------------------
        LQ  KDFAKEDK+STK+CL+MDV TRWNSTFTMLDGAIK QKTFERLEEHD  YLPK +IP  EDWDNAKVFV+FLKTF                     
Subjt:  LQTLKDFAKEDKISTKSCLSMDVATRWNSTFTMLDGAIKFQKTFERLEEHDQRYLPKGEIPIIEDWDNAKVFVEFLKTF---------------------

Query:  -----------------------------------------------------LDVTLKFSVSMS---IISQVARDIYSIPISTVQYESTFSTGGRVLDC
                                                             LD+   + V+ S   IISQVARDIYSIPISTV  ES FSTGGRVLD 
Subjt:  -----------------------------------------------------LDVTLKFSVSMS---IISQVARDIYSIPISTVQYESTFSTGGRVLDC

Query:  FWCSLTPQTAEALICAQNWIQSKPLDDMSEEIDGAEEID
        F  SLTPQTAEALICAQNWIQSKPLDDM+EEIDGAEEID
Subjt:  FWCSLTPQTAEALICAQNWIQSKPLDDMSEEIDGAEEID

TYK06161.1 putative transposase [Cucumis melo var. makuwa]3.4e-12851.28Show/hide
Query:  TPNLGTSSKVGLLRKRKPTKPASDACEHFIRVEGCDPEYPRAACKYCGATYACDSKRNGTTNMKRHLEKCKMYTSKSDDYVEGERNSE------------
        T N   SS V  L KRKP KP S   EHFI+VEGCDP+YPRAACK+CG +YACDSKRNGTTN+KRHLEKCKMY +  +D VEGE +SE            
Subjt:  TPNLGTSSKVGLLRKRKPTKPASDACEHFIRVEGCDPEYPRAACKYCGATYACDSKRNGTTNMKRHLEKCKMYTSKSDDYVEGERNSE------------

Query:  -----------------------------------------------MELAQKNSKFLSN----------------------------------------
                                                        ++  K  K L N                                        
Subjt:  -----------------------------------------------MELAQKNSKFLSN----------------------------------------

Query:  --------CNRKGDTIGRAIEKCLQSWGIDRLFTITVDNASSNDVALTYFVKKFKGRNELVLDGEFLHLRCSTHILNLIVSDALQDLHVSIIRIRNVVKY
                 N KGDTIGRAIEKCL+ WGIDRLFT+TVDNASSNDVA+ Y VKKFKGRN LVLDGEF+H+RC  HILNLIVSDAL+DLHVSIIRIRN VKY
Subjt:  --------CNRKGDTIGRAIEKCLQSWGIDRLFTITVDNASSNDVALTYFVKKFKGRNELVLDGEFLHLRCSTHILNLIVSDALQDLHVSIIRIRNVVKY

Query:  VRSSPARLQTLKDFAKEDKISTKSCLSMDVATRWNSTFTMLDGAIKFQKTFERLEEHDQRYLPKGEIPIIEDWDNAKVFVEFLKTF--------------
        VRSSPARLQ  KDFAKEDK+STK+CL+MDV TRWNSTFTMLDGAIK QKTFERLEEHD  YLPK +IP  EDWDNAKVFV+FLKTF              
Subjt:  VRSSPARLQTLKDFAKEDKISTKSCLSMDVATRWNSTFTMLDGAIKFQKTFERLEEHDQRYLPKGEIPIIEDWDNAKVFVEFLKTF--------------

Query:  ------------------------------------------------------------LDVTLKFSVSMS---IISQVARDIYSIPISTVQYESTFST
                                                                    LD+   + V+ S   IISQVARDI+SIPISTV  ES FST
Subjt:  ------------------------------------------------------------LDVTLKFSVSMS---IISQVARDIYSIPISTVQYESTFST

Query:  GGRVLDCFWCSLTPQTAEALICAQNWIQSKPLDDMSEEIDGAEEID
        GGRVLD F  SLTPQTAEALICAQNWIQ KPLDDM+EEIDGAEEID
Subjt:  GGRVLDCFWCSLTPQTAEALICAQNWIQSKPLDDMSEEIDGAEEID

TYK30761.1 putative transposase [Cucumis melo var. makuwa]5.8e-12851.47Show/hide
Query:  TPNLGTSSKVGLLRKRKPTKPASDACEHFIRVEGCDPEYPRAACKYCGATYACDSKRNGTTNMKRHLEKCKMYTSKSDDYVEGERNSE------------
        T N   SS V  L KRK  KP   A EHFI+VEGCDP+YPRAACK+C A+YACDSKRNGTTN+KRHLEKCKMY +  +D VEGE +SE            
Subjt:  TPNLGTSSKVGLLRKRKPTKPASDACEHFIRVEGCDPEYPRAACKYCGATYACDSKRNGTTNMKRHLEKCKMYTSKSDDYVEGERNSE------------

Query:  -----------------------------------------------MELAQKNSKFLSN----------------------------------------
                                                        ++  K  K L N                                        
Subjt:  -----------------------------------------------MELAQKNSKFLSN----------------------------------------

Query:  --------CNRKGDTIGRAIEKCLQSWGIDRLFTITVDNASSNDVALTYFVKKFKGRNELVLDGEFLHLRCSTHILNLIVSDALQDLHVSIIRIRNVVKY
                 N KGDTIGRAIEKCL+ WGIDRLFT+TVDNASSNDVA+ Y VKKFKGRN LVLDGEF+H+RC  HILNLIVSDAL+DLHVSIIRIRN VKY
Subjt:  --------CNRKGDTIGRAIEKCLQSWGIDRLFTITVDNASSNDVALTYFVKKFKGRNELVLDGEFLHLRCSTHILNLIVSDALQDLHVSIIRIRNVVKY

Query:  VRSSPARLQTLKDFAKEDKISTKSCLSMDVATRWNSTFTMLDGAIKFQKTFERLEEHDQRYLPKGEIPIIEDWDNAKVFVEFLKTF--------------
        VRSSPARLQ  KDFAKEDK+STK+CL+MDV TRWNSTFTMLDGAIK QKTFERLEEHD  YLPK +IP  EDWDNAKVFV+FLKTF              
Subjt:  VRSSPARLQTLKDFAKEDKISTKSCLSMDVATRWNSTFTMLDGAIKFQKTFERLEEHDQRYLPKGEIPIIEDWDNAKVFVEFLKTF--------------

Query:  ------------------------------------------------------------LDVTLKFSVSMS---IISQVARDIYSIPISTVQYESTFST
                                                                    LD+   + V+ S   IISQVARDIYSIPISTV  ES FST
Subjt:  ------------------------------------------------------------LDVTLKFSVSMS---IISQVARDIYSIPISTVQYESTFST

Query:  GGRVLDCFWCSLTPQTAEALICAQNWIQSKPLDDMSEEIDGAEEID
        GGRVLD F  SLTPQTAEALICAQNWIQSKPLDDM+EEIDGAEEID
Subjt:  GGRVLDCFWCSLTPQTAEALICAQNWIQSKPLDDMSEEIDGAEEID

TrEMBL top hitse value%identityAlignment
A0A5A7SNJ1 Putative transposase3.0e-13051.83Show/hide
Query:  TPNLGTSSKVGLLRKRKPTKPASDACEHFIRVEGCDPEYPRAACKYCGATYACDSKRNGTTNMKRHLEKCKMYTSKSDDYVEGERNSE------------
        T N   SS V  L KRKP KP S   EHFI+VEGCDP+YPRAACK+CGA+YACDSKRNGTTN+KRHLEKCKMY +  +D VEGE +SE            
Subjt:  TPNLGTSSKVGLLRKRKPTKPASDACEHFIRVEGCDPEYPRAACKYCGATYACDSKRNGTTNMKRHLEKCKMYTSKSDDYVEGERNSE------------

Query:  -----------------------------------------------MELAQKNSKFLSN----------------------------------------
                                                        ++  K  K L N                                        
Subjt:  -----------------------------------------------MELAQKNSKFLSN----------------------------------------

Query:  --------CNRKGDTIGRAIEKCLQSWGIDRLFTITVDNASSNDVALTYFVKKFKGRNELVLDGEFLHLRCSTHILNLIVSDALQDLHVSIIRIRNVVKY
                 N KGDTIGRAIEKCL+ WGIDRLFT+TVDNASSNDVA+ Y VKKFKGRN LVLDGEF+H+RC  HILNLIVSDAL+DLHVSIIRIRN VKY
Subjt:  --------CNRKGDTIGRAIEKCLQSWGIDRLFTITVDNASSNDVALTYFVKKFKGRNELVLDGEFLHLRCSTHILNLIVSDALQDLHVSIIRIRNVVKY

Query:  VRSSPARLQTLKDFAKEDKISTKSCLSMDVATRWNSTFTMLDGAIKFQKTFERLEEHDQRYLPKGEIPIIEDWDNAKVFVEFLKTF--------------
        VRSSPARLQ  KDFAKEDK+STK+CL+MDV TRWNSTFTMLDGAIK QKTFERLEEHD  YLPK +IP  EDWDNAKVFV+FLKTF              
Subjt:  VRSSPARLQTLKDFAKEDKISTKSCLSMDVATRWNSTFTMLDGAIKFQKTFERLEEHDQRYLPKGEIPIIEDWDNAKVFVEFLKTF--------------

Query:  ------------------------------------------------------------LDVTLKFSVSMS---IISQVARDIYSIPISTVQYESTFST
                                                                    LD+   + V+ S   IISQVARDIYSIPISTV  ES FST
Subjt:  ------------------------------------------------------------LDVTLKFSVSMS---IISQVARDIYSIPISTVQYESTFST

Query:  GGRVLDCFWCSLTPQTAEALICAQNWIQSKPLDDMSEEIDGAEEID
        GGRVLD F  SLTPQTAEALICAQNWIQSKPLDDM+EEIDGAEEID
Subjt:  GGRVLDCFWCSLTPQTAEALICAQNWIQSKPLDDMSEEIDGAEEID

A0A5A7UWZ3 Putative transposase4.2e-12458.54Show/hide
Query:  TPNLGTSSKVGLLRKRKPTKPASDACEHFIRVEGCDPEYPRAACKYCGATYACDSKRNGTTNMKRHLEKCKMYTSKSDDYVEGERNSEMELAQKNSKFLS
        T N   SS V  L KRKP KP S   EHFI+VEGCDP+YPRAACK+CGA+YACDSKR                                           
Subjt:  TPNLGTSSKVGLLRKRKPTKPASDACEHFIRVEGCDPEYPRAACKYCGATYACDSKRNGTTNMKRHLEKCKMYTSKSDDYVEGERNSEMELAQKNSKFLS

Query:  NCNRKGDTIGRAIEKCLQSWGIDRLFTITVDNASSNDVALTYFVKKFKGRNELVLDGEFLHLRCSTHILNLIVSDALQDLHVSIIRIRNVVKYVRSSPAR
        N N KGDTIGRAIEKCL+ WGIDRLFT+T+DNASSNDV + Y VKKFKGRN LVLDGEF+H+RC  HILNLIVSDAL+DLHVSIIRIRNVVKYVRSSPAR
Subjt:  NCNRKGDTIGRAIEKCLQSWGIDRLFTITVDNASSNDVALTYFVKKFKGRNELVLDGEFLHLRCSTHILNLIVSDALQDLHVSIIRIRNVVKYVRSSPAR

Query:  LQTLKDFAKEDKISTKSCLSMDVATRWNSTFTMLDGAIKFQKTFERLEEHDQRYLPKGEIPIIEDWDNAKVFVEFLKTF---------------------
        LQ  KDFAKEDK+STK+CL+MDV TRWNSTFTMLDGAIK QKTFERLEEHD  YLPK +IP  EDWDNAKVFV+FLKTF                     
Subjt:  LQTLKDFAKEDKISTKSCLSMDVATRWNSTFTMLDGAIKFQKTFERLEEHDQRYLPKGEIPIIEDWDNAKVFVEFLKTF---------------------

Query:  -----------------------------------------------------LDVTLKFSVSMS---IISQVARDIYSIPISTVQYESTFSTGGRVLDC
                                                             LD+   + V+ S   IISQVARDIYSIPISTV  ES FSTGGRVLD 
Subjt:  -----------------------------------------------------LDVTLKFSVSMS---IISQVARDIYSIPISTVQYESTFSTGGRVLDC

Query:  FWCSLTPQTAEALICAQNWIQSKPLDDMSEEIDGAEEID
        F  SLTPQTAEALICAQNWIQSKPLDDM+EEIDGAEEID
Subjt:  FWCSLTPQTAEALICAQNWIQSKPLDDMSEEIDGAEEID

A0A5D3C2L4 Putative transposase1.7e-12851.28Show/hide
Query:  TPNLGTSSKVGLLRKRKPTKPASDACEHFIRVEGCDPEYPRAACKYCGATYACDSKRNGTTNMKRHLEKCKMYTSKSDDYVEGERNSE------------
        T N   SS V  L KRKP KP S   EHFI+VEGCDP+YPRAACK+CG +YACDSKRNGTTN+KRHLEKCKMY +  +D VEGE +SE            
Subjt:  TPNLGTSSKVGLLRKRKPTKPASDACEHFIRVEGCDPEYPRAACKYCGATYACDSKRNGTTNMKRHLEKCKMYTSKSDDYVEGERNSE------------

Query:  -----------------------------------------------MELAQKNSKFLSN----------------------------------------
                                                        ++  K  K L N                                        
Subjt:  -----------------------------------------------MELAQKNSKFLSN----------------------------------------

Query:  --------CNRKGDTIGRAIEKCLQSWGIDRLFTITVDNASSNDVALTYFVKKFKGRNELVLDGEFLHLRCSTHILNLIVSDALQDLHVSIIRIRNVVKY
                 N KGDTIGRAIEKCL+ WGIDRLFT+TVDNASSNDVA+ Y VKKFKGRN LVLDGEF+H+RC  HILNLIVSDAL+DLHVSIIRIRN VKY
Subjt:  --------CNRKGDTIGRAIEKCLQSWGIDRLFTITVDNASSNDVALTYFVKKFKGRNELVLDGEFLHLRCSTHILNLIVSDALQDLHVSIIRIRNVVKY

Query:  VRSSPARLQTLKDFAKEDKISTKSCLSMDVATRWNSTFTMLDGAIKFQKTFERLEEHDQRYLPKGEIPIIEDWDNAKVFVEFLKTF--------------
        VRSSPARLQ  KDFAKEDK+STK+CL+MDV TRWNSTFTMLDGAIK QKTFERLEEHD  YLPK +IP  EDWDNAKVFV+FLKTF              
Subjt:  VRSSPARLQTLKDFAKEDKISTKSCLSMDVATRWNSTFTMLDGAIKFQKTFERLEEHDQRYLPKGEIPIIEDWDNAKVFVEFLKTF--------------

Query:  ------------------------------------------------------------LDVTLKFSVSMS---IISQVARDIYSIPISTVQYESTFST
                                                                    LD+   + V+ S   IISQVARDI+SIPISTV  ES FST
Subjt:  ------------------------------------------------------------LDVTLKFSVSMS---IISQVARDIYSIPISTVQYESTFST

Query:  GGRVLDCFWCSLTPQTAEALICAQNWIQSKPLDDMSEEIDGAEEID
        GGRVLD F  SLTPQTAEALICAQNWIQ KPLDDM+EEIDGAEEID
Subjt:  GGRVLDCFWCSLTPQTAEALICAQNWIQSKPLDDMSEEIDGAEEID

A0A5D3E590 Putative transposase2.8e-12851.47Show/hide
Query:  TPNLGTSSKVGLLRKRKPTKPASDACEHFIRVEGCDPEYPRAACKYCGATYACDSKRNGTTNMKRHLEKCKMYTSKSDDYVEGERNSE------------
        T N   SS V  L KRK  KP   A EHFI+VEGCDP+YPRAACK+C A+YACDSKRNGTTN+KRHLEKCKMY +  +D VEGE +SE            
Subjt:  TPNLGTSSKVGLLRKRKPTKPASDACEHFIRVEGCDPEYPRAACKYCGATYACDSKRNGTTNMKRHLEKCKMYTSKSDDYVEGERNSE------------

Query:  -----------------------------------------------MELAQKNSKFLSN----------------------------------------
                                                        ++  K  K L N                                        
Subjt:  -----------------------------------------------MELAQKNSKFLSN----------------------------------------

Query:  --------CNRKGDTIGRAIEKCLQSWGIDRLFTITVDNASSNDVALTYFVKKFKGRNELVLDGEFLHLRCSTHILNLIVSDALQDLHVSIIRIRNVVKY
                 N KGDTIGRAIEKCL+ WGIDRLFT+TVDNASSNDVA+ Y VKKFKGRN LVLDGEF+H+RC  HILNLIVSDAL+DLHVSIIRIRN VKY
Subjt:  --------CNRKGDTIGRAIEKCLQSWGIDRLFTITVDNASSNDVALTYFVKKFKGRNELVLDGEFLHLRCSTHILNLIVSDALQDLHVSIIRIRNVVKY

Query:  VRSSPARLQTLKDFAKEDKISTKSCLSMDVATRWNSTFTMLDGAIKFQKTFERLEEHDQRYLPKGEIPIIEDWDNAKVFVEFLKTF--------------
        VRSSPARLQ  KDFAKEDK+STK+CL+MDV TRWNSTFTMLDGAIK QKTFERLEEHD  YLPK +IP  EDWDNAKVFV+FLKTF              
Subjt:  VRSSPARLQTLKDFAKEDKISTKSCLSMDVATRWNSTFTMLDGAIKFQKTFERLEEHDQRYLPKGEIPIIEDWDNAKVFVEFLKTF--------------

Query:  ------------------------------------------------------------LDVTLKFSVSMS---IISQVARDIYSIPISTVQYESTFST
                                                                    LD+   + V+ S   IISQVARDIYSIPISTV  ES FST
Subjt:  ------------------------------------------------------------LDVTLKFSVSMS---IISQVARDIYSIPISTVQYESTFST

Query:  GGRVLDCFWCSLTPQTAEALICAQNWIQSKPLDDMSEEIDGAEEID
        GGRVLD F  SLTPQTAEALICAQNWIQSKPLDDM+EEIDGAEEID
Subjt:  GGRVLDCFWCSLTPQTAEALICAQNWIQSKPLDDMSEEIDGAEEID

D0UIX2 Putative transposase1.3e-12043.22Show/hide
Query:  TPNLGTSSKVGLLRKRKPTKPASDACEHFIRVEGCDPEYPRAACKYCGATYACDSKRNGTTNMKRHLEKCKMYTSKSDDYVEGERNSE------------
        T N   SS V  L KRKP KP S   EHFI+VEGCDP+YPRAACK+CGA+YACDSKRNGTTN+KRHLEKCKMY +  +D VEGE +SE            
Subjt:  TPNLGTSSKVGLLRKRKPTKPASDACEHFIRVEGCDPEYPRAACKYCGATYACDSKRNGTTNMKRHLEKCKMYTSKSDDYVEGERNSE------------

Query:  -----------------------------------------------MELAQKNSKFLSN----------------------------------------
                                                        ++  K  K L N                                        
Subjt:  -----------------------------------------------MELAQKNSKFLSN----------------------------------------

Query:  --------CNRKGDTIGRAIEKCLQSWGIDRLFTITVDNASSNDVALTYFVKKFKGRNELVLDGEFLHLRCSTHILNLIVSDALQDLHVSIIRIRNVVKY
                 N KGDTIGRAIEKCL+ WGIDRLFT+TVDNASSNDVA+ Y VKKFKGRN LVLDGEF+H+RC  HILNLIVSDAL+DLHVSIIRIRN VKY
Subjt:  --------CNRKGDTIGRAIEKCLQSWGIDRLFTITVDNASSNDVALTYFVKKFKGRNELVLDGEFLHLRCSTHILNLIVSDALQDLHVSIIRIRNVVKY

Query:  VRSSPARLQTLKDFAKEDKISTKSCLSMDVATRWNSTFTMLDGAIKFQKTFERLEEHDQRYLPKGEIPIIEDWDNAKVFVEFLKTFLDVTLKFSVSMS--
        VRSSPARLQ  KDFAKEDK+STK+CL+MDV TRWNSTFTMLDGAIK QKTFERLEEHD  YLPK +IP  EDWDNAKVFV+FLKTF +VT+KFS SMS  
Subjt:  VRSSPARLQTLKDFAKEDKISTKSCLSMDVATRWNSTFTMLDGAIKFQKTFERLEEHDQRYLPKGEIPIIEDWDNAKVFVEFLKTFLDVTLKFSVSMS--

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------------IISQVAR
                                                                                                     IISQVAR
Subjt:  ---------------------------------------------------------------------------------------------IISQVAR

Query:  DIYSIPISTVQYESTFSTGGRVLDCFWCSLTPQTAEALICAQNWIQSKPLDDMSEEIDGAEEID
        DIYSIPISTV  ES FSTGGRVLD F  SLTPQTAEALICAQNWIQSKPLDDM+EEIDGAEEID
Subjt:  DIYSIPISTVQYESTFSTGGRVLDCFWCSLTPQTAEALICAQNWIQSKPLDDMSEEIDGAEEID

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.3e-2532.17Show/hide
Query:  RYKARLVVKGYHQKEGVDYDETFSPVVKKLIVRIVLSLATQYNWDIRQLDVKNDFLHGDLKEKVYMQQPQGFVCN-------------------------
        RYKARLV +G+ QK  +DY+ETF+PV +    R +LSL  QYN  + Q+DVK  FL+G LKE++YM+ PQG  CN                         
Subjt:  RYKARLVVKGYHQKEGVDYDETFSPVVKKLIVRIVLSLATQYNWDIRQLDVKNDFLHGDLKEKVYMQQPQGFVCN-------------------------

Query:  ---------------------EAGSLT---YLLLYLDDNVLTSNDASYVGHLMHWLKSQFDMADICSLSYFMGLEIKRITFGIYVTQTKYTKDLLLKFGM
                             + G++    Y+LLY+DD V+ + D + + +   +L  +F M D+  + +F+G+ I+     IY++Q+ Y K +L KF M
Subjt:  ---------------------EAGSLT---YLLLYLDDNVLTSNDASYVGHLMHWLKSQFDMADICSLSYFMGLEIKRITFGIYVTQTKYTKDLLLKFGM

Query:  VEAKVCSTRCASGSLSSSDDTLCSMEDATT
              ST   S     + + L S ED  T
Subjt:  VEAKVCSTRCASGSLSSSDDTLCSMEDATT

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.1e-2736.2Show/hide
Query:  DCDSCVARYKARLVVKGYHQKEGVDYDETFSPVVKKLIVRIVLSLATQYNWDIRQLDVKNDFLHGDLKEKVYMQQPQGF--------VCNEAGSL-----
        D D  + RYKARLVVKG+ QK+G+D+DE FSPVVK   +R +LSLA   + ++ QLDVK  FLHGDL+E++YM+QP+GF        VC    SL     
Subjt:  DCDSCVARYKARLVVKGYHQKEGVDYDETFSPVVKKLIVRIVLSLATQYNWDIRQLDVKNDFLHGDLKEKVYMQQPQGF--------VCNEAGSL-----

Query:  --------------------TY-----------------LLLYLDDNVLTSNDASYVGHLMHWLKSQFDMADICSLSYFMGLEI--KRITFGIYVTQTKY
                            TY                 LLLY+DD ++   D   +  L   L   FDM D+      +G++I  +R +  ++++Q KY
Subjt:  --------------------TY-----------------LLLYLDDNVLTSNDASYVGHLMHWLKSQFDMADICSLSYFMGLEI--KRITFGIYVTQTKY

Query:  TKDLLLKFGMVEAKVCSTRCA
         + +L +F M  AK  ST  A
Subjt:  TKDLLLKFGMVEAKVCSTRCA

Q6AVI0 Zinc finger BED domain-containing protein RICESLEEPER 22.4e-2330.19Show/hide
Query:  YTSKSDDYVEGERNSEMELAQKNSKFLS-NCNRKGDTIGRAIEKCLQSWGI-DRLFTITVDN-ASSNDVALTYFVKKFKGRNELVLDGEFLHLRCSTHIL
        Y S +  +++    SE ++ ++   F+  +     + +  AI   L  W + D+LFTIT+DN  SS+D+           +N L+L G+   +RC  HIL
Subjt:  YTSKSDDYVEGERNSEMELAQKNSKFLS-NCNRKGDTIGRAIEKCLQSWGI-DRLFTITVDN-ASSNDVALTYFVKKFKGRNELVLDGEFLHLRCSTHIL

Query:  NLIVSDALQDLHVSIIRIRNVVKYVRSSPARLQTLKDFAKEDKISTKSCLSMDVATRWNSTFTMLDGAIKFQKTFERLEEHDQRYLPKGEIPIIEDWDNA
        N +  D +  +H  I  IR  +K++++SP+R +   + A + +I +   L +DV T+WN+T+ ML  A+ +++ F  LE  D  Y    E P  EDW   
Subjt:  NLIVSDALQDLHVSIIRIRNVVKYVRSSPARLQTLKDFAKEDKISTKSCLSMDVATRWNSTFTMLDGAIKFQKTFERLEEHDQRYLPKGEIPIIEDWDNA

Query:  KVFVEFLKTFLD
        +    +LK   D
Subjt:  KVFVEFLKTFLD

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.3e-3134.65Show/hide
Query:  DSCVARYKARLVVKGYHQKEGVDYDETFSPVVKKLIVRIVLSLATQYNWDIRQLDVKNDFLHGDLKEKVYMQQPQG------------------------
        D  + RYKARLV KGY+Q+ G+DY ETFSPV+K   +RIVL +A   +W IRQLDV N FL G L + VYM QP G                        
Subjt:  DSCVARYKARLVVKGYHQKEGVDYDETFSPVVKKLIVRIVLSLATQYNWDIRQLDVKNDFLHGDLKEKVYMQQPQG------------------------

Query:  -------------------------FVCNEAGSLTYLLLYLDDNVLTSNDASYVGHLMHWLKSQFDMADICSLSYFMGLEIKRITFGIYVTQTKYTKDLL
                                 FV     S+ Y+L+Y+DD ++T ND + + + +  L  +F + D   L YF+G+E KR+  G++++Q +Y  DLL
Subjt:  -------------------------FVCNEAGSLTYLLLYLDDNVLTSNDASYVGHLMHWLKSQFDMADICSLSYFMGLEIKRITFGIYVTQTKYTKDLL

Query:  LKFGMVEAKVCSTRCA-SGSLSSSDDTLCSMEDATTYKSKV-YLDLIKLTSPSL
         +  M+ AK  +T  A S  LS    T   + D T Y+  V  L  +  T P +
Subjt:  LKFGMVEAKVCSTRCA-SGSLSSSDDTLCSMEDATTYKSKV-YLDLIKLTSPSL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.1e-3135.04Show/hide
Query:  DSCVARYKARLVVKGYHQKEGVDYDETFSPVVKKLIVRIVLSLATQYNWDIRQLDVKNDFLHGDLKEKVYMQQPQG------------------------
        D  + RYKARLV KGY+Q+ G+DY ETFSPV+K   +RIVL +A   +W IRQLDV N FL G L ++VYM QP G                        
Subjt:  DSCVARYKARLVVKGYHQKEGVDYDETFSPVVKKLIVRIVLSLATQYNWDIRQLDVKNDFLHGDLKEKVYMQQPQG------------------------

Query:  -------------------------FVCNEAGSLTYLLLYLDDNVLTSNDASYVGHLMHWLKSQFDMADICSLSYFMGLEIKRITFGIYVTQTKYTKDLL
                                 FV     S+ Y+L+Y+DD ++T ND   + H +  L  +F + +   L YF+G+E KR+  G++++Q +YT DLL
Subjt:  -------------------------FVCNEAGSLTYLLLYLDDNVLTSNDASYVGHLMHWLKSQFDMADICSLSYFMGLEIKRITFGIYVTQTKYTKDLL

Query:  LKFGMVEAKVCSTRCA-SGSLSSSDDTLCSMEDATTYKSKV-YLDLIKLTSPSL
         +  M+ AK  +T  A S  L+    T   + D T Y+  V  L  +  T P L
Subjt:  LKFGMVEAKVCSTRCA-SGSLSSSDDTLCSMEDATTYKSKV-YLDLIKLTSPSL

Arabidopsis top hitse value%identityAlignment
AT3G42170.1 BED zinc finger ;hAT family dimerisation domain7.9e-1425.57Show/hide
Query:  DTIGRAIEKCLQSWGID-RLFTITVDNASSNDVALTYFVKKFKGRNELVLDGEFLHLRCSTHILNLIVSDALQDLHVSIIRIRNVVKYVRSSPARLQTLK
        + +  A+  C+  WG++ +LF +T ++ +SN  A+     +   +N  +LDG+ +   C       +  D L+     I  IR+ VK+V++S +  +   
Subjt:  DTIGRAIEKCLQSWGID-RLFTITVDNASSNDVALTYFVKKFKGRNELVLDGEFLHLRCSTHILNLIVSDALQDLHVSIIRIRNVVKYVRSSPARLQTLK

Query:  DFAKEDKISTKSCLSMDVATRWNSTFTMLDGAIKFQKTFERLEEHDQRYLPKGEIPIIEDWDNAKVFVEFLKTFLD
        +  ++ ++ ++  LS+D  T+WN+T+ ML  A + ++ F  L+  D  Y    + P  EDW + +    FLK   +
Subjt:  DFAKEDKISTKSCLSMDVATRWNSTFTMLDGAIKFQKTFERLEEHDQRYLPKGEIPIIEDWDNAKVFVEFLKTFLD

AT3G42170.1 BED zinc finger ;hAT family dimerisation domain4.2e-0730.1Show/hide
Query:  QKTFERLEEHDQRYLPKGEIPIIEDWDNAKVFVEFLKTFLDVTLKFSVSMSIISQVARDIYSIPISTVQYESTFSTGGRVLDCFWCSLTPQTAEALICAQ
        + T + L+    +YL +  +P ++++D            LD   +  +    +S++ARDI SIP+S   ++  F    R +D +  SL P+T EALICA+
Subjt:  QKTFERLEEHDQRYLPKGEIPIIEDWDNAKVFVEFLKTFLDVTLKFSVSMSIISQVARDIYSIPISTVQYESTFSTGGRVLDCFWCSLTPQTAEALICAQ

Query:  NWI
         W+
Subjt:  NWI

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.1e-2832.18Show/hide
Query:  EVVLDCDSCVARYKARLVVKGYHQKEGVDYDETFSPVVKKLIVRIVLSLATQYNWDIRQLDVKNDFLHGDLKEKVYMQQPQGFVCNEAGSL---------
        ++  + D  + RYKARLV KGY Q+EG+D+ ETFSPV K   V+++L+++  YN+ + QLD+ N FL+GDL E++YM+ P G+   +  SL         
Subjt:  EVVLDCDSCVARYKARLVVKGYHQKEGVDYDETFSPVVKKLIVRIVLSLATQYNWDIRQLDVKNDFLHGDLKEKVYMQQPQGFVCNEAGSL---------

Query:  ---------------------------------TY-----------LLLYLDDNVLTSNDASYVGHLMHWLKSQFDMADICSLSYFMGLEIKRITFGIYV
                                         TY           +L+Y+DD ++ SN+ + V  L   LKS F + D+  L YF+GLEI R   GI +
Subjt:  ---------------------------------TY-----------LLLYLDDNVLTSNDASYVGHLMHWLKSQFDMADICSLSYFMGLEIKRITFGIYV

Query:  TQTKYTKDLLLKFGMVEAKVCSTRCASGSLSSSDDTLCSMEDATTYKSKV----YLDLIKL
         Q KY  DLL + G++  K  S      S++ S  +     DA  Y+  +    YL + +L
Subjt:  TQTKYTKDLLLKFGMVEAKVCSTRCASGSLSSSDDTLCSMEDATTYKSKV----YLDLIKL

ATMG00810.1 DNA/RNA polymerases superfamily protein1.5e-0935.59Show/hide
Query:  YLLLYLDDNVLTSNDASYVGHLMHWLKSQFDMADICSLSYFMGLEIKRITFGIYVTQTKYTKDLLLKFGMVEAKVCSTRCASGSLSSSDDTLCSMEDATT
        YLLLY+DD +LT +  + +  L+  L S F M D+  + YF+G++IK    G++++QTKY + +L   GM++ K  ST      L+SS  T     D + 
Subjt:  YLLLYLDDNVLTSNDASYVGHLMHWLKSQFDMADICSLSYFMGLEIKRITFGIYVTQTKYTKDLLLKFGMVEAKVCSTRCASGSLSSSDDTLCSMEDATT

Query:  YKSKV-YLDLIKLTSPSL
        ++S V  L  + LT P +
Subjt:  YKSKV-YLDLIKLTSPSL

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)3.2e-0748.98Show/hide
Query:  LDCDSCVARYKARLVVKGYHQKEGVDYDETFSPVVKKLIVRIVLSLATQ
        L  D  + R KARLV KG+HQ+EG+ + ET+SPVV+   +R +L++A Q
Subjt:  LDCDSCVARYKARLVVKGYHQKEGVDYDETFSPVVKKLIVRIVLSLATQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTTCATCTTTTACAAATGAGATTTCTGAACATAGTCATACCCCTAATCTAGGTACGAGTAGTAAAGTAGGATTGTTAAGAAAAAGGAAACCTACCAAACCAGCTTC
AGATGCCTGTGAGCATTTTATTAGAGTAGAGGGATGTGATCCTGAATATCCTAGAGCTGCTTGTAAATATTGTGGAGCTACATATGCATGTGACTCTAAGAGAAATGGGA
CAACAAATATGAAAAGACACTTAGAGAAATGTAAGATGTATACAAGTAAGTCAGATGATTATGTTGAAGGAGAGAGGAATTCTGAAATGGAACTTGCACAAAAGAATTCT
AAATTTTTGTCAAATTGTAATCGTAAAGGAGACACAATAGGAAGAGCAATTGAAAAATGCTTGCAAAGTTGGGGGATTGATAGACTTTTTACCATCACAGTCGATAATGC
TAGCTCAAATGATGTAGCATTGACATATTTTGTTAAAAAGTTTAAAGGAAGAAATGAGTTAGTGTTGGATGGTGAATTTCTCCATTTGCGATGTTCTACTCATATTCTTA
ATTTGATTGTTAGTGATGCTTTACAAGATTTACACGTGTCTATCATTCGCATTAGAAATGTTGTGAAGTATGTTAGATCATCTCCTGCAAGATTACAAACACTTAAGGAT
TTTGCTAAAGAAGATAAAATTTCGACCAAAAGTTGTCTTAGTATGGATGTTGCAACACGATGGAATTCAACTTTCACTATGTTGGATGGAGCAATTAAATTTCAAAAGAC
TTTTGAAAGATTAGAAGAACACGACCAAAGATATTTGCCGAAAGGTGAGATACCTATTATTGAAGATTGGGATAATGCTAAAGTGTTTGTGGAGTTTCTGAAGACTTTTT
TAGATGTTACTTTAAAGTTCTCAGTGTCTATGTCTATTATCAGCCAAGTAGCCAGGGACATCTATAGCATTCCTATATCTACTGTGCAATATGAATCAACTTTTAGCACT
GGAGGACGGGTATTAGATTGTTTTTGGTGTTCACTAACTCCTCAAACTGCAGAGGCCCTCATTTGTGCTCAAAATTGGATTCAATCTAAACCTCTCGATGACATGAGTGA
AGAAATTGATGGAGCTGAAGAAATTGATGGAGCAATGGAGTTGGACTTGTACCGAGAATCAGGAATCAGTATGTATGATTCAAAGATCACTTATTCACATGTACTTTTAG
CTGCTTTATTGAAAGCCTCAAGTTTTCTTCGCGAGGTTGACTACGACCAATTTGCGCCGATCGAGGTGGTTCTCGATTGCGATTCCTGTGTCGCTCGCTATAAAGCTCGA
CTTGTTGTTAAGGGCTACCATCAAAAGGAGGGGGTTGATTATGATGAGACTTTTAGTCCAGTGGTTAAGAAACTTATTGTTCGTATTGTGTTATCCTTGGCTACTCAATA
TAATTGGGATATTCGCCAACTTGATGTTAAGAATGATTTCTTGCATGGTGATCTCAAAGAAAAGGTTTATATGCAACAACCTCAAGGTTTTGTTTGTAATGAGGCTGGCT
CTCTTACATATCTTCTTCTTTACCTTGATGATAATGTCTTGACTAGTAATGATGCTTCATATGTTGGTCATTTGATGCACTGGCTGAAGTCCCAATTTGATATGGCTGAT
ATCTGTAGTCTTTCATACTTTATGGGGCTTGAAATCAAACGCATTACTTTTGGTATTTATGTTACTCAAACTAAATATACCAAGGACTTGTTACTTAAGTTTGGAATGGT
TGAGGCTAAAGTTTGCTCTACTCGTTGTGCTAGTGGTTCTTTGTCTAGTTCTGATGATACTCTATGCTCTATGGAAGATGCTACAACATACAAAAGTAAAGTTTACCTTG
ATCTAATTAAGCTAACTAGTCCAAGCTTAAAAAATTCAGAAGAAAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGACTTCATCTTTTACAAATGAGATTTCTGAACATAGTCATACCCCTAATCTAGGTACGAGTAGTAAAGTAGGATTGTTAAGAAAAAGGAAACCTACCAAACCAGCTTC
AGATGCCTGTGAGCATTTTATTAGAGTAGAGGGATGTGATCCTGAATATCCTAGAGCTGCTTGTAAATATTGTGGAGCTACATATGCATGTGACTCTAAGAGAAATGGGA
CAACAAATATGAAAAGACACTTAGAGAAATGTAAGATGTATACAAGTAAGTCAGATGATTATGTTGAAGGAGAGAGGAATTCTGAAATGGAACTTGCACAAAAGAATTCT
AAATTTTTGTCAAATTGTAATCGTAAAGGAGACACAATAGGAAGAGCAATTGAAAAATGCTTGCAAAGTTGGGGGATTGATAGACTTTTTACCATCACAGTCGATAATGC
TAGCTCAAATGATGTAGCATTGACATATTTTGTTAAAAAGTTTAAAGGAAGAAATGAGTTAGTGTTGGATGGTGAATTTCTCCATTTGCGATGTTCTACTCATATTCTTA
ATTTGATTGTTAGTGATGCTTTACAAGATTTACACGTGTCTATCATTCGCATTAGAAATGTTGTGAAGTATGTTAGATCATCTCCTGCAAGATTACAAACACTTAAGGAT
TTTGCTAAAGAAGATAAAATTTCGACCAAAAGTTGTCTTAGTATGGATGTTGCAACACGATGGAATTCAACTTTCACTATGTTGGATGGAGCAATTAAATTTCAAAAGAC
TTTTGAAAGATTAGAAGAACACGACCAAAGATATTTGCCGAAAGGTGAGATACCTATTATTGAAGATTGGGATAATGCTAAAGTGTTTGTGGAGTTTCTGAAGACTTTTT
TAGATGTTACTTTAAAGTTCTCAGTGTCTATGTCTATTATCAGCCAAGTAGCCAGGGACATCTATAGCATTCCTATATCTACTGTGCAATATGAATCAACTTTTAGCACT
GGAGGACGGGTATTAGATTGTTTTTGGTGTTCACTAACTCCTCAAACTGCAGAGGCCCTCATTTGTGCTCAAAATTGGATTCAATCTAAACCTCTCGATGACATGAGTGA
AGAAATTGATGGAGCTGAAGAAATTGATGGAGCAATGGAGTTGGACTTGTACCGAGAATCAGGAATCAGTATGTATGATTCAAAGATCACTTATTCACATGTACTTTTAG
CTGCTTTATTGAAAGCCTCAAGTTTTCTTCGCGAGGTTGACTACGACCAATTTGCGCCGATCGAGGTGGTTCTCGATTGCGATTCCTGTGTCGCTCGCTATAAAGCTCGA
CTTGTTGTTAAGGGCTACCATCAAAAGGAGGGGGTTGATTATGATGAGACTTTTAGTCCAGTGGTTAAGAAACTTATTGTTCGTATTGTGTTATCCTTGGCTACTCAATA
TAATTGGGATATTCGCCAACTTGATGTTAAGAATGATTTCTTGCATGGTGATCTCAAAGAAAAGGTTTATATGCAACAACCTCAAGGTTTTGTTTGTAATGAGGCTGGCT
CTCTTACATATCTTCTTCTTTACCTTGATGATAATGTCTTGACTAGTAATGATGCTTCATATGTTGGTCATTTGATGCACTGGCTGAAGTCCCAATTTGATATGGCTGAT
ATCTGTAGTCTTTCATACTTTATGGGGCTTGAAATCAAACGCATTACTTTTGGTATTTATGTTACTCAAACTAAATATACCAAGGACTTGTTACTTAAGTTTGGAATGGT
TGAGGCTAAAGTTTGCTCTACTCGTTGTGCTAGTGGTTCTTTGTCTAGTTCTGATGATACTCTATGCTCTATGGAAGATGCTACAACATACAAAAGTAAAGTTTACCTTG
ATCTAATTAAGCTAACTAGTCCAAGCTTAAAAAATTCAGAAGAAAAATAG
Protein sequenceShow/hide protein sequence
MTSSFTNEISEHSHTPNLGTSSKVGLLRKRKPTKPASDACEHFIRVEGCDPEYPRAACKYCGATYACDSKRNGTTNMKRHLEKCKMYTSKSDDYVEGERNSEMELAQKNS
KFLSNCNRKGDTIGRAIEKCLQSWGIDRLFTITVDNASSNDVALTYFVKKFKGRNELVLDGEFLHLRCSTHILNLIVSDALQDLHVSIIRIRNVVKYVRSSPARLQTLKD
FAKEDKISTKSCLSMDVATRWNSTFTMLDGAIKFQKTFERLEEHDQRYLPKGEIPIIEDWDNAKVFVEFLKTFLDVTLKFSVSMSIISQVARDIYSIPISTVQYESTFST
GGRVLDCFWCSLTPQTAEALICAQNWIQSKPLDDMSEEIDGAEEIDGAMELDLYRESGISMYDSKITYSHVLLAALLKASSFLREVDYDQFAPIEVVLDCDSCVARYKAR
LVVKGYHQKEGVDYDETFSPVVKKLIVRIVLSLATQYNWDIRQLDVKNDFLHGDLKEKVYMQQPQGFVCNEAGSLTYLLLYLDDNVLTSNDASYVGHLMHWLKSQFDMAD
ICSLSYFMGLEIKRITFGIYVTQTKYTKDLLLKFGMVEAKVCSTRCASGSLSSSDDTLCSMEDATTYKSKVYLDLIKLTSPSLKNSEEK