; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc07G05990 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc07G05990
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionSGL domain-containing protein
Genome locationClcChr07:10073809..10078821
RNA-Seq ExpressionClc07G05990
SyntenyClc07G05990
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR011042 - Six-bladed beta-propeller, TolB-like
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600824.1 hypothetical protein SDJN03_06057, partial [Cucurbita argyrosperma subsp. sororia]3.0e-16191.28Show/hide
Query:  MAPINRSLILLFILLQLFLSQTLARKPHIIDFRSPNLYPEGLVWDPSAQHFVVSSLHHRTLVSVSDAGVAETLIHDPNLPENVSILGLAIDSVNNRLLAA
        MAP NRS ILLFIL QLFLSQTLARKPH+IDFRSPNLYPEG+VWD SAQHFVV S+HHRTLVSVSDAGVAETLIHDP+LPENVSILGLAIDSVNNRLLAA
Subjt:  MAPINRSLILLFILLQLFLSQTLARKPHIIDFRSPNLYPEGLVWDPSAQHFVVSSLHHRTLVSVSDAGVAETLIHDPNLPENVSILGLAIDSVNNRLLAA

Query:  VHAAPPLPAFNALAAYDLRSRRRISLTPLPSDGTSSPRPVANAVAVDFKGNAFVTNSAGNFIWKVDKDRSASIFSKSASYSSYPATPNEVYSSSGLNGAV
        VHAAPPLP FNALAAYDLRSRRRISLTPLPSDGTSS RPVANAVA DFKGNAF+TNSAGNFIWKV+ D SA++FSKS SYSS+PATPNEVYSSSGLNGAV
Subjt:  VHAAPPLPAFNALAAYDLRSRRRISLTPLPSDGTSSPRPVANAVAVDFKGNAFVTNSAGNFIWKVDKDRSASIFSKSASYSSYPATPNEVYSSSGLNGAV

Query:  YVSKGYLLVVQSNTGKMYKVDADDGTARLVLLNKELKGADGIAARRDGVVLVVCYKKLWFLKSEDSWGEGVVYDEIDLDEEKFATAVTAGNEGRVYVLNG
        YVSKGYLLVVQSNTGKMYKVDADDGTARLVLLNK+LKGADGIAARRDGVVLVV Y+KLWFLKSEDSWGEGV+YDEIDLDEEKFAT+VT GNEGRVYVLNG
Subjt:  YVSKGYLLVVQSNTGKMYKVDADDGTARLVLLNKELKGADGIAARRDGVVLVVCYKKLWFLKSEDSWGEGVVYDEIDLDEEKFATAVTAGNEGRVYVLNG

Query:  YVNEWLNGNLGREMFGIEEMR
        YVNE LNGNLGRE FGIEEMR
Subjt:  YVNEWLNGNLGREMFGIEEMR

XP_008454779.1 PREDICTED: uncharacterized protein LOC103495097 [Cucumis melo]6.4e-16493.5Show/hide
Query:  MAPINRSLILLFILLQLFLSQTLARKPHIIDFRSPNLYPEGLVWDPSAQHFVVSSLHHRTLVSVSDAGVAETLIHDPNLPENVSILGLAIDSVNNRLLAA
        MAPINRS ILLFILLQLFLSQTLARKPH+IDFRSPNLYPEGLVWD SAQHFVV SLHHRTLVSVSDAGVAETLI DP+LPENVSILGL IDSVN+RLLA 
Subjt:  MAPINRSLILLFILLQLFLSQTLARKPHIIDFRSPNLYPEGLVWDPSAQHFVVSSLHHRTLVSVSDAGVAETLIHDPNLPENVSILGLAIDSVNNRLLAA

Query:  VHAAPPLPAFNALAAYDLRSRRRISLTPLPSDGTSSPRPVANAVAVDFKGNAFVTNSAGNFIWKVDKDRSASIFSKSASYSSYPATPNEVYSSSGLNGAV
        VHAAPPLP FNALAAYDLRSR RISLTPL SDGTSS RPVANAVAVDFKGNAF+TNSAGNFIWKVDKD SASIFSKSASYSSYPATPNEVYSSSGLNGAV
Subjt:  VHAAPPLPAFNALAAYDLRSRRRISLTPLPSDGTSSPRPVANAVAVDFKGNAFVTNSAGNFIWKVDKDRSASIFSKSASYSSYPATPNEVYSSSGLNGAV

Query:  YVSKGYLLVVQSNTGKMYKVDADDGTARLVLLNKELKGADGIAARRDGVVLVVCYKKLWFLKSEDSWGEGVVYDEIDLDEEKFATAVTAGNEGRVYVLNG
        YVSKGYLLVVQSNTGKMYKVDADDGTARLVLLNKELKGADGIAAR+DGVVLVV Y+KLWFLKSEDSWGEGVVYDEIDLDEEKFATAVT GNEGRVYVLNG
Subjt:  YVSKGYLLVVQSNTGKMYKVDADDGTARLVLLNKELKGADGIAARRDGVVLVVCYKKLWFLKSEDSWGEGVVYDEIDLDEEKFATAVTAGNEGRVYVLNG

Query:  YVNEWLNGNLGREMFGIEEMRQA
        YVNE LNGNLGREMFGIEEMR A
Subjt:  YVNEWLNGNLGREMFGIEEMRQA

XP_022943057.1 uncharacterized protein LOC111447908 [Cucurbita moschata]1.1e-16091.59Show/hide
Query:  MAPINRSLILLFILLQLFLSQTLARKPHIIDFRSPNLYPEGLVWDPSAQHFVVSSLHHRTLVSVSDAGVAETLIHDPNLPENVSILGLAIDSVNNRLLAA
        MAP N S ILLFIL QLFLSQTLARKPH+IDFRSPNLYPEG+VWD SAQHFVV SLHHRTLVSVSDAGVAETLIHDP+LPENVSILGLAIDSVNNRLLAA
Subjt:  MAPINRSLILLFILLQLFLSQTLARKPHIIDFRSPNLYPEGLVWDPSAQHFVVSSLHHRTLVSVSDAGVAETLIHDPNLPENVSILGLAIDSVNNRLLAA

Query:  VHAAPPLPAFNALAAYDLRSRRRISLTPLPSDGTSSPRPVANAVAVDFKGNAFVTNSAGNFIWKVDKDRSASIFSKSASYSSYPATPNEVYSSSGLNGAV
        VHAAPPLP FNALAAYDLRSRRRISLTPLPSDGTSS RPVANAVA DFKGNAF+TNSAGNFIWKV+ D SA++FSKS SYSS+PATPNEVYSSSGLNGAV
Subjt:  VHAAPPLPAFNALAAYDLRSRRRISLTPLPSDGTSSPRPVANAVAVDFKGNAFVTNSAGNFIWKVDKDRSASIFSKSASYSSYPATPNEVYSSSGLNGAV

Query:  YVSKGYLLVVQSNTGKMYKVDADDGTARLVLLNKELKGADGIAARRDGVVLVVCYKKLWFLKSEDSWGEGVVYDEIDLDEEKFATAVTAGNEGRVYVLNG
        YVSKGYLLVVQSNTGKMYKVDADDGTARLVLLNK+LKGADGIAARRDGVVLVV Y+KLWFLKSEDSWGEGVVYDEIDLDEEKFAT+VT GNEGRVYVLNG
Subjt:  YVSKGYLLVVQSNTGKMYKVDADDGTARLVLLNKELKGADGIAARRDGVVLVVCYKKLWFLKSEDSWGEGVVYDEIDLDEEKFATAVTAGNEGRVYVLNG

Query:  YVNEWLNGNLGREMFGIEEMR
        YVNE LNGNLGRE FGIEEMR
Subjt:  YVNEWLNGNLGREMFGIEEMR

XP_023517186.1 uncharacterized protein LOC111781020 [Cucurbita pepo subsp. pepo]2.3e-16191.9Show/hide
Query:  MAPINRSLILLFILLQLFLSQTLARKPHIIDFRSPNLYPEGLVWDPSAQHFVVSSLHHRTLVSVSDAGVAETLIHDPNLPENVSILGLAIDSVNNRLLAA
        MAP N SLILLFIL QLFLSQTLARKPH+IDFRSPNLYPEG+VWD SAQHFVV SLHHRTLVSVSDAGVAETLIHDP+LPENVSILGLAIDSVNNRLLAA
Subjt:  MAPINRSLILLFILLQLFLSQTLARKPHIIDFRSPNLYPEGLVWDPSAQHFVVSSLHHRTLVSVSDAGVAETLIHDPNLPENVSILGLAIDSVNNRLLAA

Query:  VHAAPPLPAFNALAAYDLRSRRRISLTPLPSDGTSSPRPVANAVAVDFKGNAFVTNSAGNFIWKVDKDRSASIFSKSASYSSYPATPNEVYSSSGLNGAV
        VHAAPPLP FNALAAYDLRSRRRISLTPLPSDGTSS RPVANAVA DFKGNAF+TNSAGNFIWKV+ D SA++FSKS SYSS+PATPNEVYSSSGLNGAV
Subjt:  VHAAPPLPAFNALAAYDLRSRRRISLTPLPSDGTSSPRPVANAVAVDFKGNAFVTNSAGNFIWKVDKDRSASIFSKSASYSSYPATPNEVYSSSGLNGAV

Query:  YVSKGYLLVVQSNTGKMYKVDADDGTARLVLLNKELKGADGIAARRDGVVLVVCYKKLWFLKSEDSWGEGVVYDEIDLDEEKFATAVTAGNEGRVYVLNG
        YVSKGYLLVVQSNTGKMYKVDADDGTARLVLLNK+LKGADGIAARRDGVVLVV Y+KLWFLKSEDSWGEGVVYDEIDLDEEKFAT VT GNEGRVYVLNG
Subjt:  YVSKGYLLVVQSNTGKMYKVDADDGTARLVLLNKELKGADGIAARRDGVVLVVCYKKLWFLKSEDSWGEGVVYDEIDLDEEKFATAVTAGNEGRVYVLNG

Query:  YVNEWLNGNLGREMFGIEEMR
        YVNE LNGNLGRE FGIEEMR
Subjt:  YVNEWLNGNLGREMFGIEEMR

XP_038893355.1 uncharacterized protein LOC120082174 [Benincasa hispida]9.9e-16594.39Show/hide
Query:  MAPINRSLILLFILLQLFLSQTLARKPHIIDFRSPNLYPEGLVWDPSAQHFVVSSLHHRTLVSVSDAGVAETLIHDPNLPENVSILGLAIDSVNNRLLAA
        MA INRS IL+FILLQLFLSQTLARKPH+IDFRSPNLYPEGLVWD SAQHFVV SLHHRTLVSVSDAGVAETLIHDP LPENVSILGLAIDSVNNRLLAA
Subjt:  MAPINRSLILLFILLQLFLSQTLARKPHIIDFRSPNLYPEGLVWDPSAQHFVVSSLHHRTLVSVSDAGVAETLIHDPNLPENVSILGLAIDSVNNRLLAA

Query:  VHAAPPLPAFNALAAYDLRSRRRISLTPLPSDGTSSPRPVANAVAVDFKGNAFVTNSAGNFIWKVDKDRSASIFSKSASYSSYPATPNEVYSSSGLNGAV
        VHAAPPLP FNALAAYDLRSRRRISLT LPS+GTSS RPVANAVAVDFKGNAFVTNSAGNFIW+VDK  SASIFSKSASY+SYPATPNEVYSSSGLNGAV
Subjt:  VHAAPPLPAFNALAAYDLRSRRRISLTPLPSDGTSSPRPVANAVAVDFKGNAFVTNSAGNFIWKVDKDRSASIFSKSASYSSYPATPNEVYSSSGLNGAV

Query:  YVSKGYLLVVQSNTGKMYKVDADDGTARLVLLNKELKGADGIAARRDGVVLVVCYKKLWFLKSEDSWGEGVVYDEIDLDEEKFATAVTAGNEGRVYVLNG
        YVSKGYLLVVQSNTGKMYKVDADDGTARLVLLNKELKGADGIAARRDGVVLVVCY+KLWFLKSEDSWGEGVVYDEIDLDEEKFATAVT GNEGRVYVLNG
Subjt:  YVSKGYLLVVQSNTGKMYKVDADDGTARLVLLNKELKGADGIAARRDGVVLVVCYKKLWFLKSEDSWGEGVVYDEIDLDEEKFATAVTAGNEGRVYVLNG

Query:  YVNEWLNGNLGREMFGIEEMR
        YVNE LNGNLGREMFGIEEMR
Subjt:  YVNEWLNGNLGREMFGIEEMR

TrEMBL top hitse value%identityAlignment
A0A1S3BYY0 uncharacterized protein LOC1034950973.1e-16493.5Show/hide
Query:  MAPINRSLILLFILLQLFLSQTLARKPHIIDFRSPNLYPEGLVWDPSAQHFVVSSLHHRTLVSVSDAGVAETLIHDPNLPENVSILGLAIDSVNNRLLAA
        MAPINRS ILLFILLQLFLSQTLARKPH+IDFRSPNLYPEGLVWD SAQHFVV SLHHRTLVSVSDAGVAETLI DP+LPENVSILGL IDSVN+RLLA 
Subjt:  MAPINRSLILLFILLQLFLSQTLARKPHIIDFRSPNLYPEGLVWDPSAQHFVVSSLHHRTLVSVSDAGVAETLIHDPNLPENVSILGLAIDSVNNRLLAA

Query:  VHAAPPLPAFNALAAYDLRSRRRISLTPLPSDGTSSPRPVANAVAVDFKGNAFVTNSAGNFIWKVDKDRSASIFSKSASYSSYPATPNEVYSSSGLNGAV
        VHAAPPLP FNALAAYDLRSR RISLTPL SDGTSS RPVANAVAVDFKGNAF+TNSAGNFIWKVDKD SASIFSKSASYSSYPATPNEVYSSSGLNGAV
Subjt:  VHAAPPLPAFNALAAYDLRSRRRISLTPLPSDGTSSPRPVANAVAVDFKGNAFVTNSAGNFIWKVDKDRSASIFSKSASYSSYPATPNEVYSSSGLNGAV

Query:  YVSKGYLLVVQSNTGKMYKVDADDGTARLVLLNKELKGADGIAARRDGVVLVVCYKKLWFLKSEDSWGEGVVYDEIDLDEEKFATAVTAGNEGRVYVLNG
        YVSKGYLLVVQSNTGKMYKVDADDGTARLVLLNKELKGADGIAAR+DGVVLVV Y+KLWFLKSEDSWGEGVVYDEIDLDEEKFATAVT GNEGRVYVLNG
Subjt:  YVSKGYLLVVQSNTGKMYKVDADDGTARLVLLNKELKGADGIAARRDGVVLVVCYKKLWFLKSEDSWGEGVVYDEIDLDEEKFATAVTAGNEGRVYVLNG

Query:  YVNEWLNGNLGREMFGIEEMRQA
        YVNE LNGNLGREMFGIEEMR A
Subjt:  YVNEWLNGNLGREMFGIEEMRQA

A0A5D3DYZ8 SGL domain-containing protein3.1e-16493.5Show/hide
Query:  MAPINRSLILLFILLQLFLSQTLARKPHIIDFRSPNLYPEGLVWDPSAQHFVVSSLHHRTLVSVSDAGVAETLIHDPNLPENVSILGLAIDSVNNRLLAA
        MAPINRS ILLFILLQLFLSQTLARKPH+IDFRSPNLYPEGLVWD SAQHFVV SLHHRTLVSVSDAGVAETLI DP+LPENVSILGL IDSVN+RLLA 
Subjt:  MAPINRSLILLFILLQLFLSQTLARKPHIIDFRSPNLYPEGLVWDPSAQHFVVSSLHHRTLVSVSDAGVAETLIHDPNLPENVSILGLAIDSVNNRLLAA

Query:  VHAAPPLPAFNALAAYDLRSRRRISLTPLPSDGTSSPRPVANAVAVDFKGNAFVTNSAGNFIWKVDKDRSASIFSKSASYSSYPATPNEVYSSSGLNGAV
        VHAAPPLP FNALAAYDLRSR RISLTPL SDGTSS RPVANAVAVDFKGNAF+TNSAGNFIWKVDKD SASIFSKSASYSSYPATPNEVYSSSGLNGAV
Subjt:  VHAAPPLPAFNALAAYDLRSRRRISLTPLPSDGTSSPRPVANAVAVDFKGNAFVTNSAGNFIWKVDKDRSASIFSKSASYSSYPATPNEVYSSSGLNGAV

Query:  YVSKGYLLVVQSNTGKMYKVDADDGTARLVLLNKELKGADGIAARRDGVVLVVCYKKLWFLKSEDSWGEGVVYDEIDLDEEKFATAVTAGNEGRVYVLNG
        YVSKGYLLVVQSNTGKMYKVDADDGTARLVLLNKELKGADGIAAR+DGVVLVV Y+KLWFLKSEDSWGEGVVYDEIDLDEEKFATAVT GNEGRVYVLNG
Subjt:  YVSKGYLLVVQSNTGKMYKVDADDGTARLVLLNKELKGADGIAARRDGVVLVVCYKKLWFLKSEDSWGEGVVYDEIDLDEEKFATAVTAGNEGRVYVLNG

Query:  YVNEWLNGNLGREMFGIEEMRQA
        YVNE LNGNLGREMFGIEEMR A
Subjt:  YVNEWLNGNLGREMFGIEEMRQA

A0A6J1FRV4 uncharacterized protein LOC1114476068.5e-14685.45Show/hide
Query:  MAPINRSLI--LLFILLQLFLSQTLARKPHIIDFRSPNLYPEGLVWDPSAQHFVVSSLHHRTLVSVSDAGVAETLIHDPNLPENVSILGLAIDSVNNRLL
        MAPI  S I    F   QLFLS TLARKPH IDFRSPNLYPEGLVWD SAQHF+V SLHHRTLVSVSDAGVAE LIHDP+LPENVSILGLAIDS+NNRLL
Subjt:  MAPINRSLI--LLFILLQLFLSQTLARKPHIIDFRSPNLYPEGLVWDPSAQHFVVSSLHHRTLVSVSDAGVAETLIHDPNLPENVSILGLAIDSVNNRLL

Query:  AAVHAAPPLPAFNALAAYDLRSRRRISLTPLPSDGTSSPRPVANAVAVDFKGNAFVTNSAGNFIWKVDKDRSASIFSKSASYSSYPATPNEVYSSSGLNG
        AAVHAA PLP FNALAAYDLRSRRRISLT LPS   S+ RPVAN +AVDFKGNAFVTNSA NFIWKVDK  SASIFSKSASYSS+P T NEVYSSSGLNG
Subjt:  AAVHAAPPLPAFNALAAYDLRSRRRISLTPLPSDGTSSPRPVANAVAVDFKGNAFVTNSAGNFIWKVDKDRSASIFSKSASYSSYPATPNEVYSSSGLNG

Query:  AVYVSKGYLLVVQSNTGKMYKVDADDGTARLVLLNKELKGADGIAARRDGVVLVVCYKKLWFLKSEDSWGEGVVYDEIDLDEEKFATAVTAGNEGRVYVL
        AVYVSKGYLLVVQSNTGKMYKVDADDGTARLVLLN++LKGADGIA RRDGVVLVV Y+KLWFLKS+DSWGEGVVYD IDLDEEKFATAV  G EGRVYVL
Subjt:  AVYVSKGYLLVVQSNTGKMYKVDADDGTARLVLLNKELKGADGIAARRDGVVLVVCYKKLWFLKSEDSWGEGVVYDEIDLDEEKFATAVTAGNEGRVYVL

Query:  NGYVNEWLNGNLGREMFGIEEMR
        NGYV E LNGNLGRE FGIEEMR
Subjt:  NGYVNEWLNGNLGREMFGIEEMR

A0A6J1FRY6 uncharacterized protein LOC1114479085.5e-16191.59Show/hide
Query:  MAPINRSLILLFILLQLFLSQTLARKPHIIDFRSPNLYPEGLVWDPSAQHFVVSSLHHRTLVSVSDAGVAETLIHDPNLPENVSILGLAIDSVNNRLLAA
        MAP N S ILLFIL QLFLSQTLARKPH+IDFRSPNLYPEG+VWD SAQHFVV SLHHRTLVSVSDAGVAETLIHDP+LPENVSILGLAIDSVNNRLLAA
Subjt:  MAPINRSLILLFILLQLFLSQTLARKPHIIDFRSPNLYPEGLVWDPSAQHFVVSSLHHRTLVSVSDAGVAETLIHDPNLPENVSILGLAIDSVNNRLLAA

Query:  VHAAPPLPAFNALAAYDLRSRRRISLTPLPSDGTSSPRPVANAVAVDFKGNAFVTNSAGNFIWKVDKDRSASIFSKSASYSSYPATPNEVYSSSGLNGAV
        VHAAPPLP FNALAAYDLRSRRRISLTPLPSDGTSS RPVANAVA DFKGNAF+TNSAGNFIWKV+ D SA++FSKS SYSS+PATPNEVYSSSGLNGAV
Subjt:  VHAAPPLPAFNALAAYDLRSRRRISLTPLPSDGTSSPRPVANAVAVDFKGNAFVTNSAGNFIWKVDKDRSASIFSKSASYSSYPATPNEVYSSSGLNGAV

Query:  YVSKGYLLVVQSNTGKMYKVDADDGTARLVLLNKELKGADGIAARRDGVVLVVCYKKLWFLKSEDSWGEGVVYDEIDLDEEKFATAVTAGNEGRVYVLNG
        YVSKGYLLVVQSNTGKMYKVDADDGTARLVLLNK+LKGADGIAARRDGVVLVV Y+KLWFLKSEDSWGEGVVYDEIDLDEEKFAT+VT GNEGRVYVLNG
Subjt:  YVSKGYLLVVQSNTGKMYKVDADDGTARLVLLNKELKGADGIAARRDGVVLVVCYKKLWFLKSEDSWGEGVVYDEIDLDEEKFATAVTAGNEGRVYVLNG

Query:  YVNEWLNGNLGREMFGIEEMR
        YVNE LNGNLGRE FGIEEMR
Subjt:  YVNEWLNGNLGREMFGIEEMR

A0A6J1IQR4 uncharacterized protein LOC1114775332.8e-15789.41Show/hide
Query:  MAPINRSLILLFILLQLFLSQTLARKPHIIDFRSPNLYPEGLVWDPSAQHFVVSSLHHRTLVSVSDAGVAETLIHDPNLPENVSILGLAIDSVNNRLLAA
        MAPI+RS IL F L Q FLSQTLARKPH+IDFRSPNLYPEG+VWD SAQHFVV SLHHRTLVSVSDAGVAETLIHDP+LPENVSILGLAIDSVNNRLLAA
Subjt:  MAPINRSLILLFILLQLFLSQTLARKPHIIDFRSPNLYPEGLVWDPSAQHFVVSSLHHRTLVSVSDAGVAETLIHDPNLPENVSILGLAIDSVNNRLLAA

Query:  VHAAPPLPAFNALAAYDLRSRRRISLTPLPSDGTSSPRPVANAVAVDFKGNAFVTNSAGNFIWKVDKDRSASIFSKSASYSSYPATPNEVYSSSGLNGAV
        VHAAPPLP FNALA YDLRSRRRISLT LPSDGTSS RPVANAVA DFKGNAF+TNSAGNFIWKV+ D SA++FSKS SYSS+PATPNEV+SSSGLNGAV
Subjt:  VHAAPPLPAFNALAAYDLRSRRRISLTPLPSDGTSSPRPVANAVAVDFKGNAFVTNSAGNFIWKVDKDRSASIFSKSASYSSYPATPNEVYSSSGLNGAV

Query:  YVSKGYLLVVQSNTGKMYKVDADDGTARLVLLNKELKGADGIAARRDGVVLVVCYKKLWFLKSEDSWGEGVVYDEIDLDEEKFATAVTAGNEGRVYVLNG
        YVSKGYLLVVQSNTGKMYKVDADDGTARLVLLNK+L+GADGIAARRDGVVLVV Y+KLWFLKSEDSWGEGVVYDEIDLDEEKFAT VT GNEGRVYVLNG
Subjt:  YVSKGYLLVVQSNTGKMYKVDADDGTARLVLLNKELKGADGIAARRDGVVLVVCYKKLWFLKSEDSWGEGVVYDEIDLDEEKFATAVTAGNEGRVYVLNG

Query:  YVNEWLNGNLGREMFGIEEMR
        YVNE LNGN GRE FGIEEMR
Subjt:  YVNEWLNGNLGREMFGIEEMR

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.63.8e-1825.08Show/hide
Query:  VPKTTFRTRYGHYKFLVMPFGLTYAPAAFMDLMKRIFHPYLDQFVIVFIDDI-----------------------------LGKCEFWLEQVVFLGHVVS
        V KT F T++GHY++L MPFGL  APA F   M  I  P L++  +V++DDI                             L KCEF  ++  FLGHV++
Subjt:  VPKTTFRTRYGHYKFLVMPFGLTYAPAAFMDLMKRIFHPYLDQFVIVFIDDI-----------------------------LGKCEFWLEQVVFLGHVVS

Query:  ATRVSVDPQKTEVVVKW------------------------------------------------------ERLKTIME---------------------
           +  +P+K E + K+                                                      ++LK ++                      
Subjt:  ATRVSVDPQKTEVVVKW------------------------------------------------------ERLKTIME---------------------

Query:  ---------------------LRPHENNYPTHELELALKIW-----RHYLFGKKCYIYTDHKSLEYIFDQKELNMRQRRWLELIKDYDCTIEYHLGQANV
                             L  HE NY T E EL   +W     RHYL G+   I +DH+ L +++  K+ N +  RW   + ++D  I+Y  G+ N 
Subjt:  ---------------------LRPHENNYPTHELELALKIW-----RHYLFGKKCYIYTDHKSLEYIFDQKELNMRQRRWLELIKDYDCTIEYHLGQANV

Query:  VADALSR
        VADALSR
Subjt:  VADALSR

P20825 Retrovirus-related Pol polyprotein from transposon 2975.0e-1825.16Show/hide
Query:  ESDVPKTTFRTRYGHYKFLVMPFGLTYAPAAFMDLMKRIFHPYLDQFVIVFIDDI-----------------------------LGKCEFWLEQVVFLGH
        E  + KT F T+ GHY++L MPFGL  APA F   M  I  P L++  +V++DDI                             L KCEF  ++  FLGH
Subjt:  ESDVPKTTFRTRYGHYKFLVMPFGLTYAPAAFMDLMKRIFHPYLDQFVIVFIDDI-----------------------------LGKCEFWLEQVVFLGH

Query:  VVS------------------------------------------------------ATRVSVDPQKTEVVVKWERLKTIM-------------------
        +V+                                                        R  +D QK E +  +E+LK ++                   
Subjt:  VVS------------------------------------------------------ATRVSVDPQKTEVVVKWERLKTIM-------------------

Query:  -----------------------ELRPHENNYPTHELELALKIW-----RHYLFGKKCYIYTDHKSLEYIFDQKELNMRQRRWLELIKDYDCTIEYHLGQ
                                L  HE NY   E EL   +W     RHYL G++  I +DH+ L ++ + KE   +  RW   + +Y   I+Y  G+
Subjt:  -----------------------ELRPHENNYPTHELELALKIW-----RHYLFGKKCYIYTDHKSLEYIFDQKELNMRQRRWLELIKDYDCTIEYHLGQ

Query:  ANVVADALSR
         N VADALSR
Subjt:  ANVVADALSR

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein7.0e-1225.12Show/hide
Query:  LRPHENNYPTHELEL-----ALKIWRHYLFGKKCYIYTDHKSLEYIFDQKELNMRQRRWLELIKDYDCTIEYHLGQANVVADALSR--------------
        L   + NYP  ELEL     AL  +R+ L GK   + TDH SL  + ++ E   R +RWL+ +  YD T+EY  G  NVVADA+SR              
Subjt:  LRPHENNYPTHELEL-----ALKIWRHYLFGKKCYIYTDHKSLEYIFDQKELNMRQRRWLELIKDYDCTIEYHLGQANVVADALSR--------------

Query:  ----KSGGGISSLSVMKVTLLKEF-QSGVTTLGVTGDGA------LLTHFQLRPKLVDKVINKQ--------LKDPEIRKLKDEV---------------
            KS      L    +  +KE  Q  VT   ++   +      L   F+    L D++I  Q         ++  +R   D                 
Subjt:  ----KSGGGISSLSVMKVTLLKEF-QSGVTTLGVTGDGA------LLTHFQLRPKLVDKVINKQ--------LKDPEIRKLKDEV---------------

Query:  -------KAQQRI--------DFEQVKPEQQKLAGLLNPLPVTEWKWEYVTMDFLFGLLKTSAEVDGIL-------------------------------
               K Q  I          + +K  + +L GLL PLP+ E +W  ++MDF+ GL  TS  ++ IL                               
Subjt:  -------KAQQRI--------DFEQVKPEQQKLAGLLNPLPVTEWKWEYVTMDFLFGLLKTSAEVDGIL-------------------------------

Query:  --------------------------------------------TDGQSERTIQTLEDMLWACALQVKGSWNEYLSLMEFAYNNNYQSSIGMVSYEALYG
                                                    TDGQSERTIQTL  +L A       +W+ YL  +EF YN+    ++G   +E   G
Subjt:  --------------------------------------------TDGQSERTIQTLEDMLWACALQVKGSWNEYLSLMEFAYNNNYQSSIGMVSYEALYG

Query:  RMCRTP
         +  TP
Subjt:  RMCRTP

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus9.7e-1423.48Show/hide
Query:  YVRESDVPKTTFRTRYGHYKFLVMPFGLTYAPAAFMDLMKRIFHPYLDQFVIVFIDDI-----------------------------LGKCEFWLEQVVF
        +++ESD+PKT F T  G Y+FL +PFGL  APA F  ++  I   ++ +   V+IDDI                             L K  F   QV F
Subjt:  YVRESDVPKTTFRTRYGHYKFLVMPFGLTYAPAAFMDLMKRIFHPYLDQFVIVFIDDI-----------------------------LGKCEFWLEQVVF

Query:  LGHVVSATRVSVDPQKTEVVVK----------------------------------------------------------------WERLKTIM------
        LG++V+A  +  DP+K   + +                                                                +  LK+I+      
Subjt:  LGHVVSATRVSVDPQKTEVVVK----------------------------------------------------------------WERLKTIM------

Query:  ----------------------------------------ELRPHENNYPTHELELALKIW-----RHYLFGK-KCYIYTDHKSLEYIFDQKELNMRQRR
                                                 L   E NY T E E+   IW     R YL+G     +YTDH+ L +    +  N + +R
Subjt:  ----------------------------------------ELRPHENNYPTHELELALKIW-----RHYLFGK-KCYIYTDHKSLEYIFDQKELNMRQRR

Query:  WLELIKDYDCTIEYHLGQANVVADALSR
        W   I++Y+C + Y  G++NVVADALSR
Subjt:  WLELIKDYDCTIEYHLGQANVVADALSR

Q99315 Transposon Ty3-G Gag-Pol polyprotein2.4e-1225.37Show/hide
Query:  LRPHENNYPTHELEL-----ALKIWRHYLFGKKCYIYTDHKSLEYIFDQKELNMRQRRWLELIKDYDCTIEYHLGQANVVADALSR--------------
        L   + NYP  ELEL     AL  +R+ L GK   + TDH SL  + ++ E   R +RWL+ +  YD T+EY  G  NVVADA+SR              
Subjt:  LRPHENNYPTHELEL-----ALKIWRHYLFGKKCYIYTDHKSLEYIFDQKELNMRQRRWLELIKDYDCTIEYHLGQANVVADALSR--------------

Query:  ----KSGGGISSLSVMKVTLLKEF-QSGVTTLGVTGDGA------LLTHFQLRPKLVDKVINKQ--------LKDPEIRKLKDEV---------------
            KS      L    +  +KE  Q  VT   ++   +      L   F+    L D++I  Q         ++  +R   D                 
Subjt:  ----KSGGGISSLSVMKVTLLKEF-QSGVTTLGVTGDGA------LLTHFQLRPKLVDKVINKQ--------LKDPEIRKLKDEV---------------

Query:  -------KAQQRI--------DFEQVKPEQQKLAGLLNPLPVTEWKWEYVTMDFLFGLLKTSAEVDGIL-------------------------------
               K Q  I          + +K  + +L GLL PLP+ E +W  ++MDF+ GL  TS  ++ IL                               
Subjt:  -------KAQQRI--------DFEQVKPEQQKLAGLLNPLPVTEWKWEYVTMDFLFGLLKTSAEVDGIL-------------------------------

Query:  --------------------------------------------TDGQSERTIQTLEDMLWACALQVKGSWNEYLSLMEFAYNNNYQSSIGMVSYEALYG
                                                    TDGQSERTIQTL  +L A A     +W+ YL  +EF YN+    ++G   +E   G
Subjt:  --------------------------------------------TDGQSERTIQTLEDMLWACALQVKGSWNEYLSLMEFAYNNNYQSSIGMVSYEALYG

Query:  RMCRTP
         +  TP
Subjt:  RMCRTP

Arabidopsis top hitse value%identityAlignment
AT2G01410.1 NHL domain-containing protein8.5e-9861.42Show/hide
Query:  ILLFILLQLFLSQTLARKPHIIDFRSPNLYPEGLVWDPSAQHFVVSSLHHRTLVSVSDAGVAETLIHDPNLPENVSILGLAIDSVNNRLLAAVHAAPPLP
        IL  +L    L  + A   H+I+FRSP LYPEGL WDP  QHF+V SLH RT+ SVSDAGV ETLI D +LPEN +ILGLA+DS N RLLA + + PPLP
Subjt:  ILLFILLQLFLSQTLARKPHIIDFRSPNLYPEGLVWDPSAQHFVVSSLHHRTLVSVSDAGVAETLIHDPNLPENVSILGLAIDSVNNRLLAAVHAAPPLP

Query:  AFNALAAYDLRS-RRRISLTPLPS---DGTSSPRPVANAVAVDFKGNAFVTNSAGNFIWKVDKDRSASIFSKSASYSSYP--ATPNEVYSSSGLNGAVYV
         F+ALA+YDLRS  RR+ L+PLPS   D     R VAN VAVDFKGNA+VTNSA NFIWKVD+D +ASIFSKS  ++S P  A  +  +   GLNG VY+
Subjt:  AFNALAAYDLRS-RRRISLTPLPS---DGTSSPRPVANAVAVDFKGNAFVTNSAGNFIWKVDKDRSASIFSKSASYSSYP--ATPNEVYSSSGLNGAVYV

Query:  SKGYLLVVQSNTGKMYKVDADDGTARLVLLNKELKGADGIA-ARRDGVVLVVCYKKLWFLKSEDSWGEGVVYDEIDLDEEKFATAVTAGNEGRVYVLNGY
        SKGYLLVVQSNTGK++KVD D G ARLVLLN +L  ADG+   RRDG V+VV  KKLW LKS+DSW EGVVYDEIDLD E F TAVT     R+YVL G 
Subjt:  SKGYLLVVQSNTGKMYKVDADDGTARLVLLNKELKGADGIA-ARRDGVVLVVCYKKLWFLKSEDSWGEGVVYDEIDLDEEKFATAVTAGNEGRVYVLNGY

Query:  VNEWLNGNL-----GREMFGIEEM
        V E + G+       RE FGIEE+
Subjt:  VNEWLNGNL-----GREMFGIEEM

AT2G16760.1 Calcium-dependent phosphotriesterase superfamily protein1.4e-2030.69Show/hide
Query:  LILLFILLQLFLSQTLA-RKPHIIDFRSPNLYPEGLVWDPSAQHFVVSSLHHRTLVSV----SDAGVAE-TLIHDPNLPENVSILGLAIDSVNNRLLAAV
        L++  + +   +S  LA    H+  ++S   + E   WD   + F+VS +    +  +    SD  + E TL+ D +L  N S LG+AID V NRLL AV
Subjt:  LILLFILLQLFLSQTLA-RKPHIIDFRSPNLYPEGLVWDPSAQHFVVSSLHHRTLVSV----SDAGVAE-TLIHDPNLPENVSILGLAIDSVNNRLLAAV

Query:  HAAPPLPAFNALAAYDLRSRRRISLTPLPSDGTSSPRPVANAVAVDFKGNAFVTNSAGNFIWKVDKDRSASIFSKSASYSSYPA-TPNEVYSS-SGLNGA
         A      ++ALAAYDL + RR+ L  L   G S  +  A+ VAVD +GNA+VT++  + IWKVD      +  K  +  + P  TP   Y++   LNG 
Subjt:  HAAPPLPAFNALAAYDLRSRRRISLTPLPSDGTSSPRPVANAVAVDFKGNAFVTNSAGNFIWKVDKDRSASIFSKSASYSSYPA-TPNEVYSS-SGLNGA

Query:  VYVSKGYLLVVQSNTGKMYKVDADDG-----TARLVLLNKELKGADGIAARRDGVVLVVCYKKLWFLKSEDSWGEGVVYDEIDLDEEKFATAVTAGNEGR
        VY   G+L+V+ + +G +YK+D  +G      + + +    L+  DG+       ++V        ++S D W    V             +     EGR
Subjt:  VYVSKGYLLVVQSNTGKMYKVDADDG-----TARLVLLNKELKGADGIAARRDGVVLVVCYKKLWFLKSEDSWGEGVVYDEIDLDEEKFATAVTAGNEGR

Query:  VYV
        VY+
Subjt:  VYV

AT2G47370.1 Calcium-dependent phosphotriesterase superfamily protein5.7e-1728.48Show/hide
Query:  SLILLFILLQ------LFLSQTLARKPHIIDFRSPNLYPEGLVWDPSAQHFVVSSLHH----RTLVSVSDAG---VAETLIHDPNLPENVSILGLAIDSV
        S+ L F +L       +  S+      H+I + S     E   WD   + F+VS +        LV   D+       TL+ D +L  N S  G  ID  
Subjt:  SLILLFILLQ------LFLSQTLARKPHIIDFRSPNLYPEGLVWDPSAQHFVVSSLHH----RTLVSVSDAG---VAETLIHDPNLPENVSILGLAIDSV

Query:  NNRLLAAVHAAPPLPAFNALAAYDLRSRRRISLTPLPSDGTSSPRPVANAVAVDFKGNAFVTNSAGNFIWKVDKDRSASIFSKSASYSSYPATPNEVYSS
         NRLL AV        ++AL AYDL + RR+ LT L S   S     A+ VAVD +GNA+V+++ G  IW VD +       +S  ++    TP    + 
Subjt:  NNRLLAAVHAAPPLPAFNALAAYDLRSRRRISLTPLPSDGTSSPRPVANAVAVDFKGNAFVTNSAGNFIWKVDKDRSASIFSKSASYSSYPATPNEVYSS

Query:  SGLNGAVYVSKGYLLVVQSNTGKMYKVDADDG--TARLVLLN---KELKGADGIAARRDGVVLVVCYKKLWFLKSEDSWGEGVVYDEIDLDEEKFATAVT
          LNG VY  +G+L+V+ + +G +YK+D  +G  ++++ +++     L+  DG+       ++V        ++S D W    V             +  
Subjt:  SGLNGAVYVSKGYLLVVQSNTGKMYKVDADDG--TARLVLLN---KELKGADGIAARRDGVVLVVCYKKLWFLKSEDSWGEGVVYDEIDLDEEKFATAVT

Query:  AGNEGRVYV
           EGRVY+
Subjt:  AGNEGRVYV

AT5G28660.1 NHL domain-containing protein2.0e-0642.11Show/hide
Query:  TLIHDPNLPENVSILGLAIDSVNNRLLAAVHAAPPLPAFNALAAYDLRSRRRISLTPLPSDGTSSPRPVANAVAVDFKGNAFVTNSAGNFIWKVD
        TL++D +L +N S  G  ID   NRLL AV        ++AL AYDL + R + LT L S   +     A+ VAVD +GNA+V+++ G  IW VD
Subjt:  TLIHDPNLPENVSILGLAIDSVNNRLLAAVHAAPPLPAFNALAAYDLRSRRRISLTPLPSDGTSSPRPVANAVAVDFKGNAFVTNSAGNFIWKVD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGCCCATTAATCGCTCTCTAATTCTCCTCTTCATCCTTCTCCAATTGTTCCTCTCGCAAACCCTAGCTCGAAAACCCCACATCATCGATTTCCGATCCCCA
AATCTCTACCCGGAGGGCCTCGTTTGGGACCCATCCGCCCAGCATTTCGTCGTCAGCTCACTCCACCACCGCACTCTCGTCTCTGTCTCCGACGCCGGCGTCGCC
GAAACCTTAATCCACGATCCCAACCTCCCCGAAAACGTCTCCATCTTAGGCCTTGCCATCGATTCAGTCAACAATCGACTCCTCGCCGCCGTCCACGCTGCCCCA
CCTCTCCCGGCATTCAACGCCCTCGCCGCCTACGATCTCCGATCCCGCCGTCGCATCTCCCTTACTCCTCTCCCCTCCGATGGAACCTCCAGTCCCCGCCCAGTC
GCGAACGCCGTTGCGGTCGACTTCAAGGGTAACGCCTTCGTCACGAACTCCGCCGGAAACTTCATCTGGAAGGTTGATAAAGACAGATCTGCCTCGATCTTCTCG
AAATCGGCGAGTTACAGTTCCTATCCCGCAACTCCGAACGAAGTTTACTCGTCGTCAGGGCTAAACGGTGCCGTTTACGTGAGCAAAGGGTACCTTCTGGTGGTG
CAATCGAACACCGGAAAGATGTACAAAGTGGACGCTGACGACGGAACGGCGAGGCTGGTTTTGCTGAACAAAGAATTGAAAGGGGCGGACGGGATAGCGGCGAGA
AGAGACGGCGTCGTTTTGGTGGTCTGTTACAAAAAGCTGTGGTTCTTGAAGAGCGAGGATAGTTGGGGGGAGGGGGTGGTTTATGACGAAATTGACCTCGATGAA
GAGAAGTTTGCTACTGCTGTAACTGCGGGGAATGAGGGGAGGGTGTATGTGCTGAATGGATATGTCAATGAGTGGTTAAATGGTAATTTGGGAAGGGAGATGTTT
GGGATTGAAGAAATGAGGCAAGCAGACGCCGGAACATCGCACACTACCGGTGACTCTAGAGAAGGTTCTGAAGGAGATTCTAGTAACCCTCAGGTAAATATGAAA
GATCAAATATTTAATAGGATAGCCCAGAGGTTGGCATCCAATGTTGAAACGGCTCAGGGAGATCCCGAGAGGAAGTATAAGATCGAGAGGTTCAAAGCCTTAGGT
GCCCAGATATTTGAGGGTACTACGAATCCTGCAGATGTCAAAATGTGGCTAAACCAGATAGAGAAATGCTTTAGGGTCATGCACTGTCCTGAAGAGAGGAAGCTA
GAGGCTGAGGAGACTCTAGACGTGGTGACTATAAAGCCTAGAAAATGGTTGAGTAAAGGGTGTGAAGCGTACCTGGCATATGTCAGAGAATCTGATGTACCTAAG
ACGACATTTAGAACGAGATATGGACATTACAAGTTTTTGGTAATGCCGTTTGGATTGACATATGCACCAGCAGCATTTATGGACCTTATGAAGAGGATATTTCAT
CCTTATTTGGATCAATTTGTTATAGTATTCATAGATGATATTCTGGGTAAGTGTGAGTTTTGGTTGGAACAAGTGGTGTTTCTAGGCCATGTGGTGTCAGCAACT
AGAGTTAGCGTAGATCCTCAAAAGACTGAAGTCGTGGTAAAGTGGGAACGACTTAAGACTATAATGGAGCTAAGACCCCACGAGAACAATTACCCTACTCATGAA
TTGGAATTAGCATTAAAGATATGGCGACATTATTTATTTGGGAAAAAGTGCTATATCTATACGGATCATAAGAGTCTTGAGTATATCTTTGATCAAAAGGAGCTT
AACATGAGGCAAAGGAGGTGGTTGGAATTGATCAAAGATTACGATTGTACTATTGAGTATCACCTAGGTCAAGCAAATGTAGTCGCTGATGCTTTAAGTCGGAAA
TCCGGGGGTGGTATAAGTTCTTTAAGTGTTATGAAGGTTACTCTACTCAAAGAATTTCAAAGTGGTGTTACTACGTTAGGTGTAACCGGTGATGGAGCATTGTTG
ACACATTTCCAACTGAGGCCTAAATTAGTTGATAAAGTAATAAATAAGCAATTGAAGGATCCTGAAATTAGGAAACTCAAAGACGAAGTAAAGGCCCAACAAAGG
ATAGATTTTGAGCAAGTTAAGCCTGAACAGCAAAAGCTAGCCGGATTGCTAAATCCACTCCCTGTAACTGAATGGAAGTGGGAGTATGTCACCATGGATTTTCTA
TTTGGGTTGCTAAAGACTTCAGCAGAAGTTGATGGAATTTTGACGGATGGACAGTCTGAAAGGACTATACAAACCTTGGAAGACATGCTTTGGGCATGTGCTTTG
CAAGTTAAAGGTAGCTGGAATGAGTATTTATCATTGATGGAGTTTGCTTACAATAATAACTACCAGTCAAGCATAGGTATGGTTTCGTATGAAGCTTTGTATGGT
AGAATGTGTAGAACTCCAGTGTGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGCCCATTAATCGCTCTCTAATTCTCCTCTTCATCCTTCTCCAATTGTTCCTCTCGCAAACCCTAGCTCGAAAACCCCACATCATCGATTTCCGATCCCCA
AATCTCTACCCGGAGGGCCTCGTTTGGGACCCATCCGCCCAGCATTTCGTCGTCAGCTCACTCCACCACCGCACTCTCGTCTCTGTCTCCGACGCCGGCGTCGCC
GAAACCTTAATCCACGATCCCAACCTCCCCGAAAACGTCTCCATCTTAGGCCTTGCCATCGATTCAGTCAACAATCGACTCCTCGCCGCCGTCCACGCTGCCCCA
CCTCTCCCGGCATTCAACGCCCTCGCCGCCTACGATCTCCGATCCCGCCGTCGCATCTCCCTTACTCCTCTCCCCTCCGATGGAACCTCCAGTCCCCGCCCAGTC
GCGAACGCCGTTGCGGTCGACTTCAAGGGTAACGCCTTCGTCACGAACTCCGCCGGAAACTTCATCTGGAAGGTTGATAAAGACAGATCTGCCTCGATCTTCTCG
AAATCGGCGAGTTACAGTTCCTATCCCGCAACTCCGAACGAAGTTTACTCGTCGTCAGGGCTAAACGGTGCCGTTTACGTGAGCAAAGGGTACCTTCTGGTGGTG
CAATCGAACACCGGAAAGATGTACAAAGTGGACGCTGACGACGGAACGGCGAGGCTGGTTTTGCTGAACAAAGAATTGAAAGGGGCGGACGGGATAGCGGCGAGA
AGAGACGGCGTCGTTTTGGTGGTCTGTTACAAAAAGCTGTGGTTCTTGAAGAGCGAGGATAGTTGGGGGGAGGGGGTGGTTTATGACGAAATTGACCTCGATGAA
GAGAAGTTTGCTACTGCTGTAACTGCGGGGAATGAGGGGAGGGTGTATGTGCTGAATGGATATGTCAATGAGTGGTTAAATGGTAATTTGGGAAGGGAGATGTTT
GGGATTGAAGAAATGAGGCAAGCAGACGCCGGAACATCGCACACTACCGGTGACTCTAGAGAAGGTTCTGAAGGAGATTCTAGTAACCCTCAGGTAAATATGAAA
GATCAAATATTTAATAGGATAGCCCAGAGGTTGGCATCCAATGTTGAAACGGCTCAGGGAGATCCCGAGAGGAAGTATAAGATCGAGAGGTTCAAAGCCTTAGGT
GCCCAGATATTTGAGGGTACTACGAATCCTGCAGATGTCAAAATGTGGCTAAACCAGATAGAGAAATGCTTTAGGGTCATGCACTGTCCTGAAGAGAGGAAGCTA
GAGGCTGAGGAGACTCTAGACGTGGTGACTATAAAGCCTAGAAAATGGTTGAGTAAAGGGTGTGAAGCGTACCTGGCATATGTCAGAGAATCTGATGTACCTAAG
ACGACATTTAGAACGAGATATGGACATTACAAGTTTTTGGTAATGCCGTTTGGATTGACATATGCACCAGCAGCATTTATGGACCTTATGAAGAGGATATTTCAT
CCTTATTTGGATCAATTTGTTATAGTATTCATAGATGATATTCTGGGTAAGTGTGAGTTTTGGTTGGAACAAGTGGTGTTTCTAGGCCATGTGGTGTCAGCAACT
AGAGTTAGCGTAGATCCTCAAAAGACTGAAGTCGTGGTAAAGTGGGAACGACTTAAGACTATAATGGAGCTAAGACCCCACGAGAACAATTACCCTACTCATGAA
TTGGAATTAGCATTAAAGATATGGCGACATTATTTATTTGGGAAAAAGTGCTATATCTATACGGATCATAAGAGTCTTGAGTATATCTTTGATCAAAAGGAGCTT
AACATGAGGCAAAGGAGGTGGTTGGAATTGATCAAAGATTACGATTGTACTATTGAGTATCACCTAGGTCAAGCAAATGTAGTCGCTGATGCTTTAAGTCGGAAA
TCCGGGGGTGGTATAAGTTCTTTAAGTGTTATGAAGGTTACTCTACTCAAAGAATTTCAAAGTGGTGTTACTACGTTAGGTGTAACCGGTGATGGAGCATTGTTG
ACACATTTCCAACTGAGGCCTAAATTAGTTGATAAAGTAATAAATAAGCAATTGAAGGATCCTGAAATTAGGAAACTCAAAGACGAAGTAAAGGCCCAACAAAGG
ATAGATTTTGAGCAAGTTAAGCCTGAACAGCAAAAGCTAGCCGGATTGCTAAATCCACTCCCTGTAACTGAATGGAAGTGGGAGTATGTCACCATGGATTTTCTA
TTTGGGTTGCTAAAGACTTCAGCAGAAGTTGATGGAATTTTGACGGATGGACAGTCTGAAAGGACTATACAAACCTTGGAAGACATGCTTTGGGCATGTGCTTTG
CAAGTTAAAGGTAGCTGGAATGAGTATTTATCATTGATGGAGTTTGCTTACAATAATAACTACCAGTCAAGCATAGGTATGGTTTCGTATGAAGCTTTGTATGGT
AGAATGTGTAGAACTCCAGTGTGTTAA
Protein sequenceShow/hide protein sequence
MAPINRSLILLFILLQLFLSQTLARKPHIIDFRSPNLYPEGLVWDPSAQHFVVSSLHHRTLVSVSDAGVAETLIHDPNLPENVSILGLAIDSVNNRLLAAVHAAP
PLPAFNALAAYDLRSRRRISLTPLPSDGTSSPRPVANAVAVDFKGNAFVTNSAGNFIWKVDKDRSASIFSKSASYSSYPATPNEVYSSSGLNGAVYVSKGYLLVV
QSNTGKMYKVDADDGTARLVLLNKELKGADGIAARRDGVVLVVCYKKLWFLKSEDSWGEGVVYDEIDLDEEKFATAVTAGNEGRVYVLNGYVNEWLNGNLGREMF
GIEEMRQADAGTSHTTGDSREGSEGDSSNPQVNMKDQIFNRIAQRLASNVETAQGDPERKYKIERFKALGAQIFEGTTNPADVKMWLNQIEKCFRVMHCPEERKL
EAEETLDVVTIKPRKWLSKGCEAYLAYVRESDVPKTTFRTRYGHYKFLVMPFGLTYAPAAFMDLMKRIFHPYLDQFVIVFIDDILGKCEFWLEQVVFLGHVVSAT
RVSVDPQKTEVVVKWERLKTIMELRPHENNYPTHELELALKIWRHYLFGKKCYIYTDHKSLEYIFDQKELNMRQRRWLELIKDYDCTIEYHLGQANVVADALSRK
SGGGISSLSVMKVTLLKEFQSGVTTLGVTGDGALLTHFQLRPKLVDKVINKQLKDPEIRKLKDEVKAQQRIDFEQVKPEQQKLAGLLNPLPVTEWKWEYVTMDFL
FGLLKTSAEVDGILTDGQSERTIQTLEDMLWACALQVKGSWNEYLSLMEFAYNNNYQSSIGMVSYEALYGRMCRTPVC