; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G15600 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G15600
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionCCHC-type domain-containing protein
Genome locationChr5:16610381..16613927
RNA-Seq ExpressionCSPI05G15600
SyntenyCSPI05G15600
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0016020 - membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN63777.1 hypothetical protein VITISV_043745 [Vitis vinifera]8.1e-10741.6Show/hide
Query:  KILRSLPKTWEAKVTAIQEAKDLTKLPLEELIGSFMTHEIIMKEHL-EDESKKKKNIALKTISLEVD--PEDEDGLDEDDIAYFSRKYKNFI-----KRK
        KILR LP  W  KVTAIQEAKDLTKLP+EEL+GS MT+EI + + L E E+KKKK+IALK  + + +   E++   ++DD+A  +RK   ++     +RK
Subjt:  KILRSLPKTWEAKVTAIQEAKDLTKLPLEELIGSFMTHEIIMKEHL-EDESKKKKNIALKTISLEVD--PEDEDGLDEDDIAYFSRKYKNFI-----KRK

Query:  KYFKKHLSTQKESK--GEKSKKDEE---ICSECKRSGHIRTDCPLLK-SSKKSKKKAMKATWDDSSES--ESEVEEMANLGLMAHSDKDDE-HDDK----
        K+  +   +++ES   G+K K +E+   IC +CK  GHI+ +CPL    +K+ KKKAM ATW +S ES  E + +E+AN+  MA  D D++  +DK    
Subjt:  KYFKKHLSTQKESK--GEKSKKDEE---ICSECKRSGHIRTDCPLLK-SSKKSKKKAMKATWDDSSES--ESEVEEMANLGLMAHSDKDDE-HDDK----

Query:  -HVCIEKDALLDKVRFLEHDSC------EKDNLIKVLKENELSVLQELDKAK--ETIKKLTIGAQRLDKIIEVGK----SYGDKR-----------GLGY
         H C E    ++  ++ +HD C      +     + L+   + ++ +L+K +    + K+     ++ +  ++GK    S+ +K             +  
Subjt:  -HVCIEKDALLDKVRFLEHDSC------EKDNLIKVLKENELSVLQELDKAK--ETIKKLTIGAQRLDKIIEVGK----SYGDKR-----------GLGY

Query:  IDESSTPSSSKTTFVKASPIPPRKTNGTWIVVAQDTTGDRSKLIPFSKK----NGGMVTFGDNKKG-EFDNDAFIAFCEENGFSHNFLSPRTPQQNGVVE
           S TPS    ++V    I    +  TW++     +    +   F  K     G  +T   +  G EF+N  F  +C ++G +HNF +PRTPQQNGVVE
Subjt:  IDESSTPSSSKTTFVKASPIPPRKTNGTWIVVAQDTTGDRSKLIPFSKK----NGGMVTFGDNKKG-EFDNDAFIAFCEENGFSHNFLSPRTPQQNGVVE

Query:  RKNRTLQEFARSILNEYGLPKYFWAEVVNTDCYVSNRVLVRPSLGKTPYELWHGKIPNNGYFKE---KPIILNNKEKLGKFDSKTDVGIFLGYSSTSKAY
        RKNRTLQE AR++LNE  LPKYFWAE VNT CYV +R+L+RP L KTPYELW  K PN  YFK    K  ILN K+ LGKFD+K+DVGIFLGYS++SK +
Subjt:  RKNRTLQEFARSILNEYGLPKYFWAEVVNTDCYVSNRVLVRPSLGKTPYELWHGKIPNNGYFKE---KPIILNNKEKLGKFDSKTDVGIFLGYSSTSKAY

Query:  RVFNKKTLVTEESIHVVFDESWNNVSNESICSDD--LEKDFGDLLVNDKGKEIVRSMQDVNIIEKKEE------------GFSS--LPKVWRYALSHLKD
        RVFNK+T+V EESIHV+F ES N++       DD  LE   G L + DK ++  +S +D     KKEE            G SS  LPK W++ ++H +D
Subjt:  RVFNKKTLVTEESIHVVFDESWNNVSNESICSDD--LEKDFGDLLVNDKGKEIVRSMQDVNIIEKKEE------------GFSS--LPKVWRYALSHLKD

Query:  LILSNPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAIHE
         I+ NP  GV+TRSSL N+ +NLAF+ QIEP++ KDA  DE W++A+ E
Subjt:  LILSNPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAIHE

KAA0046182.1 zf-CCHC domain-containing protein/UBN2 domain-containing protein [Cucumis melo var. makuwa]2.4e-11169.52Show/hide
Query:  RKILRSLPKTWEAKVTAIQEAKDLTKLPLEELIGSFMTHEIIMKEHLEDESKKKKNIALKTISLEVDPEDEDGLDEDDIAYFSRKYKNFIKRKKYFKKHL
        RKIL  LPKTW+AKVTAIQEAKDLTKLPLEELIGS MTHEIIM+EHLEDESKKKK+IAL TISLE+  EDED LDEDDI YFSRKYKNFIKRKK FKK+L
Subjt:  RKILRSLPKTWEAKVTAIQEAKDLTKLPLEELIGSFMTHEIIMKEHLEDESKKKKNIALKTISLEVDPEDEDGLDEDDIAYFSRKYKNFIKRKKYFKKHL

Query:  STQKESKGEKSKKDEEICSECKRSGHIRTDCPLLKSSKKSKKKAMKATWDDSSESESEVEEMANLGLMAHSDKDDEHD----------------------
        STQK SKGEKSKKDE IC ECK+  HIRTDCP LKSSKKSK+KAMKATWDDSSESESEVEE ANLGLM  SDK+DEHD                      
Subjt:  STQKESKGEKSKKDEEICSECKRSGHIRTDCPLLKSSKKSKKKAMKATWDDSSESESEVEEMANLGLMAHSDKDDEHD----------------------

Query:  ------------------------------------------------DKHV--CIEKDALLDKVRFLEHDSCEKDNLIKVLKENELSVLQELDKAKETI
                                                        DKH+  C EKDALLDKVRFLEHDSCEKDNLIKVLKENEL+VLQ+LDKAKETI
Subjt:  ------------------------------------------------DKHV--CIEKDALLDKVRFLEHDSCEKDNLIKVLKENELSVLQELDKAKETI

Query:  KKLTIGAQRLDKIIEVGKSYGDKRGLGYIDESSTPSSSKTTFVKASPIPPR
        KKLTIGAQRLDKIIEVGKSYGDKR LGYIDESST  SSKTTFVKASPI P+
Subjt:  KKLTIGAQRLDKIIEVGKSYGDKRGLGYIDESSTPSSSKTTFVKASPIPPR

KAA0051650.1 zf-CCHC domain-containing protein/UBN2 domain-containing protein [Cucumis melo var. makuwa]2.8e-11577.6Show/hide
Query:  NQVKESKNSMLGLGKVYTTFKNVRKILRSLPKTWEAKVTAIQEAKDLTKLPLEELIGSFMTHEIIMKEHLEDESKKKKNIALKTISLEVDPEDEDGLDED
        NQ+KESK +MLGLGKVYTTF+N RKILRSLPKTW AKVTAIQEAKDLTKLP EELIGS MTHEIIMK HLEDESKK K++ALKTI LEVDP+DED LDE+
Subjt:  NQVKESKNSMLGLGKVYTTFKNVRKILRSLPKTWEAKVTAIQEAKDLTKLPLEELIGSFMTHEIIMKEHLEDESKKKKNIALKTISLEVDPEDEDGLDED

Query:  DIAYFSRKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEEICSECKRSGHIRTDCPLLKSSKKSKKKAMKATWDDSSESESEVEEMANLGLMAHSDKDDEH
        DIAYFSRKYKNFIKRKKYFKKH+S+QKESK EKSKKDE                           KAMKATWDDSS SESEVE+M +LGLMAHS+K+DEH
Subjt:  DIAYFSRKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEEICSECKRSGHIRTDCPLLKSSKKSKKKAMKATWDDSSESESEVEEMANLGLMAHSDKDDEH

Query:  DD-----KHV--CIEKDALLDKVRFLEHDSCEKDNLIKVLKENELSVLQELDKAKETIKKLTIGAQRLDKIIEVGKSYGDKRGLGYIDESSTPSSSKTTF
        DD     KHV  C EK+ALLDKVRFLEHD CEKDNLIKVLKENEL+VLQ+LDKAKETIKKLTI AQRL +IIEVGKSYGDKRGLGYIDE STPSSSKTTF
Subjt:  DD-----KHV--CIEKDALLDKVRFLEHDSCEKDNLIKVLKENELSVLQELDKAKETIKKLTIGAQRLDKIIEVGKSYGDKRGLGYIDESSTPSSSKTTF

Query:  VKASPIPP
        VKASPI P
Subjt:  VKASPIPP

XP_024024455.1 uncharacterized protein LOC112092461 [Morus notabilis]1.9e-10832.88Show/hide
Query:  DGPYVPMKNVDNVDTPKLEEEYDENEMKKCSFNEGAINCLYCTLSKDEVNRISMCSSAQEIWNTLEITHEGTNQVKESKNSML-------------GLGK
        +GP+VP K  ++   PK   EYDE++ +K S N   +N LYC L ++E NRI+ C SAQ IW+TLE+ HEGTNQVKE+K + L              +  
Subjt:  DGPYVPMKNVDNVDTPKLEEEYDENEMKKCSFNEGAINCLYCTLSKDEVNRISMCSSAQEIWNTLEITHEGTNQVKESKNSML-------------GLGK

Query:  VYTTFKN-----------------VRKILRSLPKTWEAKVTAIQEAKDLTKLPLEELIGSFMTHEIIMKEHLEDESKKKKNIALK-TISLEVDPEDE---
        +YT F +                 V KI+RSLPK+W  K T ++E K L+ + L++LIGS MTHE+   +  E++ KKKK IALK +I  EV+  +E   
Subjt:  VYTTFKN-----------------VRKILRSLPKTWEAKVTAIQEAKDLTKLPLEELIGSFMTHEIIMKEHLEDESKKKKNIALK-TISLEVDPEDE---

Query:  -DGLDEDDIAYFSRKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEEICSECKRSGHIRTDCPLLKSSKKSK-----KKAMKATWDDSSESES---EVEEM
         D LD++ I+ F++KYK F    KY +   S + E +G+K K+DE IC +CK+ GH RT+CPL +S+KKS      KK ++ATWD+    ES   E EE+
Subjt:  -DGLDEDDIAYFSRKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEEICSECKRSGHIRTDCPLLKSSKKSK-----KKAMKATWDDSSESES---EVEEM

Query:  ANLGLMAHSDKDDEHDDKHVCIEKDALL---------------DKVRFLEHDSCEKDNLIKVLKE--------------NELSVLQELDKAKETIKKLTI
        AN+  MA  D +   + K + IE+++++               ++   L H +      I++LK+                +S+ +++D   + I K T 
Subjt:  ANLGLMAHSDKDDEHDDKHVCIEKDALL---------------DKVRFLEHDSCEKDNLIKVLKE--------------NELSVLQELDKAKETIKKLTI

Query:  GAQRLDKIIEVGKSYGDKRGLGYIDESSTPSSSKTTFVKASPIPPRKTNGTWIV---VAQDTTGDRSKLIPFSKKNGGMVTFGDNKK-------------
        G +  D+++   +   +K G+GY          KT         PR+    W +    ++  TGD S L  F++KNGG VTFGDN K             
Subjt:  GAQRLDKIIEVGKSYGDKRGLGYIDESSTPSSSKTTFVKASPIPPRKTNGTWIV---VAQDTTGDRSKLIPFSKKNGGMVTFGDNKK-------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------GEFDNDAFIAFCEENGFSHNFLSPRTPQQNGVVERKNRTLQEFARSILNEYGLPKYFWAEVVNTDCYVSNRVLVRPSLGKTPYELWH
                     GEFDNDAF   C ENG+ HNF +PRTPQQNGVVERKNR LQE ARS+LNE GLPKYFWAE VNT CY+ NRVL+RP + KTPYELW 
Subjt:  -------------GEFDNDAFIAFCEENGFSHNFLSPRTPQQNGVVERKNRTLQEFARSILNEYGLPKYFWAEVVNTDCYVSNRVLVRPSLGKTPYELWH

Query:  GKIPNNGYFKE---KPIILNNKEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVTEESIHVVFDESWNNVSNESICSDD--LEKDFGDLLVND-----
        G+ PN GYF+    K  ILN K+ LGKFD+K+DVGIFLGYS+TSKAYRVFNK+TLV EES+HVVFDE+ N    +S+  DD  LE    ++ +ND     
Subjt:  GKIPNNGYFKE---KPIILNNKEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVTEESIHVVFDESWNNVSNESICSDD--LEKDFGDLLVND-----

Query:  -KGKE----IVRSMQDVNIIEKKEEGFSS-LPKVWRYALSHLKDLILSNPEQ
         K KE         Q+  + + K++  S+ LPK WRY+ SH KD IL+  E+
Subjt:  -KGKE----IVRSMQDVNIIEKKEEGFSS-LPKVWRYALSHLKDLILSNPEQ

XP_031741720.1 uncharacterized protein LOC116403915 [Cucumis sativus]1.3e-18972.52Show/hide
Query:  MANLLANGIVEGQSTSRPPYFD----------------------------GPYVPMKNVDNVDTPKLEEEYDENEMKKCSFNEGAINCLYCTLSKDEVNR
        MANLLANGIVEGQSTSRPPYFD                            GPYVPMKNVDNVDTPKLEEEYDENEMKKCSFN  AINCLYC LSKDE NR
Subjt:  MANLLANGIVEGQSTSRPPYFD----------------------------GPYVPMKNVDNVDTPKLEEEYDENEMKKCSFNEGAINCLYCTLSKDEVNR

Query:  ISMCSSAQEIWNTLEITHEGTNQVKESK------------------------------NSMLGLGKVYTTFKNVRKILRSLPKTWEAKVTAIQEAKDLTK
        ISMCSSAQEIWNTLEITHEGTNQVKESK                              N++ GLGKVYTT +NVRKILRSLPKTWEAKVTAIQEAKDLTK
Subjt:  ISMCSSAQEIWNTLEITHEGTNQVKESK------------------------------NSMLGLGKVYTTFKNVRKILRSLPKTWEAKVTAIQEAKDLTK

Query:  LPLEELIGSFMTHEIIMKEHLEDESKKKKNIALKTISLEVDPEDEDGLDEDDIAYFSRKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEEICSECKRSGH
        LPLEELIGS MTHEIIMKEHLEDESKKKK+IALKTISLEVDPEDEDGLDEDDIAYFSRKYKNFIKRKKYFKKHLSTQKESKGEKSKKDE IC ECKRSGH
Subjt:  LPLEELIGSFMTHEIIMKEHLEDESKKKKNIALKTISLEVDPEDEDGLDEDDIAYFSRKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEEICSECKRSGH

Query:  IRTDCPLLKSSKKSKKKAMKATWDDSSESESEVEEMANLGLMAHSDKDDEHD------------------------------------------------
        IRTDCPLLKSSKKSKKKAMKATWDDSSESESEVEEMANLGLMAHSDKDDEHD                                                
Subjt:  IRTDCPLLKSSKKSKKKAMKATWDDSSESESEVEEMANLGLMAHSDKDDEHD------------------------------------------------

Query:  -----------------------DKHVCIEKDALLDKVRFLEHDSCEKDNLIKVLKENELSVLQELDKAKETIKKLTIGAQRLDKIIEVGKSYGDKRGLG
                               DKHVCIEKDALLDKVRFLEHDSCEKDNLIKVLKENELSVLQELDKAKETIKKLTIGAQRLDKIIEVGKSYGDKRGLG
Subjt:  -----------------------DKHVCIEKDALLDKVRFLEHDSCEKDNLIKVLKENELSVLQELDKAKETIKKLTIGAQRLDKIIEVGKSYGDKRGLG

Query:  YIDESSTPSSSKTTFVKASPIPPR
        YIDESSTPSSSKTTFVKASPI P+
Subjt:  YIDESSTPSSSKTTFVKASPIPPR

TrEMBL top hitse value%identityAlignment
A0A2N9GUN3 Uncharacterized protein2.0e-12738.87Show/hide
Query:  IVEGQSTSRPPYF----------------------------DGPYVPMKNVDNVDTPKLEEEYDENEMKKCSFNEGAINCLYCTLSKDEVNRISMCSSAQ
        ++EGQST RPP F                            +GP++P K V+     KLE E++E +++    N  A++ LYC L   E NR+S C SA+
Subjt:  IVEGQSTSRPPYF----------------------------DGPYVPMKNVDNVDTPKLEEEYDENEMKKCSFNEGAINCLYCTLSKDEVNRISMCSSAQ

Query:  EIWNTLEITHEGTNQVKESK------------------------------NSMLGLGKVYTTFKNVRKILRSLPKTWEAKVTAIQEAKDLTKLPLEELIG
        EIW+ LE+T+EGTNQVKESK                              NS+  LGK+YT  +NVRKILRSLPK WEAK+TAI EA+DL  L LEEL G
Subjt:  EIWNTLEITHEGTNQVKESK------------------------------NSMLGLGKVYTTFKNVRKILRSLPKTWEAKVTAIQEAKDLTKLPLEELIG

Query:  SFMTHEIIMKEHLEDES-KKKKNIALKTISLEVDPEDEDGLDEDDIAYFSRKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEEICSECKRSGHIRTDCPL
        S MT+E+ M   +E+E  K KKN ALK+   + D  +E+  +E++IA  +R +K F+K+KK F +    + E+KGE SK +   C +CK+ GH + +CP 
Subjt:  SFMTHEIIMKEHLEDES-KKKKNIALKTISLEVDPEDEDGLDEDDIAYFSRKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEEICSECKRSGHIRTDCPL

Query:  LKSSK-KSKKKAMKATWDDSSESESE----VEEMANLGLMAHSDKDDEHDDKHVCIEKDALLDKVRFLEHDSCEKDNLIKVLKENELSVLQELDKAKETI
        +   K K KKKA+K TWDDS ES+S+      E+ANL L+ + ++ +  +D+H      A    + F + +S  +D  +                     
Subjt:  LKSSK-KSKKKAMKATWDDSSESESE----VEEMANLGLMAHSDKDDEHDDKHVCIEKDALLDKVRFLEHDSCEKDNLIKVLKENELSVLQELDKAKETI

Query:  KKLTIGAQRLDKIIEVGKSYGDKRGLGYIDESSTPSSSKTTFVKASPIPPRKTNGTWIVVAQDTTGDRSKLIPFSKKNGGMVTFGDNKKGEFDNDAFIAF
                          ++GD+  L       + S+ K  F+ +               ++  TGD++K    + K+GG V FGDN KG+         
Subjt:  KKLTIGAQRLDKIIEVGKSYGDKRGLGYIDESSTPSSSKTTFVKASPIPPRKTNGTWIVVAQDTTGDRSKLIPFSKKNGGMVTFGDNKKGEFDNDAFIAF

Query:  CEENGFSHNFLSPRTPQQNGVVERKNRTLQEFARSILNEYGLPKYFWAEVVNTDCYVSNRVLVRPSLGKTPYELWHGKIPNNGYFKE---KPIILNNKEK
                  ++PRTPQQNGVVERKNR+LQE +R++LNE+ LP+YFWAE VNT CYV NR ++R +L KTPYELW+ + PN GYFK    K  +LN+++ 
Subjt:  CEENGFSHNFLSPRTPQQNGVVERKNRTLQEFARSILNEYGLPKYFWAEVVNTDCYVSNRVLVRPSLGKTPYELWHGKIPNNGYFKE---KPIILNNKEK

Query:  LGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVTEESIHVVFDES-----WNNVSNESICSDDLEKDFGDLLVNDKGKEIVRSMQDVNIIEKK---EEGFS
        LGKFD+K+D GIFLGYS+ SKAYRVFNK+T+V +ES+HVVFDE+      NN  +E I  ++       + +++K K+ V   +D    E+K        
Subjt:  LGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVTEESIHVVFDES-----WNNVSNESICSDDLEKDFGDLLVNDKGKEIVRSMQDVNIIEKK---EEGFS

Query:  SLPKVWRYALSHLKDLILSNPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAIHE
         LPK W    +H K+LI+   E GV TRS L ++ +N+AF+SQIEP++  +A  DE WILA+ E
Subjt:  SLPKVWRYALSHLKDLILSNPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAIHE

A0A5A7TRZ7 Zf-CCHC domain-containing protein/UBN2 domain-containing protein1.2e-11169.52Show/hide
Query:  RKILRSLPKTWEAKVTAIQEAKDLTKLPLEELIGSFMTHEIIMKEHLEDESKKKKNIALKTISLEVDPEDEDGLDEDDIAYFSRKYKNFIKRKKYFKKHL
        RKIL  LPKTW+AKVTAIQEAKDLTKLPLEELIGS MTHEIIM+EHLEDESKKKK+IAL TISLE+  EDED LDEDDI YFSRKYKNFIKRKK FKK+L
Subjt:  RKILRSLPKTWEAKVTAIQEAKDLTKLPLEELIGSFMTHEIIMKEHLEDESKKKKNIALKTISLEVDPEDEDGLDEDDIAYFSRKYKNFIKRKKYFKKHL

Query:  STQKESKGEKSKKDEEICSECKRSGHIRTDCPLLKSSKKSKKKAMKATWDDSSESESEVEEMANLGLMAHSDKDDEHD----------------------
        STQK SKGEKSKKDE IC ECK+  HIRTDCP LKSSKKSK+KAMKATWDDSSESESEVEE ANLGLM  SDK+DEHD                      
Subjt:  STQKESKGEKSKKDEEICSECKRSGHIRTDCPLLKSSKKSKKKAMKATWDDSSESESEVEEMANLGLMAHSDKDDEHD----------------------

Query:  ------------------------------------------------DKHV--CIEKDALLDKVRFLEHDSCEKDNLIKVLKENELSVLQELDKAKETI
                                                        DKH+  C EKDALLDKVRFLEHDSCEKDNLIKVLKENEL+VLQ+LDKAKETI
Subjt:  ------------------------------------------------DKHV--CIEKDALLDKVRFLEHDSCEKDNLIKVLKENELSVLQELDKAKETI

Query:  KKLTIGAQRLDKIIEVGKSYGDKRGLGYIDESSTPSSSKTTFVKASPIPPR
        KKLTIGAQRLDKIIEVGKSYGDKR LGYIDESST  SSKTTFVKASPI P+
Subjt:  KKLTIGAQRLDKIIEVGKSYGDKRGLGYIDESSTPSSSKTTFVKASPIPPR

A0A5A7U923 Zf-CCHC domain-containing protein/UBN2 domain-containing protein1.3e-11577.6Show/hide
Query:  NQVKESKNSMLGLGKVYTTFKNVRKILRSLPKTWEAKVTAIQEAKDLTKLPLEELIGSFMTHEIIMKEHLEDESKKKKNIALKTISLEVDPEDEDGLDED
        NQ+KESK +MLGLGKVYTTF+N RKILRSLPKTW AKVTAIQEAKDLTKLP EELIGS MTHEIIMK HLEDESKK K++ALKTI LEVDP+DED LDE+
Subjt:  NQVKESKNSMLGLGKVYTTFKNVRKILRSLPKTWEAKVTAIQEAKDLTKLPLEELIGSFMTHEIIMKEHLEDESKKKKNIALKTISLEVDPEDEDGLDED

Query:  DIAYFSRKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEEICSECKRSGHIRTDCPLLKSSKKSKKKAMKATWDDSSESESEVEEMANLGLMAHSDKDDEH
        DIAYFSRKYKNFIKRKKYFKKH+S+QKESK EKSKKDE                           KAMKATWDDSS SESEVE+M +LGLMAHS+K+DEH
Subjt:  DIAYFSRKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEEICSECKRSGHIRTDCPLLKSSKKSKKKAMKATWDDSSESESEVEEMANLGLMAHSDKDDEH

Query:  DD-----KHV--CIEKDALLDKVRFLEHDSCEKDNLIKVLKENELSVLQELDKAKETIKKLTIGAQRLDKIIEVGKSYGDKRGLGYIDESSTPSSSKTTF
        DD     KHV  C EK+ALLDKVRFLEHD CEKDNLIKVLKENEL+VLQ+LDKAKETIKKLTI AQRL +IIEVGKSYGDKRGLGYIDE STPSSSKTTF
Subjt:  DD-----KHV--CIEKDALLDKVRFLEHDSCEKDNLIKVLKENELSVLQELDKAKETIKKLTIGAQRLDKIIEVGKSYGDKRGLGYIDESSTPSSSKTTF

Query:  VKASPIPP
        VKASPI P
Subjt:  VKASPIPP

A5BS59 Uncharacterized protein3.9e-10741.6Show/hide
Query:  KILRSLPKTWEAKVTAIQEAKDLTKLPLEELIGSFMTHEIIMKEHL-EDESKKKKNIALKTISLEVD--PEDEDGLDEDDIAYFSRKYKNFI-----KRK
        KILR LP  W  KVTAIQEAKDLTKLP+EEL+GS MT+EI + + L E E+KKKK+IALK  + + +   E++   ++DD+A  +RK   ++     +RK
Subjt:  KILRSLPKTWEAKVTAIQEAKDLTKLPLEELIGSFMTHEIIMKEHL-EDESKKKKNIALKTISLEVD--PEDEDGLDEDDIAYFSRKYKNFI-----KRK

Query:  KYFKKHLSTQKESK--GEKSKKDEE---ICSECKRSGHIRTDCPLLK-SSKKSKKKAMKATWDDSSES--ESEVEEMANLGLMAHSDKDDE-HDDK----
        K+  +   +++ES   G+K K +E+   IC +CK  GHI+ +CPL    +K+ KKKAM ATW +S ES  E + +E+AN+  MA  D D++  +DK    
Subjt:  KYFKKHLSTQKESK--GEKSKKDEE---ICSECKRSGHIRTDCPLLK-SSKKSKKKAMKATWDDSSES--ESEVEEMANLGLMAHSDKDDE-HDDK----

Query:  -HVCIEKDALLDKVRFLEHDSC------EKDNLIKVLKENELSVLQELDKAK--ETIKKLTIGAQRLDKIIEVGK----SYGDKR-----------GLGY
         H C E    ++  ++ +HD C      +     + L+   + ++ +L+K +    + K+     ++ +  ++GK    S+ +K             +  
Subjt:  -HVCIEKDALLDKVRFLEHDSC------EKDNLIKVLKENELSVLQELDKAK--ETIKKLTIGAQRLDKIIEVGK----SYGDKR-----------GLGY

Query:  IDESSTPSSSKTTFVKASPIPPRKTNGTWIVVAQDTTGDRSKLIPFSKK----NGGMVTFGDNKKG-EFDNDAFIAFCEENGFSHNFLSPRTPQQNGVVE
           S TPS    ++V    I    +  TW++     +    +   F  K     G  +T   +  G EF+N  F  +C ++G +HNF +PRTPQQNGVVE
Subjt:  IDESSTPSSSKTTFVKASPIPPRKTNGTWIVVAQDTTGDRSKLIPFSKK----NGGMVTFGDNKKG-EFDNDAFIAFCEENGFSHNFLSPRTPQQNGVVE

Query:  RKNRTLQEFARSILNEYGLPKYFWAEVVNTDCYVSNRVLVRPSLGKTPYELWHGKIPNNGYFKE---KPIILNNKEKLGKFDSKTDVGIFLGYSSTSKAY
        RKNRTLQE AR++LNE  LPKYFWAE VNT CYV +R+L+RP L KTPYELW  K PN  YFK    K  ILN K+ LGKFD+K+DVGIFLGYS++SK +
Subjt:  RKNRTLQEFARSILNEYGLPKYFWAEVVNTDCYVSNRVLVRPSLGKTPYELWHGKIPNNGYFKE---KPIILNNKEKLGKFDSKTDVGIFLGYSSTSKAY

Query:  RVFNKKTLVTEESIHVVFDESWNNVSNESICSDD--LEKDFGDLLVNDKGKEIVRSMQDVNIIEKKEE------------GFSS--LPKVWRYALSHLKD
        RVFNK+T+V EESIHV+F ES N++       DD  LE   G L + DK ++  +S +D     KKEE            G SS  LPK W++ ++H +D
Subjt:  RVFNKKTLVTEESIHVVFDESWNNVSNESICSDD--LEKDFGDLLVNDKGKEIVRSMQDVNIIEKKEE------------GFSS--LPKVWRYALSHLKD

Query:  LILSNPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAIHE
         I+ NP  GV+TRSSL N+ +NLAF+ QIEP++ KDA  DE W++A+ E
Subjt:  LILSNPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAIHE

A5C8K0 Uncharacterized protein1.1e-10432.64Show/hide
Query:  DGPYVPMKNVDNVDTPKLEEEYDENEMKKCSFNEGAINCLYCTLSKDEVNRISMCSSAQEIWNTLEITHEGTNQVKESK---------------------
        DGP  P K VD V  PK ++E++E + +    N  A+  L C + ++E NRI  C SA+EIW  LEITHEGTNQVKESK                     
Subjt:  DGPYVPMKNVDNVDTPKLEEEYDENEMKKCSFNEGAINCLYCTLSKDEVNRISMCSSAQEIWNTLEITHEGTNQVKESK---------------------

Query:  ---------NSMLGLGKVYTTFKNVRKILRSLPKTWEAKVTAIQEAKDLTKLPLEELIGSFMTHEI-IMKEHLEDESKKKKNIALK-TISLEVDPEDEDG
                 N +  LG+V    + V KILRSLP  W  KVTAIQEAKDLTKLP+EEL+GS MT+EI + K+  E E KKKKNIALK T   E D E+E  
Subjt:  ---------NSMLGLGKVYTTFKNVRKILRSLPKTWEAKVTAIQEAKDLTKLPLEELIGSFMTHEI-IMKEHLEDESKKKKNIALK-TISLEVDPEDEDG

Query:  LDE-DDIAYFSRKYKNFIKRKKYFKKHLSTQK-------ESKGEKSKKDEE---ICSECKRSGHIRTDCPLLK-SSKKSKKKAMKATWDDSSES--ESEV
         +E DD+A  +RK   +++ +++  K  ++++        S G+K K +E+   IC +CK  GHI+ DCPL K  +K+  KKAM ATW +S ES  E + 
Subjt:  LDE-DDIAYFSRKYKNFIKRKKYFKKHLSTQK-------ESKGEKSKKDEE---ICSECKRSGHIRTDCPLLK-SSKKSKKKAMKATWDDSSES--ESEV

Query:  EEMANLGLMAHSDKDD-----EHDDKHVCIEKDALLDKVRFLEHDSCEKDNLIKVLKENELSVLQELDKAKETIKKLTIGAQRLDKIIEVGKSYGD---K
        +E+AN+  MA  D D+       +D HV  E     +     E  S +  +L K ++E E    +EL++ KE      I    L+K  E+ ++  +   K
Subjt:  EEMANLGLMAHSDKDD-----EHDDKHVCIEKDALLDKVRFLEHDSCEKDNLIKVLKENELSVLQELDKAKETIKKLTIGAQRLDKIIEVGKSYGD---K

Query:  RGLGYIDESSTPSSSKTTFVKASPIPPRKTNGTWIV---VAQDTTGDRSKLIPFSKKNGGMVTFGDNKKG------------------------------
        +        S  S  + +F              W +    ++  TGD SK    +K+ GG VTFGDN KG                              
Subjt:  RGLGYIDESSTPSSSKTTFVKASPIPPRKTNGTWIV---VAQDTTGDRSKLIPFSKKNGGMVTFGDNKKG------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------------------------------------------------EFDN
                                                                                                        EF+N
Subjt:  ------------------------------------------------------------------------------------------------EFDN

Query:  DAFIAFCEENGFSHNFLSPRTPQQNGVVERKNRTLQEFARSILNEYGLPKYFWAEVVNTDCYVSNRVLVRPSLGKTPYELWHGKIPNNGYFKE---KPII
          F  +C + G +HNFL+PRT QQNGVVERKNRTLQE AR++LNE  LPKYFWAE +NT CYV NR+L+RP L KTPYELW  K PN  YFK    K  I
Subjt:  DAFIAFCEENGFSHNFLSPRTPQQNGVVERKNRTLQEFARSILNEYGLPKYFWAEVVNTDCYVSNRVLVRPSLGKTPYELWHGKIPNNGYFKE---KPII

Query:  LNNKEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVTEESIHVVFDESWNNVSNESICSDDLEKDFGDLLVNDKGKEIVRSMQDVNIIEKKEEGFSSL
        LN K+ LGKFD+K+DVGIFLGYS++SKA+RVFNK+T+V EESIH  +   W N                 L   D  K++ R  +     +K       L
Subjt:  LNNKEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVTEESIHVVFDESWNNVSNESICSDDLEKDFGDLLVNDKGKEIVRSMQDVNIIEKKEEGFSSL

Query:  PKVWRYALSHLKDLILSNPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAIHE
            ++ ++H +D I+ NP  GV+TRSSL N+ +NLAF+SQIEP++ KDA  DE W++A+ +
Subjt:  PKVWRYALSHLKDLILSNPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAIHE

SwissProt top hitse value%identityAlignment
P04146 Copia protein8.6e-1936.36Show/hide
Query:  EFDNDAFIAFCEENGFSHNFLSPRTPQQNGVVERKNRTLQEFARSILNEYGLPKYFWAEVVNTDCYVSNRVLVRPSL--GKTPYELWHGKIPNNGYFK--
        E+ ++    FC + G S++   P TPQ NGV ER  RT+ E AR++++   L K FW E V T  Y+ NR+  R  +   KTPYE+WH K P   + +  
Subjt:  EFDNDAFIAFCEENGFSHNFLSPRTPQQNGVVERKNRTLQEFARSILNEYGLPKYFWAEVVNTDCYVSNRVLVRPSL--GKTPYELWHGKIPNNGYFK--

Query:  EKPIILNNKEKLGKFDSKTDVGIFLGYSSTS-KAYRVFNKKTLVTEESIHVVFDESWNNVSNESI
           + ++ K K GKFD K+   IF+GY     K +   N+K +V  +   VV DE+ N V++ ++
Subjt:  EKPIILNNKEKLGKFDSKTDVGIFLGYSSTS-KAYRVFNKKTLVTEESIHVVFDESWNNVSNESI

P0C2I3 Transposon Ty1-DR6 Gag-Pol polyprotein7.8e-0423.08Show/hide
Query:  PSSSKTTFVKASPIPPRKTNGTWIVVAQDTTGDR-----SKLIPFSKK--NGGMVTFGDNKKGEFDNDAFIAFCEENGFSHNFLSPRTPQQNGVVERKNR
        P+S+ + F+         T   W+    D   D      + ++ F K      ++    ++  E+ N     F E+NG +  + +    + +GV ER NR
Subjt:  PSSSKTTFVKASPIPPRKTNGTWIVVAQDTTGDR-----SKLIPFSKK--NGGMVTFGDNKKGEFDNDAFIAFCEENGFSHNFLSPRTPQQNGVVERKNR

Query:  TLQEFARSILNEYGLPKYFWAEVVNTDCYVSNRVLVRPSLGKTPYELWHGKIPNNGYFK----EKPIILNNKEKLGKFDSKTDVGIFLGYSSTSKAYRVF
        TL +  R+ L   GLP Y W   +     V N  L  P   K+  +  H  +            +P+I+N+     K   +   G  L  S  S  Y ++
Subjt:  TLQEFARSILNEYGLPKYFWAEVVNTDCYVSNRVLVRPSLGKTPYELWHGKIPNNGYFK----EKPIILNNKEKLGKFDSKTDVGIFLGYSSTSKAYRVF

Query:  ---NKKTLVTEESIHVVFDES
            KKT+ T   + +   ES
Subjt:  ---NKKTLVTEESIHVVFDES

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.3e-1836.13Show/hide
Query:  GEFDNDAFIAFCEENGFSHNFLSPRTPQQNGVVERKNRTLQEFARSILNEYGLPKYFWAEVVNTDCYVSNRVLVRPSLGKTPYELWHGKIPNNGYFKE--
        GE+ +  F  +C  +G  H    P TPQ NGV ER NRT+ E  RS+L    LPK FW E V T CY+ NR    P   + P  +W  K  +  + K   
Subjt:  GEFDNDAFIAFCEENGFSHNFLSPRTPQQNGVVERKNRTLQEFARSILNEYGLPKYFWAEVVNTDCYVSNRVLVRPSLGKTPYELWHGKIPNNGYFKE--

Query:  -KPIILNNKEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVTEESIHVVFDES
         +      KE+  K D K+   IF+GY      YR+++        S  VVF ES
Subjt:  -KPIILNNKEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVTEESIHVVFDES

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.1e-1026.92Show/hide
Query:  YIDESSTPSSSKTTFVKASPIPPRKTNGTWIVVAQDTTGDRSKLIPF-----SKKNGGMVTFGDNKKGEFDNDAFIAFCEENGFSHNFLSPRTPQQNGVV
        Y D  S+P  S   +          T  TW+   +  +  +   I F     ++    + TF  +  GEF   A   +  ++G SH    P TP+ NG+ 
Subjt:  YIDESSTPSSSKTTFVKASPIPPRKTNGTWIVVAQDTTGDRSKLIPF-----SKKNGGMVTFGDNKKGEFDNDAFIAFCEENGFSHNFLSPRTPQQNGVV

Query:  ERKNRTLQEFARSILNEYGLPKYFWAEVVNTDCYVSNRVLVRPSLGKTPYELWHGKIPNNGYFKE---------KPIILNNKEKLGKFDSKTDVGIFLGY
        ERK+R + E   ++L+   +PK +W        Y+ NR+       ++P++   G  PN    +          +P    N+ KL   D K+   +FLGY
Subjt:  ERKNRTLQEFARSILNEYGLPKYFWAEVVNTDCYVSNRVLVRPSLGKTPYELWHGKIPNNGYFKE---------KPIILNNKEKLGKFDSKTDVGIFLGY

Query:  SSTSKAYRVFNKKTLVTEESIHVVFDESWNNVSN
        S T  AY   + +T     S HV FDE+    SN
Subjt:  SSTSKAYRVFNKKTLVTEESIHVVFDESWNNVSN

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.6e-0930.2Show/hide
Query:  FCEENGFSHNFLSPRTPQQNGVVERKNRTLQEFARSILNEYGLPKYFWAEVVNTDCYVSNRVLVRPSLGKTPYELWHGKIPNNGYFKEKP--------II
        +  ++G SH    P TP+ NG+ ERK+R + E   ++L+   +PK +W    +   Y+ NR+       ++P++   G+ PN  Y K K         + 
Subjt:  FCEENGFSHNFLSPRTPQQNGVVERKNRTLQEFARSILNEYGLPKYFWAEVVNTDCYVSNRVLVRPSLGKTPYELWHGKIPNNGYFKEKP--------II

Query:  LNNKEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVTEESIHVVFDE
          N+ KL   + K+    F+GYS T  AY   +  T     S HV FDE
Subjt:  LNNKEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVTEESIHVVFDE

Arabidopsis top hitse value%identityAlignment
ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.6e-0436.07Show/hide
Query:  NRTLQEFARSILNEYGLPKYFWAEVVNTDCYVSNRVLVRPSLGKTPYELWHGKIPNNGYFK
        NRT+ E  RS+L E GLPK F A+  NT  ++ N+          P E+W   +P   Y +
Subjt:  NRTLQEFARSILNEYGLPKYFWAEVVNTDCYVSNRVLVRPSLGKTPYELWHGKIPNNGYFK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAACCTATTGGCAAATGGGATTGTTGAAGGTCAATCTACTTCTAGACCTCCTTACTTTGATGGTCCTTATGTACCCATGAAAAATGTTGATAATGTTGATACGCC
TAAATTAGAAGAAGAGTATGATGAAAATGAAATGAAAAAGTGTTCATTTAATGAAGGCGCTATTAATTGTTTGTATTGTACCTTGAGTAAAGATGAAGTTAATAGAATAT
CCATGTGTTCTTCCGCTCAAGAAATTTGGAATACTCTTGAAATTACTCATGAAGGAACAAATCAAGTTAAAGAGTCTAAAAATAGCATGCTTGGTCTTGGTAAAGTTTAT
ACAACTTTCAAAAATGTTAGAAAAATTCTAAGATCTCTACCTAAGACTTGGGAAGCTAAGGTAACGGCAATTCAAGAAGCAAAGGATCTCACCAAACTTCCACTAGAGGA
GCTTATTGGCTCATTTATGACTCATGAGATCATCATGAAGGAGCACTTGGAGGATGAGTCGAAAAAGAAGAAGAACATTGCATTAAAGACTATCTCCTTGGAAGTTGATC
CTGAAGATGAGGATGGCCTTGATGAAGATGACATTGCTTATTTCTCACGTAAGTACAAAAATTTCATAAAAAGAAAGAAATATTTCAAGAAACACCTATCAACCCAAAAA
GAGTCAAAAGGTGAGAAAAGCAAAAAGGATGAGGAGATTTGTTCTGAATGCAAAAGATCGGGTCATATAAGAACGGATTGTCCTCTCCTCAAATCATCTAAGAAATCCAA
GAAGAAGGCAATGAAGGCTACATGGGATGATAGTAGTGAAAGTGAAAGTGAAGTTGAAGAAATGGCAAATCTTGGTCTCATGGCTCATAGTGACAAAGATGATGAACATG
ATGATAAGCATGTTTGTATTGAGAAAGATGCTTTGCTTGATAAAGTTAGATTTCTTGAGCATGATAGTTGTGAAAAAGATAACTTGATTAAAGTGCTTAAAGAAAATGAA
CTAAGTGTTTTACAAGAACTTGATAAAGCTAAAGAAACTATTAAAAAGTTGACAATAGGTGCTCAAAGATTGGACAAAATTATTGAAGTAGGAAAATCTTATGGTGATAA
GAGAGGTTTAGGCTATATTGATGAATCATCTACTCCATCAAGTTCTAAAACTACATTCGTTAAAGCATCACCTATTCCTCCAAGAAAAACAAATGGTACTTGGATAGTGG
TTGCTCAAGACACGACAGGAGATCGATCCAAGCTTATCCCTTTCTCCAAAAAGAATGGAGGCATGGTAACCTTTGGTGACAACAAGAAAGGAGAATTTGATAATGATGCT
TTTATAGCTTTTTGTGAAGAAAATGGTTTTTCCCATAATTTTCTCTCTCCAAGAACTCCTCAACAAAATGGTGTGGTTGAAAGGAAAAATCGTACTTTGCAAGAATTTGC
TAGATCAATATTGAATGAGTATGGTTTACCTAAATATTTTTGGGCGGAAGTCGTTAACACCGATTGCTATGTTTCAAATAGAGTTTTAGTTAGACCCTCTTTAGGTAAAA
CTCCTTATGAACTTTGGCATGGAAAAATTCCAAATAATGGGTATTTCAAAGAAAAACCCATTATTTTGAATAACAAAGAAAAACTTGGAAAATTTGATTCTAAGACGGAT
GTTGGTATTTTTCTAGGATATTCCTCTACTAGTAAAGCTTATAGAGTTTTCAATAAGAAAACTTTAGTTACTGAGGAATCTATTCATGTTGTATTTGATGAATCTTGGAA
TAATGTTTCTAATGAGTCTATTTGTAGTGATGATTTAGAAAAAGATTTTGGAGATTTACTTGTTAATGACAAAGGCAAAGAAATTGTTCGAAGTATGCAAGATGTGAACA
TCATAGAAAAGAAAGAAGAGGGTTTTTCATCCTTGCCTAAAGTGTGGAGATATGCTCTATCCCATCTCAAGGATTTAATTCTTAGCAATCCCGAACAAGGTGTCAAAACT
CGTTCTTCTCTTAATTTATTTAGTAATCTTGCTTTTGTGTCTCAAATTGAACCTAGAAGTTTTAAAGATGCCGAATGTGATGAGTTTTGGATTTTAGCTATACATGAAAG
CTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCAAACCTATTGGCAAATGGGATTGTTGAAGGTCAATCTACTTCTAGACCTCCTTACTTTGATGGTCCTTATGTACCCATGAAAAATGTTGATAATGTTGATACGCC
TAAATTAGAAGAAGAGTATGATGAAAATGAAATGAAAAAGTGTTCATTTAATGAAGGCGCTATTAATTGTTTGTATTGTACCTTGAGTAAAGATGAAGTTAATAGAATAT
CCATGTGTTCTTCCGCTCAAGAAATTTGGAATACTCTTGAAATTACTCATGAAGGAACAAATCAAGTTAAAGAGTCTAAAAATAGCATGCTTGGTCTTGGTAAAGTTTAT
ACAACTTTCAAAAATGTTAGAAAAATTCTAAGATCTCTACCTAAGACTTGGGAAGCTAAGGTAACGGCAATTCAAGAAGCAAAGGATCTCACCAAACTTCCACTAGAGGA
GCTTATTGGCTCATTTATGACTCATGAGATCATCATGAAGGAGCACTTGGAGGATGAGTCGAAAAAGAAGAAGAACATTGCATTAAAGACTATCTCCTTGGAAGTTGATC
CTGAAGATGAGGATGGCCTTGATGAAGATGACATTGCTTATTTCTCACGTAAGTACAAAAATTTCATAAAAAGAAAGAAATATTTCAAGAAACACCTATCAACCCAAAAA
GAGTCAAAAGGTGAGAAAAGCAAAAAGGATGAGGAGATTTGTTCTGAATGCAAAAGATCGGGTCATATAAGAACGGATTGTCCTCTCCTCAAATCATCTAAGAAATCCAA
GAAGAAGGCAATGAAGGCTACATGGGATGATAGTAGTGAAAGTGAAAGTGAAGTTGAAGAAATGGCAAATCTTGGTCTCATGGCTCATAGTGACAAAGATGATGAACATG
ATGATAAGCATGTTTGTATTGAGAAAGATGCTTTGCTTGATAAAGTTAGATTTCTTGAGCATGATAGTTGTGAAAAAGATAACTTGATTAAAGTGCTTAAAGAAAATGAA
CTAAGTGTTTTACAAGAACTTGATAAAGCTAAAGAAACTATTAAAAAGTTGACAATAGGTGCTCAAAGATTGGACAAAATTATTGAAGTAGGAAAATCTTATGGTGATAA
GAGAGGTTTAGGCTATATTGATGAATCATCTACTCCATCAAGTTCTAAAACTACATTCGTTAAAGCATCACCTATTCCTCCAAGAAAAACAAATGGTACTTGGATAGTGG
TTGCTCAAGACACGACAGGAGATCGATCCAAGCTTATCCCTTTCTCCAAAAAGAATGGAGGCATGGTAACCTTTGGTGACAACAAGAAAGGAGAATTTGATAATGATGCT
TTTATAGCTTTTTGTGAAGAAAATGGTTTTTCCCATAATTTTCTCTCTCCAAGAACTCCTCAACAAAATGGTGTGGTTGAAAGGAAAAATCGTACTTTGCAAGAATTTGC
TAGATCAATATTGAATGAGTATGGTTTACCTAAATATTTTTGGGCGGAAGTCGTTAACACCGATTGCTATGTTTCAAATAGAGTTTTAGTTAGACCCTCTTTAGGTAAAA
CTCCTTATGAACTTTGGCATGGAAAAATTCCAAATAATGGGTATTTCAAAGAAAAACCCATTATTTTGAATAACAAAGAAAAACTTGGAAAATTTGATTCTAAGACGGAT
GTTGGTATTTTTCTAGGATATTCCTCTACTAGTAAAGCTTATAGAGTTTTCAATAAGAAAACTTTAGTTACTGAGGAATCTATTCATGTTGTATTTGATGAATCTTGGAA
TAATGTTTCTAATGAGTCTATTTGTAGTGATGATTTAGAAAAAGATTTTGGAGATTTACTTGTTAATGACAAAGGCAAAGAAATTGTTCGAAGTATGCAAGATGTGAACA
TCATAGAAAAGAAAGAAGAGGGTTTTTCATCCTTGCCTAAAGTGTGGAGATATGCTCTATCCCATCTCAAGGATTTAATTCTTAGCAATCCCGAACAAGGTGTCAAAACT
CGTTCTTCTCTTAATTTATTTAGTAATCTTGCTTTTGTGTCTCAAATTGAACCTAGAAGTTTTAAAGATGCCGAATGTGATGAGTTTTGGATTTTAGCTATACATGAAAG
CTAG
Protein sequenceShow/hide protein sequence
MANLLANGIVEGQSTSRPPYFDGPYVPMKNVDNVDTPKLEEEYDENEMKKCSFNEGAINCLYCTLSKDEVNRISMCSSAQEIWNTLEITHEGTNQVKESKNSMLGLGKVY
TTFKNVRKILRSLPKTWEAKVTAIQEAKDLTKLPLEELIGSFMTHEIIMKEHLEDESKKKKNIALKTISLEVDPEDEDGLDEDDIAYFSRKYKNFIKRKKYFKKHLSTQK
ESKGEKSKKDEEICSECKRSGHIRTDCPLLKSSKKSKKKAMKATWDDSSESESEVEEMANLGLMAHSDKDDEHDDKHVCIEKDALLDKVRFLEHDSCEKDNLIKVLKENE
LSVLQELDKAKETIKKLTIGAQRLDKIIEVGKSYGDKRGLGYIDESSTPSSSKTTFVKASPIPPRKTNGTWIVVAQDTTGDRSKLIPFSKKNGGMVTFGDNKKGEFDNDA
FIAFCEENGFSHNFLSPRTPQQNGVVERKNRTLQEFARSILNEYGLPKYFWAEVVNTDCYVSNRVLVRPSLGKTPYELWHGKIPNNGYFKEKPIILNNKEKLGKFDSKTD
VGIFLGYSSTSKAYRVFNKKTLVTEESIHVVFDESWNNVSNESICSDDLEKDFGDLLVNDKGKEIVRSMQDVNIIEKKEEGFSSLPKVWRYALSHLKDLILSNPEQGVKT
RSSLNLFSNLAFVSQIEPRSFKDAECDEFWILAIHES