; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032918 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032918
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr11:38874070..38887089
RNA-Seq ExpressionLag0032918
SyntenyLag0032918
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0003723 - RNA binding (molecular function)
GO:0003724 - RNA helicase activity (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR007529 - Zinc finger, HIT-type
IPR015410 - Domain of unknown function DUF1985
IPR036875 - Zinc finger, CCHC-type superfamily
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606180.1 putative ATP-dependent RNA helicase DDX59, partial [Cucurbita argyrosperma subsp. sororia]3.2e-23490.3Show/hide
Query:  MGSRTNFYKNPSISYKKDLNLSSALQNLRAYNIATGNAPPTDDQPPPVVKKNENRKR-RREPELSGNPKYDVGNSDGPMSHQDYIERRRKEANKSQPYET
        MGSRTNFYKNPSISY KDLNLSSALQNLRAYNIA GNAPPTDDQPPPV KKNENRKR RREPELSGNPKYDVG+SDGPMSHQDYIE+RRKEAN+ QPYET
Subjt:  MGSRTNFYKNPSISYKKDLNLSSALQNLRAYNIATGNAPPTDDQPPPVVKKNENRKR-RREPELSGNPKYDVGNSDGPMSHQDYIERRRKEANKSQPYET

Query:  LTEDVLGTSSLGLNLVEYESDESSSSESAVKPDHQNSSLLNEYKEVKSKTEQRFAIAGEPVCVVCGRYGEYICNETNDDICSMECKFKLLEILKHSEEPS
        LTEDVLGTSS GLNLV YESDES+SSESAVKPDHQNSSLLNE++EV+ KTEQRFAIAGEPVCVVCGRYGEYICNETNDDICSMECKFKLLEILK SEE S
Subjt:  LTEDVLGTSSLGLNLVEYESDESSSSESAVKPDHQNSSLLNEYKEVKSKTEQRFAIAGEPVCVVCGRYGEYICNETNDDICSMECKFKLLEILKHSEEPS

Query:  NCEVKDVALPETKYILPSPEFGEDTWDYKNHRWSKKKSNLCTYECWKCQKPGHLAEDCLVKTSNQVMPQNTSNPVPGDLLGLYKRCYHIGKNLSNALCNE
        NCEVKDVALPETK+ILPSPEFGEDTWDYKNHRWSK KSNLCTYECWKCQKPGHLAEDCLV+TSNQVM Q T N VP DL+GLYKRCY IGKNLS+ALCNE
Subjt:  NCEVKDVALPETKYILPSPEFGEDTWDYKNHRWSKKKSNLCTYECWKCQKPGHLAEDCLVKTSNQVMPQNTSNPVPGDLLGLYKRCYHIGKNLSNALCNE

Query:  CSCSFSLATCLDCSTVYCDSAGHLNEHIHSHPTHGLYYSHKLKRLVKCCKSTCRVTDIKDLLVCHYCFDKAFDKFYDMYTASWKGAGLSIISGSICCEDH
        CSCSFSLATCLDCSTVYCD+AGHLNEHI  HPTHGLYYSHKLKRLVKCCKSTCRVTDIKDLLVCHYCFDKAFDKFYDMYTASW GAG+SIISGSICCEDH
Subjt:  CSCSFSLATCLDCSTVYCDSAGHLNEHIHSHPTHGLYYSHKLKRLVKCCKSTCRVTDIKDLLVCHYCFDKAFDKFYDMYTASWKGAGLSIISGSICCEDH

Query:  FAWHRMNCFNADVEDTAYIIKRNAQKGKSISIS
        FAWHRMNCFNADVED AYI+ RN +  KS S+S
Subjt:  FAWHRMNCFNADVEDTAYIIKRNAQKGKSISIS

KAG7036125.1 putative ATP-dependent RNA helicase DDX59 [Cucurbita argyrosperma subsp. argyrosperma]1.9e-23490.53Show/hide
Query:  MGSRTNFYKNPSISYKKDLNLSSALQNLRAYNIATGNAPPTDDQPPPVVKKNENRKR-RREPELSGNPKYDVGNSDGPMSHQDYIERRRKEANKSQPYET
        MGSRTNFYKNPSISY KDLNLSSALQNLRAYNIA GNAPPTDDQPPPV KKNENRKR RREPELSGNPKYDVG+SDGPMSHQDYIE+RRKEAN+ QPYET
Subjt:  MGSRTNFYKNPSISYKKDLNLSSALQNLRAYNIATGNAPPTDDQPPPVVKKNENRKR-RREPELSGNPKYDVGNSDGPMSHQDYIERRRKEANKSQPYET

Query:  LTEDVLGTSSLGLNLVEYESDESSSSESAVKPDHQNSSLLNEYKEVKSKTEQRFAIAGEPVCVVCGRYGEYICNETNDDICSMECKFKLLEILKHSEEPS
        LTEDVLGTSS GLNLV YESDES+SSESAVKPDHQNSSLLNE++EV+ KTEQRFAIAGEPVCVVCGRYGEYICNETNDDICSMECKFKLLEILK SEE S
Subjt:  LTEDVLGTSSLGLNLVEYESDESSSSESAVKPDHQNSSLLNEYKEVKSKTEQRFAIAGEPVCVVCGRYGEYICNETNDDICSMECKFKLLEILKHSEEPS

Query:  NCEVKDVALPETKYILPSPEFGEDTWDYKNHRWSKKKSNLCTYECWKCQKPGHLAEDCLVKTSNQVMPQNTSNPVPGDLLGLYKRCYHIGKNLSNALCNE
        NCEVKDVALPETK+ILPSPEFGEDTWDYKNHRWSK KSNLCTYECWKCQKPGHLAEDCLV+TSNQVM Q T N VP DL+GLYKRCY IGKNLS+ALCNE
Subjt:  NCEVKDVALPETKYILPSPEFGEDTWDYKNHRWSKKKSNLCTYECWKCQKPGHLAEDCLVKTSNQVMPQNTSNPVPGDLLGLYKRCYHIGKNLSNALCNE

Query:  CSCSFSLATCLDCSTVYCDSAGHLNEHIHSHPTHGLYYSHKLKRLVKCCKSTCRVTDIKDLLVCHYCFDKAFDKFYDMYTASWKGAGLSIISGSICCEDH
        CSCSFSLATCLDCSTVYCD+AGHLNEHI  HPTHGLYYSHKLKRLVKCCKSTCRVTDIKDLLVCHYCFDKAFDKFYDMYTASW GAGLSIISGSICCEDH
Subjt:  CSCSFSLATCLDCSTVYCDSAGHLNEHIHSHPTHGLYYSHKLKRLVKCCKSTCRVTDIKDLLVCHYCFDKAFDKFYDMYTASWKGAGLSIISGSICCEDH

Query:  FAWHRMNCFNADVEDTAYIIKRNAQKGKSISIS
        FAWHRMNCFNADVED AYI+ RN +  KS S+S
Subjt:  FAWHRMNCFNADVEDTAYIIKRNAQKGKSISIS

XP_022996224.1 uncharacterized protein LOC111491515 isoform X1 [Cucurbita maxima]1.0e-23590.99Show/hide
Query:  MGSRTNFYKNPSISYKKDLNLSSALQNLRAYNIATGNAPPTDDQPPPVVKKNENRKR-RREPELSGNPKYDVGNSDGPMSHQDYIERRRKEANKSQPYET
        MGSRTNFYKNPSISYKKDLNLSSALQNLRAYNIA GNAPPTDDQP PV KKNENRKR RRE ELSGNPKYDVG+SDGPMSHQDYIE+RRKEAN+ QPYET
Subjt:  MGSRTNFYKNPSISYKKDLNLSSALQNLRAYNIATGNAPPTDDQPPPVVKKNENRKR-RREPELSGNPKYDVGNSDGPMSHQDYIERRRKEANKSQPYET

Query:  LTEDVLGTSSLGLNLVEYESDESSSSESAVKPDHQNSSLLNEYKEVKSKTEQRFAIAGEPVCVVCGRYGEYICNETNDDICSMECKFKLLEILKHSEEPS
        L+EDVLGTSS GLNLV YESDES+SSESAVKPDHQNSSLLNE++EV+ KTEQRFAIAGEPVCVVCGRYGEYICNETNDDICSMECKFKLLEILK SEE S
Subjt:  LTEDVLGTSSLGLNLVEYESDESSSSESAVKPDHQNSSLLNEYKEVKSKTEQRFAIAGEPVCVVCGRYGEYICNETNDDICSMECKFKLLEILKHSEEPS

Query:  NCEVKDVALPETKYILPSPEFGEDTWDYKNHRWSKKKSNLCTYECWKCQKPGHLAEDCLVKTSNQVMPQNTSNPVPGDLLGLYKRCYHIGKNLSNALCNE
        NCEVKDVALPETKYILPSPEFGEDTWDYKNHRWSK KSNLCTYECWKCQKPGHLAEDCLV+TSNQVM Q T N VP DL+GLYKRCY IGKNLSN+LCNE
Subjt:  NCEVKDVALPETKYILPSPEFGEDTWDYKNHRWSKKKSNLCTYECWKCQKPGHLAEDCLVKTSNQVMPQNTSNPVPGDLLGLYKRCYHIGKNLSNALCNE

Query:  CSCSFSLATCLDCSTVYCDSAGHLNEHIHSHPTHGLYYSHKLKRLVKCCKSTCRVTDIKDLLVCHYCFDKAFDKFYDMYTASWKGAGLSIISGSICCEDH
        CSCSFSLATCLDCSTVYCDSAGHLNEHIH HPTHGLYYSHKLKRLVKCCKSTCRVTDIKDLLVCHYCFDKAFDKFYDMYTASW GAGLSIISGSICCEDH
Subjt:  CSCSFSLATCLDCSTVYCDSAGHLNEHIHSHPTHGLYYSHKLKRLVKCCKSTCRVTDIKDLLVCHYCFDKAFDKFYDMYTASWKGAGLSIISGSICCEDH

Query:  FAWHRMNCFNADVEDTAYIIKRNAQKGKSISIS
        FAWHRMNCFNADVED AYI+ RN + GKS S+S
Subjt:  FAWHRMNCFNADVEDTAYIIKRNAQKGKSISIS

XP_023532353.1 uncharacterized protein LOC111794552 isoform X1 [Cucurbita pepo subsp. pepo]4.2e-23490.3Show/hide
Query:  MGSRTNFYKNPSISYKKDLNLSSALQNLRAYNIATGNAPPTDDQPPPVVKKNENRKR-RREPELSGNPKYDVGNSDGPMSHQDYIERRRKEANKSQPYET
        MGSRTNFYKNPSISYKKDLNLSSALQNLRAYNIA GNAPPTDDQPPPV KKNENRKR RREPELSGNPKYDVG+SDGPMSHQDYIE+RRKEAN+ QPYET
Subjt:  MGSRTNFYKNPSISYKKDLNLSSALQNLRAYNIATGNAPPTDDQPPPVVKKNENRKR-RREPELSGNPKYDVGNSDGPMSHQDYIERRRKEANKSQPYET

Query:  LTEDVLGTSSLGLNLVEYESDESSSSESAVKPDHQNSSLLNEYKEVKSKTEQRFAIAGEPVCVVCGRYGEYICNETNDDICSMECKFKLLEILKHSEEPS
        LTEDVLGTSS GLNLV YESDES+SSE+AVKPDHQNSSLLNE++EV+ KTEQRFAIAGEPVCVVCGRYGEYICNETNDDICSMECKFKLLEILK SE  S
Subjt:  LTEDVLGTSSLGLNLVEYESDESSSSESAVKPDHQNSSLLNEYKEVKSKTEQRFAIAGEPVCVVCGRYGEYICNETNDDICSMECKFKLLEILKHSEEPS

Query:  NCEVKDVALPETKYILPSPEFGEDTWDYKNHRWSKKKSNLCTYECWKCQKPGHLAEDCLVKTSNQVMPQNTSNPVPGDLLGLYKRCYHIGKNLSNALCNE
        NCEVKDVALPETKYILPSPEFGEDTWDYKNHRWSK KSNLCTYECWKCQKPGHLA DCLV+TSNQVM Q T N VP DL+GLYKRCY IGKNLSNALCNE
Subjt:  NCEVKDVALPETKYILPSPEFGEDTWDYKNHRWSKKKSNLCTYECWKCQKPGHLAEDCLVKTSNQVMPQNTSNPVPGDLLGLYKRCYHIGKNLSNALCNE

Query:  CSCSFSLATCLDCSTVYCDSAGHLNEHIHSHPTHGLYYSHKLKRLVKCCKSTCRVTDIKDLLVCHYCFDKAFDKFYDMYTASWKGAGLSIISGSICCEDH
        CSCSFSLATCLDCSTVYCD+AGHLNEHI  HPTHGLYYSHKLKRLVKCCKSTCRVTDIKDLLVCHYCFDKAFDKFYDMYTASW GAGLSI+SGSICCEDH
Subjt:  CSCSFSLATCLDCSTVYCDSAGHLNEHIHSHPTHGLYYSHKLKRLVKCCKSTCRVTDIKDLLVCHYCFDKAFDKFYDMYTASWKGAGLSIISGSICCEDH

Query:  FAWHRMNCFNADVEDTAYIIKRNAQKGKSISIS
        FAWHRMNCFNADVED AYI+ RN +  KS S+S
Subjt:  FAWHRMNCFNADVEDTAYIIKRNAQKGKSISIS

XP_038874506.1 uncharacterized protein LOC120067139 [Benincasa hispida]1.7e-23591.2Show/hide
Query:  MGSRTNFYKNPSISYKKDLNLSSALQNLRAYNIATGNAPPTDDQPPPVVKKNENRKRRREPELSGNPKYDVGNSDGPMSHQDYIERRRKEANKSQPYETL
        MGSRTNFYKN SISYKKDLNLSSALQNLRAYNIATGNAPPTD  PPPVVKKNENRKR REPELSG+PKYDVGNSDGPMSHQDYIERRRKEANKSQPYETL
Subjt:  MGSRTNFYKNPSISYKKDLNLSSALQNLRAYNIATGNAPPTDDQPPPVVKKNENRKRRREPELSGNPKYDVGNSDGPMSHQDYIERRRKEANKSQPYETL

Query:  TEDVLGTSSLGLNLVEYESDESSSSESAVKPDHQNSSLLNEYKEVKSKTEQRFAIAGEPVCVVCGRYGEYICNETNDDICSMECKFKLLEILKHSEEPSN
        TEDVLGTSSLGLNLVEYESDES+SSE+AVKPDH NSSLLN+Y++VKSKTEQR AIAGEPVCVVCGRYGEYIC+ETNDDIC MECK KLLEILK SEE SN
Subjt:  TEDVLGTSSLGLNLVEYESDESSSSESAVKPDHQNSSLLNEYKEVKSKTEQRFAIAGEPVCVVCGRYGEYICNETNDDICSMECKFKLLEILKHSEEPSN

Query:  CEVKDVALPETKYILPSPEFGEDTWDYKNHRWSKKKSNLCTYECWKCQKPGHLAEDCLVKTSNQVMPQNTSNPVPGDLLGLYKRCYHIGKNLSNALCNEC
        C+VKDVALPE+KYILPSPEFGEDTWDYKNH WSKKKSNLCTYECWKCQKPGHLAEDCLVK S+Q++ Q TSNPVPGDLLGLYKRCY IGKN+SNALCNEC
Subjt:  CEVKDVALPETKYILPSPEFGEDTWDYKNHRWSKKKSNLCTYECWKCQKPGHLAEDCLVKTSNQVMPQNTSNPVPGDLLGLYKRCYHIGKNLSNALCNEC

Query:  SCSFSLATCLDCSTVYCDSAGHLNEHIHSHPTHGLYYSHKLKRLVKCCKSTCRVTDIKDLLVCHYCFDKAFDKFYDMYTASWKGAGLSIISGSICCEDHF
        SCSFSLATCLDCSTVYCDSAGHLNEHIH HPTHGLYYSHKLKRLVKCCKSTCRVT IKDLLVCHYCFDKAFDKFYDMYTASW  AGLSIISGSICCEDHF
Subjt:  SCSFSLATCLDCSTVYCDSAGHLNEHIHSHPTHGLYYSHKLKRLVKCCKSTCRVTDIKDLLVCHYCFDKAFDKFYDMYTASWKGAGLSIISGSICCEDHF

Query:  AWHRMNCFNADVEDTAYIIKRNAQKGKSISIS
        AWHRMNCFNADVEDTAYI+ R A+K KS+SIS
Subjt:  AWHRMNCFNADVEDTAYIIKRNAQKGKSISIS

TrEMBL top hitse value%identityAlignment
A0A0A0KP93 CCHC-type domain-containing protein2.6e-22988.99Show/hide
Query:  MGSRTNFYKNPSISYKKDLNLSSALQNLRAYNIATGNAPPTDDQ----PPPVVKKNENRKRRREPELSGNPKYDVGNSDGPMSHQDYIERRRKEANKSQP
        MGSRTNFYKNPSISYKKDL+LSSALQNLRAYNIATGNAPPTD Q    P PVVKKNENRKR+REPEL G   YDVGNSDGPMSHQDYIERRRKEANKSQP
Subjt:  MGSRTNFYKNPSISYKKDLNLSSALQNLRAYNIATGNAPPTDDQ----PPPVVKKNENRKRRREPELSGNPKYDVGNSDGPMSHQDYIERRRKEANKSQP

Query:  YETLTEDVLGTSSLGLNLVEYESDESSSSESAVKPDHQNSSLLNEYKEVKSKTEQRFAIAGEPVCVVCGRYGEYICNETNDDICSMECKFKLLEILKHSE
        YETLTEDVL  SS GLNLV+YESDES SS+ A KPD QNSSLLN+YKEV+SKTEQRFA+AGEPVCVVCGRYGEYIC+ETNDDICSMECKFKLLEILK  E
Subjt:  YETLTEDVLGTSSLGLNLVEYESDESSSSESAVKPDHQNSSLLNEYKEVKSKTEQRFAIAGEPVCVVCGRYGEYICNETNDDICSMECKFKLLEILKHSE

Query:  EPSNCEVKDVALPETKYILPSPEFGEDTWDYKNHRWSKKKSNLCTYECWKCQKPGHLAEDCLVKTSNQVMPQNTSNPVPGDLLGLYKRCYHIGKNLSNAL
        E  NCEVKDVALPE+KYILPSPEFGEDTWDYKNHRWSKKKSNLCTYECWKCQKPGHLAEDCLVKTSNQ M Q TSNPVPGDLLGLYKRCY +GKNLSNAL
Subjt:  EPSNCEVKDVALPETKYILPSPEFGEDTWDYKNHRWSKKKSNLCTYECWKCQKPGHLAEDCLVKTSNQVMPQNTSNPVPGDLLGLYKRCYHIGKNLSNAL

Query:  CNECSCSFSLATCLDCSTVYCDSAGHLNEHIHSHPTHGLYYSHKLKRLVKCCKSTCRVTDIKDLLVCHYCFDKAFDKFYDMYTASWKGAGLSIISGSICC
        CNECSCS+SLATCLDC+TVYCDSAGHLNEHIH+HPTHGLYYSHKLKRLVKCCKSTCRVT IKDLLVCHYCFDKAFDKFYDMYTASW  AGLSIISGSICC
Subjt:  CNECSCSFSLATCLDCSTVYCDSAGHLNEHIHSHPTHGLYYSHKLKRLVKCCKSTCRVTDIKDLLVCHYCFDKAFDKFYDMYTASWKGAGLSIISGSICC

Query:  EDHFAWHRMNCFNADVEDTAYIIKRNAQKGKSISIS
        EDHFAWHRMNCFNADVEDTAYII R  +K K +SIS
Subjt:  EDHFAWHRMNCFNADVEDTAYIIKRNAQKGKSISIS

A0A1S3ATQ8 uncharacterized protein LOC1034826175.2e-23089.22Show/hide
Query:  MGSRTNFYKNPSISYKKDLNLSSALQNLRAYNIATGNAPPTDDQ----PPPVVKKNENRKRRREPELSGNPKYDVGNSDGPMSHQDYIERRRKEANKSQP
        MGSRTNFYKNPSISYKKDLNLSSALQNLRAYNIATGNAPPTD Q    P PV KKNE RKRRREPEL G  KYDVGNSDGPMSHQDYIERRRKEAN SQP
Subjt:  MGSRTNFYKNPSISYKKDLNLSSALQNLRAYNIATGNAPPTDDQ----PPPVVKKNENRKRRREPELSGNPKYDVGNSDGPMSHQDYIERRRKEANKSQP

Query:  YETLTEDVLGTSSLGLNLVEYESDESSSSESAVKPDHQNSSLLNEYKEVKSKTEQRFAIAGEPVCVVCGRYGEYICNETNDDICSMECKFKLLEILKHSE
        YETLTEDVL  SS GLNLV+YESDES SS+ AVKPD QNSSLLN+Y+EVKSKTEQRFA+AGEPVCVVCGRYGEYIC+ETNDDICS ECKFKLLEILK  E
Subjt:  YETLTEDVLGTSSLGLNLVEYESDESSSSESAVKPDHQNSSLLNEYKEVKSKTEQRFAIAGEPVCVVCGRYGEYICNETNDDICSMECKFKLLEILKHSE

Query:  EPSNCEVKDVALPETKYILPSPEFGEDTWDYKNHRWSKKKSNLCTYECWKCQKPGHLAEDCLVKTSNQVMPQNTSNPVPGDLLGLYKRCYHIGKNLSNAL
        E SNCEVKDVALPE+KYILPSPE GEDTWDYKNHRWSKKKS+LCTYECWKCQKPGHLAEDCLVKTSNQVM Q TSNPVPGDLLGLYKRCY +GKNLSNAL
Subjt:  EPSNCEVKDVALPETKYILPSPEFGEDTWDYKNHRWSKKKSNLCTYECWKCQKPGHLAEDCLVKTSNQVMPQNTSNPVPGDLLGLYKRCYHIGKNLSNAL

Query:  CNECSCSFSLATCLDCSTVYCDSAGHLNEHIHSHPTHGLYYSHKLKRLVKCCKSTCRVTDIKDLLVCHYCFDKAFDKFYDMYTASWKGAGLSIISGSICC
        CNECSCS+SLATCLDCSTVYCDSAGHLN+HIH+HPTHGLYYSHKLKRLVKCCKSTCRVT IKDLLVCHYCFDKAFDKFYDMYTASW  AGLSIISGSICC
Subjt:  CNECSCSFSLATCLDCSTVYCDSAGHLNEHIHSHPTHGLYYSHKLKRLVKCCKSTCRVTDIKDLLVCHYCFDKAFDKFYDMYTASWKGAGLSIISGSICC

Query:  EDHFAWHRMNCFNADVEDTAYIIKRNAQKGKSISIS
        EDHFAWHRMNCFNADVEDTAYII R  +K KS+SIS
Subjt:  EDHFAWHRMNCFNADVEDTAYIIKRNAQKGKSISIS

A0A6J1DNB9 uncharacterized protein LOC1110219918.0e-23190.05Show/hide
Query:  MGSRTNFYKNPSISYKKDLNLSSALQNLRAYNIATGNAPPTDDQPPPVVKKNENRKRRREPELSGNPKYDVGNSDGPMSHQDYIERRRKEANKSQPYETL
        MGSRTNFYKNPSISYKKDLNLSSALQNLRAYNIATGNAP T++QPPPVVKKNENRKR REPELS NPKYDVG SDGPMSHQDYIERRRKEANKSQPYE L
Subjt:  MGSRTNFYKNPSISYKKDLNLSSALQNLRAYNIATGNAPPTDDQPPPVVKKNENRKRRREPELSGNPKYDVGNSDGPMSHQDYIERRRKEANKSQPYETL

Query:  TEDVLGTSSLGLNLVEYESDESSSSESAVKPDHQNSSLLNEYKEVKSKTEQRFAIAGEPVCVVCGRYGEYICNETNDDICSMECKFKLLEILKHSEEPSN
         EDVLGTSS GLNLV YESDES+SSESAVK DHQNSS LNE++E+KSK EQ FAIAGEPVCVVCGRYGEYICNETNDDICSMECK KLLEILK  EE SN
Subjt:  TEDVLGTSSLGLNLVEYESDESSSSESAVKPDHQNSSLLNEYKEVKSKTEQRFAIAGEPVCVVCGRYGEYICNETNDDICSMECKFKLLEILKHSEEPSN

Query:  CEVKDVALPETKYILPSPEFGEDTWDYKNHRWSKKKSNLCTYECWKCQKPGHLAEDCLVKTSNQVMPQNTSNPVPGDLLGLYKRCYHIGKNLSNALCNEC
         EVKDVALPETK+ILPSPEF EDTWDYKNHRWSKKKSNLCTYECWKCQKPGHLAEDCLVKTSNQVM Q T N +PGDLLGLYKRCY IGKNLS  LCNEC
Subjt:  CEVKDVALPETKYILPSPEFGEDTWDYKNHRWSKKKSNLCTYECWKCQKPGHLAEDCLVKTSNQVMPQNTSNPVPGDLLGLYKRCYHIGKNLSNALCNEC

Query:  SCSFSLATCLDCSTVYCDSAGHLNEHIHSHPTHGLYYSHKLKRLVKCCKSTCRVTDIKDLLVCHYCFDKAFDKFYDMYTASWKGAGLSIISGSICCEDHF
        SCSFSLATCLDCS+VYCDSAGHLNEHIH+HPTHGLYYSHKLKRLVKCCKSTC+VTDIKDLLVCHYCF+KAFDKFYDMYTA+WKG GLSIISGSICCEDHF
Subjt:  SCSFSLATCLDCSTVYCDSAGHLNEHIHSHPTHGLYYSHKLKRLVKCCKSTCRVTDIKDLLVCHYCFDKAFDKFYDMYTASWKGAGLSIISGSICCEDHF

Query:  AWHRMNCFNADVEDTAYIIKRNAQKGKSISIS
         WHRMNCFNADVED AYII RNAQK KSISIS
Subjt:  AWHRMNCFNADVEDTAYIIKRNAQKGKSISIS

A0A6J1H0R9 uncharacterized protein LOC111459101 isoform X17.2e-23289.15Show/hide
Query:  MGSRTNFYKNPSISYKKDLNLSSALQNLRAYNIATGNAPPTDDQPPPVVKKNENRKR-RREPELSGNPKYDVGNSDGPMSHQDYIERRRKEANKSQPYET
        MGSRTNFYKNPSISYKKDLNLSSALQNLRAYNIA GNAPPTDD+PPPV KKNENRKR RREPELSGNPKYDVG+SDGPMSHQDYIE+RRKEAN+ QPYE 
Subjt:  MGSRTNFYKNPSISYKKDLNLSSALQNLRAYNIATGNAPPTDDQPPPVVKKNENRKR-RREPELSGNPKYDVGNSDGPMSHQDYIERRRKEANKSQPYET

Query:  LTEDVLGTSSLGLNLVEYESDESSSSESAVKPDHQNSSLLNEYKEVKSKTEQRFAIAGEPVCVVCGRYGEYICNETNDDICSMECKFKLLEILKHSEEPS
        LTEDVLGTSS GLNLV YES ES+SSES VKPDHQNSSLLNE++EV+ KTEQRFA AGEPVCVVCGRYGEYICNETNDDICSMECKFKLLEILK SEE S
Subjt:  LTEDVLGTSSLGLNLVEYESDESSSSESAVKPDHQNSSLLNEYKEVKSKTEQRFAIAGEPVCVVCGRYGEYICNETNDDICSMECKFKLLEILKHSEEPS

Query:  NCEVKDVALPETKYILPSPEFGEDTWDYKNHRWSKKKSNLCTYECWKCQKPGHLAEDCLVKTSNQVMPQNTSNPVPGDLLGLYKRCYHIGKNLSNALCNE
        NCEVKDVALPETKYILPSPEFGEDTWDYKNHRWSK KSNLCTYECWKCQKPGHLAEDCLV++SNQVM Q + NPVP DL+GLYKRCY IGKNL N LCNE
Subjt:  NCEVKDVALPETKYILPSPEFGEDTWDYKNHRWSKKKSNLCTYECWKCQKPGHLAEDCLVKTSNQVMPQNTSNPVPGDLLGLYKRCYHIGKNLSNALCNE

Query:  CSCSFSLATCLDCSTVYCDSAGHLNEHIHSHPTHGLYYSHKLKRLVKCCKSTCRVTDIKDLLVCHYCFDKAFDKFYDMYTASWKGAGLSIISGSICCEDH
        CSCSFSLATCLDCSTVYCD+AGHLNEHI  HPTHGLYYSHKLKRLVKCCKSTCRVT+IKDLLVCHYCFDKAFDKFYDMYTASW GAGLSIISGSICCEDH
Subjt:  CSCSFSLATCLDCSTVYCDSAGHLNEHIHSHPTHGLYYSHKLKRLVKCCKSTCRVTDIKDLLVCHYCFDKAFDKFYDMYTASWKGAGLSIISGSICCEDH

Query:  FAWHRMNCFNADVEDTAYIIKRNAQKGKSISIS
        FAWHRMNCFNADVED AYI+ RN +  KS S+S
Subjt:  FAWHRMNCFNADVEDTAYIIKRNAQKGKSISIS

A0A6J1K849 uncharacterized protein LOC111491515 isoform X14.9e-23690.99Show/hide
Query:  MGSRTNFYKNPSISYKKDLNLSSALQNLRAYNIATGNAPPTDDQPPPVVKKNENRKR-RREPELSGNPKYDVGNSDGPMSHQDYIERRRKEANKSQPYET
        MGSRTNFYKNPSISYKKDLNLSSALQNLRAYNIA GNAPPTDDQP PV KKNENRKR RRE ELSGNPKYDVG+SDGPMSHQDYIE+RRKEAN+ QPYET
Subjt:  MGSRTNFYKNPSISYKKDLNLSSALQNLRAYNIATGNAPPTDDQPPPVVKKNENRKR-RREPELSGNPKYDVGNSDGPMSHQDYIERRRKEANKSQPYET

Query:  LTEDVLGTSSLGLNLVEYESDESSSSESAVKPDHQNSSLLNEYKEVKSKTEQRFAIAGEPVCVVCGRYGEYICNETNDDICSMECKFKLLEILKHSEEPS
        L+EDVLGTSS GLNLV YESDES+SSESAVKPDHQNSSLLNE++EV+ KTEQRFAIAGEPVCVVCGRYGEYICNETNDDICSMECKFKLLEILK SEE S
Subjt:  LTEDVLGTSSLGLNLVEYESDESSSSESAVKPDHQNSSLLNEYKEVKSKTEQRFAIAGEPVCVVCGRYGEYICNETNDDICSMECKFKLLEILKHSEEPS

Query:  NCEVKDVALPETKYILPSPEFGEDTWDYKNHRWSKKKSNLCTYECWKCQKPGHLAEDCLVKTSNQVMPQNTSNPVPGDLLGLYKRCYHIGKNLSNALCNE
        NCEVKDVALPETKYILPSPEFGEDTWDYKNHRWSK KSNLCTYECWKCQKPGHLAEDCLV+TSNQVM Q T N VP DL+GLYKRCY IGKNLSN+LCNE
Subjt:  NCEVKDVALPETKYILPSPEFGEDTWDYKNHRWSKKKSNLCTYECWKCQKPGHLAEDCLVKTSNQVMPQNTSNPVPGDLLGLYKRCYHIGKNLSNALCNE

Query:  CSCSFSLATCLDCSTVYCDSAGHLNEHIHSHPTHGLYYSHKLKRLVKCCKSTCRVTDIKDLLVCHYCFDKAFDKFYDMYTASWKGAGLSIISGSICCEDH
        CSCSFSLATCLDCSTVYCDSAGHLNEHIH HPTHGLYYSHKLKRLVKCCKSTCRVTDIKDLLVCHYCFDKAFDKFYDMYTASW GAGLSIISGSICCEDH
Subjt:  CSCSFSLATCLDCSTVYCDSAGHLNEHIHSHPTHGLYYSHKLKRLVKCCKSTCRVTDIKDLLVCHYCFDKAFDKFYDMYTASWKGAGLSIISGSICCEDH

Query:  FAWHRMNCFNADVEDTAYIIKRNAQKGKSISIS
        FAWHRMNCFNADVED AYI+ RN + GKS S+S
Subjt:  FAWHRMNCFNADVEDTAYIIKRNAQKGKSISIS

SwissProt top hitse value%identityAlignment
Q0E2Z7 DEAD-box ATP-dependent RNA helicase 413.2e-1146.32Show/hide
Query:  VEYESDESSSSESAVKPDHQNSSLLNEYKEVKSKTEQRFAIAGEPVCVVCGRYGEYICNETNDDICSMECKFKLLEILKHSEEPSNCEVKDVALP
        +E E + S+   SA +P + N   L E    +   EQR A+ GEP CV+CGRYGEYIC++T+DDICS+ECK  LL  L     P     K V LP
Subjt:  VEYESDESSSSESAVKPDHQNSSLLNEYKEVKSKTEQRFAIAGEPVCVVCGRYGEYICNETNDDICSMECKFKLLEILKHSEEPSNCEVKDVALP

Q3EBD3 DEAD-box ATP-dependent RNA helicase 411.6e-1065.22Show/hide
Query:  VKSKT-EQRFAIAGEPVCVVCGRYGEYICNETNDDICSMECKFKLL
        VK K+ +QR  ++GEP CV+C RYGEYIC+ETNDD+CS+ECK  LL
Subjt:  VKSKT-EQRFAIAGEPVCVVCGRYGEYICNETNDDICSMECKFKLL

Q5T1V6 Probable ATP-dependent RNA helicase DDX598.5e-1250Show/hide
Query:  SSSSESAVKPDHQNSSLLNEYKEVKSKTEQRFAIAGEPVCVVCGRYGEYICNETNDDICSMECKFKLLEILKHSEEPS
        S S E   K  H +   +  +    SKT QR+A  GEP+CVVCGRYGEYIC++T++D+CS+ECK K L  +K  EE S
Subjt:  SSSSESAVKPDHQNSSLLNEYKEVKSKTEQRFAIAGEPVCVVCGRYGEYICNETNDDICSMECKFKLLEILKHSEEPS

Q66HG7 Probable ATP-dependent RNA helicase DDX592.1e-1048Show/hide
Query:  SSSSESAVKPDHQNSSLLNEYKEVKSKTEQRFAIAGEPVCVVCGRYGEYICNETNDDICSMECKFKLLEILKHSE
        S+  +  VK  H +   +  +    SKT QR+   GEPVCVVCGRYGEYIC++T++D+CS+ECK K L  +K  E
Subjt:  SSSSESAVKPDHQNSSLLNEYKEVKSKTEQRFAIAGEPVCVVCGRYGEYICNETNDDICSMECKFKLLEILKHSE

Q9DBN9 Probable ATP-dependent RNA helicase DDX592.7e-1042Show/hide
Query:  ETLTEDVLG-TSSLGLNLVEYESDESSSSESAVKPDHQNSSLLNEYKEVKSKTEQRFAIAGEPVCVVCGRYGEYICNETNDDICSMECKFKLLEILKHSE
        E +T D  G  SS      +     S+  +  VK  H +   +  +    SKT QR+   GEPVCVVCGRYGEYIC++T++D+CS+ECK K L  +K  E
Subjt:  ETLTEDVLG-TSSLGLNLVEYESDESSSSESAVKPDHQNSSLLNEYKEVKSKTEQRFAIAGEPVCVVCGRYGEYICNETNDDICSMECKFKLLEILKHSE

Arabidopsis top hitse value%identityAlignment
AT2G07240.1 cysteine-type peptidases;cysteine-type peptidases1.8e-1225.28Show/hide
Query:  RRGDRKRTLSWKLRTPWKDTR-EGAKKQKVI-----PYNPLVEIPGKLDRRFQKWLDDTEVDNAPRKTAYAFRDKVWFQNLLKPCYWMSDEVIDSLFMFV
        R+  R RTLS KL     D R    KK K++     P+  + E+    + R+Q+ L   +  ++      A        ++++     S +V+D L  F 
Subjt:  RRGDRKRTLSWKLRTPWKDTR-EGAKKQKVI-----PYNPLVEIPGKLDRRFQKWLDDTEVDNAPRKTAYAFRDKVWFQNLLKPCYWMSDEVIDSLFMFV

Query:  RKKMQQRADLCRWKFVTADIVVTDFLRRSDDIAEELKKVQDPSLITYDWSTANTVIDYVLGR-HSDHDTHWSTVDAIYMPLNLGGNHWVMVCADLLVGKL
        R  +  R D    + +  D++ + F+ +   +  +  K   P     D+   + ++D ++G   S+    ++  D +YMP N    HWV +C DL   K+
Subjt:  RKKMQQRADLCRWKFVTADIVVTDFLRRSDDIAEELKKVQDPSLITYDWSTANTVIDYVLGR-HSDHDTHWSTVDAIYMPLNLGGNHWVMVCADLLVGKL

Query:  NVLDSFIALTSDATLKKELSTLATVLPVLLFKCDVMKAKPHLPVHEWEIHRDSSVPQQTNGGDCGMFAV
         +LDS I L  DA L  EL  LA +LP L  +     +   + +  + + R   +PQ ++  D G+ +V
Subjt:  NVLDSFIALTSDATLKKELSTLATVLPVLLFKCDVMKAKPHLPVHEWEIHRDSSVPQQTNGGDCGMFAV

AT3G02065.2 P-loop containing nucleoside triphosphate hydrolases superfamily protein1.1e-1165.22Show/hide
Query:  VKSKT-EQRFAIAGEPVCVVCGRYGEYICNETNDDICSMECKFKLL
        VK K+ +QR  ++GEP CV+C RYGEYIC+ETNDD+CS+ECK  LL
Subjt:  VKSKT-EQRFAIAGEPVCVVCGRYGEYICNETNDDICSMECKFKLL

AT3G02065.3 P-loop containing nucleoside triphosphate hydrolases superfamily protein1.1e-1165.22Show/hide
Query:  VKSKT-EQRFAIAGEPVCVVCGRYGEYICNETNDDICSMECKFKLL
        VK K+ +QR  ++GEP CV+C RYGEYIC+ETNDD+CS+ECK  LL
Subjt:  VKSKT-EQRFAIAGEPVCVVCGRYGEYICNETNDDICSMECKFKLL

AT4G08430.1 Ulp1 protease family protein1.0e-0725.78Show/hide
Query:  VDAIYMPLNLGGNHWVMVCADLLVGKLNVLDSFIALTSDATLKKELSTLATVLPVLLFKCDVMKAKPHLPVHEWEIHRDSSVPQQTNGGDCGMFAVKFFE
        VD +Y  L + GNHWV +  DL   ++NV DS  +LT+D  +  +   + T++P +L      K +      + E  R + +P+  +  DC ++++K+ E
Subjt:  VDAIYMPLNLGGNHWVMVCADLLVGKLNVLDSFIALTSDATLKKELSTLATVLPVLLFKCDVMKAKPHLPVHEWEIHRDSSVPQQTNGGDCGMFAVKFFE

Query:  YDVTGSEINTLNQDRINFCRRQFAVQIW
            G   + L  + +     + AV+++
Subjt:  YDVTGSEINTLNQDRINFCRRQFAVQIW

AT5G45570.1 Ulp1 protease family protein1.4e-0927.34Show/hide
Query:  VDAIYMPLNLGGNHWVMVCADLLVGKLNVLDSFIALTSDATLKKELSTLATVLPVLLFKCDVMKAKPHLPVHEWEIHRDSSVPQQTNGGDCGMFAVKFFE
        VD +Y  L + GNHWV +  DL   ++NV DS  +LT+D  +  +   + T++P +L      K +      + E  R + +P+  + GDC ++++K+ E
Subjt:  VDAIYMPLNLGGNHWVMVCADLLVGKLNVLDSFIALTSDATLKKELSTLATVLPVLLFKCDVMKAKPHLPVHEWEIHRDSSVPQQTNGGDCGMFAVKFFE

Query:  YDVTGSEINTLNQDRINFCRRQFAVQIW
            G   + L  + +   R + AV+++
Subjt:  YDVTGSEINTLNQDRINFCRRQFAVQIW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCAGTAGAACAAATTTCTACAAGAATCCTTCCATCTCCTACAAGAAGGACTTGAATCTCTCCTCTGCTCTTCAAAACCTAAGAGCGTATAACATTGCTACCGGCAA
CGCTCCACCGACCGATGATCAACCGCCTCCTGTCGTGAAGAAGAACGAGAATCGGAAACGCCGCCGGGAGCCAGAACTGTCAGGTAATCCAAAATATGACGTTGGAAACA
GTGACGGACCTATGTCTCATCAAGATTACATTGAAAGGAGAAGAAAAGAAGCAAATAAATCTCAACCTTATGAGACGCTTACGGAAGATGTTTTGGGAACTTCTAGCTTG
GGCTTGAACTTGGTAGAATATGAAAGTGATGAGAGCTCGTCTTCAGAAAGTGCAGTAAAGCCAGATCATCAAAATTCCAGTCTCTTGAATGAATACAAGGAAGTTAAGAG
TAAAACTGAGCAACGTTTTGCCATTGCGGGAGAACCTGTTTGTGTAGTATGTGGTAGATATGGAGAATACATATGCAATGAAACCAATGACGACATCTGCAGCATGGAGT
GCAAATTCAAGCTTTTGGAAATTCTTAAACATAGTGAGGAGCCCTCAAACTGTGAAGTGAAAGATGTTGCATTACCAGAAACTAAATATATCCTGCCTTCCCCAGAGTTC
GGGGAGGACACCTGGGATTATAAGAACCATCGGTGGTCCAAAAAGAAATCCAATCTTTGTACTTATGAATGCTGGAAATGTCAAAAGCCGGGACACCTTGCTGAAGATTG
TTTGGTGAAAACCAGTAACCAGGTAATGCCACAAAATACTTCCAATCCTGTACCCGGCGATCTCCTTGGACTATATAAAAGATGTTACCATATAGGAAAGAATTTGTCAA
ATGCGCTGTGCAATGAATGCAGCTGTTCATTTAGTTTGGCGACATGTCTTGACTGTAGTACTGTTTACTGTGACAGTGCAGGTCATCTAAACGAGCATATACACTCACAC
CCAACTCATGGACTATATTACTCGCATAAACTCAAACGTCTTGTAAAATGCTGCAAATCAACATGCAGGGTGACTGACATCAAGGATCTTCTGGTGTGTCATTACTGTTT
TGATAAAGCTTTCGACAAGTTCTATGATATGTATACTGCATCTTGGAAAGGAGCTGGACTTTCAATCATATCAGGTTCCATTTGCTGCGAAGATCACTTTGCCTGGCATC
GCATGAACTGCTTCAATGCAGATGTAGAGGACACTGCCTATATCATCAAAAGGAACGCGCAAAAAGGCAAGTCCATTTCTATTAGTAAAAGGTCAAAATTGAAACTGAAA
ACATTAACCCCCTTCCCCTTCGTCTTCTCCTTCTTCCCTTCGTCTTCTCCTCCTTCGTCTTCTCTTCTCGTGTCCAGCAGCTCCGACGACCCAACGGCTCCGGCGTTTCC
TTCTCTCTCGTTCGACTCCGGCGTGTCCAGCAACGTAACTCCGCGGCGTCTTCATCCCCGTTCTAGCAGCTCCAACTCCACGCCGTCTTCATCCCACGGATTTTTCAGGC
CGTTTCCAGCAGCTCCGAAGACTCCACCAGTCAAGAAGAGATTGACTGAAACCCAATTAGATATGTTTAGGCAAACTATATTTGGCCCTATTTTAGACAGCAACATATTG
TTTAATGGTCAGTTAATCCACCATCTACTACTTAGGGAGGTTGAGGATCCCAGGAAGGATGTAATTAGTTTCGATATATTTGGAAATAAGGTGTCGTTTGGCAAGGAAGA
ATTCGATCTAATCACCGGATTTAGACACAATAGGAGGATAGTTGATAGACATGAGTCGGGGGTTAGATTGAGGCGTCTGTACTTTAATGACAGTGTCAAAGATACCGTAA
TGGATGCTGAAAAAAGATTCTTAGACATACAGTTTCAGTCAGATGAAGATGCGGTGAAGGTAGCGCTCGCATATTTTATCGAGCTAGCAATGTTTGGGCGGGAGAGGAAA
CAAAAATTCAATTGGTCTTTATTGGGTATCGTGGACGATTGGGAGATATTCTGCAATTATGACTGGAGCAAAGTAATTTTTGAGATGACGATAAGGAGTTTGAAGAAAGC
ACTCAGTCATGCCACCCAAAGAGACGTTGTGGCCGGAGAGGCTAGTCGATTGGAAAGATATAGTCTTTACGGCTTTCCACATGCTTTTCAGGTATGGGCGTATGAGACTA
TTTCGTCTCTAACGAACCGTGTTGCGAACCGGATGAACCAGGATGCGATCCCACGGTTTTCTCGGTGGTCATGCTCTCATTCTCCTACGTACACCCAACTTAGCAGTGAG
ATATTTGGCTTGACGGAGGCAAGGGTGACAGTGCAATTGGTTCCAAGCGAAGCAGAGCTCGAACATATGCGTCGTATTGTTTTGCCGCCACAACTACAAGCCCCTGTTTT
GCCGCCACAACTAGAGGCCCCTGTTTCGCCACCACATCCAGAGGCCCCTGTTTTGTCGCCACAACCAGATGCAAACCTAGATGATCCTGTGGGGAGTGATAGAGGGTCAG
AGGAGGCTGGTTTGGATATGAGTTCACCGAAAAAGGATGTAGAAATGGTTAGGCTCGATGAACAATCGACACACGACGGTCTACCTGAAGGCGTCGGCAAGACCTGCCAA
TGTGACTGCAAGCAAGCATACGAGTCACTAGACCGACGGATGAAGGTGGTGGAGTCCGATGTAAAAGAGATGAAATCTGACTTAAAGTCGATCAAGAAGTATTTGCGCCG
GTTATCTAAGGGTCAAATGGTGGTTGATCCTACCAAGTATTTGGGTCCCGACCGTAGTGCAGCTGCATCAGGTGATGAACCATCCGAGAAAGGAAAGAACCATGTCGTGG
AGGAGGGGGGTGGTGGGGTTTCAATAGATGCGATGGTAGAGCACCATGATATGGACAAGGGTGTTGAATCAGACTCCCATGAGGTTGAAGAGATCCCGAAACCTGGAGAA
ATGGTGAAACGTCGGGGAGATCGGAAAAGAACTCTTTCTTGGAAACTTCGAACTCCGTGGAAGGATACGAGGGAAGGGGCCAAAAAACAAAAGGTCATACCATACAACCC
CTTAGTTGAGATTCCTGGGAAGCTTGATAGACGTTTCCAAAAGTGGTTGGACGACACGGAGGTGGACAATGCTCCAAGGAAGACGGCATATGCTTTTAGGGACAAAGTGT
GGTTTCAAAACCTTTTGAAACCCTGCTATTGGATGAGCGATGAGGTCATTGACTCACTTTTTATGTTCGTCCGGAAGAAAATGCAACAGCGGGCAGACTTATGTCGTTGG
AAGTTTGTCACTGCAGATATTGTTGTTACCGATTTTCTGAGGCGTAGCGACGACATAGCTGAAGAGTTGAAGAAGGTGCAAGATCCTTCGTTGATTACGTACGACTGGAG
TACGGCCAATACTGTGATAGACTACGTTTTGGGTCGACACTCGGACCACGATACACATTGGAGTACAGTTGATGCGATCTACATGCCATTGAACCTTGGGGGGAACCATT
GGGTTATGGTATGTGCTGATCTCCTAGTGGGAAAATTGAATGTCCTCGATTCATTCATAGCGTTGACATCAGATGCAACCTTGAAGAAAGAGTTGAGCACTCTAGCCACA
GTATTGCCAGTGCTACTGTTCAAGTGCGATGTCATGAAAGCGAAGCCACATCTCCCAGTTCACGAATGGGAAATACATAGAGATAGTTCAGTGCCTCAACAAACGAACGG
TGGGGATTGTGGTATGTTCGCGGTAAAGTTTTTTGAATATGATGTTACTGGAAGTGAAATAAACACTCTGAATCAAGATAGGATTAATTTTTGTAGACGTCAATTTGCTG
TTCAAATTTGGGCCAACAGGCCGATATTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGCAGTAGAACAAATTTCTACAAGAATCCTTCCATCTCCTACAAGAAGGACTTGAATCTCTCCTCTGCTCTTCAAAACCTAAGAGCGTATAACATTGCTACCGGCAA
CGCTCCACCGACCGATGATCAACCGCCTCCTGTCGTGAAGAAGAACGAGAATCGGAAACGCCGCCGGGAGCCAGAACTGTCAGGTAATCCAAAATATGACGTTGGAAACA
GTGACGGACCTATGTCTCATCAAGATTACATTGAAAGGAGAAGAAAAGAAGCAAATAAATCTCAACCTTATGAGACGCTTACGGAAGATGTTTTGGGAACTTCTAGCTTG
GGCTTGAACTTGGTAGAATATGAAAGTGATGAGAGCTCGTCTTCAGAAAGTGCAGTAAAGCCAGATCATCAAAATTCCAGTCTCTTGAATGAATACAAGGAAGTTAAGAG
TAAAACTGAGCAACGTTTTGCCATTGCGGGAGAACCTGTTTGTGTAGTATGTGGTAGATATGGAGAATACATATGCAATGAAACCAATGACGACATCTGCAGCATGGAGT
GCAAATTCAAGCTTTTGGAAATTCTTAAACATAGTGAGGAGCCCTCAAACTGTGAAGTGAAAGATGTTGCATTACCAGAAACTAAATATATCCTGCCTTCCCCAGAGTTC
GGGGAGGACACCTGGGATTATAAGAACCATCGGTGGTCCAAAAAGAAATCCAATCTTTGTACTTATGAATGCTGGAAATGTCAAAAGCCGGGACACCTTGCTGAAGATTG
TTTGGTGAAAACCAGTAACCAGGTAATGCCACAAAATACTTCCAATCCTGTACCCGGCGATCTCCTTGGACTATATAAAAGATGTTACCATATAGGAAAGAATTTGTCAA
ATGCGCTGTGCAATGAATGCAGCTGTTCATTTAGTTTGGCGACATGTCTTGACTGTAGTACTGTTTACTGTGACAGTGCAGGTCATCTAAACGAGCATATACACTCACAC
CCAACTCATGGACTATATTACTCGCATAAACTCAAACGTCTTGTAAAATGCTGCAAATCAACATGCAGGGTGACTGACATCAAGGATCTTCTGGTGTGTCATTACTGTTT
TGATAAAGCTTTCGACAAGTTCTATGATATGTATACTGCATCTTGGAAAGGAGCTGGACTTTCAATCATATCAGGTTCCATTTGCTGCGAAGATCACTTTGCCTGGCATC
GCATGAACTGCTTCAATGCAGATGTAGAGGACACTGCCTATATCATCAAAAGGAACGCGCAAAAAGGCAAGTCCATTTCTATTAGTAAAAGGTCAAAATTGAAACTGAAA
ACATTAACCCCCTTCCCCTTCGTCTTCTCCTTCTTCCCTTCGTCTTCTCCTCCTTCGTCTTCTCTTCTCGTGTCCAGCAGCTCCGACGACCCAACGGCTCCGGCGTTTCC
TTCTCTCTCGTTCGACTCCGGCGTGTCCAGCAACGTAACTCCGCGGCGTCTTCATCCCCGTTCTAGCAGCTCCAACTCCACGCCGTCTTCATCCCACGGATTTTTCAGGC
CGTTTCCAGCAGCTCCGAAGACTCCACCAGTCAAGAAGAGATTGACTGAAACCCAATTAGATATGTTTAGGCAAACTATATTTGGCCCTATTTTAGACAGCAACATATTG
TTTAATGGTCAGTTAATCCACCATCTACTACTTAGGGAGGTTGAGGATCCCAGGAAGGATGTAATTAGTTTCGATATATTTGGAAATAAGGTGTCGTTTGGCAAGGAAGA
ATTCGATCTAATCACCGGATTTAGACACAATAGGAGGATAGTTGATAGACATGAGTCGGGGGTTAGATTGAGGCGTCTGTACTTTAATGACAGTGTCAAAGATACCGTAA
TGGATGCTGAAAAAAGATTCTTAGACATACAGTTTCAGTCAGATGAAGATGCGGTGAAGGTAGCGCTCGCATATTTTATCGAGCTAGCAATGTTTGGGCGGGAGAGGAAA
CAAAAATTCAATTGGTCTTTATTGGGTATCGTGGACGATTGGGAGATATTCTGCAATTATGACTGGAGCAAAGTAATTTTTGAGATGACGATAAGGAGTTTGAAGAAAGC
ACTCAGTCATGCCACCCAAAGAGACGTTGTGGCCGGAGAGGCTAGTCGATTGGAAAGATATAGTCTTTACGGCTTTCCACATGCTTTTCAGGTATGGGCGTATGAGACTA
TTTCGTCTCTAACGAACCGTGTTGCGAACCGGATGAACCAGGATGCGATCCCACGGTTTTCTCGGTGGTCATGCTCTCATTCTCCTACGTACACCCAACTTAGCAGTGAG
ATATTTGGCTTGACGGAGGCAAGGGTGACAGTGCAATTGGTTCCAAGCGAAGCAGAGCTCGAACATATGCGTCGTATTGTTTTGCCGCCACAACTACAAGCCCCTGTTTT
GCCGCCACAACTAGAGGCCCCTGTTTCGCCACCACATCCAGAGGCCCCTGTTTTGTCGCCACAACCAGATGCAAACCTAGATGATCCTGTGGGGAGTGATAGAGGGTCAG
AGGAGGCTGGTTTGGATATGAGTTCACCGAAAAAGGATGTAGAAATGGTTAGGCTCGATGAACAATCGACACACGACGGTCTACCTGAAGGCGTCGGCAAGACCTGCCAA
TGTGACTGCAAGCAAGCATACGAGTCACTAGACCGACGGATGAAGGTGGTGGAGTCCGATGTAAAAGAGATGAAATCTGACTTAAAGTCGATCAAGAAGTATTTGCGCCG
GTTATCTAAGGGTCAAATGGTGGTTGATCCTACCAAGTATTTGGGTCCCGACCGTAGTGCAGCTGCATCAGGTGATGAACCATCCGAGAAAGGAAAGAACCATGTCGTGG
AGGAGGGGGGTGGTGGGGTTTCAATAGATGCGATGGTAGAGCACCATGATATGGACAAGGGTGTTGAATCAGACTCCCATGAGGTTGAAGAGATCCCGAAACCTGGAGAA
ATGGTGAAACGTCGGGGAGATCGGAAAAGAACTCTTTCTTGGAAACTTCGAACTCCGTGGAAGGATACGAGGGAAGGGGCCAAAAAACAAAAGGTCATACCATACAACCC
CTTAGTTGAGATTCCTGGGAAGCTTGATAGACGTTTCCAAAAGTGGTTGGACGACACGGAGGTGGACAATGCTCCAAGGAAGACGGCATATGCTTTTAGGGACAAAGTGT
GGTTTCAAAACCTTTTGAAACCCTGCTATTGGATGAGCGATGAGGTCATTGACTCACTTTTTATGTTCGTCCGGAAGAAAATGCAACAGCGGGCAGACTTATGTCGTTGG
AAGTTTGTCACTGCAGATATTGTTGTTACCGATTTTCTGAGGCGTAGCGACGACATAGCTGAAGAGTTGAAGAAGGTGCAAGATCCTTCGTTGATTACGTACGACTGGAG
TACGGCCAATACTGTGATAGACTACGTTTTGGGTCGACACTCGGACCACGATACACATTGGAGTACAGTTGATGCGATCTACATGCCATTGAACCTTGGGGGGAACCATT
GGGTTATGGTATGTGCTGATCTCCTAGTGGGAAAATTGAATGTCCTCGATTCATTCATAGCGTTGACATCAGATGCAACCTTGAAGAAAGAGTTGAGCACTCTAGCCACA
GTATTGCCAGTGCTACTGTTCAAGTGCGATGTCATGAAAGCGAAGCCACATCTCCCAGTTCACGAATGGGAAATACATAGAGATAGTTCAGTGCCTCAACAAACGAACGG
TGGGGATTGTGGTATGTTCGCGGTAAAGTTTTTTGAATATGATGTTACTGGAAGTGAAATAAACACTCTGAATCAAGATAGGATTAATTTTTGTAGACGTCAATTTGCTG
TTCAAATTTGGGCCAACAGGCCGATATTTTAG
Protein sequenceShow/hide protein sequence
MGSRTNFYKNPSISYKKDLNLSSALQNLRAYNIATGNAPPTDDQPPPVVKKNENRKRRREPELSGNPKYDVGNSDGPMSHQDYIERRRKEANKSQPYETLTEDVLGTSSL
GLNLVEYESDESSSSESAVKPDHQNSSLLNEYKEVKSKTEQRFAIAGEPVCVVCGRYGEYICNETNDDICSMECKFKLLEILKHSEEPSNCEVKDVALPETKYILPSPEF
GEDTWDYKNHRWSKKKSNLCTYECWKCQKPGHLAEDCLVKTSNQVMPQNTSNPVPGDLLGLYKRCYHIGKNLSNALCNECSCSFSLATCLDCSTVYCDSAGHLNEHIHSH
PTHGLYYSHKLKRLVKCCKSTCRVTDIKDLLVCHYCFDKAFDKFYDMYTASWKGAGLSIISGSICCEDHFAWHRMNCFNADVEDTAYIIKRNAQKGKSISISKRSKLKLK
TLTPFPFVFSFFPSSSPPSSSLLVSSSSDDPTAPAFPSLSFDSGVSSNVTPRRLHPRSSSSNSTPSSSHGFFRPFPAAPKTPPVKKRLTETQLDMFRQTIFGPILDSNIL
FNGQLIHHLLLREVEDPRKDVISFDIFGNKVSFGKEEFDLITGFRHNRRIVDRHESGVRLRRLYFNDSVKDTVMDAEKRFLDIQFQSDEDAVKVALAYFIELAMFGRERK
QKFNWSLLGIVDDWEIFCNYDWSKVIFEMTIRSLKKALSHATQRDVVAGEASRLERYSLYGFPHAFQVWAYETISSLTNRVANRMNQDAIPRFSRWSCSHSPTYTQLSSE
IFGLTEARVTVQLVPSEAELEHMRRIVLPPQLQAPVLPPQLEAPVSPPHPEAPVLSPQPDANLDDPVGSDRGSEEAGLDMSSPKKDVEMVRLDEQSTHDGLPEGVGKTCQ
CDCKQAYESLDRRMKVVESDVKEMKSDLKSIKKYLRRLSKGQMVVDPTKYLGPDRSAAASGDEPSEKGKNHVVEEGGGGVSIDAMVEHHDMDKGVESDSHEVEEIPKPGE
MVKRRGDRKRTLSWKLRTPWKDTREGAKKQKVIPYNPLVEIPGKLDRRFQKWLDDTEVDNAPRKTAYAFRDKVWFQNLLKPCYWMSDEVIDSLFMFVRKKMQQRADLCRW
KFVTADIVVTDFLRRSDDIAEELKKVQDPSLITYDWSTANTVIDYVLGRHSDHDTHWSTVDAIYMPLNLGGNHWVMVCADLLVGKLNVLDSFIALTSDATLKKELSTLAT
VLPVLLFKCDVMKAKPHLPVHEWEIHRDSSVPQQTNGGDCGMFAVKFFEYDVTGSEINTLNQDRINFCRRQFAVQIWANRPIF