; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg24904 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg24904
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionJmjC domain-containing protein
Genome locationCarg_Chr08:1845109..1853672
RNA-Seq ExpressionCarg24904
SyntenyCarg24904
Gene Ontology termsNA
InterPro domainsIPR003347 - JmjC domain
IPR027445 - Hypoxia-inducible factor 1-alpha inhibitor
IPR041667 - Cupin-like domain 8


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7025504.1 Lysine-specific demethylase JMJ30, partial [Cucurbita argyrosperma subsp. argyrosperma]5.8e-274100Show/hide
Query:  MVTLEAMREYDPYQDSLIFNFEVQVPLPFSTFIQFCKQRLQERNEENVVSTESNSNRTTYPDMEKGCLPFEDDSKKLYLAQVPILDVINEERAQLAPLRQ
        MVTLEAMREYDPYQDSLIFNFEVQVPLPFSTFIQFCKQRLQERNEENVVSTESNSNRTTYPDMEKGCLPFEDDSKKLYLAQVPILDVINEERAQLAPLRQ
Subjt:  MVTLEAMREYDPYQDSLIFNFEVQVPLPFSTFIQFCKQRLQERNEENVVSTESNSNRTTYPDMEKGCLPFEDDSKKLYLAQVPILDVINEERAQLAPLRQ

Query:  DIQTPAFLEKKKLASINLWMNSAQSRSSTHYDPHHNVLCIVSGSKQVILWPPSSTPSLYPMHIYGEASNHSSVTLEKPDYSLYPRAKYSMKSSQTGKVKR
        DIQTPAFLEKKKLASINLWMNSAQSRSSTHYDPHHNVLCIVSGSKQVILWPPSSTPSLYPMHIYGEASNHSSVTLEKPDYSLYPRAKYSMKSSQTGKVKR
Subjt:  DIQTPAFLEKKKLASINLWMNSAQSRSSTHYDPHHNVLCIVSGSKQVILWPPSSTPSLYPMHIYGEASNHSSVTLEKPDYSLYPRAKYSMKSSQTGKVKR

Query:  SIFGKIFGRFHQVDSDDLTIAVNFWWQSHLMSSMSEHMDAYYLRRILRRLMDKEMNEVLHVPCSLADMDDMKSHERDVPNNRDKGVRCLGQALDGGDVRE
        SIFGKIFGRFHQVDSDDLTIAVNFWWQSHLMSSMSEHMDAYYLRRILRRLMDKEMNEVLHVPCSLADMDDMKSHERDVPNNRDKGVRCLGQALDGGDVRE
Subjt:  SIFGKIFGRFHQVDSDDLTIAVNFWWQSHLMSSMSEHMDAYYLRRILRRLMDKEMNEVLHVPCSLADMDDMKSHERDVPNNRDKGVRCLGQALDGGDVRE

Query:  KEPREETSHELEPRTAQALHGLIALVHDQVSVSDQIGPLQSSSTNCSADEEGMKITSLHSLENDRVATFIWNLEPCILQKVLLTMANNFPRTLEALILHL
        KEPREETSHELEPRTAQALHGLIALVHDQVSVSDQIGPLQSSSTNCSADEEGMKITSLHSLENDRVATFIWNLEPCILQKVLLTMANNFPRTLEALILHL
Subjt:  KEPREETSHELEPRTAQALHGLIALVHDQVSVSDQIGPLQSSSTNCSADEEGMKITSLHSLENDRVATFIWNLEPCILQKVLLTMANNFPRTLEALILHL

Query:  LSPVGAEVLTQKFDQMDKENTEEDQKRFYEVFYSSFDDQFAVMDAILNSKESFARQAFKSVLDKYLGVNLNGPNLGS
        LSPVGAEVLTQKFDQMDKENTEEDQKRFYEVFYSSFDDQFAVMDAILNSKESFARQAFKSVLDKYLGVNLNGPNLGS
Subjt:  LSPVGAEVLTQKFDQMDKENTEEDQKRFYEVFYSSFDDQFAVMDAILNSKESFARQAFKSVLDKYLGVNLNGPNLGS

XP_022960123.1 uncharacterized protein LOC111460965 isoform X1 [Cucurbita moschata]2.1e-24796.3Show/hide
Query:  QVPLPFSTFIQFCKQRLQERNEENVVSTESNSNRTTYPDMEKGCLPFEDDSKKLYLAQVPILDVINEERAQLAPLRQDIQTPAFLEKKKLASINLWMNSA
        +VPLPFSTFIQFCKQRLQERNEENVVSTESNSNRTTYPDMEKGCLPFEDDSKKLYLAQVPILDVINEERAQLAPLRQDIQTPAFLEKKKLASINLWMNSA
Subjt:  QVPLPFSTFIQFCKQRLQERNEENVVSTESNSNRTTYPDMEKGCLPFEDDSKKLYLAQVPILDVINEERAQLAPLRQDIQTPAFLEKKKLASINLWMNSA

Query:  QSRSSTHYDPHHNVLCIVSGSKQVILWPPSSTPSLYPMHIYGEASNHSSVTLEKPDYSLYPRAKYSMKSSQTGKVKRSIFGKIF---GRFHQVDSDDLTI
        QSRSSTHYDPHHNVLCIVSGSKQVILWPPSSTPSLYPMHIYGEASNHSSVTLEKPDYSLYPRAKYSMKSSQT  V       +F   G FHQVDSDDLTI
Subjt:  QSRSSTHYDPHHNVLCIVSGSKQVILWPPSSTPSLYPMHIYGEASNHSSVTLEKPDYSLYPRAKYSMKSSQTGKVKRSIFGKIF---GRFHQVDSDDLTI

Query:  AVNFWWQSHLMSSMSEHMDAYYLRRILRRLMDKEMNEVLHVPCSLADMDDMKSHERDVPNNR--DKGVRCLGQALDGGDVREKEPREETSHELEPRTAQA
        AVNFWWQSHLMSSMSEHMDAYYLRRILRRLMDKEMNEVLHVPCSLADMDDMKSHERDVPNNR  DKGVRCLGQALDGGDVREKEPREETSHELEPRTAQA
Subjt:  AVNFWWQSHLMSSMSEHMDAYYLRRILRRLMDKEMNEVLHVPCSLADMDDMKSHERDVPNNR--DKGVRCLGQALDGGDVREKEPREETSHELEPRTAQA

Query:  LHGLIALVHDQVSVSDQIGPLQSSSTNCSADEEGMKITSLHSLENDRVATFIWNLEPCILQKVLLTMANNFPRTLEALILHLLSPVGAEVLTQKFDQMDK
        LHGLIALVHDQVSVSDQIGPLQSSSTNCSADEEGMKITSLHSLENDRVATFIWNLEPCILQKVLLTMANNFPRTLEALILHLLSPVGAEVLTQKFDQMDK
Subjt:  LHGLIALVHDQVSVSDQIGPLQSSSTNCSADEEGMKITSLHSLENDRVATFIWNLEPCILQKVLLTMANNFPRTLEALILHLLSPVGAEVLTQKFDQMDK

Query:  ENTEEDQKRFYEVFYSSFDDQFAVMDAILNSKESFARQAFKSVLDKYLGVNLNGPNLGS
        ENTEEDQKRFYEVFYSSFDDQFAVMDAILNSKESFARQAFKSVLDKYLGVNLNGPNLGS
Subjt:  ENTEEDQKRFYEVFYSSFDDQFAVMDAILNSKESFARQAFKSVLDKYLGVNLNGPNLGS

XP_022960126.1 uncharacterized protein LOC111460965 isoform X2 [Cucurbita moschata]6.5e-24996.72Show/hide
Query:  QVPLPFSTFIQFCKQRLQERNEENVVSTESNSNRTTYPDMEKGCLPFEDDSKKLYLAQVPILDVINEERAQLAPLRQDIQTPAFLEKKKLASINLWMNSA
        +VPLPFSTFIQFCKQRLQERNEENVVSTESNSNRTTYPDMEKGCLPFEDDSKKLYLAQVPILDVINEERAQLAPLRQDIQTPAFLEKKKLASINLWMNSA
Subjt:  QVPLPFSTFIQFCKQRLQERNEENVVSTESNSNRTTYPDMEKGCLPFEDDSKKLYLAQVPILDVINEERAQLAPLRQDIQTPAFLEKKKLASINLWMNSA

Query:  QSRSSTHYDPHHNVLCIVSGSKQVILWPPSSTPSLYPMHIYGEASNHSSVTLEKPDYSLYPRAKYSMKSSQTGKVKRSIFGKIF---GRFHQVDSDDLTI
        QSRSSTHYDPHHNVLCIVSGSKQVILWPPSSTPSLYPMHIYGEASNHSSVTLEKPDYSLYPRAKYSMKSSQT  V       +F   G FHQVDSDDLTI
Subjt:  QSRSSTHYDPHHNVLCIVSGSKQVILWPPSSTPSLYPMHIYGEASNHSSVTLEKPDYSLYPRAKYSMKSSQTGKVKRSIFGKIF---GRFHQVDSDDLTI

Query:  AVNFWWQSHLMSSMSEHMDAYYLRRILRRLMDKEMNEVLHVPCSLADMDDMKSHERDVPNNRDKGVRCLGQALDGGDVREKEPREETSHELEPRTAQALH
        AVNFWWQSHLMSSMSEHMDAYYLRRILRRLMDKEMNEVLHVPCSLADMDDMKSHERDVPNNRDKGVRCLGQALDGGDVREKEPREETSHELEPRTAQALH
Subjt:  AVNFWWQSHLMSSMSEHMDAYYLRRILRRLMDKEMNEVLHVPCSLADMDDMKSHERDVPNNRDKGVRCLGQALDGGDVREKEPREETSHELEPRTAQALH

Query:  GLIALVHDQVSVSDQIGPLQSSSTNCSADEEGMKITSLHSLENDRVATFIWNLEPCILQKVLLTMANNFPRTLEALILHLLSPVGAEVLTQKFDQMDKEN
        GLIALVHDQVSVSDQIGPLQSSSTNCSADEEGMKITSLHSLENDRVATFIWNLEPCILQKVLLTMANNFPRTLEALILHLLSPVGAEVLTQKFDQMDKEN
Subjt:  GLIALVHDQVSVSDQIGPLQSSSTNCSADEEGMKITSLHSLENDRVATFIWNLEPCILQKVLLTMANNFPRTLEALILHLLSPVGAEVLTQKFDQMDKEN

Query:  TEEDQKRFYEVFYSSFDDQFAVMDAILNSKESFARQAFKSVLDKYLGVNLNGPNLGS
        TEEDQKRFYEVFYSSFDDQFAVMDAILNSKESFARQAFKSVLDKYLGVNLNGPNLGS
Subjt:  TEEDQKRFYEVFYSSFDDQFAVMDAILNSKESFARQAFKSVLDKYLGVNLNGPNLGS

XP_023004289.1 uncharacterized protein LOC111497660 [Cucurbita maxima]7.7e-24294.55Show/hide
Query:  QVPLPFSTFIQFCKQRLQERNEENVVSTESNSNRTTYPDMEKGCLPFEDDSKKLYLAQVPILDVINEERAQLAPLRQDIQTPAFLEKKKLASINLWMNSA
        +VPLPFSTFIQFCKQRLQERNE NVVSTESNSNRTTYPDMEKGCLPFEDDSKKLYLAQVPILDVINEERAQLAPLR+DIQTPAFLEKKKLASINLW+NSA
Subjt:  QVPLPFSTFIQFCKQRLQERNEENVVSTESNSNRTTYPDMEKGCLPFEDDSKKLYLAQVPILDVINEERAQLAPLRQDIQTPAFLEKKKLASINLWMNSA

Query:  QSRSSTHYDPHHNVLCIVSGSKQVILWPPSSTPSLYPMHIYGEASNHSSVTLEKPDYSLYPRAKYSMKSSQTGKVKRSIFGKIF---GRFHQVDSDDLTI
        +SRSSTHYDPHHNVLCIVSGSKQVILWPPSSTPSLYPMHIYGEASNHSSVTLEKPDYSLYPRAKYSMKSSQT  V       +F   G FHQVDSDDLTI
Subjt:  QSRSSTHYDPHHNVLCIVSGSKQVILWPPSSTPSLYPMHIYGEASNHSSVTLEKPDYSLYPRAKYSMKSSQTGKVKRSIFGKIF---GRFHQVDSDDLTI

Query:  AVNFWWQSHLMSSMSEHMDAYYLRRILRRLMDKEMNEVLHVPCSLADMDDMKSHERDVPNNR--DKGVRCLGQALDGGDVREKEPREETSHELEPRTAQA
        AVNFWWQSHLMSSMSEHMDAYYLRRILRRL DKEMNEVLHVPCSLADMDDMKSHERDVPNNR  DKGVRCLGQALDGGD REKEPREETSHELEPRTAQA
Subjt:  AVNFWWQSHLMSSMSEHMDAYYLRRILRRLMDKEMNEVLHVPCSLADMDDMKSHERDVPNNR--DKGVRCLGQALDGGDVREKEPREETSHELEPRTAQA

Query:  LHGLIALVHDQVSVSDQIGPLQSSSTNCSADEEGMKITSLHSLENDRVATFIWNLEPCILQKVLLTMANNFPRTLEALILHLLSPVGAEVLTQKFDQMDK
        LHGLIALVHDQVSVSDQIGPLQSSSTNCSADEE MKITSLHSLENDRVATFIWNLEP ILQKVLLTMANNFPRTLEALILHLLSPVGAEVLTQKFDQMDK
Subjt:  LHGLIALVHDQVSVSDQIGPLQSSSTNCSADEEGMKITSLHSLENDRVATFIWNLEPCILQKVLLTMANNFPRTLEALILHLLSPVGAEVLTQKFDQMDK

Query:  ENTEEDQKRFYEVFYSSFDDQFAVMDAILNSKESFARQAFKSVLDKYLGVNLNGPNLGS
        ENTEEDQKRFYEVFYSSFDDQFAVMDAILNSKESFARQAFKSVLDKYLGVNLNGPNLGS
Subjt:  ENTEEDQKRFYEVFYSSFDDQFAVMDAILNSKESFARQAFKSVLDKYLGVNLNGPNLGS

XP_023514397.1 uncharacterized protein LOC111778672 [Cucurbita pepo subsp. pepo]1.7e-24495.21Show/hide
Query:  QVPLPFSTFIQFCKQRLQERNEENVVSTESNSNRTTYPDMEKGCLPFEDDSKKLYLAQVPILDVINEERAQLAPLRQDIQTPAFLEKKKLASINLWMNSA
        +VPLPFSTFIQFCKQRLQERNE NVVSTESNSNRTTYPDMEKGCLPFEDDSKKLYLAQVPILDVINEERAQLAPLRQDIQTPAFLEKKKLASINLWMNSA
Subjt:  QVPLPFSTFIQFCKQRLQERNEENVVSTESNSNRTTYPDMEKGCLPFEDDSKKLYLAQVPILDVINEERAQLAPLRQDIQTPAFLEKKKLASINLWMNSA

Query:  QSRSSTHYDPHHNVLCIVSGSKQVILWPPSSTPSLYPMHIYGEASNHSSVTLEKPDYSLYPRAKYSMKSSQTGKVKRSIFGKIF---GRFHQVDSDDLTI
        QSRSSTHYDPHHNVLCIVSGSKQVILWPPSSTPSLYPMHIYGEASNHSSVTLEKPDYSLYPRAKYSMKSSQT  V       +F   G FHQVDSDDLTI
Subjt:  QSRSSTHYDPHHNVLCIVSGSKQVILWPPSSTPSLYPMHIYGEASNHSSVTLEKPDYSLYPRAKYSMKSSQTGKVKRSIFGKIF---GRFHQVDSDDLTI

Query:  AVNFWWQSHLMSSMSEHMDAYYLRRILRRLMDKEMNEVLHVPCSLADMDDMKSHERDVPNNR--DKGVRCLGQALDGGDVREKEPREETSHELEPRTAQA
        AVNFWWQSHLMSSMSEHMDAYYLRRILRRLMDKEMNEVLHVPCSLADMDD KSHERDVPNNR  DKGVRCLGQALDGGD REKEPREETSHELEPRTAQA
Subjt:  AVNFWWQSHLMSSMSEHMDAYYLRRILRRLMDKEMNEVLHVPCSLADMDDMKSHERDVPNNR--DKGVRCLGQALDGGDVREKEPREETSHELEPRTAQA

Query:  LHGLIALVHDQVSVSDQIGPLQSSSTNCSADEEGMKITSLHSLENDRVATFIWNLEPCILQKVLLTMANNFPRTLEALILHLLSPVGAEVLTQKFDQMDK
        LHGLIALVHDQVSVSDQIGPL+SSSTNCS+DEEGMKITSLHSLENDRVATFIWNLEPCILQKVLLTMANNFPRTLEALILHLLSPVGAEVLTQKFDQMDK
Subjt:  LHGLIALVHDQVSVSDQIGPLQSSSTNCSADEEGMKITSLHSLENDRVATFIWNLEPCILQKVLLTMANNFPRTLEALILHLLSPVGAEVLTQKFDQMDK

Query:  ENTEEDQKRFYEVFYSSFDDQFAVMDAILNSKESFARQAFKSVLDKYLGVNLNGPNLGS
        ENTEEDQKRFYEVFYSSFDDQFAVMDAILNSKESFARQAFKSVLDKYLGVNLNGPNLGS
Subjt:  ENTEEDQKRFYEVFYSSFDDQFAVMDAILNSKESFARQAFKSVLDKYLGVNLNGPNLGS

TrEMBL top hitse value%identityAlignment
A0A1S3BS01 uncharacterized protein LOC103492647 isoform X41.7e-20280.04Show/hide
Query:  QVPLPFSTFIQFCKQRLQERNEENVVSTESNSNRTTYPDMEKGCLPFEDDSKKLYLAQVPILDVINEERAQLAPLRQDIQTPAFLEKKKLASINLWMNSA
        +VP+PFSTFIQ CKQRL E+++ NVVS+E NSNR T PD++K CLPFEDDSK+LYLAQVPILDVINEERAQL PLR+DIQTPAFLEKKKLASINLWMNSA
Subjt:  QVPLPFSTFIQFCKQRLQERNEENVVSTESNSNRTTYPDMEKGCLPFEDDSKKLYLAQVPILDVINEERAQLAPLRQDIQTPAFLEKKKLASINLWMNSA

Query:  QSRSSTHYDPHHNVLCIVSGSKQVILWPPSSTPSLYPMHIYGEASNHSSVTLEKPDYSLYPRAKYSMKSSQTGKVKRSIFGKIF---GRFHQVDSDDLTI
         SRSSTHYDPHHNVLCIVSG KQVILWPPS+TPSLYPMHIYGEASNHSSV+LEKPDYSLYPRAKYS KSSQT  ++      +F   G FHQVDSD+LTI
Subjt:  QSRSSTHYDPHHNVLCIVSGSKQVILWPPSSTPSLYPMHIYGEASNHSSVTLEKPDYSLYPRAKYSMKSSQTGKVKRSIFGKIF---GRFHQVDSDDLTI

Query:  AVNFWWQSHLMSSMSEHMDAYYLRRILRRLMDKEMNEVLHVPCSLADMDDMKSHERDV--PNNRDKGVRCLGQALDGGDVREKEPREET-SHELEPRTAQ
        AVNFWWQSH+MSS+S+HMDAYYLRRILRRLMD+EMNEVL VPCSLA+MD+ KSHE DV      D+GV+CL QA +GGD++EKE REET SHELE  +A+
Subjt:  AVNFWWQSHLMSSMSEHMDAYYLRRILRRLMDKEMNEVLHVPCSLADMDDMKSHERDV--PNNRDKGVRCLGQALDGGDVREKEPREET-SHELEPRTAQ

Query:  ALHGLIALVHDQVSVSDQIGPLQSSSTNCSAD-EEGMKITSLHSLENDRVATFIWNLEPCILQKVLLTMANNFPRTLEALILHLLSPVGAEVLTQKFDQM
        ALHGL+ LVHD VSVSDQ G LQSSSTN SAD EE M  TSL+SLEND+VA  IWNLEPC+LQKVLLTMANNFPRTLEALILHLLSP+GAEVLT+KFDQM
Subjt:  ALHGLIALVHDQVSVSDQIGPLQSSSTNCSAD-EEGMKITSLHSLENDRVATFIWNLEPCILQKVLLTMANNFPRTLEALILHLLSPVGAEVLTQKFDQM

Query:  DKENTEEDQKRFYEVFYSSFDDQFAVMDAILNSKESFARQAFKSVLDKYLGVNLNGPNLGS
        D+ NT EDQKRFYEVFYSSFDDQFAVMDAILN KESFARQ FKSVLDKY+GVNLNGPN G+
Subjt:  DKENTEEDQKRFYEVFYSSFDDQFAVMDAILNSKESFARQAFKSVLDKYLGVNLNGPNLGS

A0A6J1DQD1 uncharacterized protein LOC111022839 isoform X34.8e-19779.53Show/hide
Query:  QVPLPFSTFIQFCKQRLQERNEENVVSTESNSNRTTYPDMEKGCLPFEDDSKKLYLAQVPILDVINEERAQLAPLRQDIQTPAFLEKKKLASINLWMNSA
        +VPLPFSTFIQFC QRL+ER++ NVVS++  SNR T PD+EKGCLPFEDD +KLYLAQVPILD  NEER QL PLR+DI TPAFLEKKKLASINLWMNSA
Subjt:  QVPLPFSTFIQFCKQRLQERNEENVVSTESNSNRTTYPDMEKGCLPFEDDSKKLYLAQVPILDVINEERAQLAPLRQDIQTPAFLEKKKLASINLWMNSA

Query:  QSRSSTHYDPHHNVLCIVSGSKQVILWPPSSTPSLYPMHIYGEASNHSSVTLEKPDYSLYPRAKYSMKSSQTGKVKRSIFGKIF---GRFHQVDSDDLTI
        QSRSSTHYDPHHNVLCIVSG+KQVILWPPS+ PSLYPM IYGEASNHSSVTLEKPDYSLYPRAKYSM+SSQ   V       +F   G FHQVDSDDLTI
Subjt:  QSRSSTHYDPHHNVLCIVSGSKQVILWPPSSTPSLYPMHIYGEASNHSSVTLEKPDYSLYPRAKYSMKSSQTGKVKRSIFGKIF---GRFHQVDSDDLTI

Query:  AVNFWWQSHLMSSMSEHMDAYYLRRILRRLMDKEMNEVLHVP-CSLADMDDMKSHERDVPNNRDKGVR--CLGQALDGGDVREKEPREET-SHELEPRTA
        AVNFWW+SH MSSMSEHMDAYYLRRILRRLMDKEMN+VL VP CS+A MD+MK HERDV N ++ GV   CL QA + GD+RE E R+ET +HELEP + 
Subjt:  AVNFWWQSHLMSSMSEHMDAYYLRRILRRLMDKEMNEVLHVP-CSLADMDDMKSHERDVPNNRDKGVR--CLGQALDGGDVREKEPREET-SHELEPRTA

Query:  QALHGLIALVHDQ--VSVSDQIGPLQSSSTNCSAD-EEGMKITSLHSLENDRVATFIWNLEPCILQKVLLTMANNFPRTLEALILHLLSPVGAEVLTQKF
        QALHGLI LVHD+  VSVSDQI   QSSSTN SAD EE MK+TS HSLEND+VA FIWNL PCILQKVLL+MANNFPRTLEALILHLLSPVGAEVLT+KF
Subjt:  QALHGLIALVHDQ--VSVSDQIGPLQSSSTNCSAD-EEGMKITSLHSLENDRVATFIWNLEPCILQKVLLTMANNFPRTLEALILHLLSPVGAEVLTQKF

Query:  DQMDKENTEEDQKRFYEVFYSSFDDQFAVMDAILNSKESFARQAFKSVLDKYLGVNLNGPNLGS
        DQMD+ N+EEDQK+FYEVFYSSFDDQFAVMDAILN KE+FARQAFKSVLDKYLGVNL+GPNLGS
Subjt:  DQMDKENTEEDQKRFYEVFYSSFDDQFAVMDAILNSKESFARQAFKSVLDKYLGVNLNGPNLGS

A0A6J1H6S1 uncharacterized protein LOC111460965 isoform X23.1e-24996.72Show/hide
Query:  QVPLPFSTFIQFCKQRLQERNEENVVSTESNSNRTTYPDMEKGCLPFEDDSKKLYLAQVPILDVINEERAQLAPLRQDIQTPAFLEKKKLASINLWMNSA
        +VPLPFSTFIQFCKQRLQERNEENVVSTESNSNRTTYPDMEKGCLPFEDDSKKLYLAQVPILDVINEERAQLAPLRQDIQTPAFLEKKKLASINLWMNSA
Subjt:  QVPLPFSTFIQFCKQRLQERNEENVVSTESNSNRTTYPDMEKGCLPFEDDSKKLYLAQVPILDVINEERAQLAPLRQDIQTPAFLEKKKLASINLWMNSA

Query:  QSRSSTHYDPHHNVLCIVSGSKQVILWPPSSTPSLYPMHIYGEASNHSSVTLEKPDYSLYPRAKYSMKSSQTGKVKRSIFGKIF---GRFHQVDSDDLTI
        QSRSSTHYDPHHNVLCIVSGSKQVILWPPSSTPSLYPMHIYGEASNHSSVTLEKPDYSLYPRAKYSMKSSQT  V       +F   G FHQVDSDDLTI
Subjt:  QSRSSTHYDPHHNVLCIVSGSKQVILWPPSSTPSLYPMHIYGEASNHSSVTLEKPDYSLYPRAKYSMKSSQTGKVKRSIFGKIF---GRFHQVDSDDLTI

Query:  AVNFWWQSHLMSSMSEHMDAYYLRRILRRLMDKEMNEVLHVPCSLADMDDMKSHERDVPNNRDKGVRCLGQALDGGDVREKEPREETSHELEPRTAQALH
        AVNFWWQSHLMSSMSEHMDAYYLRRILRRLMDKEMNEVLHVPCSLADMDDMKSHERDVPNNRDKGVRCLGQALDGGDVREKEPREETSHELEPRTAQALH
Subjt:  AVNFWWQSHLMSSMSEHMDAYYLRRILRRLMDKEMNEVLHVPCSLADMDDMKSHERDVPNNRDKGVRCLGQALDGGDVREKEPREETSHELEPRTAQALH

Query:  GLIALVHDQVSVSDQIGPLQSSSTNCSADEEGMKITSLHSLENDRVATFIWNLEPCILQKVLLTMANNFPRTLEALILHLLSPVGAEVLTQKFDQMDKEN
        GLIALVHDQVSVSDQIGPLQSSSTNCSADEEGMKITSLHSLENDRVATFIWNLEPCILQKVLLTMANNFPRTLEALILHLLSPVGAEVLTQKFDQMDKEN
Subjt:  GLIALVHDQVSVSDQIGPLQSSSTNCSADEEGMKITSLHSLENDRVATFIWNLEPCILQKVLLTMANNFPRTLEALILHLLSPVGAEVLTQKFDQMDKEN

Query:  TEEDQKRFYEVFYSSFDDQFAVMDAILNSKESFARQAFKSVLDKYLGVNLNGPNLGS
        TEEDQKRFYEVFYSSFDDQFAVMDAILNSKESFARQAFKSVLDKYLGVNLNGPNLGS
Subjt:  TEEDQKRFYEVFYSSFDDQFAVMDAILNSKESFARQAFKSVLDKYLGVNLNGPNLGS

A0A6J1H7X9 uncharacterized protein LOC111460965 isoform X11.0e-24796.3Show/hide
Query:  QVPLPFSTFIQFCKQRLQERNEENVVSTESNSNRTTYPDMEKGCLPFEDDSKKLYLAQVPILDVINEERAQLAPLRQDIQTPAFLEKKKLASINLWMNSA
        +VPLPFSTFIQFCKQRLQERNEENVVSTESNSNRTTYPDMEKGCLPFEDDSKKLYLAQVPILDVINEERAQLAPLRQDIQTPAFLEKKKLASINLWMNSA
Subjt:  QVPLPFSTFIQFCKQRLQERNEENVVSTESNSNRTTYPDMEKGCLPFEDDSKKLYLAQVPILDVINEERAQLAPLRQDIQTPAFLEKKKLASINLWMNSA

Query:  QSRSSTHYDPHHNVLCIVSGSKQVILWPPSSTPSLYPMHIYGEASNHSSVTLEKPDYSLYPRAKYSMKSSQTGKVKRSIFGKIF---GRFHQVDSDDLTI
        QSRSSTHYDPHHNVLCIVSGSKQVILWPPSSTPSLYPMHIYGEASNHSSVTLEKPDYSLYPRAKYSMKSSQT  V       +F   G FHQVDSDDLTI
Subjt:  QSRSSTHYDPHHNVLCIVSGSKQVILWPPSSTPSLYPMHIYGEASNHSSVTLEKPDYSLYPRAKYSMKSSQTGKVKRSIFGKIF---GRFHQVDSDDLTI

Query:  AVNFWWQSHLMSSMSEHMDAYYLRRILRRLMDKEMNEVLHVPCSLADMDDMKSHERDVPNNR--DKGVRCLGQALDGGDVREKEPREETSHELEPRTAQA
        AVNFWWQSHLMSSMSEHMDAYYLRRILRRLMDKEMNEVLHVPCSLADMDDMKSHERDVPNNR  DKGVRCLGQALDGGDVREKEPREETSHELEPRTAQA
Subjt:  AVNFWWQSHLMSSMSEHMDAYYLRRILRRLMDKEMNEVLHVPCSLADMDDMKSHERDVPNNR--DKGVRCLGQALDGGDVREKEPREETSHELEPRTAQA

Query:  LHGLIALVHDQVSVSDQIGPLQSSSTNCSADEEGMKITSLHSLENDRVATFIWNLEPCILQKVLLTMANNFPRTLEALILHLLSPVGAEVLTQKFDQMDK
        LHGLIALVHDQVSVSDQIGPLQSSSTNCSADEEGMKITSLHSLENDRVATFIWNLEPCILQKVLLTMANNFPRTLEALILHLLSPVGAEVLTQKFDQMDK
Subjt:  LHGLIALVHDQVSVSDQIGPLQSSSTNCSADEEGMKITSLHSLENDRVATFIWNLEPCILQKVLLTMANNFPRTLEALILHLLSPVGAEVLTQKFDQMDK

Query:  ENTEEDQKRFYEVFYSSFDDQFAVMDAILNSKESFARQAFKSVLDKYLGVNLNGPNLGS
        ENTEEDQKRFYEVFYSSFDDQFAVMDAILNSKESFARQAFKSVLDKYLGVNLNGPNLGS
Subjt:  ENTEEDQKRFYEVFYSSFDDQFAVMDAILNSKESFARQAFKSVLDKYLGVNLNGPNLGS

A0A6J1KVV2 uncharacterized protein LOC1114976603.7e-24294.55Show/hide
Query:  QVPLPFSTFIQFCKQRLQERNEENVVSTESNSNRTTYPDMEKGCLPFEDDSKKLYLAQVPILDVINEERAQLAPLRQDIQTPAFLEKKKLASINLWMNSA
        +VPLPFSTFIQFCKQRLQERNE NVVSTESNSNRTTYPDMEKGCLPFEDDSKKLYLAQVPILDVINEERAQLAPLR+DIQTPAFLEKKKLASINLW+NSA
Subjt:  QVPLPFSTFIQFCKQRLQERNEENVVSTESNSNRTTYPDMEKGCLPFEDDSKKLYLAQVPILDVINEERAQLAPLRQDIQTPAFLEKKKLASINLWMNSA

Query:  QSRSSTHYDPHHNVLCIVSGSKQVILWPPSSTPSLYPMHIYGEASNHSSVTLEKPDYSLYPRAKYSMKSSQTGKVKRSIFGKIF---GRFHQVDSDDLTI
        +SRSSTHYDPHHNVLCIVSGSKQVILWPPSSTPSLYPMHIYGEASNHSSVTLEKPDYSLYPRAKYSMKSSQT  V       +F   G FHQVDSDDLTI
Subjt:  QSRSSTHYDPHHNVLCIVSGSKQVILWPPSSTPSLYPMHIYGEASNHSSVTLEKPDYSLYPRAKYSMKSSQTGKVKRSIFGKIF---GRFHQVDSDDLTI

Query:  AVNFWWQSHLMSSMSEHMDAYYLRRILRRLMDKEMNEVLHVPCSLADMDDMKSHERDVPNNR--DKGVRCLGQALDGGDVREKEPREETSHELEPRTAQA
        AVNFWWQSHLMSSMSEHMDAYYLRRILRRL DKEMNEVLHVPCSLADMDDMKSHERDVPNNR  DKGVRCLGQALDGGD REKEPREETSHELEPRTAQA
Subjt:  AVNFWWQSHLMSSMSEHMDAYYLRRILRRLMDKEMNEVLHVPCSLADMDDMKSHERDVPNNR--DKGVRCLGQALDGGDVREKEPREETSHELEPRTAQA

Query:  LHGLIALVHDQVSVSDQIGPLQSSSTNCSADEEGMKITSLHSLENDRVATFIWNLEPCILQKVLLTMANNFPRTLEALILHLLSPVGAEVLTQKFDQMDK
        LHGLIALVHDQVSVSDQIGPLQSSSTNCSADEE MKITSLHSLENDRVATFIWNLEP ILQKVLLTMANNFPRTLEALILHLLSPVGAEVLTQKFDQMDK
Subjt:  LHGLIALVHDQVSVSDQIGPLQSSSTNCSADEEGMKITSLHSLENDRVATFIWNLEPCILQKVLLTMANNFPRTLEALILHLLSPVGAEVLTQKFDQMDK

Query:  ENTEEDQKRFYEVFYSSFDDQFAVMDAILNSKESFARQAFKSVLDKYLGVNLNGPNLGS
        ENTEEDQKRFYEVFYSSFDDQFAVMDAILNSKESFARQAFKSVLDKYLGVNLNGPNLGS
Subjt:  ENTEEDQKRFYEVFYSSFDDQFAVMDAILNSKESFARQAFKSVLDKYLGVNLNGPNLGS

SwissProt top hitse value%identityAlignment
P0C872 Bifunctional peptidase and (3S)-lysyl hydroxylase Jmjd77.5e-0627.48Show/hide
Query:  SINLWMNSAQSRSSTHYDPHHNVLCIVSGSKQVILWPPSS---------TPSLYPMHIYGEASNHSSVTLEK----------PDYSLYPRAKYSMKSSQT
        ++N W+  A + +S H D + N+ C+VSG K  +L PPS          TP+ Y +   G         +EK          PD + YP    +     T
Subjt:  SINLWMNSAQSRSSTHYDPHHNVLCIVSGSKQVILWPPSS---------TPSLYPMHIYGEASNHSSVTLEK----------PDYSLYPRAKYSMKSSQT

Query:  GKVKRSIFGKIFGRFHQVDSDDLTIAVNFWW
         +    ++      FH V      IAVNFW+
Subjt:  GKVKRSIFGKIFGRFHQVDSDDLTIAVNFWW

Q55DF5 JmjC domain-containing protein D8.0e-0831.06Show/hide
Query:  EDDSKKL-YLAQVPILDVINEERAQLAPLRQDIQTPAFLE-------------KKKLASINLWMNSAQSRSSTHYDPHHNVLCIVSGSKQVILWPPSSTP
        +DD+  + YLAQ  + +       Q+  LR DI  P + +             K+    IN W+    + +  HYDP HN LC + G K + L+ P  + 
Subjt:  EDDSKKL-YLAQVPILDVINEERAQLAPLRQDIQTPAFLE-------------KKKLASINLWMNSAQSRSSTHYDPHHNVLCIVSGSKQVILWPPSSTP

Query:  SLYPMHIYGEA-SNHSSVTLEKPDYSLYPRAK
        +LYP H+  +   N S V +E PD+S +P  K
Subjt:  SLYPMHIYGEA-SNHSSVTLEKPDYSLYPRAK

Q8RWR1 Lysine-specific demethylase JMJ307.0e-1233.33Show/hide
Query:  YLAQVPILDVINEERAQLAPLRQDIQTP--AFLEKKKLASINLWMNSAQSRSSTHYDPHHNVLCIVSGSKQVILWPPSSTPSLYPMHIYGEA--SNHSSV
        YLAQ P+ D INE       LR DI  P   F+   +L S+N W   A + +  H+DPHHN+L  V G K + L+P      LYP   Y E    N S V
Subjt:  YLAQVPILDVINEERAQLAPLRQDIQTP--AFLEKKKLASINLWMNSAQSRSSTHYDPHHNVLCIVSGSKQVILWPPSSTPSLYPMHIYGEA--SNHSSV

Query:  TLEKPDYSLYPRAKYSMKSSQTGKVKRSIFGKIFGRFHQVDSDDLTIAVNFWWQSHLMSSMS
         L+  D + +P+A   ++       +  +       +H V S  ++++V+FWW +   SS S
Subjt:  TLEKPDYSLYPRAKYSMKSSQTGKVKRSIFGKIFGRFHQVDSDDLTIAVNFWWQSHLMSSMS

Arabidopsis top hitse value%identityAlignment
AT3G20810.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein5.0e-1333.33Show/hide
Query:  YLAQVPILDVINEERAQLAPLRQDIQTP--AFLEKKKLASINLWMNSAQSRSSTHYDPHHNVLCIVSGSKQVILWPPSSTPSLYPMHIYGEA--SNHSSV
        YLAQ P+ D INE       LR DI  P   F+   +L S+N W   A + +  H+DPHHN+L  V G K + L+P      LYP   Y E    N S V
Subjt:  YLAQVPILDVINEERAQLAPLRQDIQTP--AFLEKKKLASINLWMNSAQSRSSTHYDPHHNVLCIVSGSKQVILWPPSSTPSLYPMHIYGEA--SNHSSV

Query:  TLEKPDYSLYPRAKYSMKSSQTGKVKRSIFGKIFGRFHQVDSDDLTIAVNFWWQSHLMSSMS
         L+  D + +P+A   ++       +  +       +H V S  ++++V+FWW +   SS S
Subjt:  TLEKPDYSLYPRAKYSMKSSQTGKVKRSIFGKIFGRFHQVDSDDLTIAVNFWWQSHLMSSMS

AT3G20810.2 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein5.0e-1333.33Show/hide
Query:  YLAQVPILDVINEERAQLAPLRQDIQTP--AFLEKKKLASINLWMNSAQSRSSTHYDPHHNVLCIVSGSKQVILWPPSSTPSLYPMHIYGEA--SNHSSV
        YLAQ P+ D INE       LR DI  P   F+   +L S+N W   A + +  H+DPHHN+L  V G K + L+P      LYP   Y E    N S V
Subjt:  YLAQVPILDVINEERAQLAPLRQDIQTP--AFLEKKKLASINLWMNSAQSRSSTHYDPHHNVLCIVSGSKQVILWPPSSTPSLYPMHIYGEA--SNHSSV

Query:  TLEKPDYSLYPRAKYSMKSSQTGKVKRSIFGKIFGRFHQVDSDDLTIAVNFWWQSHLMSSMS
         L+  D + +P+A   ++       +  +       +H V S  ++++V+FWW +   SS S
Subjt:  TLEKPDYSLYPRAKYSMKSSQTGKVKRSIFGKIFGRFHQVDSDDLTIAVNFWWQSHLMSSMS

AT3G20810.3 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein3.6e-1138.94Show/hide
Query:  YLAQVPILDVINEERAQLAPLRQDIQTP--AFLEKKKLASINLWMNSAQSRSSTHYDPHHNVLCIVSGSKQVILWPPSSTPSLYPMHIYGEA--SNHSSV
        YLAQ P+ D INE       LR DI  P   F+   +L S+N W   A + +  H+DPHHN+L  V G K + L+P      LYP   Y E    N S V
Subjt:  YLAQVPILDVINEERAQLAPLRQDIQTP--AFLEKKKLASINLWMNSAQSRSSTHYDPHHNVLCIVSGSKQVILWPPSSTPSLYPMHIYGEA--SNHSSV

Query:  TLEKPDYSLYPRA
         L+  D + +P+A
Subjt:  TLEKPDYSLYPRA

AT5G19840.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.3e-11449.57Show/hide
Query:  IFNFEV----QVPLPFSTFIQFCKQRLQERNEENVVSTESNSNRTTYPDMEKGCLPFEDDSKKLYLAQVPILDVINEERAQLAPLRQDIQTPAFLEKKKL
        +FN ++    +V LPFS FI+FCKQ ++ +   + V  +S        D   G         ++YLAQ PIL+   EE+  L  LRQDIQTP FL+ K L
Subjt:  IFNFEV----QVPLPFSTFIQFCKQRLQERNEENVVSTESNSNRTTYPDMEKGCLPFEDDSKKLYLAQVPILDVINEERAQLAPLRQDIQTPAFLEKKKL

Query:  ASINLWMNSAQSRSSTHYDPHHNVLCIVSGSKQVILWPPSSTPSLYPMHIYGEASNHSSVTLEKPDYSLYPRAKYSMKSSQTGKVKRSIFGKIF---GRF
        +SIN WMNSA++RSSTHYDPHHN+LC+VSG K+V+LWPPS++PSLYPM IYGEASNHSSV LE P+ S YPRA++S+K SQ  ++  +    +F   G F
Subjt:  ASINLWMNSAQSRSSTHYDPHHNVLCIVSGSKQVILWPPSSTPSLYPMHIYGEASNHSSVTLEKPDYSLYPRAKYSMKSSQTGKVKRSIFGKIF---GRF

Query:  HQVDSDDLTIAVNFWWQSHLMSSMSEHMDAYYLRRILRRLMDKEMNEVLHVPCS-----LADMDDMKSHERDVPNNRDKGVRCLGQALDGGDVREKEPRE
        HQVDSD+LT+AVNFWWQS+ MS+M EHMD+YYLRRI RRL+D+EM+ ++  P S     L++  D    E     N + G   + + L    + EK    
Subjt:  HQVDSDDLTIAVNFWWQSHLMSSMSEHMDAYYLRRILRRLMDKEMNEVLHVPCS-----LADMDDMKSHERDVPNNRDKGVRCLGQALDGGDVREKEPRE

Query:  ETSHELEPRTAQALHGLIALVHDQVSVSDQIGPLQSSSTNCSADEEGMKITSLHSLENDRVATFIWNLEPCILQKVLLTMANNFPRTLEALILHLLSPVG
         + H+L+P  +QALH LI+LVHD V+  D                           ++DRVA  +WNLE   L+ VLL MA  FPRTLEALILH+LSP+ 
Subjt:  ETSHELEPRTAQALHGLIALVHDQVSVSDQIGPLQSSSTNCSADEEGMKITSLHSLENDRVATFIWNLEPCILQKVLLTMANNFPRTLEALILHLLSPVG

Query:  AEVLTQKFDQMDKENTEEDQKRFYEVFYSSFDDQFAVMDAILNSKESFARQAFKSVLDKYLGVNLNGP
        AEVLTQKFD++D++  EED+ +F+  FYS+FDD+ A MD IL+ KE+FA QAFK+VLDK+LGVN+  P
Subjt:  AEVLTQKFDQMDKENTEEDQKRFYEVFYSSFDDQFAVMDAILNSKESFARQAFKSVLDKYLGVNLNGP

AT5G19840.2 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.8e-10347.12Show/hide
Query:  IFNFEV----QVPLPFSTFIQFCKQRLQERNEENVVSTESNSNRTTYPDMEKGCLPFEDDSKKLYLAQVPILDVINEERAQLAPLRQDIQTPAFLEKKKL
        +FN ++    +V LPFS FI+FCKQ ++ +   + V  +S        D   G         ++YLAQ PIL+   EE+  L  LRQDIQTP FL+ K L
Subjt:  IFNFEV----QVPLPFSTFIQFCKQRLQERNEENVVSTESNSNRTTYPDMEKGCLPFEDDSKKLYLAQVPILDVINEERAQLAPLRQDIQTPAFLEKKKL

Query:  ASINLWMNSAQSRSSTHYDPHHNVLCIVSGSKQVILWPPSSTPSLYPMHIYGEASNHSSVTLEKPDYSLYPRAKYSMKSSQTGKVKRSIFGKIF---GRF
        +SIN WMNSA++RSSTHYDPHHN+LC+VSG K+V+LWPPS++PSLYPM IYGEASNHSSV LE P+ S YPRA++S+K SQ  ++  +    +F   G F
Subjt:  ASINLWMNSAQSRSSTHYDPHHNVLCIVSGSKQVILWPPSSTPSLYPMHIYGEASNHSSVTLEKPDYSLYPRAKYSMKSSQTGKVKRSIFGKIF---GRF

Query:  HQVDSDDLTIAVNFWWQSHLMSSMSEHMDAYYLRRILRRLM--DKEMNEVLHVPCSLADMDDMKSHERDVPNNRDKGVRCLGQALDGGDVREKEPREETS
        HQVDSD+LT+AVNFWWQS+ MS+M EHMD+YYLRRI R L+       ++ H    L++  D    E     N + G   + + L    + EK     + 
Subjt:  HQVDSDDLTIAVNFWWQSHLMSSMSEHMDAYYLRRILRRLM--DKEMNEVLHVPCSLADMDDMKSHERDVPNNRDKGVRCLGQALDGGDVREKEPREETS

Query:  HELEPRTAQALHGLIALVHDQVSVSDQIGPLQSSSTNCSADEEGMK--ITSLHSLENDRVATFIWNLEPCILQKVLLTMA-------------------N
        H+L+P  +QALH LI+LVHD V+  D    LQ +S +CS   E  K  + ++  LE+DRVA  +WNLE   L+ VLL MA                   +
Subjt:  HELEPRTAQALHGLIALVHDQVSVSDQIGPLQSSSTNCSADEEGMK--ITSLHSLENDRVATFIWNLEPCILQKVLLTMA-------------------N

Query:  NFPRTLEALILHLLSPVGAEVLTQKFDQMDKENTEEDQKRFYEVFYSSFDDQFAVMDAILNSKESFARQ
            TLEALILH+LSP+ AEVLTQKFD++D++  EED+ +F+  FYS+FDD+ A MD IL+ KE+FA Q
Subjt:  NFPRTLEALILHLLSPVGAEVLTQKFDQMDKENTEEDQKRFYEVFYSSFDDQFAVMDAILNSKESFARQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGACCTTAGAAGCCATGAGAGAGTACGATCCTTATCAGGATTCTTTAATTTTTAATTTTGAAGTTCAAGTTCCGCTTCCATTTTCTACTTTTATTCAATTTTGCAA
GCAGCGTCTGCAAGAAAGGAATGAAGAAAATGTTGTTTCCACTGAATCAAATTCTAACAGGACGACCTATCCTGACATGGAGAAAGGCTGCTTGCCTTTTGAAGATGATT
CTAAAAAATTATACTTAGCACAGGTGCCAATATTGGATGTTATAAACGAAGAAAGGGCACAATTGGCACCTCTGAGACAAGATATTCAAACACCTGCTTTTCTGGAGAAA
AAGAAGTTAGCCTCTATAAACCTTTGGATGAACAGTGCTCAATCTAGGTCAAGTACTCACTATGACCCTCACCATAATGTTCTATGCATAGTTTCTGGCAGTAAACAAGT
CATTTTGTGGCCCCCTTCTTCTACTCCCTCGCTGTACCCGATGCATATTTATGGAGAGGCCTCTAATCATAGCTCTGTTACTTTAGAAAAGCCCGACTATTCACTTTATC
CAAGAGCAAAATATTCCATGAAATCCTCTCAGACCGGGAAGGTTAAAAGGTCTATTTTTGGGAAGATCTTTGGTAGGTTTCATCAGGTTGATAGTGACGATTTAACCATT
GCTGTTAACTTTTGGTGGCAGTCGCACCTGATGTCTAGCATGTCAGAGCACATGGATGCATATTACTTGCGTAGAATATTGAGAAGATTGATGGACAAAGAAATGAACGA
AGTTTTGCATGTGCCTTGTTCTCTTGCTGACATGGATGACATGAAGAGCCATGAGCGTGATGTACCTAATAACAGAGATAAAGGTGTTCGTTGTCTGGGTCAAGCACTTG
ATGGTGGGGATGTTAGGGAGAAAGAGCCAAGGGAAGAAACTTCTCATGAACTTGAACCCCGCACTGCCCAGGCTCTGCATGGGCTTATTGCATTAGTACATGACCAAGTA
AGCGTTTCTGATCAAATTGGACCCCTTCAGTCGAGTTCTACAAATTGTTCTGCAGATGAAGAGGGGATGAAGATCACTAGTTTACATAGTTTGGAGAATGATCGAGTTGC
TACCTTTATTTGGAATTTGGAACCATGTATTCTTCAGAAGGTCCTTCTCACCATGGCGAATAACTTCCCAAGAACTCTCGAGGCTCTTATACTGCACTTGCTTTCACCTG
TTGGAGCAGAAGTTCTTACTCAAAAATTTGATCAGATGGACAAAGAAAACACCGAGGAGGACCAGAAAAGATTCTACGAAGTTTTTTATAGTTCATTTGATGATCAGTTT
GCTGTGATGGATGCAATTCTAAATAGCAAAGAGTCTTTCGCTCGTCAGGCGTTCAAAAGTGTGCTGGATAAATATTTGGGAGTGAACTTAAATGGGCCAAATCTGGGATC
TTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTGACCTTAGAAGCCATGAGAGAGTACGATCCTTATCAGGATTCTTTAATTTTTAATTTTGAAGTTCAAGTTCCGCTTCCATTTTCTACTTTTATTCAATTTTGCAA
GCAGCGTCTGCAAGAAAGGAATGAAGAAAATGTTGTTTCCACTGAATCAAATTCTAACAGGACGACCTATCCTGACATGGAGAAAGGCTGCTTGCCTTTTGAAGATGATT
CTAAAAAATTATACTTAGCACAGGTGCCAATATTGGATGTTATAAACGAAGAAAGGGCACAATTGGCACCTCTGAGACAAGATATTCAAACACCTGCTTTTCTGGAGAAA
AAGAAGTTAGCCTCTATAAACCTTTGGATGAACAGTGCTCAATCTAGGTCAAGTACTCACTATGACCCTCACCATAATGTTCTATGCATAGTTTCTGGCAGTAAACAAGT
CATTTTGTGGCCCCCTTCTTCTACTCCCTCGCTGTACCCGATGCATATTTATGGAGAGGCCTCTAATCATAGCTCTGTTACTTTAGAAAAGCCCGACTATTCACTTTATC
CAAGAGCAAAATATTCCATGAAATCCTCTCAGACCGGGAAGGTTAAAAGGTCTATTTTTGGGAAGATCTTTGGTAGGTTTCATCAGGTTGATAGTGACGATTTAACCATT
GCTGTTAACTTTTGGTGGCAGTCGCACCTGATGTCTAGCATGTCAGAGCACATGGATGCATATTACTTGCGTAGAATATTGAGAAGATTGATGGACAAAGAAATGAACGA
AGTTTTGCATGTGCCTTGTTCTCTTGCTGACATGGATGACATGAAGAGCCATGAGCGTGATGTACCTAATAACAGAGATAAAGGTGTTCGTTGTCTGGGTCAAGCACTTG
ATGGTGGGGATGTTAGGGAGAAAGAGCCAAGGGAAGAAACTTCTCATGAACTTGAACCCCGCACTGCCCAGGCTCTGCATGGGCTTATTGCATTAGTACATGACCAAGTA
AGCGTTTCTGATCAAATTGGACCCCTTCAGTCGAGTTCTACAAATTGTTCTGCAGATGAAGAGGGGATGAAGATCACTAGTTTACATAGTTTGGAGAATGATCGAGTTGC
TACCTTTATTTGGAATTTGGAACCATGTATTCTTCAGAAGGTCCTTCTCACCATGGCGAATAACTTCCCAAGAACTCTCGAGGCTCTTATACTGCACTTGCTTTCACCTG
TTGGAGCAGAAGTTCTTACTCAAAAATTTGATCAGATGGACAAAGAAAACACCGAGGAGGACCAGAAAAGATTCTACGAAGTTTTTTATAGTTCATTTGATGATCAGTTT
GCTGTGATGGATGCAATTCTAAATAGCAAAGAGTCTTTCGCTCGTCAGGCGTTCAAAAGTGTGCTGGATAAATATTTGGGAGTGAACTTAAATGGGCCAAATCTGGGATC
TTGAAGGAAGTCCTATAAAGAAATTGGTCCTGCAGATTCTGGAGCACATCGATGCTATCCATTTTCTAAAGCTTTTCCTCTCAATTGGTACTTTTACATATAAGTTATAC
ACAGCTATGTATGATGTAAATCATTACCTACTCTAGGCATTACTTTGTAATTTATTCCAGATCCATCTTATTTTGTATAGAGAAGGAAA
Protein sequenceShow/hide protein sequence
MVTLEAMREYDPYQDSLIFNFEVQVPLPFSTFIQFCKQRLQERNEENVVSTESNSNRTTYPDMEKGCLPFEDDSKKLYLAQVPILDVINEERAQLAPLRQDIQTPAFLEK
KKLASINLWMNSAQSRSSTHYDPHHNVLCIVSGSKQVILWPPSSTPSLYPMHIYGEASNHSSVTLEKPDYSLYPRAKYSMKSSQTGKVKRSIFGKIFGRFHQVDSDDLTI
AVNFWWQSHLMSSMSEHMDAYYLRRILRRLMDKEMNEVLHVPCSLADMDDMKSHERDVPNNRDKGVRCLGQALDGGDVREKEPREETSHELEPRTAQALHGLIALVHDQV
SVSDQIGPLQSSSTNCSADEEGMKITSLHSLENDRVATFIWNLEPCILQKVLLTMANNFPRTLEALILHLLSPVGAEVLTQKFDQMDKENTEEDQKRFYEVFYSSFDDQF
AVMDAILNSKESFARQAFKSVLDKYLGVNLNGPNLGS