; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0009316 (gene) of Snake gourd v1 genome

Gene IDTan0009316
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionE3 ubiquitin-protein ligase rnf8-A isoform X1
Genome locationLG04:70911473..70928655
RNA-Seq ExpressionTan0009316
SyntenyTan0009316
Gene Ontology termsGO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001357 - BRCT domain
IPR001841 - Zinc finger, RING-type
IPR011011 - Zinc finger, FYVE/PHD-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR017907 - Zinc finger, RING-type, conserved site
IPR024675 - Eukaryotic translation initiation factor 3 subunit G, N-terminal
IPR036420 - BRCT domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011651366.1 uncharacterized protein LOC101213123 [Cucumis sativus]3.8e-24180.7Show/hide
Query:  EDVGSRLTVVSTEEILPERPHAPGCPIEGMESVVATVSGYHGTERFNLIKMISYAGASYVGAMSRSITHLICWELEGRKFNLAKKFKTIIVNHRWLEDCI
        ED G   T++S     P R   PGCPIEGMESVV TVSGYHGTERFNLIKMISY GASYVGAMSRSITHLICWEL+GRKF+LA+KF+TIIVNHRWLEDCI
Subjt:  EDVGSRLTVVSTEEILPERPHAPGCPIEGMESVVATVSGYHGTERFNLIKMISYAGASYVGAMSRSITHLICWELEGRKFNLAKKFKTIIVNHRWLEDCI

Query:  KQGKRVPEGSYILQSGQSAGPLSMKLPLAVEGSVSTKKYNVLSKKLHNCGNVEDQSIKSLCSFGDPILSHSCLLDKDLYSDFRKSDDTARKPKYKLWNNI
        K GKRVPEG YILQSGQS GPLSMKLPLA +G VS KKYN+LS+KLHN GNVEDQSIK +CSFGD IL  S LLDKDL SDFRKSDDTA K K+K+   I
Subjt:  KQGKRVPEGSYILQSGQSAGPLSMKLPLAVEGSVSTKKYNVLSKKLHNCGNVEDQSIKSLCSFGDPILSHSCLLDKDLYSDFRKSDDTARKPKYKLWNNI

Query:  SKHEDPSSSSSRNHFEESPS-GFFAIERGSSSILARDEGKGENSNLNPTVKSSRRRR-LANKNSSKDRNEVDIWNFDPECYQLGTRNHGNNFTVSSSHFD
        SK EDPSSSSSRN FEE  S G FAIE GS S LARDE KGE+SN + TVKSSRRRR L + NS +D N+ DI NFDPE Y+LGTR   N+ TV S  +D
Subjt:  SKHEDPSSSSSRNHFEESPS-GFFAIERGSSSILARDEGKGENSNLNPTVKSSRRRR-LANKNSSKDRNEVDIWNFDPECYQLGTRNHGNNFTVSSSHFD

Query:  DETDIEVVNIGGTSDRDQLC-ERGLASDRFEGVEACENQSTSRDPNLPFENAPRDLTITSEDELHNINDLQKNSEDPVIELNASLPSTSTELSCVICWTD
         ETDIEVVNIGGTSDR+QLC ERGLAS RFEGVEACENQSTS+D NL  +NAPR L+ITSEDELHN+NDLQKN EDPVIEL+ASLPSTSTELSCVICWTD
Subjt:  DETDIEVVNIGGTSDRDQLC-ERGLASDRFEGVEACENQSTSRDPNLPFENAPRDLTITSEDELHNINDLQKNSEDPVIELNASLPSTSTELSCVICWTD

Query:  FSSTRGVLPCGHRFCYSCIQNWADHMASSRKISTCPLCKASFLSITKVEAAASSDQKIYSQTIPCGPSLLDIYILPDERILSSGVQPSVAAVCSACRCRE
        FSSTRGVLPCGHRFCYSCIQNWADHMA SRKISTCPLCKASFLSITKVE AA+SDQKIYSQTIPCG SLLDIY+L DER L++ VQPSVAAVCSACRCRE
Subjt:  FSSTRGVLPCGHRFCYSCIQNWADHMASSRKISTCPLCKASFLSITKVEAAASSDQKIYSQTIPCGPSLLDIYILPDERILSSGVQPSVAAVCSACRCRE

Query:  PEDLLMSCHLCQIRHIHSYCLDPPLLPWTCIHCKDLQTLYHRSH
        PEDLLMSCHLCQIR IHSYCLDPPLLPWTCIHCKDLQTLYHRSH
Subjt:  PEDLLMSCHLCQIRHIHSYCLDPPLLPWTCIHCKDLQTLYHRSH

XP_022148261.1 uncharacterized protein LOC111016967 isoform X1 [Momordica charantia]9.3e-24081.15Show/hide
Query:  GCPIEGMESVVATVSGYHGTERFNLIKMISYAGASYVGAMSRSITHLICWELEGRKFNLAKKFKTIIVNHRWLEDCIKQGKRVPEGSYILQSGQSAGPLS
        GCPIEGMESVVATVSGYHGTERFNLIKMISY GASYVGAMSRSITHLICW+LEGRKF+LAKKFKTIIVNHRWLEDCI+QGKRVPEG YILQSGQSAGPLS
Subjt:  GCPIEGMESVVATVSGYHGTERFNLIKMISYAGASYVGAMSRSITHLICWELEGRKFNLAKKFKTIIVNHRWLEDCIKQGKRVPEGSYILQSGQSAGPLS

Query:  MKLPLAVEGSVSTKKYNVLSKKLHNCGNVEDQSIKSLCSFGDPILSHSCLLDKDLYSDFRKSDDTARKPKYKLWNNISKHEDPSSSSSRNHFEE-SPSGF
        ++LPLA + SVST KYNVLS+K  NCGNVE+QSIKS+ SFG  I   SCLLDKDL+SDF KSDDT+ K K+KL   ISK EDPS+SSSRN+F+E +PS F
Subjt:  MKLPLAVEGSVSTKKYNVLSKKLHNCGNVEDQSIKSLCSFGDPILSHSCLLDKDLYSDFRKSDDTARKPKYKLWNNISKHEDPSSSSSRNHFEE-SPSGF

Query:  FAIERGSSSILARDEGKGENSNLNPTVKSSRRRRLANKNSSKDRNEVDIWNFDPECYQLGTRNHGNNFTVSSSHFDDETDIEVVNIGGTSDRDQLC-ERG
          IERGSSS LARDE KGENSNL+PTVKSSRRRRL N+N+S+D  + D+W+FDPECY LGTRNHGNNFTV S H ++E DIEVV IGG+SD   LC E G
Subjt:  FAIERGSSSILARDEGKGENSNLNPTVKSSRRRRLANKNSSKDRNEVDIWNFDPECYQLGTRNHGNNFTVSSSHFDDETDIEVVNIGGTSDRDQLC-ERG

Query:  LASDRFEGVEACENQSTSRDPNLPFENAPRDLTITSEDELHNINDLQKNSEDPVIELNASLPSTSTELSCVICWTDFSSTRGVLPCGHRFCYSCIQNWAD
        + SD FEGVEA ENQ TS+D NLP ENAP  L I+SEDEL NI+ LQK  EDP +E NASLP TS ELSCVICWTDFSS RGVLPCGHRFCYSCIQNWAD
Subjt:  LASDRFEGVEACENQSTSRDPNLPFENAPRDLTITSEDELHNINDLQKNSEDPVIELNASLPSTSTELSCVICWTDFSSTRGVLPCGHRFCYSCIQNWAD

Query:  HMASSRKISTCPLCKASFLSITKVEAAASSDQKIYSQTIPCGPSLLDIYILPDERILSSGVQPSVAAVCSACRCREPEDLLMSCHLCQIRHIHSYCLDPP
        HMASSRKISTCPLCKASFLSITKVE AA+SDQKIYSQTIPCGPSLLDI+ILPDER L+SGVQ SV  VCSACRCREPEDLLMSCHLCQIRHIHSYCLDPP
Subjt:  HMASSRKISTCPLCKASFLSITKVEAAASSDQKIYSQTIPCGPSLLDIYILPDERILSSGVQPSVAAVCSACRCREPEDLLMSCHLCQIRHIHSYCLDPP

Query:  LLPWTCIHCKDLQTLYHRSH
        LLPWTCIHCKDLQTLYHRSH
Subjt:  LLPWTCIHCKDLQTLYHRSH

XP_022935568.1 uncharacterized protein LOC111442404 [Cucurbita moschata]2.7e-23981.63Show/hide
Query:  PERPHAPGCPIEGMESVVATVSGYHGTERFNLIKMISYAGASYVGAMSRSITHLICWELEGRKFNLAKKFKTIIVNHRWLEDCIKQGKRVPEGSYILQSG
        P RP   GCPIEGMESVVATVSGYHGTERFNLIKMISY GASYVG MSRSITHLICWELEGRKFNLAKKFKTIIVNHRWLEDCIK GKRVPE  YILQSG
Subjt:  PERPHAPGCPIEGMESVVATVSGYHGTERFNLIKMISYAGASYVGAMSRSITHLICWELEGRKFNLAKKFKTIIVNHRWLEDCIKQGKRVPEGSYILQSG

Query:  QSAGPLSMKLPLAVEGSVSTKKYNVLSKKLHNCGNVEDQSIKSLCSFGDPILSHSCLLDKDLYSDFRKSDDTARKPKYKLWNNISKHEDPSSSSSRNHFE
        QSAGPLSMKLP + +GSVSTKKY V S+KLHNCGN+EDQ IK +CSFGD IL HS LLDK++Y DFR SDDTA K K+KL   ISK E+PSSSSS+NHF+
Subjt:  QSAGPLSMKLPLAVEGSVSTKKYNVLSKKLHNCGNVEDQSIKSLCSFGDPILSHSCLLDKDLYSDFRKSDDTARKPKYKLWNNISKHEDPSSSSSRNHFE

Query:  E-SPSGFFAIERGSSSILARDEGKGENSNLNPTVKSSRR-RRLANKNSSKDRNEVDIWNFDPECYQLGTRNHGNNFTVSSSHFDDETDIEVVNIGGTSDR
        E +PS FFAI  GSSS LARDE KG+  N N TV+SSRR RRL  KNSS+D NE D+WNFDPE Y L  R   N+ TV SSH DDETDIE VNIGGT+D 
Subjt:  E-SPSGFFAIERGSSSILARDEGKGENSNLNPTVKSSRR-RRLANKNSSKDRNEVDIWNFDPECYQLGTRNHGNNFTVSSSHFDDETDIEVVNIGGTSDR

Query:  DQLC-ERGLASDRFEGVEACENQSTSRDPNLPFENAPRDLTITSEDELHNINDLQKNSEDPVIELNASLPSTSTELSCVICWTDFSSTRGVLPCGHRFCY
        DQLC ERGLASD FEGVEAC NQSTSR  NL  ENAPR LT TSEDELH  N LQKN ED VIELN S+PSTSTELSCVICWTDFSSTRGVLPCGHRFCY
Subjt:  DQLC-ERGLASDRFEGVEACENQSTSRDPNLPFENAPRDLTITSEDELHNINDLQKNSEDPVIELNASLPSTSTELSCVICWTDFSSTRGVLPCGHRFCY

Query:  SCIQNWADHMASSRKISTCPLCKASFLSITKVEAAASSDQKIYSQTIPCGPSLLDIYILPDERILSSGVQPSVAAVCSACRCREPEDLLMSCHLCQIRHI
        SCIQNWADHMAS RKISTCPLCKASFLSITKVE AA+SDQKIYSQTIPCG SLLDIYILPDER L S VQPSVAAVCS CRCREPEDLLMSCHLCQIRHI
Subjt:  SCIQNWADHMASSRKISTCPLCKASFLSITKVEAAASSDQKIYSQTIPCGPSLLDIYILPDERILSSGVQPSVAAVCSACRCREPEDLLMSCHLCQIRHI

Query:  HSYCLDPPLLPWTCIHCKDLQTLYHRSH
        HSYCLDPPLLPW CIHCKDLQTLYHR H
Subjt:  HSYCLDPPLLPWTCIHCKDLQTLYHRSH

XP_022983591.1 E3 ubiquitin-protein ligase rnf8-A isoform X1 [Cucurbita maxima]1.2e-24282.01Show/hide
Query:  PERPHAPGCPIEGMESVVATVSGYHGTERFNLIKMISYAGASYVGAMSRSITHLICWELEGRKFNLAKKFKTIIVNHRWLEDCIKQGKRVPEGSYILQSG
        P RP   GCPIEGMESVVATVSGYHGTERFNLIKMISY GASYVG MSRSITHLICWELEGRKFNLAKKFKTIIVNHRWLEDCIKQG RVPE  YILQSG
Subjt:  PERPHAPGCPIEGMESVVATVSGYHGTERFNLIKMISYAGASYVGAMSRSITHLICWELEGRKFNLAKKFKTIIVNHRWLEDCIKQGKRVPEGSYILQSG

Query:  QSAGPLSMKLPLAVEGSVSTKKYNVLSKKLHNCGNVEDQSIKSLCSFGDPILSHSCLLDKDLYSDFRKSDDTARKPKYKLWNNISKHEDPSSSSSRNHFE
        QSAGPLSMKLP + +GSVSTKKY V S+KLHNCGNVEDQ IK +CSFGD IL HS LLDK++Y DFR SDDTA K K+KL   ISK E+PSSSSS+NHF+
Subjt:  QSAGPLSMKLPLAVEGSVSTKKYNVLSKKLHNCGNVEDQSIKSLCSFGDPILSHSCLLDKDLYSDFRKSDDTARKPKYKLWNNISKHEDPSSSSSRNHFE

Query:  E-SPSGFFAIERGSSSILARDEGKGENSNLNPTVKSSRR-RRLANKNSSKDRNEVDIWNFDPECYQLGTRNHGNNFTVSSSHFDDETDIEVVNIGGTSDR
        E +PS FFAI  GSSS LARDE KG+  N N TV+SSRR RRL  KNSS+D N+ D+WNFDPE Y L TR   N+ TV SSH DDETD EVVN+GGT+DR
Subjt:  E-SPSGFFAIERGSSSILARDEGKGENSNLNPTVKSSRR-RRLANKNSSKDRNEVDIWNFDPECYQLGTRNHGNNFTVSSSHFDDETDIEVVNIGGTSDR

Query:  DQLC-ERGLASDRFEGVEACENQSTSRDPNLPFENAPRDLTITSEDELHNINDLQKNSEDPVIELNASLPSTSTELSCVICWTDFSSTRGVLPCGHRFCY
        DQLC ERGLASD FEGVEACENQSTSR  NL  ENAPR LT+TSEDELH   DLQK  EDPVIELN S+PST+TELSCVICWTDFSSTRGVLPCGHRFCY
Subjt:  DQLC-ERGLASDRFEGVEACENQSTSRDPNLPFENAPRDLTITSEDELHNINDLQKNSEDPVIELNASLPSTSTELSCVICWTDFSSTRGVLPCGHRFCY

Query:  SCIQNWADHMASSRKISTCPLCKASFLSITKVEAAASSDQKIYSQTIPCGPSLLDIYILPDERILSSGVQPSVAAVCSACRCREPEDLLMSCHLCQIRHI
        SCIQNWADHMASSRKISTCPLCKASFLSITKVE AA+SDQKIYSQTIPCGPSLLDIYILPDER L S VQPSVAAVCS CRC+EPEDLLMSCHLCQIRHI
Subjt:  SCIQNWADHMASSRKISTCPLCKASFLSITKVEAAASSDQKIYSQTIPCGPSLLDIYILPDERILSSGVQPSVAAVCSACRCREPEDLLMSCHLCQIRHI

Query:  HSYCLDPPLLPWTCIHCKDLQTLYHRSH
        HSYCLDPPLLPW CIHCKDLQTLYHR H
Subjt:  HSYCLDPPLLPWTCIHCKDLQTLYHRSH

XP_038887153.1 uncharacterized protein LOC120077302 [Benincasa hispida]1.1e-24583.72Show/hide
Query:  PGCPIEGMESVVATVSGYHGTERFNLIKMISYAGASYVGAMSRSITHLICWELEGRKFNLAKKFKTIIVNHRWLEDCIKQGKRVPEGSYILQSGQSAGPL
        PGCPIEGMESVVATVSGYHGTERFNLIKMISY GASYVGAMSRSITHLICWEL+GRKFNLAKKF+TIIVNHRWLEDCIK GKRVPEG Y+LQSGQ AGPL
Subjt:  PGCPIEGMESVVATVSGYHGTERFNLIKMISYAGASYVGAMSRSITHLICWELEGRKFNLAKKFKTIIVNHRWLEDCIKQGKRVPEGSYILQSGQSAGPL

Query:  SMKLPLAVEGSVSTKKYNVLSKKLHNCGNVEDQSIKSLCSFGDPILSHSCLLDKDLYSDFRKSDDTARKPKYKLWNNISKHEDPSSSSSRNHFEE-SPSG
        SMKLPLA +GSVSTKK N+L +KLHN GNV+DQSIK +CSF D IL HS LLDKDLYSDFR SD TA K K  L   ISK ED SSSSSRNHFEE +PSG
Subjt:  SMKLPLAVEGSVSTKKYNVLSKKLHNCGNVEDQSIKSLCSFGDPILSHSCLLDKDLYSDFRKSDDTARKPKYKLWNNISKHEDPSSSSSRNHFEE-SPSG

Query:  FFAIERGSSSILARDEGKGENSNLNPTVKSSRR-RRLANKNSSKDRNEVDIWNFDPECYQLGTRNHGNNFTVSSSHFDDETDIEVVNIGGTSDRDQLC-E
        FFAIERGSSS LARDE KGE+SN + TV SSRR RRL NKNS++D N+ DIWNFD     LGTR   N+ T  SSH+DDET+IEVVNIGGTSDRDQL  E
Subjt:  FFAIERGSSSILARDEGKGENSNLNPTVKSSRR-RRLANKNSSKDRNEVDIWNFDPECYQLGTRNHGNNFTVSSSHFDDETDIEVVNIGGTSDRDQLC-E

Query:  RGLASDRFEGVEACENQSTSRDPNLPFENAPRDLTITSEDELHNINDLQKNSEDPVIELNASLPSTSTELSCVICWTDFSSTRGVLPCGHRFCYSCIQNW
        RGL+SD FEG+EACE+QSTSRD NL  ENAPR  +ITSEDELHNIN+LQKN EDPVIELNASLPSTSTELSCVICWTDFSSTRGVLPCGHRFCYSCIQNW
Subjt:  RGLASDRFEGVEACENQSTSRDPNLPFENAPRDLTITSEDELHNINDLQKNSEDPVIELNASLPSTSTELSCVICWTDFSSTRGVLPCGHRFCYSCIQNW

Query:  ADHMASSRKISTCPLCKASFLSITKVEAAASSDQKIYSQTIPCGPSLLDIYILPDERILSSGVQPSVAAVCSACRCREPEDLLMSCHLCQIRHIHSYCLD
        ADHMA SRKISTCPLCKA+FLSITKVE AA+SDQKIYSQTIPCGPSLLDIYILPDER L++ VQPSVAAVCSACRCREPEDLLMSCHLCQIRHIHSYCLD
Subjt:  ADHMASSRKISTCPLCKASFLSITKVEAAASSDQKIYSQTIPCGPSLLDIYILPDERILSSGVQPSVAAVCSACRCREPEDLLMSCHLCQIRHIHSYCLD

Query:  PPLLPWTCIHCKDLQTLYHRSH
        PPLLPWTCIHCKDLQ LYHRSH
Subjt:  PPLLPWTCIHCKDLQTLYHRSH

TrEMBL top hitse value%identityAlignment
A0A0A0L7F3 Uncharacterized protein1.8e-24180.7Show/hide
Query:  EDVGSRLTVVSTEEILPERPHAPGCPIEGMESVVATVSGYHGTERFNLIKMISYAGASYVGAMSRSITHLICWELEGRKFNLAKKFKTIIVNHRWLEDCI
        ED G   T++S     P R   PGCPIEGMESVV TVSGYHGTERFNLIKMISY GASYVGAMSRSITHLICWEL+GRKF+LA+KF+TIIVNHRWLEDCI
Subjt:  EDVGSRLTVVSTEEILPERPHAPGCPIEGMESVVATVSGYHGTERFNLIKMISYAGASYVGAMSRSITHLICWELEGRKFNLAKKFKTIIVNHRWLEDCI

Query:  KQGKRVPEGSYILQSGQSAGPLSMKLPLAVEGSVSTKKYNVLSKKLHNCGNVEDQSIKSLCSFGDPILSHSCLLDKDLYSDFRKSDDTARKPKYKLWNNI
        K GKRVPEG YILQSGQS GPLSMKLPLA +G VS KKYN+LS+KLHN GNVEDQSIK +CSFGD IL  S LLDKDL SDFRKSDDTA K K+K+   I
Subjt:  KQGKRVPEGSYILQSGQSAGPLSMKLPLAVEGSVSTKKYNVLSKKLHNCGNVEDQSIKSLCSFGDPILSHSCLLDKDLYSDFRKSDDTARKPKYKLWNNI

Query:  SKHEDPSSSSSRNHFEESPS-GFFAIERGSSSILARDEGKGENSNLNPTVKSSRRRR-LANKNSSKDRNEVDIWNFDPECYQLGTRNHGNNFTVSSSHFD
        SK EDPSSSSSRN FEE  S G FAIE GS S LARDE KGE+SN + TVKSSRRRR L + NS +D N+ DI NFDPE Y+LGTR   N+ TV S  +D
Subjt:  SKHEDPSSSSSRNHFEESPS-GFFAIERGSSSILARDEGKGENSNLNPTVKSSRRRR-LANKNSSKDRNEVDIWNFDPECYQLGTRNHGNNFTVSSSHFD

Query:  DETDIEVVNIGGTSDRDQLC-ERGLASDRFEGVEACENQSTSRDPNLPFENAPRDLTITSEDELHNINDLQKNSEDPVIELNASLPSTSTELSCVICWTD
         ETDIEVVNIGGTSDR+QLC ERGLAS RFEGVEACENQSTS+D NL  +NAPR L+ITSEDELHN+NDLQKN EDPVIEL+ASLPSTSTELSCVICWTD
Subjt:  DETDIEVVNIGGTSDRDQLC-ERGLASDRFEGVEACENQSTSRDPNLPFENAPRDLTITSEDELHNINDLQKNSEDPVIELNASLPSTSTELSCVICWTD

Query:  FSSTRGVLPCGHRFCYSCIQNWADHMASSRKISTCPLCKASFLSITKVEAAASSDQKIYSQTIPCGPSLLDIYILPDERILSSGVQPSVAAVCSACRCRE
        FSSTRGVLPCGHRFCYSCIQNWADHMA SRKISTCPLCKASFLSITKVE AA+SDQKIYSQTIPCG SLLDIY+L DER L++ VQPSVAAVCSACRCRE
Subjt:  FSSTRGVLPCGHRFCYSCIQNWADHMASSRKISTCPLCKASFLSITKVEAAASSDQKIYSQTIPCGPSLLDIYILPDERILSSGVQPSVAAVCSACRCRE

Query:  PEDLLMSCHLCQIRHIHSYCLDPPLLPWTCIHCKDLQTLYHRSH
        PEDLLMSCHLCQIR IHSYCLDPPLLPWTCIHCKDLQTLYHRSH
Subjt:  PEDLLMSCHLCQIRHIHSYCLDPPLLPWTCIHCKDLQTLYHRSH

A0A6J1D3H7 uncharacterized protein LOC111016967 isoform X28.8e-23680.93Show/hide
Query:  MESVVATVSGYHGTERFNLIKMISYAGASYVGAMSRSITHLICWELEGRKFNLAKKFKTIIVNHRWLEDCIKQGKRVPEGSYILQSGQSAGPLSMKLPLA
        MESVVATVSGYHGTERFNLIKMISY GASYVGAMSRSITHLICW+LEGRKF+LAKKFKTIIVNHRWLEDCI+QGKRVPEG YILQSGQSAGPLS++LPLA
Subjt:  MESVVATVSGYHGTERFNLIKMISYAGASYVGAMSRSITHLICWELEGRKFNLAKKFKTIIVNHRWLEDCIKQGKRVPEGSYILQSGQSAGPLSMKLPLA

Query:  VEGSVSTKKYNVLSKKLHNCGNVEDQSIKSLCSFGDPILSHSCLLDKDLYSDFRKSDDTARKPKYKLWNNISKHEDPSSSSSRNHFEE-SPSGFFAIERG
         + SVST KYNVLS+K  NCGNVE+QSIKS+ SFG  I   SCLLDKDL+SDF KSDDT+ K K+KL   ISK EDPS+SSSRN+F+E +PS F  IERG
Subjt:  VEGSVSTKKYNVLSKKLHNCGNVEDQSIKSLCSFGDPILSHSCLLDKDLYSDFRKSDDTARKPKYKLWNNISKHEDPSSSSSRNHFEE-SPSGFFAIERG

Query:  SSSILARDEGKGENSNLNPTVKSSRRRRLANKNSSKDRNEVDIWNFDPECYQLGTRNHGNNFTVSSSHFDDETDIEVVNIGGTSDRDQLC-ERGLASDRF
        SSS LARDE KGENSNL+PTVKSSRRRRL N+N+S+D  + D+W+FDPECY LGTRNHGNNFTV S H ++E DIEVV IGG+SD   LC E G+ SD F
Subjt:  SSSILARDEGKGENSNLNPTVKSSRRRRLANKNSSKDRNEVDIWNFDPECYQLGTRNHGNNFTVSSSHFDDETDIEVVNIGGTSDRDQLC-ERGLASDRF

Query:  EGVEACENQSTSRDPNLPFENAPRDLTITSEDELHNINDLQKNSEDPVIELNASLPSTSTELSCVICWTDFSSTRGVLPCGHRFCYSCIQNWADHMASSR
        EGVEA ENQ TS+D NLP ENAP  L I+SEDEL NI+ LQK  EDP +E NASLP TS ELSCVICWTDFSS RGVLPCGHRFCYSCIQNWADHMASSR
Subjt:  EGVEACENQSTSRDPNLPFENAPRDLTITSEDELHNINDLQKNSEDPVIELNASLPSTSTELSCVICWTDFSSTRGVLPCGHRFCYSCIQNWADHMASSR

Query:  KISTCPLCKASFLSITKVEAAASSDQKIYSQTIPCGPSLLDIYILPDERILSSGVQPSVAAVCSACRCREPEDLLMSCHLCQIRHIHSYCLDPPLLPWTC
        KISTCPLCKASFLSITKVE AA+SDQKIYSQTIPCGPSLLDI+ILPDER L+SGVQ SV  VCSACRCREPEDLLMSCHLCQIRHIHSYCLDPPLLPWTC
Subjt:  KISTCPLCKASFLSITKVEAAASSDQKIYSQTIPCGPSLLDIYILPDERILSSGVQPSVAAVCSACRCREPEDLLMSCHLCQIRHIHSYCLDPPLLPWTC

Query:  IHCKDLQTLYHRSH
        IHCKDLQTLYHRSH
Subjt:  IHCKDLQTLYHRSH

A0A6J1D4V1 uncharacterized protein LOC111016967 isoform X14.5e-24081.15Show/hide
Query:  GCPIEGMESVVATVSGYHGTERFNLIKMISYAGASYVGAMSRSITHLICWELEGRKFNLAKKFKTIIVNHRWLEDCIKQGKRVPEGSYILQSGQSAGPLS
        GCPIEGMESVVATVSGYHGTERFNLIKMISY GASYVGAMSRSITHLICW+LEGRKF+LAKKFKTIIVNHRWLEDCI+QGKRVPEG YILQSGQSAGPLS
Subjt:  GCPIEGMESVVATVSGYHGTERFNLIKMISYAGASYVGAMSRSITHLICWELEGRKFNLAKKFKTIIVNHRWLEDCIKQGKRVPEGSYILQSGQSAGPLS

Query:  MKLPLAVEGSVSTKKYNVLSKKLHNCGNVEDQSIKSLCSFGDPILSHSCLLDKDLYSDFRKSDDTARKPKYKLWNNISKHEDPSSSSSRNHFEE-SPSGF
        ++LPLA + SVST KYNVLS+K  NCGNVE+QSIKS+ SFG  I   SCLLDKDL+SDF KSDDT+ K K+KL   ISK EDPS+SSSRN+F+E +PS F
Subjt:  MKLPLAVEGSVSTKKYNVLSKKLHNCGNVEDQSIKSLCSFGDPILSHSCLLDKDLYSDFRKSDDTARKPKYKLWNNISKHEDPSSSSSRNHFEE-SPSGF

Query:  FAIERGSSSILARDEGKGENSNLNPTVKSSRRRRLANKNSSKDRNEVDIWNFDPECYQLGTRNHGNNFTVSSSHFDDETDIEVVNIGGTSDRDQLC-ERG
          IERGSSS LARDE KGENSNL+PTVKSSRRRRL N+N+S+D  + D+W+FDPECY LGTRNHGNNFTV S H ++E DIEVV IGG+SD   LC E G
Subjt:  FAIERGSSSILARDEGKGENSNLNPTVKSSRRRRLANKNSSKDRNEVDIWNFDPECYQLGTRNHGNNFTVSSSHFDDETDIEVVNIGGTSDRDQLC-ERG

Query:  LASDRFEGVEACENQSTSRDPNLPFENAPRDLTITSEDELHNINDLQKNSEDPVIELNASLPSTSTELSCVICWTDFSSTRGVLPCGHRFCYSCIQNWAD
        + SD FEGVEA ENQ TS+D NLP ENAP  L I+SEDEL NI+ LQK  EDP +E NASLP TS ELSCVICWTDFSS RGVLPCGHRFCYSCIQNWAD
Subjt:  LASDRFEGVEACENQSTSRDPNLPFENAPRDLTITSEDELHNINDLQKNSEDPVIELNASLPSTSTELSCVICWTDFSSTRGVLPCGHRFCYSCIQNWAD

Query:  HMASSRKISTCPLCKASFLSITKVEAAASSDQKIYSQTIPCGPSLLDIYILPDERILSSGVQPSVAAVCSACRCREPEDLLMSCHLCQIRHIHSYCLDPP
        HMASSRKISTCPLCKASFLSITKVE AA+SDQKIYSQTIPCGPSLLDI+ILPDER L+SGVQ SV  VCSACRCREPEDLLMSCHLCQIRHIHSYCLDPP
Subjt:  HMASSRKISTCPLCKASFLSITKVEAAASSDQKIYSQTIPCGPSLLDIYILPDERILSSGVQPSVAAVCSACRCREPEDLLMSCHLCQIRHIHSYCLDPP

Query:  LLPWTCIHCKDLQTLYHRSH
        LLPWTCIHCKDLQTLYHRSH
Subjt:  LLPWTCIHCKDLQTLYHRSH

A0A6J1FAW6 uncharacterized protein LOC1114424041.3e-23981.63Show/hide
Query:  PERPHAPGCPIEGMESVVATVSGYHGTERFNLIKMISYAGASYVGAMSRSITHLICWELEGRKFNLAKKFKTIIVNHRWLEDCIKQGKRVPEGSYILQSG
        P RP   GCPIEGMESVVATVSGYHGTERFNLIKMISY GASYVG MSRSITHLICWELEGRKFNLAKKFKTIIVNHRWLEDCIK GKRVPE  YILQSG
Subjt:  PERPHAPGCPIEGMESVVATVSGYHGTERFNLIKMISYAGASYVGAMSRSITHLICWELEGRKFNLAKKFKTIIVNHRWLEDCIKQGKRVPEGSYILQSG

Query:  QSAGPLSMKLPLAVEGSVSTKKYNVLSKKLHNCGNVEDQSIKSLCSFGDPILSHSCLLDKDLYSDFRKSDDTARKPKYKLWNNISKHEDPSSSSSRNHFE
        QSAGPLSMKLP + +GSVSTKKY V S+KLHNCGN+EDQ IK +CSFGD IL HS LLDK++Y DFR SDDTA K K+KL   ISK E+PSSSSS+NHF+
Subjt:  QSAGPLSMKLPLAVEGSVSTKKYNVLSKKLHNCGNVEDQSIKSLCSFGDPILSHSCLLDKDLYSDFRKSDDTARKPKYKLWNNISKHEDPSSSSSRNHFE

Query:  E-SPSGFFAIERGSSSILARDEGKGENSNLNPTVKSSRR-RRLANKNSSKDRNEVDIWNFDPECYQLGTRNHGNNFTVSSSHFDDETDIEVVNIGGTSDR
        E +PS FFAI  GSSS LARDE KG+  N N TV+SSRR RRL  KNSS+D NE D+WNFDPE Y L  R   N+ TV SSH DDETDIE VNIGGT+D 
Subjt:  E-SPSGFFAIERGSSSILARDEGKGENSNLNPTVKSSRR-RRLANKNSSKDRNEVDIWNFDPECYQLGTRNHGNNFTVSSSHFDDETDIEVVNIGGTSDR

Query:  DQLC-ERGLASDRFEGVEACENQSTSRDPNLPFENAPRDLTITSEDELHNINDLQKNSEDPVIELNASLPSTSTELSCVICWTDFSSTRGVLPCGHRFCY
        DQLC ERGLASD FEGVEAC NQSTSR  NL  ENAPR LT TSEDELH  N LQKN ED VIELN S+PSTSTELSCVICWTDFSSTRGVLPCGHRFCY
Subjt:  DQLC-ERGLASDRFEGVEACENQSTSRDPNLPFENAPRDLTITSEDELHNINDLQKNSEDPVIELNASLPSTSTELSCVICWTDFSSTRGVLPCGHRFCY

Query:  SCIQNWADHMASSRKISTCPLCKASFLSITKVEAAASSDQKIYSQTIPCGPSLLDIYILPDERILSSGVQPSVAAVCSACRCREPEDLLMSCHLCQIRHI
        SCIQNWADHMAS RKISTCPLCKASFLSITKVE AA+SDQKIYSQTIPCG SLLDIYILPDER L S VQPSVAAVCS CRCREPEDLLMSCHLCQIRHI
Subjt:  SCIQNWADHMASSRKISTCPLCKASFLSITKVEAAASSDQKIYSQTIPCGPSLLDIYILPDERILSSGVQPSVAAVCSACRCREPEDLLMSCHLCQIRHI

Query:  HSYCLDPPLLPWTCIHCKDLQTLYHRSH
        HSYCLDPPLLPW CIHCKDLQTLYHR H
Subjt:  HSYCLDPPLLPWTCIHCKDLQTLYHRSH

A0A6J1J2R6 E3 ubiquitin-protein ligase rnf8-A isoform X15.7e-24382.01Show/hide
Query:  PERPHAPGCPIEGMESVVATVSGYHGTERFNLIKMISYAGASYVGAMSRSITHLICWELEGRKFNLAKKFKTIIVNHRWLEDCIKQGKRVPEGSYILQSG
        P RP   GCPIEGMESVVATVSGYHGTERFNLIKMISY GASYVG MSRSITHLICWELEGRKFNLAKKFKTIIVNHRWLEDCIKQG RVPE  YILQSG
Subjt:  PERPHAPGCPIEGMESVVATVSGYHGTERFNLIKMISYAGASYVGAMSRSITHLICWELEGRKFNLAKKFKTIIVNHRWLEDCIKQGKRVPEGSYILQSG

Query:  QSAGPLSMKLPLAVEGSVSTKKYNVLSKKLHNCGNVEDQSIKSLCSFGDPILSHSCLLDKDLYSDFRKSDDTARKPKYKLWNNISKHEDPSSSSSRNHFE
        QSAGPLSMKLP + +GSVSTKKY V S+KLHNCGNVEDQ IK +CSFGD IL HS LLDK++Y DFR SDDTA K K+KL   ISK E+PSSSSS+NHF+
Subjt:  QSAGPLSMKLPLAVEGSVSTKKYNVLSKKLHNCGNVEDQSIKSLCSFGDPILSHSCLLDKDLYSDFRKSDDTARKPKYKLWNNISKHEDPSSSSSRNHFE

Query:  E-SPSGFFAIERGSSSILARDEGKGENSNLNPTVKSSRR-RRLANKNSSKDRNEVDIWNFDPECYQLGTRNHGNNFTVSSSHFDDETDIEVVNIGGTSDR
        E +PS FFAI  GSSS LARDE KG+  N N TV+SSRR RRL  KNSS+D N+ D+WNFDPE Y L TR   N+ TV SSH DDETD EVVN+GGT+DR
Subjt:  E-SPSGFFAIERGSSSILARDEGKGENSNLNPTVKSSRR-RRLANKNSSKDRNEVDIWNFDPECYQLGTRNHGNNFTVSSSHFDDETDIEVVNIGGTSDR

Query:  DQLC-ERGLASDRFEGVEACENQSTSRDPNLPFENAPRDLTITSEDELHNINDLQKNSEDPVIELNASLPSTSTELSCVICWTDFSSTRGVLPCGHRFCY
        DQLC ERGLASD FEGVEACENQSTSR  NL  ENAPR LT+TSEDELH   DLQK  EDPVIELN S+PST+TELSCVICWTDFSSTRGVLPCGHRFCY
Subjt:  DQLC-ERGLASDRFEGVEACENQSTSRDPNLPFENAPRDLTITSEDELHNINDLQKNSEDPVIELNASLPSTSTELSCVICWTDFSSTRGVLPCGHRFCY

Query:  SCIQNWADHMASSRKISTCPLCKASFLSITKVEAAASSDQKIYSQTIPCGPSLLDIYILPDERILSSGVQPSVAAVCSACRCREPEDLLMSCHLCQIRHI
        SCIQNWADHMASSRKISTCPLCKASFLSITKVE AA+SDQKIYSQTIPCGPSLLDIYILPDER L S VQPSVAAVCS CRC+EPEDLLMSCHLCQIRHI
Subjt:  SCIQNWADHMASSRKISTCPLCKASFLSITKVEAAASSDQKIYSQTIPCGPSLLDIYILPDERILSSGVQPSVAAVCSACRCREPEDLLMSCHLCQIRHI

Query:  HSYCLDPPLLPWTCIHCKDLQTLYHRSH
        HSYCLDPPLLPW CIHCKDLQTLYHR H
Subjt:  HSYCLDPPLLPWTCIHCKDLQTLYHRSH

SwissProt top hitse value%identityAlignment
O04251 BRCT domain-containing protein At4g021103.3e-1441.38Show/hide
Query:  IEGMESVVATVSGYHGTERFNLIKMISYAGASYVGAM-SRSITHLICWELEGRKFNLAKKFKTI-IVNHRWLEDCIKQGKRVPEGSY
        I G +++V  ++GY G +R ++++M+   G  +   + +  +THLIC++ EG K+ LAK+ K I +VNHRWLEDC+K  K +PE  Y
Subjt:  IEGMESVVATVSGYHGTERFNLIKMISYAGASYVGAM-SRSITHLICWELEGRKFNLAKKFKTI-IVNHRWLEDCIKQGKRVPEGSY

Q7ZZY3 DNA topoisomerase 2-binding protein 1-B4.0e-0427.64Show/hide
Query:  VSTEEILPERPHAPGCPIEGME------SVVATVSGYHGTERFNLIKMISYAGASY-----------VGAMSRSITHLICWELEGRKFNLAKKFKTIIVN
        +  E++L    +A   P+  +E        V +VS + G ER +L+ +    GA              G  +RS  HL   + EG K+  AKK+    V 
Subjt:  VSTEEILPERPHAPGCPIEGME------SVVATVSGYHGTERFNLIKMISYAGASY-----------VGAMSRSITHLICWELEGRKFNLAKKFKTIIVN

Query:  HRWLEDCIKQGKRVPEGSYILQS
          WL  C + GK+  E SY++ +
Subjt:  HRWLEDCIKQGKRVPEGSYILQS

Q800K6 DNA topoisomerase 2-binding protein 1-A8.2e-0526.45Show/hide
Query:  VSTEEILPERPHAPGCPIEGME------SVVATVSGYHGTERFNLIKMISYAGASYVGAMSR---------SITHLICWELEGRKFNLAKKFKTIIVNHR
        +  E++L    +A   P+  +E        V +VS + G ER +L+ +    GA       R         + THL+  + EG K+  AKK+    V   
Subjt:  VSTEEILPERPHAPGCPIEGME------SVVATVSGYHGTERFNLIKMISYAGASYVGAMSR---------SITHLICWELEGRKFNLAKKFKTIIVNHR

Query:  WLEDCIKQGKRVPEGSYILQS
        WL  C + G++  E SY++ +
Subjt:  WLEDCIKQGKRVPEGSYILQS

Q9NS56 E3 ubiquitin-protein ligase Topors5.3e-0431.68Show/hide
Query:  ELNASLPS-TSTELSCVICWTDFSSTRGVLPCGHRFCYSCIQNWADHMASSRKISTCPLCKASFLSITKVEAAASSDQKIYSQTIPCGPSLLDIYILPDE
        +L  ++P+  S +  C IC   F +   +  C H+FC+ C+Q W+ + A       CPLCK  F SI      A  D K Y       PS    ++ PD 
Subjt:  ELNASLPS-TSTELSCVICWTDFSSTRGVLPCGHRFCYSCIQNWADHMASSRKISTCPLCKASFLSITKVEAAASSDQKIYSQTIPCGPSLLDIYILPDE

Query:  R
        R
Subjt:  R

Arabidopsis top hitse value%identityAlignment
AT1G67180.1 zinc finger (C3HC4-type RING finger) family protein / BRCT domain-containing protein6.2e-9339.39Show/hide
Query:  MESVVATVSGYHGTERFNLIKMISYAGASYVGAMSRSITHLICWELEGRKFNLAKKFKTIIVNHRWLEDCIKQGKRVPEGSYILQSGQSAGPLSMKLPLA
        ME+VVATVSGYHG++RF LIK+IS++GASYVGAMSRSITHL+CW+ EG+K++LAKKF T++VNHRW+E+C+K+G+RV E  Y+  SG+  GPL ++LP  
Subjt:  MESVVATVSGYHGTERFNLIKMISYAGASYVGAMSRSITHLICWELEGRKFNLAKKFKTIIVNHRWLEDCIKQGKRVPEGSYILQSGQSAGPLSMKLPLA

Query:  VEGSVSTKKYNVLS----KKLHNCGNVEDQSIKSLCSFGDPILSHSCLLDKDLYSDFRKSDDTARKPKYKLWNNISKHEDPSSSSSRNHFEESPSGFFAI
         E +  TKK N  S    K   N G     S   L ++          ++K++ ++        ++P                                 
Subjt:  VEGSVSTKKYNVLS----KKLHNCGNVEDQSIKSLCSFGDPILSHSCLLDKDLYSDFRKSDDTARKPKYKLWNNISKHEDPSSSSSRNHFEESPSGFFAI

Query:  ERGSSSILARDEGKGENSNLNPTVKSSRRRRLANKNSSKDRNEVDIWNFDPECYQLGTRNHGNNFTVSSSHFDDETDIEVVNIGG----TSDRDQLCERG
            SSIL   E  G         +SSR+ +         RN +D+     E  +    NH +N   + +   D  +    N+ G      +   L   G
Subjt:  ERGSSSILARDEGKGENSNLNPTVKSSRRRRLANKNSSKDRNEVDIWNFDPECYQLGTRNHGNNFTVSSSHFDDETDIEVVNIGG----TSDRDQLCERG

Query:  -LASDRF--EGVEACENQSTSRDPNLPFENAPRDLTITSEDELHNINDLQKNSEDPVIELNASLPSTSTELSCVICWTDFSSTRGVLPCGHRFCYSCIQN
         LA+  +  + +E  EN S S     P   +P    I  +D+           ++   E   +      ++SC+ICWT+FSS+RG+LPCGHRFCYSCIQ 
Subjt:  -LASDRF--EGVEACENQSTSRDPNLPFENAPRDLTITSEDELHNINDLQKNSEDPVIELNASLPSTSTELSCVICWTDFSSTRGVLPCGHRFCYSCIQN

Query:  WADHMASSRKISTCPLCKASFLSITKVEAAASSDQKIYSQTIPCGPSLLDI-YILPDERILSSGVQP-SVAAVCSACRCREPEDLLMSCHLCQIRHIHSY
        WAD + S RK +TCPLCK++F++ITK+E A SSDQKIYSQT+P   S  +I  +LP+E      + P + A+ CS C   EPE+LL+ CHLC  R IHSY
Subjt:  WADHMASSRKISTCPLCKASFLSITKVEAAASSDQKIYSQTIPCGPSLLDI-YILPDERILSSGVQP-SVAAVCSACRCREPEDLLMSCHLCQIRHIHSY

Query:  CLDPPLLPWTCIHCKDLQTLYHR
        CLDP LLPWTC HC DLQ +YHR
Subjt:  CLDPPLLPWTCIHCKDLQTLYHR

AT3G11400.1 eukaryotic translation initiation factor 3G11.6e-2456.19Show/hide
Query:  SKLRWGELDGDDGDLD---------------LKKTVEYRFNEDGNKVKVSTTSRTRKLAHARLSKHPMERASWAKFGDAVHEDVGSRLTVVSTEEILPER
        SK RWGE+D +D DLD               LK T+EY+FN++ NKVK++T +R RKLA ARL+K  MER +W KFGDA +E+ GS LT+VSTEEIL ER
Subjt:  SKLRWGELDGDDGDLD---------------LKKTVEYRFNEDGNKVKVSTTSRTRKLAHARLSKHPMERASWAKFGDAVHEDVGSRLTVVSTEEILPER

Query:  PHAPG
        P APG
Subjt:  PHAPG

AT3G11400.2 eukaryotic translation initiation factor 3G11.6e-2456.19Show/hide
Query:  SKLRWGELDGDDGDLD---------------LKKTVEYRFNEDGNKVKVSTTSRTRKLAHARLSKHPMERASWAKFGDAVHEDVGSRLTVVSTEEILPER
        SK RWGE+D +D DLD               LK T+EY+FN++ NKVK++T +R RKLA ARL+K  MER +W KFGDA +E+ GS LT+VSTEEIL ER
Subjt:  SKLRWGELDGDDGDLD---------------LKKTVEYRFNEDGNKVKVSTTSRTRKLAHARLSKHPMERASWAKFGDAVHEDVGSRLTVVSTEEILPER

Query:  PHAPG
        P APG
Subjt:  PHAPG

AT4G02110.1 transcription coactivators2.3e-1541.38Show/hide
Query:  IEGMESVVATVSGYHGTERFNLIKMISYAGASYVGAM-SRSITHLICWELEGRKFNLAKKFKTI-IVNHRWLEDCIKQGKRVPEGSY
        I G +++V  ++GY G +R ++++M+   G  +   + +  +THLIC++ EG K+ LAK+ K I +VNHRWLEDC+K  K +PE  Y
Subjt:  IEGMESVVATVSGYHGTERFNLIKMISYAGASYVGAM-SRSITHLICWELEGRKFNLAKKFKTI-IVNHRWLEDCIKQGKRVPEGSY

AT5G06000.1 eukaryotic translation initiation factor 3G22.7e-1946.22Show/hide
Query:  KLRWGELDGDDGDLD---------------LKKTVEYRFNEDGNKVKVSTTSRTRKLAHARLSKHPMERASWAKFGDAVHEDVGSRLTVVSTEEILPERP
        KLRWGE+D ++GD D               +KK +EY+FNE+  KVK++TT+R +K A   L+K  +ER SW KFGDA HE+  S LT+ STE+I+ ER 
Subjt:  KLRWGELDGDDGDLD---------------LKKTVEYRFNEDGNKVKVSTTSRTRKLAHARLSKHPMERASWAKFGDAVHEDVGSRLTVVSTEEILPERP

Query:  HAPGCPIEGMESVVATVSG
         APG   E      +TVSG
Subjt:  HAPGCPIEGMESVVATVSG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAGCCGACAAATCCATCCAAATTGCGATGGGGGGAGCTAGACGGCGACGATGGGGATCTCGATTTGAAGAAAACTGTTGAATATCGGTTTAACGAAGATGGTAA
CAAGGTCAAAGTCTCAACGACCTCTCGAACTCGGAAGCTCGCCCATGCTCGCCTGAGTAAACATCCCATGGAGCGCGCATCTTGGGCCAAGTTCGGGGATGCCGTACATG
AAGACGTCGGCAGTAGGCTCACTGTGGTCTCCACTGAGGAGATTCTTCCCGAGCGCCCTCATGCTCCTGGTTGTCCAATTGAAGGTATGGAATCTGTGGTTGCAACTGTC
AGTGGCTACCATGGCACAGAAAGATTCAACCTTATCAAGATGATATCTTATGCTGGTGCCAGCTATGTGGGTGCCATGTCAAGGTCTATTACTCATTTGATTTGTTGGGA
ATTGGAAGGGAGGAAATTTAACCTTGCTAAGAAGTTCAAAACAATAATAGTCAATCATCGTTGGCTTGAAGATTGCATCAAACAAGGAAAGCGTGTCCCAGAAGGTTCGT
ACATTCTTCAAAGTGGCCAATCAGCAGGTCCCTTGTCAATGAAACTCCCTCTTGCTGTCGAGGGTTCTGTCTCAACCAAAAAGTATAACGTGCTTTCTAAAAAGTTACAT
AATTGTGGAAATGTTGAAGATCAAAGCATTAAAAGCTTATGCTCTTTTGGTGATCCAATCTTGTCCCATTCTTGTTTGCTGGATAAGGATCTGTACTCTGATTTTAGAAA
GAGCGACGACACTGCTCGTAAACCAAAGTACAAATTGTGGAATAATATTTCTAAGCACGAAGACCCATCAAGCTCGAGTAGCAGAAACCATTTTGAGGAGTCTCCCTCTG
GATTTTTTGCAATTGAGCGTGGTAGCTCTTCAATCTTGGCGAGAGATGAAGGAAAAGGGGAAAATAGTAACCTGAACCCTACTGTTAAATCTTCAAGGAGGCGGCGGCTT
GCGAATAAAAATTCTAGCAAAGATCGTAACGAGGTTGACATTTGGAATTTTGACCCAGAGTGCTATCAGTTGGGAACTCGTAACCATGGTAATAATTTTACAGTTTCATC
TAGCCATTTTGATGATGAAACCGATATTGAAGTGGTAAACATTGGAGGAACATCTGATCGTGATCAGTTGTGTGAAAGGGGACTAGCAAGTGACCGCTTTGAAGGTGTTG
AAGCTTGTGAGAATCAATCTACTTCCAGAGATCCAAACTTACCATTCGAAAATGCACCCAGAGACCTTACGATAACTTCAGAAGATGAGTTGCACAATATTAATGATTTA
CAAAAGAACTCTGAGGATCCAGTCATAGAACTCAATGCAAGCTTACCATCCACTTCAACAGAGTTATCATGTGTTATCTGTTGGACAGATTTTAGTTCGACAAGGGGAGT
TTTGCCCTGTGGGCACCGATTTTGCTATTCATGCATTCAAAACTGGGCAGATCACATGGCTTCGAGCAGAAAGATCTCAACTTGCCCTTTGTGCAAAGCCAGTTTTCTGA
GCATCACAAAGGTTGAAGCTGCTGCCTCTTCGGATCAGAAGATATACTCCCAAACAATTCCATGTGGTCCATCACTATTGGATATTTACATTCTTCCTGATGAAAGAATT
CTTAGCAGTGGGGTTCAGCCCTCTGTAGCTGCTGTTTGTAGTGCATGCCGATGTCGGGAACCAGAGGACCTCCTCATGAGCTGCCATCTTTGCCAGATTCGACATATTCA
TTCGTACTGTCTGGACCCTCCCTTGTTACCATGGACTTGTATTCACTGTAAGGATCTGCAGACACTCTACCATCGAAGCCATTAA
mRNA sequenceShow/hide mRNA sequence
CAACAATTTCCCGCCTTCTTCCTCATCGAAACCCTGGGCCCTGGCCACCAAAAATCTGTGTCGCCAGCTCTAGCCTCCCGATTCTCCATCTGCTGCGCAATGGTTCAGCC
GACAAATCCATCCAAATTGCGATGGGGGGAGCTAGACGGCGACGATGGGGATCTCGATTTGAAGAAAACTGTTGAATATCGGTTTAACGAAGATGGTAACAAGGTCAAAG
TCTCAACGACCTCTCGAACTCGGAAGCTCGCCCATGCTCGCCTGAGTAAACATCCCATGGAGCGCGCATCTTGGGCCAAGTTCGGGGATGCCGTACATGAAGACGTCGGC
AGTAGGCTCACTGTGGTCTCCACTGAGGAGATTCTTCCCGAGCGCCCTCATGCTCCTGGTTGTCCAATTGAAGGTATGGAATCTGTGGTTGCAACTGTCAGTGGCTACCA
TGGCACAGAAAGATTCAACCTTATCAAGATGATATCTTATGCTGGTGCCAGCTATGTGGGTGCCATGTCAAGGTCTATTACTCATTTGATTTGTTGGGAATTGGAAGGGA
GGAAATTTAACCTTGCTAAGAAGTTCAAAACAATAATAGTCAATCATCGTTGGCTTGAAGATTGCATCAAACAAGGAAAGCGTGTCCCAGAAGGTTCGTACATTCTTCAA
AGTGGCCAATCAGCAGGTCCCTTGTCAATGAAACTCCCTCTTGCTGTCGAGGGTTCTGTCTCAACCAAAAAGTATAACGTGCTTTCTAAAAAGTTACATAATTGTGGAAA
TGTTGAAGATCAAAGCATTAAAAGCTTATGCTCTTTTGGTGATCCAATCTTGTCCCATTCTTGTTTGCTGGATAAGGATCTGTACTCTGATTTTAGAAAGAGCGACGACA
CTGCTCGTAAACCAAAGTACAAATTGTGGAATAATATTTCTAAGCACGAAGACCCATCAAGCTCGAGTAGCAGAAACCATTTTGAGGAGTCTCCCTCTGGATTTTTTGCA
ATTGAGCGTGGTAGCTCTTCAATCTTGGCGAGAGATGAAGGAAAAGGGGAAAATAGTAACCTGAACCCTACTGTTAAATCTTCAAGGAGGCGGCGGCTTGCGAATAAAAA
TTCTAGCAAAGATCGTAACGAGGTTGACATTTGGAATTTTGACCCAGAGTGCTATCAGTTGGGAACTCGTAACCATGGTAATAATTTTACAGTTTCATCTAGCCATTTTG
ATGATGAAACCGATATTGAAGTGGTAAACATTGGAGGAACATCTGATCGTGATCAGTTGTGTGAAAGGGGACTAGCAAGTGACCGCTTTGAAGGTGTTGAAGCTTGTGAG
AATCAATCTACTTCCAGAGATCCAAACTTACCATTCGAAAATGCACCCAGAGACCTTACGATAACTTCAGAAGATGAGTTGCACAATATTAATGATTTACAAAAGAACTC
TGAGGATCCAGTCATAGAACTCAATGCAAGCTTACCATCCACTTCAACAGAGTTATCATGTGTTATCTGTTGGACAGATTTTAGTTCGACAAGGGGAGTTTTGCCCTGTG
GGCACCGATTTTGCTATTCATGCATTCAAAACTGGGCAGATCACATGGCTTCGAGCAGAAAGATCTCAACTTGCCCTTTGTGCAAAGCCAGTTTTCTGAGCATCACAAAG
GTTGAAGCTGCTGCCTCTTCGGATCAGAAGATATACTCCCAAACAATTCCATGTGGTCCATCACTATTGGATATTTACATTCTTCCTGATGAAAGAATTCTTAGCAGTGG
GGTTCAGCCCTCTGTAGCTGCTGTTTGTAGTGCATGCCGATGTCGGGAACCAGAGGACCTCCTCATGAGCTGCCATCTTTGCCAGATTCGACATATTCATTCGTACTGTC
TGGACCCTCCCTTGTTACCATGGACTTGTATTCACTGTAAGGATCTGCAGACACTCTACCATCGAAGCCATTAATTTGTTGTCTTGAAATCAGGTGATTGCAAAGAGCAA
ATGGGAAGACAATTGAAGTTAGCTCAAGATTTTGAAGCAAAGAAAAGCAACCTGCCTCTCACAAATCTTCACTGTTAAGGTTGTGCCCAACACTGATTATTGTTATTATT
TTGAAAAGTTTCAATTTGTTTGGCTGGTTATGATGTGGTTTTTGAATGCATGTTTTTACTTATAAAGTTGCATTTAACCGAAAGAAAATAGGGAAAAAGATCCATCCGTA
AATATTTTTGAATTTCTTATATGTGCATAAAGTATAATTATTAGTAGTACGTTGAAATTATTGAAAAACTTCTCAACTGTTCATGTGGTCTTGAGAGTTTCAACGTATCC
TGCTTCTGTCTGTCTGTCTTTATTTATTTTATTTTTTGTAATCTTTAACGATTGAGATTTTATAATCATAATATTCATGATCTCCACCACTGAGATTTTGTGACCAAGAT
A
Protein sequenceShow/hide protein sequence
MVQPTNPSKLRWGELDGDDGDLDLKKTVEYRFNEDGNKVKVSTTSRTRKLAHARLSKHPMERASWAKFGDAVHEDVGSRLTVVSTEEILPERPHAPGCPIEGMESVVATV
SGYHGTERFNLIKMISYAGASYVGAMSRSITHLICWELEGRKFNLAKKFKTIIVNHRWLEDCIKQGKRVPEGSYILQSGQSAGPLSMKLPLAVEGSVSTKKYNVLSKKLH
NCGNVEDQSIKSLCSFGDPILSHSCLLDKDLYSDFRKSDDTARKPKYKLWNNISKHEDPSSSSSRNHFEESPSGFFAIERGSSSILARDEGKGENSNLNPTVKSSRRRRL
ANKNSSKDRNEVDIWNFDPECYQLGTRNHGNNFTVSSSHFDDETDIEVVNIGGTSDRDQLCERGLASDRFEGVEACENQSTSRDPNLPFENAPRDLTITSEDELHNINDL
QKNSEDPVIELNASLPSTSTELSCVICWTDFSSTRGVLPCGHRFCYSCIQNWADHMASSRKISTCPLCKASFLSITKVEAAASSDQKIYSQTIPCGPSLLDIYILPDERI
LSSGVQPSVAAVCSACRCREPEDLLMSCHLCQIRHIHSYCLDPPLLPWTCIHCKDLQTLYHRSH