; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS013226 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS013226
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionE3 ubiquitin-protein ligase rnf8-A isoform X1
Genome locationscaffold459:1375914..1382079
RNA-Seq ExpressionMS013226
SyntenyMS013226
Gene Ontology termsGO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001357 - BRCT domain
IPR001841 - Zinc finger, RING-type
IPR011011 - Zinc finger, FYVE/PHD-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR017907 - Zinc finger, RING-type, conserved site
IPR036420 - BRCT domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011651366.1 uncharacterized protein LOC101213123 [Cucumis sativus]1.9e-22577.93Show/hide
Query:  GCPIEGMESVVATVSGYHGTERFNLIKMISYTGASYVGAMSRSITHLICWKLEGRKFSLAKKFKTIIVNHRWLEDCIRQGKRVPEGPYILQSGQSAGPLS
        GCPIEGMESVV TVSGYHGTERFNLIKMISYTGASYVGAMSRSITHLICW+L+GRKF LA+KF+TIIVNHRWLEDCI+ GKRVPEGPYILQSGQS GPLS
Subjt:  GCPIEGMESVVATVSGYHGTERFNLIKMISYTGASYVGAMSRSITHLICWKLEGRKFSLAKKFKTIIVNHRWLEDCIRQGKRVPEGPYILQSGQSAGPLS

Query:  IELPLAAKDSVSTTKYNVLSEKSQNCGNVEEQSIKSIYSFG-AIWPRSCLLDKDLFSDFGKSDDTSHKSKHKLRKKISKQEDPSNSSSRNNFQEPTPSEF
        ++LPLA K  VS  KYN+LSEK  N GNVE+QSIK I SFG +I PRS LLDKDL SDF KSDDT+HK KHK+RK+ISK EDPS+SSSRN F+EPT +  
Subjt:  IELPLAAKDSVSTTKYNVLSEKSQNCGNVEEQSIKSIYSFG-AIWPRSCLLDKDLFSDFGKSDDTSHKSKHKLRKKISKQEDPSNSSSRNNFQEPTPSEF

Query:  LEIERGSSSSLARDERKGENSNLPPTVKSSRRRRLL-NRNTSKDHCKPDVWDFDPECNHLGTRNHGNNFTVLSCHLNNERDIEVVGIGGSSDHGQLCDEG
          IE GS SSLARDERKGE+SN   TVKSSRRRRLL + N+ +DH KPD+ +FDPE   LGTR   N+ TV S   + E DIEVV IGG+SD  QLCDE 
Subjt:  LEIERGSSSSLARDERKGENSNLPPTVKSSRRRRLL-NRNTSKDHCKPDVWDFDPECNHLGTRNHGNNFTVLSCHLNNERDIEVVGIGGSSDHGQLCDEG

Query:  GIVSDSFEGVEARENQFTSKDANLPVENAPGVLMISSEDELSNIDTLQKEIEDPAVEHNASLP-TSAELSCVICWTDFSSMRGVLPCGHRFCYSCIQNWA
        G+ S  FEGVEA ENQ TSKD NL V+NAP VL I+SEDEL N++ LQK IEDP +E +ASLP TS ELSCVICWTDFSS RGVLPCGHRFCYSCIQNWA
Subjt:  GIVSDSFEGVEARENQFTSKDANLPVENAPGVLMISSEDELSNIDTLQKEIEDPAVEHNASLP-TSAELSCVICWTDFSSMRGVLPCGHRFCYSCIQNWA

Query:  DHMASSRKISTCPLCKASFLSITKVEEAATSDQKIYSQTIPCGPSLLDIFILPDERTLNSGVQASVGGVCSACRCREPEDLLMSCHLCQIRHIHSYCLDP
        DHMA SRKISTCPLCKASFLSITKVE AATSDQKIYSQTIPCG SLLDI++L DERTLN+ VQ SV  VCSACRCREPEDLLMSCHLCQIR IHSYCLDP
Subjt:  DHMASSRKISTCPLCKASFLSITKVEEAATSDQKIYSQTIPCGPSLLDIFILPDERTLNSGVQASVGGVCSACRCREPEDLLMSCHLCQIRHIHSYCLDP

Query:  PLLPWTCIHCKDLQTLYHRSH
        PLLPWTCIHCKDLQTLYHRSH
Subjt:  PLLPWTCIHCKDLQTLYHRSH

XP_022148261.1 uncharacterized protein LOC111016967 isoform X1 [Momordica charantia]3.4e-30499.23Show/hide
Query:  GCPIEGMESVVATVSGYHGTERFNLIKMISYTGASYVGAMSRSITHLICWKLEGRKFSLAKKFKTIIVNHRWLEDCIRQGKRVPEGPYILQSGQSAGPLS
        GCPIEGMESVVATVSGYHGTERFNLIKMISYTGASYVGAMSRSITHLICWKLEGRKFSLAKKFKTIIVNHRWLEDCIRQGKRVPEGPYILQSGQSAGPLS
Subjt:  GCPIEGMESVVATVSGYHGTERFNLIKMISYTGASYVGAMSRSITHLICWKLEGRKFSLAKKFKTIIVNHRWLEDCIRQGKRVPEGPYILQSGQSAGPLS

Query:  IELPLAAKDSVSTTKYNVLSEKSQNCGNVEEQSIKSIYSFGAIWPRSCLLDKDLFSDFGKSDDTSHKSKHKLRKKISKQEDPSNSSSRNNFQEPTPSEFL
        IELPLAAKDSVSTTKYNVLSEKSQNCGNVEEQSIKSIYSFGAIWPRSCLLDKDLFSDFGKSDDTSHKSKHKLRKKISKQEDPSNSSSRNNFQEPTPSEFL
Subjt:  IELPLAAKDSVSTTKYNVLSEKSQNCGNVEEQSIKSIYSFGAIWPRSCLLDKDLFSDFGKSDDTSHKSKHKLRKKISKQEDPSNSSSRNNFQEPTPSEFL

Query:  EIERGSSSSLARDERKGENSNLPPTVKSSRRRRLLNRNTSKDHCKPDVWDFDPECNHLGTRNHGNNFTVLSCHLNNERDIEVVGIGGSSDHGQLCDEGGI
        EIERGSSSSLARDERKGENSNL PTVKSSRRRRLLNRNTS+DHCKPDVWDFDPEC HLGTRNHGNNFTVLSCHLNNERDIEVVGIGGSSDHG LCDEGGI
Subjt:  EIERGSSSSLARDERKGENSNLPPTVKSSRRRRLLNRNTSKDHCKPDVWDFDPECNHLGTRNHGNNFTVLSCHLNNERDIEVVGIGGSSDHGQLCDEGGI

Query:  VSDSFEGVEARENQFTSKDANLPVENAPGVLMISSEDELSNIDTLQKEIEDPAVEHNASLPTSAELSCVICWTDFSSMRGVLPCGHRFCYSCIQNWADHM
        VSDSFEGVEARENQFTSKDANLPVENAPGVLMISSEDELSNIDTLQKEIEDPAVEHNASLPTSAELSCVICWTDFSSMRGVLPCGHRFCYSCIQNWADHM
Subjt:  VSDSFEGVEARENQFTSKDANLPVENAPGVLMISSEDELSNIDTLQKEIEDPAVEHNASLPTSAELSCVICWTDFSSMRGVLPCGHRFCYSCIQNWADHM

Query:  ASSRKISTCPLCKASFLSITKVEEAATSDQKIYSQTIPCGPSLLDIFILPDERTLNSGVQASVGGVCSACRCREPEDLLMSCHLCQIRHIHSYCLDPPLL
        ASSRKISTCPLCKASFLSITKVEEAATSDQKIYSQTIPCGPSLLDIFILPDERTLNSGVQASVGGVCSACRCREPEDLLMSCHLCQIRHIHSYCLDPPLL
Subjt:  ASSRKISTCPLCKASFLSITKVEEAATSDQKIYSQTIPCGPSLLDIFILPDERTLNSGVQASVGGVCSACRCREPEDLLMSCHLCQIRHIHSYCLDPPLL

Query:  PWTCIHCKDLQTLYHRSH
        PWTCIHCKDLQTLYHRSH
Subjt:  PWTCIHCKDLQTLYHRSH

XP_022148264.1 uncharacterized protein LOC111016967 isoform X2 [Momordica charantia]6.7e-30099.22Show/hide
Query:  MESVVATVSGYHGTERFNLIKMISYTGASYVGAMSRSITHLICWKLEGRKFSLAKKFKTIIVNHRWLEDCIRQGKRVPEGPYILQSGQSAGPLSIELPLA
        MESVVATVSGYHGTERFNLIKMISYTGASYVGAMSRSITHLICWKLEGRKFSLAKKFKTIIVNHRWLEDCIRQGKRVPEGPYILQSGQSAGPLSIELPLA
Subjt:  MESVVATVSGYHGTERFNLIKMISYTGASYVGAMSRSITHLICWKLEGRKFSLAKKFKTIIVNHRWLEDCIRQGKRVPEGPYILQSGQSAGPLSIELPLA

Query:  AKDSVSTTKYNVLSEKSQNCGNVEEQSIKSIYSFGAIWPRSCLLDKDLFSDFGKSDDTSHKSKHKLRKKISKQEDPSNSSSRNNFQEPTPSEFLEIERGS
        AKDSVSTTKYNVLSEKSQNCGNVEEQSIKSIYSFGAIWPRSCLLDKDLFSDFGKSDDTSHKSKHKLRKKISKQEDPSNSSSRNNFQEPTPSEFLEIERGS
Subjt:  AKDSVSTTKYNVLSEKSQNCGNVEEQSIKSIYSFGAIWPRSCLLDKDLFSDFGKSDDTSHKSKHKLRKKISKQEDPSNSSSRNNFQEPTPSEFLEIERGS

Query:  SSSLARDERKGENSNLPPTVKSSRRRRLLNRNTSKDHCKPDVWDFDPECNHLGTRNHGNNFTVLSCHLNNERDIEVVGIGGSSDHGQLCDEGGIVSDSFE
        SSSLARDERKGENSNL PTVKSSRRRRLLNRNTS+DHCKPDVWDFDPEC HLGTRNHGNNFTVLSCHLNNERDIEVVGIGGSSDHG LCDEGGIVSDSFE
Subjt:  SSSLARDERKGENSNLPPTVKSSRRRRLLNRNTSKDHCKPDVWDFDPECNHLGTRNHGNNFTVLSCHLNNERDIEVVGIGGSSDHGQLCDEGGIVSDSFE

Query:  GVEARENQFTSKDANLPVENAPGVLMISSEDELSNIDTLQKEIEDPAVEHNASLPTSAELSCVICWTDFSSMRGVLPCGHRFCYSCIQNWADHMASSRKI
        GVEARENQFTSKDANLPVENAPGVLMISSEDELSNIDTLQKEIEDPAVEHNASLPTSAELSCVICWTDFSSMRGVLPCGHRFCYSCIQNWADHMASSRKI
Subjt:  GVEARENQFTSKDANLPVENAPGVLMISSEDELSNIDTLQKEIEDPAVEHNASLPTSAELSCVICWTDFSSMRGVLPCGHRFCYSCIQNWADHMASSRKI

Query:  STCPLCKASFLSITKVEEAATSDQKIYSQTIPCGPSLLDIFILPDERTLNSGVQASVGGVCSACRCREPEDLLMSCHLCQIRHIHSYCLDPPLLPWTCIH
        STCPLCKASFLSITKVEEAATSDQKIYSQTIPCGPSLLDIFILPDERTLNSGVQASVGGVCSACRCREPEDLLMSCHLCQIRHIHSYCLDPPLLPWTCIH
Subjt:  STCPLCKASFLSITKVEEAATSDQKIYSQTIPCGPSLLDIFILPDERTLNSGVQASVGGVCSACRCREPEDLLMSCHLCQIRHIHSYCLDPPLLPWTCIH

Query:  CKDLQTLYHRSH
        CKDLQTLYHRSH
Subjt:  CKDLQTLYHRSH

XP_022983591.1 E3 ubiquitin-protein ligase rnf8-A isoform X1 [Cucurbita maxima]2.4e-22576.2Show/hide
Query:  GCPIEGMESVVATVSGYHGTERFNLIKMISYTGASYVGAMSRSITHLICWKLEGRKFSLAKKFKTIIVNHRWLEDCIRQGKRVPEGPYILQSGQSAGPLS
        GCPIEGMESVVATVSGYHGTERFNLIKMISYTGASYVG MSRSITHLICW+LEGRKF+LAKKFKTIIVNHRWLEDCI+QG RVPE PYILQSGQSAGPLS
Subjt:  GCPIEGMESVVATVSGYHGTERFNLIKMISYTGASYVGAMSRSITHLICWKLEGRKFSLAKKFKTIIVNHRWLEDCIRQGKRVPEGPYILQSGQSAGPLS

Query:  IELPLAAKDSVSTTKYNVLSEKSQNCGNVEEQSIKSIYSFG-AIWPRSCLLDKDLFSDFGKSDDTSHKSKHKLRKKISKQEDPSNSSSRNNFQEPTPSEF
        ++LP + K SVST KY V SEK  NCGNVE+Q IK + SFG +I P S LLDK+++ DF  SDDT+HK KHKLRK+ISK E+PS+SSS+N+F+EPTPS+F
Subjt:  IELPLAAKDSVSTTKYNVLSEKSQNCGNVEEQSIKSIYSFG-AIWPRSCLLDKDLFSDFGKSDDTSHKSKHKLRKKISKQEDPSNSSSRNNFQEPTPSEF

Query:  LEIERGSSSSLARDERKGENSNLPPTVKSSRR-RRLLNRNTSKDHCKPDVWDFDPECNHLGTRNHGNNFTVLSCHLNNERDIEVVGIGGSSDHGQLCDEG
          I  GSSSSLARDE KG+  N   TV+SSRR RRL+ +N+S+DH  PDVW+FDPE  HL TR   N+ TVLS H ++E D EVV +GG++D  QLCDE 
Subjt:  LEIERGSSSSLARDERKGENSNLPPTVKSSRR-RRLLNRNTSKDHCKPDVWDFDPECNHLGTRNHGNNFTVLSCHLNNERDIEVVGIGGSSDHGQLCDEG

Query:  GIVSDSFEGVEARENQFTSKDANLPVENAPGVLMISSEDELSNIDTLQKEIEDPAVEHNASLP-TSAELSCVICWTDFSSMRGVLPCGHRFCYSCIQNWA
        G+ SDSFEGVEA ENQ TS+  NL VENAP +L ++SEDEL     LQK IEDP +E N S+P T+ ELSCVICWTDFSS RGVLPCGHRFCYSCIQNWA
Subjt:  GIVSDSFEGVEARENQFTSKDANLPVENAPGVLMISSEDELSNIDTLQKEIEDPAVEHNASLP-TSAELSCVICWTDFSSMRGVLPCGHRFCYSCIQNWA

Query:  DHMASSRKISTCPLCKASFLSITKVEEAATSDQKIYSQTIPCGPSLLDIFILPDERTLNSGVQASVGGVCSACRCREPEDLLMSCHLCQIRHIHSYCLDP
        DHMASSRKISTCPLCKASFLSITKVE+AATSDQKIYSQTIPCGPSLLDI+ILPDERTL+S VQ SV  VCS CRC+EPEDLLMSCHLCQIRHIHSYCLDP
Subjt:  DHMASSRKISTCPLCKASFLSITKVEEAATSDQKIYSQTIPCGPSLLDIFILPDERTLNSGVQASVGGVCSACRCREPEDLLMSCHLCQIRHIHSYCLDP

Query:  PLLPWTCIHCKDLQTLYHRSH
        PLLPW CIHCKDLQTLYHR H
Subjt:  PLLPWTCIHCKDLQTLYHRSH

XP_038887153.1 uncharacterized protein LOC120077302 [Benincasa hispida]4.4e-22777.35Show/hide
Query:  GCPIEGMESVVATVSGYHGTERFNLIKMISYTGASYVGAMSRSITHLICWKLEGRKFSLAKKFKTIIVNHRWLEDCIRQGKRVPEGPYILQSGQSAGPLS
        GCPIEGMESVVATVSGYHGTERFNLIKMISYTGASYVGAMSRSITHLICW+L+GRKF+LAKKF+TIIVNHRWLEDCI+ GKRVPEGPY+LQSGQ AGPLS
Subjt:  GCPIEGMESVVATVSGYHGTERFNLIKMISYTGASYVGAMSRSITHLICWKLEGRKFSLAKKFKTIIVNHRWLEDCIRQGKRVPEGPYILQSGQSAGPLS

Query:  IELPLAAKDSVSTTKYNVLSEKSQNCGNVEEQSIKSIYSF-GAIWPRSCLLDKDLFSDFGKSDDTSHKSKHKLRKKISKQEDPSNSSSRNNFQEPTPSEF
        ++LPLA K SVST K N+L EK  N GNV++QSIK I SF  +I P S LLDKDL+SDF  SD T+HK+K  LR++ISKQED S+SSSRN+F+EPTPS F
Subjt:  IELPLAAKDSVSTTKYNVLSEKSQNCGNVEEQSIKSIYSF-GAIWPRSCLLDKDLFSDFGKSDDTSHKSKHKLRKKISKQEDPSNSSSRNNFQEPTPSEF

Query:  LEIERGSSSSLARDERKGENSNLPPTVKSSRR-RRLLNRNTSKDHCKPDVWDFDPECNHLGTRNHGNNFTVLSCHLNNERDIEVVGIGGSSDHGQLCDEG
          IERGSSS LARDERKGE+SN   TV SSRR RRL+N+N+++DH K D+W+FD   NHLGTR   N+ T  S H ++E +IEVV IGG+SD  QL DE 
Subjt:  LEIERGSSSSLARDERKGENSNLPPTVKSSRR-RRLLNRNTSKDHCKPDVWDFDPECNHLGTRNHGNNFTVLSCHLNNERDIEVVGIGGSSDHGQLCDEG

Query:  GIVSDSFEGVEARENQFTSKDANLPVENAPGVLMISSEDELSNIDTLQKEIEDPAVEHNASLP-TSAELSCVICWTDFSSMRGVLPCGHRFCYSCIQNWA
        G+ SDSFEG+EA E+Q TS+D NL VENAP    I+SEDEL NI+ LQK IEDP +E NASLP TS ELSCVICWTDFSS RGVLPCGHRFCYSCIQNWA
Subjt:  GIVSDSFEGVEARENQFTSKDANLPVENAPGVLMISSEDELSNIDTLQKEIEDPAVEHNASLP-TSAELSCVICWTDFSSMRGVLPCGHRFCYSCIQNWA

Query:  DHMASSRKISTCPLCKASFLSITKVEEAATSDQKIYSQTIPCGPSLLDIFILPDERTLNSGVQASVGGVCSACRCREPEDLLMSCHLCQIRHIHSYCLDP
        DHMA SRKISTCPLCKA+FLSITKVE+AATSDQKIYSQTIPCGPSLLDI+ILPDERTLN+ VQ SV  VCSACRCREPEDLLMSCHLCQIRHIHSYCLDP
Subjt:  DHMASSRKISTCPLCKASFLSITKVEEAATSDQKIYSQTIPCGPSLLDIFILPDERTLNSGVQASVGGVCSACRCREPEDLLMSCHLCQIRHIHSYCLDP

Query:  PLLPWTCIHCKDLQTLYHRSH
        PLLPWTCIHCKDLQ LYHRSH
Subjt:  PLLPWTCIHCKDLQTLYHRSH

TrEMBL top hitse value%identityAlignment
A0A0A0L7F3 Uncharacterized protein9.0e-22677.93Show/hide
Query:  GCPIEGMESVVATVSGYHGTERFNLIKMISYTGASYVGAMSRSITHLICWKLEGRKFSLAKKFKTIIVNHRWLEDCIRQGKRVPEGPYILQSGQSAGPLS
        GCPIEGMESVV TVSGYHGTERFNLIKMISYTGASYVGAMSRSITHLICW+L+GRKF LA+KF+TIIVNHRWLEDCI+ GKRVPEGPYILQSGQS GPLS
Subjt:  GCPIEGMESVVATVSGYHGTERFNLIKMISYTGASYVGAMSRSITHLICWKLEGRKFSLAKKFKTIIVNHRWLEDCIRQGKRVPEGPYILQSGQSAGPLS

Query:  IELPLAAKDSVSTTKYNVLSEKSQNCGNVEEQSIKSIYSFG-AIWPRSCLLDKDLFSDFGKSDDTSHKSKHKLRKKISKQEDPSNSSSRNNFQEPTPSEF
        ++LPLA K  VS  KYN+LSEK  N GNVE+QSIK I SFG +I PRS LLDKDL SDF KSDDT+HK KHK+RK+ISK EDPS+SSSRN F+EPT +  
Subjt:  IELPLAAKDSVSTTKYNVLSEKSQNCGNVEEQSIKSIYSFG-AIWPRSCLLDKDLFSDFGKSDDTSHKSKHKLRKKISKQEDPSNSSSRNNFQEPTPSEF

Query:  LEIERGSSSSLARDERKGENSNLPPTVKSSRRRRLL-NRNTSKDHCKPDVWDFDPECNHLGTRNHGNNFTVLSCHLNNERDIEVVGIGGSSDHGQLCDEG
          IE GS SSLARDERKGE+SN   TVKSSRRRRLL + N+ +DH KPD+ +FDPE   LGTR   N+ TV S   + E DIEVV IGG+SD  QLCDE 
Subjt:  LEIERGSSSSLARDERKGENSNLPPTVKSSRRRRLL-NRNTSKDHCKPDVWDFDPECNHLGTRNHGNNFTVLSCHLNNERDIEVVGIGGSSDHGQLCDEG

Query:  GIVSDSFEGVEARENQFTSKDANLPVENAPGVLMISSEDELSNIDTLQKEIEDPAVEHNASLP-TSAELSCVICWTDFSSMRGVLPCGHRFCYSCIQNWA
        G+ S  FEGVEA ENQ TSKD NL V+NAP VL I+SEDEL N++ LQK IEDP +E +ASLP TS ELSCVICWTDFSS RGVLPCGHRFCYSCIQNWA
Subjt:  GIVSDSFEGVEARENQFTSKDANLPVENAPGVLMISSEDELSNIDTLQKEIEDPAVEHNASLP-TSAELSCVICWTDFSSMRGVLPCGHRFCYSCIQNWA

Query:  DHMASSRKISTCPLCKASFLSITKVEEAATSDQKIYSQTIPCGPSLLDIFILPDERTLNSGVQASVGGVCSACRCREPEDLLMSCHLCQIRHIHSYCLDP
        DHMA SRKISTCPLCKASFLSITKVE AATSDQKIYSQTIPCG SLLDI++L DERTLN+ VQ SV  VCSACRCREPEDLLMSCHLCQIR IHSYCLDP
Subjt:  DHMASSRKISTCPLCKASFLSITKVEEAATSDQKIYSQTIPCGPSLLDIFILPDERTLNSGVQASVGGVCSACRCREPEDLLMSCHLCQIRHIHSYCLDP

Query:  PLLPWTCIHCKDLQTLYHRSH
        PLLPWTCIHCKDLQTLYHRSH
Subjt:  PLLPWTCIHCKDLQTLYHRSH

A0A6J1D3H7 uncharacterized protein LOC111016967 isoform X23.3e-30099.22Show/hide
Query:  MESVVATVSGYHGTERFNLIKMISYTGASYVGAMSRSITHLICWKLEGRKFSLAKKFKTIIVNHRWLEDCIRQGKRVPEGPYILQSGQSAGPLSIELPLA
        MESVVATVSGYHGTERFNLIKMISYTGASYVGAMSRSITHLICWKLEGRKFSLAKKFKTIIVNHRWLEDCIRQGKRVPEGPYILQSGQSAGPLSIELPLA
Subjt:  MESVVATVSGYHGTERFNLIKMISYTGASYVGAMSRSITHLICWKLEGRKFSLAKKFKTIIVNHRWLEDCIRQGKRVPEGPYILQSGQSAGPLSIELPLA

Query:  AKDSVSTTKYNVLSEKSQNCGNVEEQSIKSIYSFGAIWPRSCLLDKDLFSDFGKSDDTSHKSKHKLRKKISKQEDPSNSSSRNNFQEPTPSEFLEIERGS
        AKDSVSTTKYNVLSEKSQNCGNVEEQSIKSIYSFGAIWPRSCLLDKDLFSDFGKSDDTSHKSKHKLRKKISKQEDPSNSSSRNNFQEPTPSEFLEIERGS
Subjt:  AKDSVSTTKYNVLSEKSQNCGNVEEQSIKSIYSFGAIWPRSCLLDKDLFSDFGKSDDTSHKSKHKLRKKISKQEDPSNSSSRNNFQEPTPSEFLEIERGS

Query:  SSSLARDERKGENSNLPPTVKSSRRRRLLNRNTSKDHCKPDVWDFDPECNHLGTRNHGNNFTVLSCHLNNERDIEVVGIGGSSDHGQLCDEGGIVSDSFE
        SSSLARDERKGENSNL PTVKSSRRRRLLNRNTS+DHCKPDVWDFDPEC HLGTRNHGNNFTVLSCHLNNERDIEVVGIGGSSDHG LCDEGGIVSDSFE
Subjt:  SSSLARDERKGENSNLPPTVKSSRRRRLLNRNTSKDHCKPDVWDFDPECNHLGTRNHGNNFTVLSCHLNNERDIEVVGIGGSSDHGQLCDEGGIVSDSFE

Query:  GVEARENQFTSKDANLPVENAPGVLMISSEDELSNIDTLQKEIEDPAVEHNASLPTSAELSCVICWTDFSSMRGVLPCGHRFCYSCIQNWADHMASSRKI
        GVEARENQFTSKDANLPVENAPGVLMISSEDELSNIDTLQKEIEDPAVEHNASLPTSAELSCVICWTDFSSMRGVLPCGHRFCYSCIQNWADHMASSRKI
Subjt:  GVEARENQFTSKDANLPVENAPGVLMISSEDELSNIDTLQKEIEDPAVEHNASLPTSAELSCVICWTDFSSMRGVLPCGHRFCYSCIQNWADHMASSRKI

Query:  STCPLCKASFLSITKVEEAATSDQKIYSQTIPCGPSLLDIFILPDERTLNSGVQASVGGVCSACRCREPEDLLMSCHLCQIRHIHSYCLDPPLLPWTCIH
        STCPLCKASFLSITKVEEAATSDQKIYSQTIPCGPSLLDIFILPDERTLNSGVQASVGGVCSACRCREPEDLLMSCHLCQIRHIHSYCLDPPLLPWTCIH
Subjt:  STCPLCKASFLSITKVEEAATSDQKIYSQTIPCGPSLLDIFILPDERTLNSGVQASVGGVCSACRCREPEDLLMSCHLCQIRHIHSYCLDPPLLPWTCIH

Query:  CKDLQTLYHRSH
        CKDLQTLYHRSH
Subjt:  CKDLQTLYHRSH

A0A6J1D4V1 uncharacterized protein LOC111016967 isoform X11.7e-30499.23Show/hide
Query:  GCPIEGMESVVATVSGYHGTERFNLIKMISYTGASYVGAMSRSITHLICWKLEGRKFSLAKKFKTIIVNHRWLEDCIRQGKRVPEGPYILQSGQSAGPLS
        GCPIEGMESVVATVSGYHGTERFNLIKMISYTGASYVGAMSRSITHLICWKLEGRKFSLAKKFKTIIVNHRWLEDCIRQGKRVPEGPYILQSGQSAGPLS
Subjt:  GCPIEGMESVVATVSGYHGTERFNLIKMISYTGASYVGAMSRSITHLICWKLEGRKFSLAKKFKTIIVNHRWLEDCIRQGKRVPEGPYILQSGQSAGPLS

Query:  IELPLAAKDSVSTTKYNVLSEKSQNCGNVEEQSIKSIYSFGAIWPRSCLLDKDLFSDFGKSDDTSHKSKHKLRKKISKQEDPSNSSSRNNFQEPTPSEFL
        IELPLAAKDSVSTTKYNVLSEKSQNCGNVEEQSIKSIYSFGAIWPRSCLLDKDLFSDFGKSDDTSHKSKHKLRKKISKQEDPSNSSSRNNFQEPTPSEFL
Subjt:  IELPLAAKDSVSTTKYNVLSEKSQNCGNVEEQSIKSIYSFGAIWPRSCLLDKDLFSDFGKSDDTSHKSKHKLRKKISKQEDPSNSSSRNNFQEPTPSEFL

Query:  EIERGSSSSLARDERKGENSNLPPTVKSSRRRRLLNRNTSKDHCKPDVWDFDPECNHLGTRNHGNNFTVLSCHLNNERDIEVVGIGGSSDHGQLCDEGGI
        EIERGSSSSLARDERKGENSNL PTVKSSRRRRLLNRNTS+DHCKPDVWDFDPEC HLGTRNHGNNFTVLSCHLNNERDIEVVGIGGSSDHG LCDEGGI
Subjt:  EIERGSSSSLARDERKGENSNLPPTVKSSRRRRLLNRNTSKDHCKPDVWDFDPECNHLGTRNHGNNFTVLSCHLNNERDIEVVGIGGSSDHGQLCDEGGI

Query:  VSDSFEGVEARENQFTSKDANLPVENAPGVLMISSEDELSNIDTLQKEIEDPAVEHNASLPTSAELSCVICWTDFSSMRGVLPCGHRFCYSCIQNWADHM
        VSDSFEGVEARENQFTSKDANLPVENAPGVLMISSEDELSNIDTLQKEIEDPAVEHNASLPTSAELSCVICWTDFSSMRGVLPCGHRFCYSCIQNWADHM
Subjt:  VSDSFEGVEARENQFTSKDANLPVENAPGVLMISSEDELSNIDTLQKEIEDPAVEHNASLPTSAELSCVICWTDFSSMRGVLPCGHRFCYSCIQNWADHM

Query:  ASSRKISTCPLCKASFLSITKVEEAATSDQKIYSQTIPCGPSLLDIFILPDERTLNSGVQASVGGVCSACRCREPEDLLMSCHLCQIRHIHSYCLDPPLL
        ASSRKISTCPLCKASFLSITKVEEAATSDQKIYSQTIPCGPSLLDIFILPDERTLNSGVQASVGGVCSACRCREPEDLLMSCHLCQIRHIHSYCLDPPLL
Subjt:  ASSRKISTCPLCKASFLSITKVEEAATSDQKIYSQTIPCGPSLLDIFILPDERTLNSGVQASVGGVCSACRCREPEDLLMSCHLCQIRHIHSYCLDPPLL

Query:  PWTCIHCKDLQTLYHRSH
        PWTCIHCKDLQTLYHRSH
Subjt:  PWTCIHCKDLQTLYHRSH

A0A6J1FAW6 uncharacterized protein LOC1114424043.6e-22275.62Show/hide
Query:  GCPIEGMESVVATVSGYHGTERFNLIKMISYTGASYVGAMSRSITHLICWKLEGRKFSLAKKFKTIIVNHRWLEDCIRQGKRVPEGPYILQSGQSAGPLS
        GCPIEGMESVVATVSGYHGTERFNLIKMISYTGASYVG MSRSITHLICW+LEGRKF+LAKKFKTIIVNHRWLEDCI+ GKRVPE PYILQSGQSAGPLS
Subjt:  GCPIEGMESVVATVSGYHGTERFNLIKMISYTGASYVGAMSRSITHLICWKLEGRKFSLAKKFKTIIVNHRWLEDCIRQGKRVPEGPYILQSGQSAGPLS

Query:  IELPLAAKDSVSTTKYNVLSEKSQNCGNVEEQSIKSIYSFG-AIWPRSCLLDKDLFSDFGKSDDTSHKSKHKLRKKISKQEDPSNSSSRNNFQEPTPSEF
        ++LP + + SVST KY V SEK  NCGN+E+Q IK + SFG +I P S LLDK+++ DF  SDDT+HK KHKLRK+ISK E+PS+SSS+N+F+EPTPS+F
Subjt:  IELPLAAKDSVSTTKYNVLSEKSQNCGNVEEQSIKSIYSFG-AIWPRSCLLDKDLFSDFGKSDDTSHKSKHKLRKKISKQEDPSNSSSRNNFQEPTPSEF

Query:  LEIERGSSSSLARDERKGENSNLPPTVKSSRR-RRLLNRNTSKDHCKPDVWDFDPECNHLGTRNHGNNFTVLSCHLNNERDIEVVGIGGSSDHGQLCDEG
          I  GSSSSLARDE KG+  N   TV+SSRR RRL+ +N+S+DH +PDVW+FDPE  HL  R   N+ TVLS H ++E DIE V IGG++D  QLCDE 
Subjt:  LEIERGSSSSLARDERKGENSNLPPTVKSSRR-RRLLNRNTSKDHCKPDVWDFDPECNHLGTRNHGNNFTVLSCHLNNERDIEVVGIGGSSDHGQLCDEG

Query:  GIVSDSFEGVEARENQFTSKDANLPVENAPGVLMISSEDELSNIDTLQKEIEDPAVEHNASLP-TSAELSCVICWTDFSSMRGVLPCGHRFCYSCIQNWA
        G+ SDSFEGVEA  NQ TS+  NL VENAP +L  +SEDEL N   LQK IED  +E N S+P TS ELSCVICWTDFSS RGVLPCGHRFCYSCIQNWA
Subjt:  GIVSDSFEGVEARENQFTSKDANLPVENAPGVLMISSEDELSNIDTLQKEIEDPAVEHNASLP-TSAELSCVICWTDFSSMRGVLPCGHRFCYSCIQNWA

Query:  DHMASSRKISTCPLCKASFLSITKVEEAATSDQKIYSQTIPCGPSLLDIFILPDERTLNSGVQASVGGVCSACRCREPEDLLMSCHLCQIRHIHSYCLDP
        DHMAS RKISTCPLCKASFLSITKVE+AATSDQKIYSQTIPCG SLLDI+ILPDERTL+S VQ SV  VCS CRCREPEDLLMSCHLCQIRHIHSYCLDP
Subjt:  DHMASSRKISTCPLCKASFLSITKVEEAATSDQKIYSQTIPCGPSLLDIFILPDERTLNSGVQASVGGVCSACRCREPEDLLMSCHLCQIRHIHSYCLDP

Query:  PLLPWTCIHCKDLQTLYHRSH
        PLLPW CIHCKDLQTLYHR H
Subjt:  PLLPWTCIHCKDLQTLYHRSH

A0A6J1J2R6 E3 ubiquitin-protein ligase rnf8-A isoform X11.2e-22576.2Show/hide
Query:  GCPIEGMESVVATVSGYHGTERFNLIKMISYTGASYVGAMSRSITHLICWKLEGRKFSLAKKFKTIIVNHRWLEDCIRQGKRVPEGPYILQSGQSAGPLS
        GCPIEGMESVVATVSGYHGTERFNLIKMISYTGASYVG MSRSITHLICW+LEGRKF+LAKKFKTIIVNHRWLEDCI+QG RVPE PYILQSGQSAGPLS
Subjt:  GCPIEGMESVVATVSGYHGTERFNLIKMISYTGASYVGAMSRSITHLICWKLEGRKFSLAKKFKTIIVNHRWLEDCIRQGKRVPEGPYILQSGQSAGPLS

Query:  IELPLAAKDSVSTTKYNVLSEKSQNCGNVEEQSIKSIYSFG-AIWPRSCLLDKDLFSDFGKSDDTSHKSKHKLRKKISKQEDPSNSSSRNNFQEPTPSEF
        ++LP + K SVST KY V SEK  NCGNVE+Q IK + SFG +I P S LLDK+++ DF  SDDT+HK KHKLRK+ISK E+PS+SSS+N+F+EPTPS+F
Subjt:  IELPLAAKDSVSTTKYNVLSEKSQNCGNVEEQSIKSIYSFG-AIWPRSCLLDKDLFSDFGKSDDTSHKSKHKLRKKISKQEDPSNSSSRNNFQEPTPSEF

Query:  LEIERGSSSSLARDERKGENSNLPPTVKSSRR-RRLLNRNTSKDHCKPDVWDFDPECNHLGTRNHGNNFTVLSCHLNNERDIEVVGIGGSSDHGQLCDEG
          I  GSSSSLARDE KG+  N   TV+SSRR RRL+ +N+S+DH  PDVW+FDPE  HL TR   N+ TVLS H ++E D EVV +GG++D  QLCDE 
Subjt:  LEIERGSSSSLARDERKGENSNLPPTVKSSRR-RRLLNRNTSKDHCKPDVWDFDPECNHLGTRNHGNNFTVLSCHLNNERDIEVVGIGGSSDHGQLCDEG

Query:  GIVSDSFEGVEARENQFTSKDANLPVENAPGVLMISSEDELSNIDTLQKEIEDPAVEHNASLP-TSAELSCVICWTDFSSMRGVLPCGHRFCYSCIQNWA
        G+ SDSFEGVEA ENQ TS+  NL VENAP +L ++SEDEL     LQK IEDP +E N S+P T+ ELSCVICWTDFSS RGVLPCGHRFCYSCIQNWA
Subjt:  GIVSDSFEGVEARENQFTSKDANLPVENAPGVLMISSEDELSNIDTLQKEIEDPAVEHNASLP-TSAELSCVICWTDFSSMRGVLPCGHRFCYSCIQNWA

Query:  DHMASSRKISTCPLCKASFLSITKVEEAATSDQKIYSQTIPCGPSLLDIFILPDERTLNSGVQASVGGVCSACRCREPEDLLMSCHLCQIRHIHSYCLDP
        DHMASSRKISTCPLCKASFLSITKVE+AATSDQKIYSQTIPCGPSLLDI+ILPDERTL+S VQ SV  VCS CRC+EPEDLLMSCHLCQIRHIHSYCLDP
Subjt:  DHMASSRKISTCPLCKASFLSITKVEEAATSDQKIYSQTIPCGPSLLDIFILPDERTLNSGVQASVGGVCSACRCREPEDLLMSCHLCQIRHIHSYCLDP

Query:  PLLPWTCIHCKDLQTLYHRSH
        PLLPW CIHCKDLQTLYHR H
Subjt:  PLLPWTCIHCKDLQTLYHRSH

SwissProt top hitse value%identityAlignment
O04251 BRCT domain-containing protein At4g021103.6e-1441.38Show/hide
Query:  IEGMESVVATVSGYHGTERFNLIKMISYTGASYVGAM-SRSITHLICWKLEGRKFSLAKKFKTI-IVNHRWLEDCIRQGKRVPEGPY
        I G +++V  ++GY G +R ++++M+   G  +   + +  +THLIC+K EG K+ LAK+ K I +VNHRWLEDC++  K +PE  Y
Subjt:  IEGMESVVATVSGYHGTERFNLIKMISYTGASYVGAM-SRSITHLICWKLEGRKFSLAKKFKTI-IVNHRWLEDCIRQGKRVPEGPY

Q80Z37 E3 ubiquitin-protein ligase Topors9.9e-0427.96Show/hide
Query:  APGVLMISSEDELSNIDTLQKEIEDPAVEHNASLPTSAELSCVICWTDFSSMRGVLPCGHRFCYSCIQNWADHMASSRKISTCPLCKASFLSI
        A   +M S+  E   +D    +     ++       S +  C IC   F ++  +  C H+FC+ C+Q W+ + A       CPLCK  F SI
Subjt:  APGVLMISSEDELSNIDTLQKEIEDPAVEHNASLPTSAELSCVICWTDFSSMRGVLPCGHRFCYSCIQNWADHMASSRKISTCPLCKASFLSI

Q9NS56 E3 ubiquitin-protein ligase Topors3.1e-0528.68Show/hide
Query:  ANLPVENAP--GVLMISSEDELSNIDTLQKEIEDPAVEHNASLPTSAELSCVICWTDFSSMRGVLPCGHRFCYSCIQNWADHMASSRKISTCPLCKASFL
        A+ P   AP    +M S+  E   +D    +     ++       S +  C IC   F ++  +  C H+FC+ C+Q W+ + A       CPLCK  F 
Subjt:  ANLPVENAP--GVLMISSEDELSNIDTLQKEIEDPAVEHNASLPTSAELSCVICWTDFSSMRGVLPCGHRFCYSCIQNWADHMASSRKISTCPLCKASFL

Query:  SITKVEEAATSDQKIYSQTIPCGPSLLDIFILPDER
        SI      A  D K Y       PS    F+ PD R
Subjt:  SITKVEEAATSDQKIYSQTIPCGPSLLDIFILPDER

Arabidopsis top hitse value%identityAlignment
AT1G67180.1 zinc finger (C3HC4-type RING finger) family protein / BRCT domain-containing protein2.5e-9540.08Show/hide
Query:  MESVVATVSGYHGTERFNLIKMISYTGASYVGAMSRSITHLICWKLEGRKFSLAKKFKTIIVNHRWLEDCIRQGKRVPEGPYILQSGQSAGPLSIELPLA
        ME+VVATVSGYHG++RF LIK+IS++GASYVGAMSRSITHL+CWK EG+K+ LAKKF T++VNHRW+E+C+++G+RV E PY+  SG+  GPL IELP  
Subjt:  MESVVATVSGYHGTERFNLIKMISYTGASYVGAMSRSITHLICWKLEGRKFSLAKKFKTIIVNHRWLEDCIRQGKRVPEGPYILQSGQSAGPLSIELPLA

Query:  AKDSVSTTKYNVLSEKSQNCGNVEEQSIKSIYSFGAIWPRSCLLDKDLFSDFGKSDDTSHKSKHKLRKKISKQEDPSNSSSRNNFQEPTPSEFLEIERGS
        ++++  T K N  SE                             DK     F    +    S  +L   + K  + +  S R   + P+      +E   
Subjt:  AKDSVSTTKYNVLSEKSQNCGNVEEQSIKSIYSFGAIWPRSCLLDKDLFSDFGKSDDTSHKSKHKLRKKISKQEDPSNSSSRNNFQEPTPSEFLEIERGS

Query:  SSSLARDERKGENSNLPPTVKSSRRRRLLNRNTSKDHCKPDVWDFDPECNHLGTRNHGNNFTVLSCHLNNERDIEVVGIGG----SSDHGQLCDEGGIVS
        +S +A   RKG+       VK    R L+                D E +     NH +N         + R+     + G      +   L   G + +
Subjt:  SSSLARDERKGENSNLPPTVKSSRRRRLLNRNTSKDHCKPDVWDFDPECNHLGTRNHGNNFTVLSCHLNNERDIEVVGIGG----SSDHGQLCDEGGIVS

Query:  DSFEGVEARENQFTSKDANLPVENAPGVLMISSEDELSNIDTLQKEIEDPAVEHNASLPTSAELSCVICWTDFSSMRGVLPCGHRFCYSCIQNWADHMAS
         +++  E  E++  S  A           +       S     Q + E    E  A+    A++SC+ICWT+FSS RG+LPCGHRFCYSCIQ WAD + S
Subjt:  DSFEGVEARENQFTSKDANLPVENAPGVLMISSEDELSNIDTLQKEIEDPAVEHNASLPTSAELSCVICWTDFSSMRGVLPCGHRFCYSCIQNWADHMAS

Query:  SRKISTCPLCKASFLSITKVEEAATSDQKIYSQTIPCGPSLLDIF-ILPDE----RTLNSGVQASVGGVCSACRCREPEDLLMSCHLCQIRHIHSYCLDP
         RK +TCPLCK++F++ITK+E+A +SDQKIYSQT+P   S  +I  +LP+E    +TLN   +AS    CS C   EPE+LL+ CHLC  R IHSYCLDP
Subjt:  SRKISTCPLCKASFLSITKVEEAATSDQKIYSQTIPCGPSLLDIF-ILPDE----RTLNSGVQASVGGVCSACRCREPEDLLMSCHLCQIRHIHSYCLDP

Query:  PLLPWTCIHCKDLQTLYHR
         LLPWTC HC DLQ +YHR
Subjt:  PLLPWTCIHCKDLQTLYHR

AT1G77320.1 transcription coactivators2.0e-0729.91Show/hide
Query:  PIEGMESVVATVSGYHGTERFNLIKMISYT-GASYVGAMSRSITHLICWKLEGRKFSLAKKFKTIIVNHRWLEDCIRQGKRVPEGPY-----ILQSGQSA
        P+ G ES +   S  H  +   L++ +S   GA +V  ++R +THLIC   +G K+  A K+  I V   WL +C+RQ + V    +       Q  ++ 
Subjt:  PIEGMESVVATVSGYHGTERFNLIKMISYT-GASYVGAMSRSITHLICWKLEGRKFSLAKKFKTIIVNHRWLEDCIRQGKRVPEGPY-----ILQSGQSA

Query:  GPLSIE-LPLAAKDSVS
             + +P+A++DS+S
Subjt:  GPLSIE-LPLAAKDSVS

AT1G77320.2 transcription coactivators2.0e-0729.91Show/hide
Query:  PIEGMESVVATVSGYHGTERFNLIKMISYT-GASYVGAMSRSITHLICWKLEGRKFSLAKKFKTIIVNHRWLEDCIRQGKRVPEGPY-----ILQSGQSA
        P+ G ES +   S  H  +   L++ +S   GA +V  ++R +THLIC   +G K+  A K+  I V   WL +C+RQ + V    +       Q  ++ 
Subjt:  PIEGMESVVATVSGYHGTERFNLIKMISYT-GASYVGAMSRSITHLICWKLEGRKFSLAKKFKTIIVNHRWLEDCIRQGKRVPEGPY-----ILQSGQSA

Query:  GPLSIE-LPLAAKDSVS
             + +P+A++DS+S
Subjt:  GPLSIE-LPLAAKDSVS

AT2G26350.1 peroxin 102.7e-0426.99Show/hide
Query:  HLGTRNHGNNFTVLSCHLNNERDIEVVGIGGSSDHGQLCDEG------GIVSDSFEGVEARENQFTSKDANLPVENAPGVLMISSEDELSNIDTLQKEIE
        H+  R  G  +  +   LN     +++G+        L  EG        ++ S +       Q TS    LPV N  G L I+SE E  N  T      
Subjt:  HLGTRNHGNNFTVLSCHLNNERDIEVVGIGGSSDHGQLCDEG------GIVSDSFEGVEARENQFTSKDANLPVENAPGVLMISSEDELSNIDTLQKEIE

Query:  DPAVEHNASLPTSAELSCVICWTDFSSMRGVLPCGHRFCYSCIQNWADHMASSRKISTCPLCK
              + S  T A   C +C +         PCGH FC+SCI  W +          CPLC+
Subjt:  DPAVEHNASLPTSAELSCVICWTDFSSMRGVLPCGHRFCYSCIQNWADHMASSRKISTCPLCK

AT4G02110.1 transcription coactivators2.6e-1541.38Show/hide
Query:  IEGMESVVATVSGYHGTERFNLIKMISYTGASYVGAM-SRSITHLICWKLEGRKFSLAKKFKTI-IVNHRWLEDCIRQGKRVPEGPY
        I G +++V  ++GY G +R ++++M+   G  +   + +  +THLIC+K EG K+ LAK+ K I +VNHRWLEDC++  K +PE  Y
Subjt:  IEGMESVVATVSGYHGTERFNLIKMISYTGASYVGAM-SRSITHLICWKLEGRKFSLAKKFKTI-IVNHRWLEDCIRQGKRVPEGPY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GGTTGTCCAATTGAAGGTATGGAATCTGTTGTTGCGACTGTCAGTGGCTACCATGGCACAGAAAGATTCAACCTTATCAAGATGATATCTTATACTGGTGCCAGCTATGT
GGGTGCAATGTCAAGGTCTATTACTCATTTGATTTGTTGGAAATTGGAAGGGAGGAAATTTAGTCTTGCTAAGAAGTTCAAAACAATAATAGTCAACCATCGTTGGCTTG
AAGATTGCATCAGGCAAGGAAAGCGTGTCCCAGAAGGTCCTTACATTCTTCAAAGTGGCCAATCAGCAGGTCCCTTGTCAATTGAACTCCCTCTTGCTGCCAAGGATTCT
GTCTCAACCACAAAGTATAACGTGCTTTCTGAAAAGTCACAGAATTGTGGAAATGTTGAAGAGCAGAGCATTAAAAGCATATACTCTTTTGGTGCAATCTGGCCACGTTC
TTGTTTGCTGGATAAGGATCTGTTCTCTGATTTTGGAAAAAGTGACGACACCTCTCATAAATCAAAGCACAAATTGCGGAAGAAGATTTCTAAGCAAGAAGACCCATCAA
ACTCAAGTAGCAGAAATAATTTTCAGGAACCAACTCCCTCCGAGTTTCTCGAAATTGAGCGTGGTAGCTCTTCAAGCTTGGCCAGAGATGAAAGAAAAGGTGAAAATAGT
AATCTGCCCCCTACTGTTAAATCTTCACGGAGGCGGCGGCTTCTGAACAGAAACACTAGCAAAGATCATTGTAAGCCTGACGTTTGGGATTTTGACCCAGAGTGTAATCA
TTTGGGAACTCGTAACCATGGCAATAATTTTACAGTTTTGTCTTGCCATTTGAATAATGAAAGAGATATTGAAGTGGTAGGCATTGGAGGATCATCTGATCATGGTCAGT
TGTGTGACGAAGGGGGAATTGTAAGTGACAGTTTTGAAGGTGTTGAAGCTCGCGAGAATCAATTTACTTCCAAAGATGCGAACTTACCAGTTGAGAATGCACCAGGAGTC
CTCATGATATCTTCAGAAGATGAATTGTCTAATATTGATACTTTACAAAAGGAAATTGAGGACCCAGCTGTAGAACACAATGCGAGCCTACCCACTTCAGCAGAGTTATC
ATGTGTTATCTGTTGGACAGATTTTAGTTCGATGAGGGGAGTTTTGCCTTGTGGGCACCGGTTTTGCTATTCATGCATTCAGAATTGGGCAGATCACATGGCTTCGAGCA
GAAAGATCTCAACTTGCCCTTTGTGCAAAGCCAGTTTTCTGAGCATCACAAAGGTTGAGGAAGCTGCCACTTCAGATCAGAAGATATATTCTCAAACAATTCCATGTGGC
CCGTCACTATTGGATATTTTCATTCTTCCCGATGAAAGAACTCTTAACAGCGGTGTTCAGGCCTCTGTAGGAGGTGTTTGTAGTGCATGCCGATGCCGGGAACCAGAAGA
TCTCCTCATGAGTTGCCATCTTTGCCAGATTCGACATATTCATTCGTACTGTCTGGACCCTCCCTTGTTACCATGGACTTGCATTCACTGCAAGGATCTGCAGACACTCT
ACCATCGCAGCCAT
mRNA sequenceShow/hide mRNA sequence
GGTTGTCCAATTGAAGGTATGGAATCTGTTGTTGCGACTGTCAGTGGCTACCATGGCACAGAAAGATTCAACCTTATCAAGATGATATCTTATACTGGTGCCAGCTATGT
GGGTGCAATGTCAAGGTCTATTACTCATTTGATTTGTTGGAAATTGGAAGGGAGGAAATTTAGTCTTGCTAAGAAGTTCAAAACAATAATAGTCAACCATCGTTGGCTTG
AAGATTGCATCAGGCAAGGAAAGCGTGTCCCAGAAGGTCCTTACATTCTTCAAAGTGGCCAATCAGCAGGTCCCTTGTCAATTGAACTCCCTCTTGCTGCCAAGGATTCT
GTCTCAACCACAAAGTATAACGTGCTTTCTGAAAAGTCACAGAATTGTGGAAATGTTGAAGAGCAGAGCATTAAAAGCATATACTCTTTTGGTGCAATCTGGCCACGTTC
TTGTTTGCTGGATAAGGATCTGTTCTCTGATTTTGGAAAAAGTGACGACACCTCTCATAAATCAAAGCACAAATTGCGGAAGAAGATTTCTAAGCAAGAAGACCCATCAA
ACTCAAGTAGCAGAAATAATTTTCAGGAACCAACTCCCTCCGAGTTTCTCGAAATTGAGCGTGGTAGCTCTTCAAGCTTGGCCAGAGATGAAAGAAAAGGTGAAAATAGT
AATCTGCCCCCTACTGTTAAATCTTCACGGAGGCGGCGGCTTCTGAACAGAAACACTAGCAAAGATCATTGTAAGCCTGACGTTTGGGATTTTGACCCAGAGTGTAATCA
TTTGGGAACTCGTAACCATGGCAATAATTTTACAGTTTTGTCTTGCCATTTGAATAATGAAAGAGATATTGAAGTGGTAGGCATTGGAGGATCATCTGATCATGGTCAGT
TGTGTGACGAAGGGGGAATTGTAAGTGACAGTTTTGAAGGTGTTGAAGCTCGCGAGAATCAATTTACTTCCAAAGATGCGAACTTACCAGTTGAGAATGCACCAGGAGTC
CTCATGATATCTTCAGAAGATGAATTGTCTAATATTGATACTTTACAAAAGGAAATTGAGGACCCAGCTGTAGAACACAATGCGAGCCTACCCACTTCAGCAGAGTTATC
ATGTGTTATCTGTTGGACAGATTTTAGTTCGATGAGGGGAGTTTTGCCTTGTGGGCACCGGTTTTGCTATTCATGCATTCAGAATTGGGCAGATCACATGGCTTCGAGCA
GAAAGATCTCAACTTGCCCTTTGTGCAAAGCCAGTTTTCTGAGCATCACAAAGGTTGAGGAAGCTGCCACTTCAGATCAGAAGATATATTCTCAAACAATTCCATGTGGC
CCGTCACTATTGGATATTTTCATTCTTCCCGATGAAAGAACTCTTAACAGCGGTGTTCAGGCCTCTGTAGGAGGTGTTTGTAGTGCATGCCGATGCCGGGAACCAGAAGA
TCTCCTCATGAGTTGCCATCTTTGCCAGATTCGACATATTCATTCGTACTGTCTGGACCCTCCCTTGTTACCATGGACTTGCATTCACTGCAAGGATCTGCAGACACTCT
ACCATCGCAGCCAT
Protein sequenceShow/hide protein sequence
GCPIEGMESVVATVSGYHGTERFNLIKMISYTGASYVGAMSRSITHLICWKLEGRKFSLAKKFKTIIVNHRWLEDCIRQGKRVPEGPYILQSGQSAGPLSIELPLAAKDS
VSTTKYNVLSEKSQNCGNVEEQSIKSIYSFGAIWPRSCLLDKDLFSDFGKSDDTSHKSKHKLRKKISKQEDPSNSSSRNNFQEPTPSEFLEIERGSSSSLARDERKGENS
NLPPTVKSSRRRRLLNRNTSKDHCKPDVWDFDPECNHLGTRNHGNNFTVLSCHLNNERDIEVVGIGGSSDHGQLCDEGGIVSDSFEGVEARENQFTSKDANLPVENAPGV
LMISSEDELSNIDTLQKEIEDPAVEHNASLPTSAELSCVICWTDFSSMRGVLPCGHRFCYSCIQNWADHMASSRKISTCPLCKASFLSITKVEEAATSDQKIYSQTIPCG
PSLLDIFILPDERTLNSGVQASVGGVCSACRCREPEDLLMSCHLCQIRHIHSYCLDPPLLPWTCIHCKDLQTLYHRSH