; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g06170 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g06170
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionProtein SQS1
Genome locationchr6:4456419..4461968
RNA-Seq ExpressionMoc06g06170
SyntenyMoc06g06170
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR000467 - G-patch domain
IPR001374 - R3H domain
IPR034082 - Protein SQS1, R3H domain
IPR036867 - R3H domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587849.1 Zinc finger CCCH-type with G patch domain-containing protein, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0078.93Show/hide
Query:  MAGGRRRTIHAKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY
        MAGGRRRT HAK SDGFRKNK ++GRRRSDPSSS+RGNLFVDGGFLSD QFQ+SPPSSAREGNSRSKG+SGSKS +LDR K AS+SG+K+SNGNAIGYEY
Subjt:  MAGGRRRTIHAKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY

Query:  PPTHHQE-GLHSESQGLHNDADCSLDNSQPFILL--DSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDRSPSTDDGSPPQ
        PP  HQE GLHSESQGL NDADCSLDNSQPFILL  +SK+TQIVAYVD+KP LK DD+EFTYDYGT+F+LGDSSHRGLGFH ++DEL R+ ++DD SP  
Subjt:  PPTHHQE-GLHSESQGLHNDADCSLDNSQPFILL--DSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDRSPSTDDGSPPQ

Query:  VEEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSE
        VEEQEGLC GSL S+KE G+DERVECR E ++ +E+LAE S+PNKY+  VCS +NSGFLSIGG+RLYTQDVS EESDDDGE S+GSS YSE LES ESSE
Subjt:  VEEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSE

Query:  SDSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPSRKKSSIVSGDDWSSLALD
        SDSS EM+C+ SDIDDEVAEDYLEGIGGSE+IL SKWLVKQEL ESDDD SSS  DDTLEKL GIALQEASKE+GMKKT PSRKKS+IVS D+WSSLALD
Subjt:  SDSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPSRKKSSIVSGDDWSSLALD

Query:  DLLVKDSRTTSAKKKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLA
        DLL+KDSR+ SA+KKKNAA    SWPPKAPKSK   KYPGEKKKYRKETIAAKRRERM++RGVDL +INLKLEHMVLNREDMFSFQPMHPRDCSQVRRLA
Subjt:  DLLVKDSRTTSAKKKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLA

Query:  AIYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSS
        AIYRLHSGCQGSGKKRFVTVTRTQYTG+PSSSDQVRL QLIGARDED+DFSVA+G N+KS G NRSREKKNA+ S   ILEL+QSGS KSR++GSAGK S
Subjt:  AIYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSS

Query:  SQKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSEVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFV-GGGLGKDGQ
        SQKKTG KKYAD+PVSFVSCGVMQP+ VEITTS+V D DK KD I  A E ++++TS VKN DI+ D+IGAFE HT GFGSKMMAKMGFV GGGLGKDGQ
Subjt:  SQKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSEVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFV-GGGLGKDGQ

Query:  GMARPIEVIKRPKSLGLGVEFSEAFSSGG-NQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLG
        GMA PIEVIKRPKSLGLGVEFSEA +S G NQ+S GS    TGA GK+ K+GAFEEHTKGFGSKMMAKMGFVEG GLGKD QGM  P+  ++RPK+ GLG
Subjt:  GMARPIEVIKRPKSLGLGVEFSEAFSSGG-NQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLG

Query:  AK
         +
Subjt:  AK

XP_022134831.1 uncharacterized protein LOC111007007 [Momordica charantia]0.0e+00100Show/hide
Query:  MAGGRRRTIHAKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY
        MAGGRRRTIHAKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY
Subjt:  MAGGRRRTIHAKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY

Query:  PPTHHQEGLHSESQGLHNDADCSLDNSQPFILLDSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDRSPSTDDGSPPQVEE
        PPTHHQEGLHSESQGLHNDADCSLDNSQPFILLDSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDRSPSTDDGSPPQVEE
Subjt:  PPTHHQEGLHSESQGLHNDADCSLDNSQPFILLDSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDRSPSTDDGSPPQVEE

Query:  QEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSESDS
        QEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSESDS
Subjt:  QEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSESDS

Query:  SEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPSRKKSSIVSGDDWSSLALDDLL
        SEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPSRKKSSIVSGDDWSSLALDDLL
Subjt:  SEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPSRKKSSIVSGDDWSSLALDDLL

Query:  VKDSRTTSAKKKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLAAIY
        VKDSRTTSAKKKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLAAIY
Subjt:  VKDSRTTSAKKKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLAAIY

Query:  RLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSSSQK
        RLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSSSQK
Subjt:  RLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSSSQK

Query:  KTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSEVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFVGGGLGKDGQGMAR
        KTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSEVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFVGGGLGKDGQGMAR
Subjt:  KTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSEVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFVGGGLGKDGQGMAR

Query:  PIEVIKRPKSLGLGVEFSEAFSSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAKA
        PIEVIKRPKSLGLGVEFSEAFSSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAKA
Subjt:  PIEVIKRPKSLGLGVEFSEAFSSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAKA

XP_022929809.1 uncharacterized protein LOC111436305 [Cucurbita moschata]0.0e+0078.8Show/hide
Query:  MAGGRRRTIHAKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY
        MAGGRRRT HAK SDGFRKNK ++GRRRSDPSSS+RGNLFVDGGFLSD QFQ+SPPSSAREGNSRSKG+SGSKS +LDR K AS+SG+K+SNGNAIGYEY
Subjt:  MAGGRRRTIHAKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY

Query:  PPTHHQE-GLHSESQGLHNDADCSLDNSQPFILL--DSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDRSPSTDDGSPPQ
        PP  HQE GLHSESQGL NDADCSLDNSQPFILL  +SK+TQIVAYVD+KP LK DD+EFTYDYGT+F+LGDSSHRGLGFH ++DEL R+ ++DD SP  
Subjt:  PPTHHQE-GLHSESQGLHNDADCSLDNSQPFILL--DSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDRSPSTDDGSPPQ

Query:  VEEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSE
        VEEQEGLC GSL S+KE G+DERVECR E ++ +E+LAE S+PNKY+  VCS +NSGFLSIGG+RLYTQDVS EESDDDGE S+GSS YSE LES ESSE
Subjt:  VEEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSE

Query:  SDSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPSRKKSSIVSGDDWSSLALD
        SDSS EM+C+ SDIDDEVAEDYLEGIGGSE+IL SKWLVKQEL ESDDD SSS  DDTLEKL GIALQEASKE+GMKKT PSRKKS+IVS D+WSSLALD
Subjt:  SDSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPSRKKSSIVSGDDWSSLALD

Query:  DLLVKDSRTTSAKKKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLA
        DLL+KDSR+ SA+KKKNAA    SWPPKAPKSK   KYPGEKKKYRKETIAAKRRERM++RGVDL +INLKLEHMVLNREDMFSFQPMHPRDCSQVRRLA
Subjt:  DLLVKDSRTTSAKKKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLA

Query:  AIYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSS
        AIYRLHSGCQGSGKKRFVTVTRTQYTG+PSSSDQVRL QLIGARD D+DFSVA+G N+KS G NRSREKKNA+ S   ILEL+QSGS KSR++GSAGK S
Subjt:  AIYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSS

Query:  SQKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSEVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFV-GGGLGKDGQ
        SQKKTG KKYAD+PVSFVSCGVMQP+ VEITTS+V D DK KD I  A E ++++TS VKN DI+ D+IGAFE HT GFGSKMMAKMGFV GGGLGKDGQ
Subjt:  SQKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSEVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFV-GGGLGKDGQ

Query:  GMARPIEVIKRPKSLGLGVEFSEAFSSGG-NQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLG
        GMA PIEVIKRPKSLGLGVEFSEA +S G NQ+S GS    TGA GK+ K+GAFEEHTKGFGSKMMAKMGFVEG GLGKD QGM  P+  ++RPK+ GLG
Subjt:  GMARPIEVIKRPKSLGLGVEFSEAFSSGG-NQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLG

Query:  AK
         +
Subjt:  AK

XP_023003174.1 uncharacterized protein LOC111496865 isoform X1 [Cucurbita maxima]0.0e+0078.68Show/hide
Query:  MAGGRRRTIHAKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY
        MAGGRRRT HAK SDGFRKNK ++GRRRSDPSSS+RGNLFVDGGFLSD QFQ++PPSSAREGNSRSKGQSGSKS +LDR KTAS+SG+K+SNGNAIGYEY
Subjt:  MAGGRRRTIHAKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY

Query:  PPTHHQE-GLHSESQGLHNDADCSLDNSQPFILL--DSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDRSPSTDDGSPPQ
        PP  HQE GLHSES+GL NDADCSLDNSQPFILL  +SK+TQIVAYVD+KP LK D LE TYDYGT F+LGDSSHRGLGFH D+DEL R+ +TDD SP  
Subjt:  PPTHHQE-GLHSESQGLHNDADCSLDNSQPFILL--DSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDRSPSTDDGSPPQ

Query:  VEEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSE
        VEEQEGLC GSL S+KE G+DERVECR E ++ +E+LAE S+PNKY+   CS +NSGFLSIGG+RLYTQDVS EESDDDGE S+GSS YSE LES ESSE
Subjt:  VEEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSE

Query:  SDSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPSRKKSSIVSGDDWSSLALD
        SDSS EM+C+ SDIDDEVAEDYLEGIGG E+IL SKWLVKQEL ESDDD SSS  DDTLEKL  IALQEASKE+GMKKT PSR KS IVS D+WSSLALD
Subjt:  SDSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPSRKKSSIVSGDDWSSLALD

Query:  DLLVKDSRTTSAKKKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLA
        DLL+KDSR+ SA+KKKNAA    SWPPKAPKSK   KYPGEKKKYRKETIAAKRRERM++RGVDL QINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLA
Subjt:  DLLVKDSRTTSAKKKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLA

Query:  AIYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSS
        AIYRLHSGCQGSGKKRFVTVTRTQYTG+PSSSDQVRL QLIGARD+D+DFSVA+G N+KS G NRSREKKNA+ S   ILEL+QSGS KSR+KGSAGK S
Subjt:  AIYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSS

Query:  SQKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSEVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFV-GGGLGKDGQ
        SQKKTG KKYAD+PVSFVSCGVMQP+ VEITTS+V D DK KD I  A E ++++TS VKN DI+ D+IGAFE HT GFGSKMMAKMGFV GGGLGKDGQ
Subjt:  SQKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSEVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFV-GGGLGKDGQ

Query:  GMARPIEVIKRPKSLGLGVEFSEAFSSGG-NQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLG
        GMA PIEVIKRPKSLGLG+EFSEA +S G NQES GS    TG  GK+KK+GAFEEHTKGFGSKMMAKMGFVEG GLGKD QGM  P+  ++RPK+ GLG
Subjt:  GMARPIEVIKRPKSLGLGVEFSEAFSSGG-NQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLG

Query:  AK
         +
Subjt:  AK

XP_023003182.1 uncharacterized protein LOC111496865 isoform X2 [Cucurbita maxima]0.0e+0080.22Show/hide
Query:  MAGGRRRTIHAKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY
        MAGGRRRT HAK SDGFRKNK ++GRRRSDPSSS+RGNLFVDGGFLSD QFQ++PPSSAREGNSRSKGQSGSKS +LDR KTAS+SG+K+SNGNAIGYEY
Subjt:  MAGGRRRTIHAKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY

Query:  PPTHHQE-GLHSESQGLHNDADCSLDNSQPFILL--DSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDRSPSTDDGSPPQ
        PP  HQE GLHSES+GL NDADCSLDNSQPFILL  +SK+TQIVAYVD+KP LK D LE TYDYGT F+LGDSSHRGLGFH D+DEL R+ +TDD SP  
Subjt:  PPTHHQE-GLHSESQGLHNDADCSLDNSQPFILL--DSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDRSPSTDDGSPPQ

Query:  VEEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSE
        VEEQEGLC GSL S+KE G+DERVECR E ++ +E+LAE S+PNKY+   CS +NSGFLSIGG+RLYTQDVS EESDDDGE S+GSS YSE LES ESSE
Subjt:  VEEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSE

Query:  SDSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPSRKKSSIVSGDDWSSLALD
        SDSS EM+C+ SDIDDEVAEDYLEGIGG E+IL SKWLVKQEL ESDDD SSS  DDTLEKL  IALQEASKE+GMKKT PSR KS IVS D+WSSLALD
Subjt:  SDSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPSRKKSSIVSGDDWSSLALD

Query:  DLLVKDSRTTSAKKKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLA
        DLL+KDSR+ SA+KKKNAA    SWPPKAPKSK   KYPGEKKKYRKETIAAKRRERM++RGVDL QINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLA
Subjt:  DLLVKDSRTTSAKKKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLA

Query:  AIYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSS
        AIYRLHSGCQGSGKKRFVTVTRTQYTG+PSSSDQVRL QLIGARD+D+DFSVA+G N+KS G NRSREKKNA+ S   ILEL+QSGS KSR+KGSAGK S
Subjt:  AIYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSS

Query:  SQKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSEVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFV-GGGLGKDGQ
        SQKKTG KKYAD+PVSFVSCGVMQP+ VEITTS+V D DK KD I  A E ++++TS VKN DI+ D+IGAFE HT GFGSKMMAKMGFV GGGLGKDGQ
Subjt:  SQKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSEVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFV-GGGLGKDGQ

Query:  GMARPIEVIKRPKSLGLGVEFS--EAFSSGGNQES-RGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARG
        GMA PIEVIKRPKSLGLGVEFS   + S+G NQES R S R  TGA GK+KK+GAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARG
Subjt:  GMARPIEVIKRPKSLGLGVEFS--EAFSSGGNQES-RGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARG

Query:  LGAK
        LGAK
Subjt:  LGAK

TrEMBL top hitse value%identityAlignment
A0A1S3B9V9 Protein SQS13.0e-31074.84Show/hide
Query:  MAGGRRRTIHAKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY
        MAGGRRRT HAK+SD FRKNK N+ RRRSDP S    NLFVDGGFLSD QFQ SPP SAREGNSR+KG+SGSKS  LDR K AS+SG+K+SNG AIGYEY
Subjt:  MAGGRRRTIHAKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY

Query:  PPTHHQEGLHSESQGLHNDADCSLDNSQPFILLDSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDRSPSTDDGSPPQVEE
        P   +QE L SES+GL NDA+CSLD+SQPFILL+SK+ QIVAYVDE P L  D+LEFTYDYGTSFVLGDSSHRGLGF  DDDEL    +TDD S  QVEE
Subjt:  PPTHHQEGLHSESQGLHNDADCSLDNSQPFILLDSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDRSPSTDDGSPPQVEE

Query:  QEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSESDS
        Q GLC GSLS EKE GTDE+V+ R+  +  NE++AE S  NK +DD+ S KNSGFLSIGG+RLYTQDVS EESDDDGE+SDG+S YSE LES ESSE DS
Subjt:  QEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSESDS

Query:  SEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPSRKKSSIVSGDDWSSLALDDLL
        S EMSC+ SDIDDEVAEDYLEG+GGS+NIL SKWLVKQEL+ES DDSSSS  DDTL+KLGGIALQEASKE+GM KT  SR K S+VS D WS+LALDD+L
Subjt:  SEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPSRKKSSIVSGDDWSSLALDDLL

Query:  VKDSRTTSAKKKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLAAIY
        VKD R+TS +K+KNA+Q   SWPPKA  SK  RKYPGEKKKYRKE IAAKRRERM++RGVDL QI+L+LEHMVLN+EDM++FQPMHPRDCSQVRRLAAIY
Subjt:  VKDSRTTSAKKKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLAAIY

Query:  RLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSSSQK
        RLHSGCQGSGKKRFVTVTRTQ+TGMPS+SDQVRL QLIGARDED+DFSVA+GSN+KS+G +RSREKKN +V   H LE  QS S KSRSKGSAGK SSQK
Subjt:  RLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSSSQK

Query:  KTGNKKYADKPVSFVSCGVMQPETVEI-TTSDVTDTDKRKDMITTALETIQISTSEVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFV-GGGLGKDGQGM
         TG KKYAD+PVSFVSCGVMQPE +E+    D  D DK KD++ T+ E I++ TS V NTDI+ D+IG+FE+HTKGFGSKMMAKMGFV GGGLGKDGQGM
Subjt:  KTGNKKYADKPVSFVSCGVMQPETVEI-TTSDVTDTDKRKDMITTALETIQISTSEVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFV-GGGLGKDGQGM

Query:  ARPIEVIKRPKSLGLGVEFSEAFSS--GGNQES-RGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLG
          PIEV+KRPKSLGLG+EFSEA +S  G NQ S   S R ++GA  K+K+IGAFE HTKGFGSKMMAKMGFVEGMGLGKDSQG+VNPLLPVRRPKARGLG
Subjt:  ARPIEVIKRPKSLGLGVEFSEAFSS--GGNQES-RGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLG

Query:  AKA
        AK+
Subjt:  AKA

A0A6J1BYW1 Protein SQS10.0e+00100Show/hide
Query:  MAGGRRRTIHAKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY
        MAGGRRRTIHAKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY
Subjt:  MAGGRRRTIHAKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY

Query:  PPTHHQEGLHSESQGLHNDADCSLDNSQPFILLDSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDRSPSTDDGSPPQVEE
        PPTHHQEGLHSESQGLHNDADCSLDNSQPFILLDSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDRSPSTDDGSPPQVEE
Subjt:  PPTHHQEGLHSESQGLHNDADCSLDNSQPFILLDSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDRSPSTDDGSPPQVEE

Query:  QEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSESDS
        QEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSESDS
Subjt:  QEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSESDS

Query:  SEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPSRKKSSIVSGDDWSSLALDDLL
        SEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPSRKKSSIVSGDDWSSLALDDLL
Subjt:  SEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPSRKKSSIVSGDDWSSLALDDLL

Query:  VKDSRTTSAKKKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLAAIY
        VKDSRTTSAKKKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLAAIY
Subjt:  VKDSRTTSAKKKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLAAIY

Query:  RLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSSSQK
        RLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSSSQK
Subjt:  RLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSSSQK

Query:  KTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSEVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFVGGGLGKDGQGMAR
        KTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSEVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFVGGGLGKDGQGMAR
Subjt:  KTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSEVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFVGGGLGKDGQGMAR

Query:  PIEVIKRPKSLGLGVEFSEAFSSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAKA
        PIEVIKRPKSLGLGVEFSEAFSSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAKA
Subjt:  PIEVIKRPKSLGLGVEFSEAFSSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAKA

A0A6J1EPV3 Protein SQS10.0e+0078.8Show/hide
Query:  MAGGRRRTIHAKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY
        MAGGRRRT HAK SDGFRKNK ++GRRRSDPSSS+RGNLFVDGGFLSD QFQ+SPPSSAREGNSRSKG+SGSKS +LDR K AS+SG+K+SNGNAIGYEY
Subjt:  MAGGRRRTIHAKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY

Query:  PPTHHQE-GLHSESQGLHNDADCSLDNSQPFILL--DSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDRSPSTDDGSPPQ
        PP  HQE GLHSESQGL NDADCSLDNSQPFILL  +SK+TQIVAYVD+KP LK DD+EFTYDYGT+F+LGDSSHRGLGFH ++DEL R+ ++DD SP  
Subjt:  PPTHHQE-GLHSESQGLHNDADCSLDNSQPFILL--DSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDRSPSTDDGSPPQ

Query:  VEEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSE
        VEEQEGLC GSL S+KE G+DERVECR E ++ +E+LAE S+PNKY+  VCS +NSGFLSIGG+RLYTQDVS EESDDDGE S+GSS YSE LES ESSE
Subjt:  VEEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSE

Query:  SDSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPSRKKSSIVSGDDWSSLALD
        SDSS EM+C+ SDIDDEVAEDYLEGIGGSE+IL SKWLVKQEL ESDDD SSS  DDTLEKL GIALQEASKE+GMKKT PSRKKS+IVS D+WSSLALD
Subjt:  SDSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPSRKKSSIVSGDDWSSLALD

Query:  DLLVKDSRTTSAKKKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLA
        DLL+KDSR+ SA+KKKNAA    SWPPKAPKSK   KYPGEKKKYRKETIAAKRRERM++RGVDL +INLKLEHMVLNREDMFSFQPMHPRDCSQVRRLA
Subjt:  DLLVKDSRTTSAKKKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLA

Query:  AIYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSS
        AIYRLHSGCQGSGKKRFVTVTRTQYTG+PSSSDQVRL QLIGARD D+DFSVA+G N+KS G NRSREKKNA+ S   ILEL+QSGS KSR++GSAGK S
Subjt:  AIYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSS

Query:  SQKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSEVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFV-GGGLGKDGQ
        SQKKTG KKYAD+PVSFVSCGVMQP+ VEITTS+V D DK KD I  A E ++++TS VKN DI+ D+IGAFE HT GFGSKMMAKMGFV GGGLGKDGQ
Subjt:  SQKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSEVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFV-GGGLGKDGQ

Query:  GMARPIEVIKRPKSLGLGVEFSEAFSSGG-NQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLG
        GMA PIEVIKRPKSLGLGVEFSEA +S G NQ+S GS    TGA GK+ K+GAFEEHTKGFGSKMMAKMGFVEG GLGKD QGM  P+  ++RPK+ GLG
Subjt:  GMARPIEVIKRPKSLGLGVEFSEAFSSGG-NQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLG

Query:  AK
         +
Subjt:  AK

A0A6J1KLR0 Protein SQS10.0e+0080.22Show/hide
Query:  MAGGRRRTIHAKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY
        MAGGRRRT HAK SDGFRKNK ++GRRRSDPSSS+RGNLFVDGGFLSD QFQ++PPSSAREGNSRSKGQSGSKS +LDR KTAS+SG+K+SNGNAIGYEY
Subjt:  MAGGRRRTIHAKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY

Query:  PPTHHQE-GLHSESQGLHNDADCSLDNSQPFILL--DSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDRSPSTDDGSPPQ
        PP  HQE GLHSES+GL NDADCSLDNSQPFILL  +SK+TQIVAYVD+KP LK D LE TYDYGT F+LGDSSHRGLGFH D+DEL R+ +TDD SP  
Subjt:  PPTHHQE-GLHSESQGLHNDADCSLDNSQPFILL--DSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDRSPSTDDGSPPQ

Query:  VEEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSE
        VEEQEGLC GSL S+KE G+DERVECR E ++ +E+LAE S+PNKY+   CS +NSGFLSIGG+RLYTQDVS EESDDDGE S+GSS YSE LES ESSE
Subjt:  VEEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSE

Query:  SDSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPSRKKSSIVSGDDWSSLALD
        SDSS EM+C+ SDIDDEVAEDYLEGIGG E+IL SKWLVKQEL ESDDD SSS  DDTLEKL  IALQEASKE+GMKKT PSR KS IVS D+WSSLALD
Subjt:  SDSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPSRKKSSIVSGDDWSSLALD

Query:  DLLVKDSRTTSAKKKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLA
        DLL+KDSR+ SA+KKKNAA    SWPPKAPKSK   KYPGEKKKYRKETIAAKRRERM++RGVDL QINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLA
Subjt:  DLLVKDSRTTSAKKKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLA

Query:  AIYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSS
        AIYRLHSGCQGSGKKRFVTVTRTQYTG+PSSSDQVRL QLIGARD+D+DFSVA+G N+KS G NRSREKKNA+ S   ILEL+QSGS KSR+KGSAGK S
Subjt:  AIYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSS

Query:  SQKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSEVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFV-GGGLGKDGQ
        SQKKTG KKYAD+PVSFVSCGVMQP+ VEITTS+V D DK KD I  A E ++++TS VKN DI+ D+IGAFE HT GFGSKMMAKMGFV GGGLGKDGQ
Subjt:  SQKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSEVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFV-GGGLGKDGQ

Query:  GMARPIEVIKRPKSLGLGVEFS--EAFSSGGNQES-RGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARG
        GMA PIEVIKRPKSLGLGVEFS   + S+G NQES R S R  TGA GK+KK+GAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARG
Subjt:  GMARPIEVIKRPKSLGLGVEFS--EAFSSGGNQES-RGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARG

Query:  LGAK
        LGAK
Subjt:  LGAK

A0A6J1KSJ4 Protein SQS10.0e+0078.68Show/hide
Query:  MAGGRRRTIHAKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY
        MAGGRRRT HAK SDGFRKNK ++GRRRSDPSSS+RGNLFVDGGFLSD QFQ++PPSSAREGNSRSKGQSGSKS +LDR KTAS+SG+K+SNGNAIGYEY
Subjt:  MAGGRRRTIHAKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY

Query:  PPTHHQE-GLHSESQGLHNDADCSLDNSQPFILL--DSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDRSPSTDDGSPPQ
        PP  HQE GLHSES+GL NDADCSLDNSQPFILL  +SK+TQIVAYVD+KP LK D LE TYDYGT F+LGDSSHRGLGFH D+DEL R+ +TDD SP  
Subjt:  PPTHHQE-GLHSESQGLHNDADCSLDNSQPFILL--DSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDRSPSTDDGSPPQ

Query:  VEEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSE
        VEEQEGLC GSL S+KE G+DERVECR E ++ +E+LAE S+PNKY+   CS +NSGFLSIGG+RLYTQDVS EESDDDGE S+GSS YSE LES ESSE
Subjt:  VEEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSE

Query:  SDSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPSRKKSSIVSGDDWSSLALD
        SDSS EM+C+ SDIDDEVAEDYLEGIGG E+IL SKWLVKQEL ESDDD SSS  DDTLEKL  IALQEASKE+GMKKT PSR KS IVS D+WSSLALD
Subjt:  SDSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPSRKKSSIVSGDDWSSLALD

Query:  DLLVKDSRTTSAKKKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLA
        DLL+KDSR+ SA+KKKNAA    SWPPKAPKSK   KYPGEKKKYRKETIAAKRRERM++RGVDL QINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLA
Subjt:  DLLVKDSRTTSAKKKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLA

Query:  AIYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSS
        AIYRLHSGCQGSGKKRFVTVTRTQYTG+PSSSDQVRL QLIGARD+D+DFSVA+G N+KS G NRSREKKNA+ S   ILEL+QSGS KSR+KGSAGK S
Subjt:  AIYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSS

Query:  SQKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSEVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFV-GGGLGKDGQ
        SQKKTG KKYAD+PVSFVSCGVMQP+ VEITTS+V D DK KD I  A E ++++TS VKN DI+ D+IGAFE HT GFGSKMMAKMGFV GGGLGKDGQ
Subjt:  SQKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSEVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFV-GGGLGKDGQ

Query:  GMARPIEVIKRPKSLGLGVEFSEAFSSGG-NQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLG
        GMA PIEVIKRPKSLGLG+EFSEA +S G NQES GS    TG  GK+KK+GAFEEHTKGFGSKMMAKMGFVEG GLGKD QGM  P+  ++RPK+ GLG
Subjt:  GMARPIEVIKRPKSLGLGVEFSEAFSSGG-NQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLG

Query:  AK
         +
Subjt:  AK

SwissProt top hitse value%identityAlignment
A1XD95 Tuftelin-interacting protein 118.7e-0732.26Show/hide
Query:  AKMGFVGGGLGK---------DGQGMARPIEVIKRPKSLGLGVEFSEAFSSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGL
        A + F+  GL K         D     RP++    PK  G          +GGN   + S +   G +      G++E HTKG G K++ KMG+V G GL
Subjt:  AKMGFVGGGLGK---------DGQGMARPIEVIKRPKSLGLGVEFSEAFSSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGL

Query:  GKDSQGMVNPLLPVRRPKARGLGA
        GK++QG++NP+   +R     +GA
Subjt:  GKDSQGMVNPLLPVRRPKARGLGA

A1XD97 Tuftelin-interacting protein 118.7e-0731.45Show/hide
Query:  AKMGFVGGGLGK---------DGQGMARPIEVIKRPKSLGLGVEFSEAFSSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGL
        A + F+  GL K         D +   +P++    PK  G      +   +GGN   + S +   G +      G++E HTKG G K++ KMG+V G GL
Subjt:  AKMGFVGGGLGK---------DGQGMARPIEVIKRPKSLGLGVEFSEAFSSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGL

Query:  GKDSQGMVNPLLPVRRPKARGLGA
        GK++QG++NP+   +R     +GA
Subjt:  GKDSQGMVNPLLPVRRPKARGLGA

Q29RR5 Tuftelin-interacting protein 118.7e-0731.45Show/hide
Query:  AKMGFVGGGLGK---------DGQGMARPIEVIKRPKSLGLGVEFSEAFSSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGL
        A + F+  GL K         D     +P++  + PK  G      +   +GGN   + S +   G +      G++E HTKG G K++ KMG+V G GL
Subjt:  AKMGFVGGGLGK---------DGQGMARPIEVIKRPKSLGLGVEFSEAFSSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGL

Query:  GKDSQGMVNPLLPVRRPKARGLGA
        GK++QG++NP+   +R     +GA
Subjt:  GKDSQGMVNPLLPVRRPKARGLGA

Q5U2Y6 Tuftelin-interacting protein 113.9e-0732.26Show/hide
Query:  AKMGFVGGGLGK---------DGQGMARPIEVIKRPKSLGLGVEFSEAFSSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGL
        A + F+  GL K         D     +P++    PK LG      +   +GGN   + S +   G +      G++E HTKG G K++ KMG+V G GL
Subjt:  AKMGFVGGGLGK---------DGQGMARPIEVIKRPKSLGLGVEFSEAFSSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGL

Query:  GKDSQGMVNPLLPVRRPKARGLGA
        GK++QG++NP+   +R     +GA
Subjt:  GKDSQGMVNPLLPVRRPKARGLGA

Q9ERA6 Tuftelin-interacting protein 113.0e-0732.26Show/hide
Query:  AKMGFVGGGLGK---------DGQGMARPIEVIKRPKSLGLGVEFSEAFSSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGL
        A + F+  GL K         D     +P++    PK LG      +   +GGN   + S +  +G +      G++E HTKG G K++ KMG+V G GL
Subjt:  AKMGFVGGGLGK---------DGQGMARPIEVIKRPKSLGLGVEFSEAFSSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGL

Query:  GKDSQGMVNPLLPVRRPKARGLGA
        GK++QG++NP+   +R     +GA
Subjt:  GKDSQGMVNPLLPVRRPKARGLGA

Arabidopsis top hitse value%identityAlignment
AT1G17070.1 GC-rich sequence DNA-binding factor-like protein with Tuftelin interacting domain2.6e-0628.93Show/hide
Query:  GARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSSSQKKTGNKKYADKPVSFVSCGVMQPETVEITT-----SDVT
        G R E D+F   K          + + K+    +T  I     S S  S   GS  K    + +G K    KPV+FVS G + P   EI       +D  
Subjt:  GARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSSSQKKTGNKKYADKPVSFVSCGVMQPETVEITT-----SDVT

Query:  DTDK--RKDMITTALET---IQISTSEV-------------------------------------------------KNTDINIDTIGAFEVHTKGFGSK
        D DK    DMI   +E    + I +S +                                                 K   +  D IG FE  TKG G K
Subjt:  DTDK--RKDMITTALET---IQISTSEV-------------------------------------------------KNTDINIDTIGAFEVHTKGFGSK

Query:  MMAKMGFVGGGLGKDGQGMARPIEVIKRPKSLGLGV-EFSEA
        ++ KMG+ GGGLGK+ QG+  PIE   RPK++G+G  +F EA
Subjt:  MMAKMGFVGGGLGKDGQGMARPIEVIKRPKSLGLGV-EFSEA

AT2G24830.1 zinc finger (CCCH-type) family protein / D111/G-patch domain-containing protein4.4e-0644.44Show/hide
Query:  STRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGL
        ST    G          +E HT+G  SKMMA MG+ EGMGLG   QG++NP+L    P  R L
Subjt:  STRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGL

AT2G42330.1 GC-rich sequence DNA-binding factor-like protein with Tuftelin interacting domain2.0e-0652.17Show/hide
Query:  FEVHTKGFGSKMMAKMGFVGGGLGKDGQGMARPIEVIKRPKSLGLG
        FE  + G G K++ KMG+ G GLGK+ QG+  PIEV  RPK++G+G
Subjt:  FEVHTKGFGSKMMAKMGFVGGGLGKDGQGMARPIEVIKRPKSLGLG

AT2G42330.2 GC-rich sequence DNA-binding factor-like protein with Tuftelin interacting domain2.0e-0652.17Show/hide
Query:  FEVHTKGFGSKMMAKMGFVGGGLGKDGQGMARPIEVIKRPKSLGLG
        FE  + G G K++ KMG+ G GLGK+ QG+  PIEV  RPK++G+G
Subjt:  FEVHTKGFGSKMMAKMGFVGGGLGKDGQGMARPIEVIKRPKSLGLG

AT3G09850.1 D111/G-patch domain-containing protein3.4e-14745.01Show/hide
Query:  MAGGRRRTIHAKASDGFRKNK---RNTGRRRSDPSSSIRG---------NLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTK-TASASG
        M GG RR  +   S G  K K   ++T    +  + S  G          LFV+GG LSD +   +  + +R G+S  K   G +S +++R K +AS SG
Subjt:  MAGGRRRTIHAKASDGFRKNK---RNTGRRRSDPSSSIRG---------NLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTK-TASASG

Query:  SKKSNGNAIGYEYPPTHHQEGLHSESQGLHNDADCSLDNSQPFILLDSKNTQIVAYVDEKP-SLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDR
         +K +GN   Y+YP    +EGL   S G+ +D      N  P +L  S+ TQIVA++D+ P S K   + + Y+Y  S+VLGD SH+GLGF DD    D 
Subjt:  SKKSNGNAIGYEYPPTHHQEGLHSESQGLHNDADCSLDNSQPFILLDSKNTQIVAYVDEKP-SLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDR

Query:  SPSTDDGSPPQVEEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNY
        +PS     P  + +Q     GS S E+E   D   E    DD     + +T             +NSGF+SIGGM+LYT+DVS EESD + E +D   + 
Subjt:  SPSTDDGSPPQVEEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNY

Query:  S---EQLESSESSESDSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQEL----VESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPS
        S       SSE SESDSSE+M  + S+IDD+VA+DYLEGIGGSE +L + WL +Q L    + SDD SSS   D    KL GI LQ+AS E+G KKT  S
Subjt:  S---EQLESSESSESDSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQEL----VESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPS

Query:  RKKSSIVSGDDWSSLALDDLL-VKDSRTTSAK--KKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNR
                G   + LA+DDL+ VKD R+ S K  KKK  A+ P SWP +APKSK +R +PGE KK+RKE IA KRRERM+ RGVDL  IN +LE+ VL  
Subjt:  RKKSSIVSGDDWSSLALDDLL-VKDSRTTSAK--KKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNR

Query:  EDMFSFQPMHPRDCSQVRRLAAIYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHI
         DM  FQ MH RDCSQVRRLA +YRL S C GSGKK FVTVTRT  T MPS+SD++R+ +LIGA DED DF+V+ G   KS   +R + K +A+   +  
Subjt:  EDMFSFQPMHPRDCSQVRRLAAIYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHI

Query:  LELHQSGSMKSRSKGSAGKSSSQKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSEVKNTDINIDTIGAFEVHTKGF
                 + R+K +  KSS         YAD+PVSFVS G++  E                 +  T++E +    +E      N   IGAFEVHT+GF
Subjt:  LELHQSGSMKSRSKGSAGKSSSQKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSEVKNTDINIDTIGAFEVHTKGF

Query:  GSKMMAKMGFV-GGGLGKDGQGMARPIEVIKRPKSLGLGVEFS-----EAFSSGGNQESRGSTRAKTGASGKA---------------KKIGAFEEHTKG
        GSKMMAKMGF+ GGGLGKDG+G+A+PIE ++RPKSLGLG++FS      + SS  N  ++ +  + +G  GK                K++GAFE+HT G
Subjt:  GSKMMAKMGFV-GGGLGKDGQGMARPIEVIKRPKSLGLGVEFS-----EAFSSGGNQESRGSTRAKTGASGKA---------------KKIGAFEEHTKG

Query:  FGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK
        FGS+MMA+MGFVEG GLG++SQG+VNPL+ VRRP+ARG+GA+
Subjt:  FGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGGAGGAAGACGGAGAACGATTCATGCCAAAGCCAGTGACGGCTTCAGGAAGAATAAGAGAAATACTGGCAGAAGAAGATCAGACCCGTCGTCATCTATT
CGAGGAAATTTGTTTGTCGACGGTGGCTTTTTATCCGATTTGCAGTTCCAGTCCTCGCCGCCTAGTTCTGCTCGAGAGGGAAACTCTAGGTCGAAGGGCCAGTCT
GGATCAAAATCGGTTAATTTAGATCGTACAAAAACTGCTTCCGCTAGTGGGAGCAAAAAATCGAATGGTAATGCTATTGGCTACGAATACCCTCCTACTCATCAT
CAGGAGGGCTTGCACTCTGAATCACAGGGTTTGCACAATGATGCAGATTGTAGCTTGGACAACTCACAGCCTTTTATATTACTTGATTCTAAGAACACTCAAATT
GTTGCTTATGTAGATGAAAAACCGTCTTTAAAGGTGGATGATTTGGAATTCACTTATGATTATGGTACGAGTTTTGTGTTAGGTGACAGTTCTCATAGAGGACTA
GGGTTTCATGATGATGATGATGAACTTGATAGAAGTCCCAGCACAGATGATGGCTCGCCTCCACAAGTGGAAGAACAAGAAGGATTATGCATTGGTTCATTGTCA
TCCGAGAAGGAAATGGGTACTGACGAGAGGGTTGAGTGTAGGATAGAGGATGATATGACTAATGAGGTGCTAGCAGAAACATCAGCCCCCAACAAATATACAGAT
GATGTGTGCTCTGAGAAAAATTCGGGTTTCTTGTCAATTGGTGGTATGAGATTATACACCCAAGATGTATCTTATGAAGAAAGTGATGACGATGGAGAGGCATCG
GATGGGAGTTCTAATTACTCTGAGCAGTTAGAGTCAAGTGAATCATCCGAAAGTGATAGCTCAGAAGAAATGTCTTGTACTGATTCAGATATTGATGACGAGGTG
GCTGAAGATTACCTCGAAGGAATTGGTGGAAGTGAGAATATTTTAAGTTCTAAGTGGCTAGTAAAACAAGAGTTGGTAGAGTCTGATGATGATAGTTCTAGCAGT
TGCTTTGATGATACTTTAGAGAAATTAGGCGGCATTGCTTTGCAGGAGGCATCAAAAGAATTTGGAATGAAGAAAACCCCACCATCAAGGAAGAAAAGCTCCATT
GTATCGGGAGATGATTGGTCATCATTGGCTCTGGATGACTTGCTAGTAAAAGATAGTAGAACTACATCAGCTAAAAAGAAGAAGAATGCTGCCCAGTCTCCATGT
TCTTGGCCTCCAAAAGCTCCAAAAAGTAAAGTAGCTAGAAAGTATCCTGGTGAAAAGAAGAAATATCGTAAAGAAACAATTGCAGCAAAGCGTCGAGAAAGAATG
ATTAGTCGTGGTGTTGATCTAGGGCAAATAAATCTGAAATTGGAGCACATGGTTCTGAATAGAGAAGATATGTTTTCTTTCCAACCTATGCATCCTCGTGATTGT
TCCCAGGTACGACGGTTGGCGGCAATTTACCGCTTGCATAGTGGATGCCAAGGTTCTGGTAAAAAAAGGTTTGTTACGGTAACTCGAACACAATATACAGGAATG
CCATCATCAAGTGATCAAGTTCGCCTGGTGCAGCTAATAGGAGCAAGAGACGAGGATGATGACTTTTCTGTTGCTAAAGGCTCAAATATGAAATCACAGGGTGGC
AACAGAAGCAGAGAAAAGAAGAATGCCGAAGTGAGCACTTCGCACATATTGGAGCTCCATCAATCCGGGAGCATGAAGTCGAGGTCGAAGGGTTCTGCAGGTAAG
AGTTCAAGTCAGAAGAAGACTGGCAACAAAAAGTACGCGGATAAACCTGTCTCATTTGTATCATGTGGAGTGATGCAACCAGAAACAGTCGAGATAACAACGAGC
GATGTTACAGATACAGATAAAAGAAAGGATATGATCACCACTGCATTGGAAACGATCCAGATATCAACTAGCGAAGTCAAGAACACAGATATTAATATAGATACG
ATCGGTGCATTTGAGGTGCACACCAAGGGTTTTGGGTCAAAAATGATGGCAAAAATGGGATTTGTAGGAGGAGGATTGGGGAAAGATGGTCAAGGTATGGCCCGC
CCCATTGAAGTGATCAAACGGCCTAAATCGCTCGGGTTAGGGGTCGAGTTCTCAGAGGCCTTCAGTTCTGGTGGTAACCAGGAAAGTCGAGGATCTACTCGTGCC
AAAACTGGAGCTTCTGGTAAAGCTAAAAAAATTGGGGCTTTTGAAGAGCATACCAAAGGATTTGGGTCCAAGATGATGGCAAAGATGGGATTTGTTGAAGGCATG
GGATTGGGGAAAGATTCTCAAGGAATGGTTAACCCTTTGCTTCCTGTTAGGCGGCCTAAAGCGCGAGGGTTAGGGGCCAAAGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCGGAGGAAGACGGAGAACGATTCATGCCAAAGCCAGTGACGGCTTCAGGAAGAATAAGAGAAATACTGGCAGAAGAAGATCAGACCCGTCGTCATCTATT
CGAGGAAATTTGTTTGTCGACGGTGGCTTTTTATCCGATTTGCAGTTCCAGTCCTCGCCGCCTAGTTCTGCTCGAGAGGGAAACTCTAGGTCGAAGGGCCAGTCT
GGATCAAAATCGGTTAATTTAGATCGTACAAAAACTGCTTCCGCTAGTGGGAGCAAAAAATCGAATGGTAATGCTATTGGCTACGAATACCCTCCTACTCATCAT
CAGGAGGGCTTGCACTCTGAATCACAGGGTTTGCACAATGATGCAGATTGTAGCTTGGACAACTCACAGCCTTTTATATTACTTGATTCTAAGAACACTCAAATT
GTTGCTTATGTAGATGAAAAACCGTCTTTAAAGGTGGATGATTTGGAATTCACTTATGATTATGGTACGAGTTTTGTGTTAGGTGACAGTTCTCATAGAGGACTA
GGGTTTCATGATGATGATGATGAACTTGATAGAAGTCCCAGCACAGATGATGGCTCGCCTCCACAAGTGGAAGAACAAGAAGGATTATGCATTGGTTCATTGTCA
TCCGAGAAGGAAATGGGTACTGACGAGAGGGTTGAGTGTAGGATAGAGGATGATATGACTAATGAGGTGCTAGCAGAAACATCAGCCCCCAACAAATATACAGAT
GATGTGTGCTCTGAGAAAAATTCGGGTTTCTTGTCAATTGGTGGTATGAGATTATACACCCAAGATGTATCTTATGAAGAAAGTGATGACGATGGAGAGGCATCG
GATGGGAGTTCTAATTACTCTGAGCAGTTAGAGTCAAGTGAATCATCCGAAAGTGATAGCTCAGAAGAAATGTCTTGTACTGATTCAGATATTGATGACGAGGTG
GCTGAAGATTACCTCGAAGGAATTGGTGGAAGTGAGAATATTTTAAGTTCTAAGTGGCTAGTAAAACAAGAGTTGGTAGAGTCTGATGATGATAGTTCTAGCAGT
TGCTTTGATGATACTTTAGAGAAATTAGGCGGCATTGCTTTGCAGGAGGCATCAAAAGAATTTGGAATGAAGAAAACCCCACCATCAAGGAAGAAAAGCTCCATT
GTATCGGGAGATGATTGGTCATCATTGGCTCTGGATGACTTGCTAGTAAAAGATAGTAGAACTACATCAGCTAAAAAGAAGAAGAATGCTGCCCAGTCTCCATGT
TCTTGGCCTCCAAAAGCTCCAAAAAGTAAAGTAGCTAGAAAGTATCCTGGTGAAAAGAAGAAATATCGTAAAGAAACAATTGCAGCAAAGCGTCGAGAAAGAATG
ATTAGTCGTGGTGTTGATCTAGGGCAAATAAATCTGAAATTGGAGCACATGGTTCTGAATAGAGAAGATATGTTTTCTTTCCAACCTATGCATCCTCGTGATTGT
TCCCAGGTACGACGGTTGGCGGCAATTTACCGCTTGCATAGTGGATGCCAAGGTTCTGGTAAAAAAAGGTTTGTTACGGTAACTCGAACACAATATACAGGAATG
CCATCATCAAGTGATCAAGTTCGCCTGGTGCAGCTAATAGGAGCAAGAGACGAGGATGATGACTTTTCTGTTGCTAAAGGCTCAAATATGAAATCACAGGGTGGC
AACAGAAGCAGAGAAAAGAAGAATGCCGAAGTGAGCACTTCGCACATATTGGAGCTCCATCAATCCGGGAGCATGAAGTCGAGGTCGAAGGGTTCTGCAGGTAAG
AGTTCAAGTCAGAAGAAGACTGGCAACAAAAAGTACGCGGATAAACCTGTCTCATTTGTATCATGTGGAGTGATGCAACCAGAAACAGTCGAGATAACAACGAGC
GATGTTACAGATACAGATAAAAGAAAGGATATGATCACCACTGCATTGGAAACGATCCAGATATCAACTAGCGAAGTCAAGAACACAGATATTAATATAGATACG
ATCGGTGCATTTGAGGTGCACACCAAGGGTTTTGGGTCAAAAATGATGGCAAAAATGGGATTTGTAGGAGGAGGATTGGGGAAAGATGGTCAAGGTATGGCCCGC
CCCATTGAAGTGATCAAACGGCCTAAATCGCTCGGGTTAGGGGTCGAGTTCTCAGAGGCCTTCAGTTCTGGTGGTAACCAGGAAAGTCGAGGATCTACTCGTGCC
AAAACTGGAGCTTCTGGTAAAGCTAAAAAAATTGGGGCTTTTGAAGAGCATACCAAAGGATTTGGGTCCAAGATGATGGCAAAGATGGGATTTGTTGAAGGCATG
GGATTGGGGAAAGATTCTCAAGGAATGGTTAACCCTTTGCTTCCTGTTAGGCGGCCTAAAGCGCGAGGGTTAGGGGCCAAAGCTTAG
Protein sequenceShow/hide protein sequence
MAGGRRRTIHAKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEYPPTHH
QEGLHSESQGLHNDADCSLDNSQPFILLDSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDRSPSTDDGSPPQVEEQEGLCIGSLS
SEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSESDSSEEMSCTDSDIDDEV
AEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPSRKKSSIVSGDDWSSLALDDLLVKDSRTTSAKKKKNAAQSPC
SWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLAAIYRLHSGCQGSGKKRFVTVTRTQYTGM
PSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSSSQKKTGNKKYADKPVSFVSCGVMQPETVEITTS
DVTDTDKRKDMITTALETIQISTSEVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFVGGGLGKDGQGMARPIEVIKRPKSLGLGVEFSEAFSSGGNQESRGSTRA
KTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAKA