; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS002371 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS002371
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProtein SQS1
Genome locationscaffold30:4445804..4451350
RNA-Seq ExpressionMS002371
SyntenyMS002371
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR000467 - G-patch domain
IPR001374 - R3H domain
IPR034082 - Protein SQS1, R3H domain
IPR036867 - R3H domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587849.1 Zinc finger CCCH-type with G patch domain-containing protein, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0079.15Show/hide
Query:  MAGGRRRTIHVKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY
        MAGGRRRT H K SDGFRKNK ++GRRRSDPSSS+RGNLFVDGGFLSD QFQ+SPPSSAREGNSRSKG+SGSKS +LDR K AS+SG+K+SNGNAIGYEY
Subjt:  MAGGRRRTIHVKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY

Query:  PPTHHQE-GLHSESQGLHNDADCSLDNSQPFILL--DSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDELDRSPSTDDGSPPQV
        PP  HQE GLHSESQGL NDADCSLDNSQPFILL  +SK+TQIVAYVD+KP LK DD+EFTYDYGT+F+LGDSSHRGLGFH++DEL R+ ++DD SP  V
Subjt:  PPTHHQE-GLHSESQGLHNDADCSLDNSQPFILL--DSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDELDRSPSTDDGSPPQV

Query:  EEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSES
        EEQEGLC GSL S+KE G+DERVECR E ++ +E+LAE S+PNKY+  VCS +NSGFLSIGG+RLYTQDVS EESDDDGE S+GSS YSE LES ESSES
Subjt:  EEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSES

Query:  DSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEYGMKKTPPSRKKSSIVSGDDWSSLALDD
        DSS EM+C+ SDIDDEVAEDYLEGIGGSE+IL SKWLVKQEL ESDDD SSS  DDTLEKL GIALQEASKEYGMKKT PSRKKS+IVS D+WSSLALDD
Subjt:  DSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEYGMKKTPPSRKKSSIVSGDDWSSLALDD

Query:  LLVKDSRSTSAKKKKNDAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLAA
        LL+KDSRS SA+KKKN A    SWPPKAPKSK   KYPGEKKKYRKETIAAKRRERM++RGVDL +INLKLEHMVLNREDMFSFQPMHPRDCSQVRRLAA
Subjt:  LLVKDSRSTSAKKKKNDAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLAA

Query:  IYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSSS
        IYRLHSGCQGSGKKRFVTVTRTQYTG+PSSSDQVRL QLIGARDED+DFSVA+G N+KS G NRSREKKNA+ S   ILEL+QSGS KSR++GSAGK SS
Subjt:  IYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSSS

Query:  QKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSKVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFV-GGGLGKDGQG
        QKKTG KKYAD+PVSFVSCGVMQP+ VEITTS+V D DK KD I  A E ++++TS VKN DI+ D+IGAFE HT GFGSKMMAKMGFV GGGLGKDGQG
Subjt:  QKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSKVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFV-GGGLGKDGQG

Query:  MARPIEVIKRPKSLGLGVEFSEASSSGG-NQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGA
        MA PIEVIKRPKSLGLGVEFSEAS+S G NQ+S GS    TGA GK+ K+GAFEEHTKGFGSKMMAKMGFVEG GLGKD QGM  P+  ++RPK+ GLG 
Subjt:  MARPIEVIKRPKSLGLGVEFSEASSSGG-NQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGA

Query:  K
        +
Subjt:  K

XP_022134831.1 uncharacterized protein LOC111007007 [Momordica charantia]0.0e+0099.12Show/hide
Query:  MAGGRRRTIHVKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY
        MAGGRRRTIH KASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY
Subjt:  MAGGRRRTIHVKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY

Query:  PPTHHQEGLHSESQGLHNDADCSLDNSQPFILLDSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFH-DDDELDRSPSTDDGSPPQVEE
        PPTHHQEGLHSESQGLHNDADCSLDNSQPFILLDSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFH DDDELDRSPSTDDGSPPQVEE
Subjt:  PPTHHQEGLHSESQGLHNDADCSLDNSQPFILLDSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFH-DDDELDRSPSTDDGSPPQVEE

Query:  QEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSESDS
        QEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSESDS
Subjt:  QEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSESDS

Query:  SEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEYGMKKTPPSRKKSSIVSGDDWSSLALDDLL
        SEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKE+GMKKTPPSRKKSSIVSGDDWSSLALDDLL
Subjt:  SEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEYGMKKTPPSRKKSSIVSGDDWSSLALDDLL

Query:  VKDSRSTSAKKKKNDAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLAAIY
        VKDSR+TSAKKKKN AQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLAAIY
Subjt:  VKDSRSTSAKKKKNDAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLAAIY

Query:  RLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSSSQK
        RLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSSSQK
Subjt:  RLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSSSQK

Query:  KTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSKVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFVGGGLGKDGQGMAR
        KTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTS+VKNTDINIDTIGAFEVHTKGFGSKMMAKMGFVGGGLGKDGQGMAR
Subjt:  KTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSKVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFVGGGLGKDGQGMAR

Query:  PIEVIKRPKSLGLGVEFSEASSSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAKA
        PIEVIKRPKSLGLGVEFSEA SSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAKA
Subjt:  PIEVIKRPKSLGLGVEFSEASSSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAKA

XP_022929809.1 uncharacterized protein LOC111436305 [Cucurbita moschata]0.0e+0079.03Show/hide
Query:  MAGGRRRTIHVKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY
        MAGGRRRT H K SDGFRKNK ++GRRRSDPSSS+RGNLFVDGGFLSD QFQ+SPPSSAREGNSRSKG+SGSKS +LDR K AS+SG+K+SNGNAIGYEY
Subjt:  MAGGRRRTIHVKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY

Query:  PPTHHQE-GLHSESQGLHNDADCSLDNSQPFILL--DSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDELDRSPSTDDGSPPQV
        PP  HQE GLHSESQGL NDADCSLDNSQPFILL  +SK+TQIVAYVD+KP LK DD+EFTYDYGT+F+LGDSSHRGLGFH++DEL R+ ++DD SP  V
Subjt:  PPTHHQE-GLHSESQGLHNDADCSLDNSQPFILL--DSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDELDRSPSTDDGSPPQV

Query:  EEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSES
        EEQEGLC GSL S+KE G+DERVECR E ++ +E+LAE S+PNKY+  VCS +NSGFLSIGG+RLYTQDVS EESDDDGE S+GSS YSE LES ESSES
Subjt:  EEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSES

Query:  DSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEYGMKKTPPSRKKSSIVSGDDWSSLALDD
        DSS EM+C+ SDIDDEVAEDYLEGIGGSE+IL SKWLVKQEL ESDDD SSS  DDTLEKL GIALQEASKEYGMKKT PSRKKS+IVS D+WSSLALDD
Subjt:  DSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEYGMKKTPPSRKKSSIVSGDDWSSLALDD

Query:  LLVKDSRSTSAKKKKNDAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLAA
        LL+KDSRS SA+KKKN A    SWPPKAPKSK   KYPGEKKKYRKETIAAKRRERM++RGVDL +INLKLEHMVLNREDMFSFQPMHPRDCSQVRRLAA
Subjt:  LLVKDSRSTSAKKKKNDAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLAA

Query:  IYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSSS
        IYRLHSGCQGSGKKRFVTVTRTQYTG+PSSSDQVRL QLIGARD D+DFSVA+G N+KS G NRSREKKNA+ S   ILEL+QSGS KSR++GSAGK SS
Subjt:  IYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSSS

Query:  QKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSKVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFV-GGGLGKDGQG
        QKKTG KKYAD+PVSFVSCGVMQP+ VEITTS+V D DK KD I  A E ++++TS VKN DI+ D+IGAFE HT GFGSKMMAKMGFV GGGLGKDGQG
Subjt:  QKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSKVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFV-GGGLGKDGQG

Query:  MARPIEVIKRPKSLGLGVEFSEASSSGG-NQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGA
        MA PIEVIKRPKSLGLGVEFSEAS+S G NQ+S GS    TGA GK+ K+GAFEEHTKGFGSKMMAKMGFVEG GLGKD QGM  P+  ++RPK+ GLG 
Subjt:  MARPIEVIKRPKSLGLGVEFSEASSSGG-NQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGA

Query:  K
        +
Subjt:  K

XP_023003174.1 uncharacterized protein LOC111496865 isoform X1 [Cucurbita maxima]0.0e+0078.9Show/hide
Query:  MAGGRRRTIHVKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY
        MAGGRRRT H K SDGFRKNK ++GRRRSDPSSS+RGNLFVDGGFLSD QFQ++PPSSAREGNSRSKGQSGSKS +LDR KTAS+SG+K+SNGNAIGYEY
Subjt:  MAGGRRRTIHVKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY

Query:  PPTHHQE-GLHSESQGLHNDADCSLDNSQPFILL--DSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDELDRSPSTDDGSPPQV
        PP  HQE GLHSES+GL NDADCSLDNSQPFILL  +SK+TQIVAYVD+KP LK D LE TYDYGT F+LGDSSHRGLGFHD+DEL R+ +TDD SP  V
Subjt:  PPTHHQE-GLHSESQGLHNDADCSLDNSQPFILL--DSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDELDRSPSTDDGSPPQV

Query:  EEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSES
        EEQEGLC GSL S+KE G+DERVECR E ++ +E+LAE S+PNKY+   CS +NSGFLSIGG+RLYTQDVS EESDDDGE S+GSS YSE LES ESSES
Subjt:  EEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSES

Query:  DSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEYGMKKTPPSRKKSSIVSGDDWSSLALDD
        DSS EM+C+ SDIDDEVAEDYLEGIGG E+IL SKWLVKQEL ESDDD SSS  DDTLEKL  IALQEASKEYGMKKT PSR KS IVS D+WSSLALDD
Subjt:  DSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEYGMKKTPPSRKKSSIVSGDDWSSLALDD

Query:  LLVKDSRSTSAKKKKNDAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLAA
        LL+KDSRS SA+KKKN A    SWPPKAPKSK   KYPGEKKKYRKETIAAKRRERM++RGVDL QINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLAA
Subjt:  LLVKDSRSTSAKKKKNDAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLAA

Query:  IYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSSS
        IYRLHSGCQGSGKKRFVTVTRTQYTG+PSSSDQVRL QLIGARD+D+DFSVA+G N+KS G NRSREKKNA+ S   ILEL+QSGS KSR+KGSAGK SS
Subjt:  IYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSSS

Query:  QKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSKVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFV-GGGLGKDGQG
        QKKTG KKYAD+PVSFVSCGVMQP+ VEITTS+V D DK KD I  A E ++++TS VKN DI+ D+IGAFE HT GFGSKMMAKMGFV GGGLGKDGQG
Subjt:  QKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSKVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFV-GGGLGKDGQG

Query:  MARPIEVIKRPKSLGLGVEFSEASSSGG-NQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGA
        MA PIEVIKRPKSLGLG+EFSEAS+S G NQES GS    TG  GK+KK+GAFEEHTKGFGSKMMAKMGFVEG GLGKD QGM  P+  ++RPK+ GLG 
Subjt:  MARPIEVIKRPKSLGLGVEFSEASSSGG-NQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGA

Query:  K
        +
Subjt:  K

XP_023003182.1 uncharacterized protein LOC111496865 isoform X2 [Cucurbita maxima]0.0e+0080.7Show/hide
Query:  MAGGRRRTIHVKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY
        MAGGRRRT H K SDGFRKNK ++GRRRSDPSSS+RGNLFVDGGFLSD QFQ++PPSSAREGNSRSKGQSGSKS +LDR KTAS+SG+K+SNGNAIGYEY
Subjt:  MAGGRRRTIHVKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY

Query:  PPTHHQE-GLHSESQGLHNDADCSLDNSQPFILL--DSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDELDRSPSTDDGSPPQV
        PP  HQE GLHSES+GL NDADCSLDNSQPFILL  +SK+TQIVAYVD+KP LK D LE TYDYGT F+LGDSSHRGLGFHD+DEL R+ +TDD SP  V
Subjt:  PPTHHQE-GLHSESQGLHNDADCSLDNSQPFILL--DSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDELDRSPSTDDGSPPQV

Query:  EEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSES
        EEQEGLC GSL S+KE G+DERVECR E ++ +E+LAE S+PNKY+   CS +NSGFLSIGG+RLYTQDVS EESDDDGE S+GSS YSE LES ESSES
Subjt:  EEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSES

Query:  DSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEYGMKKTPPSRKKSSIVSGDDWSSLALDD
        DSS EM+C+ SDIDDEVAEDYLEGIGG E+IL SKWLVKQEL ESDDD SSS  DDTLEKL  IALQEASKEYGMKKT PSR KS IVS D+WSSLALDD
Subjt:  DSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEYGMKKTPPSRKKSSIVSGDDWSSLALDD

Query:  LLVKDSRSTSAKKKKNDAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLAA
        LL+KDSRS SA+KKKN A    SWPPKAPKSK   KYPGEKKKYRKETIAAKRRERM++RGVDL QINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLAA
Subjt:  LLVKDSRSTSAKKKKNDAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLAA

Query:  IYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSSS
        IYRLHSGCQGSGKKRFVTVTRTQYTG+PSSSDQVRL QLIGARD+D+DFSVA+G N+KS G NRSREKKNA+ S   ILEL+QSGS KSR+KGSAGK SS
Subjt:  IYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSSS

Query:  QKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSKVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFV-GGGLGKDGQG
        QKKTG KKYAD+PVSFVSCGVMQP+ VEITTS+V D DK KD I  A E ++++TS VKN DI+ D+IGAFE HT GFGSKMMAKMGFV GGGLGKDGQG
Subjt:  QKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSKVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFV-GGGLGKDGQG

Query:  MARPIEVIKRPKSLGLGVEFS-EASSSGG-NQES-RGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGL
        MA PIEVIKRPKSLGLGVEFS EAS+S G NQES R S R  TGA GK+KK+GAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGL
Subjt:  MARPIEVIKRPKSLGLGVEFS-EASSSGG-NQES-RGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGL

Query:  GAK
        GAK
Subjt:  GAK

TrEMBL top hitse value%identityAlignment
A0A1S3B9V9 Protein SQS10.0e+0075.06Show/hide
Query:  MAGGRRRTIHVKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY
        MAGGRRRT H K+SD FRKNK N+ RRRSDP S    NLFVDGGFLSD QFQ SPP SAREGNSR+KG+SGSKS  LDR K AS+SG+K+SNG AIGYEY
Subjt:  MAGGRRRTIHVKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY

Query:  PPTHHQEGLHSESQGLHNDADCSLDNSQPFILLDSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDELDRSPSTDDGSPPQVEEQ
        P   +QE L SES+GL NDA+CSLD+SQPFILL+SK+ QIVAYVDE P L  D+LEFTYDYGTSFVLGDSSHRGLGF DDDEL    +TDD S  QVEEQ
Subjt:  PPTHHQEGLHSESQGLHNDADCSLDNSQPFILLDSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDELDRSPSTDDGSPPQVEEQ

Query:  EGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSESDSS
         GLC GSLS EKE GTDE+V+ R+  +  NE++AE S  NK +DD+ S KNSGFLSIGG+RLYTQDVS EESDDDGE+SDG+S YSE LES ESSE DSS
Subjt:  EGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSESDSS

Query:  EEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEYGMKKTPPSRKKSSIVSGDDWSSLALDDLLV
         EMSC+ SDIDDEVAEDYLEG+GGS+NIL SKWLVKQEL+ES DDSSSS  DDTL+KLGGIALQEASKEYGM KT  SR K S+VS D WS+LALDD+LV
Subjt:  EEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEYGMKKTPPSRKKSSIVSGDDWSSLALDDLLV

Query:  KDSRSTSAKKKKNDAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLAAIYR
        KD RSTS +K+KN +Q   SWPPKA  SK  RKYPGEKKKYRKE IAAKRRERM++RGVDL QI+L+LEHMVLN+EDM++FQPMHPRDCSQVRRLAAIYR
Subjt:  KDSRSTSAKKKKNDAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLAAIYR

Query:  LHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSSSQKK
        LHSGCQGSGKKRFVTVTRTQ+TGMPS+SDQVRL QLIGARDED+DFSVA+GSN+KS+G +RSREKKN +V   H LE  QS S KSRSKGSAGK SSQK 
Subjt:  LHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSSSQKK

Query:  TGNKKYADKPVSFVSCGVMQPETVEI-TTSDVTDTDKRKDMITTALETIQISTSKVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFV-GGGLGKDGQGMA
        TG KKYAD+PVSFVSCGVMQPE +E+    D  D DK KD++ T+ E I++ TS V NTDI+ D+IG+FE+HTKGFGSKMMAKMGFV GGGLGKDGQGM 
Subjt:  TGNKKYADKPVSFVSCGVMQPETVEI-TTSDVTDTDKRKDMITTALETIQISTSKVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFV-GGGLGKDGQGMA

Query:  RPIEVIKRPKSLGLGVEFSEASSS--GGNQES-RGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGA
         PIEV+KRPKSLGLG+EFSEAS+S  G NQ S   S R ++GA  K+K+IGAFE HTKGFGSKMMAKMGFVEGMGLGKDSQG+VNPLLPVRRPKARGLGA
Subjt:  RPIEVIKRPKSLGLGVEFSEASSS--GGNQES-RGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGA

Query:  KA
        K+
Subjt:  KA

A0A6J1BYW1 Protein SQS10.0e+0099.12Show/hide
Query:  MAGGRRRTIHVKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY
        MAGGRRRTIH KASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY
Subjt:  MAGGRRRTIHVKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY

Query:  PPTHHQEGLHSESQGLHNDADCSLDNSQPFILLDSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFH-DDDELDRSPSTDDGSPPQVEE
        PPTHHQEGLHSESQGLHNDADCSLDNSQPFILLDSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFH DDDELDRSPSTDDGSPPQVEE
Subjt:  PPTHHQEGLHSESQGLHNDADCSLDNSQPFILLDSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFH-DDDELDRSPSTDDGSPPQVEE

Query:  QEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSESDS
        QEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSESDS
Subjt:  QEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSESDS

Query:  SEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEYGMKKTPPSRKKSSIVSGDDWSSLALDDLL
        SEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKE+GMKKTPPSRKKSSIVSGDDWSSLALDDLL
Subjt:  SEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEYGMKKTPPSRKKSSIVSGDDWSSLALDDLL

Query:  VKDSRSTSAKKKKNDAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLAAIY
        VKDSR+TSAKKKKN AQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLAAIY
Subjt:  VKDSRSTSAKKKKNDAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLAAIY

Query:  RLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSSSQK
        RLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSSSQK
Subjt:  RLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSSSQK

Query:  KTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSKVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFVGGGLGKDGQGMAR
        KTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTS+VKNTDINIDTIGAFEVHTKGFGSKMMAKMGFVGGGLGKDGQGMAR
Subjt:  KTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSKVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFVGGGLGKDGQGMAR

Query:  PIEVIKRPKSLGLGVEFSEASSSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAKA
        PIEVIKRPKSLGLGVEFSEA SSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAKA
Subjt:  PIEVIKRPKSLGLGVEFSEASSSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAKA

A0A6J1EPV3 Protein SQS10.0e+0079.03Show/hide
Query:  MAGGRRRTIHVKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY
        MAGGRRRT H K SDGFRKNK ++GRRRSDPSSS+RGNLFVDGGFLSD QFQ+SPPSSAREGNSRSKG+SGSKS +LDR K AS+SG+K+SNGNAIGYEY
Subjt:  MAGGRRRTIHVKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY

Query:  PPTHHQE-GLHSESQGLHNDADCSLDNSQPFILL--DSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDELDRSPSTDDGSPPQV
        PP  HQE GLHSESQGL NDADCSLDNSQPFILL  +SK+TQIVAYVD+KP LK DD+EFTYDYGT+F+LGDSSHRGLGFH++DEL R+ ++DD SP  V
Subjt:  PPTHHQE-GLHSESQGLHNDADCSLDNSQPFILL--DSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDELDRSPSTDDGSPPQV

Query:  EEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSES
        EEQEGLC GSL S+KE G+DERVECR E ++ +E+LAE S+PNKY+  VCS +NSGFLSIGG+RLYTQDVS EESDDDGE S+GSS YSE LES ESSES
Subjt:  EEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSES

Query:  DSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEYGMKKTPPSRKKSSIVSGDDWSSLALDD
        DSS EM+C+ SDIDDEVAEDYLEGIGGSE+IL SKWLVKQEL ESDDD SSS  DDTLEKL GIALQEASKEYGMKKT PSRKKS+IVS D+WSSLALDD
Subjt:  DSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEYGMKKTPPSRKKSSIVSGDDWSSLALDD

Query:  LLVKDSRSTSAKKKKNDAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLAA
        LL+KDSRS SA+KKKN A    SWPPKAPKSK   KYPGEKKKYRKETIAAKRRERM++RGVDL +INLKLEHMVLNREDMFSFQPMHPRDCSQVRRLAA
Subjt:  LLVKDSRSTSAKKKKNDAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLAA

Query:  IYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSSS
        IYRLHSGCQGSGKKRFVTVTRTQYTG+PSSSDQVRL QLIGARD D+DFSVA+G N+KS G NRSREKKNA+ S   ILEL+QSGS KSR++GSAGK SS
Subjt:  IYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSSS

Query:  QKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSKVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFV-GGGLGKDGQG
        QKKTG KKYAD+PVSFVSCGVMQP+ VEITTS+V D DK KD I  A E ++++TS VKN DI+ D+IGAFE HT GFGSKMMAKMGFV GGGLGKDGQG
Subjt:  QKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSKVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFV-GGGLGKDGQG

Query:  MARPIEVIKRPKSLGLGVEFSEASSSGG-NQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGA
        MA PIEVIKRPKSLGLGVEFSEAS+S G NQ+S GS    TGA GK+ K+GAFEEHTKGFGSKMMAKMGFVEG GLGKD QGM  P+  ++RPK+ GLG 
Subjt:  MARPIEVIKRPKSLGLGVEFSEASSSGG-NQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGA

Query:  K
        +
Subjt:  K

A0A6J1KLR0 Protein SQS10.0e+0080.7Show/hide
Query:  MAGGRRRTIHVKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY
        MAGGRRRT H K SDGFRKNK ++GRRRSDPSSS+RGNLFVDGGFLSD QFQ++PPSSAREGNSRSKGQSGSKS +LDR KTAS+SG+K+SNGNAIGYEY
Subjt:  MAGGRRRTIHVKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY

Query:  PPTHHQE-GLHSESQGLHNDADCSLDNSQPFILL--DSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDELDRSPSTDDGSPPQV
        PP  HQE GLHSES+GL NDADCSLDNSQPFILL  +SK+TQIVAYVD+KP LK D LE TYDYGT F+LGDSSHRGLGFHD+DEL R+ +TDD SP  V
Subjt:  PPTHHQE-GLHSESQGLHNDADCSLDNSQPFILL--DSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDELDRSPSTDDGSPPQV

Query:  EEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSES
        EEQEGLC GSL S+KE G+DERVECR E ++ +E+LAE S+PNKY+   CS +NSGFLSIGG+RLYTQDVS EESDDDGE S+GSS YSE LES ESSES
Subjt:  EEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSES

Query:  DSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEYGMKKTPPSRKKSSIVSGDDWSSLALDD
        DSS EM+C+ SDIDDEVAEDYLEGIGG E+IL SKWLVKQEL ESDDD SSS  DDTLEKL  IALQEASKEYGMKKT PSR KS IVS D+WSSLALDD
Subjt:  DSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEYGMKKTPPSRKKSSIVSGDDWSSLALDD

Query:  LLVKDSRSTSAKKKKNDAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLAA
        LL+KDSRS SA+KKKN A    SWPPKAPKSK   KYPGEKKKYRKETIAAKRRERM++RGVDL QINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLAA
Subjt:  LLVKDSRSTSAKKKKNDAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLAA

Query:  IYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSSS
        IYRLHSGCQGSGKKRFVTVTRTQYTG+PSSSDQVRL QLIGARD+D+DFSVA+G N+KS G NRSREKKNA+ S   ILEL+QSGS KSR+KGSAGK SS
Subjt:  IYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSSS

Query:  QKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSKVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFV-GGGLGKDGQG
        QKKTG KKYAD+PVSFVSCGVMQP+ VEITTS+V D DK KD I  A E ++++TS VKN DI+ D+IGAFE HT GFGSKMMAKMGFV GGGLGKDGQG
Subjt:  QKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSKVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFV-GGGLGKDGQG

Query:  MARPIEVIKRPKSLGLGVEFS-EASSSGG-NQES-RGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGL
        MA PIEVIKRPKSLGLGVEFS EAS+S G NQES R S R  TGA GK+KK+GAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGL
Subjt:  MARPIEVIKRPKSLGLGVEFS-EASSSGG-NQES-RGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGL

Query:  GAK
        GAK
Subjt:  GAK

A0A6J1KSJ4 Protein SQS10.0e+0078.9Show/hide
Query:  MAGGRRRTIHVKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY
        MAGGRRRT H K SDGFRKNK ++GRRRSDPSSS+RGNLFVDGGFLSD QFQ++PPSSAREGNSRSKGQSGSKS +LDR KTAS+SG+K+SNGNAIGYEY
Subjt:  MAGGRRRTIHVKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY

Query:  PPTHHQE-GLHSESQGLHNDADCSLDNSQPFILL--DSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDELDRSPSTDDGSPPQV
        PP  HQE GLHSES+GL NDADCSLDNSQPFILL  +SK+TQIVAYVD+KP LK D LE TYDYGT F+LGDSSHRGLGFHD+DEL R+ +TDD SP  V
Subjt:  PPTHHQE-GLHSESQGLHNDADCSLDNSQPFILL--DSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDELDRSPSTDDGSPPQV

Query:  EEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSES
        EEQEGLC GSL S+KE G+DERVECR E ++ +E+LAE S+PNKY+   CS +NSGFLSIGG+RLYTQDVS EESDDDGE S+GSS YSE LES ESSES
Subjt:  EEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSES

Query:  DSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEYGMKKTPPSRKKSSIVSGDDWSSLALDD
        DSS EM+C+ SDIDDEVAEDYLEGIGG E+IL SKWLVKQEL ESDDD SSS  DDTLEKL  IALQEASKEYGMKKT PSR KS IVS D+WSSLALDD
Subjt:  DSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEYGMKKTPPSRKKSSIVSGDDWSSLALDD

Query:  LLVKDSRSTSAKKKKNDAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLAA
        LL+KDSRS SA+KKKN A    SWPPKAPKSK   KYPGEKKKYRKETIAAKRRERM++RGVDL QINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLAA
Subjt:  LLVKDSRSTSAKKKKNDAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLAA

Query:  IYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSSS
        IYRLHSGCQGSGKKRFVTVTRTQYTG+PSSSDQVRL QLIGARD+D+DFSVA+G N+KS G NRSREKKNA+ S   ILEL+QSGS KSR+KGSAGK SS
Subjt:  IYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSSS

Query:  QKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSKVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFV-GGGLGKDGQG
        QKKTG KKYAD+PVSFVSCGVMQP+ VEITTS+V D DK KD I  A E ++++TS VKN DI+ D+IGAFE HT GFGSKMMAKMGFV GGGLGKDGQG
Subjt:  QKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSKVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFV-GGGLGKDGQG

Query:  MARPIEVIKRPKSLGLGVEFSEASSSGG-NQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGA
        MA PIEVIKRPKSLGLG+EFSEAS+S G NQES GS    TG  GK+KK+GAFEEHTKGFGSKMMAKMGFVEG GLGKD QGM  P+  ++RPK+ GLG 
Subjt:  MARPIEVIKRPKSLGLGVEFSEASSSGG-NQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGA

Query:  K
        +
Subjt:  K

SwissProt top hitse value%identityAlignment
A1XD97 Tuftelin-interacting protein 114.3e-0631.45Show/hide
Query:  AKMGFVGGGLGK---------DGQGMARPIEVIKRPKSLGLGVEFSEASSSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGL
        A + F+  GL K         D +   +P++    PK  G      +   +GGN   + S +   G +      G++E HTKG G K++ KMG+V G GL
Subjt:  AKMGFVGGGLGK---------DGQGMARPIEVIKRPKSLGLGVEFSEASSSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGL

Query:  GKDSQGMVNPLLPVRRPKARGLGA
        GK++QG++NP+   +R     +GA
Subjt:  GKDSQGMVNPLLPVRRPKARGLGA

A7SBN6 Zinc finger CCCH-type with G patch domain-containing protein1.5e-0636.46Show/hide
Query:  LGVEFSEASSSGGNQESRGSTRAKTGA--------------SGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGL
        LG E  E+ S   +Q   G + +   A              +G    +G +E+HTKG GSK+MAKMG++ G GLGKD +G V P+  V  P+ + L
Subjt:  LGVEFSEASSSGGNQESRGSTRAKTGA--------------SGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGL

Q29RR5 Tuftelin-interacting protein 114.3e-0631.45Show/hide
Query:  AKMGFVGGGLGK---------DGQGMARPIEVIKRPKSLGLGVEFSEASSSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGL
        A + F+  GL K         D     +P++  + PK  G      +   +GGN   + S +   G +      G++E HTKG G K++ KMG+V G GL
Subjt:  AKMGFVGGGLGK---------DGQGMARPIEVIKRPKSLGLGVEFSEASSSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGL

Query:  GKDSQGMVNPLLPVRRPKARGLGA
        GK++QG++NP+   +R     +GA
Subjt:  GKDSQGMVNPLLPVRRPKARGLGA

Q5U2Y6 Tuftelin-interacting protein 111.9e-0632.26Show/hide
Query:  AKMGFVGGGLGK---------DGQGMARPIEVIKRPKSLGLGVEFSEASSSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGL
        A + F+  GL K         D     +P++    PK LG      +   +GGN   + S +   G +      G++E HTKG G K++ KMG+V G GL
Subjt:  AKMGFVGGGLGK---------DGQGMARPIEVIKRPKSLGLGVEFSEASSSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGL

Query:  GKDSQGMVNPLLPVRRPKARGLGA
        GK++QG++NP+   +R     +GA
Subjt:  GKDSQGMVNPLLPVRRPKARGLGA

Q9ERA6 Tuftelin-interacting protein 111.5e-0632.26Show/hide
Query:  AKMGFVGGGLGK---------DGQGMARPIEVIKRPKSLGLGVEFSEASSSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGL
        A + F+  GL K         D     +P++    PK LG      +   +GGN   + S +  +G +      G++E HTKG G K++ KMG+V G GL
Subjt:  AKMGFVGGGLGK---------DGQGMARPIEVIKRPKSLGLGVEFSEASSSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGL

Query:  GKDSQGMVNPLLPVRRPKARGLGA
        GK++QG++NP+   +R     +GA
Subjt:  GKDSQGMVNPLLPVRRPKARGLGA

Arabidopsis top hitse value%identityAlignment
AT1G17070.1 GC-rich sequence DNA-binding factor-like protein with Tuftelin interacting domain2.6e-0628.93Show/hide
Query:  GARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSSSQKKTGNKKYADKPVSFVSCGVMQPETVEITT-----SDVT
        G R E D+F   K          + + K+    +T  I     S S  S   GS  K    + +G K    KPV+FVS G + P   EI       +D  
Subjt:  GARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSSSQKKTGNKKYADKPVSFVSCGVMQPETVEITT-----SDVT

Query:  DTDK--RKDMITTALET---IQISTSKV-------------------------------------------------KNTDINIDTIGAFEVHTKGFGSK
        D DK    DMI   +E    + I +S +                                                 K   +  D IG FE  TKG G K
Subjt:  DTDK--RKDMITTALET---IQISTSKV-------------------------------------------------KNTDINIDTIGAFEVHTKGFGSK

Query:  MMAKMGFVGGGLGKDGQGMARPIEVIKRPKSLGLGV-EFSEA
        ++ KMG+ GGGLGK+ QG+  PIE   RPK++G+G  +F EA
Subjt:  MMAKMGFVGGGLGKDGQGMARPIEVIKRPKSLGLGV-EFSEA

AT2G24830.1 zinc finger (CCCH-type) family protein / D111/G-patch domain-containing protein5.8e-0644.44Show/hide
Query:  STRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGL
        ST    G          +E HT+G  SKMMA MG+ EGMGLG   QG++NP+L    P  R L
Subjt:  STRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGL

AT2G42330.1 GC-rich sequence DNA-binding factor-like protein with Tuftelin interacting domain2.6e-0647.27Show/hide
Query:  FEVHTKGFGSKMMAKMGFVGGGLGKDGQGMARPIEVIKRPKSLGLGV-EFSEASS
        FE  + G G K++ KMG+ G GLGK+ QG+  PIEV  RPK++G+G  +F E ++
Subjt:  FEVHTKGFGSKMMAKMGFVGGGLGKDGQGMARPIEVIKRPKSLGLGV-EFSEASS

AT2G42330.2 GC-rich sequence DNA-binding factor-like protein with Tuftelin interacting domain2.6e-0647.27Show/hide
Query:  FEVHTKGFGSKMMAKMGFVGGGLGKDGQGMARPIEVIKRPKSLGLGV-EFSEASS
        FE  + G G K++ KMG+ G GLGK+ QG+  PIEV  RPK++G+G  +F E ++
Subjt:  FEVHTKGFGSKMMAKMGFVGGGLGKDGQGMARPIEVIKRPKSLGLGV-EFSEASS

AT3G09850.1 D111/G-patch domain-containing protein3.4e-14745.18Show/hide
Query:  MAGGRRRTIHVKASDGFRKNK---RNTGRRRSDPSSSIRG---------NLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTK-TASASG
        M GG RR  +   S G  K K   ++T    +  + S  G          LFV+GG LSD +   +  + +R G+S  K   G +S +++R K +AS SG
Subjt:  MAGGRRRTIHVKASDGFRKNK---RNTGRRRSDPSSSIRG---------NLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTK-TASASG

Query:  SKKSNGNAIGYEYPPTHHQEGLHSESQGLHNDADCSLDNSQPFILLDSKNTQIVAYVDEKP-SLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDELDRS
         +K +GN   Y+YP    +EGL   S G+ +D      N  P +L  S+ TQIVA++D+ P S K   + + Y+Y  S+VLGD SH+GLGF DD   D +
Subjt:  SKKSNGNAIGYEYPPTHHQEGLHSESQGLHNDADCSLDNSQPFILLDSKNTQIVAYVDEKP-SLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDELDRS

Query:  PSTDDGSPPQVEEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYS
        PS     P  + +Q     GS S E+E   D   E    DD     + +T             +NSGF+SIGGM+LYT+DVS EESD + E +D   + S
Subjt:  PSTDDGSPPQVEEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYS

Query:  ---EQLESSESSESDSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQEL----VESDDDSSSSCFDDTLEKLGGIALQEASKEYGMKKTPPSR
               SSE SESDSSE+M  + S+IDD+VA+DYLEGIGGSE +L + WL +Q L    + SDD SSS   D    KL GI LQ+AS EYG KKT  S 
Subjt:  ---EQLESSESSESDSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQEL----VESDDDSSSSCFDDTLEKLGGIALQEASKEYGMKKTPPSR

Query:  KKSSIVSGDDWSSLALDDLL-VKDSRSTSAK--KKKNDAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNRE
               G   + LA+DDL+ VKD RS S K  KKK  A+ P SWP +APKSK +R +PGE KK+RKE IA KRRERM+ RGVDL  IN +LE+ VL   
Subjt:  KKSSIVSGDDWSSLALDDLL-VKDSRSTSAK--KKKNDAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNRE

Query:  DMFSFQPMHPRDCSQVRRLAAIYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHIL
        DM  FQ MH RDCSQVRRLA +YRL S C GSGKK FVTVTRT  T MPS+SD++R+ +LIGA DED DF+V+ G   KS   +R + K +A+   +   
Subjt:  DMFSFQPMHPRDCSQVRRLAAIYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHIL

Query:  ELHQSGSMKSRSKGSAGKSSSQKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSKVKNTDINIDTIGAFEVHTKGFG
                + R+K +  KSS         YAD+PVSFVS G++  E                 +  T++E +    ++      N   IGAFEVHT+GFG
Subjt:  ELHQSGSMKSRSKGSAGKSSSQKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSKVKNTDINIDTIGAFEVHTKGFG

Query:  SKMMAKMGFV-GGGLGKDGQGMARPIEVIKRPKSLGLGVEFS-----EASSSGGNQESRGSTRAKTGASGKA---------------KKIGAFEEHTKGF
        SKMMAKMGF+ GGGLGKDG+G+A+PIE ++RPKSLGLG++FS      + SS  N  ++ +  + +G  GK                K++GAFE+HT GF
Subjt:  SKMMAKMGFV-GGGLGKDGQGMARPIEVIKRPKSLGLGVEFS-----EASSSGGNQESRGSTRAKTGASGKA---------------KKIGAFEEHTKGF

Query:  GSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK
        GS+MMA+MGFVEG GLG++SQG+VNPL+ VRRP+ARG+GA+
Subjt:  GSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGGAGGAAGACGGAGAACGATTCATGTCAAAGCCAGTGACGGCTTCAGGAAGAATAAGAGAAATACTGGCAGAAGAAGATCAGACCCGTCGTCATCTATTCGAGG
AAATTTGTTTGTCGACGGTGGCTTTTTATCCGATTTGCAGTTCCAGTCCTCGCCGCCTAGTTCTGCTCGAGAGGGAAACTCTAGGTCGAAGGGCCAGTCTGGATCAAAAT
CGGTTAATTTAGATCGTACAAAAACTGCTTCCGCTAGTGGGAGCAAAAAATCGAATGGTAATGCTATTGGCTACGAATACCCTCCTACTCATCATCAGGAGGGCTTGCAC
TCTGAATCACAGGGTTTGCACAATGATGCAGATTGTAGCTTGGACAACTCACAGCCTTTTATATTACTTGATTCTAAGAACACTCAAATTGTTGCTTATGTAGATGAAAA
ACCGTCTTTAAAGGTGGATGATTTGGAATTCACTTATGATTATGGTACGAGCTTTGTGTTAGGTGACAGTTCTCATAGAGGACTAGGGTTTCATGATGATGATGAACTTG
ATAGAAGTCCCAGCACAGATGATGGCTCGCCTCCACAAGTGGAAGAACAAGAAGGATTATGCATTGGTTCATTGTCATCCGAGAAGGAAATGGGTACTGACGAGAGGGTT
GAGTGTAGGATAGAGGATGATATGACTAATGAGGTGCTAGCAGAAACATCAGCCCCCAACAAATATACAGATGATGTGTGCTCTGAGAAAAATTCGGGTTTCTTGTCAAT
TGGTGGTATGAGATTATACACCCAAGATGTATCTTATGAAGAAAGTGATGACGATGGAGAGGCATCGGATGGGAGTTCTAATTACTCTGAGCAGTTAGAGTCAAGTGAAT
CATCCGAAAGTGATAGCTCAGAAGAAATGTCTTGTACTGATTCAGATATTGATGACGAGGTGGCTGAAGATTACCTCGAAGGAATTGGTGGAAGTGAGAATATTTTAAGT
TCTAAGTGGCTAGTAAAACAAGAGTTGGTAGAGTCTGATGATGATAGTTCTAGCAGTTGCTTTGATGATACTTTAGAGAAATTAGGCGGCATTGCTTTGCAGGAGGCATC
AAAAGAATATGGAATGAAGAAAACCCCACCATCAAGGAAGAAAAGCTCCATTGTATCGGGAGATGATTGGTCATCATTGGCTCTGGATGACTTGCTAGTAAAAGATAGTA
GAAGTACATCAGCTAAAAAGAAGAAGAATGATGCCCAGTCTCCATGTTCTTGGCCTCCAAAAGCTCCAAAAAGTAAAGTAGCTAGAAAGTATCCTGGTGAAAAGAAGAAA
TATCGTAAAGAAACAATTGCAGCAAAGCGTCGAGAAAGAATGATTAGTCGTGGTGTTGATCTAGGGCAAATAAATCTGAAATTGGAGCACATGGTTCTGAATAGAGAAGA
TATGTTTTCTTTCCAACCTATGCATCCTCGTGATTGTTCCCAGGTACGACGGTTGGCGGCAATTTACCGCTTGCATAGTGGATGCCAAGGTTCTGGTAAAAAAAGGTTTG
TTACGGTAACTCGAACACAATATACAGGAATGCCATCATCAAGTGATCAAGTTCGCCTGGTGCAGCTAATAGGAGCAAGAGACGAGGATGATGACTTTTCTGTTGCTAAA
GGCTCAAATATGAAATCACAGGGTGGCAACAGAAGCAGAGAAAAGAAGAATGCCGAAGTGAGCACTTCGCACATATTGGAGCTCCATCAATCCGGGAGCATGAAGTCGAG
GTCGAAGGGTTCTGCAGGTAAGAGTTCAAGTCAGAAGAAGACTGGCAACAAAAAGTACGCGGATAAACCTGTCTCATTTGTATCATGTGGAGTGATGCAACCAGAAACAG
TCGAGATAACAACGAGCGATGTTACAGATACAGATAAAAGAAAGGATATGATCACCACTGCATTGGAAACGATCCAGATATCAACTAGCAAAGTCAAGAACACAGATATT
AATATAGATACGATCGGTGCATTTGAGGTGCACACCAAGGGTTTTGGGTCAAAAATGATGGCAAAAATGGGATTTGTAGGAGGAGGATTGGGGAAAGATGGTCAAGGTAT
GGCCCGCCCCATTGAAGTGATCAAACGGCCTAAATCGCTCGGGTTAGGGGTCGAGTTCTCAGAGGCCTCCAGTTCTGGTGGTAACCAGGAAAGTCGAGGATCTACTCGTG
CCAAAACTGGAGCTTCTGGTAAAGCTAAAAAAATTGGGGCTTTTGAAGAGCATACCAAAGGATTTGGGTCCAAGATGATGGCAAAGATGGGATTTGTTGAAGGCATGGGA
TTGGGGAAAGATTCTCAAGGAATGGTTAACCCTTTGCTTCCTGTTAGGCGGCCTAAAGCGCGAGGGTTAGGGGCCAAAGCT
mRNA sequenceShow/hide mRNA sequence
ATGGCCGGAGGAAGACGGAGAACGATTCATGTCAAAGCCAGTGACGGCTTCAGGAAGAATAAGAGAAATACTGGCAGAAGAAGATCAGACCCGTCGTCATCTATTCGAGG
AAATTTGTTTGTCGACGGTGGCTTTTTATCCGATTTGCAGTTCCAGTCCTCGCCGCCTAGTTCTGCTCGAGAGGGAAACTCTAGGTCGAAGGGCCAGTCTGGATCAAAAT
CGGTTAATTTAGATCGTACAAAAACTGCTTCCGCTAGTGGGAGCAAAAAATCGAATGGTAATGCTATTGGCTACGAATACCCTCCTACTCATCATCAGGAGGGCTTGCAC
TCTGAATCACAGGGTTTGCACAATGATGCAGATTGTAGCTTGGACAACTCACAGCCTTTTATATTACTTGATTCTAAGAACACTCAAATTGTTGCTTATGTAGATGAAAA
ACCGTCTTTAAAGGTGGATGATTTGGAATTCACTTATGATTATGGTACGAGCTTTGTGTTAGGTGACAGTTCTCATAGAGGACTAGGGTTTCATGATGATGATGAACTTG
ATAGAAGTCCCAGCACAGATGATGGCTCGCCTCCACAAGTGGAAGAACAAGAAGGATTATGCATTGGTTCATTGTCATCCGAGAAGGAAATGGGTACTGACGAGAGGGTT
GAGTGTAGGATAGAGGATGATATGACTAATGAGGTGCTAGCAGAAACATCAGCCCCCAACAAATATACAGATGATGTGTGCTCTGAGAAAAATTCGGGTTTCTTGTCAAT
TGGTGGTATGAGATTATACACCCAAGATGTATCTTATGAAGAAAGTGATGACGATGGAGAGGCATCGGATGGGAGTTCTAATTACTCTGAGCAGTTAGAGTCAAGTGAAT
CATCCGAAAGTGATAGCTCAGAAGAAATGTCTTGTACTGATTCAGATATTGATGACGAGGTGGCTGAAGATTACCTCGAAGGAATTGGTGGAAGTGAGAATATTTTAAGT
TCTAAGTGGCTAGTAAAACAAGAGTTGGTAGAGTCTGATGATGATAGTTCTAGCAGTTGCTTTGATGATACTTTAGAGAAATTAGGCGGCATTGCTTTGCAGGAGGCATC
AAAAGAATATGGAATGAAGAAAACCCCACCATCAAGGAAGAAAAGCTCCATTGTATCGGGAGATGATTGGTCATCATTGGCTCTGGATGACTTGCTAGTAAAAGATAGTA
GAAGTACATCAGCTAAAAAGAAGAAGAATGATGCCCAGTCTCCATGTTCTTGGCCTCCAAAAGCTCCAAAAAGTAAAGTAGCTAGAAAGTATCCTGGTGAAAAGAAGAAA
TATCGTAAAGAAACAATTGCAGCAAAGCGTCGAGAAAGAATGATTAGTCGTGGTGTTGATCTAGGGCAAATAAATCTGAAATTGGAGCACATGGTTCTGAATAGAGAAGA
TATGTTTTCTTTCCAACCTATGCATCCTCGTGATTGTTCCCAGGTACGACGGTTGGCGGCAATTTACCGCTTGCATAGTGGATGCCAAGGTTCTGGTAAAAAAAGGTTTG
TTACGGTAACTCGAACACAATATACAGGAATGCCATCATCAAGTGATCAAGTTCGCCTGGTGCAGCTAATAGGAGCAAGAGACGAGGATGATGACTTTTCTGTTGCTAAA
GGCTCAAATATGAAATCACAGGGTGGCAACAGAAGCAGAGAAAAGAAGAATGCCGAAGTGAGCACTTCGCACATATTGGAGCTCCATCAATCCGGGAGCATGAAGTCGAG
GTCGAAGGGTTCTGCAGGTAAGAGTTCAAGTCAGAAGAAGACTGGCAACAAAAAGTACGCGGATAAACCTGTCTCATTTGTATCATGTGGAGTGATGCAACCAGAAACAG
TCGAGATAACAACGAGCGATGTTACAGATACAGATAAAAGAAAGGATATGATCACCACTGCATTGGAAACGATCCAGATATCAACTAGCAAAGTCAAGAACACAGATATT
AATATAGATACGATCGGTGCATTTGAGGTGCACACCAAGGGTTTTGGGTCAAAAATGATGGCAAAAATGGGATTTGTAGGAGGAGGATTGGGGAAAGATGGTCAAGGTAT
GGCCCGCCCCATTGAAGTGATCAAACGGCCTAAATCGCTCGGGTTAGGGGTCGAGTTCTCAGAGGCCTCCAGTTCTGGTGGTAACCAGGAAAGTCGAGGATCTACTCGTG
CCAAAACTGGAGCTTCTGGTAAAGCTAAAAAAATTGGGGCTTTTGAAGAGCATACCAAAGGATTTGGGTCCAAGATGATGGCAAAGATGGGATTTGTTGAAGGCATGGGA
TTGGGGAAAGATTCTCAAGGAATGGTTAACCCTTTGCTTCCTGTTAGGCGGCCTAAAGCGCGAGGGTTAGGGGCCAAAGCT
Protein sequenceShow/hide protein sequence
MAGGRRRTIHVKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEYPPTHHQEGLH
SESQGLHNDADCSLDNSQPFILLDSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDELDRSPSTDDGSPPQVEEQEGLCIGSLSSEKEMGTDERV
ECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSESDSSEEMSCTDSDIDDEVAEDYLEGIGGSENILS
SKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEYGMKKTPPSRKKSSIVSGDDWSSLALDDLLVKDSRSTSAKKKKNDAQSPCSWPPKAPKSKVARKYPGEKKK
YRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVRRLAAIYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAK
GSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSSSQKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSKVKNTDI
NIDTIGAFEVHTKGFGSKMMAKMGFVGGGLGKDGQGMARPIEVIKRPKSLGLGVEFSEASSSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMG
LGKDSQGMVNPLLPVRRPKARGLGAKA