; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC06g0543 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC06g0543
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionProtein SQS1
Genome locationMC06:4458819..4464822
RNA-Seq ExpressionMC06g0543
SyntenyMC06g0543
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR000467 - G-patch domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587849.1 Zinc finger CCCH-type with G patch domain-containing protein, partial [Cucurbita argyrosperma subsp. sororia]0.075.8Show/hide
Query:  MAGGRRRTIHAKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY
        MAGGRRRT HAK SDGFRKNK ++GRRRSDPSSS+RGNLFVDGGFLSD QFQ+SPPSSAREGNSRSKG+SGSKS +LDR K AS+SG+K+SNGNAIGYEY
Subjt:  MAGGRRRTIHAKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY

Query:  PPTHHQEG-LHSESQGLHNDADCSLDNSQPFILLDS--KNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDRSPSTDDGSPPQ
        PP  HQEG LHSESQGL NDADCSLDNSQPFILL+S  K+TQIVAYVD+KP LK DD+EFTYDYGT+F+LGDSSHRGLGFH++D EL R+ ++DD SP  
Subjt:  PPTHHQEG-LHSESQGLHNDADCSLDNSQPFILLDS--KNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDRSPSTDDGSPPQ

Query:  VEEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSE
        VEEQEGLC GSL S+KE G+DERVECR E ++ +E+LAE S+PNKY+  VCS +NSGFLSIGG+RLYTQDVS EESDDDGE S+GSS YSE LES ESSE
Subjt:  VEEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSE

Query:  SDSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPSRKKSSIVSGDDWSSLALD
        SDSS EM+C+ SDIDDEVAEDYLEGIGGSE+IL SKWLVKQEL ESDDD SSS  DDTLEKL GIALQEASKE+GMKKTP SRKKS+IVS D+WSSLALD
Subjt:  SDSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPSRKKSSIVSGDDWSSLALD

Query:  DLLVKDSRTTSAKKKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVHCLL
        DLL+KDSR+ SA+KKKNAA    SWPPKAPKSK   KYPGEKKKYRKETIAAKRRERM++RGVDL +INLKLEHMVLNREDMFSFQPMHPRDCSQV  L 
Subjt:  DLLVKDSRTTSAKKKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVHCLL

Query:  LHLQYDGWRQFTACIVDAKVLVKKGKSWFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSM
               +R  + C          GK  FVTVTRTQYTG+PSSSDQVRL QLIGARDED+DFSVA+G N+KS G NRSREKKNA+ S   ILEL+QSGS 
Subjt:  LHLQYDGWRQFTACIVDAKVLVKKGKSWFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSM

Query:  KSRSKGSAGKSSSQKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSEVKNTDINIDTIGAFEVHTKGFGSKMMAKMG
        KSR++GSAGK SSQKKTG KKYAD+PVSFVSCGVMQP+ VEITTS+V D DK KD++  A E ++++TS VKN DI+ D+IGAFE HT GFGSKMMAKMG
Subjt:  KSRSKGSAGKSSSQKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSEVKNTDINIDTIGAFEVHTKGFGSKMMAKMG

Query:  FV-GGGLGKDGQGMARPIEVIKRPKSLGLGVEFSEAFSSGG-NQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPL
        FV GGGLGKDGQGMA PIEVIKRPKSLGLGVEFSEA +S G NQ+S GS    TGA GK+ K+GAFEEHTKGFGSKMMAKMGFVEG GLGKD QGM  P+
Subjt:  FV-GGGLGKDGQGMARPIEVIKRPKSLGLGVEFSEAFSSGG-NQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPL

Query:  LPVRRPKARGLGAK
          ++RPK+ GLG +
Subjt:  LPVRRPKARGLGAK

XP_022134831.1 uncharacterized protein LOC111007007 [Momordica charantia]0.096.67Show/hide
Query:  MAGGRRRTIHAKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY
        MAGGRRRTIHAKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY
Subjt:  MAGGRRRTIHAKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY

Query:  PPTHHQEGLHSESQGLHNDADCSLDNSQPFILLDSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDRSPSTDDGSPPQVEE
        PPTHHQEGLHSESQGLHNDADCSLDNSQPFILLDSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDRSPSTDDGSPPQVEE
Subjt:  PPTHHQEGLHSESQGLHNDADCSLDNSQPFILLDSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDRSPSTDDGSPPQVEE

Query:  QEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSESDS
        QEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSESDS
Subjt:  QEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSESDS

Query:  SEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPSRKKSSIVSGDDWSSLALDDLL
        SEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPSRKKSSIVSGDDWSSLALDDLL
Subjt:  SEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPSRKKSSIVSGDDWSSLALDDLL

Query:  VKDSRTTSAKKKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVHCLLLHL
        VKDSRTTSAKKKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQV  L    
Subjt:  VKDSRTTSAKKKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVHCLLLHL

Query:  QYDGWRQFTACIVDAKVLVKKGKSWFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSR
            +R  + C          GK  FVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSR
Subjt:  QYDGWRQFTACIVDAKVLVKKGKSWFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSR

Query:  SKGSAGKSSSQKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSEVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFVG
        SKGSAGKSSSQKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSEVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFVG
Subjt:  SKGSAGKSSSQKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSEVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFVG

Query:  GGLGKDGQGMARPIEVIKRPKSLGLGVEFSEAFSSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRR
        GGLGKDGQGMARPIEVIKRPKSLGLGVEFSEAFSSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRR
Subjt:  GGLGKDGQGMARPIEVIKRPKSLGLGVEFSEAFSSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRR

Query:  PKARGLGAKA
        PKARGLGAKA
Subjt:  PKARGLGAKA

XP_022929809.1 uncharacterized protein LOC111436305 [Cucurbita moschata]0.075.68Show/hide
Query:  MAGGRRRTIHAKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY
        MAGGRRRT HAK SDGFRKNK ++GRRRSDPSSS+RGNLFVDGGFLSD QFQ+SPPSSAREGNSRSKG+SGSKS +LDR K AS+SG+K+SNGNAIGYEY
Subjt:  MAGGRRRTIHAKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY

Query:  PPTHHQEG-LHSESQGLHNDADCSLDNSQPFILLDS--KNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDRSPSTDDGSPPQ
        PP  HQEG LHSESQGL NDADCSLDNSQPFILL+S  K+TQIVAYVD+KP LK DD+EFTYDYGT+F+LGDSSHRGLGFH++D EL R+ ++DD SP  
Subjt:  PPTHHQEG-LHSESQGLHNDADCSLDNSQPFILLDS--KNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDRSPSTDDGSPPQ

Query:  VEEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSE
        VEEQEGLC GSL S+KE G+DERVECR E ++ +E+LAE S+PNKY+  VCS +NSGFLSIGG+RLYTQDVS EESDDDGE S+GSS YSE LES ESSE
Subjt:  VEEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSE

Query:  SDSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPSRKKSSIVSGDDWSSLALD
        SDSS EM+C+ SDIDDEVAEDYLEGIGGSE+IL SKWLVKQEL ESDDD SSS  DDTLEKL GIALQEASKE+GMKKTP SRKKS+IVS D+WSSLALD
Subjt:  SDSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPSRKKSSIVSGDDWSSLALD

Query:  DLLVKDSRTTSAKKKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVHCLL
        DLL+KDSR+ SA+KKKNAA    SWPPKAPKSK   KYPGEKKKYRKETIAAKRRERM++RGVDL +INLKLEHMVLNREDMFSFQPMHPRDCSQV  L 
Subjt:  DLLVKDSRTTSAKKKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVHCLL

Query:  LHLQYDGWRQFTACIVDAKVLVKKGKSWFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSM
               +R  + C          GK  FVTVTRTQYTG+PSSSDQVRL QLIGARD D+DFSVA+G N+KS G NRSREKKNA+ S   ILEL+QSGS 
Subjt:  LHLQYDGWRQFTACIVDAKVLVKKGKSWFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSM

Query:  KSRSKGSAGKSSSQKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSEVKNTDINIDTIGAFEVHTKGFGSKMMAKMG
        KSR++GSAGK SSQKKTG KKYAD+PVSFVSCGVMQP+ VEITTS+V D DK KD++  A E ++++TS VKN DI+ D+IGAFE HT GFGSKMMAKMG
Subjt:  KSRSKGSAGKSSSQKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSEVKNTDINIDTIGAFEVHTKGFGSKMMAKMG

Query:  FV-GGGLGKDGQGMARPIEVIKRPKSLGLGVEFSEAFSSGG-NQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPL
        FV GGGLGKDGQGMA PIEVIKRPKSLGLGVEFSEA +S G NQ+S GS    TGA GK+ K+GAFEEHTKGFGSKMMAKMGFVEG GLGKD QGM  P+
Subjt:  FV-GGGLGKDGQGMARPIEVIKRPKSLGLGVEFSEAFSSGG-NQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPL

Query:  LPVRRPKARGLGAK
          ++RPK+ GLG +
Subjt:  LPVRRPKARGLGAK

XP_023003174.1 uncharacterized protein LOC111496865 isoform X1 [Cucurbita maxima]0.075.55Show/hide
Query:  MAGGRRRTIHAKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY
        MAGGRRRT HAK SDGFRKNK ++GRRRSDPSSS+RGNLFVDGGFLSD QFQ++PPSSAREGNSRSKGQSGSKS +LDR KTAS+SG+K+SNGNAIGYEY
Subjt:  MAGGRRRTIHAKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY

Query:  PPTHHQEG-LHSESQGLHNDADCSLDNSQPFILLDS--KNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDRSPSTDDGSPPQ
        PP  HQEG LHSES+GL NDADCSLDNSQPFILL+S  K+TQIVAYVD+KP LK D LE TYDYGT F+LGDSSHRGLGFHD+D EL R+ +TDD SP  
Subjt:  PPTHHQEG-LHSESQGLHNDADCSLDNSQPFILLDS--KNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDRSPSTDDGSPPQ

Query:  VEEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSE
        VEEQEGLC GSL S+KE G+DERVECR E ++ +E+LAE S+PNKY+   CS +NSGFLSIGG+RLYTQDVS EESDDDGE S+GSS YSE LES ESSE
Subjt:  VEEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSE

Query:  SDSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPSRKKSSIVSGDDWSSLALD
        SDSS EM+C+ SDIDDEVAEDYLEGIGG E+IL SKWLVKQEL ESDDD SSS  DDTLEKL  IALQEASKE+GMKKTP SR KS IVS D+WSSLALD
Subjt:  SDSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPSRKKSSIVSGDDWSSLALD

Query:  DLLVKDSRTTSAKKKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVHCLL
        DLL+KDSR+ SA+KKKNAA    SWPPKAPKSK   KYPGEKKKYRKETIAAKRRERM++RGVDL QINLKLEHMVLNREDMFSFQPMHPRDCSQV  L 
Subjt:  DLLVKDSRTTSAKKKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVHCLL

Query:  LHLQYDGWRQFTACIVDAKVLVKKGKSWFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSM
               +R  + C          GK  FVTVTRTQYTG+PSSSDQVRL QLIGARD+D+DFSVA+G N+KS G NRSREKKNA+ S   ILEL+QSGS 
Subjt:  LHLQYDGWRQFTACIVDAKVLVKKGKSWFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSM

Query:  KSRSKGSAGKSSSQKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSEVKNTDINIDTIGAFEVHTKGFGSKMMAKMG
        KSR+KGSAGK SSQKKTG KKYAD+PVSFVSCGVMQP+ VEITTS+V D DK KD++  A E ++++TS VKN DI+ D+IGAFE HT GFGSKMMAKMG
Subjt:  KSRSKGSAGKSSSQKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSEVKNTDINIDTIGAFEVHTKGFGSKMMAKMG

Query:  FV-GGGLGKDGQGMARPIEVIKRPKSLGLGVEFSEAFSSGG-NQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPL
        FV GGGLGKDGQGMA PIEVIKRPKSLGLG+EFSEA +S G NQES GS    TG  GK+KK+GAFEEHTKGFGSKMMAKMGFVEG GLGKD QGM  P+
Subjt:  FV-GGGLGKDGQGMARPIEVIKRPKSLGLGVEFSEAFSSGG-NQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPL

Query:  LPVRRPKARGLGAK
          ++RPK+ GLG +
Subjt:  LPVRRPKARGLGAK

XP_023003182.1 uncharacterized protein LOC111496865 isoform X2 [Cucurbita maxima]0.077.08Show/hide
Query:  MAGGRRRTIHAKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY
        MAGGRRRT HAK SDGFRKNK ++GRRRSDPSSS+RGNLFVDGGFLSD QFQ++PPSSAREGNSRSKGQSGSKS +LDR KTAS+SG+K+SNGNAIGYEY
Subjt:  MAGGRRRTIHAKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY

Query:  PPTHHQEG-LHSESQGLHNDADCSLDNSQPFILLDS--KNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDRSPSTDDGSPPQ
        PP  HQEG LHSES+GL NDADCSLDNSQPFILL+S  K+TQIVAYVD+KP LK D LE TYDYGT F+LGDSSHRGLGFHD+D EL R+ +TDD SP  
Subjt:  PPTHHQEG-LHSESQGLHNDADCSLDNSQPFILLDS--KNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDRSPSTDDGSPPQ

Query:  VEEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSE
        VEEQEGLC GSL S+KE G+DERVECR E ++ +E+LAE S+PNKY+   CS +NSGFLSIGG+RLYTQDVS EESDDDGE S+GSS YSE LES ESSE
Subjt:  VEEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSE

Query:  SDSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPSRKKSSIVSGDDWSSLALD
        SDSS EM+C+ SDIDDEVAEDYLEGIGG E+IL SKWLVKQEL ESDDD SSS  DDTLEKL  IALQEASKE+GMKKTP SR KS IVS D+WSSLALD
Subjt:  SDSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPSRKKSSIVSGDDWSSLALD

Query:  DLLVKDSRTTSAKKKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVHCLL
        DLL+KDSR+ SA+KKKNAA    SWPPKAPKSK   KYPGEKKKYRKETIAAKRRERM++RGVDL QINLKLEHMVLNREDMFSFQPMHPRDCSQV  L 
Subjt:  DLLVKDSRTTSAKKKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVHCLL

Query:  LHLQYDGWRQFTACIVDAKVLVKKGKSWFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSM
               +R  + C          GK  FVTVTRTQYTG+PSSSDQVRL QLIGARD+D+DFSVA+G N+KS G NRSREKKNA+ S   ILEL+QSGS 
Subjt:  LHLQYDGWRQFTACIVDAKVLVKKGKSWFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSM

Query:  KSRSKGSAGKSSSQKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSEVKNTDINIDTIGAFEVHTKGFGSKMMAKMG
        KSR+KGSAGK SSQKKTG KKYAD+PVSFVSCGVMQP+ VEITTS+V D DK KD++  A E ++++TS VKN DI+ D+IGAFE HT GFGSKMMAKMG
Subjt:  KSRSKGSAGKSSSQKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSEVKNTDINIDTIGAFEVHTKGFGSKMMAKMG

Query:  FV-GGGLGKDGQGMARPIEVIKRPKSLGLGVEFSE--AFSSGGNQESRGST-RAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVN
        FV GGGLGKDGQGMA PIEVIKRPKSLGLGVEFS   + S+G NQESR S+ R  TGA GK+KK+GAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVN
Subjt:  FV-GGGLGKDGQGMARPIEVIKRPKSLGLGVEFSE--AFSSGGNQESRGST-RAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVN

Query:  PLLPVRRPKARGLGAK
        PLLPVRRPKARGLGAK
Subjt:  PLLPVRRPKARGLGAK

TrEMBL top hitse value%identityAlignment
A0A1S3B932 uncharacterized protein LOC103487380 isoform X10.073.37Show/hide
Query:  MAGGRRRTIHAKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY
        MAGGRRRT HAK+SD FRKNK N+ RRRSDP S    NLFVDGGFLSD QFQ SPP SAREGNSR+KG+SGSKS  LDR K AS+SG+K+SNG AIGYEY
Subjt:  MAGGRRRTIHAKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY

Query:  PPTHHQEGLHSESQGLHNDADCSLDNSQPFILLDSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDRSPSTDDGSPPQVEE
        P   +QE L SES+GL NDA+CSLD+SQPFILL+SK+ QIVAYVDE P L  D+LEFTYDYGTSFVLGDSSHRGLGF DDD EL    +TDD S  QVEE
Subjt:  PPTHHQEGLHSESQGLHNDADCSLDNSQPFILLDSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDRSPSTDDGSPPQVEE

Query:  QEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSESDS
        Q GLC GSLS EKE GTDE+V+ R+  +  NE++AE S  NK +DD+ S KNSGFLSIGG+RLYTQDVS EESDDDGE+SDG+S YSE LES ESSE DS
Subjt:  QEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSESDS

Query:  SEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPSRKKSSIVSGDDWSSLALDDLL
        S EMSC+ SDIDDEVAEDYLEG+GGS+NIL SKWLVKQEL+ES DDSSSS  DDTL+KLGGIALQEASKE+GM KT  SR K S+VS D WS+LALDD+L
Subjt:  SEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPSRKKSSIVSGDDWSSLALDDLL

Query:  VKDSRTTSAKKKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVHCLLLHL
        VKD R+TS +K+KNA+Q   SWPPKA  SK  RKYPGEKKKYRKE IAAKRRERM++RGVDL QI+L+LEHMVLN+EDM++FQPMHPRDCSQ+  +    
Subjt:  VKDSRTTSAKKKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVHCLLLHL

Query:  QYDGWRQFTACIVDAKVLVKKGKSWFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSR
        +YD W+QFTACIVDAKVLVKK    FVTVTRTQ+TGMPS+SDQVRL QLIGARDED+DFSVA+GSN+KS+G +RSREKKN +V   H LE  QS S KSR
Subjt:  QYDGWRQFTACIVDAKVLVKKGKSWFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSR

Query:  SKGSAGKSSSQKKTGNKKYADKPVSFVSCGVMQPETVEIT-TSDVTDTDKRKDMITTALETIQISTSEVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFV
        SKGSAGK SSQK TG KKYAD+PVSFVSCGVMQPE +E+    D  D DK KD++ T+ E I++ TS V NTDI+ D+IG+FE+HTKGFGSKMMAKMGFV
Subjt:  SKGSAGKSSSQKKTGNKKYADKPVSFVSCGVMQPETVEIT-TSDVTDTDKRKDMITTALETIQISTSEVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFV

Query:  -GGGLGKDGQGMARPIEVIKRPKSLGLGVEFSEAFSS--GGNQESR-GSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPL
         GGGLGKDGQGM  PIEV+KRPKSLGLG+EFSEA +S  G NQ S   S R ++GA  K+K+IGAFE HTKGFGSKMMAKMGFVEGMGLGKDSQG+VNPL
Subjt:  -GGGLGKDGQGMARPIEVIKRPKSLGLGVEFSEAFSS--GGNQESR-GSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPL

Query:  LPVRRPKARGLGAKA
        LPVRRPKARGLGAK+
Subjt:  LPVRRPKARGLGAKA

A0A6J1BYW1 Protein SQS10.096.67Show/hide
Query:  MAGGRRRTIHAKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY
        MAGGRRRTIHAKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY
Subjt:  MAGGRRRTIHAKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY

Query:  PPTHHQEGLHSESQGLHNDADCSLDNSQPFILLDSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDRSPSTDDGSPPQVEE
        PPTHHQEGLHSESQGLHNDADCSLDNSQPFILLDSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDRSPSTDDGSPPQVEE
Subjt:  PPTHHQEGLHSESQGLHNDADCSLDNSQPFILLDSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDRSPSTDDGSPPQVEE

Query:  QEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSESDS
        QEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSESDS
Subjt:  QEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSESDS

Query:  SEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPSRKKSSIVSGDDWSSLALDDLL
        SEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPSRKKSSIVSGDDWSSLALDDLL
Subjt:  SEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPSRKKSSIVSGDDWSSLALDDLL

Query:  VKDSRTTSAKKKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVHCLLLHL
        VKDSRTTSAKKKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQV  L    
Subjt:  VKDSRTTSAKKKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVHCLLLHL

Query:  QYDGWRQFTACIVDAKVLVKKGKSWFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSR
            +R  + C          GK  FVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSR
Subjt:  QYDGWRQFTACIVDAKVLVKKGKSWFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSR

Query:  SKGSAGKSSSQKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSEVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFVG
        SKGSAGKSSSQKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSEVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFVG
Subjt:  SKGSAGKSSSQKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSEVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFVG

Query:  GGLGKDGQGMARPIEVIKRPKSLGLGVEFSEAFSSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRR
        GGLGKDGQGMARPIEVIKRPKSLGLGVEFSEAFSSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRR
Subjt:  GGLGKDGQGMARPIEVIKRPKSLGLGVEFSEAFSSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRR

Query:  PKARGLGAKA
        PKARGLGAKA
Subjt:  PKARGLGAKA

A0A6J1EPV3 Protein SQS10.075.68Show/hide
Query:  MAGGRRRTIHAKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY
        MAGGRRRT HAK SDGFRKNK ++GRRRSDPSSS+RGNLFVDGGFLSD QFQ+SPPSSAREGNSRSKG+SGSKS +LDR K AS+SG+K+SNGNAIGYEY
Subjt:  MAGGRRRTIHAKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY

Query:  PPTHHQEG-LHSESQGLHNDADCSLDNSQPFILLDS--KNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDRSPSTDDGSPPQ
        PP  HQEG LHSESQGL NDADCSLDNSQPFILL+S  K+TQIVAYVD+KP LK DD+EFTYDYGT+F+LGDSSHRGLGFH++D EL R+ ++DD SP  
Subjt:  PPTHHQEG-LHSESQGLHNDADCSLDNSQPFILLDS--KNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDRSPSTDDGSPPQ

Query:  VEEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSE
        VEEQEGLC GSL S+KE G+DERVECR E ++ +E+LAE S+PNKY+  VCS +NSGFLSIGG+RLYTQDVS EESDDDGE S+GSS YSE LES ESSE
Subjt:  VEEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSE

Query:  SDSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPSRKKSSIVSGDDWSSLALD
        SDSS EM+C+ SDIDDEVAEDYLEGIGGSE+IL SKWLVKQEL ESDDD SSS  DDTLEKL GIALQEASKE+GMKKTP SRKKS+IVS D+WSSLALD
Subjt:  SDSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPSRKKSSIVSGDDWSSLALD

Query:  DLLVKDSRTTSAKKKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVHCLL
        DLL+KDSR+ SA+KKKNAA    SWPPKAPKSK   KYPGEKKKYRKETIAAKRRERM++RGVDL +INLKLEHMVLNREDMFSFQPMHPRDCSQV  L 
Subjt:  DLLVKDSRTTSAKKKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVHCLL

Query:  LHLQYDGWRQFTACIVDAKVLVKKGKSWFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSM
               +R  + C          GK  FVTVTRTQYTG+PSSSDQVRL QLIGARD D+DFSVA+G N+KS G NRSREKKNA+ S   ILEL+QSGS 
Subjt:  LHLQYDGWRQFTACIVDAKVLVKKGKSWFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSM

Query:  KSRSKGSAGKSSSQKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSEVKNTDINIDTIGAFEVHTKGFGSKMMAKMG
        KSR++GSAGK SSQKKTG KKYAD+PVSFVSCGVMQP+ VEITTS+V D DK KD++  A E ++++TS VKN DI+ D+IGAFE HT GFGSKMMAKMG
Subjt:  KSRSKGSAGKSSSQKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSEVKNTDINIDTIGAFEVHTKGFGSKMMAKMG

Query:  FV-GGGLGKDGQGMARPIEVIKRPKSLGLGVEFSEAFSSGG-NQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPL
        FV GGGLGKDGQGMA PIEVIKRPKSLGLGVEFSEA +S G NQ+S GS    TGA GK+ K+GAFEEHTKGFGSKMMAKMGFVEG GLGKD QGM  P+
Subjt:  FV-GGGLGKDGQGMARPIEVIKRPKSLGLGVEFSEAFSSGG-NQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPL

Query:  LPVRRPKARGLGAK
          ++RPK+ GLG +
Subjt:  LPVRRPKARGLGAK

A0A6J1KLR0 Protein SQS10.077.08Show/hide
Query:  MAGGRRRTIHAKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY
        MAGGRRRT HAK SDGFRKNK ++GRRRSDPSSS+RGNLFVDGGFLSD QFQ++PPSSAREGNSRSKGQSGSKS +LDR KTAS+SG+K+SNGNAIGYEY
Subjt:  MAGGRRRTIHAKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY

Query:  PPTHHQEG-LHSESQGLHNDADCSLDNSQPFILLDS--KNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDRSPSTDDGSPPQ
        PP  HQEG LHSES+GL NDADCSLDNSQPFILL+S  K+TQIVAYVD+KP LK D LE TYDYGT F+LGDSSHRGLGFHD+D EL R+ +TDD SP  
Subjt:  PPTHHQEG-LHSESQGLHNDADCSLDNSQPFILLDS--KNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDRSPSTDDGSPPQ

Query:  VEEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSE
        VEEQEGLC GSL S+KE G+DERVECR E ++ +E+LAE S+PNKY+   CS +NSGFLSIGG+RLYTQDVS EESDDDGE S+GSS YSE LES ESSE
Subjt:  VEEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSE

Query:  SDSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPSRKKSSIVSGDDWSSLALD
        SDSS EM+C+ SDIDDEVAEDYLEGIGG E+IL SKWLVKQEL ESDDD SSS  DDTLEKL  IALQEASKE+GMKKTP SR KS IVS D+WSSLALD
Subjt:  SDSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPSRKKSSIVSGDDWSSLALD

Query:  DLLVKDSRTTSAKKKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVHCLL
        DLL+KDSR+ SA+KKKNAA    SWPPKAPKSK   KYPGEKKKYRKETIAAKRRERM++RGVDL QINLKLEHMVLNREDMFSFQPMHPRDCSQV  L 
Subjt:  DLLVKDSRTTSAKKKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVHCLL

Query:  LHLQYDGWRQFTACIVDAKVLVKKGKSWFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSM
               +R  + C          GK  FVTVTRTQYTG+PSSSDQVRL QLIGARD+D+DFSVA+G N+KS G NRSREKKNA+ S   ILEL+QSGS 
Subjt:  LHLQYDGWRQFTACIVDAKVLVKKGKSWFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSM

Query:  KSRSKGSAGKSSSQKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSEVKNTDINIDTIGAFEVHTKGFGSKMMAKMG
        KSR+KGSAGK SSQKKTG KKYAD+PVSFVSCGVMQP+ VEITTS+V D DK KD++  A E ++++TS VKN DI+ D+IGAFE HT GFGSKMMAKMG
Subjt:  KSRSKGSAGKSSSQKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSEVKNTDINIDTIGAFEVHTKGFGSKMMAKMG

Query:  FV-GGGLGKDGQGMARPIEVIKRPKSLGLGVEFSE--AFSSGGNQESRGST-RAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVN
        FV GGGLGKDGQGMA PIEVIKRPKSLGLGVEFS   + S+G NQESR S+ R  TGA GK+KK+GAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVN
Subjt:  FV-GGGLGKDGQGMARPIEVIKRPKSLGLGVEFSE--AFSSGGNQESRGST-RAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVN

Query:  PLLPVRRPKARGLGAK
        PLLPVRRPKARGLGAK
Subjt:  PLLPVRRPKARGLGAK

A0A6J1KSJ4 Protein SQS10.075.55Show/hide
Query:  MAGGRRRTIHAKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY
        MAGGRRRT HAK SDGFRKNK ++GRRRSDPSSS+RGNLFVDGGFLSD QFQ++PPSSAREGNSRSKGQSGSKS +LDR KTAS+SG+K+SNGNAIGYEY
Subjt:  MAGGRRRTIHAKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEY

Query:  PPTHHQEG-LHSESQGLHNDADCSLDNSQPFILLDS--KNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDRSPSTDDGSPPQ
        PP  HQEG LHSES+GL NDADCSLDNSQPFILL+S  K+TQIVAYVD+KP LK D LE TYDYGT F+LGDSSHRGLGFHD+D EL R+ +TDD SP  
Subjt:  PPTHHQEG-LHSESQGLHNDADCSLDNSQPFILLDS--KNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDRSPSTDDGSPPQ

Query:  VEEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSE
        VEEQEGLC GSL S+KE G+DERVECR E ++ +E+LAE S+PNKY+   CS +NSGFLSIGG+RLYTQDVS EESDDDGE S+GSS YSE LES ESSE
Subjt:  VEEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSE

Query:  SDSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPSRKKSSIVSGDDWSSLALD
        SDSS EM+C+ SDIDDEVAEDYLEGIGG E+IL SKWLVKQEL ESDDD SSS  DDTLEKL  IALQEASKE+GMKKTP SR KS IVS D+WSSLALD
Subjt:  SDSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPSRKKSSIVSGDDWSSLALD

Query:  DLLVKDSRTTSAKKKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVHCLL
        DLL+KDSR+ SA+KKKNAA    SWPPKAPKSK   KYPGEKKKYRKETIAAKRRERM++RGVDL QINLKLEHMVLNREDMFSFQPMHPRDCSQV  L 
Subjt:  DLLVKDSRTTSAKKKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVHCLL

Query:  LHLQYDGWRQFTACIVDAKVLVKKGKSWFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSM
               +R  + C          GK  FVTVTRTQYTG+PSSSDQVRL QLIGARD+D+DFSVA+G N+KS G NRSREKKNA+ S   ILEL+QSGS 
Subjt:  LHLQYDGWRQFTACIVDAKVLVKKGKSWFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSM

Query:  KSRSKGSAGKSSSQKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSEVKNTDINIDTIGAFEVHTKGFGSKMMAKMG
        KSR+KGSAGK SSQKKTG KKYAD+PVSFVSCGVMQP+ VEITTS+V D DK KD++  A E ++++TS VKN DI+ D+IGAFE HT GFGSKMMAKMG
Subjt:  KSRSKGSAGKSSSQKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSEVKNTDINIDTIGAFEVHTKGFGSKMMAKMG

Query:  FV-GGGLGKDGQGMARPIEVIKRPKSLGLGVEFSEAFSSGG-NQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPL
        FV GGGLGKDGQGMA PIEVIKRPKSLGLG+EFSEA +S G NQES GS    TG  GK+KK+GAFEEHTKGFGSKMMAKMGFVEG GLGKD QGM  P+
Subjt:  FV-GGGLGKDGQGMARPIEVIKRPKSLGLGVEFSEAFSSGG-NQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPL

Query:  LPVRRPKARGLGAK
          ++RPK+ GLG +
Subjt:  LPVRRPKARGLGAK

SwissProt top hitse value%identityAlignment
A1XD95 Tuftelin-interacting protein 118.8e-0732.26Show/hide
Query:  AKMGFVGGGLGK---------DGQGMARPIEVIKRPKSLGLGVEFSEAFSSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGL
        A + F+  GL K         D     RP++    PK  G          +GGN   + S +   G +      G++E HTKG G K++ KMG+V G GL
Subjt:  AKMGFVGGGLGK---------DGQGMARPIEVIKRPKSLGLGVEFSEAFSSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGL

Query:  GKDSQGMVNPLLPVRRPKARGLGA
        GK++QG++NP+   +R     +GA
Subjt:  GKDSQGMVNPLLPVRRPKARGLGA

A1XD97 Tuftelin-interacting protein 118.8e-0731.45Show/hide
Query:  AKMGFVGGGLGK---------DGQGMARPIEVIKRPKSLGLGVEFSEAFSSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGL
        A + F+  GL K         D +   +P++    PK  G      +   +GGN   + S +   G +      G++E HTKG G K++ KMG+V G GL
Subjt:  AKMGFVGGGLGK---------DGQGMARPIEVIKRPKSLGLGVEFSEAFSSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGL

Query:  GKDSQGMVNPLLPVRRPKARGLGA
        GK++QG++NP+   +R     +GA
Subjt:  GKDSQGMVNPLLPVRRPKARGLGA

Q29RR5 Tuftelin-interacting protein 118.8e-0731.45Show/hide
Query:  AKMGFVGGGLGK---------DGQGMARPIEVIKRPKSLGLGVEFSEAFSSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGL
        A + F+  GL K         D     +P++  + PK  G      +   +GGN   + S +   G +      G++E HTKG G K++ KMG+V G GL
Subjt:  AKMGFVGGGLGK---------DGQGMARPIEVIKRPKSLGLGVEFSEAFSSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGL

Query:  GKDSQGMVNPLLPVRRPKARGLGA
        GK++QG++NP+   +R     +GA
Subjt:  GKDSQGMVNPLLPVRRPKARGLGA

Q5U2Y6 Tuftelin-interacting protein 114.0e-0732.26Show/hide
Query:  AKMGFVGGGLGK---------DGQGMARPIEVIKRPKSLGLGVEFSEAFSSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGL
        A + F+  GL K         D     +P++    PK LG      +   +GGN   + S +   G +      G++E HTKG G K++ KMG+V G GL
Subjt:  AKMGFVGGGLGK---------DGQGMARPIEVIKRPKSLGLGVEFSEAFSSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGL

Query:  GKDSQGMVNPLLPVRRPKARGLGA
        GK++QG++NP+   +R     +GA
Subjt:  GKDSQGMVNPLLPVRRPKARGLGA

Q9ERA6 Tuftelin-interacting protein 113.0e-0732.26Show/hide
Query:  AKMGFVGGGLGK---------DGQGMARPIEVIKRPKSLGLGVEFSEAFSSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGL
        A + F+  GL K         D     +P++    PK LG      +   +GGN   + S +  +G +      G++E HTKG G K++ KMG+V G GL
Subjt:  AKMGFVGGGLGK---------DGQGMARPIEVIKRPKSLGLGVEFSEAFSSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGL

Query:  GKDSQGMVNPLLPVRRPKARGLGA
        GK++QG++NP+   +R     +GA
Subjt:  GKDSQGMVNPLLPVRRPKARGLGA

Arabidopsis top hitse value%identityAlignment
AT1G17070.1 GC-rich sequence DNA-binding factor-like protein with Tuftelin interacting domain2.6e-0628.93Show/hide
Query:  GARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSSSQKKTGNKKYADKPVSFVSCGVMQPETVEITT-----SDVT
        G R E D+F   K          + + K+    +T  I     S S  S   GS  K    + +G K    KPV+FVS G + P   EI       +D  
Subjt:  GARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSSSQKKTGNKKYADKPVSFVSCGVMQPETVEITT-----SDVT

Query:  DTDK--RKDMITTALET---IQISTSEV-------------------------------------------------KNTDINIDTIGAFEVHTKGFGSK
        D DK    DMI   +E    + I +S +                                                 K   +  D IG FE  TKG G K
Subjt:  DTDK--RKDMITTALET---IQISTSEV-------------------------------------------------KNTDINIDTIGAFEVHTKGFGSK

Query:  MMAKMGFVGGGLGKDGQGMARPIEVIKRPKSLGLGV-EFSEA
        ++ KMG+ GGGLGK+ QG+  PIE   RPK++G+G  +F EA
Subjt:  MMAKMGFVGGGLGKDGQGMARPIEVIKRPKSLGLGV-EFSEA

AT2G24830.1 zinc finger (CCCH-type) family protein / D111/G-patch domain-containing protein4.5e-0644.44Show/hide
Query:  STRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGL
        ST    G          +E HT+G  SKMMA MG+ EGMGLG   QG++NP+L    P  R L
Subjt:  STRAKTGASGKAKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGL

AT2G42330.1 GC-rich sequence DNA-binding factor-like protein with Tuftelin interacting domain2.0e-0652.17Show/hide
Query:  FEVHTKGFGSKMMAKMGFVGGGLGKDGQGMARPIEVIKRPKSLGLG
        FE  + G G K++ KMG+ G GLGK+ QG+  PIEV  RPK++G+G
Subjt:  FEVHTKGFGSKMMAKMGFVGGGLGKDGQGMARPIEVIKRPKSLGLG

AT2G42330.2 GC-rich sequence DNA-binding factor-like protein with Tuftelin interacting domain2.0e-0652.17Show/hide
Query:  FEVHTKGFGSKMMAKMGFVGGGLGKDGQGMARPIEVIKRPKSLGLG
        FE  + G G K++ KMG+ G GLGK+ QG+  PIEV  RPK++G+G
Subjt:  FEVHTKGFGSKMMAKMGFVGGGLGKDGQGMARPIEVIKRPKSLGLG

AT3G09850.1 D111/G-patch domain-containing protein1.9e-13743.44Show/hide
Query:  MAGGRRRTIHAKASDGFRKNK---RNTGRRRSDPSSSIRG---------NLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTK-TASASG
        M GG RR  +   S G  K K   ++T    +  + S  G          LFV+GG LSD +   +  + +R G+S  K   G +S +++R K +AS SG
Subjt:  MAGGRRRTIHAKASDGFRKNK---RNTGRRRSDPSSSIRG---------NLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTK-TASASG

Query:  SKKSNGNAIGYEYPPTHHQEGLHSESQGLHNDADCSLDNSQPFILLDSKNTQIVAYVDEKP-SLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDR
         +K +GN   Y+YP    +EGL   S G+ +D      N  P +L  S+ TQIVA++D+ P S K   + + Y+Y  S+VLGD SH+GLGF DD    D 
Subjt:  SKKSNGNAIGYEYPPTHHQEGLHSESQGLHNDADCSLDNSQPFILLDSKNTQIVAYVDEKP-SLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDR

Query:  SPSTDDGSPPQVEEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNY
        +PS     P  + +Q     GS S E+E   D   E    DD     + +T             +NSGF+SIGGM+LYT+DVS EESD + E +D   + 
Subjt:  SPSTDDGSPPQVEEQEGLCIGSLSSEKEMGTDERVECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNY

Query:  S---EQLESSESSESDSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQEL----VESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPS
        S       SSE SESDSSE+M  + S+IDD+VA+DYLEGIGGSE +L + WL +Q L    + SDD SSS   D    KL GI LQ+AS E+G KKT  S
Subjt:  S---EQLESSESSESDSSEEMSCTDSDIDDEVAEDYLEGIGGSENILSSKWLVKQEL----VESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPS

Query:  RKKSSIVSGDDWSSLALDDLL-VKDSRTTSAK--KKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNR
                G   + LA+DDL+ VKD R+ S K  KKK  A+ P SWP +APKSK +R +PGE KK+RKE IA KRRERM+ RGVDL  IN +LE+ VL  
Subjt:  RKKSSIVSGDDWSSLALDDLL-VKDSRTTSAK--KKKNAAQSPCSWPPKAPKSKVARKYPGEKKKYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNR

Query:  EDMFSFQPMHPRDCSQVHCLLLHLQYDGWRQFTACIVDAKVLVKKGKSWFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSR
         DM  FQ MH RDCSQV  L      D +R  ++C          GK  FVTVTRT  T MPS+SD++R+ +LIGA DED DF+V+ G   KS   +R +
Subjt:  EDMFSFQPMHPRDCSQVHCLLLHLQYDGWRQFTACIVDAKVLVKKGKSWFVTVTRTQYTGMPSSSDQVRLVQLIGARDEDDDFSVAKGSNMKSQGGNRSR

Query:  EKKNAEVSTSHILELHQSGSMKSRSKGSAGKSSSQKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSEVKNTDINID
         K +A+   +           + R+K +  KSS         YAD+PVSFVS G++  E                 +  T++E +    +E      N  
Subjt:  EKKNAEVSTSHILELHQSGSMKSRSKGSAGKSSSQKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALETIQISTSEVKNTDINID

Query:  TIGAFEVHTKGFGSKMMAKMGFV-GGGLGKDGQGMARPIEVIKRPKSLGLGVEFS-----EAFSSGGNQESRGSTRAKTGASGKA---------------
         IGAFEVHT+GFGSKMMAKMGF+ GGGLGKDG+G+A+PIE ++RPKSLGLG++FS      + SS  N  ++ +  + +G  GK                
Subjt:  TIGAFEVHTKGFGSKMMAKMGFV-GGGLGKDGQGMARPIEVIKRPKSLGLGVEFS-----EAFSSGGNQESRGSTRAKTGASGKA---------------

Query:  KKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK
        K++GAFE+HT GFGS+MMA+MGFVEG GLG++SQG+VNPL+ VRRP+ARG+GA+
Subjt:  KKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGGAGGAAGACGGAGAACGATTCATGCCAAAGCCAGTGACGGCTTCAGGAAGAATAAGAGAAATACTGGCAGAAGAAGATCAGACCCGTCGTCATCTATTCGAGG
AAATTTGTTTGTCGACGGTGGCTTTTTATCCGATTTGCAGTTCCAGTCCTCGCCGCCTAGTTCTGCTCGAGAGGGAAACTCTAGGTCGAAGGGCCAGTCTGGATCAAAAT
CGGTTAATTTAGATCGTACAAAAACTGCTTCCGCTAGTGGGAGCAAAAAATCGAATGGTAATGCTATTGGCTACGAATACCCTCCTACTCATCATCAGGAGGGCTTGCAC
TCTGAATCACAGGGTTTGCACAATGATGCAGATTGTAGCTTGGACAACTCACAGCCTTTTATATTACTTGATTCTAAGAACACTCAAATTGTTGCTTATGTAGATGAAAA
ACCGTCTTTAAAGGTGGATGATTTGGAATTCACTTATGATTATGGTACGAGTTTTGTGTTAGGTGACAGTTCTCATAGAGGACTAGGGTTTCATGATGATGATGATGAAC
TTGATAGAAGTCCCAGCACAGATGATGGCTCGCCTCCACAAGTGGAAGAACAAGAAGGATTATGCATTGGTTCATTGTCATCCGAGAAGGAAATGGGTACTGACGAGAGG
GTTGAGTGTAGGATAGAGGATGATATGACTAATGAGGTGCTAGCAGAAACATCAGCCCCCAACAAATATACAGATGATGTGTGCTCTGAGAAAAATTCGGGTTTCTTGTC
AATTGGTGGTATGAGATTATACACCCAAGATGTATCTTATGAAGAAAGTGATGACGATGGAGAGGCATCGGATGGGAGTTCTAATTACTCTGAGCAGTTAGAGTCAAGTG
AATCATCCGAAAGTGATAGCTCAGAAGAAATGTCTTGTACTGATTCAGATATTGATGACGAGGTGGCTGAAGATTACCTCGAAGGAATTGGTGGAAGTGAGAATATTTTA
AGTTCTAAGTGGCTAGTAAAACAAGAGTTGGTAGAGTCTGATGATGATAGTTCTAGCAGTTGCTTTGATGATACTTTAGAGAAATTAGGCGGCATTGCTTTGCAGGAGGC
ATCAAAAGAATTTGGAATGAAGAAAACCCCACCATCAAGGAAGAAAAGCTCCATTGTATCGGGAGATGATTGGTCATCATTGGCTCTGGATGACTTGCTAGTAAAAGATA
GTAGAACTACATCAGCTAAAAAGAAGAAGAATGCTGCCCAGTCTCCATGTTCTTGGCCTCCAAAAGCTCCAAAAAGTAAAGTAGCTAGAAAGTATCCTGGTGAAAAGAAG
AAATATCGTAAAGAAACAATTGCAGCAAAGCGTCGAGAAAGAATGATTAGTCGTGGTGTTGATCTAGGGCAAATAAATCTGAAATTGGAGCACATGGTTCTGAATAGAGA
AGATATGTTTTCTTTCCAACCTATGCATCCTCGTGATTGTTCCCAGGTACATTGTTTGCTGCTGCATCTGCAGTACGACGGTTGGCGGCAATTTACCGCTTGCATAGTGG
ATGCCAAGGTTCTGGTAAAAAAAGGTAAAAGTTGGTTTGTTACGGTAACTCGAACACAATATACAGGAATGCCATCATCAAGTGATCAAGTTCGCCTGGTGCAGCTAATA
GGAGCAAGAGACGAGGATGATGACTTTTCTGTTGCTAAAGGCTCAAATATGAAATCACAGGGTGGCAACAGAAGCAGAGAAAAGAAGAATGCCGAAGTGAGCACTTCGCA
CATATTGGAGCTCCATCAATCCGGGAGCATGAAGTCGAGGTCGAAGGGTTCTGCAGGTAAGAGTTCAAGTCAGAAGAAGACTGGCAACAAAAAGTACGCGGATAAACCTG
TCTCATTTGTATCATGTGGAGTGATGCAACCAGAAACAGTCGAGATAACAACGAGCGATGTTACAGATACAGATAAAAGAAAGGATATGATCACCACTGCATTGGAAACG
ATCCAGATATCAACTAGCGAAGTCAAGAACACAGATATTAATATAGATACGATCGGTGCATTTGAGGTGCACACCAAGGGTTTTGGGTCAAAAATGATGGCAAAAATGGG
ATTTGTAGGAGGAGGATTGGGGAAAGATGGTCAAGGTATGGCCCGCCCCATTGAAGTGATCAAACGGCCTAAATCGCTCGGGTTAGGGGTCGAGTTCTCAGAGGCCTTCA
GTTCTGGTGGTAACCAGGAAAGTCGAGGATCTACTCGTGCCAAAACTGGAGCTTCTGGTAAAGCTAAAAAAATTGGGGCTTTTGAAGAGCATACCAAAGGATTTGGGTCC
AAGATGATGGCAAAGATGGGATTTGTTGAAGGCATGGGATTGGGGAAAGATTCTCAAGGAATGGTTAACCCTTTGCTTCCTGTTAGGCGGCCTAAAGCGCGAGGGTTAGG
GGCCAAAGCTTAG
mRNA sequenceShow/hide mRNA sequence
GGGTAGAGTCGGCCCACAAAAAGCTGCAGGCTTTGGGCCTTGGAAAGATTCCGAGTGGACCCCACTCTGAGTTTTCTCTCCTTGTTTGTTATCCGTCCACCGCGCCGATC
CACCGTAAGCTCCCCGGTGTGGCGGTTTGCTTCAGATTTCGGGAACTAGACTTTTAGCCTGTAGAGCTCCTCGAGAAAACACCGTCGTTGTGGTTGCGGTGGCGGGTCTC
CGATGGCCGGAGGAAGACGGAGAACGATTCATGCCAAAGCCAGTGACGGCTTCAGGAAGAATAAGAGAAATACTGGCAGAAGAAGATCAGACCCGTCGTCATCTATTCGA
GGAAATTTGTTTGTCGACGGTGGCTTTTTATCCGATTTGCAGTTCCAGTCCTCGCCGCCTAGTTCTGCTCGAGAGGGAAACTCTAGGTCGAAGGGCCAGTCTGGATCAAA
ATCGGTTAATTTAGATCGTACAAAAACTGCTTCCGCTAGTGGGAGCAAAAAATCGAATGGTAATGCTATTGGCTACGAATACCCTCCTACTCATCATCAGGAGGGCTTGC
ACTCTGAATCACAGGGTTTGCACAATGATGCAGATTGTAGCTTGGACAACTCACAGCCTTTTATATTACTTGATTCTAAGAACACTCAAATTGTTGCTTATGTAGATGAA
AAACCGTCTTTAAAGGTGGATGATTTGGAATTCACTTATGATTATGGTACGAGTTTTGTGTTAGGTGACAGTTCTCATAGAGGACTAGGGTTTCATGATGATGATGATGA
ACTTGATAGAAGTCCCAGCACAGATGATGGCTCGCCTCCACAAGTGGAAGAACAAGAAGGATTATGCATTGGTTCATTGTCATCCGAGAAGGAAATGGGTACTGACGAGA
GGGTTGAGTGTAGGATAGAGGATGATATGACTAATGAGGTGCTAGCAGAAACATCAGCCCCCAACAAATATACAGATGATGTGTGCTCTGAGAAAAATTCGGGTTTCTTG
TCAATTGGTGGTATGAGATTATACACCCAAGATGTATCTTATGAAGAAAGTGATGACGATGGAGAGGCATCGGATGGGAGTTCTAATTACTCTGAGCAGTTAGAGTCAAG
TGAATCATCCGAAAGTGATAGCTCAGAAGAAATGTCTTGTACTGATTCAGATATTGATGACGAGGTGGCTGAAGATTACCTCGAAGGAATTGGTGGAAGTGAGAATATTT
TAAGTTCTAAGTGGCTAGTAAAACAAGAGTTGGTAGAGTCTGATGATGATAGTTCTAGCAGTTGCTTTGATGATACTTTAGAGAAATTAGGCGGCATTGCTTTGCAGGAG
GCATCAAAAGAATTTGGAATGAAGAAAACCCCACCATCAAGGAAGAAAAGCTCCATTGTATCGGGAGATGATTGGTCATCATTGGCTCTGGATGACTTGCTAGTAAAAGA
TAGTAGAACTACATCAGCTAAAAAGAAGAAGAATGCTGCCCAGTCTCCATGTTCTTGGCCTCCAAAAGCTCCAAAAAGTAAAGTAGCTAGAAAGTATCCTGGTGAAAAGA
AGAAATATCGTAAAGAAACAATTGCAGCAAAGCGTCGAGAAAGAATGATTAGTCGTGGTGTTGATCTAGGGCAAATAAATCTGAAATTGGAGCACATGGTTCTGAATAGA
GAAGATATGTTTTCTTTCCAACCTATGCATCCTCGTGATTGTTCCCAGGTACATTGTTTGCTGCTGCATCTGCAGTACGACGGTTGGCGGCAATTTACCGCTTGCATAGT
GGATGCCAAGGTTCTGGTAAAAAAAGGTAAAAGTTGGTTTGTTACGGTAACTCGAACACAATATACAGGAATGCCATCATCAAGTGATCAAGTTCGCCTGGTGCAGCTAA
TAGGAGCAAGAGACGAGGATGATGACTTTTCTGTTGCTAAAGGCTCAAATATGAAATCACAGGGTGGCAACAGAAGCAGAGAAAAGAAGAATGCCGAAGTGAGCACTTCG
CACATATTGGAGCTCCATCAATCCGGGAGCATGAAGTCGAGGTCGAAGGGTTCTGCAGGTAAGAGTTCAAGTCAGAAGAAGACTGGCAACAAAAAGTACGCGGATAAACC
TGTCTCATTTGTATCATGTGGAGTGATGCAACCAGAAACAGTCGAGATAACAACGAGCGATGTTACAGATACAGATAAAAGAAAGGATATGATCACCACTGCATTGGAAA
CGATCCAGATATCAACTAGCGAAGTCAAGAACACAGATATTAATATAGATACGATCGGTGCATTTGAGGTGCACACCAAGGGTTTTGGGTCAAAAATGATGGCAAAAATG
GGATTTGTAGGAGGAGGATTGGGGAAAGATGGTCAAGGTATGGCCCGCCCCATTGAAGTGATCAAACGGCCTAAATCGCTCGGGTTAGGGGTCGAGTTCTCAGAGGCCTT
CAGTTCTGGTGGTAACCAGGAAAGTCGAGGATCTACTCGTGCCAAAACTGGAGCTTCTGGTAAAGCTAAAAAAATTGGGGCTTTTGAAGAGCATACCAAAGGATTTGGGT
CCAAGATGATGGCAAAGATGGGATTTGTTGAAGGCATGGGATTGGGGAAAGATTCTCAAGGAATGGTTAACCCTTTGCTTCCTGTTAGGCGGCCTAAAGCGCGAGGGTTA
GGGGCCAAAGCTTAGATTAGATATCATAATGATTGTTTCATCATATGCCTAGTGCATTTATTTTCATACACACTGTCAAAGTCTGTAACAAGGTCAATTTAGTCCTCGAA
AGTTCAAAAAATACTGCCCCTGAACAGCCATAAACTTTTTAAAAATAGCTTTTGAATAAACATCCGTGACAATTAACGTGAAAAGTTTGTATAGGTTAAATCTAAAGAGG
GAGAAAACTGTACCCCATGCATATCCA
Protein sequenceShow/hide protein sequence
MAGGRRRTIHAKASDGFRKNKRNTGRRRSDPSSSIRGNLFVDGGFLSDLQFQSSPPSSAREGNSRSKGQSGSKSVNLDRTKTASASGSKKSNGNAIGYEYPPTHHQEGLH
SESQGLHNDADCSLDNSQPFILLDSKNTQIVAYVDEKPSLKVDDLEFTYDYGTSFVLGDSSHRGLGFHDDDDELDRSPSTDDGSPPQVEEQEGLCIGSLSSEKEMGTDER
VECRIEDDMTNEVLAETSAPNKYTDDVCSEKNSGFLSIGGMRLYTQDVSYEESDDDGEASDGSSNYSEQLESSESSESDSSEEMSCTDSDIDDEVAEDYLEGIGGSENIL
SSKWLVKQELVESDDDSSSSCFDDTLEKLGGIALQEASKEFGMKKTPPSRKKSSIVSGDDWSSLALDDLLVKDSRTTSAKKKKNAAQSPCSWPPKAPKSKVARKYPGEKK
KYRKETIAAKRRERMISRGVDLGQINLKLEHMVLNREDMFSFQPMHPRDCSQVHCLLLHLQYDGWRQFTACIVDAKVLVKKGKSWFVTVTRTQYTGMPSSSDQVRLVQLI
GARDEDDDFSVAKGSNMKSQGGNRSREKKNAEVSTSHILELHQSGSMKSRSKGSAGKSSSQKKTGNKKYADKPVSFVSCGVMQPETVEITTSDVTDTDKRKDMITTALET
IQISTSEVKNTDINIDTIGAFEVHTKGFGSKMMAKMGFVGGGLGKDGQGMARPIEVIKRPKSLGLGVEFSEAFSSGGNQESRGSTRAKTGASGKAKKIGAFEEHTKGFGS
KMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAKA