; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001966 (gene) of Snake gourd v1 genome

Gene IDTan0001966
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionHTH myb-type domain-containing protein
Genome locationLG02:4686282..4690619
RNA-Seq ExpressionTan0001966
SyntenyTan0001966
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR001965 - Zinc finger, PHD-type
IPR009057 - Homeobox-like domain superfamily
IPR011011 - Zinc finger, FYVE/PHD-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR017930 - Myb domain
IPR019786 - Zinc finger, PHD-type, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022135725.1 uncharacterized protein LOC111007617 [Momordica charantia]1.1e-19373.35Show/hide
Query:  MRTKSRRGGRSRPSRR-SATSSPSSMASPSVPDRDVSNAEDNTLNEASRKETHDVLDKAKVDCFQKDTCIRCDESGDLLVCTEIGCPIALHEHCMSCEPS
        MRTK+RRGGRS+PSRR SA S+PSS ASPSV DRD  NAEDN LNEAS  ET DV +  +VDCF+KDTCIRCDESG LLVCTEIGCPIALHE+CMSCEP+
Subjt:  MRTKSRRGGRSRPSRR-SATSSPSSMASPSVPDRDVSNAEDNTLNEASRKETHDVLDKAKVDCFQKDTCIRCDESGDLLVCTEIGCPIALHEHCMSCEPS

Query:  FDEEGRFYCPYCSYKRALVRVNELRRKAMVAKRALSDFIDTRMVSGGNSPQIGEAGKKKTSNVSTCGV----ANHESHLGNENSQDQDMLVEQNQSNEGE
        FDEEGRFYCPYCSYK+ALVR NELRRKAMVAK+ALSDFIDTRMVSGGNSPQIGEA KKK    STCGV     +HESHLG+E+  DQ + VE+NQSNEGE
Subjt:  FDEEGRFYCPYCSYKRALVRVNELRRKAMVAKRALSDFIDTRMVSGGNSPQIGEAGKKKTSNVSTCGV----ANHESHLGNENSQDQDMLVEQNQSNEGE

Query:  DHARTTGDVEPVSMVGVNGENHGGPIVSNVSNDILLATEVQPCEDSMNEEETHDADGDAFGTHQDETLEDK-DGKTMEEENLRSTDDIEDEEVVKDQGQP
        DH +T GD    S VGVN ENH  PIVSNVSN I    E+QP EDSMNEEET +A  DA GTH  E+LEDK DGK MEEE LR TDDI++E +VKD+GQP
Subjt:  DHARTTGDVEPVSMVGVNGENHGGPIVSNVSNDILLATEVQPCEDSMNEEETHDADGDAFGTHQDETLEDK-DGKTMEEENLRSTDDIEDEEVVKDQGQP

Query:  ETPSAHHDEEETAVDDVDHGAERESQGTGDGGEQIQLDKEGMLENVDPASGNNDLKNETFVKKKRFKTKANRKVDLQNYSSPRKSLRLRTPEPGNRSPHV
        ET +AH+DEEETAVD VDHG ERE Q  GDGGEQIQLDKEGMLEN++  SGN DLKNET VKKKRFKTKANR++DLQN+SSPRKS R+RTPEP       
Subjt:  ETPSAHHDEEETAVDDVDHGAERESQGTGDGGEQIQLDKEGMLENVDPASGNNDLKNETFVKKKRFKTKANRKVDLQNYSSPRKSLRLRTPEPGNRSPHV

Query:  QTPEPGKKSPHIQTPKTGKNHAIKIEKVSASRNLRLKPASPNQFKNLDFHGGKRKRMRWSTEEEEMLKEGVQRFSSTANKNLPWRKILEFGRHVFDDTRT
                         GKNH  KIEKVS  RNL+L+P SPNQFK LD +GGKRKR+ WSTEEEEMLKEGVQ+FS+TANKNLPWRKILEFGRHVFDDTRT
Subjt:  QTPEPGKKSPHIQTPKTGKNHAIKIEKVSASRNLRLKPASPNQFKNLDFHGGKRKRMRWSTEEEEMLKEGVQRFSSTANKNLPWRKILEFGRHVFDDTRT

Query:  PVDLKDKWRNILAK
        PVDLKDKWRNI+AK
Subjt:  PVDLKDKWRNILAK

XP_022928510.1 uncharacterized protein LOC111435297 [Cucurbita moschata]9.2e-19371.43Show/hide
Query:  MRTKSRRGGRSRPSRRSATSSPSSMASPSVPDRDVSNAEDNT------------------LNEASRKETHDVLDKAKVDCFQKDTCIRCDESGDLLVCTE
        MRTKSRRGGRSRPSRRS+ S PS MASPSVPDR++ NAEDNT                  LNEASRKETHDVLDK   DCFQKDTCIRCDE G LLVCTE
Subjt:  MRTKSRRGGRSRPSRRSATSSPSSMASPSVPDRDVSNAEDNT------------------LNEASRKETHDVLDKAKVDCFQKDTCIRCDESGDLLVCTE

Query:  IGCPIALHEHCMSCEPSFDEEGRFYCPYCSYKRALVRVNELRRKAMVAKRALSDFIDTRMVSGGNSPQIGEAGKKKTSNVSTCGV----ANHESHLGNEN
        IGCPIALHE+CMSC+PSFDEEGRFYCPYCSYKRALV VNELRRKAM AKR LSDFID RMVSG NSPQ+GEAGKKK ++VSTCGV     NHES      
Subjt:  IGCPIALHEHCMSCEPSFDEEGRFYCPYCSYKRALVRVNELRRKAMVAKRALSDFIDTRMVSGGNSPQIGEAGKKKTSNVSTCGV----ANHESHLGNEN

Query:  SQDQ-DMLVEQNQSNEGEDHARTTGDVEPVSMVGVNGENHGGPIVSNVSNDILLATEVQPCEDSMNEEETHDADGDAFGTHQDETLEDK-DGKTMEEENL
        S+DQ DM V QNQSN+GEDHART GDV+  +++GVN +NH GP VS+VSN I    EVQPCED+MNE+          GTHQDE+LEDK DGKTMEEENL
Subjt:  SQDQ-DMLVEQNQSNEGEDHARTTGDVEPVSMVGVNGENHGGPIVSNVSNDILLATEVQPCEDSMNEEETHDADGDAFGTHQDETLEDK-DGKTMEEENL

Query:  RSTDDIEDEEVVKDQGQPETPSAHHDEEETAVDDVDHGAERESQGTGDGGEQIQLDKEGMLENVDPASGNNDLKNETFVKKKRFKTKANRKVDLQNYSSP
        R TDDIED+E+VKDQGQPE     +DEEE AV+DVD GA R SQ TGDG E+IQ       EN+ PASGNNDLKNETFVKKKRFK KANRKVD ++++SP
Subjt:  RSTDDIEDEEVVKDQGQPETPSAHHDEEETAVDDVDHGAERESQGTGDGGEQIQLDKEGMLENVDPASGNNDLKNETFVKKKRFKTKANRKVDLQNYSSP

Query:  RKSLRLRTPEPGNRSPHVQTPEPGKKSPHIQTPKTGKNHAIKIEKVSASRNLRLKPASPNQFKNLDFHGGKRKRMRWSTEEEEMLKEGVQRFSSTANKNL
        R SLR +TP+ G +SPH+QTP+  K SPHIQTPK GK+ + K+EKVS S  L++KP+SPNQFKNL FHG KRKRMRWSTEEE+MLKEGVQRFSS+ NKNL
Subjt:  RKSLRLRTPEPGNRSPHVQTPEPGKKSPHIQTPKTGKNHAIKIEKVSASRNLRLKPASPNQFKNLDFHGGKRKRMRWSTEEEEMLKEGVQRFSSTANKNL

Query:  PWRKILEFGRHVFDDTRTPVDLKDKWRNILAK
        PWRKILEFGRH+FDDTRTPVDLKDKWRNI+ K
Subjt:  PWRKILEFGRHVFDDTRTPVDLKDKWRNILAK

XP_022989389.1 uncharacterized protein LOC111486458 [Cucurbita maxima]1.2e-19271.37Show/hide
Query:  MRTKSRRGGRSRPSRRSATSSPSSMASPSVPDRDVSNAEDNT------------------LNEASRKETHDVLDKAKVDCFQKDTCIRCDESGDLLVCTE
        MRTKSRRGGRSRPSRRS+ S PS MASPSVPDRD+ NAEDNT                  LNEASRKETHDV DK   DCFQKDTCIRCDESG LLVCTE
Subjt:  MRTKSRRGGRSRPSRRSATSSPSSMASPSVPDRDVSNAEDNT------------------LNEASRKETHDVLDKAKVDCFQKDTCIRCDESGDLLVCTE

Query:  IGCPIALHEHCMSCEPSFDEEGRFYCPYCSYKRALVRVNELRRKAMVAKRALSDFIDTRMVSGGNSPQIGEAGKKKTSNVSTCGV----ANHESHLGNEN
        IGCPIALHE+CMSC+PSFDEEGRFYCPYCSYKRALV VNELRRKAM AKR LSDFID RMVSG NSPQ+GEAGKKK ++VSTCGV     NHES    E 
Subjt:  IGCPIALHEHCMSCEPSFDEEGRFYCPYCSYKRALVRVNELRRKAMVAKRALSDFIDTRMVSGGNSPQIGEAGKKKTSNVSTCGV----ANHESHLGNEN

Query:  SQDQDMLVEQNQSNEGEDHARTTGDVEPVSMVGVNGENHGGPIVSNVSNDILLATEVQPCEDSMNEEETHDADGDAFGTHQDETLEDK-DGKTMEEENLR
            DM V QNQSN+GEDHART GDV+  +++GVN +NH GP VS VSN I    EVQPCED+MNEE          GTHQDE+LEDK DGKTMEE+NLR
Subjt:  SQDQDMLVEQNQSNEGEDHARTTGDVEPVSMVGVNGENHGGPIVSNVSNDILLATEVQPCEDSMNEEETHDADGDAFGTHQDETLEDK-DGKTMEEENLR

Query:  STDDIEDEEVVKDQGQPETPSAHHDEEETAVDDVDHGAERESQGTGDGGEQIQLDKEGMLENVDPASGNNDLKNETFVKKKRFKTKANRKVDLQNYSSPR
         TDDIED+E+VK+QG+P+     +DEEE AVDDVD GA R SQ TGDG EQIQ       EN+ PASGNNDLKNETFVKKKRFK KANRKVD ++++SPR
Subjt:  STDDIEDEEVVKDQGQPETPSAHHDEEETAVDDVDHGAERESQGTGDGGEQIQLDKEGMLENVDPASGNNDLKNETFVKKKRFKTKANRKVDLQNYSSPR

Query:  KSLRLRTPEPGNRSPHVQTPEPGKKSPHIQTPKTGKNHAIKIEKVSASRNLRLKPASPNQFKNLDFHGGKRKRMRWSTEEEEMLKEGVQRFSSTANKNLP
         SLR +TP+ G +SPH+QTP+  K SPHIQTPK GK+ + K+EKVS S  L++KPASPNQFKNL FHG KRKRMRWSTEEE+MLKEGVQRFSS+ NKNLP
Subjt:  KSLRLRTPEPGNRSPHVQTPEPGKKSPHIQTPKTGKNHAIKIEKVSASRNLRLKPASPNQFKNLDFHGGKRKRMRWSTEEEEMLKEGVQRFSSTANKNLP

Query:  WRKILEFGRHVFDDTRTPVDLKDKWRNILAK
        WRKILEFGRH+FDDTRTPVDLKDKWRNI+ K
Subjt:  WRKILEFGRHVFDDTRTPVDLKDKWRNILAK

XP_023529671.1 uncharacterized protein LOC111792440 [Cucurbita pepo subsp. pepo]4.9e-19471.99Show/hide
Query:  MRTKSRRGGRSRPSRRSATSSPSSMASPSVPDRDV------------------SNAEDNTLNEASRKETHDVLDKAKVDCFQKDTCIRCDESGDLLVCTE
        MRTKSRRGGRSRPSRRS+ S PS MASPSVPDRD+                   NAEDN LNEASRKETHDV DK   DCFQKD CIRCDESG LLVCTE
Subjt:  MRTKSRRGGRSRPSRRSATSSPSSMASPSVPDRDV------------------SNAEDNTLNEASRKETHDVLDKAKVDCFQKDTCIRCDESGDLLVCTE

Query:  IGCPIALHEHCMSCEPSFDEEGRFYCPYCSYKRALVRVNELRRKAMVAKRALSDFIDTRMVSGGNSPQIGEAGKKKTSNVSTCGV----ANHESHLGNEN
        IGCPIALHE+CMSC+PSFDEEGRFYCPYCSYKRALV VNELRRKAM AKR LSDFID RMVSG NSPQ+GEAGKKK + VSTCGV     NHES      
Subjt:  IGCPIALHEHCMSCEPSFDEEGRFYCPYCSYKRALVRVNELRRKAMVAKRALSDFIDTRMVSGGNSPQIGEAGKKKTSNVSTCGV----ANHESHLGNEN

Query:  SQDQ-DMLVEQNQSNEGEDHARTTGDVEPVSMVGVNGENHGGPIVSNVSNDILLATEVQPCEDSMNEEETHDADGDAFGTHQDETLEDK-DGKTMEEENL
        S+DQ DM V QN+SN+GEDHART GDV+  +++GVN +NH GP VS+VSN I    EVQPCED+MNEE          GTHQDE+LEDK DGKTMEEEN 
Subjt:  SQDQ-DMLVEQNQSNEGEDHARTTGDVEPVSMVGVNGENHGGPIVSNVSNDILLATEVQPCEDSMNEEETHDADGDAFGTHQDETLEDK-DGKTMEEENL

Query:  RSTDDIEDEEVVKDQGQPETPSAHHDEEETAVDDVDHGAERESQGTGDGGEQIQLDKEGMLENVDPASGNNDLKNETFVKKKRFKTKANRKVDLQNYSSP
        R TDDIED+E+VKDQGQP+ P   +DEEETAVDDVD GA R SQ TGDG EQIQ       EN+ PASGNNDLKNETFVKKKRFK KANRKVD ++++SP
Subjt:  RSTDDIEDEEVVKDQGQPETPSAHHDEEETAVDDVDHGAERESQGTGDGGEQIQLDKEGMLENVDPASGNNDLKNETFVKKKRFKTKANRKVDLQNYSSP

Query:  RKSLRLRTPEPGNRSPHVQTPEPGKKSPHIQTPKTGKNHAIKIEKVSASRNLRLKPASPNQFKNLDFHGGKRKRMRWSTEEEEMLKEGVQRFSSTANKNL
        R SLR +TP+ G +SPH+QTP+  K SPHIQTPK GK+ + K+EKVS S  L+LKPASPNQFKNL FHG KRKRMRWSTEEE+MLKEGVQRFSS+ NKNL
Subjt:  RKSLRLRTPEPGNRSPHVQTPEPGKKSPHIQTPKTGKNHAIKIEKVSASRNLRLKPASPNQFKNLDFHGGKRKRMRWSTEEEEMLKEGVQRFSSTANKNL

Query:  PWRKILEFGRHVFDDTRTPVDLKDKWRNILAK
        PWRKILEFGRH+FDDTRTPVDLKDKWRNI+ K
Subjt:  PWRKILEFGRHVFDDTRTPVDLKDKWRNILAK

XP_038888454.1 uncharacterized protein LOC120078293 [Benincasa hispida]2.8e-19772Show/hide
Query:  MRTKSRRGGRSRPSRRSATSSPSSMASPSVPDRDVSNAEDNTLNEASRKETHDVLDKAKVDCFQKDTCIRCDESGDLLVCTEIGCPIALHEHCMSCEPSF
        M+ K+RRGGRS+ SRRSA S+PS MASP +PD+DV NAEDNT++EAS KET+DVLD  K+DCFQKDTC RCD+SGDLLVCTEIGCPIALH+ CMSCEPSF
Subjt:  MRTKSRRGGRSRPSRRSATSSPSSMASPSVPDRDVSNAEDNTLNEASRKETHDVLDKAKVDCFQKDTCIRCDESGDLLVCTEIGCPIALHEHCMSCEPSF

Query:  DEEGRFYCPYCSYKRALVRVNELRRKAMVAKRALSDFIDTRMVSGGNSPQIGEAGKKKTSNVSTCGV----ANHESHLGNENSQDQDMLVEQNQSNEGED
        DE+GRFYCPYCSYKRALVRVNELRRK MVAKRALSDFIDTRMV GG SPQI E GKKK ++VSTCGV     NH SHLGNE+S+DQD+ VEQNQ NEGED
Subjt:  DEEGRFYCPYCSYKRALVRVNELRRKAMVAKRALSDFIDTRMVSGGNSPQIGEAGKKKTSNVSTCGV----ANHESHLGNENSQDQDMLVEQNQSNEGED

Query:  HARTTGDVEPVSMVGVNGENHGGPIVSNVSNDILLATEVQPCEDSMNEEETHDADGDAFGTHQDETLEDK-DGKTMEEENLRSTDDIEDEEVVKDQGQPE
        HART GD++P SMVGVN ENH GPIV NVS+       +QPCEDSMN EETH+   D  GTHQ E+LEDK DG+TM++ENLRSTDDI+D+ + KDQGQ E
Subjt:  HARTTGDVEPVSMVGVNGENHGGPIVSNVSNDILLATEVQPCEDSMNEEETHDADGDAFGTHQDETLEDK-DGKTMEEENLRSTDDIEDEEVVKDQGQPE

Query:  TPSAHHDEEETAVDDVDHGAERESQGTGDGGEQIQLDKEGMLENVDPASGNNDLKNETFVKKKRFKTKANRKVDLQNYSSPRKSLRLRTPEPGNRSPHVQ
        TPSAHHDEEETA         RE Q   DGGEQ Q D E MLE++  ASGNNDLK+ET VKK+RFKTKANR+ D QN + PRKSLRL+TP+ G +SPH+Q
Subjt:  TPSAHHDEEETAVDDVDHGAERESQGTGDGGEQIQLDKEGMLENVDPASGNNDLKNETFVKKKRFKTKANRKVDLQNYSSPRKSLRLRTPEPGNRSPHVQ

Query:  TPEPGKKSPHIQTPKTGK------------NHAIKIEKVSASRNLRLKPASPNQFKNLDFHGGKRKRMRWSTEEEEMLKEGVQRFSSTANKNLPWRKILE
        TP+PGKKSPHIQTP+  K            +HA K EKV  SR+L+ +PAS NQFKNLDFHGGKRKRM WS EEEEMLKEGVQ+FSSTANKNLPWRKILE
Subjt:  TPEPGKKSPHIQTPKTGK------------NHAIKIEKVSASRNLRLKPASPNQFKNLDFHGGKRKRMRWSTEEEEMLKEGVQRFSSTANKNLPWRKILE

Query:  FGRHVFDDTRTPVDLKDKWRNILAK
        FGRH+FD TRTPVDLKDKWRN+ AK
Subjt:  FGRHVFDDTRTPVDLKDKWRNILAK

TrEMBL top hitse value%identityAlignment
A0A0A0K518 HTH myb-type domain-containing protein5.6e-18869.66Show/hide
Query:  MRTKSRRGGRSRPSRRSATSSPSSMASPSVPDRDVSNAEDNTLNEASRKETHDVLDKAKVDCFQKDTCIRCDESGDLLVCTEIGCPIALHEHCMSCEPSF
        MR K+RRGGR +PSRRS TS+PS MAS  +PD+DV N EDNTL++AS KET DVLD  K+DCFQKDTC RCDESGDLLVCTE GCPIALHE CMSCEPSF
Subjt:  MRTKSRRGGRSRPSRRSATSSPSSMASPSVPDRDVSNAEDNTLNEASRKETHDVLDKAKVDCFQKDTCIRCDESGDLLVCTEIGCPIALHEHCMSCEPSF

Query:  DEEGRFYCPYCSYKRALVRVNELRRKAMVAKRALSDFIDTRMVSGGNSPQIGEAGKKKTSNVSTCG----VANHESHLGNENSQDQDMLVEQNQSNEGED
        DE+GRFYCPYCSYKRAL+RVNELRRK MVAKRALSDFIDTRMV G NSP++GEAGKKK+ +VSTCG    + NH SHL NE+S+D D+ VEQNQSNEGED
Subjt:  DEEGRFYCPYCSYKRALVRVNELRRKAMVAKRALSDFIDTRMVSGGNSPQIGEAGKKKTSNVSTCG----VANHESHLGNENSQDQDMLVEQNQSNEGED

Query:  HARTTGDVEPVSMVGVNGENHGGPIVSNVSNDILLATEVQPCEDSMNEEETHDADGDAFGTHQDETLEDK-DGKTMEEENLRSTDDIEDEEVVKDQGQPE
         AR  GDVEP SMVGVN E H GPIVSNVSN    A  VQPCED M +EETH+A  +  GTHQ E+LEDK DG TM++E LR  DDI+D+ +  D GQ E
Subjt:  HARTTGDVEPVSMVGVNGENHGGPIVSNVSNDILLATEVQPCEDSMNEEETHDADGDAFGTHQDETLEDK-DGKTMEEENLRSTDDIEDEEVVKDQGQPE

Query:  TPSAHHDEEETAVDDVDHGAERESQGTGDGGEQIQLDKEGMLENVDPASGNNDLKNETFVKKKRFKTKANRKVDLQNYSSPRKSLRL-----------RT
        TP A+H  E TA         +E Q    G EQIQ D E MLEN+ PASGNNDLKN+T VKK+RFKTKANR+ DLQN +SPRKSLRL           RT
Subjt:  TPSAHHDEEETAVDDVDHGAERESQGTGDGGEQIQLDKEGMLENVDPASGNNDLKNETFVKKKRFKTKANRKVDLQNYSSPRKSLRL-----------RT

Query:  PEPGNRSPHVQTPEPGKKSPHIQTPKTGKNHAIKIEKVSASRNLRLKPASPNQFKNLDFHGGKRKRMRWSTEEEEMLKEGVQRFSSTANKNLPWRKILEF
        PEP  +SPH+QTPEP K SP +QTPK  K++ IKIEKVS SRNL+ +PAS NQ K+LDFH GKRKRMRWS EEEEMLKEGV++FSST NKNLPWRKILEF
Subjt:  PEPGNRSPHVQTPEPGKKSPHIQTPKTGKNHAIKIEKVSASRNLRLKPASPNQFKNLDFHGGKRKRMRWSTEEEEMLKEGVQRFSSTANKNLPWRKILEF

Query:  GRHVFDDTRTPVDLKDKWRNILAK
        GRH+FDDTRTPVDLKDKWR++L +
Subjt:  GRHVFDDTRTPVDLKDKWRNILAK

A0A1S3CHE7 protein CHROMATIN REMODELING 4-like isoform X17.1e-19171.6Show/hide
Query:  MRTKSRRGGRSRPSRRSATS-SPSSMASPSVPDRDVSNAEDNTLNEASRKETHDVLDKAKVDCFQKDTCIRCDESGDLLVCTEIGCPIALHEHCMSCEPS
        MR K+RRGGR +PSR SATS +PS MASP +PD+DV N EDN L EAS KET+DVLD  K+DCFQKD+C RCD+SGDLLVCTE GCPIALHE CMSCEPS
Subjt:  MRTKSRRGGRSRPSRRSATS-SPSSMASPSVPDRDVSNAEDNTLNEASRKETHDVLDKAKVDCFQKDTCIRCDESGDLLVCTEIGCPIALHEHCMSCEPS

Query:  FDEEGRFYCPYCSYKRALVRVNELRRKAMVAKRALSDFIDTRMVSGGNSPQIGEAGKKKTSNVSTCGV----ANHESHLGNENSQDQDMLVEQNQSNEGE
        FDE+GRFYCPYCSYKRAL+RVNELRRK MVAKRALSDFIDTRMV GGNSP++GEAGKKK+ ++STCGV     N+ SHL NE+S+DQD+ VEQNQSNEGE
Subjt:  FDEEGRFYCPYCSYKRALVRVNELRRKAMVAKRALSDFIDTRMVSGGNSPQIGEAGKKKTSNVSTCGV----ANHESHLGNENSQDQDMLVEQNQSNEGE

Query:  DHARTTGDVEPVSMVGVNGENHGGPIVSNVSNDILLATEVQPCEDSMNEEETHDADGDAFGTHQDETLEDK-DGKTMEEENLRSTDDIEDEEVVKDQGQP
        + AR  GDV+P SMVGVN E H GPIVSNVSND   A  VQPCED M +EETH+A  +  GTHQ E+LEDK DGKTM+EE LR  DDI+D+ + +DQGQ 
Subjt:  DHARTTGDVEPVSMVGVNGENHGGPIVSNVSNDILLATEVQPCEDSMNEEETHDADGDAFGTHQDETLEDK-DGKTMEEENLRSTDDIEDEEVVKDQGQP

Query:  ETPSAHHDEEETAVDDVDHGAERESQGTGDGGEQIQLDKEGMLENVDPASGNNDLKNETFVKKKRFKTKANRKVDLQNYSSPRKSLRLRTPEPGNRSPHV
        E P A+HD EETA         +E Q   DG EQIQ D E MLEN+ PAS +NDLKNET  KK+RFKTKANR+ DLQN +SPRKSLRL+TPE   +SP +
Subjt:  ETPSAHHDEEETAVDDVDHGAERESQGTGDGGEQIQLDKEGMLENVDPASGNNDLKNETFVKKKRFKTKANRKVDLQNYSSPRKSLRLRTPEPGNRSPHV

Query:  QTPEPGKKSPHIQTPKTGKNHAIKIEKVSASRNLRLKPASPNQFKNLDFHGGKRKRMRWSTEEEEMLKEGVQRFSSTANKNLPWRKILEFGRHVFDDTRT
        +TPEP   SPHIQTPK  K+HAIKIEKVS SRNL+ + AS N FK+LDFHGGKRKRMRWS EEEEML+EGVQ+FSSTANKNLPWRKILEFGRH+FDDTRT
Subjt:  QTPEPGKKSPHIQTPKTGKNHAIKIEKVSASRNLRLKPASPNQFKNLDFHGGKRKRMRWSTEEEEMLKEGVQRFSSTANKNLPWRKILEFGRHVFDDTRT

Query:  PVDLKDKWRNILAK
        PVDLKDKWRN+L +
Subjt:  PVDLKDKWRNILAK

A0A6J1C1V0 uncharacterized protein LOC1110076175.2e-19473.35Show/hide
Query:  MRTKSRRGGRSRPSRR-SATSSPSSMASPSVPDRDVSNAEDNTLNEASRKETHDVLDKAKVDCFQKDTCIRCDESGDLLVCTEIGCPIALHEHCMSCEPS
        MRTK+RRGGRS+PSRR SA S+PSS ASPSV DRD  NAEDN LNEAS  ET DV +  +VDCF+KDTCIRCDESG LLVCTEIGCPIALHE+CMSCEP+
Subjt:  MRTKSRRGGRSRPSRR-SATSSPSSMASPSVPDRDVSNAEDNTLNEASRKETHDVLDKAKVDCFQKDTCIRCDESGDLLVCTEIGCPIALHEHCMSCEPS

Query:  FDEEGRFYCPYCSYKRALVRVNELRRKAMVAKRALSDFIDTRMVSGGNSPQIGEAGKKKTSNVSTCGV----ANHESHLGNENSQDQDMLVEQNQSNEGE
        FDEEGRFYCPYCSYK+ALVR NELRRKAMVAK+ALSDFIDTRMVSGGNSPQIGEA KKK    STCGV     +HESHLG+E+  DQ + VE+NQSNEGE
Subjt:  FDEEGRFYCPYCSYKRALVRVNELRRKAMVAKRALSDFIDTRMVSGGNSPQIGEAGKKKTSNVSTCGV----ANHESHLGNENSQDQDMLVEQNQSNEGE

Query:  DHARTTGDVEPVSMVGVNGENHGGPIVSNVSNDILLATEVQPCEDSMNEEETHDADGDAFGTHQDETLEDK-DGKTMEEENLRSTDDIEDEEVVKDQGQP
        DH +T GD    S VGVN ENH  PIVSNVSN I    E+QP EDSMNEEET +A  DA GTH  E+LEDK DGK MEEE LR TDDI++E +VKD+GQP
Subjt:  DHARTTGDVEPVSMVGVNGENHGGPIVSNVSNDILLATEVQPCEDSMNEEETHDADGDAFGTHQDETLEDK-DGKTMEEENLRSTDDIEDEEVVKDQGQP

Query:  ETPSAHHDEEETAVDDVDHGAERESQGTGDGGEQIQLDKEGMLENVDPASGNNDLKNETFVKKKRFKTKANRKVDLQNYSSPRKSLRLRTPEPGNRSPHV
        ET +AH+DEEETAVD VDHG ERE Q  GDGGEQIQLDKEGMLEN++  SGN DLKNET VKKKRFKTKANR++DLQN+SSPRKS R+RTPEP       
Subjt:  ETPSAHHDEEETAVDDVDHGAERESQGTGDGGEQIQLDKEGMLENVDPASGNNDLKNETFVKKKRFKTKANRKVDLQNYSSPRKSLRLRTPEPGNRSPHV

Query:  QTPEPGKKSPHIQTPKTGKNHAIKIEKVSASRNLRLKPASPNQFKNLDFHGGKRKRMRWSTEEEEMLKEGVQRFSSTANKNLPWRKILEFGRHVFDDTRT
                         GKNH  KIEKVS  RNL+L+P SPNQFK LD +GGKRKR+ WSTEEEEMLKEGVQ+FS+TANKNLPWRKILEFGRHVFDDTRT
Subjt:  QTPEPGKKSPHIQTPKTGKNHAIKIEKVSASRNLRLKPASPNQFKNLDFHGGKRKRMRWSTEEEEMLKEGVQRFSSTANKNLPWRKILEFGRHVFDDTRT

Query:  PVDLKDKWRNILAK
        PVDLKDKWRNI+AK
Subjt:  PVDLKDKWRNILAK

A0A6J1ERV5 uncharacterized protein LOC1114352974.4e-19371.43Show/hide
Query:  MRTKSRRGGRSRPSRRSATSSPSSMASPSVPDRDVSNAEDNT------------------LNEASRKETHDVLDKAKVDCFQKDTCIRCDESGDLLVCTE
        MRTKSRRGGRSRPSRRS+ S PS MASPSVPDR++ NAEDNT                  LNEASRKETHDVLDK   DCFQKDTCIRCDE G LLVCTE
Subjt:  MRTKSRRGGRSRPSRRSATSSPSSMASPSVPDRDVSNAEDNT------------------LNEASRKETHDVLDKAKVDCFQKDTCIRCDESGDLLVCTE

Query:  IGCPIALHEHCMSCEPSFDEEGRFYCPYCSYKRALVRVNELRRKAMVAKRALSDFIDTRMVSGGNSPQIGEAGKKKTSNVSTCGV----ANHESHLGNEN
        IGCPIALHE+CMSC+PSFDEEGRFYCPYCSYKRALV VNELRRKAM AKR LSDFID RMVSG NSPQ+GEAGKKK ++VSTCGV     NHES      
Subjt:  IGCPIALHEHCMSCEPSFDEEGRFYCPYCSYKRALVRVNELRRKAMVAKRALSDFIDTRMVSGGNSPQIGEAGKKKTSNVSTCGV----ANHESHLGNEN

Query:  SQDQ-DMLVEQNQSNEGEDHARTTGDVEPVSMVGVNGENHGGPIVSNVSNDILLATEVQPCEDSMNEEETHDADGDAFGTHQDETLEDK-DGKTMEEENL
        S+DQ DM V QNQSN+GEDHART GDV+  +++GVN +NH GP VS+VSN I    EVQPCED+MNE+          GTHQDE+LEDK DGKTMEEENL
Subjt:  SQDQ-DMLVEQNQSNEGEDHARTTGDVEPVSMVGVNGENHGGPIVSNVSNDILLATEVQPCEDSMNEEETHDADGDAFGTHQDETLEDK-DGKTMEEENL

Query:  RSTDDIEDEEVVKDQGQPETPSAHHDEEETAVDDVDHGAERESQGTGDGGEQIQLDKEGMLENVDPASGNNDLKNETFVKKKRFKTKANRKVDLQNYSSP
        R TDDIED+E+VKDQGQPE     +DEEE AV+DVD GA R SQ TGDG E+IQ       EN+ PASGNNDLKNETFVKKKRFK KANRKVD ++++SP
Subjt:  RSTDDIEDEEVVKDQGQPETPSAHHDEEETAVDDVDHGAERESQGTGDGGEQIQLDKEGMLENVDPASGNNDLKNETFVKKKRFKTKANRKVDLQNYSSP

Query:  RKSLRLRTPEPGNRSPHVQTPEPGKKSPHIQTPKTGKNHAIKIEKVSASRNLRLKPASPNQFKNLDFHGGKRKRMRWSTEEEEMLKEGVQRFSSTANKNL
        R SLR +TP+ G +SPH+QTP+  K SPHIQTPK GK+ + K+EKVS S  L++KP+SPNQFKNL FHG KRKRMRWSTEEE+MLKEGVQRFSS+ NKNL
Subjt:  RKSLRLRTPEPGNRSPHVQTPEPGKKSPHIQTPKTGKNHAIKIEKVSASRNLRLKPASPNQFKNLDFHGGKRKRMRWSTEEEEMLKEGVQRFSSTANKNL

Query:  PWRKILEFGRHVFDDTRTPVDLKDKWRNILAK
        PWRKILEFGRH+FDDTRTPVDLKDKWRNI+ K
Subjt:  PWRKILEFGRHVFDDTRTPVDLKDKWRNILAK

A0A6J1JP70 uncharacterized protein LOC1114864585.8e-19371.37Show/hide
Query:  MRTKSRRGGRSRPSRRSATSSPSSMASPSVPDRDVSNAEDNT------------------LNEASRKETHDVLDKAKVDCFQKDTCIRCDESGDLLVCTE
        MRTKSRRGGRSRPSRRS+ S PS MASPSVPDRD+ NAEDNT                  LNEASRKETHDV DK   DCFQKDTCIRCDESG LLVCTE
Subjt:  MRTKSRRGGRSRPSRRSATSSPSSMASPSVPDRDVSNAEDNT------------------LNEASRKETHDVLDKAKVDCFQKDTCIRCDESGDLLVCTE

Query:  IGCPIALHEHCMSCEPSFDEEGRFYCPYCSYKRALVRVNELRRKAMVAKRALSDFIDTRMVSGGNSPQIGEAGKKKTSNVSTCGV----ANHESHLGNEN
        IGCPIALHE+CMSC+PSFDEEGRFYCPYCSYKRALV VNELRRKAM AKR LSDFID RMVSG NSPQ+GEAGKKK ++VSTCGV     NHES    E 
Subjt:  IGCPIALHEHCMSCEPSFDEEGRFYCPYCSYKRALVRVNELRRKAMVAKRALSDFIDTRMVSGGNSPQIGEAGKKKTSNVSTCGV----ANHESHLGNEN

Query:  SQDQDMLVEQNQSNEGEDHARTTGDVEPVSMVGVNGENHGGPIVSNVSNDILLATEVQPCEDSMNEEETHDADGDAFGTHQDETLEDK-DGKTMEEENLR
            DM V QNQSN+GEDHART GDV+  +++GVN +NH GP VS VSN I    EVQPCED+MNEE          GTHQDE+LEDK DGKTMEE+NLR
Subjt:  SQDQDMLVEQNQSNEGEDHARTTGDVEPVSMVGVNGENHGGPIVSNVSNDILLATEVQPCEDSMNEEETHDADGDAFGTHQDETLEDK-DGKTMEEENLR

Query:  STDDIEDEEVVKDQGQPETPSAHHDEEETAVDDVDHGAERESQGTGDGGEQIQLDKEGMLENVDPASGNNDLKNETFVKKKRFKTKANRKVDLQNYSSPR
         TDDIED+E+VK+QG+P+     +DEEE AVDDVD GA R SQ TGDG EQIQ       EN+ PASGNNDLKNETFVKKKRFK KANRKVD ++++SPR
Subjt:  STDDIEDEEVVKDQGQPETPSAHHDEEETAVDDVDHGAERESQGTGDGGEQIQLDKEGMLENVDPASGNNDLKNETFVKKKRFKTKANRKVDLQNYSSPR

Query:  KSLRLRTPEPGNRSPHVQTPEPGKKSPHIQTPKTGKNHAIKIEKVSASRNLRLKPASPNQFKNLDFHGGKRKRMRWSTEEEEMLKEGVQRFSSTANKNLP
         SLR +TP+ G +SPH+QTP+  K SPHIQTPK GK+ + K+EKVS S  L++KPASPNQFKNL FHG KRKRMRWSTEEE+MLKEGVQRFSS+ NKNLP
Subjt:  KSLRLRTPEPGNRSPHVQTPEPGKKSPHIQTPKTGKNHAIKIEKVSASRNLRLKPASPNQFKNLDFHGGKRKRMRWSTEEEEMLKEGVQRFSSTANKNLP

Query:  WRKILEFGRHVFDDTRTPVDLKDKWRNILAK
        WRKILEFGRH+FDDTRTPVDLKDKWRNI+ K
Subjt:  WRKILEFGRHVFDDTRTPVDLKDKWRNILAK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G01150.1 Homeodomain-like protein with RING/FYVE/PHD-type zinc finger domain8.7e-1659.65Show/hide
Query:  KRMRWSTEEEEMLKEGVQRFSSTANKNLPWRKILEFGRHVFDDTRTPVDLKDKWRNI
        KR+ W+  EE+ML+EGV++FS T NKN+PW+KILE G+ +F  TR   DLKDKWRN+
Subjt:  KRMRWSTEEEEMLKEGVQRFSSTANKNLPWRKILEFGRHVFDDTRTPVDLKDKWRNI

AT1G14770.1 RING/FYVE/PHD zinc finger superfamily protein2.7e-0934.29Show/hide
Query:  KDTCIRCDESGDLLVCTEIGCPIALHEHCMSCEPSFDEEGRFYCPYCSYKRALVRVNELRRKAMVAKRAL
        ++ C +C++ G LL+C+   C   +H+ C++C  + DE G F CP C Y R  +  +E ++    AKR L
Subjt:  KDTCIRCDESGDLLVCTEIGCPIALHEHCMSCEPSFDEEGRFYCPYCSYKRALVRVNELRRKAMVAKRAL

AT1G14770.2 RING/FYVE/PHD zinc finger superfamily protein2.7e-0934.29Show/hide
Query:  KDTCIRCDESGDLLVCTEIGCPIALHEHCMSCEPSFDEEGRFYCPYCSYKRALVRVNELRRKAMVAKRAL
        ++ C +C++ G LL+C+   C   +H+ C++C  + DE G F CP C Y R  +  +E ++    AKR L
Subjt:  KDTCIRCDESGDLLVCTEIGCPIALHEHCMSCEPSFDEEGRFYCPYCSYKRALVRVNELRRKAMVAKRAL

AT4G12670.1 Homeodomain-like superfamily protein4.0e-1342.31Show/hide
Query:  KSPHIQTPKTGKNHAIKIEKVSASRNLRLKPASPNQFKNLDFHGGKRKRMRWSTEEEEMLKEGVQRFSSTANKNLPWRKILEFGRHVFDDTRTPVDLKDK
        K P  Q  K G  H   +EK   + N   + A+            KRKR  W+  E EML+ GVQ+F     +N+PWRKIL+FGR VF D R P DLKDK
Subjt:  KSPHIQTPKTGKNHAIKIEKVSASRNLRLKPASPNQFKNLDFHGGKRKRMRWSTEEEEMLKEGVQRFSSTANKNLPWRKILEFGRHVFDDTRTPVDLKDK

Query:  WRNI
        W+ +
Subjt:  WRNI

AT5G03780.1 TRF-like 105.3e-1327.73Show/hide
Query:  DKDGKTMEEENLRSTDDIEDEE----VVKD---QGQPETPSAHHDEEE------TAVDDVDHGAERESQGTGDGGEQIQLDKEGMLENVDPASGNNDLKN
        DK+ K+ +E+   S D+I ++E    +V D   QG+ +  S+  D ++        +D+V   +E+ +       E  +  K+     ++  +G     +
Subjt:  DKDGKTMEEENLRSTDDIEDEE----VVKD---QGQPETPSAHHDEEE------TAVDDVDHGAERESQGTGDGGEQIQLDKEGMLENVDPASGNNDLKN

Query:  ETFVKKKRFKTKANRKVDLQNYSSPRKSLRLRTPEPGNR-SPHVQTPEPGKKSPHIQTPKTGKNHAIKIEKVSASRNLRLKPASPNQFKNLDFHGGKRKR
             ++ F  K   +V  QN    R+ L++   +  ++ S + +  E   +        T  +  ++ ++ +     + K      F  +D    +R+R
Subjt:  ETFVKKKRFKTKANRKVDLQNYSSPRKSLRLRTPEPGNR-SPHVQTPEPGKKSPHIQTPKTGKNHAIKIEKVSASRNLRLKPASPNQFKNLDFHGGKRKR

Query:  MRWSTEEEEMLKEGVQRFSSTANKNLPWRKILEFGRHVFDDTRTPVDLKDKWRNIL
        + W+ EEEEMLK GV++F++ ANKN+PWRKILE G  VF +TRTP DLKDKWR+++
Subjt:  MRWSTEEEEMLKEGVQRFSSTANKNLPWRKILEFGRHVFDDTRTPVDLKDKWRNIL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGACCAAGAGTCGCCGCGGCGGAAGGTCCAGGCCTTCCCGGCGTTCCGCCACTTCTTCCCCGTCGTCTATGGCGTCTCCCTCCGTGCCCGACCGCGATGTGTCCAA
TGCAGAAGATAACACTCTGAATGAGGCATCAAGGAAGGAAACTCACGATGTGTTGGATAAGGCAAAGGTAGATTGCTTTCAGAAAGACACTTGTATTAGATGCGATGAGA
GTGGTGACTTGTTGGTTTGTACTGAAATTGGATGTCCAATTGCTCTTCATGAGCACTGCATGTCCTGCGAACCTTCGTTCGATGAAGAGGGGCGCTTCTATTGTCCTTAT
TGTTCGTATAAAAGAGCTTTGGTGCGTGTGAATGAGTTGAGGAGAAAGGCTATGGTGGCAAAGAGAGCGTTGTCTGATTTTATTGATACCAGGATGGTTAGTGGTGGCAA
TTCACCGCAGATAGGAGAGGCAGGTAAGAAGAAAACATCTAATGTTTCAACTTGTGGAGTGGCTAATCATGAGAGTCATCTGGGCAATGAGAATTCGCAAGATCAGGATA
TGCTGGTCGAGCAAAATCAGAGTAATGAAGGGGAGGACCATGCGAGAACAACAGGAGATGTTGAACCAGTGTCAATGGTGGGGGTTAATGGCGAAAATCACGGTGGTCCT
ATTGTGTCTAATGTTAGCAACGACATACTTTTGGCTACTGAAGTTCAGCCATGTGAGGATTCTATGAACGAAGAGGAAACTCATGATGCTGACGGTGACGCATTTGGAAC
ACATCAGGATGAAACTTTAGAAGATAAAGATGGGAAAACGATGGAAGAAGAAAACTTAAGGTCGACAGATGACATTGAGGATGAAGAGGTTGTGAAGGATCAAGGCCAAC
CAGAGACTCCTAGTGCACATCATGATGAGGAAGAGACTGCTGTAGATGATGTTGATCATGGAGCAGAACGAGAATCTCAAGGCACAGGTGATGGTGGAGAACAAATCCAG
CTCGACAAAGAAGGGATGCTTGAAAATGTTGATCCTGCGTCTGGAAATAATGATTTAAAGAATGAAACATTTGTTAAGAAAAAGCGCTTTAAAACTAAAGCTAATAGGAA
AGTAGACCTGCAGAATTATAGTTCTCCTAGAAAGTCATTGCGTCTACGAACTCCAGAACCTGGGAACAGGTCACCACATGTACAAACTCCAGAACCTGGGAAAAAGTCTC
CTCATATACAAACGCCAAAAACTGGGAAAAATCATGCAATCAAAATTGAGAAAGTTTCTGCATCCAGAAATTTAAGGCTGAAGCCAGCATCTCCTAATCAATTCAAAAAC
TTGGATTTTCATGGTGGGAAGCGAAAGAGAATGCGTTGGTCTACCGAAGAGGAGGAAATGTTGAAGGAAGGAGTTCAGAGATTCTCTTCTACAGCAAATAAAAATCTTCC
TTGGAGGAAAATTTTGGAATTCGGTCGTCACGTATTTGATGACACTCGTACTCCTGTTGACCTGAAAGATAAATGGAGAAATATTCTAGCCAAATAG
mRNA sequenceShow/hide mRNA sequence
GTTTAAAATAAAAATTTCCTCAGTCGTCAGTCTCACCACAAAAGCAGAAATTTGTTCTTCACATTTCGAGCCAAAATTAAAATCGGTTTTTGAATCGAGATTTATCACCG
TTTTTTTTTTCCTATCCACGGAACAAAATTTCGAGGGAAAAGGAAAACGGAATCGTTCTTCGTCTCCGATCGAGAATGAAGTTTTGTGTTGACTGAGCACTGAAGATTCT
GAAACATTAGGGTTCCTACTTGCGGCGGAAGAAGATGAGGACCAAGAGTCGCCGCGGCGGAAGGTCCAGGCCTTCCCGGCGTTCCGCCACTTCTTCCCCGTCGTCTATGG
CGTCTCCCTCCGTGCCCGACCGCGATGTGTCCAATGCAGAAGATAACACTCTGAATGAGGCATCAAGGAAGGAAACTCACGATGTGTTGGATAAGGCAAAGGTAGATTGC
TTTCAGAAAGACACTTGTATTAGATGCGATGAGAGTGGTGACTTGTTGGTTTGTACTGAAATTGGATGTCCAATTGCTCTTCATGAGCACTGCATGTCCTGCGAACCTTC
GTTCGATGAAGAGGGGCGCTTCTATTGTCCTTATTGTTCGTATAAAAGAGCTTTGGTGCGTGTGAATGAGTTGAGGAGAAAGGCTATGGTGGCAAAGAGAGCGTTGTCTG
ATTTTATTGATACCAGGATGGTTAGTGGTGGCAATTCACCGCAGATAGGAGAGGCAGGTAAGAAGAAAACATCTAATGTTTCAACTTGTGGAGTGGCTAATCATGAGAGT
CATCTGGGCAATGAGAATTCGCAAGATCAGGATATGCTGGTCGAGCAAAATCAGAGTAATGAAGGGGAGGACCATGCGAGAACAACAGGAGATGTTGAACCAGTGTCAAT
GGTGGGGGTTAATGGCGAAAATCACGGTGGTCCTATTGTGTCTAATGTTAGCAACGACATACTTTTGGCTACTGAAGTTCAGCCATGTGAGGATTCTATGAACGAAGAGG
AAACTCATGATGCTGACGGTGACGCATTTGGAACACATCAGGATGAAACTTTAGAAGATAAAGATGGGAAAACGATGGAAGAAGAAAACTTAAGGTCGACAGATGACATT
GAGGATGAAGAGGTTGTGAAGGATCAAGGCCAACCAGAGACTCCTAGTGCACATCATGATGAGGAAGAGACTGCTGTAGATGATGTTGATCATGGAGCAGAACGAGAATC
TCAAGGCACAGGTGATGGTGGAGAACAAATCCAGCTCGACAAAGAAGGGATGCTTGAAAATGTTGATCCTGCGTCTGGAAATAATGATTTAAAGAATGAAACATTTGTTA
AGAAAAAGCGCTTTAAAACTAAAGCTAATAGGAAAGTAGACCTGCAGAATTATAGTTCTCCTAGAAAGTCATTGCGTCTACGAACTCCAGAACCTGGGAACAGGTCACCA
CATGTACAAACTCCAGAACCTGGGAAAAAGTCTCCTCATATACAAACGCCAAAAACTGGGAAAAATCATGCAATCAAAATTGAGAAAGTTTCTGCATCCAGAAATTTAAG
GCTGAAGCCAGCATCTCCTAATCAATTCAAAAACTTGGATTTTCATGGTGGGAAGCGAAAGAGAATGCGTTGGTCTACCGAAGAGGAGGAAATGTTGAAGGAAGGAGTTC
AGAGATTCTCTTCTACAGCAAATAAAAATCTTCCTTGGAGGAAAATTTTGGAATTCGGTCGTCACGTATTTGATGACACTCGTACTCCTGTTGACCTGAAAGATAAATGG
AGAAATATTCTAGCCAAATAGTCTGGCTTAATGGACAACTAGTCGGAATAGATTATGAAAAGGTATGTTCCAGATGCGGAGCCATACTGTTCTGTAGCAGGAACCTTTCA
ATTTGGATCTTCAGGAGATCTACATACTTAAGAAAAAAGGATTGCACAAACAGTTTAGTCTTGAAGACTTTGATTTCAAATGGAAGCTGTGGAGCCTGAAGCCTGTGCTT
AGAGATAAGGTTGATGAAGACTGTACAGCTCTATCAAATGTTGGTCTGTAACTGATGGGCATCATCACAGTTTAAGTTTGGGGAAGAACACTATAGATTTTGCCCATTTT
TGTATATGTTCTTTCAAATCCTTTTAAGTTGGTTGTAGGCTAAGATCTCCAATTTTGGTCTGTTAATGTAGGTTGGTTTTGACTAAGCAGAGCCATCAATGGTAACTCTC
CCATGGGAGAAAAGCACGTTGGAGTTGCGGTTTTGACCTTTGCTTGATGATGATTTTTGGTTCTTTGAAATGTTTGTAACATATTTTGTTTATGTAAATGCAGTTTTTTA
AGTTCATATTGTATGTACTACTTAGTGGTCAATAAGGGTGGTATATATTAAAGGATTTAAAGAGATCGAGTTTAAGTAAGGGTGGTCACGTACCCAAGATTTAATATCTT
A
Protein sequenceShow/hide protein sequence
MRTKSRRGGRSRPSRRSATSSPSSMASPSVPDRDVSNAEDNTLNEASRKETHDVLDKAKVDCFQKDTCIRCDESGDLLVCTEIGCPIALHEHCMSCEPSFDEEGRFYCPY
CSYKRALVRVNELRRKAMVAKRALSDFIDTRMVSGGNSPQIGEAGKKKTSNVSTCGVANHESHLGNENSQDQDMLVEQNQSNEGEDHARTTGDVEPVSMVGVNGENHGGP
IVSNVSNDILLATEVQPCEDSMNEEETHDADGDAFGTHQDETLEDKDGKTMEEENLRSTDDIEDEEVVKDQGQPETPSAHHDEEETAVDDVDHGAERESQGTGDGGEQIQ
LDKEGMLENVDPASGNNDLKNETFVKKKRFKTKANRKVDLQNYSSPRKSLRLRTPEPGNRSPHVQTPEPGKKSPHIQTPKTGKNHAIKIEKVSASRNLRLKPASPNQFKN
LDFHGGKRKRMRWSTEEEEMLKEGVQRFSSTANKNLPWRKILEFGRHVFDDTRTPVDLKDKWRNILAK