; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS004598 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS004598
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptioncarbon catabolite repressor protein 4 homolog 4
Genome locationscaffold995:576524..588382
RNA-Seq ExpressionMS004598
SyntenyMS004598
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR000504 - RNA recognition motif domain
IPR005135 - Endonuclease/exonuclease/phosphatase
IPR012677 - Nucleotide-binding alpha-beta plait domain superfamily
IPR035979 - RNA-binding domain superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAF2228989.1 unnamed protein product [Brassica napus]1.0e-15248.44Show/hide
Query:  RFGRPREADGSS------SPNLYVANCGPAVGITHASISEVFGQFGDVKAVHAADESGARVIVCFSDEFSARAALEALHGRPCSLLGGRTLHIRYSIIR-
        RF RP  +  SS      S NLYVANCGPAVG++H +I  VF  FG+VK V  ADESG RVIV F+D FSA++ALEAL GRPC  L GRTLHIRYS+++ 
Subjt:  RFGRPREADGSS------SPNLYVANCGPAVGITHASISEVFGQFGDVKAVHAADESGARVIVCFSDEFSARAALEALHGRPCSLLGGRTLHIRYSIIR-

Query:  PTVSPLNDSVSVSLSASELDIPGLYLLHDFVTAREEEELLLEVDARPWNCLAKRRVQHYGYEFCYQTRNVNTRLKLGPLPSFVSHILERISMFPNTE-DA
        P+ + +N+ V VSLS S+L+IPGL+LL DFVTA EE++LL  VD +PW  LAKRRVQHYGYEFCY TRNV+T+ +LG LP+FVS IL+RIS+FPN + D 
Subjt:  PTVSPLNDSVSVSLSASELDIPGLYLLHDFVTAREEEELLLEVDARPWNCLAKRRVQHYGYEFCYQTRNVNTRLKLGPLPSFVSHILERISMFPNTE-DA

Query:  ADAALDQXSPS------------NTEDAADAALDQLTVGNPFLF--------------------------------------NNMLLALQYCCLLNYLVP
        A   LDQ + +            +T  A +  +  L++  P +                                       + +LL+ +     N+ +P
Subjt:  ADAALDQXSPS------------NTEDAADAALDQLTVGNPFLF--------------------------------------NNMLLALQYCCLLNYLVP

Query:  --AIHKQKWIALATMQKNIILAKHIVLFHNVSSLRFQA---NLFL-------------------------------------FLCI--FDEYDSFYKGNL
           I K K   +    + +      V  H    +RF+    N+                                       F C+   DEYDSFY+ N+
Subjt:  --AIHKQKWIALATMQKNIILAKHIVLFHNVSSLRFQA---NLFL-------------------------------------FLCI--FDEYDSFYKGNL

Query:  ERCGYSSLYIQRSGQ-KRDGCGIFFKHENAELIIEDRIEYNDLVDSIQDDGCSCEDKSEDVVTGASNDVESNNGLSTKTTAGDHGDPNDPRVRLKRDCVG
        E  GYS +YIQR+GQ KRDGC IF+K   AEL+ ++RIEYNDL+D         E K E      SN+ + +     K +  D  D NDP+VRLKRDCVG
Subjt:  ERCGYSSLYIQRSGQ-KRDGCGIFFKHENAELIIEDRIEYNDLVDSIQDDGCSCEDKSEDVVTGASNDVESNNGLSTKTTAGDHGDPNDPRVRLKRDCVG

Query:  IMAAFKLKKPFHHVVIVANTHLYWDPEWADVKLAQAKYLLSRLARFRTLVSEKFECRPSVLLAGDFNSTPGDKVYQYLVSG---DSPECLEEL-PLPLCS
        IMAAF++ KPFHH+VIVANTHLYWDPE ADVKLAQAKYLLSRLA+F+TL+S++FEC PS+LLAGDFNS PGD VY YLVSG    +    EE+ P+P+CS
Subjt:  IMAAFKLKKPFHHVVIVANTHLYWDPEWADVKLAQAKYLLSRLARFRTLVSEKFECRPSVLLAGDFNSTPGDKVYQYLVSG---DSPECLEEL-PLPLCS

Query:  VYSTVLGTEPSFTNCTPGFTGTLDYIFFSPSDPIRPISFLELPQSEWPEVIGGLPNNTYPSDHLPIAAEFEITME
        VY    G EP FTNCTPGFT TLDYIFFSPSD I+P+S L+LP+ E P+V+G LPN+ +PSDHLPI AEFEI+ E
Subjt:  VYSTVLGTEPSFTNCTPGFTGTLDYIFFSPSDPIRPISFLELPQSEWPEVIGGLPNNTYPSDHLPIAAEFEITME

CAG7897534.1 unnamed protein product, partial [Brassica rapa]4.3e-15448.66Show/hide
Query:  RFGRPREADGSS------SPNLYVANCGPAVGITHASISEVFGQFGDVKAVHAADESGARVIVCFSDEFSARAALEALHGRPCSLLGGRTLHIRYSIIR-
        RF RP  +  SS      S NLYVANCGPAVG++H +I  VF  FG+VK V  ADESG RVIV F+D FSA++ALEAL GRPC  L GRTLHIRYS+++ 
Subjt:  RFGRPREADGSS------SPNLYVANCGPAVGITHASISEVFGQFGDVKAVHAADESGARVIVCFSDEFSARAALEALHGRPCSLLGGRTLHIRYSIIR-

Query:  PTVSPLNDSVSVSLSASELDIPGLYLLHDFVTAREEEELLLEVDARPWNCLAKRRVQHYGYEFCYQTRNVNTRLKLGPLPSFVSHILERISMFPNTE-DA
        P+ + +N+ V VSLS S+L+IPGL+LL DFVTA EE++LL  VD +PW  LAKRRVQHYGYEFCY TRNV+T+ +LG LP+FVS IL+RIS+FPN + D 
Subjt:  PTVSPLNDSVSVSLSASELDIPGLYLLHDFVTAREEEELLLEVDARPWNCLAKRRVQHYGYEFCYQTRNVNTRLKLGPLPSFVSHILERISMFPNTE-DA

Query:  ADAALDQXSPS------------NTEDAADAALDQLTVGNPFLF--------------------------------------NNMLLALQYCCLLNYLVP
        A   LDQ + +            +T  A +  +  L++  P +                                       + +LL+ +     N+ +P
Subjt:  ADAALDQXSPS------------NTEDAADAALDQLTVGNPFLF--------------------------------------NNMLLALQYCCLLNYLVP

Query:  --AIHKQKWIALATMQKNIILAKHIVLFHNVSSLRFQ-------ANLFL--------------------------------FLCI--FDEYDSFYKGNLE
           I K K   +    + +      V  H    +RF+       A +++                                F C+   DEYD+FY+ N+E
Subjt:  --AIHKQKWIALATMQKNIILAKHIVLFHNVSSLRFQ-------ANLFL--------------------------------FLCI--FDEYDSFYKGNLE

Query:  RCGYSSLYIQRSGQ-KRDGCGIFFKHENAELIIEDRIEYNDLVDSIQDDGCSCEDKSEDVVTGASNDVESNNGLSTKTTAGDHGDPNDPRVRLKRDCVGI
          GYS +YIQR+GQ KRDGC IFFK   AEL+ ++RIEYNDL+D         E K E      SN+ + +     K +  D  D NDP+VRLKRDCVGI
Subjt:  RCGYSSLYIQRSGQ-KRDGCGIFFKHENAELIIEDRIEYNDLVDSIQDDGCSCEDKSEDVVTGASNDVESNNGLSTKTTAGDHGDPNDPRVRLKRDCVGI

Query:  MAAFKLKKPFHHVVIVANTHLYWDPEWADVKLAQAKYLLSRLARFRTLVSEKFECRPSVLLAGDFNSTPGDKVYQYLVSG-DSPECLEE---LPLPLCSV
        MAAF++ KPFHH+VIVANTHLYWDPE ADVKLAQAKYLLSRLA+F+TL+S++FEC PS+LLAGDFNS PGD VY YLVSG   P  +EE    P+P+CSV
Subjt:  MAAFKLKKPFHHVVIVANTHLYWDPEWADVKLAQAKYLLSRLARFRTLVSEKFECRPSVLLAGDFNSTPGDKVYQYLVSG-DSPECLEE---LPLPLCSV

Query:  YSTVLGTEPSFTNCTPGFTGTLDYIFFSPSDPIRPISFLELPQSEWPEVIGGLPNNTYPSDHLPIAAEFEITME
        Y    G EP FTNCTPGFT TLDYIFFSPSD I+P+S L+LP+ E P+V+G LPN+ +PSDHLPI AEFEI+ E
Subjt:  YSTVLGTEPSFTNCTPGFTGTLDYIFFSPSDPIRPISFLELPQSEWPEVIGGLPNNTYPSDHLPIAAEFEITME

KAG5389025.1 hypothetical protein IGI04_030566 [Brassica rapa subsp. trilocularis]2.8e-15349.03Show/hide
Query:  RFGRPREADGSS------SPNLYVANCGPAVGITHASISEVFGQFGDVKAVHAADESGARVIVCFSDEFSARAALEALHGRPCSLLGGRTLHIRYSIIR-
        RF RP  +  SS      S NLYVANCGPAVG++H +I+ VF  FG+VK V+ ADESG RVIV F+D FSA++ALEAL GRPC  L GRTLHIRYS+++ 
Subjt:  RFGRPREADGSS------SPNLYVANCGPAVGITHASISEVFGQFGDVKAVHAADESGARVIVCFSDEFSARAALEALHGRPCSLLGGRTLHIRYSIIR-

Query:  PTVSPLNDSVSVSLSASELDIPGLYLLHDFVTAREEEELLLEVDARPWNCLAKRRVQHYGYEFCYQTRNVNTRLKLGPLPSFVSHILERISMFPNTE-DA
        P+ + +N+ V VSLS S+L+IPGL+LL DFVTA EE++LL  VD +PW  LAKRRVQHYGYEFCY TRNV+T+ +LG LP FVS IL+RIS+FPN + D 
Subjt:  PTVSPLNDSVSVSLSASELDIPGLYLLHDFVTAREEEELLLEVDARPWNCLAKRRVQHYGYEFCYQTRNVNTRLKLGPLPSFVSHILERISMFPNTE-DA

Query:  ADAALDQXS----PS--------NTEDAADAALDQLTVGNPFLF--------------------------------------NNMLLALQYCCLLNYLVP
        A   LDQ +    PS        +T  A +  +  L++  P +                                       + +LL+ +     N+ +P
Subjt:  ADAALDQXS----PS--------NTEDAADAALDQLTVGNPFLF--------------------------------------NNMLLALQYCCLLNYLVP

Query:  --AIHKQKWIALATMQKNI-------------------ILAKHIVL--------FHNV-----------SSLRFQANLFLFLC-IFDEYDSFYKGNLERC
           I K K   +    + +                   IL+K +          F +V             +RF+   +  L    DEYDSFY+ N+E  
Subjt:  --AIHKQKWIALATMQKNI-------------------ILAKHIVL--------FHNV-----------SSLRFQANLFLFLC-IFDEYDSFYKGNLERC

Query:  GYSSLYIQRSGQ-KRDGCGIFFKHENAELIIEDRIEYNDLVDSIQDDGCSCEDKSEDVVTGASNDVESNNGLSTKTTAGDHGDPNDPRVRLKRDCVGIMA
        GYS +YIQR+GQ KRDGC IF+K   AEL+ ++RIEYNDL+D         E K E      SN+ + ++  + + +  D  D NDP+VRLKRDCVGIMA
Subjt:  GYSSLYIQRSGQ-KRDGCGIFFKHENAELIIEDRIEYNDLVDSIQDDGCSCEDKSEDVVTGASNDVESNNGLSTKTTAGDHGDPNDPRVRLKRDCVGIMA

Query:  AFKLKKPFHHVVIVANTHLYWDPEWADVKLAQAKYLLSRLARFRTLVSEKFECRPSVLLAGDFNSTPGDKVYQYLVSG--DSPECLEE---LPLPLCSVY
        AF++ KPFHH+VIVANTHLYWDPE ADVKLAQAKYLLSRLA+F+TL+S++FEC PS+LLAGDFNS PGD VY YLVSG     E +EE    P+P+CSVY
Subjt:  AFKLKKPFHHVVIVANTHLYWDPEWADVKLAQAKYLLSRLARFRTLVSEKFECRPSVLLAGDFNSTPGDKVYQYLVSG--DSPECLEE---LPLPLCSVY

Query:  STVLGTEPSFTNCTPGFTGTLDYIFFSPSDPIRPISFLELPQSEWPEVIGGLPNNTYPSDHLPIAAEFEITME
            G EP FTNCTPGFT TLDYIFFSPSD I+P+S L+LP+ E P+V+G LPN+ +PSDHLPI AEFEI+ E
Subjt:  STVLGTEPSFTNCTPGFTGTLDYIFFSPSDPIRPISFLELPQSEWPEVIGGLPNNTYPSDHLPIAAEFEITME

XP_008444261.1 PREDICTED: carbon catabolite repressor protein 4 homolog 4 [Cucumis melo]6.8e-14485.32Show/hide
Query:  FLCI--FDEYDSFYKGNLERCGYSSLYIQRSGQKRDGCGIFFKHENAELIIEDRIEYNDLVDSIQDDGCSCEDKSEDVVTGASNDVESNNGLSTKTTAGD
        FLC+   DEYD+FYKGNLERCGYSSLYIQRSGQKRDGCGIFFKHE AELIIEDRIEYNDLVDSIQDD  SCEDKSEDVVT ASNDVESN G S KTT   
Subjt:  FLCI--FDEYDSFYKGNLERCGYSSLYIQRSGQKRDGCGIFFKHENAELIIEDRIEYNDLVDSIQDDGCSCEDKSEDVVTGASNDVESNNGLSTKTTAGD

Query:  HGDPNDPRVRLKRDCVGIMAAFKLKKPFHHVVIVANTHLYWDPEWADVKLAQAKYLLSRLARFRTLVSEKFECRPSVLLAGDFNSTPGDKVYQYLVSGD-
         GDPNDPRVRLKRDCVGIMAAFKL++PFHHVVIVANTHLYWDPEWADVKLAQAKYLLSRLARF++LV+EKFEC PS+LLAGDFNSTPGDKVYQYL+SG+ 
Subjt:  HGDPNDPRVRLKRDCVGIMAAFKLKKPFHHVVIVANTHLYWDPEWADVKLAQAKYLLSRLARFRTLVSEKFECRPSVLLAGDFNSTPGDKVYQYLVSGD-

Query:  ----SPECLEELPLPLCSVYSTVLGTEPSFTNCTPGFTGTLDYIFFSPSDPIRPISFLELPQSEWPEVIGGLPNNTYPSDHLPIAAEFEITME
            SPECLEELPLPLCSVY+ +LG+EPSFTN TPGFTGTLDYIFFSPSD IRP+SFLELP+S+WPE+IGGLPN++YPSDHLPIAA+FEITME
Subjt:  ----SPECLEELPLPLCSVYSTVLGTEPSFTNCTPGFTGTLDYIFFSPSDPIRPISFLELPQSEWPEVIGGLPNNTYPSDHLPIAAEFEITME

XP_022140802.1 carbon catabolite repressor protein 4 homolog 4 [Momordica charantia]7.7e-16497.57Show/hide
Query:  FLCI--FDEYDSFYKGNLERCGYSSLYIQRSGQKRDGCGIFFKHENAELIIEDRIEYNDLVDSIQDDGCSCEDKSEDVVTGASNDVESNNGLSTKTTAGD
        FLC+   DEYDSFYKGNLERCGYSSLYIQRSGQKRDGCGIFFKHENAELIIEDRIEYNDLVDSIQDDGCSCEDKSEDVVTGASNDVESNNGLSTKTTAGD
Subjt:  FLCI--FDEYDSFYKGNLERCGYSSLYIQRSGQKRDGCGIFFKHENAELIIEDRIEYNDLVDSIQDDGCSCEDKSEDVVTGASNDVESNNGLSTKTTAGD

Query:  HGDPNDPRVRLKRDCVGIMAAFKLKKPFHHVVIVANTHLYWDPEWADVKLAQAKYLLSRLARFRTLVSEKFECRPSVLLAGDFNSTPGDKVYQYLVSGDS
        HGDPNDPRVRLKRDCVGIMAAFKLKKPFHH+VIVANTHLYWDPEWADVKLAQAKYLLSRLARFRTLVSEKFEC+PSVLLAGDFNSTPGDKVYQYLVSGDS
Subjt:  HGDPNDPRVRLKRDCVGIMAAFKLKKPFHHVVIVANTHLYWDPEWADVKLAQAKYLLSRLARFRTLVSEKFECRPSVLLAGDFNSTPGDKVYQYLVSGDS

Query:  PECLEELPLPLCSVYSTVLGTEPSFTNCTPGFTGTLDYIFFSPSDPIRPISFLELPQSEWPEVIGGLPNNTYPSDHLPIAAEFEITME
        PECLEELPLPLCSVYST+LGTEPSFTNCTPGFTGTLDYIFFSPSDPIRPISFLELPQSEWPEVIGGLPNNTYPSDHLPIAAEFEITME
Subjt:  PECLEELPLPLCSVYSTVLGTEPSFTNCTPGFTGTLDYIFFSPSDPIRPISFLELPQSEWPEVIGGLPNNTYPSDHLPIAAEFEITME

TrEMBL top hitse value%identityAlignment
A0A1S3B9X8 carbon catabolite repressor protein 4 homolog 43.3e-14485.32Show/hide
Query:  FLCI--FDEYDSFYKGNLERCGYSSLYIQRSGQKRDGCGIFFKHENAELIIEDRIEYNDLVDSIQDDGCSCEDKSEDVVTGASNDVESNNGLSTKTTAGD
        FLC+   DEYD+FYKGNLERCGYSSLYIQRSGQKRDGCGIFFKHE AELIIEDRIEYNDLVDSIQDD  SCEDKSEDVVT ASNDVESN G S KTT   
Subjt:  FLCI--FDEYDSFYKGNLERCGYSSLYIQRSGQKRDGCGIFFKHENAELIIEDRIEYNDLVDSIQDDGCSCEDKSEDVVTGASNDVESNNGLSTKTTAGD

Query:  HGDPNDPRVRLKRDCVGIMAAFKLKKPFHHVVIVANTHLYWDPEWADVKLAQAKYLLSRLARFRTLVSEKFECRPSVLLAGDFNSTPGDKVYQYLVSGD-
         GDPNDPRVRLKRDCVGIMAAFKL++PFHHVVIVANTHLYWDPEWADVKLAQAKYLLSRLARF++LV+EKFEC PS+LLAGDFNSTPGDKVYQYL+SG+ 
Subjt:  HGDPNDPRVRLKRDCVGIMAAFKLKKPFHHVVIVANTHLYWDPEWADVKLAQAKYLLSRLARFRTLVSEKFECRPSVLLAGDFNSTPGDKVYQYLVSGD-

Query:  ----SPECLEELPLPLCSVYSTVLGTEPSFTNCTPGFTGTLDYIFFSPSDPIRPISFLELPQSEWPEVIGGLPNNTYPSDHLPIAAEFEITME
            SPECLEELPLPLCSVY+ +LG+EPSFTN TPGFTGTLDYIFFSPSD IRP+SFLELP+S+WPE+IGGLPN++YPSDHLPIAA+FEITME
Subjt:  ----SPECLEELPLPLCSVYSTVLGTEPSFTNCTPGFTGTLDYIFFSPSDPIRPISFLELPQSEWPEVIGGLPNNTYPSDHLPIAAEFEITME

A0A5A7UEY1 Carbon catabolite repressor protein 4-like protein 43.3e-14485.32Show/hide
Query:  FLCI--FDEYDSFYKGNLERCGYSSLYIQRSGQKRDGCGIFFKHENAELIIEDRIEYNDLVDSIQDDGCSCEDKSEDVVTGASNDVESNNGLSTKTTAGD
        FLC+   DEYD+FYKGNLERCGYSSLYIQRSGQKRDGCGIFFKHE AELIIEDRIEYNDLVDSIQDD  SCEDKSEDVVT ASNDVESN G S KTT   
Subjt:  FLCI--FDEYDSFYKGNLERCGYSSLYIQRSGQKRDGCGIFFKHENAELIIEDRIEYNDLVDSIQDDGCSCEDKSEDVVTGASNDVESNNGLSTKTTAGD

Query:  HGDPNDPRVRLKRDCVGIMAAFKLKKPFHHVVIVANTHLYWDPEWADVKLAQAKYLLSRLARFRTLVSEKFECRPSVLLAGDFNSTPGDKVYQYLVSGD-
         GDPNDPRVRLKRDCVGIMAAFKL++PFHHVVIVANTHLYWDPEWADVKLAQAKYLLSRLARF++LV+EKFEC PS+LLAGDFNSTPGDKVYQYL+SG+ 
Subjt:  HGDPNDPRVRLKRDCVGIMAAFKLKKPFHHVVIVANTHLYWDPEWADVKLAQAKYLLSRLARFRTLVSEKFECRPSVLLAGDFNSTPGDKVYQYLVSGD-

Query:  ----SPECLEELPLPLCSVYSTVLGTEPSFTNCTPGFTGTLDYIFFSPSDPIRPISFLELPQSEWPEVIGGLPNNTYPSDHLPIAAEFEITME
            SPECLEELPLPLCSVY+ +LG+EPSFTN TPGFTGTLDYIFFSPSD IRP+SFLELP+S+WPE+IGGLPN++YPSDHLPIAA+FEITME
Subjt:  ----SPECLEELPLPLCSVYSTVLGTEPSFTNCTPGFTGTLDYIFFSPSDPIRPISFLELPQSEWPEVIGGLPNNTYPSDHLPIAAEFEITME

A0A5D3BPZ3 Carbon catabolite repressor protein 4-like protein 48.1e-14384.98Show/hide
Query:  FLCI--FDEYDSFYKGNLERCGYSSLYIQRSGQKRDGCGIFFKHENAELIIEDRIEYNDLVDSIQDDGCSCEDKSEDVVTGASNDVESNNGLSTKTTAGD
        FLC+   DEYD+FYKGNLERCGYSSLYIQRSGQKRDGCGIFFKHE AELIIEDRIEYNDLVDSIQDD  SCEDKSEDVVT ASNDVESN G S KTT   
Subjt:  FLCI--FDEYDSFYKGNLERCGYSSLYIQRSGQKRDGCGIFFKHENAELIIEDRIEYNDLVDSIQDDGCSCEDKSEDVVTGASNDVESNNGLSTKTTAGD

Query:  HGDPNDPRVRLKRDCVGIMAAFKLKKPFHHVVIVANTHLYWDPEWADVKLAQAKYLLSRLARFRTLVSEKFECRPSVLLAGDFNSTPGDKVYQYLVSGD-
         GDPNDPRVRLKRDCVGIMAAFKL++PFHHVVIVANTHLYWDPEWADVKLAQAKYLLSRLARF++LV+EKFEC PS+LLAGDFNSTPGDKVYQYLVSG+ 
Subjt:  HGDPNDPRVRLKRDCVGIMAAFKLKKPFHHVVIVANTHLYWDPEWADVKLAQAKYLLSRLARFRTLVSEKFECRPSVLLAGDFNSTPGDKVYQYLVSGD-

Query:  ----SPECLEELPLPLCSVYSTVLGTEPSFTNCTPGFTGTLDYIFFSPSDPIRPISFLELPQSEWPEVIGGLPNNTYPSDHLPIAAEFEITME
            SPECLEELPL LCSVY+ +LG+EPSFTN TPGFTGTLDYIFFSP+D IRP+SFLELP+S+WPE+IGGLPN++YPSDHLPIAA+FEITME
Subjt:  ----SPECLEELPLPLCSVYSTVLGTEPSFTNCTPGFTGTLDYIFFSPSDPIRPISFLELPQSEWPEVIGGLPNNTYPSDHLPIAAEFEITME

A0A6J1CI40 carbon catabolite repressor protein 4 homolog 43.8e-16497.57Show/hide
Query:  FLCI--FDEYDSFYKGNLERCGYSSLYIQRSGQKRDGCGIFFKHENAELIIEDRIEYNDLVDSIQDDGCSCEDKSEDVVTGASNDVESNNGLSTKTTAGD
        FLC+   DEYDSFYKGNLERCGYSSLYIQRSGQKRDGCGIFFKHENAELIIEDRIEYNDLVDSIQDDGCSCEDKSEDVVTGASNDVESNNGLSTKTTAGD
Subjt:  FLCI--FDEYDSFYKGNLERCGYSSLYIQRSGQKRDGCGIFFKHENAELIIEDRIEYNDLVDSIQDDGCSCEDKSEDVVTGASNDVESNNGLSTKTTAGD

Query:  HGDPNDPRVRLKRDCVGIMAAFKLKKPFHHVVIVANTHLYWDPEWADVKLAQAKYLLSRLARFRTLVSEKFECRPSVLLAGDFNSTPGDKVYQYLVSGDS
        HGDPNDPRVRLKRDCVGIMAAFKLKKPFHH+VIVANTHLYWDPEWADVKLAQAKYLLSRLARFRTLVSEKFEC+PSVLLAGDFNSTPGDKVYQYLVSGDS
Subjt:  HGDPNDPRVRLKRDCVGIMAAFKLKKPFHHVVIVANTHLYWDPEWADVKLAQAKYLLSRLARFRTLVSEKFECRPSVLLAGDFNSTPGDKVYQYLVSGDS

Query:  PECLEELPLPLCSVYSTVLGTEPSFTNCTPGFTGTLDYIFFSPSDPIRPISFLELPQSEWPEVIGGLPNNTYPSDHLPIAAEFEITME
        PECLEELPLPLCSVYST+LGTEPSFTNCTPGFTGTLDYIFFSPSDPIRPISFLELPQSEWPEVIGGLPNNTYPSDHLPIAAEFEITME
Subjt:  PECLEELPLPLCSVYSTVLGTEPSFTNCTPGFTGTLDYIFFSPSDPIRPISFLELPQSEWPEVIGGLPNNTYPSDHLPIAAEFEITME

A0A6J1F8E0 carbon catabolite repressor protein 4 homolog 4 isoform X11.4e-14286.35Show/hide
Query:  FLCI--FDEYDSFYKGNLERCGYSSLYIQRSGQKRDGCGIFFKHENAELIIEDRIEYNDLVDSIQDDGCSCEDKSEDVVTGASNDVESNNGLSTKTTAGD
        FLC+   DEYDSFYKGNLERCGYSSLYIQRSGQKRDGCGIFFKHENAELIIEDRIEYNDLV+SIQDD CSCE+KSE+VVT AS+DVESN G S KTT  D
Subjt:  FLCI--FDEYDSFYKGNLERCGYSSLYIQRSGQKRDGCGIFFKHENAELIIEDRIEYNDLVDSIQDDGCSCEDKSEDVVTGASNDVESNNGLSTKTTAGD

Query:  HGDPNDPRVRLKRDCVGIMAAFKLKKPFHHVVIVANTHLYWDPEWADVKLAQAKYLLSRLARFRTLVSEKFECRPSVLLAGDFNSTPGDKVYQYLVSGD-
         GDPNDPRVRLKRDCVGIMAAF+ KKPFHHVVIVANTHLYWDPEWADVKLAQAKYLLSRLARF+TLVSEKFEC PS+LLAGDFNSTPGDKVYQYL+SG  
Subjt:  HGDPNDPRVRLKRDCVGIMAAFKLKKPFHHVVIVANTHLYWDPEWADVKLAQAKYLLSRLARFRTLVSEKFECRPSVLLAGDFNSTPGDKVYQYLVSGD-

Query:  ----SPECLEELPLPLCSVYSTVLGTEPSFTNCTPGFTGTLDYIFFSPSDPIRPISFLELPQSEWPEVIGGLPNNTYPSDHLPIAAEFEITME
            SPECLEELPLPL SVY+T LG EPSFTN TPGFTGTLDYIFFSPSD IRPISFLELP+SE PEVIGGLPN +YPSDHLPI A+FEITME
Subjt:  ----SPECLEELPLPLCSVYSTVLGTEPSFTNCTPGFTGTLDYIFFSPSDPIRPISFLELPQSEWPEVIGGLPNNTYPSDHLPIAAEFEITME

SwissProt top hitse value%identityAlignment
A8MS41 Carbon catabolite repressor protein 4 homolog 48.5e-9762.12Show/hide
Query:  FLCI--FDEYDSFYKGNLERCGYSSLYIQRSGQ-KRDGCGIFFKHENAELIIEDRIEYNDLVDSIQDDGCSCEDKSEDVVTGASNDVESNNGLSTKTTAG
        F C+   DEYDSFY+ N++  GYS +YIQR+GQ KRDGC IF+K   AEL+ ++RIEYNDLVDSI+ D  SC ++  +          SN G   K +  
Subjt:  FLCI--FDEYDSFYKGNLERCGYSSLYIQRSGQ-KRDGCGIFFKHENAELIIEDRIEYNDLVDSIQDDGCSCEDKSEDVVTGASNDVESNNGLSTKTTAG

Query:  DHGDPNDPRVRLKRDCVGIMAAFKLKKPFHHVVIVANTHLYWDPEWADVKLAQAKYLLSRLARFRTLVSEKFECRPSVLLAGDFNSTPGDKVYQYLVSGD
        D  D NDP VRLKRDCVGIMAAF++ KPF H+VIVANTHLYWDPE ADVKLAQAKYLLSRLA+F+TL+S++FEC PS+LLAGDFNS PGD VY YLVSG+
Subjt:  DHGDPNDPRVRLKRDCVGIMAAFKLKKPFHHVVIVANTHLYWDPEWADVKLAQAKYLLSRLARFRTLVSEKFECRPSVLLAGDFNSTPGDKVYQYLVSGD

Query:  SPEC----LEELPLPLCSVYSTVLGTEPSFTNCTPGFTGTLDYIFFSPSDPIRPISFLELPQSEWPEVIGGLPNNTYPSDHLPIAAEFEITME
        +        EE P+PL SVY    G EP FTNCTPGFT TLDYIF SPSD I+P+S L+LP+ + P+V+G LPN+ +PSDHLPI AEFEI  E
Subjt:  SPEC----LEELPLPLCSVYSTVLGTEPSFTNCTPGFTGTLDYIFFSPSDPIRPISFLELPQSEWPEVIGGLPNNTYPSDHLPIAAEFEITME

C4V7I7 Probable CCR4-Not complex 3'-5'-exoribonuclease subunit Ccr42.3e-2528.08Show/hide
Query:  LFHNVSSLRFQANLFLFLCIFD----EYDSFYKGNLE-RCGYSSLYIQRSGQKR-------DGCGIFFKHENAELIIEDRIEYNDLVDSIQDDGCSCEDK
        + HN+ S+         LC+ +     Y+ FYK  LE RC YSS++  +   K        DGC  F+K        + +I+ N ++             
Subjt:  LFHNVSSLRFQANLFLFLCIFD----EYDSFYKGNLE-RCGYSSLYIQRSGQKR-------DGCGIFFKHENAELIIEDRIEYNDLVDSIQDDGCSCEDK

Query:  SEDVVTGASNDVESNNGLSTKTTAGDHGDPNDPRVRLKRDCVGIMAAFKLKKPFHHVVIVANTHLYWDPEWADVKLAQAKYLLSRLARFRTLVSEKFECR
          D  +   ND   N  ++  +  G            K+D + +++ F++ +     +IV N HLYWDPE+ D+K  QA  LL  L +    VS+ ++  
Subjt:  SEDVVTGASNDVESNNGLSTKTTAGDHGDPNDPRVRLKRDCVGIMAAFKLKKPFHHVVIVANTHLYWDPEWADVKLAQAKYLLSRLARFRTLVSEKFECR

Query:  PSVLLAGDFNSTPGDKVYQYLVSGDSPEC------LEELPLPLCSVYSTVLGTEPSFTNCTPGFTGTLDYIFFSPSDPIRPISFLELPQSEWPEVIGGLP
        PS++L GDFNS     VY ++              +  +P     +    L  E  FTN TP F G +D+IF+S +  +R  S L   ++E+ + + GLP
Subjt:  PSVLLAGDFNSTPGDKVYQYLVSGDSPEC------LEELPLPLCSVYSTVLGTEPSFTNCTPGFTGTLDYIFFSPSDPIRPISFLELPQSEWPEVIGGLP

Query:  NNTYPSDHLPIAAEFEI
        N  +PSDH+ +A++F++
Subjt:  NNTYPSDHLPIAAEFEI

Q5A761 CCR4-Not complex 3'-5'-exoribonuclease subunit Ccr47.1e-1927.93Show/hide
Query:  YKGNLERCGYSSLYIQRSGQKRDGCGIFFKHENAELIIEDRIEYNDLVDSIQDDGCSCEDKSEDVVTGASNDVESNNGLSTKTTAGDHGDPNDPRVRLKR
        YKG       S    +   +K DGC  FFK++   LI +   EYN +        C   DK +                 TK               + +
Subjt:  YKGNLERCGYSSLYIQRSGQKRDGCGIFFKHENAELIIEDRIEYNDLVDSIQDDGCSCEDKSEDVVTGASNDVESNNGLSTKTTAGDHGDPNDPRVRLKR

Query:  DCVGIMAAFKLKKPFHHVVIVANTHLYWDPEWADVKLAQAKYLLSRL----ARFRTLVSEKFECRPSVLLAGDFNSTPGDKVYQYLVSGDS---------
        D + +++  + K+    + +V NTHL+WDP + DVK  Q   LL  L     ++R   S +     S+++ GDFNS     VYQ   +G S         
Subjt:  DCVGIMAAFKLKKPFHHVVIVANTHLYWDPEWADVKLAQAKYLLSRL----ARFRTLVSEKFECRPSVLLAGDFNSTPGDKVYQYLVSGDS---------

Query:  -----PECLEELPLPLCSVYSTVLGTEPSFTNCTPGFTGTLDYIFFSPSDPIRPISFLELPQSEWPEVIGGLPNNTYPSDHLPIAAEFEI
              E     P  L S Y  V   E  FTN TP FT  +DYI++S +  ++    L     E+     G P+  +PSDH+PI A+F++
Subjt:  -----PECLEELPLPLCSVYSTVLGTEPSFTNCTPGFTGTLDYIFFSPSDPIRPISFLELPQSEWPEVIGGLPNNTYPSDHLPIAAEFEI

Q8RWY1 Alkylated DNA repair protein ALKBH8 homolog3.6e-6362.8Show/hide
Query:  RFGRPREADGSS------SPNLYVANCGPAVGITHASISEVFGQFGDVKAVHAADESGARVIVCFSDEFSARAALEALHGRPCSLLGGRTLHIRYSIIR-
        RF RP ++  SS      S NLYVANCGPAVG+TH +I+ VF +FG+V  V+AAD+SG RVIV F+D FSA+AALEAL GRPC  L GR+LHIRYS+++ 
Subjt:  RFGRPREADGSS------SPNLYVANCGPAVGITHASISEVFGQFGDVKAVHAADESGARVIVCFSDEFSARAALEALHGRPCSLLGGRTLHIRYSIIR-

Query:  PTVSPLNDSVSVSLSASELDIPGLYLLHDFVTAREEEELLLEVDARPWNCLAKRRVQHYGYEFCYQTRNVNTRLKLGPLPSFVSHILERISMFPNTED-A
        P+ + +ND V VSL  SEL+IPGL+LL DFVT  EE++LL  VDAR W  LAKRRVQHYGYEFCY TRNV+T+ +LG LPSFVS ILERI +FPN ++ +
Subjt:  PTVSPLNDSVSVSLSASELDIPGLYLLHDFVTAREEEELLLEVDARPWNCLAKRRVQHYGYEFCYQTRNVNTRLKLGPLPSFVSHILERISMFPNTED-A

Query:  ADAALDQ
        A   LDQ
Subjt:  ADAALDQ

Q8SU52 Probable CCR4-Not complex 3'-5'-exoribonuclease subunit Ccr42.6e-2129.19Show/hide
Query:  YLVPAIHKQKWIALATMQKNIILAKHIVLFHNVSSLRFQ-ANLFLFLCIFDEYDSFYKGNLE-RCGYSSLYIQRSGQKR-------DGCGIFFKHENAEL
        Y     +   W+  +  ++  +L +  ++ +NV  L  Q   L+ F   FD    FYK  LE RC Y S+   R   K        DGC IF++     L
Subjt:  YLVPAIHKQKWIALATMQKNIILAKHIVLFHNVSSLRFQ-ANLFLFLCIFDEYDSFYKGNLE-RCGYSSLYIQRSGQKR-------DGCGIFFKHENAEL

Query:  IIEDRIEYNDLVDSIQDDGCSCEDKSEDVVTGASNDVESNNGLSTKTTAGDHGDPNDPRVRLKRDCVGIMAAFKLKKPFHHVVIVANTHLYWDPEWADVK
        I +  I+++  V  IQD   +   +  D                       +G         K+D + I A   L++P    V+V NTH++WDP++ D+K
Subjt:  IIEDRIEYNDLVDSIQDDGCSCEDKSEDVVTGASNDVESNNGLSTKTTAGDHGDPNDPRVRLKRDCVGIMAAFKLKKPFHHVVIVANTHLYWDPEWADVK

Query:  LAQAKYLLSRLARFRTLVSEKFECRPSVLLAGDFNSTPGDKVYQYLVS--------GDSPECLEELP----LPLCSVYSTVLGTEPSFTNCTPGFTGTLD
        L Q   L+  + R    VS +      +LL GDFNS     VY+ + +        GD+ + L        L L   YS     +  FTN TPGF G +D
Subjt:  LAQAKYLLSRLARFRTLVSEKFECRPSVLLAGDFNSTPGDKVYQYLVS--------GDSPECLEELP----LPLCSVYSTVLGTEPSFTNCTPGFTGTLD

Query:  YIFFSPSDPIRPISFLELPQSEWPEVIGGLPNNTYPSDHLPIAAEF
        YIF+     I   S L   + E+ E + GLPN  +PSDH+ + A+F
Subjt:  YIFFSPSDPIRPISFLELPQSEWPEVIGGLPNNTYPSDHLPIAAEF

Arabidopsis top hitse value%identityAlignment
AT1G31500.1 DNAse I-like superfamily protein6.0e-9862.12Show/hide
Query:  FLCI--FDEYDSFYKGNLERCGYSSLYIQRSGQ-KRDGCGIFFKHENAELIIEDRIEYNDLVDSIQDDGCSCEDKSEDVVTGASNDVESNNGLSTKTTAG
        F C+   DEYDSFY+ N++  GYS +YIQR+GQ KRDGC IF+K   AEL+ ++RIEYNDLVDSI+ D  SC ++  +          SN G   K +  
Subjt:  FLCI--FDEYDSFYKGNLERCGYSSLYIQRSGQ-KRDGCGIFFKHENAELIIEDRIEYNDLVDSIQDDGCSCEDKSEDVVTGASNDVESNNGLSTKTTAG

Query:  DHGDPNDPRVRLKRDCVGIMAAFKLKKPFHHVVIVANTHLYWDPEWADVKLAQAKYLLSRLARFRTLVSEKFECRPSVLLAGDFNSTPGDKVYQYLVSGD
        D  D NDP VRLKRDCVGIMAAF++ KPF H+VIVANTHLYWDPE ADVKLAQAKYLLSRLA+F+TL+S++FEC PS+LLAGDFNS PGD VY YLVSG+
Subjt:  DHGDPNDPRVRLKRDCVGIMAAFKLKKPFHHVVIVANTHLYWDPEWADVKLAQAKYLLSRLARFRTLVSEKFECRPSVLLAGDFNSTPGDKVYQYLVSGD

Query:  SPEC----LEELPLPLCSVYSTVLGTEPSFTNCTPGFTGTLDYIFFSPSDPIRPISFLELPQSEWPEVIGGLPNNTYPSDHLPIAAEFEITME
        +        EE P+PL SVY    G EP FTNCTPGFT TLDYIF SPSD I+P+S L+LP+ + P+V+G LPN+ +PSDHLPI AEFEI  E
Subjt:  SPEC----LEELPLPLCSVYSTVLGTEPSFTNCTPGFTGTLDYIFFSPSDPIRPISFLELPQSEWPEVIGGLPNNTYPSDHLPIAAEFEITME

AT1G31500.2 DNAse I-like superfamily protein6.0e-9862.12Show/hide
Query:  FLCI--FDEYDSFYKGNLERCGYSSLYIQRSGQ-KRDGCGIFFKHENAELIIEDRIEYNDLVDSIQDDGCSCEDKSEDVVTGASNDVESNNGLSTKTTAG
        F C+   DEYDSFY+ N++  GYS +YIQR+GQ KRDGC IF+K   AEL+ ++RIEYNDLVDSI+ D  SC ++  +          SN G   K +  
Subjt:  FLCI--FDEYDSFYKGNLERCGYSSLYIQRSGQ-KRDGCGIFFKHENAELIIEDRIEYNDLVDSIQDDGCSCEDKSEDVVTGASNDVESNNGLSTKTTAG

Query:  DHGDPNDPRVRLKRDCVGIMAAFKLKKPFHHVVIVANTHLYWDPEWADVKLAQAKYLLSRLARFRTLVSEKFECRPSVLLAGDFNSTPGDKVYQYLVSGD
        D  D NDP VRLKRDCVGIMAAF++ KPF H+VIVANTHLYWDPE ADVKLAQAKYLLSRLA+F+TL+S++FEC PS+LLAGDFNS PGD VY YLVSG+
Subjt:  DHGDPNDPRVRLKRDCVGIMAAFKLKKPFHHVVIVANTHLYWDPEWADVKLAQAKYLLSRLARFRTLVSEKFECRPSVLLAGDFNSTPGDKVYQYLVSGD

Query:  SPEC----LEELPLPLCSVYSTVLGTEPSFTNCTPGFTGTLDYIFFSPSDPIRPISFLELPQSEWPEVIGGLPNNTYPSDHLPIAAEFEITME
        +        EE P+PL SVY    G EP FTNCTPGFT TLDYIF SPSD I+P+S L+LP+ + P+V+G LPN+ +PSDHLPI AEFEI  E
Subjt:  SPEC----LEELPLPLCSVYSTVLGTEPSFTNCTPGFTGTLDYIFFSPSDPIRPISFLELPQSEWPEVIGGLPNNTYPSDHLPIAAEFEITME

AT1G31500.4 DNAse I-like superfamily protein6.0e-9862.12Show/hide
Query:  FLCI--FDEYDSFYKGNLERCGYSSLYIQRSGQ-KRDGCGIFFKHENAELIIEDRIEYNDLVDSIQDDGCSCEDKSEDVVTGASNDVESNNGLSTKTTAG
        F C+   DEYDSFY+ N++  GYS +YIQR+GQ KRDGC IF+K   AEL+ ++RIEYNDLVDSI+ D  SC ++  +          SN G   K +  
Subjt:  FLCI--FDEYDSFYKGNLERCGYSSLYIQRSGQ-KRDGCGIFFKHENAELIIEDRIEYNDLVDSIQDDGCSCEDKSEDVVTGASNDVESNNGLSTKTTAG

Query:  DHGDPNDPRVRLKRDCVGIMAAFKLKKPFHHVVIVANTHLYWDPEWADVKLAQAKYLLSRLARFRTLVSEKFECRPSVLLAGDFNSTPGDKVYQYLVSGD
        D  D NDP VRLKRDCVGIMAAF++ KPF H+VIVANTHLYWDPE ADVKLAQAKYLLSRLA+F+TL+S++FEC PS+LLAGDFNS PGD VY YLVSG+
Subjt:  DHGDPNDPRVRLKRDCVGIMAAFKLKKPFHHVVIVANTHLYWDPEWADVKLAQAKYLLSRLARFRTLVSEKFECRPSVLLAGDFNSTPGDKVYQYLVSGD

Query:  SPEC----LEELPLPLCSVYSTVLGTEPSFTNCTPGFTGTLDYIFFSPSDPIRPISFLELPQSEWPEVIGGLPNNTYPSDHLPIAAEFEITME
        +        EE P+PL SVY    G EP FTNCTPGFT TLDYIF SPSD I+P+S L+LP+ + P+V+G LPN+ +PSDHLPI AEFEI  E
Subjt:  SPEC----LEELPLPLCSVYSTVLGTEPSFTNCTPGFTGTLDYIFFSPSDPIRPISFLELPQSEWPEVIGGLPNNTYPSDHLPIAAEFEITME

AT1G31600.1 RNA-binding (RRM/RBD/RNP motifs) family protein2.6e-6462.8Show/hide
Query:  RFGRPREADGSS------SPNLYVANCGPAVGITHASISEVFGQFGDVKAVHAADESGARVIVCFSDEFSARAALEALHGRPCSLLGGRTLHIRYSIIR-
        RF RP ++  SS      S NLYVANCGPAVG+TH +I+ VF +FG+V  V+AAD+SG RVIV F+D FSA+AALEAL GRPC  L GR+LHIRYS+++ 
Subjt:  RFGRPREADGSS------SPNLYVANCGPAVGITHASISEVFGQFGDVKAVHAADESGARVIVCFSDEFSARAALEALHGRPCSLLGGRTLHIRYSIIR-

Query:  PTVSPLNDSVSVSLSASELDIPGLYLLHDFVTAREEEELLLEVDARPWNCLAKRRVQHYGYEFCYQTRNVNTRLKLGPLPSFVSHILERISMFPNTED-A
        P+ + +ND V VSL  SEL+IPGL+LL DFVT  EE++LL  VDAR W  LAKRRVQHYGYEFCY TRNV+T+ +LG LPSFVS ILERI +FPN ++ +
Subjt:  PTVSPLNDSVSVSLSASELDIPGLYLLHDFVTAREEEELLLEVDARPWNCLAKRRVQHYGYEFCYQTRNVNTRLKLGPLPSFVSHILERISMFPNTED-A

Query:  ADAALDQ
        A   LDQ
Subjt:  ADAALDQ

AT1G31600.3 RNA-binding (RRM/RBD/RNP motifs) family protein2.6e-6462.8Show/hide
Query:  RFGRPREADGSS------SPNLYVANCGPAVGITHASISEVFGQFGDVKAVHAADESGARVIVCFSDEFSARAALEALHGRPCSLLGGRTLHIRYSIIR-
        RF RP ++  SS      S NLYVANCGPAVG+TH +I+ VF +FG+V  V+AAD+SG RVIV F+D FSA+AALEAL GRPC  L GR+LHIRYS+++ 
Subjt:  RFGRPREADGSS------SPNLYVANCGPAVGITHASISEVFGQFGDVKAVHAADESGARVIVCFSDEFSARAALEALHGRPCSLLGGRTLHIRYSIIR-

Query:  PTVSPLNDSVSVSLSASELDIPGLYLLHDFVTAREEEELLLEVDARPWNCLAKRRVQHYGYEFCYQTRNVNTRLKLGPLPSFVSHILERISMFPNTED-A
        P+ + +ND V VSL  SEL+IPGL+LL DFVT  EE++LL  VDAR W  LAKRRVQHYGYEFCY TRNV+T+ +LG LPSFVS ILERI +FPN ++ +
Subjt:  PTVSPLNDSVSVSLSASELDIPGLYLLHDFVTAREEEELLLEVDARPWNCLAKRRVQHYGYEFCYQTRNVNTRLKLGPLPSFVSHILERISMFPNTED-A

Query:  ADAALDQ
        A   LDQ
Subjt:  ADAALDQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTTGCCATTGACAAGATTCGGGCGTCCGAGAGAAGCAGATGGATCTTCTAGCCCCAATCTCTACGTTGCGAACTGTGGACCGGCCGTCGGAATCACCCACGCCTC
AATTTCGGAGGTTTTCGGCCAATTTGGGGATGTGAAAGCTGTGCACGCCGCCGACGAGAGCGGCGCCCGCGTCATTGTATGTTTTTCCGACGAGTTCAGTGCTCGAGCCG
CTCTAGAGGCCTTACACGGACGCCCTTGTTCTCTCCTCGGTGGCCGGACTTTGCACATACGCTATTCCATCATCAGACCGACCGTTTCGCCGCTCAATGACTCAGTTTCG
GTTTCTTTATCGGCTTCGGAGTTGGACATTCCCGGACTTTACTTATTGCACGATTTCGTCACTGCTAGAGAAGAGGAGGAATTGCTTCTGGAAGTTGATGCTAGGCCTTG
GAATTGTCTGGCCAAACGAAGGGTTCAGCATTATGGGTATGAGTTTTGTTATCAAACGAGAAATGTTAATACTAGACTAAAGTTGGGTCCACTTCCTTCATTTGTCTCCC
ACATACTTGAAAGGATCTCCATGTTTCCAAACACTGAGGATGCTGCAGATGCTGCTCTTGACCAANGGTCTCCATCAAACACTGAGGATGCTGCAGATGCTGCTCTTGAC
CAATTGACGGTAGGCAATCCGTTTCTATTTAATAATATGCTTCTCGCCCTCCAATATTGCTGTTTATTAAATTATCTTGTCCCAGCAATACATAAGCAGAAGTGGATTGC
TCTTGCCACAATGCAAAAGAACATCATATTAGCTAAACATATTGTTTTATTTCATAATGTTAGTTCTTTGCGTTTTCAAGCAAATTTATTTTTATTTTTATGCATTTTTG
ATGAATATGATAGCTTTTACAAAGGAAATTTGGAAAGATGTGGATATTCCAGTTTATATATCCAGAGAAGTGGGCAGAAGCGTGATGGATGTGGAATTTTTTTCAAGCAT
GAAAATGCTGAGTTGATCATAGAGGATAGAATTGAATACAATGATCTCGTAGACTCTATACAAGATGATGGTTGTTCTTGTGAAGATAAGTCTGAAGATGTGGTAACCGG
TGCAAGTAATGATGTTGAATCAAACAATGGTTTATCAACAAAAACTACTGCAGGGGATCATGGGGATCCTAATGATCCCCGCGTGAGACTAAAACGTGATTGTGTTGGAA
TTATGGCTGCTTTCAAACTCAAGAAGCCTTTTCATCATGTTGTAATTGTAGCGAACACCCATCTTTACTGGGATCCAGAATGGGCTGATGTCAAGCTTGCCCAGGCCAAA
TATCTTTTATCACGCCTAGCTCGATTCAGAACGTTAGTGTCTGAAAAGTTTGAATGCAGGCCTTCAGTACTTTTGGCTGGCGATTTCAATTCAACCCCTGGGGATAAGGT
ATACCAATACCTTGTTTCGGGCGACTCGCCCGAGTGCTTGGAAGAGCTTCCATTGCCCCTTTGTAGCGTGTATTCTACCGTACTAGGAACTGAACCTTCATTCACAAACT
GCACTCCTGGCTTCACTGGTACTCTTGATTATATATTCTTTTCACCTTCTGACCCTATAAGACCAATAAGTTTTCTAGAGCTTCCACAGTCAGAATGGCCAGAGGTTATT
GGTGGGTTACCTAATAACACCTACCCAAGTGATCATCTTCCCATTGCTGCTGAATTTGAAATCACAATGGAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGTTGCCATTGACAAGATTCGGGCGTCCGAGAGAAGCAGATGGATCTTCTAGCCCCAATCTCTACGTTGCGAACTGTGGACCGGCCGTCGGAATCACCCACGCCTC
AATTTCGGAGGTTTTCGGCCAATTTGGGGATGTGAAAGCTGTGCACGCCGCCGACGAGAGCGGCGCCCGCGTCATTGTATGTTTTTCCGACGAGTTCAGTGCTCGAGCCG
CTCTAGAGGCCTTACACGGACGCCCTTGTTCTCTCCTCGGTGGCCGGACTTTGCACATACGCTATTCCATCATCAGACCGACCGTTTCGCCGCTCAATGACTCAGTTTCG
GTTTCTTTATCGGCTTCGGAGTTGGACATTCCCGGACTTTACTTATTGCACGATTTCGTCACTGCTAGAGAAGAGGAGGAATTGCTTCTGGAAGTTGATGCTAGGCCTTG
GAATTGTCTGGCCAAACGAAGGGTTCAGCATTATGGGTATGAGTTTTGTTATCAAACGAGAAATGTTAATACTAGACTAAAGTTGGGTCCACTTCCTTCATTTGTCTCCC
ACATACTTGAAAGGATCTCCATGTTTCCAAACACTGAGGATGCTGCAGATGCTGCTCTTGACCAANGGTCTCCATCAAACACTGAGGATGCTGCAGATGCTGCTCTTGAC
CAATTGACGGTAGGCAATCCGTTTCTATTTAATAATATGCTTCTCGCCCTCCAATATTGCTGTTTATTAAATTATCTTGTCCCAGCAATACATAAGCAGAAGTGGATTGC
TCTTGCCACAATGCAAAAGAACATCATATTAGCTAAACATATTGTTTTATTTCATAATGTTAGTTCTTTGCGTTTTCAAGCAAATTTATTTTTATTTTTATGCATTTTTG
ATGAATATGATAGCTTTTACAAAGGAAATTTGGAAAGATGTGGATATTCCAGTTTATATATCCAGAGAAGTGGGCAGAAGCGTGATGGATGTGGAATTTTTTTCAAGCAT
GAAAATGCTGAGTTGATCATAGAGGATAGAATTGAATACAATGATCTCGTAGACTCTATACAAGATGATGGTTGTTCTTGTGAAGATAAGTCTGAAGATGTGGTAACCGG
TGCAAGTAATGATGTTGAATCAAACAATGGTTTATCAACAAAAACTACTGCAGGGGATCATGGGGATCCTAATGATCCCCGCGTGAGACTAAAACGTGATTGTGTTGGAA
TTATGGCTGCTTTCAAACTCAAGAAGCCTTTTCATCATGTTGTAATTGTAGCGAACACCCATCTTTACTGGGATCCAGAATGGGCTGATGTCAAGCTTGCCCAGGCCAAA
TATCTTTTATCACGCCTAGCTCGATTCAGAACGTTAGTGTCTGAAAAGTTTGAATGCAGGCCTTCAGTACTTTTGGCTGGCGATTTCAATTCAACCCCTGGGGATAAGGT
ATACCAATACCTTGTTTCGGGCGACTCGCCCGAGTGCTTGGAAGAGCTTCCATTGCCCCTTTGTAGCGTGTATTCTACCGTACTAGGAACTGAACCTTCATTCACAAACT
GCACTCCTGGCTTCACTGGTACTCTTGATTATATATTCTTTTCACCTTCTGACCCTATAAGACCAATAAGTTTTCTAGAGCTTCCACAGTCAGAATGGCCAGAGGTTATT
GGTGGGTTACCTAATAACACCTACCCAAGTGATCATCTTCCCATTGCTGCTGAATTTGAAATCACAATGGAA
Protein sequenceShow/hide protein sequence
MELPLTRFGRPREADGSSSPNLYVANCGPAVGITHASISEVFGQFGDVKAVHAADESGARVIVCFSDEFSARAALEALHGRPCSLLGGRTLHIRYSIIRPTVSPLNDSVS
VSLSASELDIPGLYLLHDFVTAREEEELLLEVDARPWNCLAKRRVQHYGYEFCYQTRNVNTRLKLGPLPSFVSHILERISMFPNTEDAADAALDQXSPSNTEDAADAALD
QLTVGNPFLFNNMLLALQYCCLLNYLVPAIHKQKWIALATMQKNIILAKHIVLFHNVSSLRFQANLFLFLCIFDEYDSFYKGNLERCGYSSLYIQRSGQKRDGCGIFFKH
ENAELIIEDRIEYNDLVDSIQDDGCSCEDKSEDVVTGASNDVESNNGLSTKTTAGDHGDPNDPRVRLKRDCVGIMAAFKLKKPFHHVVIVANTHLYWDPEWADVKLAQAK
YLLSRLARFRTLVSEKFECRPSVLLAGDFNSTPGDKVYQYLVSGDSPECLEELPLPLCSVYSTVLGTEPSFTNCTPGFTGTLDYIFFSPSDPIRPISFLELPQSEWPEVI
GGLPNNTYPSDHLPIAAEFEITME