; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G017110 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G017110
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
Descriptioncarbon catabolite repressor protein 4 homolog 6 isoform X1
Genome locationCG_Chr05:29382491..29396844
RNA-Seq ExpressionClCG05G017110
SyntenyClCG05G017110
Gene Ontology termsGO:0090503 - RNA phosphodiester bond hydrolysis, exonucleolytic (biological process)
GO:0000175 - 3'-5'-exoribonuclease activity (molecular function)
InterPro domainsIPR005135 - Endonuclease/exonuclease/phosphatase
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008466384.1 PREDICTED: carbon catabolite repressor protein 4 homolog 6 isoform X1 [Cucumis melo]0.0e+0071.27Show/hide
Query:  MRRVATPPPPLHQLSVAV--ATTATNTSAAMSSRPPYR-AGRYERHRGFSSERPYSGGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHTAPQYP
        MRR ATPPPPLHQLS AV  ATTATNTS AMSSR PYR  GRY RHRGFSSERPYSGG+GQ V+ DSHFQSV+ESNLGFRQGERGG+ NNAGS+TAP+ P
Subjt:  MRRVATPPPPLHQLSVAV--ATTATNTSAAMSSRPPYR-AGRYERHRGFSSERPYSGGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHTAPQYP

Query:  RPPSFRGNHQFRQAPPSSQRHQYRGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHH
        RPPSF GNHQFRQAPPS+QRHQYRGPHPH H+QQPPSFNQNQGVRMPQQ RPRPPK LDFRHWDYAKT PPSTCERFSILSYNILADYLAM+HK KLYHH
Subjt:  RPPSFRGNHQFRQAPPSSQRHQYRGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHH

Query:  IPPYMLDWEWRKNNILFELGLWSTDIMCFQSFRLEIGNMFKPFIDFDLLPI-KTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGF
        IP YMLDWEWRKN+ILFELGLWSTDIMCFQ       ++ +   D     I K  T + V  CA+ ++++                     R K L + F
Subjt:  IPPYMLDWEWRKNNILFELGLWSTDIMCFQSFRLEIGNMFKPFIDFDLLPI-KTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGF

Query:  SKCDFLFPKVGRLRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERK----GLLVFYNASEQQSRDSPFINERQSLVRVLLEKAHAISKIWDNAPIVLCG
         +    F ++G LR + VA++              + PP ++    +     + V YN    +         +   VRVLLEKAH ISKIW+NAP+VLCG
Subjt:  SKCDFLFPKVGRLRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERK----GLLVFYNASEQQSRDSPFINERQSLVRVLLEKAHAISKIWDNAPIVLCG

Query:  DFNCTPKSALYNFISEQKLDLSELDRDKVSGQSSAEIYQPSSLYRNPWLQTDNSSVPLQLRPESSVIERKPDNSLSDIQKQECSHSFMENENLPSVNRFL
        DFNCTPKSALYNFISEQKLDLS LDRDKVSGQSSAEI+QPSSLYRNP  Q  N SVPLQ R ESS  ERKPD+S++DIQKQ+CSH+ +ENENL SVN  L
Subjt:  DFNCTPKSALYNFISEQKLDLSELDRDKVSGQSSAEIYQPSSLYRNPWLQTDNSSVPLQLRPESSVIERKPDNSLSDIQKQECSHSFMENENLPSVNRFL

Query:  PPDGSNVIFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDSFNESSTTCAKDEFTAGHTSKKVGELFSPSGTDPEVLHLNDSE
         PD S+   DALDTSCN+ QLGMKGTTLHSEGQKE QHSALFDHKNVGETT CEKTD FNE S TCA+DEF  GHTSK+VGEL SP GTDPEVLHLN++E
Subjt:  PPDGSNVIFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDSFNESSTTCAKDEFTAGHTSKKVGELFSPSGTDPEVLHLNDSE

Query:  RRQMEEVDTSSLNNKSSTDGFQDHNFGKESKDTVDNIILDDAQLYSQAAFLDSNSVSSTPACKNSTTDTATDSSDVVTFDHSIAEFEKESSA-RNIEGGP
          Q+E+VDT  LNN  STDGF+DHNFGK+SKD+V+ IILDD QL SQ AFLD  +VSSTP CKNS  DTA DSSDVVTF HSIA  EKESS+ RNIEGGP
Subjt:  RRQMEEVDTSSLNNKSSTDGFQDHNFGKESKDTVDNIILDDAQLYSQAAFLDSNSVSSTPACKNSTTDTATDSSDVVTFDHSIAEFEKESSA-RNIEGGP

Query:  SMVLPRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHDVKDPFSSESHHSVSHQSLVAPPTGVEDDLLPGLNSKSFEVKNITHDRSLWTSMEI
        S+ LPR +  VD RP+ILSSDEQDVA LNGSL EDD TFLSALHDV+DPFSS+  HS SHQSLVAPPTG E++LLPGL++KS EV+ + HDRSLWT  EI
Subjt:  SMVLPRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHDVKDPFSSESHHSVSHQSLVAPPTGVEDDLLPGLNSKSFEVKNITHDRSLWTSMEI

Query:  ETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFV
        ETATGNADSTL+EHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVL+PIRKQVMQQLT GFPTKKWGSDHIALATELAFV
Subjt:  ETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFV

Query:  RSSEE
         S EE
Subjt:  RSSEE

XP_008466385.1 PREDICTED: carbon catabolite repressor protein 4 homolog 6 isoform X2 [Cucumis melo]0.0e+0071.57Show/hide
Query:  MRRVATPPPPLHQLSVAV--ATTATNTSAAMSSRPPYR-AGRYERHRGFSSERPYSGGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHTAPQYP
        MRR ATPPPPLHQLS AV  ATTATNTS AMSSR PYR  GRY RHRGFSSERPYSGG+GQ V+ DSHFQSV+ESNLGFRQGERGG+ NNAGS+TAP+ P
Subjt:  MRRVATPPPPLHQLSVAV--ATTATNTSAAMSSRPPYR-AGRYERHRGFSSERPYSGGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHTAPQYP

Query:  RPPSFRGNHQFRQAPPSSQRHQYRGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHH
        RPPSF GNHQFRQAPPS+QRHQYRGPHPH H+QQPPSFNQNQGVRMPQQ RPRPPK LDFRHWDYAKT PPSTCERFSILSYNILADYLAM+HK KLYHH
Subjt:  RPPSFRGNHQFRQAPPSSQRHQYRGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHH

Query:  IPPYMLDWEWRKNNILFELGLWSTDIMCFQSFRLEIGNMFKPFIDFDLLPI-KTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGF
        IP YMLDWEWRKN+ILFELGLWSTDIMCFQ       ++ +   D     I K  T + V  CA+ ++++                     R K L + F
Subjt:  IPPYMLDWEWRKNNILFELGLWSTDIMCFQSFRLEIGNMFKPFIDFDLLPI-KTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGF

Query:  SKCDFLFPKVGRLRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERK---GLLVFYNASEQQSRDSPFINERQSLVRVLLEKAHAISKIWDNAPIVLCGD
         +    F ++G LR + VA++       +  D     P + S   R     + V YN    +         +   VRVLLEKAH ISKIW+NAP+VLCGD
Subjt:  SKCDFLFPKVGRLRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERK---GLLVFYNASEQQSRDSPFINERQSLVRVLLEKAHAISKIWDNAPIVLCGD

Query:  FNCTPKSALYNFISEQKLDLSELDRDKVSGQSSAEIYQPSSLYRNPWLQTDNSSVPLQLRPESSVIERKPDNSLSDIQKQECSHSFMENENLPSVNRFLP
        FNCTPKSALYNFISEQKLDLS LDRDKVSGQSSAEI+QPSSLYRNP  Q  N SVPLQ R ESS  ERKPD+S++DIQKQ+CSH+ +ENENL SVN  L 
Subjt:  FNCTPKSALYNFISEQKLDLSELDRDKVSGQSSAEIYQPSSLYRNPWLQTDNSSVPLQLRPESSVIERKPDNSLSDIQKQECSHSFMENENLPSVNRFLP

Query:  PDGSNVIFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDSFNESSTTCAKDEFTAGHTSKKVGELFSPSGTDPEVLHLNDSER
        PD S+   DALDTSCN+ QLGMKGTTLHSEGQKE QHSALFDHKNVGETT CEKTD FNE S TCA+DEF  GHTSK+VGEL SP GTDPEVLHLN++E 
Subjt:  PDGSNVIFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDSFNESSTTCAKDEFTAGHTSKKVGELFSPSGTDPEVLHLNDSER

Query:  RQMEEVDTSSLNNKSSTDGFQDHNFGKESKDTVDNIILDDAQLYSQAAFLDSNSVSSTPACKNSTTDTATDSSDVVTFDHSIAEFEKESSA-RNIEGGPS
         Q+E+VDT  LNN  STDGF+DHNFGK+SKD+V+ IILDD QL SQ AFLD  +VSSTP CKNS  DTA DSSDVVTF HSIA  EKESS+ RNIEGGPS
Subjt:  RQMEEVDTSSLNNKSSTDGFQDHNFGKESKDTVDNIILDDAQLYSQAAFLDSNSVSSTPACKNSTTDTATDSSDVVTFDHSIAEFEKESSA-RNIEGGPS

Query:  MVLPRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHDVKDPFSSESHHSVSHQSLVAPPTGVEDDLLPGLNSKSFEVKNITHDRSLWTSMEIE
        + LPR +  VD RP+ILSSDEQDVA LNGSL EDD TFLSALHDV+DPFSS+  HS SHQSLVAPPTG E++LLPGL++KS EV+ + HDRSLWT  EIE
Subjt:  MVLPRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHDVKDPFSSESHHSVSHQSLVAPPTGVEDDLLPGLNSKSFEVKNITHDRSLWTSMEIE

Query:  TATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVR
        TATGNADSTL+EHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVL+PIRKQVMQQLT GFPTKKWGSDHIALATELAFV 
Subjt:  TATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVR

Query:  SSEE
        S EE
Subjt:  SSEE

XP_011652490.1 carbon catabolite repressor protein 4 homolog 6 isoform X2 [Cucumis sativus]0.0e+0071.87Show/hide
Query:  MRRVATPPPPLHQLSVAV--ATTATNTSAAMSSRPPYRAGRYERHRGFSSERPYSGGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHTAPQYPR
        MRR ATPPPPLHQLS AV  ATTATNTS AMSSRPPYR G Y RHRG+SSERPYSGG+GQ V+GDSHFQSV+ESNLGFRQGERGG+ NNAGS+TAP+ PR
Subjt:  MRRVATPPPPLHQLSVAV--ATTATNTSAAMSSRPPYRAGRYERHRGFSSERPYSGGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHTAPQYPR

Query:  PPSFRGNHQFRQAPPSSQRHQYRGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHHI
        PPSF GNHQFRQAPPSSQRHQYRGP+PH H+QQPPSFNQNQGVRMPQQ R RPPK LDFRHWDYAKT PP TCERFSILSYNILADYLAM+HKQKLYHHI
Subjt:  PPSFRGNHQFRQAPPSSQRHQYRGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHHI

Query:  PPYMLDWEWRKNNILFELGLWSTDIMCFQSFRLEIGNMFKPFIDFDLLPI-KTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGFS
        P YMLDWEWRKN+ILFELGLWSTDIMCFQ       ++ +   D     I K  T + V  CA+ ++++                     R K L + F 
Subjt:  PPYMLDWEWRKNNILFELGLWSTDIMCFQSFRLEIGNMFKPFIDFDLLPI-KTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGFS

Query:  KCDFLFPKVGRLRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERK---GLLVFYNASEQQSRDSPFINERQSLVRVLLEKAHAISKIWDNAPIVLCGDF
        +    F K+G LR + VA++       +  D     P + S   R     + V YN    +         +   VRVLLEKAHAISKIW+NAPIVLCGDF
Subjt:  KCDFLFPKVGRLRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERK---GLLVFYNASEQQSRDSPFINERQSLVRVLLEKAHAISKIWDNAPIVLCGDF

Query:  NCTPKSALYNFISEQKLDLSELDRDKVSGQSSAEIYQPSSLYRNPWLQTDNSSVPLQLRPESSVIERKPDNSLSDIQKQECSHSFMENENLPSVNRFLPP
        NCTPKSALYNFISEQKLDLS LDRDKVSGQSSAEI+QPSSLYRNP  Q  N SVPLQ R ESS  E KPD+S+SDIQKQ+CSHS M+NENL S N  L P
Subjt:  NCTPKSALYNFISEQKLDLSELDRDKVSGQSSAEIYQPSSLYRNPWLQTDNSSVPLQLRPESSVIERKPDNSLSDIQKQECSHSFMENENLPSVNRFLPP

Query:  DGSNVIFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDSFNESSTTCAKDEFTAGHTSKKVGELFSPSGTDPEVLHLNDSERR
        D S++  DALDTSCN+LQLGMKGTTLHSEGQKESQHSALFDHKNVGETT CEKTDSFNE S TCA+DEF  GHTSK++GEL SP GTDP+V HLN++E+R
Subjt:  DGSNVIFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDSFNESSTTCAKDEFTAGHTSKKVGELFSPSGTDPEVLHLNDSERR

Query:  QMEEVDTSSLNNKSSTDGFQDHNFGKESKDTVDNIILDDAQLYSQAAFLDSNSVSSTPACKNSTTDTATDSSDVVTFDHSIAEFEKE-SSARNIEGGPSM
        Q+E+V    LNN SSTDGF+DHN  K+SKD+V+ IILDD QL S+  FLD  +VSSTP CKNS  DTA DS DVVT DHSIAE EKE SSARNIEGGPS+
Subjt:  QMEEVDTSSLNNKSSTDGFQDHNFGKESKDTVDNIILDDAQLYSQAAFLDSNSVSSTPACKNSTTDTATDSSDVVTFDHSIAEFEKE-SSARNIEGGPSM

Query:  VLPRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHDVKDPFSSESHHSVSHQSLVAPPTGVEDDLLPGLNSKSFEVKNITHDRSLWTSMEIET
         LPR + VVD RP+ILSSDEQDVA LNGSLTEDD TFLSALHDV+DPFS E  HS SHQSLVAP TG ++DLLPGLN+KS EV+N+ HDRSLWT  +IET
Subjt:  VLPRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHDVKDPFSSESHHSVSHQSLVAPPTGVEDDLLPGLNSKSFEVKNITHDRSLWTSMEIET

Query:  ATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVRS
        ATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTV+VLAPIRKQVMQQLT GFPTKKWGSDHIALATELAFV S
Subjt:  ATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVRS

Query:  SEE
         EE
Subjt:  SEE

XP_038897964.1 carbon catabolite repressor protein 4 homolog 6-like isoform X1 [Benincasa hispida]0.0e+0073.27Show/hide
Query:  MRRVATPPPPLHQLSVAV--------ATTATNTSAAMSSRPPYRAGRYERHRGFSSERPYSGGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHT
        MRR ATPPPPLHQLSVAV        ATTATNTSA MSSRPPYR GRY  HRGFSSERPYSGG+GQ VTGDSHFQSVRESNLGF++GERGGFANNAG ++
Subjt:  MRRVATPPPPLHQLSVAV--------ATTATNTSAAMSSRPPYRAGRYERHRGFSSERPYSGGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHT

Query:  APQYPRPPSFRGNHQFRQAPPSSQRHQYRGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQ
        A Q PRPPSF GNHQFRQAPPS+QRHQYRGPHPH H QQPPSFNQNQGV MPQQ+RPRPPK LD+RHWDYAKTPPPSTCERFSILSYNILADYLAM+HKQ
Subjt:  APQYPRPPSFRGNHQFRQAPPSSQRHQYRGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQ

Query:  KLYHHIPPYMLDWEWRKNNILFELGLWSTDIMCFQS----FRLEIGNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFR
        KLY HIP YMLDWEWRKN+ILFELGLWSTDIMCFQ       LE     + F        K  T   V  CA+ + ++                     R
Subjt:  KLYHHIPPYMLDWEWRKNNILFELGLWSTDIMCFQS----FRLEIGNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFR

Query:  TKCLGKGFSKCDFLFPKVGRLRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERKGLL----VFYNASEQQSRDSPFINERQSLVRVLLEKAHAISKIWD
         K L +        F K+G LR + VA++              + PP ++    K ++    V YN    +         +   VRVLLEKAHAISK WD
Subjt:  TKCLGKGFSKCDFLFPKVGRLRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERKGLL----VFYNASEQQSRDSPFINERQSLVRVLLEKAHAISKIWD

Query:  NAPIVLCGDFNCTPKSALYNFISEQKLDLSELDRDKVSGQSSAEIYQPSSLYRNPWLQTDNSSVPLQLRPESSVIERKPDNSLSDIQKQECSHSFMENEN
        NAPIVLCGDFNCTPKSALYNFISEQKLDLS LDRDKVSGQSSAEI+QPSS +RNP LQT N SVPLQ R ESS IERK D+SLSDIQKQ+CS S MENEN
Subjt:  NAPIVLCGDFNCTPKSALYNFISEQKLDLSELDRDKVSGQSSAEIYQPSSLYRNPWLQTDNSSVPLQLRPESSVIERKPDNSLSDIQKQECSHSFMENEN

Query:  LPSVNRFLPPDGSNVIFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDSFNESSTTCAKDEFTAGHTSKKVGELFSPSGTDPE
        LPSVN  LPPD S+++FDA DTSCNDLQLGMKGTTLHSEG+KESQ SALFDHKN GETTCCEKTDSFNE+S TCAKDEFT GHTSKKVGEL SP GTDPE
Subjt:  LPSVNRFLPPDGSNVIFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDSFNESSTTCAKDEFTAGHTSKKVGELFSPSGTDPE

Query:  VLHLNDSERRQMEEVDTSSLNNKSSTDGFQDHNFGKESKDTVDNIILDDAQLYSQAAFLDSNSVSSTPACKNSTTDTATDSSDVVTFDHSIAEFEKESSA
        ++HLN++ERRQME+ D S L NKSSTDG++DHNFGKESKDTVD +ILDDAQLYSQ    DS +VSSTPACKNS  +TA DSSDVVTFD S  EFEKESS+
Subjt:  VLHLNDSERRQMEEVDTSSLNNKSSTDGFQDHNFGKESKDTVDNIILDDAQLYSQAAFLDSNSVSSTPACKNSTTDTATDSSDVVTFDHSIAEFEKESSA

Query:  -RNIEGGPSMVLPRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHDVKDPFSSESHHSVSHQSLVAPPTGVEDDLLPGLNSKSFEVKNITHDR
         RNIEGGPS  LP  DS +D RPKI  SDEQDVA LNGSLTEDD TFLSALH V+DPFSS+ HHS  H++LV PPTGVEDDLLPGLN+KSFEV+N+THDR
Subjt:  -RNIEGGPSMVLPRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHDVKDPFSSESHHSVSHQSLVAPPTGVEDDLLPGLNSKSFEVKNITHDR

Query:  SLWTSMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIA
        SLWT MEIETATGNAD TLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIA
Subjt:  SLWTSMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIA

Query:  LATELAFVRSSEE
        LATELAFVRS EE
Subjt:  LATELAFVRSSEE

XP_038897965.1 carbon catabolite repressor protein 4 homolog 6-like isoform X2 [Benincasa hispida]0.0e+0073.52Show/hide
Query:  MRRVATPPPPLHQLSVAV--------ATTATNTSAAMSSRPPYRAGRYERHRGFSSERPYSGGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHT
        MRR ATPPPPLHQLSVAV        ATTATNTSA MSSRPPYR GRY  HRGFSSERPYSGG+GQ VTGDSHFQSVRESNLGF++GERGGFANNAG ++
Subjt:  MRRVATPPPPLHQLSVAV--------ATTATNTSAAMSSRPPYRAGRYERHRGFSSERPYSGGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHT

Query:  APQYPRPPSFRGNHQFRQAPPSSQRHQYRGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQ
        A Q PRPPSF GNHQFRQAPPS+QRHQYRGPHPH H QQPPSFNQNQGV MPQQ+RPRPPK LD+RHWDYAKTPPPSTCERFSILSYNILADYLAM+HKQ
Subjt:  APQYPRPPSFRGNHQFRQAPPSSQRHQYRGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQ

Query:  KLYHHIPPYMLDWEWRKNNILFELGLWSTDIMCFQS----FRLEIGNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFR
        KLY HIP YMLDWEWRKN+ILFELGLWSTDIMCFQ       LE     + F        K  T   V  CA+ + ++                     R
Subjt:  KLYHHIPPYMLDWEWRKNNILFELGLWSTDIMCFQS----FRLEIGNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFR

Query:  TKCLGKGFSKCDFLFPKVGRLRGEGVAEVFSGPLGINGLDGE-ELMPPTNSPGERKGLL----VFYNASEQQSRDSPFINERQSLVRVLLEKAHAISKIW
         K L +        F K+G LR + VA++    L  +  DG+  + PP ++    K ++    V YN    +         +   VRVLLEKAHAISK W
Subjt:  TKCLGKGFSKCDFLFPKVGRLRGEGVAEVFSGPLGINGLDGE-ELMPPTNSPGERKGLL----VFYNASEQQSRDSPFINERQSLVRVLLEKAHAISKIW

Query:  DNAPIVLCGDFNCTPKSALYNFISEQKLDLSELDRDKVSGQSSAEIYQPSSLYRNPWLQTDNSSVPLQLRPESSVIERKPDNSLSDIQKQECSHSFMENE
        DNAPIVLCGDFNCTPKSALYNFISEQKLDLS LDRDKVSGQSSAEI+QPSS +RNP LQT N SVPLQ R ESS IERK D+SLSDIQKQ+CS S MENE
Subjt:  DNAPIVLCGDFNCTPKSALYNFISEQKLDLSELDRDKVSGQSSAEIYQPSSLYRNPWLQTDNSSVPLQLRPESSVIERKPDNSLSDIQKQECSHSFMENE

Query:  NLPSVNRFLPPDGSNVIFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDSFNESSTTCAKDEFTAGHTSKKVGELFSPSGTDP
        NLPSVN  LPPD S+++FDA DTSCNDLQLGMKGTTLHSEG+KESQ SALFDHKN GETTCCEKTDSFNE+S TCAKDEFT GHTSKKVGEL SP GTDP
Subjt:  NLPSVNRFLPPDGSNVIFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDSFNESSTTCAKDEFTAGHTSKKVGELFSPSGTDP

Query:  EVLHLNDSERRQMEEVDTSSLNNKSSTDGFQDHNFGKESKDTVDNIILDDAQLYSQAAFLDSNSVSSTPACKNSTTDTATDSSDVVTFDHSIAEFEKESS
        E++HLN++ERRQME+ D S L NKSSTDG++DHNFGKESKDTVD +ILDDAQLYSQ    DS +VSSTPACKNS  +TA DSSDVVTFD S  EFEKESS
Subjt:  EVLHLNDSERRQMEEVDTSSLNNKSSTDGFQDHNFGKESKDTVDNIILDDAQLYSQAAFLDSNSVSSTPACKNSTTDTATDSSDVVTFDHSIAEFEKESS

Query:  A-RNIEGGPSMVLPRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHDVKDPFSSESHHSVSHQSLVAPPTGVEDDLLPGLNSKSFEVKNITHD
        + RNIEGGPS  LP  DS +D RPKI  SDEQDVA LNGSLTEDD TFLSALH V+DPFSS+ HHS  H++LV PPTGVEDDLLPGLN+KSFEV+N+THD
Subjt:  A-RNIEGGPSMVLPRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHDVKDPFSSESHHSVSHQSLVAPPTGVEDDLLPGLNSKSFEVKNITHD

Query:  RSLWTSMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHI
        RSLWT MEIETATGNAD TLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHI
Subjt:  RSLWTSMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHI

Query:  ALATELAFVRSSEE
        ALATELAFVRS EE
Subjt:  ALATELAFVRSSEE

TrEMBL top hitse value%identityAlignment
A0A0A0LHB6 Endo/exonuclease/phosphatase domain-containing protein0.0e+0071.87Show/hide
Query:  MRRVATPPPPLHQLSVAV--ATTATNTSAAMSSRPPYRAGRYERHRGFSSERPYSGGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHTAPQYPR
        MRR ATPPPPLHQLS AV  ATTATNTS AMSSRPPYR G Y RHRG+SSERPYSGG+GQ V+GDSHFQSV+ESNLGFRQGERGG+ NNAGS+TAP+ PR
Subjt:  MRRVATPPPPLHQLSVAV--ATTATNTSAAMSSRPPYRAGRYERHRGFSSERPYSGGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHTAPQYPR

Query:  PPSFRGNHQFRQAPPSSQRHQYRGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHHI
        PPSF GNHQFRQAPPSSQRHQYRGP+PH H+QQPPSFNQNQGVRMPQQ R RPPK LDFRHWDYAKT PP TCERFSILSYNILADYLAM+HKQKLYHHI
Subjt:  PPSFRGNHQFRQAPPSSQRHQYRGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHHI

Query:  PPYMLDWEWRKNNILFELGLWSTDIMCFQSFRLEIGNMFKPFIDFDLLPI-KTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGFS
        P YMLDWEWRKN+ILFELGLWSTDIMCFQ       ++ +   D     I K  T + V  CA+ ++++                     R K L + F 
Subjt:  PPYMLDWEWRKNNILFELGLWSTDIMCFQSFRLEIGNMFKPFIDFDLLPI-KTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGFS

Query:  KCDFLFPKVGRLRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERK---GLLVFYNASEQQSRDSPFINERQSLVRVLLEKAHAISKIWDNAPIVLCGDF
        +    F K+G LR + VA++       +  D     P + S   R     + V YN    +         +   VRVLLEKAHAISKIW+NAPIVLCGDF
Subjt:  KCDFLFPKVGRLRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERK---GLLVFYNASEQQSRDSPFINERQSLVRVLLEKAHAISKIWDNAPIVLCGDF

Query:  NCTPKSALYNFISEQKLDLSELDRDKVSGQSSAEIYQPSSLYRNPWLQTDNSSVPLQLRPESSVIERKPDNSLSDIQKQECSHSFMENENLPSVNRFLPP
        NCTPKSALYNFISEQKLDLS LDRDKVSGQSSAEI+QPSSLYRNP  Q  N SVPLQ R ESS  E KPD+S+SDIQKQ+CSHS M+NENL S N  L P
Subjt:  NCTPKSALYNFISEQKLDLSELDRDKVSGQSSAEIYQPSSLYRNPWLQTDNSSVPLQLRPESSVIERKPDNSLSDIQKQECSHSFMENENLPSVNRFLPP

Query:  DGSNVIFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDSFNESSTTCAKDEFTAGHTSKKVGELFSPSGTDPEVLHLNDSERR
        D S++  DALDTSCN+LQLGMKGTTLHSEGQKESQHSALFDHKNVGETT CEKTDSFNE S TCA+DEF  GHTSK++GEL SP GTDP+V HLN++E+R
Subjt:  DGSNVIFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDSFNESSTTCAKDEFTAGHTSKKVGELFSPSGTDPEVLHLNDSERR

Query:  QMEEVDTSSLNNKSSTDGFQDHNFGKESKDTVDNIILDDAQLYSQAAFLDSNSVSSTPACKNSTTDTATDSSDVVTFDHSIAEFEKE-SSARNIEGGPSM
        Q+E+V    LNN SSTDGF+DHN  K+SKD+V+ IILDD QL S+  FLD  +VSSTP CKNS  DTA DS DVVT DHSIAE EKE SSARNIEGGPS+
Subjt:  QMEEVDTSSLNNKSSTDGFQDHNFGKESKDTVDNIILDDAQLYSQAAFLDSNSVSSTPACKNSTTDTATDSSDVVTFDHSIAEFEKE-SSARNIEGGPSM

Query:  VLPRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHDVKDPFSSESHHSVSHQSLVAPPTGVEDDLLPGLNSKSFEVKNITHDRSLWTSMEIET
         LPR + VVD RP+ILSSDEQDVA LNGSLTEDD TFLSALHDV+DPFS E  HS SHQSLVAP TG ++DLLPGLN+KS EV+N+ HDRSLWT  +IET
Subjt:  VLPRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHDVKDPFSSESHHSVSHQSLVAPPTGVEDDLLPGLNSKSFEVKNITHDRSLWTSMEIET

Query:  ATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVRS
        ATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTV+VLAPIRKQVMQQLT GFPTKKWGSDHIALATELAFV S
Subjt:  ATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVRS

Query:  SEE
         EE
Subjt:  SEE

A0A1S3CRA9 carbon catabolite repressor protein 4 homolog 6 isoform X10.0e+0071.27Show/hide
Query:  MRRVATPPPPLHQLSVAV--ATTATNTSAAMSSRPPYR-AGRYERHRGFSSERPYSGGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHTAPQYP
        MRR ATPPPPLHQLS AV  ATTATNTS AMSSR PYR  GRY RHRGFSSERPYSGG+GQ V+ DSHFQSV+ESNLGFRQGERGG+ NNAGS+TAP+ P
Subjt:  MRRVATPPPPLHQLSVAV--ATTATNTSAAMSSRPPYR-AGRYERHRGFSSERPYSGGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHTAPQYP

Query:  RPPSFRGNHQFRQAPPSSQRHQYRGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHH
        RPPSF GNHQFRQAPPS+QRHQYRGPHPH H+QQPPSFNQNQGVRMPQQ RPRPPK LDFRHWDYAKT PPSTCERFSILSYNILADYLAM+HK KLYHH
Subjt:  RPPSFRGNHQFRQAPPSSQRHQYRGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHH

Query:  IPPYMLDWEWRKNNILFELGLWSTDIMCFQSFRLEIGNMFKPFIDFDLLPI-KTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGF
        IP YMLDWEWRKN+ILFELGLWSTDIMCFQ       ++ +   D     I K  T + V  CA+ ++++                     R K L + F
Subjt:  IPPYMLDWEWRKNNILFELGLWSTDIMCFQSFRLEIGNMFKPFIDFDLLPI-KTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGF

Query:  SKCDFLFPKVGRLRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERK----GLLVFYNASEQQSRDSPFINERQSLVRVLLEKAHAISKIWDNAPIVLCG
         +    F ++G LR + VA++              + PP ++    +     + V YN    +         +   VRVLLEKAH ISKIW+NAP+VLCG
Subjt:  SKCDFLFPKVGRLRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERK----GLLVFYNASEQQSRDSPFINERQSLVRVLLEKAHAISKIWDNAPIVLCG

Query:  DFNCTPKSALYNFISEQKLDLSELDRDKVSGQSSAEIYQPSSLYRNPWLQTDNSSVPLQLRPESSVIERKPDNSLSDIQKQECSHSFMENENLPSVNRFL
        DFNCTPKSALYNFISEQKLDLS LDRDKVSGQSSAEI+QPSSLYRNP  Q  N SVPLQ R ESS  ERKPD+S++DIQKQ+CSH+ +ENENL SVN  L
Subjt:  DFNCTPKSALYNFISEQKLDLSELDRDKVSGQSSAEIYQPSSLYRNPWLQTDNSSVPLQLRPESSVIERKPDNSLSDIQKQECSHSFMENENLPSVNRFL

Query:  PPDGSNVIFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDSFNESSTTCAKDEFTAGHTSKKVGELFSPSGTDPEVLHLNDSE
         PD S+   DALDTSCN+ QLGMKGTTLHSEGQKE QHSALFDHKNVGETT CEKTD FNE S TCA+DEF  GHTSK+VGEL SP GTDPEVLHLN++E
Subjt:  PPDGSNVIFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDSFNESSTTCAKDEFTAGHTSKKVGELFSPSGTDPEVLHLNDSE

Query:  RRQMEEVDTSSLNNKSSTDGFQDHNFGKESKDTVDNIILDDAQLYSQAAFLDSNSVSSTPACKNSTTDTATDSSDVVTFDHSIAEFEKESSA-RNIEGGP
          Q+E+VDT  LNN  STDGF+DHNFGK+SKD+V+ IILDD QL SQ AFLD  +VSSTP CKNS  DTA DSSDVVTF HSIA  EKESS+ RNIEGGP
Subjt:  RRQMEEVDTSSLNNKSSTDGFQDHNFGKESKDTVDNIILDDAQLYSQAAFLDSNSVSSTPACKNSTTDTATDSSDVVTFDHSIAEFEKESSA-RNIEGGP

Query:  SMVLPRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHDVKDPFSSESHHSVSHQSLVAPPTGVEDDLLPGLNSKSFEVKNITHDRSLWTSMEI
        S+ LPR +  VD RP+ILSSDEQDVA LNGSL EDD TFLSALHDV+DPFSS+  HS SHQSLVAPPTG E++LLPGL++KS EV+ + HDRSLWT  EI
Subjt:  SMVLPRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHDVKDPFSSESHHSVSHQSLVAPPTGVEDDLLPGLNSKSFEVKNITHDRSLWTSMEI

Query:  ETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFV
        ETATGNADSTL+EHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVL+PIRKQVMQQLT GFPTKKWGSDHIALATELAFV
Subjt:  ETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFV

Query:  RSSEE
         S EE
Subjt:  RSSEE

A0A1S3CSF9 carbon catabolite repressor protein 4 homolog 6 isoform X20.0e+0071.57Show/hide
Query:  MRRVATPPPPLHQLSVAV--ATTATNTSAAMSSRPPYR-AGRYERHRGFSSERPYSGGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHTAPQYP
        MRR ATPPPPLHQLS AV  ATTATNTS AMSSR PYR  GRY RHRGFSSERPYSGG+GQ V+ DSHFQSV+ESNLGFRQGERGG+ NNAGS+TAP+ P
Subjt:  MRRVATPPPPLHQLSVAV--ATTATNTSAAMSSRPPYR-AGRYERHRGFSSERPYSGGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHTAPQYP

Query:  RPPSFRGNHQFRQAPPSSQRHQYRGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHH
        RPPSF GNHQFRQAPPS+QRHQYRGPHPH H+QQPPSFNQNQGVRMPQQ RPRPPK LDFRHWDYAKT PPSTCERFSILSYNILADYLAM+HK KLYHH
Subjt:  RPPSFRGNHQFRQAPPSSQRHQYRGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHH

Query:  IPPYMLDWEWRKNNILFELGLWSTDIMCFQSFRLEIGNMFKPFIDFDLLPI-KTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGF
        IP YMLDWEWRKN+ILFELGLWSTDIMCFQ       ++ +   D     I K  T + V  CA+ ++++                     R K L + F
Subjt:  IPPYMLDWEWRKNNILFELGLWSTDIMCFQSFRLEIGNMFKPFIDFDLLPI-KTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGF

Query:  SKCDFLFPKVGRLRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERK---GLLVFYNASEQQSRDSPFINERQSLVRVLLEKAHAISKIWDNAPIVLCGD
         +    F ++G LR + VA++       +  D     P + S   R     + V YN    +         +   VRVLLEKAH ISKIW+NAP+VLCGD
Subjt:  SKCDFLFPKVGRLRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERK---GLLVFYNASEQQSRDSPFINERQSLVRVLLEKAHAISKIWDNAPIVLCGD

Query:  FNCTPKSALYNFISEQKLDLSELDRDKVSGQSSAEIYQPSSLYRNPWLQTDNSSVPLQLRPESSVIERKPDNSLSDIQKQECSHSFMENENLPSVNRFLP
        FNCTPKSALYNFISEQKLDLS LDRDKVSGQSSAEI+QPSSLYRNP  Q  N SVPLQ R ESS  ERKPD+S++DIQKQ+CSH+ +ENENL SVN  L 
Subjt:  FNCTPKSALYNFISEQKLDLSELDRDKVSGQSSAEIYQPSSLYRNPWLQTDNSSVPLQLRPESSVIERKPDNSLSDIQKQECSHSFMENENLPSVNRFLP

Query:  PDGSNVIFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDSFNESSTTCAKDEFTAGHTSKKVGELFSPSGTDPEVLHLNDSER
        PD S+   DALDTSCN+ QLGMKGTTLHSEGQKE QHSALFDHKNVGETT CEKTD FNE S TCA+DEF  GHTSK+VGEL SP GTDPEVLHLN++E 
Subjt:  PDGSNVIFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDSFNESSTTCAKDEFTAGHTSKKVGELFSPSGTDPEVLHLNDSER

Query:  RQMEEVDTSSLNNKSSTDGFQDHNFGKESKDTVDNIILDDAQLYSQAAFLDSNSVSSTPACKNSTTDTATDSSDVVTFDHSIAEFEKESSA-RNIEGGPS
         Q+E+VDT  LNN  STDGF+DHNFGK+SKD+V+ IILDD QL SQ AFLD  +VSSTP CKNS  DTA DSSDVVTF HSIA  EKESS+ RNIEGGPS
Subjt:  RQMEEVDTSSLNNKSSTDGFQDHNFGKESKDTVDNIILDDAQLYSQAAFLDSNSVSSTPACKNSTTDTATDSSDVVTFDHSIAEFEKESSA-RNIEGGPS

Query:  MVLPRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHDVKDPFSSESHHSVSHQSLVAPPTGVEDDLLPGLNSKSFEVKNITHDRSLWTSMEIE
        + LPR +  VD RP+ILSSDEQDVA LNGSL EDD TFLSALHDV+DPFSS+  HS SHQSLVAPPTG E++LLPGL++KS EV+ + HDRSLWT  EIE
Subjt:  MVLPRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHDVKDPFSSESHHSVSHQSLVAPPTGVEDDLLPGLNSKSFEVKNITHDRSLWTSMEIE

Query:  TATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVR
        TATGNADSTL+EHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVL+PIRKQVMQQLT GFPTKKWGSDHIALATELAFV 
Subjt:  TATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVR

Query:  SSEE
        S EE
Subjt:  SSEE

A0A5A7T7G2 Carbon catabolite repressor protein 4-like protein 6 isoform X10.0e+0071.27Show/hide
Query:  MRRVATPPPPLHQLSVAV--ATTATNTSAAMSSRPPYR-AGRYERHRGFSSERPYSGGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHTAPQYP
        MRR ATPPPPLHQLS AV  ATTATNTS AMSSR PYR  GRY RHRGFSSERPYSGG+GQ V+ DSHFQSV+ESNLGFRQGERGG+ NNAGS+TAP+ P
Subjt:  MRRVATPPPPLHQLSVAV--ATTATNTSAAMSSRPPYR-AGRYERHRGFSSERPYSGGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHTAPQYP

Query:  RPPSFRGNHQFRQAPPSSQRHQYRGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHH
        RPPSF GNHQFRQAPPS+QRHQYRGPHPH H+QQPPSFNQNQGVRMPQQ RPRPPK LDFRHWDYAKT PPSTCERFSILSYNILADYLAM+HK KLYHH
Subjt:  RPPSFRGNHQFRQAPPSSQRHQYRGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHH

Query:  IPPYMLDWEWRKNNILFELGLWSTDIMCFQSFRLEIGNMFKPFIDFDLLPI-KTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGF
        IP YMLDWEWRKN+ILFELGLWSTDIMCFQ       ++ +   D     I K  T + V  CA+ ++++                     R K L + F
Subjt:  IPPYMLDWEWRKNNILFELGLWSTDIMCFQSFRLEIGNMFKPFIDFDLLPI-KTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGF

Query:  SKCDFLFPKVGRLRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERK----GLLVFYNASEQQSRDSPFINERQSLVRVLLEKAHAISKIWDNAPIVLCG
         +    F ++G LR + VA++              + PP ++    +     + V YN    +         +   VRVLLEKAH ISKIW+NAP+VLCG
Subjt:  SKCDFLFPKVGRLRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERK----GLLVFYNASEQQSRDSPFINERQSLVRVLLEKAHAISKIWDNAPIVLCG

Query:  DFNCTPKSALYNFISEQKLDLSELDRDKVSGQSSAEIYQPSSLYRNPWLQTDNSSVPLQLRPESSVIERKPDNSLSDIQKQECSHSFMENENLPSVNRFL
        DFNCTPKSALYNFISEQKLDLS LDRDKVSGQSSAEI+QPSSLYRNP  Q  N SVPLQ R ESS  ERKPD+S++DIQKQ+CSH+ +ENENL SVN  L
Subjt:  DFNCTPKSALYNFISEQKLDLSELDRDKVSGQSSAEIYQPSSLYRNPWLQTDNSSVPLQLRPESSVIERKPDNSLSDIQKQECSHSFMENENLPSVNRFL

Query:  PPDGSNVIFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDSFNESSTTCAKDEFTAGHTSKKVGELFSPSGTDPEVLHLNDSE
         PD S+   DALDTSCN+ QLGMKGTTLHSEGQKE QHSALFDHKNVGETT CEKTD FNE S TCA+DEF  GHTSK+VGEL SP GTDPEVLHLN++E
Subjt:  PPDGSNVIFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDSFNESSTTCAKDEFTAGHTSKKVGELFSPSGTDPEVLHLNDSE

Query:  RRQMEEVDTSSLNNKSSTDGFQDHNFGKESKDTVDNIILDDAQLYSQAAFLDSNSVSSTPACKNSTTDTATDSSDVVTFDHSIAEFEKESSA-RNIEGGP
          Q+E+VDT  LNN  STDGF+DHNFGK+SKD+V+ IILDD QL SQ AFLD  +VSSTP CKNS  DTA DSSDVVTF HSIA  EKESS+ RNIEGGP
Subjt:  RRQMEEVDTSSLNNKSSTDGFQDHNFGKESKDTVDNIILDDAQLYSQAAFLDSNSVSSTPACKNSTTDTATDSSDVVTFDHSIAEFEKESSA-RNIEGGP

Query:  SMVLPRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHDVKDPFSSESHHSVSHQSLVAPPTGVEDDLLPGLNSKSFEVKNITHDRSLWTSMEI
        S+ LPR +  VD RP+ILSSDEQDVA LNGSL EDD TFLSALHDV+DPFSS+  HS SHQSLVAPPTG E++LLPGL++KS EV+ + HDRSLWT  EI
Subjt:  SMVLPRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHDVKDPFSSESHHSVSHQSLVAPPTGVEDDLLPGLNSKSFEVKNITHDRSLWTSMEI

Query:  ETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFV
        ETATGNADSTL+EHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVL+PIRKQVMQQLT GFPTKKWGSDHIALATELAFV
Subjt:  ETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFV

Query:  RSSEE
         S EE
Subjt:  RSSEE

A0A6J1BSH6 carbon catabolite repressor protein 4 homolog 6 isoform X29.0e-30664.87Show/hide
Query:  MRRVATPPPPLHQLSVAVATTAT-NTSAAMSSRPPYRAGRYERHRGFSSERPYSGGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHTAPQYPRP
        MRR AT PPPL QLS AVAT A+  TSAAMSSRPPYR GRY+R  GFSSERPYSGGKGQ V+GDSH+QSVRESNLGFRQGE G FANNAGS+TAPQ PRP
Subjt:  MRRVATPPPPLHQLSVAVATTAT-NTSAAMSSRPPYRAGRYERHRGFSSERPYSGGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHTAPQYPRP

Query:  PSFRGNHQFRQAPPSSQRHQYRGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHHIP
        P + G HQFRQA P  Q HQYRGPHPH H+QQP SFNQNQGVR PQ+ RPRPPK  D+RHWDYAKT  PSTCERF+ILSYNILADYLAM+HKQKLYHHIP
Subjt:  PSFRGNHQFRQAPPSSQRHQYRGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHHIP

Query:  PYMLDWEWRKNNILFELGLWSTDIMCFQS----FRLEIGNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKG
         YMLDWEWRK N+LFELGLWSTDIMCFQ       LE     + F        K  T + V  CA+ ++++  +   E                +C+   
Subjt:  PYMLDWEWRKNNILFELGLWSTDIMCFQS----FRLEIGNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKG

Query:  FSKCDFLFPKVGRLRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERKGLL----VFYNASEQQSRDSPFINERQSLVRVLLEKAHAISKIWDNAPIVLC
               F K+G LR + VA++       +  D  +  PP ++    K ++    V YN    +         +   VR+LLEKAHAISKIW+NAPIVLC
Subjt:  FSKCDFLFPKVGRLRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERKGLL----VFYNASEQQSRDSPFINERQSLVRVLLEKAHAISKIWDNAPIVLC

Query:  GDFNCTPKSALYNFISEQKLDLSELDRDKVSGQSSAEIYQPSSLYRNPWLQTDNSSVPLQLRPESSVIERKPDNSLSDIQKQECSHSFMENENLPSVNRF
        GDFNCTPKSALYNFISEQKL+LS LDRDKVSGQSSAEI+QPSSLYRNP  QT + SVPLQLR +S  IERKPD+ LSD++ Q+  HS MENENLPSVN+ 
Subjt:  GDFNCTPKSALYNFISEQKLDLSELDRDKVSGQSSAEIYQPSSLYRNPWLQTDNSSVPLQLRPESSVIERKPDNSLSDIQKQECSHSFMENENLPSVNRF

Query:  LPPDGSNVIFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDSFNESSTTCAKDEFTAGHTSKKVGELFSPSGTDPEVLHLNDS
        LPPD S+ +    + SCNDLQLGMKG  LHSEGQKE Q+ A F HKNVGETT C   DSF +SS  CA+DEFT  H S+KV EL SP GTD E LHLN +
Subjt:  LPPDGSNVIFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDSFNESSTTCAKDEFTAGHTSKKVGELFSPSGTDPEVLHLNDS

Query:  ERRQMEEVDTSS-LNNKSSTDGF-QDHNFGKESKDTVDNIILDDAQLYSQAAFLDSNSVSSTPACKNSTTDTATDSSDVVTFDHSIAEFEKESSARNIEG
        ERRQME+VD++S L ++SSTD   +D NF K+SK+ V+N+ILDD     Q   L S +V  TPAC+N   D A DSS+VV F H IAEFEKESSARNI+G
Subjt:  ERRQMEEVDTSS-LNNKSSTDGF-QDHNFGKESKDTVDNIILDDAQLYSQAAFLDSNSVSSTPACKNSTTDTATDSSDVVTFDHSIAEFEKESSARNIEG

Query:  GPSMVLPRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHDV-KDPFSSESHHSVSHQSLVAPPTGVEDDLLPGLNSKSFEVKNITHDRSLWTS
        GPS+  P  +  VD R KILSSDEQDVA L+GSLTEDD TFLSALHD+ ++PFSSE H SV HQSLVAP TGV DD LPGLN+KSFEV+N  HDRSLWT 
Subjt:  GPSMVLPRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHDV-KDPFSSESHHSVSHQSLVAPPTGVEDDLLPGLNSKSFEVKNITHDRSLWTS

Query:  MEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATEL
        MEIE ATGN D TL+EH L+LRSTYTE ED SGTRDLN EPL TSYNRCFLGTVDYIWRSEGLQTV+VLAPI+K VM +LTPGFPTKKWGSDHIALA EL
Subjt:  MEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATEL

Query:  AFVRSSEE
        AF R  EE
Subjt:  AFVRSSEE

SwissProt top hitse value%identityAlignment
B2RYM0 Protein angel homolog 13.5e-1226.85Show/hide
Query:  PHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDF-----RHW-DYAKTPPPSTCE-------RFSILSYNILADYLAMNHKQKLYHHIPPYMLDWEWRKNN
        P P  Q+  S    +G+   +QL+P PP  + +     R W D++  P     E       +F+++SYNILA  L M    +LY H  P +L+W +R  N
Subjt:  PHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDF-----RHW-DYAKTPPPSTCE-------RFSILSYNILADYLAMNHKQKLYHHIPPYMLDWEWRKNN

Query:  ILFELGLWSTDIMCFQSFRLEIGNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGFSKCDFLFPKVGRLRG
        ++ E   W  DI+C Q  +           D     ++    ++   C  + + T  ++ G C   Y+  R    FR  C     S  ++  P +  L  
Subjt:  ILFELGLWSTDIMCFQSFRLEIGNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGFSKCDFLFPKVGRLRG

Query:  EGVAEVFSGPLGINGLDGEELMPPTNSPGERKGLLVFYNASEQQSRDSPFINERQSLVRVLLEKAHAISKIWD--NAPIVLCGDFNCTPKSALYNFISEQ
        + V  V    L +  L  E L   + +P       V YN            + + + + +LL +   ++++ D  + PI+LCGD N  P S LYNFI + 
Subjt:  EGVAEVFSGPLGINGLDGEELMPPTNSPGERKGLLVFYNASEQQSRDSPFINERQSLVRVLLEKAHAISKIWD--NAPIVLCGDFNCTPKSALYNFISEQ

Query:  KLDLSELDRDKVSGQS--SAEIYQ
        +L  + +   KVSGQ   S ++YQ
Subjt:  KLDLSELDRDKVSGQS--SAEIYQ

Q0WKY2 Carbon catabolite repressor protein 4 homolog 51.8e-2147.17Show/hide
Query:  WTSMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALA
        W+  E++ ATG  ++T ++H L+L S Y+       TRD  GEPLAT+Y+  FLGTVDYIW ++ L  VRVL  +   V+++ T G P++ WGSDH+A+A
Subjt:  WTSMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALA

Query:  TELAFV
         EL FV
Subjt:  TELAFV

Q0WKY2 Carbon catabolite repressor protein 4 homolog 58.5e-1124.12Show/hide
Query:  PHHQQPPSFNQ---------NQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHHIPPYMLDWEWRKNNILFELG
        PH    P F+Q            +R  ++ + +    ++ R W ++     +  ++  ++SYN+L    A NH   LY+++P   L+W  RK+ I  E+ 
Subjt:  PHHQQPPSFNQ---------NQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHHIPPYMLDWEWRKNNILFELG

Query:  LWSTDIMCFQSFRLEIGNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGFSKCDFLFPKVGRLRGEGVAEV
         ++  I+C Q            F D D+L +K             F+       GE             F  + L +        F K G      VA++
Subjt:  LWSTDIMCFQSFRLEIGNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGFSKCDFLFPKVGRLRGEGVAEV

Query:  FSGPLGINGLDGEELMPPTNSPGERKGLLVFYNASEQQSRDSPFINERQSLVRVLLEKAHAISKIWDNAPIVLCGDFNCTPKSALYNFISEQKLDLSELD
            L +N  +  +      S   R+ ++   +      R    + +    VR+ LEKA+ +S+ W N P+ + GD N TP+SA+Y+FI+   LD    D
Subjt:  FSGPLGINGLDGEELMPPTNSPGERKGLLVFYNASEQQSRDSPFINERQSLVRVLLEKAHAISKIWDNAPIVLCGDFNCTPKSALYNFISEQKLDLSELD

Query:  RDKVSGQSSAEIYQPSSLYRNPWLQTDNSSVPLQLRPESS
        R ++SGQ+  E  + S  +RN +  + ++S+   L  E S
Subjt:  RDKVSGQSSAEIYQPSSLYRNPWLQTDNSSVPLQLRPESS

Q8VCU0 Protein angel homolog 13.5e-1226.85Show/hide
Query:  PHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDF-----RHW-DYAKTPPPSTCE-------RFSILSYNILADYLAMNHKQKLYHHIPPYMLDWEWRKNN
        P P  Q+  S    +G+   +QL+P PP  + +     R W D++  P     E       +F+++SYNILA  L M    +LY H  P +L+W +R  N
Subjt:  PHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDF-----RHW-DYAKTPPPSTCE-------RFSILSYNILADYLAMNHKQKLYHHIPPYMLDWEWRKNN

Query:  ILFELGLWSTDIMCFQSFRLEIGNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGFSKCDFLFPKVGRLRG
        ++ E   W  DI+C Q  +           D     ++    ++   C  + + T  ++ G C   Y+  R    FR  C     S  ++  P +  L  
Subjt:  ILFELGLWSTDIMCFQSFRLEIGNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGFSKCDFLFPKVGRLRG

Query:  EGVAEVFSGPLGINGLDGEELMPPTNSPGERKGLLVFYNASEQQSRDSPFINERQSLVRVLLEKAHAISKIWD--NAPIVLCGDFNCTPKSALYNFISEQ
        + V  V    L +  L  E L   + +P       V YN            + + + + +LL +   ++++ D  + PI+LCGD N  P S LYNFI + 
Subjt:  EGVAEVFSGPLGINGLDGEELMPPTNSPGERKGLLVFYNASEQQSRDSPFINERQSLVRVLLEKAHAISKIWD--NAPIVLCGDFNCTPKSALYNFISEQ

Query:  KLDLSELDRDKVSGQS--SAEIYQ
        +L  + +   KVSGQ   S ++YQ
Subjt:  KLDLSELDRDKVSGQS--SAEIYQ

Q8VYU4 Carbon catabolite repressor protein 4 homolog 61.6e-10235.08Show/hide
Query:  MRRVATPPPPLHQLSVAVATTATNTSAAMSSRPPYRA----GRYERHRGFSSERPYS--GGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHTAP
        MRR          ++ A A+T +     MS+R PYR     GR    R F S+RPY+   G+ Q VTGDSHFQSV ++N  FR GE          H  P
Subjt:  MRRVATPPPPLHQLSVAVATTATNTSAAMSSRPPYRA----GRYERHRGFSSERPYS--GGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHTAP

Query:  QYPR-PPSFRGNHQFRQAPPS-SQRHQYRGPHPHPHHQQ-----PPSFNQNQGVRMP--QQLRPRP-PKRLDFRHWDYAKTPPPSTCERFSILSYNILAD
           R  P F  N++FR  PPS  Q  Q+R P+  P +Q      PP F QNQ  R P  Q  R RP  K  D+R W+YAKTPP    E+F +LSYNILAD
Subjt:  QYPR-PPSFRGNHQFRQAPPS-SQRHQYRGPHPHPHHQQ-----PPSFNQNQGVRMP--QQLRPRP-PKRLDFRHWDYAKTPPPSTCERFSILSYNILAD

Query:  YLAMNHKQKLYHHIPPYMLDWEWRKNNILFELGLWSTDIMC------FQSFRLEIGNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAY
        YLA +H + LY HIP  ML W WRK+ ++FEL LWS DIMC      FQ    E+ +     I       K  T   V  CA                  
Subjt:  YLAMNHKQKLYHHIPPYMLDWEWRKNNILFELGLWSTDIMC------FQSFRLEIGNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAY

Query:  QELRNDVFFRTKCLGKGFSKCDFLFPKVGRLRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERKGLLVFYNASEQQSRDSPFINERQSLVRVLLEKAHA
              +F+R+    K   +    F ++G LR           L  +     E  PP +S G  + ++   +      R    + +    VR LL+KAHA
Subjt:  QELRNDVFFRTKCLGKGFSKCDFLFPKVGRLRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERKGLLVFYNASEQQSRDSPFINERQSLVRVLLEKAHA

Query:  ISKIWDNAPIVLCGDFNCTPKSALYNFISEQKLDLSELDRDKVSGQSSAEIYQPSSLYRNPWLQTDNSSVPLQLRPESSVIERKPDNSLSDIQKQECSHS
        +SK+WD+APIVLCGDFNCTPKS LYNFIS++KLDLS L RDKVSGQ SAE   P         Q+ N S   Q++P + +                 +++
Subjt:  ISKIWDNAPIVLCGDFNCTPKSALYNFISEQKLDLSELDRDKVSGQSSAEIYQPSSLYRNPWLQTDNSSVPLQLRPESSVIERKPDNSLSDIQKQECSHS

Query:  FMENENLPSVNRFLPPDGSNVIFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDSFNESSTTCAKDEFTAGHTSKKVGELFSP
         MEN              SN+                                      +VG T   EKT     S   C  D   AGH +    +   P
Subjt:  FMENENLPSVNRFLPPDGSNVIFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDSFNESSTTCAKDEFTAGHTSKKVGELFSP

Query:  SGTDPEVLHLNDSERRQMEEVDTSSLNNKSSTDGFQDHNFGKESKDTVDNIILDDAQLYSQAAFLDSNSVSSTPACKNSTTDTATDS--SDVVTFDHSIA
                                   N +S     D  FG E++   D+  L  A+  S     D+    ++ A ++  TD +  S  S+       I 
Subjt:  SGTDPEVLHLNDSERRQMEEVDTSSLNNKSSTDGFQDHNFGKESKDTVDNIILDDAQLYSQAAFLDSNSVSSTPACKNSTTDTATDS--SDVVTFDHSIA

Query:  EFEKE-SSARNIEGGPSMVLPRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHDVKDPFSSESHHSVSHQSLVAPPTGVEDDLLPGLNSKSFE
          +++ SS+ + +    +   + D +    P + + DE+       SL ED  TFL+ LHD  +  S +                +  ++    +S++  
Subjt:  EFEKE-SSARNIEGGPSMVLPRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHDVKDPFSSESHHSVSHQSLVAPPTGVEDDLLPGLNSKSFE

Query:  VKNITHDRSLWTSMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTK
           IT+  S WT MEI TATG+ + T +EH+L L+STY+E E  + TRD NGEP+ TSY+RCF+GTVDYIWRSEGLQTVRVLAPI KQ M Q TPGFPT 
Subjt:  VKNITHDRSLWTSMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTK

Query:  KWGSDHIALATELAFVRS
        KWGSDHIAL +ELAF  S
Subjt:  KWGSDHIALATELAFVRS

Q9LS39 Carbon catabolite repressor protein 4 homolog 35.0e-1947.66Show/hide
Query:  SLWTSMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIA
        S WT  EI  ATG  +S    H L+L S+Y   +  + TRD  GEPLATSY+  FLGTVDY+W S+GL   RVL  +   V+ + T G P ++ GSDH+A
Subjt:  SLWTSMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIA

Query:  LATELAF
        L +E  F
Subjt:  LATELAF

Q9LS39 Carbon catabolite repressor protein 4 homolog 33.7e-1428.79Show/hide
Query:  PPSSQRHQY--RGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHW-DYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHHIPPYMLDWEWR
        P SS    Y  R  +P P  Q P     +Q                  R W D   TP     ERF+++SYNIL D  +  H++ LY ++    L W +R
Subjt:  PPSSQRHQY--RGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHW-DYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHHIPPYMLDWEWR

Query:  KNNILFELGLWSTDIMCFQSFRLEIGNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGFSKCDFLFPKVGR
        K  I  EL   + DI+  Q    E+                             F L ++       G+Y+    D           F K D    + G 
Subjt:  KNNILFELGLWSTDIMCFQSFRLEIGNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGFSKCDFLFPKVGR

Query:  LRGEGVA-EVFSGPLGINGLDGEELMPPTNSPGERKGLL----VFYNASEQQSRDSPFINERQSLVRVLLEKAHAISKIWDNAPIVLCGDFNCTPKSALY
        L  E +    F     +  L   EL     S   RK LL    V YN ++         + +   VR L  KAH +SK W + PIVLCGDFN TPKS LY
Subjt:  LRGEGVA-EVFSGPLGINGLDGEELMPPTNSPGERKGLL----VFYNASEQQSRDSPFINERQSLVRVLLEKAHAISKIWDNAPIVLCGDFNCTPKSALY

Query:  NFISEQKLDLSELDRDKVSGQSS
        NF++  +L++ E D+ ++SGQ +
Subjt:  NFISEQKLDLSELDRDKVSGQSS

Arabidopsis top hitse value%identityAlignment
AT1G73875.1 DNAse I-like superfamily protein1.3e-2247.17Show/hide
Query:  WTSMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALA
        W+  E++ ATG  ++T ++H L+L S Y+       TRD  GEPLAT+Y+  FLGTVDYIW ++ L  VRVL  +   V+++ T G P++ WGSDH+A+A
Subjt:  WTSMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALA

Query:  TELAFV
         EL FV
Subjt:  TELAFV

AT1G73875.1 DNAse I-like superfamily protein6.0e-1224.12Show/hide
Query:  PHHQQPPSFNQ---------NQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHHIPPYMLDWEWRKNNILFELG
        PH    P F+Q            +R  ++ + +    ++ R W ++     +  ++  ++SYN+L    A NH   LY+++P   L+W  RK+ I  E+ 
Subjt:  PHHQQPPSFNQ---------NQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHHIPPYMLDWEWRKNNILFELG

Query:  LWSTDIMCFQSFRLEIGNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGFSKCDFLFPKVGRLRGEGVAEV
         ++  I+C Q            F D D+L +K             F+       GE             F  + L +        F K G      VA++
Subjt:  LWSTDIMCFQSFRLEIGNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGFSKCDFLFPKVGRLRGEGVAEV

Query:  FSGPLGINGLDGEELMPPTNSPGERKGLLVFYNASEQQSRDSPFINERQSLVRVLLEKAHAISKIWDNAPIVLCGDFNCTPKSALYNFISEQKLDLSELD
            L +N  +  +      S   R+ ++   +      R    + +    VR+ LEKA+ +S+ W N P+ + GD N TP+SA+Y+FI+   LD    D
Subjt:  FSGPLGINGLDGEELMPPTNSPGERKGLLVFYNASEQQSRDSPFINERQSLVRVLLEKAHAISKIWDNAPIVLCGDFNCTPKSALYNFISEQKLDLSELD

Query:  RDKVSGQSSAEIYQPSSLYRNPWLQTDNSSVPLQLRPESS
        R ++SGQ+  E  + S  +RN +  + ++S+   L  E S
Subjt:  RDKVSGQSSAEIYQPSSLYRNPWLQTDNSSVPLQLRPESS

AT3G18500.1 DNAse I-like superfamily protein3.5e-2047.66Show/hide
Query:  SLWTSMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIA
        S WT  EI  ATG  +S    H L+L S+Y   +  + TRD  GEPLATSY+  FLGTVDY+W S+GL   RVL  +   V+ + T G P ++ GSDH+A
Subjt:  SLWTSMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIA

Query:  LATELAF
        L +E  F
Subjt:  LATELAF

AT3G18500.1 DNAse I-like superfamily protein1.4e-1327.36Show/hide
Query:  PPSSQRHQY--RGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHW-DYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHHIPPYMLDWEWR
        P SS    Y  R  +P P  Q P     +Q                  R W D   TP     ERF+++SYNIL D  +  H++ LY ++    L W +R
Subjt:  PPSSQRHQY--RGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHW-DYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHHIPPYMLDWEWR

Query:  KNNILFELGLWSTDIMCFQSFRLEIGNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGFSKCDFLFPKVGR
        K  I  EL   + DI+   S +   G+     +D   +  K     V+ R  +EF    ++     Q A  ELR     R   LG               
Subjt:  KNNILFELGLWSTDIMCFQSFRLEIGNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGFSKCDFLFPKVGR

Query:  LRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERKGLLVFYNASEQQSRDSPFINERQSLVRVLLEKAHAISKIWDNAPIVLCGDFNCTPKSALYNFISE
                                            + V YN ++         + +   VR L  KAH +SK W + PIVLCGDFN TPKS LYNF++ 
Subjt:  LRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERKGLLVFYNASEQQSRDSPFINERQSLVRVLLEKAHAISKIWDNAPIVLCGDFNCTPKSALYNFISE

Query:  QKLDLSELDRDKVSGQSS
         +L++ E D+ ++SGQ +
Subjt:  QKLDLSELDRDKVSGQSS

AT3G18500.2 DNAse I-like superfamily protein3.5e-2047.66Show/hide
Query:  SLWTSMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIA
        S WT  EI  ATG  +S    H L+L S+Y   +  + TRD  GEPLATSY+  FLGTVDY+W S+GL   RVL  +   V+ + T G P ++ GSDH+A
Subjt:  SLWTSMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIA

Query:  LATELAF
        L +E  F
Subjt:  LATELAF

AT3G18500.2 DNAse I-like superfamily protein2.6e-1528.79Show/hide
Query:  PPSSQRHQY--RGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHW-DYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHHIPPYMLDWEWR
        P SS    Y  R  +P P  Q P     +Q                  R W D   TP     ERF+++SYNIL D  +  H++ LY ++    L W +R
Subjt:  PPSSQRHQY--RGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHW-DYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHHIPPYMLDWEWR

Query:  KNNILFELGLWSTDIMCFQSFRLEIGNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGFSKCDFLFPKVGR
        K  I  EL   + DI+  Q    E+                             F L ++       G+Y+    D           F K D    + G 
Subjt:  KNNILFELGLWSTDIMCFQSFRLEIGNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGFSKCDFLFPKVGR

Query:  LRGEGVA-EVFSGPLGINGLDGEELMPPTNSPGERKGLL----VFYNASEQQSRDSPFINERQSLVRVLLEKAHAISKIWDNAPIVLCGDFNCTPKSALY
        L  E +    F     +  L   EL     S   RK LL    V YN ++         + +   VR L  KAH +SK W + PIVLCGDFN TPKS LY
Subjt:  LRGEGVA-EVFSGPLGINGLDGEELMPPTNSPGERKGLL----VFYNASEQQSRDSPFINERQSLVRVLLEKAHAISKIWDNAPIVLCGDFNCTPKSALY

Query:  NFISEQKLDLSELDRDKVSGQSS
        NF++  +L++ E D+ ++SGQ +
Subjt:  NFISEQKLDLSELDRDKVSGQSS

AT3G18500.3 DNAse I-like superfamily protein3.5e-2047.66Show/hide
Query:  SLWTSMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIA
        S WT  EI  ATG  +S    H L+L S+Y   +  + TRD  GEPLATSY+  FLGTVDY+W S+GL   RVL  +   V+ + T G P ++ GSDH+A
Subjt:  SLWTSMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIA

Query:  LATELAF
        L +E  F
Subjt:  LATELAF

AT3G18500.3 DNAse I-like superfamily protein2.6e-1528.79Show/hide
Query:  PPSSQRHQY--RGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHW-DYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHHIPPYMLDWEWR
        P SS    Y  R  +P P  Q P     +Q                  R W D   TP     ERF+++SYNIL D  +  H++ LY ++    L W +R
Subjt:  PPSSQRHQY--RGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHW-DYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHHIPPYMLDWEWR

Query:  KNNILFELGLWSTDIMCFQSFRLEIGNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGFSKCDFLFPKVGR
        K  I  EL   + DI+  Q    E+                             F L ++       G+Y+    D           F K D    + G 
Subjt:  KNNILFELGLWSTDIMCFQSFRLEIGNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGFSKCDFLFPKVGR

Query:  LRGEGVA-EVFSGPLGINGLDGEELMPPTNSPGERKGLL----VFYNASEQQSRDSPFINERQSLVRVLLEKAHAISKIWDNAPIVLCGDFNCTPKSALY
        L  E +    F     +  L   EL     S   RK LL    V YN ++         + +   VR L  KAH +SK W + PIVLCGDFN TPKS LY
Subjt:  LRGEGVA-EVFSGPLGINGLDGEELMPPTNSPGERKGLL----VFYNASEQQSRDSPFINERQSLVRVLLEKAHAISKIWDNAPIVLCGDFNCTPKSALY

Query:  NFISEQKLDLSELDRDKVSGQSS
        NF++  +L++ E D+ ++SGQ +
Subjt:  NFISEQKLDLSELDRDKVSGQSS

AT5G11350.1 DNAse I-like superfamily protein1.1e-10335.08Show/hide
Query:  MRRVATPPPPLHQLSVAVATTATNTSAAMSSRPPYRA----GRYERHRGFSSERPYS--GGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHTAP
        MRR          ++ A A+T +     MS+R PYR     GR    R F S+RPY+   G+ Q VTGDSHFQSV ++N  FR GE          H  P
Subjt:  MRRVATPPPPLHQLSVAVATTATNTSAAMSSRPPYRA----GRYERHRGFSSERPYS--GGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHTAP

Query:  QYPR-PPSFRGNHQFRQAPPS-SQRHQYRGPHPHPHHQQ-----PPSFNQNQGVRMP--QQLRPRP-PKRLDFRHWDYAKTPPPSTCERFSILSYNILAD
           R  P F  N++FR  PPS  Q  Q+R P+  P +Q      PP F QNQ  R P  Q  R RP  K  D+R W+YAKTPP    E+F +LSYNILAD
Subjt:  QYPR-PPSFRGNHQFRQAPPS-SQRHQYRGPHPHPHHQQ-----PPSFNQNQGVRMP--QQLRPRP-PKRLDFRHWDYAKTPPPSTCERFSILSYNILAD

Query:  YLAMNHKQKLYHHIPPYMLDWEWRKNNILFELGLWSTDIMC------FQSFRLEIGNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAY
        YLA +H + LY HIP  ML W WRK+ ++FEL LWS DIMC      FQ    E+ +     I       K  T   V  CA                  
Subjt:  YLAMNHKQKLYHHIPPYMLDWEWRKNNILFELGLWSTDIMC------FQSFRLEIGNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAY

Query:  QELRNDVFFRTKCLGKGFSKCDFLFPKVGRLRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERKGLLVFYNASEQQSRDSPFINERQSLVRVLLEKAHA
              +F+R+    K   +    F ++G LR           L  +     E  PP +S G  + ++   +      R    + +    VR LL+KAHA
Subjt:  QELRNDVFFRTKCLGKGFSKCDFLFPKVGRLRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERKGLLVFYNASEQQSRDSPFINERQSLVRVLLEKAHA

Query:  ISKIWDNAPIVLCGDFNCTPKSALYNFISEQKLDLSELDRDKVSGQSSAEIYQPSSLYRNPWLQTDNSSVPLQLRPESSVIERKPDNSLSDIQKQECSHS
        +SK+WD+APIVLCGDFNCTPKS LYNFIS++KLDLS L RDKVSGQ SAE   P         Q+ N S   Q++P + +                 +++
Subjt:  ISKIWDNAPIVLCGDFNCTPKSALYNFISEQKLDLSELDRDKVSGQSSAEIYQPSSLYRNPWLQTDNSSVPLQLRPESSVIERKPDNSLSDIQKQECSHS

Query:  FMENENLPSVNRFLPPDGSNVIFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDSFNESSTTCAKDEFTAGHTSKKVGELFSP
         MEN              SN+                                      +VG T   EKT     S   C  D   AGH +    +   P
Subjt:  FMENENLPSVNRFLPPDGSNVIFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDSFNESSTTCAKDEFTAGHTSKKVGELFSP

Query:  SGTDPEVLHLNDSERRQMEEVDTSSLNNKSSTDGFQDHNFGKESKDTVDNIILDDAQLYSQAAFLDSNSVSSTPACKNSTTDTATDS--SDVVTFDHSIA
                                   N +S     D  FG E++   D+  L  A+  S     D+    ++ A ++  TD +  S  S+       I 
Subjt:  SGTDPEVLHLNDSERRQMEEVDTSSLNNKSSTDGFQDHNFGKESKDTVDNIILDDAQLYSQAAFLDSNSVSSTPACKNSTTDTATDS--SDVVTFDHSIA

Query:  EFEKE-SSARNIEGGPSMVLPRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHDVKDPFSSESHHSVSHQSLVAPPTGVEDDLLPGLNSKSFE
          +++ SS+ + +    +   + D +    P + + DE+       SL ED  TFL+ LHD  +  S +                +  ++    +S++  
Subjt:  EFEKE-SSARNIEGGPSMVLPRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHDVKDPFSSESHHSVSHQSLVAPPTGVEDDLLPGLNSKSFE

Query:  VKNITHDRSLWTSMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTK
           IT+  S WT MEI TATG+ + T +EH+L L+STY+E E  + TRD NGEP+ TSY+RCF+GTVDYIWRSEGLQTVRVLAPI KQ M Q TPGFPT 
Subjt:  VKNITHDRSLWTSMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTK

Query:  KWGSDHIALATELAFVRS
        KWGSDHIAL +ELAF  S
Subjt:  KWGSDHIALATELAFVRS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGCGCGTTGCTACTCCTCCTCCTCCGCTCCATCAACTGTCCGTCGCCGTCGCCACGACCGCTACCAACACATCCGCTGCCATGTCTTCTCGACCTCCATACCGAGC
TGGCCGGTACGAACGCCACCGAGGCTTCTCGTCGGAACGGCCATACTCTGGCGGTAAAGGTCAATTAGTCACCGGTGATTCTCATTTTCAGTCTGTTCGGGAGTCAAACC
TAGGGTTCCGGCAAGGAGAGAGGGGAGGCTTTGCGAACAATGCGGGGTCGCATACGGCACCTCAATATCCTAGACCTCCGTCTTTTCGTGGAAATCATCAATTCCGACAG
GCTCCGCCTTCCAGTCAGAGGCACCAGTATCGGGGACCTCATCCTCATCCTCACCATCAGCAGCCACCGTCGTTTAATCAAAATCAAGGTGTTCGTATGCCGCAGCAACT
TCGACCTCGGCCTCCGAAGCGACTAGACTTTCGTCATTGGGATTATGCAAAGACACCACCTCCATCTACTTGCGAGCGGTTTTCAATTCTTTCATACAACATCTTAGCTG
ATTACCTCGCCATGAATCACAAGCAAAAGCTCTACCATCATATTCCCCCTTACATGTTGGATTGGGAGTGGAGGAAAAATAATATTTTATTTGAGCTTGGATTATGGTCT
ACTGACATAATGTGCTTTCAGTCTTTTAGACTGGAGATCGGGAACATGTTCAAACCTTTTATTGATTTCGATCTGTTGCCAATTAAAACCCATACTGATTTGGTAGTGTG
TAGATGCGCACTGGAATTCCAGTTGACGGCTGTGCAATCTTTTGGCGAGTGTCAAGGAGCATACCAGGAGTTAAGGAATGATGTTTTCTTTCGCACAAAATGCCTTGGAA
AAGGCTTCAGCAAGTGTGATTTTCTTTTTCCAAAAGTTGGAAGATTGAGGGGCGAAGGGGTAGCAGAGGTCTTTTCGGGACCCTTGGGAATTAATGGCTTGGATGGAGAA
GAGTTAATGCCGCCTACCAACTCTCCTGGAGAGAGAAAAGGACTACTTGTCTTTTACAATGCTTCAGAGCAACAATCGAGGGACTCTCCTTTCATTAATGAAAGACAATC
GCTGGTCAGGGTTCTCTTGGAGAAGGCTCATGCTATTTCAAAAATCTGGGACAATGCTCCAATCGTTCTCTGTGGGGATTTTAACTGTACACCAAAGAGTGCATTGTATA
ACTTTATTTCAGAGCAGAAGCTAGATTTGTCTGAATTGGATAGAGACAAGGTATCGGGACAATCTTCTGCCGAGATTTATCAACCTTCATCACTCTATCGTAATCCTTGG
CTTCAGACTGACAACAGTTCAGTTCCCCTCCAGCTGAGGCCAGAATCTAGTGTTATTGAAAGGAAGCCAGATAATTCTCTGTCTGACATACAGAAGCAAGAATGTTCACA
TAGCTTCATGGAAAACGAGAATCTTCCATCAGTGAACCGCTTTTTGCCCCCTGATGGTTCTAACGTTATCTTTGATGCACTTGATACTTCTTGTAACGATCTCCAGCTTG
GAATGAAGGGTACTACTCTACATTCTGAAGGCCAAAAGGAAAGTCAGCATAGTGCTTTGTTTGACCACAAAAATGTAGGGGAAACAACTTGCTGTGAGAAGACAGATAGC
TTTAATGAAAGTTCAACCACATGTGCCAAAGATGAGTTTACTGCTGGTCATACCAGTAAAAAAGTTGGTGAACTATTCTCCCCTTCAGGAACCGACCCTGAAGTTCTTCA
TCTGAATGACTCTGAGAGACGACAGATGGAAGAAGTTGATACCTCTAGTTTAAACAATAAATCTTCAACAGATGGTTTTCAGGATCACAATTTTGGCAAGGAAAGCAAAG
ACACTGTTGATAATATTATCTTAGATGACGCACAGCTTTATTCTCAGGCAGCTTTTTTGGATTCAAATAGCGTTTCTTCTACACCTGCTTGCAAAAACTCCACGACCGAC
ACTGCTACAGACTCCTCTGATGTTGTAACTTTTGACCACTCAATTGCTGAATTTGAGAAGGAAAGCTCTGCTAGGAATATTGAAGGTGGCCCATCAATGGTTTTGCCCAG
GACTGACTCGGTGGTGGATGGAAGACCAAAGATTTTATCTTCAGATGAGCAGGATGTAGCTGTGTTAAATGGAAGCTTAACTGAGGATGATCATACATTTCTCTCAGCTC
TGCATGATGTTAAAGATCCCTTTTCATCTGAGAGTCATCATTCTGTTAGTCATCAAAGCTTGGTTGCACCACCCACTGGAGTTGAAGATGATTTGTTGCCAGGATTGAAT
TCCAAGTCTTTTGAAGTCAAAAATATTACTCATGATCGCTCATTATGGACTTCAATGGAAATAGAAACTGCTACTGGCAATGCAGATAGTACTCTAATTGAACACTCTCT
AAGGCTTAGAAGCACGTATACAGAAGCTGAGGACCTTTCTGGGACCAGAGACTTGAATGGGGAACCCCTGGCGACTAGTTACAATAGGTGTTTTCTGGGCACTGTTGACT
ACATATGGCGTTCCGAAGGTCTTCAGACGGTTAGGGTGCTCGCCCCTATAAGAAAACAAGTCATGCAGCAGTTGACTCCTGGATTTCCTACAAAGAAATGGGGCAGCGAT
CACATTGCCTTAGCTACTGAATTGGCGTTTGTAAGGAGTAGTGAAGAGTGA
mRNA sequenceShow/hide mRNA sequence
TGAGGAAGCCCTCCGTGCTATTATTTCGCGAAAGAATCGTCACTGAGTCTTCACACTCTCTCCCGCCATTCAATGAGGCGCGTTGCTACTCCTCCTCCTCCGCTCCATCA
ACTGTCCGTCGCCGTCGCCACGACCGCTACCAACACATCCGCTGCCATGTCTTCTCGACCTCCATACCGAGCTGGCCGGTACGAACGCCACCGAGGCTTCTCGTCGGAAC
GGCCATACTCTGGCGGTAAAGGTCAATTAGTCACCGGTGATTCTCATTTTCAGTCTGTTCGGGAGTCAAACCTAGGGTTCCGGCAAGGAGAGAGGGGAGGCTTTGCGAAC
AATGCGGGGTCGCATACGGCACCTCAATATCCTAGACCTCCGTCTTTTCGTGGAAATCATCAATTCCGACAGGCTCCGCCTTCCAGTCAGAGGCACCAGTATCGGGGACC
TCATCCTCATCCTCACCATCAGCAGCCACCGTCGTTTAATCAAAATCAAGGTGTTCGTATGCCGCAGCAACTTCGACCTCGGCCTCCGAAGCGACTAGACTTTCGTCATT
GGGATTATGCAAAGACACCACCTCCATCTACTTGCGAGCGGTTTTCAATTCTTTCATACAACATCTTAGCTGATTACCTCGCCATGAATCACAAGCAAAAGCTCTACCAT
CATATTCCCCCTTACATGTTGGATTGGGAGTGGAGGAAAAATAATATTTTATTTGAGCTTGGATTATGGTCTACTGACATAATGTGCTTTCAGTCTTTTAGACTGGAGAT
CGGGAACATGTTCAAACCTTTTATTGATTTCGATCTGTTGCCAATTAAAACCCATACTGATTTGGTAGTGTGTAGATGCGCACTGGAATTCCAGTTGACGGCTGTGCAAT
CTTTTGGCGAGTGTCAAGGAGCATACCAGGAGTTAAGGAATGATGTTTTCTTTCGCACAAAATGCCTTGGAAAAGGCTTCAGCAAGTGTGATTTTCTTTTTCCAAAAGTT
GGAAGATTGAGGGGCGAAGGGGTAGCAGAGGTCTTTTCGGGACCCTTGGGAATTAATGGCTTGGATGGAGAAGAGTTAATGCCGCCTACCAACTCTCCTGGAGAGAGAAA
AGGACTACTTGTCTTTTACAATGCTTCAGAGCAACAATCGAGGGACTCTCCTTTCATTAATGAAAGACAATCGCTGGTCAGGGTTCTCTTGGAGAAGGCTCATGCTATTT
CAAAAATCTGGGACAATGCTCCAATCGTTCTCTGTGGGGATTTTAACTGTACACCAAAGAGTGCATTGTATAACTTTATTTCAGAGCAGAAGCTAGATTTGTCTGAATTG
GATAGAGACAAGGTATCGGGACAATCTTCTGCCGAGATTTATCAACCTTCATCACTCTATCGTAATCCTTGGCTTCAGACTGACAACAGTTCAGTTCCCCTCCAGCTGAG
GCCAGAATCTAGTGTTATTGAAAGGAAGCCAGATAATTCTCTGTCTGACATACAGAAGCAAGAATGTTCACATAGCTTCATGGAAAACGAGAATCTTCCATCAGTGAACC
GCTTTTTGCCCCCTGATGGTTCTAACGTTATCTTTGATGCACTTGATACTTCTTGTAACGATCTCCAGCTTGGAATGAAGGGTACTACTCTACATTCTGAAGGCCAAAAG
GAAAGTCAGCATAGTGCTTTGTTTGACCACAAAAATGTAGGGGAAACAACTTGCTGTGAGAAGACAGATAGCTTTAATGAAAGTTCAACCACATGTGCCAAAGATGAGTT
TACTGCTGGTCATACCAGTAAAAAAGTTGGTGAACTATTCTCCCCTTCAGGAACCGACCCTGAAGTTCTTCATCTGAATGACTCTGAGAGACGACAGATGGAAGAAGTTG
ATACCTCTAGTTTAAACAATAAATCTTCAACAGATGGTTTTCAGGATCACAATTTTGGCAAGGAAAGCAAAGACACTGTTGATAATATTATCTTAGATGACGCACAGCTT
TATTCTCAGGCAGCTTTTTTGGATTCAAATAGCGTTTCTTCTACACCTGCTTGCAAAAACTCCACGACCGACACTGCTACAGACTCCTCTGATGTTGTAACTTTTGACCA
CTCAATTGCTGAATTTGAGAAGGAAAGCTCTGCTAGGAATATTGAAGGTGGCCCATCAATGGTTTTGCCCAGGACTGACTCGGTGGTGGATGGAAGACCAAAGATTTTAT
CTTCAGATGAGCAGGATGTAGCTGTGTTAAATGGAAGCTTAACTGAGGATGATCATACATTTCTCTCAGCTCTGCATGATGTTAAAGATCCCTTTTCATCTGAGAGTCAT
CATTCTGTTAGTCATCAAAGCTTGGTTGCACCACCCACTGGAGTTGAAGATGATTTGTTGCCAGGATTGAATTCCAAGTCTTTTGAAGTCAAAAATATTACTCATGATCG
CTCATTATGGACTTCAATGGAAATAGAAACTGCTACTGGCAATGCAGATAGTACTCTAATTGAACACTCTCTAAGGCTTAGAAGCACGTATACAGAAGCTGAGGACCTTT
CTGGGACCAGAGACTTGAATGGGGAACCCCTGGCGACTAGTTACAATAGGTGTTTTCTGGGCACTGTTGACTACATATGGCGTTCCGAAGGTCTTCAGACGGTTAGGGTG
CTCGCCCCTATAAGAAAACAAGTCATGCAGCAGTTGACTCCTGGATTTCCTACAAAGAAATGGGGCAGCGATCACATTGCCTTAGCTACTGAATTGGCGTTTGTAAGGAG
TAGTGAAGAGTGATACCATGAATCTTAGGAGCAAATGATAATAAGTAAATAAATACTTTCCCTTGTAATTGGAGTTTTTTTATTTGAGGAACCCAACTGGTTCCAACGGC
ACAAGCCTGCAGGTGCAATGTGATGAGTGGAGGCAACAGAGCCTAAGACAGTGAGTTTTTTTTTTTTTTTTTTTTTTTTTTTAATTATTAAATTTATAGATGTAGTTCCT
CACGAATGCTCAGTTTCAGGTTACTTTATAGCTGTTGTTTTACTTCATTAATTATTTGGCCTGGATGAGCTATGTCACTGTGTGTTGCAGTTGAGTTTGTCTTTTTTATA
TTTACGAATTAGATTGGAATGGCCTCCCTTACTGAACTCTCATGTGTTGCCTGCATTTTTGTATTCTATGAATCTAACCTAATGTTTGTTATAGGAGTGATAACATGGTT
ACAATACAACCTAATGTTTGTTA
Protein sequenceShow/hide protein sequence
MRRVATPPPPLHQLSVAVATTATNTSAAMSSRPPYRAGRYERHRGFSSERPYSGGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHTAPQYPRPPSFRGNHQFRQ
APPSSQRHQYRGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHHIPPYMLDWEWRKNNILFELGLWS
TDIMCFQSFRLEIGNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGFSKCDFLFPKVGRLRGEGVAEVFSGPLGINGLDGE
ELMPPTNSPGERKGLLVFYNASEQQSRDSPFINERQSLVRVLLEKAHAISKIWDNAPIVLCGDFNCTPKSALYNFISEQKLDLSELDRDKVSGQSSAEIYQPSSLYRNPW
LQTDNSSVPLQLRPESSVIERKPDNSLSDIQKQECSHSFMENENLPSVNRFLPPDGSNVIFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDS
FNESSTTCAKDEFTAGHTSKKVGELFSPSGTDPEVLHLNDSERRQMEEVDTSSLNNKSSTDGFQDHNFGKESKDTVDNIILDDAQLYSQAAFLDSNSVSSTPACKNSTTD
TATDSSDVVTFDHSIAEFEKESSARNIEGGPSMVLPRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHDVKDPFSSESHHSVSHQSLVAPPTGVEDDLLPGLN
SKSFEVKNITHDRSLWTSMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSD
HIALATELAFVRSSEE