; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC05G096510 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC05G096510
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
Descriptioncarbon catabolite repressor protein 4 homolog 6 isoform X1
Genome locationCiama_Chr05:27719006..27733364
RNA-Seq ExpressionCaUC05G096510
SyntenyCaUC05G096510
Gene Ontology termsGO:0090503 - RNA phosphodiester bond hydrolysis, exonucleolytic (biological process)
GO:0000175 - 3'-5'-exoribonuclease activity (molecular function)
InterPro domainsIPR005135 - Endonuclease/exonuclease/phosphatase
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008466384.1 PREDICTED: carbon catabolite repressor protein 4 homolog 6 isoform X1 [Cucumis melo]0.0e+0071.03Show/hide
Query:  MRRVATPPPPLHQLSVAV--ATTATNTSAAMSSRPPYR-AGRYERHRGFSSERPYSGGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHTAPQYP
        MRR ATPPPPLHQLS AV  ATTATNTS AMSSR PYR  GRY RHRGFSSERPYSGG+GQ V+ DSHFQSV+ESNLGFRQGERGG+ NNAGS+TAP+ P
Subjt:  MRRVATPPPPLHQLSVAV--ATTATNTSAAMSSRPPYR-AGRYERHRGFSSERPYSGGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHTAPQYP

Query:  RPPSFRGNHQFRQAPPSSQRHQYRGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHH
        RPPSF GNHQFRQAPPS+QRHQYRGPHPH H+QQPPSFNQNQGVRMPQQ RPRPPK LDFRHWDYAKT PPSTCERFSILSYNILADYLAM+HK KLYHH
Subjt:  RPPSFRGNHQFRQAPPSSQRHQYRGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHH

Query:  IPPYMLDWEWRKNNILFELGLWSTDIMCFQSFRLEIRNMFKPFIDFDLLPI-KTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGF
        IP YMLDWEWRKN+ILFELGLWSTDIMCFQ      R++ +   D     I K  T + V  CA+ ++++                     R K L + F
Subjt:  IPPYMLDWEWRKNNILFELGLWSTDIMCFQSFRLEIRNMFKPFIDFDLLPI-KTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGF

Query:  SKCDFLFPKVGRSRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERKGLLVFYNASEQQPRDSPSINERQSLVRVLLEKAHAISKIWDNAPIVLCGDFNC
         +    F ++G    + VA++              + PP ++    + ++   +      R    + +    VRVLLEKAH ISKIW+NAP+VLCGDFNC
Subjt:  SKCDFLFPKVGRSRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERKGLLVFYNASEQQPRDSPSINERQSLVRVLLEKAHAISKIWDNAPIVLCGDFNC

Query:  TPKSALYNFISEQKLDLSGLDRDKVSGQSSSEIYQPSSLYRNPRLQIDKGSVPLQLRPESSVIERKPDNSLSDIQKQECSHSLMENENLPSVNRFLPPDG
        TPKSALYNFISEQKLDLSGLDRDKVSGQSS+EI+QPSSLYRNPR Q   GSVPLQ R ESS  ERKPD+S++DIQKQ+CSH+ +ENENL SVN  L PD 
Subjt:  TPKSALYNFISEQKLDLSGLDRDKVSGQSSSEIYQPSSLYRNPRLQIDKGSVPLQLRPESSVIERKPDNSLSDIQKQECSHSLMENENLPSVNRFLPPDG

Query:  SNVVFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDGFNESSTTCAKDEFTAGHTSKKVGELFSRSGTDPEVLHLNDSERQQM
        S+   DALDTSCN+ QLGMKGTTLHSEGQKE QHSALFDHKNVGETT CEKTDGFNE S TCA+DEF  GHTSK+VGEL S  GTDPEVLHLN++E  Q+
Subjt:  SNVVFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDGFNESSTTCAKDEFTAGHTSKKVGELFSRSGTDPEVLHLNDSERQQM

Query:  EEVDTSSLNNKSSTDGFQDHNFGKESKDTVDNIILDDAQLYSQATFLDSNSVSSTPACKNSTTDTARDSSDVVTFDHSIAEFEKESSA-RNIEGGPSMVL
        E+VDT  LNN  STDGF+DHNFGK+SKD+V+ IILDD QL SQ  FLD  +VSSTP CKNS  DTA DSSDVVTF HSIA  EKESS+ RNIEGGPS+ L
Subjt:  EEVDTSSLNNKSSTDGFQDHNFGKESKDTVDNIILDDAQLYSQATFLDSNSVSSTPACKNSTTDTARDSSDVVTFDHSIAEFEKESSA-RNIEGGPSMVL

Query:  SRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHHVKDPFSSEIHHSVSHQSLVAPPTGVEDDLLPGLNSKSFEVKNITHDRSLWTSMEIETAT
         R +  VD RP+ILSSDEQDVA LNGSL EDD TFLSALH V+DPFSS++ HS SHQSLVAPPTG E++LLPGL++KS EV+ + HDRSLWT  EIETAT
Subjt:  SRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHHVKDPFSSEIHHSVSHQSLVAPPTGVEDDLLPGLNSKSFEVKNITHDRSLWTSMEIETAT

Query:  GNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVRSSE
        GNADSTL+EHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVL+PIRKQVMQQLT GFPTKKWGSDHIALATELAFV S E
Subjt:  GNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVRSSE

Query:  E
        E
Subjt:  E

XP_008466385.1 PREDICTED: carbon catabolite repressor protein 4 homolog 6 isoform X2 [Cucumis melo]0.0e+0070.7Show/hide
Query:  MRRVATPPPPLHQLSVAV--ATTATNTSAAMSSRPPYR-AGRYERHRGFSSERPYSGGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHTAPQYP
        MRR ATPPPPLHQLS AV  ATTATNTS AMSSR PYR  GRY RHRGFSSERPYSGG+GQ V+ DSHFQSV+ESNLGFRQGERGG+ NNAGS+TAP+ P
Subjt:  MRRVATPPPPLHQLSVAV--ATTATNTSAAMSSRPPYR-AGRYERHRGFSSERPYSGGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHTAPQYP

Query:  RPPSFRGNHQFRQAPPSSQRHQYRGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHH
        RPPSF GNHQFRQAPPS+QRHQYRGPHPH H+QQPPSFNQNQGVRMPQQ RPRPPK LDFRHWDYAKT PPSTCERFSILSYNILADYLAM+HK KLYHH
Subjt:  RPPSFRGNHQFRQAPPSSQRHQYRGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHH

Query:  IPPYMLDWEWRKNNILFELGLWSTDIMCFQSFRLEIRNMFKPFIDFDLLPI-KTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGF
        IP YMLDWEWRKN+ILFELGLWSTDIMCFQ      R++ +   D     I K  T + V  CA+ ++++             +L ++ F     LG   
Subjt:  IPPYMLDWEWRKNNILFELGLWSTDIMCFQSFRLEIRNMFKPFIDFDLLPI-KTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGF

Query:  SKCDFLFPKVGRSRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERKGLLVFYNASEQQPRDSPSINERQSLVRVLLEKAHAISKIWDNAPIVLCGDFNC
        +       +  +  G+                   + PP ++    + ++   +      R    + +    VRVLLEKAH ISKIW+NAP+VLCGDFNC
Subjt:  SKCDFLFPKVGRSRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERKGLLVFYNASEQQPRDSPSINERQSLVRVLLEKAHAISKIWDNAPIVLCGDFNC

Query:  TPKSALYNFISEQKLDLSGLDRDKVSGQSSSEIYQPSSLYRNPRLQIDKGSVPLQLRPESSVIERKPDNSLSDIQKQECSHSLMENENLPSVNRFLPPDG
        TPKSALYNFISEQKLDLSGLDRDKVSGQSS+EI+QPSSLYRNPR Q   GSVPLQ R ESS  ERKPD+S++DIQKQ+CSH+ +ENENL SVN  L PD 
Subjt:  TPKSALYNFISEQKLDLSGLDRDKVSGQSSSEIYQPSSLYRNPRLQIDKGSVPLQLRPESSVIERKPDNSLSDIQKQECSHSLMENENLPSVNRFLPPDG

Query:  SNVVFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDGFNESSTTCAKDEFTAGHTSKKVGELFSRSGTDPEVLHLNDSERQQM
        S+   DALDTSCN+ QLGMKGTTLHSEGQKE QHSALFDHKNVGETT CEKTDGFNE S TCA+DEF  GHTSK+VGEL S  GTDPEVLHLN++E  Q+
Subjt:  SNVVFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDGFNESSTTCAKDEFTAGHTSKKVGELFSRSGTDPEVLHLNDSERQQM

Query:  EEVDTSSLNNKSSTDGFQDHNFGKESKDTVDNIILDDAQLYSQATFLDSNSVSSTPACKNSTTDTARDSSDVVTFDHSIAEFEKESSA-RNIEGGPSMVL
        E+VDT  LNN  STDGF+DHNFGK+SKD+V+ IILDD QL SQ  FLD  +VSSTP CKNS  DTA DSSDVVTF HSIA  EKESS+ RNIEGGPS+ L
Subjt:  EEVDTSSLNNKSSTDGFQDHNFGKESKDTVDNIILDDAQLYSQATFLDSNSVSSTPACKNSTTDTARDSSDVVTFDHSIAEFEKESSA-RNIEGGPSMVL

Query:  SRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHHVKDPFSSEIHHSVSHQSLVAPPTGVEDDLLPGLNSKSFEVKNITHDRSLWTSMEIETAT
         R +  VD RP+ILSSDEQDVA LNGSL EDD TFLSALH V+DPFSS++ HS SHQSLVAPPTG E++LLPGL++KS EV+ + HDRSLWT  EIETAT
Subjt:  SRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHHVKDPFSSEIHHSVSHQSLVAPPTGVEDDLLPGLNSKSFEVKNITHDRSLWTSMEIETAT

Query:  GNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVRSSE
        GNADSTL+EHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVL+PIRKQVMQQLT GFPTKKWGSDHIALATELAFV S E
Subjt:  GNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVRSSE

Query:  E
        E
Subjt:  E

XP_011652490.1 carbon catabolite repressor protein 4 homolog 6 isoform X2 [Cucumis sativus]0.0e+0071.43Show/hide
Query:  MRRVATPPPPLHQLSVAV--ATTATNTSAAMSSRPPYRAGRYERHRGFSSERPYSGGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHTAPQYPR
        MRR ATPPPPLHQLS AV  ATTATNTS AMSSRPPYR G Y RHRG+SSERPYSGG+GQ V+GDSHFQSV+ESNLGFRQGERGG+ NNAGS+TAP+ PR
Subjt:  MRRVATPPPPLHQLSVAV--ATTATNTSAAMSSRPPYRAGRYERHRGFSSERPYSGGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHTAPQYPR

Query:  PPSFRGNHQFRQAPPSSQRHQYRGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHHI
        PPSF GNHQFRQAPPSSQRHQYRGP+PH H+QQPPSFNQNQGVRMPQQ R RPPK LDFRHWDYAKT PP TCERFSILSYNILADYLAM+HKQKLYHHI
Subjt:  PPSFRGNHQFRQAPPSSQRHQYRGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHHI

Query:  PPYMLDWEWRKNNILFELGLWSTDIMCFQSFRLEIRNMFKPFIDFDLLPI-KTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGFS
        P YMLDWEWRKN+ILFELGLWSTDIMCFQ       ++ +   D     I K  T + V  CA+ ++++                     R K L + F 
Subjt:  PPYMLDWEWRKNNILFELGLWSTDIMCFQSFRLEIRNMFKPFIDFDLLPI-KTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGFS

Query:  KCDFLFPKVGRSRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERK---GLLVFYNASEQQPRDSPSINERQSLVRVLLEKAHAISKIWDNAPIVLCGDF
        +    F K+G    + VA++       +  D     P + S   R     + V YN     PR       +   VRVLLEKAHAISKIW+NAPIVLCGDF
Subjt:  KCDFLFPKVGRSRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERK---GLLVFYNASEQQPRDSPSINERQSLVRVLLEKAHAISKIWDNAPIVLCGDF

Query:  NCTPKSALYNFISEQKLDLSGLDRDKVSGQSSSEIYQPSSLYRNPRLQIDKGSVPLQLRPESSVIERKPDNSLSDIQKQECSHSLMENENLPSVNRFLPP
        NCTPKSALYNFISEQKLDLSGLDRDKVSGQSS+EI+QPSSLYRNPR Q   GSVPLQ R ESS  E KPD+S+SDIQKQ+CSHS M+NENL S N  L P
Subjt:  NCTPKSALYNFISEQKLDLSGLDRDKVSGQSSSEIYQPSSLYRNPRLQIDKGSVPLQLRPESSVIERKPDNSLSDIQKQECSHSLMENENLPSVNRFLPP

Query:  DGSNVVFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDGFNESSTTCAKDEFTAGHTSKKVGELFSRSGTDPEVLHLNDSERQ
        D S++  DALDTSCN+LQLGMKGTTLHSEGQKESQHSALFDHKNVGETT CEKTD FNE S TCA+DEF  GHTSK++GEL S  GTDP+V HLN++E++
Subjt:  DGSNVVFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDGFNESSTTCAKDEFTAGHTSKKVGELFSRSGTDPEVLHLNDSERQ

Query:  QMEEVDTSSLNNKSSTDGFQDHNFGKESKDTVDNIILDDAQLYSQATFLDSNSVSSTPACKNSTTDTARDSSDVVTFDHSIAEFEKE-SSARNIEGGPSM
        Q+E+V    LNN SSTDGF+DHN  K+SKD+V+ IILDD QL S+  FLD  +VSSTP CKNS  DTA DS DVVT DHSIAE EKE SSARNIEGGPS+
Subjt:  QMEEVDTSSLNNKSSTDGFQDHNFGKESKDTVDNIILDDAQLYSQATFLDSNSVSSTPACKNSTTDTARDSSDVVTFDHSIAEFEKE-SSARNIEGGPSM

Query:  VLSRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHHVKDPFSSEIHHSVSHQSLVAPPTGVEDDLLPGLNSKSFEVKNITHDRSLWTSMEIET
         L R + VVD RP+ILSSDEQDVA LNGSLTEDD TFLSALH V+DPFS E+ HS SHQSLVAP TG ++DLLPGLN+KS EV+N+ HDRSLWT  +IET
Subjt:  VLSRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHHVKDPFSSEIHHSVSHQSLVAPPTGVEDDLLPGLNSKSFEVKNITHDRSLWTSMEIET

Query:  ATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVRS
        ATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTV+VLAPIRKQVMQQLT GFPTKKWGSDHIALATELAFV S
Subjt:  ATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVRS

Query:  SEE
         EE
Subjt:  SEE

XP_038897964.1 carbon catabolite repressor protein 4 homolog 6-like isoform X1 [Benincasa hispida]0.0e+0072.94Show/hide
Query:  MRRVATPPPPLHQLSVAV--------ATTATNTSAAMSSRPPYRAGRYERHRGFSSERPYSGGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHT
        MRR ATPPPPLHQLSVAV        ATTATNTSA MSSRPPYR GRY  HRGFSSERPYSGG+GQ VTGDSHFQSVRESNLGF++GERGGFANNAG ++
Subjt:  MRRVATPPPPLHQLSVAV--------ATTATNTSAAMSSRPPYRAGRYERHRGFSSERPYSGGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHT

Query:  APQYPRPPSFRGNHQFRQAPPSSQRHQYRGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQ
        A Q PRPPSF GNHQFRQAPPS+QRHQYRGPHPH H QQPPSFNQNQGV MPQQ+RPRPPK LD+RHWDYAKTPPPSTCERFSILSYNILADYLAM+HKQ
Subjt:  APQYPRPPSFRGNHQFRQAPPSSQRHQYRGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQ

Query:  KLYHHIPPYMLDWEWRKNNILFELGLWSTDIMCFQS----FRLEIRNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFR
        KLY HIP YMLDWEWRKN+ILFELGLWSTDIMCFQ       LE     + F        K  T   V  CA+ + ++                     R
Subjt:  KLYHHIPPYMLDWEWRKNNILFELGLWSTDIMCFQS----FRLEIRNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFR

Query:  TKCLGKGFSKCDFLFPKVGRSRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERKGLLVFYNASEQQPRDSPSINERQSLVRVLLEKAHAISKIWDNAPI
         K L +        F K+G    + VA++              + PP ++    K ++   +      R    + +    VRVLLEKAHAISK WDNAPI
Subjt:  TKCLGKGFSKCDFLFPKVGRSRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERKGLLVFYNASEQQPRDSPSINERQSLVRVLLEKAHAISKIWDNAPI

Query:  VLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSGQSSSEIYQPSSLYRNPRLQIDKGSVPLQLRPESSVIERKPDNSLSDIQKQECSHSLMENENLPSV
        VLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSGQSS+EI+QPSS +RNPRLQ   GSVPLQ R ESS IERK D+SLSDIQKQ+CS S MENENLPSV
Subjt:  VLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSGQSSSEIYQPSSLYRNPRLQIDKGSVPLQLRPESSVIERKPDNSLSDIQKQECSHSLMENENLPSV

Query:  NRFLPPDGSNVVFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDGFNESSTTCAKDEFTAGHTSKKVGELFSRSGTDPEVLHL
        N  LPPD S++VFDA DTSCNDLQLGMKGTTLHSEG+KESQ SALFDHKN GETTCCEKTD FNE+S TCAKDEFT GHTSKKVGEL S  GTDPE++HL
Subjt:  NRFLPPDGSNVVFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDGFNESSTTCAKDEFTAGHTSKKVGELFSRSGTDPEVLHL

Query:  NDSERQQMEEVDTSSLNNKSSTDGFQDHNFGKESKDTVDNIILDDAQLYSQATFLDSNSVSSTPACKNSTTDTARDSSDVVTFDHSIAEFEKESSA-RNI
        N++ER+QME+ D S L NKSSTDG++DHNFGKESKDTVD +ILDDAQLYSQ    DS +VSSTPACKNS  +TA DSSDVVTFD S  EFEKESS+ RNI
Subjt:  NDSERQQMEEVDTSSLNNKSSTDGFQDHNFGKESKDTVDNIILDDAQLYSQATFLDSNSVSSTPACKNSTTDTARDSSDVVTFDHSIAEFEKESSA-RNI

Query:  EGGPSMVLSRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHHVKDPFSSEIHHSVSHQSLVAPPTGVEDDLLPGLNSKSFEVKNITHDRSLWT
        EGGPS  L   DS +D RPKI  SDEQDVA LNGSLTEDD TFLSALH V+DPFSS+IHHS  H++LV PPTGVEDDLLPGLN+KSFEV+N+THDRSLWT
Subjt:  EGGPSMVLSRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHHVKDPFSSEIHHSVSHQSLVAPPTGVEDDLLPGLNSKSFEVKNITHDRSLWT

Query:  SMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATE
         MEIETATGNAD TLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATE
Subjt:  SMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATE

Query:  LAFVRSSEE
        LAFVRS EE
Subjt:  LAFVRSSEE

XP_038897965.1 carbon catabolite repressor protein 4 homolog 6-like isoform X2 [Benincasa hispida]0.0e+0073.19Show/hide
Query:  MRRVATPPPPLHQLSVAV--------ATTATNTSAAMSSRPPYRAGRYERHRGFSSERPYSGGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHT
        MRR ATPPPPLHQLSVAV        ATTATNTSA MSSRPPYR GRY  HRGFSSERPYSGG+GQ VTGDSHFQSVRESNLGF++GERGGFANNAG ++
Subjt:  MRRVATPPPPLHQLSVAV--------ATTATNTSAAMSSRPPYRAGRYERHRGFSSERPYSGGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHT

Query:  APQYPRPPSFRGNHQFRQAPPSSQRHQYRGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQ
        A Q PRPPSF GNHQFRQAPPS+QRHQYRGPHPH H QQPPSFNQNQGV MPQQ+RPRPPK LD+RHWDYAKTPPPSTCERFSILSYNILADYLAM+HKQ
Subjt:  APQYPRPPSFRGNHQFRQAPPSSQRHQYRGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQ

Query:  KLYHHIPPYMLDWEWRKNNILFELGLWSTDIMCFQS----FRLEIRNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFR
        KLY HIP YMLDWEWRKN+ILFELGLWSTDIMCFQ       LE     + F        K  T   V  CA+ + ++                     R
Subjt:  KLYHHIPPYMLDWEWRKNNILFELGLWSTDIMCFQS----FRLEIRNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFR

Query:  TKCLGKGFSKCDFLFPKVGRSRGEGVAEVFSGPLGINGLDGE-ELMPPTNSPGERKGLLVFYNASEQQPRDSPSINERQSLVRVLLEKAHAISKIWDNAP
         K L +        F K+G    + VA++    L  +  DG+  + PP ++    K ++   +      R    + +    VRVLLEKAHAISK WDNAP
Subjt:  TKCLGKGFSKCDFLFPKVGRSRGEGVAEVFSGPLGINGLDGE-ELMPPTNSPGERKGLLVFYNASEQQPRDSPSINERQSLVRVLLEKAHAISKIWDNAP

Query:  IVLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSGQSSSEIYQPSSLYRNPRLQIDKGSVPLQLRPESSVIERKPDNSLSDIQKQECSHSLMENENLPS
        IVLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSGQSS+EI+QPSS +RNPRLQ   GSVPLQ R ESS IERK D+SLSDIQKQ+CS S MENENLPS
Subjt:  IVLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSGQSSSEIYQPSSLYRNPRLQIDKGSVPLQLRPESSVIERKPDNSLSDIQKQECSHSLMENENLPS

Query:  VNRFLPPDGSNVVFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDGFNESSTTCAKDEFTAGHTSKKVGELFSRSGTDPEVLH
        VN  LPPD S++VFDA DTSCNDLQLGMKGTTLHSEG+KESQ SALFDHKN GETTCCEKTD FNE+S TCAKDEFT GHTSKKVGEL S  GTDPE++H
Subjt:  VNRFLPPDGSNVVFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDGFNESSTTCAKDEFTAGHTSKKVGELFSRSGTDPEVLH

Query:  LNDSERQQMEEVDTSSLNNKSSTDGFQDHNFGKESKDTVDNIILDDAQLYSQATFLDSNSVSSTPACKNSTTDTARDSSDVVTFDHSIAEFEKESSA-RN
        LN++ER+QME+ D S L NKSSTDG++DHNFGKESKDTVD +ILDDAQLYSQ    DS +VSSTPACKNS  +TA DSSDVVTFD S  EFEKESS+ RN
Subjt:  LNDSERQQMEEVDTSSLNNKSSTDGFQDHNFGKESKDTVDNIILDDAQLYSQATFLDSNSVSSTPACKNSTTDTARDSSDVVTFDHSIAEFEKESSA-RN

Query:  IEGGPSMVLSRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHHVKDPFSSEIHHSVSHQSLVAPPTGVEDDLLPGLNSKSFEVKNITHDRSLW
        IEGGPS  L   DS +D RPKI  SDEQDVA LNGSLTEDD TFLSALH V+DPFSS+IHHS  H++LV PPTGVEDDLLPGLN+KSFEV+N+THDRSLW
Subjt:  IEGGPSMVLSRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHHVKDPFSSEIHHSVSHQSLVAPPTGVEDDLLPGLNSKSFEVKNITHDRSLW

Query:  TSMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALAT
        T MEIETATGNAD TLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALAT
Subjt:  TSMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALAT

Query:  ELAFVRSSEE
        ELAFVRS EE
Subjt:  ELAFVRSSEE

TrEMBL top hitse value%identityAlignment
A0A0A0LHB6 Endo/exonuclease/phosphatase domain-containing protein0.0e+0071.43Show/hide
Query:  MRRVATPPPPLHQLSVAV--ATTATNTSAAMSSRPPYRAGRYERHRGFSSERPYSGGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHTAPQYPR
        MRR ATPPPPLHQLS AV  ATTATNTS AMSSRPPYR G Y RHRG+SSERPYSGG+GQ V+GDSHFQSV+ESNLGFRQGERGG+ NNAGS+TAP+ PR
Subjt:  MRRVATPPPPLHQLSVAV--ATTATNTSAAMSSRPPYRAGRYERHRGFSSERPYSGGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHTAPQYPR

Query:  PPSFRGNHQFRQAPPSSQRHQYRGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHHI
        PPSF GNHQFRQAPPSSQRHQYRGP+PH H+QQPPSFNQNQGVRMPQQ R RPPK LDFRHWDYAKT PP TCERFSILSYNILADYLAM+HKQKLYHHI
Subjt:  PPSFRGNHQFRQAPPSSQRHQYRGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHHI

Query:  PPYMLDWEWRKNNILFELGLWSTDIMCFQSFRLEIRNMFKPFIDFDLLPI-KTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGFS
        P YMLDWEWRKN+ILFELGLWSTDIMCFQ       ++ +   D     I K  T + V  CA+ ++++                     R K L + F 
Subjt:  PPYMLDWEWRKNNILFELGLWSTDIMCFQSFRLEIRNMFKPFIDFDLLPI-KTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGFS

Query:  KCDFLFPKVGRSRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERK---GLLVFYNASEQQPRDSPSINERQSLVRVLLEKAHAISKIWDNAPIVLCGDF
        +    F K+G    + VA++       +  D     P + S   R     + V YN     PR       +   VRVLLEKAHAISKIW+NAPIVLCGDF
Subjt:  KCDFLFPKVGRSRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERK---GLLVFYNASEQQPRDSPSINERQSLVRVLLEKAHAISKIWDNAPIVLCGDF

Query:  NCTPKSALYNFISEQKLDLSGLDRDKVSGQSSSEIYQPSSLYRNPRLQIDKGSVPLQLRPESSVIERKPDNSLSDIQKQECSHSLMENENLPSVNRFLPP
        NCTPKSALYNFISEQKLDLSGLDRDKVSGQSS+EI+QPSSLYRNPR Q   GSVPLQ R ESS  E KPD+S+SDIQKQ+CSHS M+NENL S N  L P
Subjt:  NCTPKSALYNFISEQKLDLSGLDRDKVSGQSSSEIYQPSSLYRNPRLQIDKGSVPLQLRPESSVIERKPDNSLSDIQKQECSHSLMENENLPSVNRFLPP

Query:  DGSNVVFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDGFNESSTTCAKDEFTAGHTSKKVGELFSRSGTDPEVLHLNDSERQ
        D S++  DALDTSCN+LQLGMKGTTLHSEGQKESQHSALFDHKNVGETT CEKTD FNE S TCA+DEF  GHTSK++GEL S  GTDP+V HLN++E++
Subjt:  DGSNVVFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDGFNESSTTCAKDEFTAGHTSKKVGELFSRSGTDPEVLHLNDSERQ

Query:  QMEEVDTSSLNNKSSTDGFQDHNFGKESKDTVDNIILDDAQLYSQATFLDSNSVSSTPACKNSTTDTARDSSDVVTFDHSIAEFEKE-SSARNIEGGPSM
        Q+E+V    LNN SSTDGF+DHN  K+SKD+V+ IILDD QL S+  FLD  +VSSTP CKNS  DTA DS DVVT DHSIAE EKE SSARNIEGGPS+
Subjt:  QMEEVDTSSLNNKSSTDGFQDHNFGKESKDTVDNIILDDAQLYSQATFLDSNSVSSTPACKNSTTDTARDSSDVVTFDHSIAEFEKE-SSARNIEGGPSM

Query:  VLSRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHHVKDPFSSEIHHSVSHQSLVAPPTGVEDDLLPGLNSKSFEVKNITHDRSLWTSMEIET
         L R + VVD RP+ILSSDEQDVA LNGSLTEDD TFLSALH V+DPFS E+ HS SHQSLVAP TG ++DLLPGLN+KS EV+N+ HDRSLWT  +IET
Subjt:  VLSRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHHVKDPFSSEIHHSVSHQSLVAPPTGVEDDLLPGLNSKSFEVKNITHDRSLWTSMEIET

Query:  ATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVRS
        ATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTV+VLAPIRKQVMQQLT GFPTKKWGSDHIALATELAFV S
Subjt:  ATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVRS

Query:  SEE
         EE
Subjt:  SEE

A0A1S3CRA9 carbon catabolite repressor protein 4 homolog 6 isoform X10.0e+0071.03Show/hide
Query:  MRRVATPPPPLHQLSVAV--ATTATNTSAAMSSRPPYR-AGRYERHRGFSSERPYSGGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHTAPQYP
        MRR ATPPPPLHQLS AV  ATTATNTS AMSSR PYR  GRY RHRGFSSERPYSGG+GQ V+ DSHFQSV+ESNLGFRQGERGG+ NNAGS+TAP+ P
Subjt:  MRRVATPPPPLHQLSVAV--ATTATNTSAAMSSRPPYR-AGRYERHRGFSSERPYSGGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHTAPQYP

Query:  RPPSFRGNHQFRQAPPSSQRHQYRGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHH
        RPPSF GNHQFRQAPPS+QRHQYRGPHPH H+QQPPSFNQNQGVRMPQQ RPRPPK LDFRHWDYAKT PPSTCERFSILSYNILADYLAM+HK KLYHH
Subjt:  RPPSFRGNHQFRQAPPSSQRHQYRGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHH

Query:  IPPYMLDWEWRKNNILFELGLWSTDIMCFQSFRLEIRNMFKPFIDFDLLPI-KTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGF
        IP YMLDWEWRKN+ILFELGLWSTDIMCFQ      R++ +   D     I K  T + V  CA+ ++++                     R K L + F
Subjt:  IPPYMLDWEWRKNNILFELGLWSTDIMCFQSFRLEIRNMFKPFIDFDLLPI-KTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGF

Query:  SKCDFLFPKVGRSRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERKGLLVFYNASEQQPRDSPSINERQSLVRVLLEKAHAISKIWDNAPIVLCGDFNC
         +    F ++G    + VA++              + PP ++    + ++   +      R    + +    VRVLLEKAH ISKIW+NAP+VLCGDFNC
Subjt:  SKCDFLFPKVGRSRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERKGLLVFYNASEQQPRDSPSINERQSLVRVLLEKAHAISKIWDNAPIVLCGDFNC

Query:  TPKSALYNFISEQKLDLSGLDRDKVSGQSSSEIYQPSSLYRNPRLQIDKGSVPLQLRPESSVIERKPDNSLSDIQKQECSHSLMENENLPSVNRFLPPDG
        TPKSALYNFISEQKLDLSGLDRDKVSGQSS+EI+QPSSLYRNPR Q   GSVPLQ R ESS  ERKPD+S++DIQKQ+CSH+ +ENENL SVN  L PD 
Subjt:  TPKSALYNFISEQKLDLSGLDRDKVSGQSSSEIYQPSSLYRNPRLQIDKGSVPLQLRPESSVIERKPDNSLSDIQKQECSHSLMENENLPSVNRFLPPDG

Query:  SNVVFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDGFNESSTTCAKDEFTAGHTSKKVGELFSRSGTDPEVLHLNDSERQQM
        S+   DALDTSCN+ QLGMKGTTLHSEGQKE QHSALFDHKNVGETT CEKTDGFNE S TCA+DEF  GHTSK+VGEL S  GTDPEVLHLN++E  Q+
Subjt:  SNVVFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDGFNESSTTCAKDEFTAGHTSKKVGELFSRSGTDPEVLHLNDSERQQM

Query:  EEVDTSSLNNKSSTDGFQDHNFGKESKDTVDNIILDDAQLYSQATFLDSNSVSSTPACKNSTTDTARDSSDVVTFDHSIAEFEKESSA-RNIEGGPSMVL
        E+VDT  LNN  STDGF+DHNFGK+SKD+V+ IILDD QL SQ  FLD  +VSSTP CKNS  DTA DSSDVVTF HSIA  EKESS+ RNIEGGPS+ L
Subjt:  EEVDTSSLNNKSSTDGFQDHNFGKESKDTVDNIILDDAQLYSQATFLDSNSVSSTPACKNSTTDTARDSSDVVTFDHSIAEFEKESSA-RNIEGGPSMVL

Query:  SRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHHVKDPFSSEIHHSVSHQSLVAPPTGVEDDLLPGLNSKSFEVKNITHDRSLWTSMEIETAT
         R +  VD RP+ILSSDEQDVA LNGSL EDD TFLSALH V+DPFSS++ HS SHQSLVAPPTG E++LLPGL++KS EV+ + HDRSLWT  EIETAT
Subjt:  SRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHHVKDPFSSEIHHSVSHQSLVAPPTGVEDDLLPGLNSKSFEVKNITHDRSLWTSMEIETAT

Query:  GNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVRSSE
        GNADSTL+EHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVL+PIRKQVMQQLT GFPTKKWGSDHIALATELAFV S E
Subjt:  GNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVRSSE

Query:  E
        E
Subjt:  E

A0A1S3CSF9 carbon catabolite repressor protein 4 homolog 6 isoform X20.0e+0070.7Show/hide
Query:  MRRVATPPPPLHQLSVAV--ATTATNTSAAMSSRPPYR-AGRYERHRGFSSERPYSGGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHTAPQYP
        MRR ATPPPPLHQLS AV  ATTATNTS AMSSR PYR  GRY RHRGFSSERPYSGG+GQ V+ DSHFQSV+ESNLGFRQGERGG+ NNAGS+TAP+ P
Subjt:  MRRVATPPPPLHQLSVAV--ATTATNTSAAMSSRPPYR-AGRYERHRGFSSERPYSGGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHTAPQYP

Query:  RPPSFRGNHQFRQAPPSSQRHQYRGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHH
        RPPSF GNHQFRQAPPS+QRHQYRGPHPH H+QQPPSFNQNQGVRMPQQ RPRPPK LDFRHWDYAKT PPSTCERFSILSYNILADYLAM+HK KLYHH
Subjt:  RPPSFRGNHQFRQAPPSSQRHQYRGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHH

Query:  IPPYMLDWEWRKNNILFELGLWSTDIMCFQSFRLEIRNMFKPFIDFDLLPI-KTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGF
        IP YMLDWEWRKN+ILFELGLWSTDIMCFQ      R++ +   D     I K  T + V  CA+ ++++             +L ++ F     LG   
Subjt:  IPPYMLDWEWRKNNILFELGLWSTDIMCFQSFRLEIRNMFKPFIDFDLLPI-KTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGF

Query:  SKCDFLFPKVGRSRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERKGLLVFYNASEQQPRDSPSINERQSLVRVLLEKAHAISKIWDNAPIVLCGDFNC
        +       +  +  G+                   + PP ++    + ++   +      R    + +    VRVLLEKAH ISKIW+NAP+VLCGDFNC
Subjt:  SKCDFLFPKVGRSRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERKGLLVFYNASEQQPRDSPSINERQSLVRVLLEKAHAISKIWDNAPIVLCGDFNC

Query:  TPKSALYNFISEQKLDLSGLDRDKVSGQSSSEIYQPSSLYRNPRLQIDKGSVPLQLRPESSVIERKPDNSLSDIQKQECSHSLMENENLPSVNRFLPPDG
        TPKSALYNFISEQKLDLSGLDRDKVSGQSS+EI+QPSSLYRNPR Q   GSVPLQ R ESS  ERKPD+S++DIQKQ+CSH+ +ENENL SVN  L PD 
Subjt:  TPKSALYNFISEQKLDLSGLDRDKVSGQSSSEIYQPSSLYRNPRLQIDKGSVPLQLRPESSVIERKPDNSLSDIQKQECSHSLMENENLPSVNRFLPPDG

Query:  SNVVFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDGFNESSTTCAKDEFTAGHTSKKVGELFSRSGTDPEVLHLNDSERQQM
        S+   DALDTSCN+ QLGMKGTTLHSEGQKE QHSALFDHKNVGETT CEKTDGFNE S TCA+DEF  GHTSK+VGEL S  GTDPEVLHLN++E  Q+
Subjt:  SNVVFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDGFNESSTTCAKDEFTAGHTSKKVGELFSRSGTDPEVLHLNDSERQQM

Query:  EEVDTSSLNNKSSTDGFQDHNFGKESKDTVDNIILDDAQLYSQATFLDSNSVSSTPACKNSTTDTARDSSDVVTFDHSIAEFEKESSA-RNIEGGPSMVL
        E+VDT  LNN  STDGF+DHNFGK+SKD+V+ IILDD QL SQ  FLD  +VSSTP CKNS  DTA DSSDVVTF HSIA  EKESS+ RNIEGGPS+ L
Subjt:  EEVDTSSLNNKSSTDGFQDHNFGKESKDTVDNIILDDAQLYSQATFLDSNSVSSTPACKNSTTDTARDSSDVVTFDHSIAEFEKESSA-RNIEGGPSMVL

Query:  SRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHHVKDPFSSEIHHSVSHQSLVAPPTGVEDDLLPGLNSKSFEVKNITHDRSLWTSMEIETAT
         R +  VD RP+ILSSDEQDVA LNGSL EDD TFLSALH V+DPFSS++ HS SHQSLVAPPTG E++LLPGL++KS EV+ + HDRSLWT  EIETAT
Subjt:  SRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHHVKDPFSSEIHHSVSHQSLVAPPTGVEDDLLPGLNSKSFEVKNITHDRSLWTSMEIETAT

Query:  GNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVRSSE
        GNADSTL+EHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVL+PIRKQVMQQLT GFPTKKWGSDHIALATELAFV S E
Subjt:  GNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVRSSE

Query:  E
        E
Subjt:  E

A0A5A7T7G2 Carbon catabolite repressor protein 4-like protein 6 isoform X10.0e+0071.03Show/hide
Query:  MRRVATPPPPLHQLSVAV--ATTATNTSAAMSSRPPYR-AGRYERHRGFSSERPYSGGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHTAPQYP
        MRR ATPPPPLHQLS AV  ATTATNTS AMSSR PYR  GRY RHRGFSSERPYSGG+GQ V+ DSHFQSV+ESNLGFRQGERGG+ NNAGS+TAP+ P
Subjt:  MRRVATPPPPLHQLSVAV--ATTATNTSAAMSSRPPYR-AGRYERHRGFSSERPYSGGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHTAPQYP

Query:  RPPSFRGNHQFRQAPPSSQRHQYRGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHH
        RPPSF GNHQFRQAPPS+QRHQYRGPHPH H+QQPPSFNQNQGVRMPQQ RPRPPK LDFRHWDYAKT PPSTCERFSILSYNILADYLAM+HK KLYHH
Subjt:  RPPSFRGNHQFRQAPPSSQRHQYRGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHH

Query:  IPPYMLDWEWRKNNILFELGLWSTDIMCFQSFRLEIRNMFKPFIDFDLLPI-KTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGF
        IP YMLDWEWRKN+ILFELGLWSTDIMCFQ      R++ +   D     I K  T + V  CA+ ++++                     R K L + F
Subjt:  IPPYMLDWEWRKNNILFELGLWSTDIMCFQSFRLEIRNMFKPFIDFDLLPI-KTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGF

Query:  SKCDFLFPKVGRSRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERKGLLVFYNASEQQPRDSPSINERQSLVRVLLEKAHAISKIWDNAPIVLCGDFNC
         +    F ++G    + VA++              + PP ++    + ++   +      R    + +    VRVLLEKAH ISKIW+NAP+VLCGDFNC
Subjt:  SKCDFLFPKVGRSRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERKGLLVFYNASEQQPRDSPSINERQSLVRVLLEKAHAISKIWDNAPIVLCGDFNC

Query:  TPKSALYNFISEQKLDLSGLDRDKVSGQSSSEIYQPSSLYRNPRLQIDKGSVPLQLRPESSVIERKPDNSLSDIQKQECSHSLMENENLPSVNRFLPPDG
        TPKSALYNFISEQKLDLSGLDRDKVSGQSS+EI+QPSSLYRNPR Q   GSVPLQ R ESS  ERKPD+S++DIQKQ+CSH+ +ENENL SVN  L PD 
Subjt:  TPKSALYNFISEQKLDLSGLDRDKVSGQSSSEIYQPSSLYRNPRLQIDKGSVPLQLRPESSVIERKPDNSLSDIQKQECSHSLMENENLPSVNRFLPPDG

Query:  SNVVFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDGFNESSTTCAKDEFTAGHTSKKVGELFSRSGTDPEVLHLNDSERQQM
        S+   DALDTSCN+ QLGMKGTTLHSEGQKE QHSALFDHKNVGETT CEKTDGFNE S TCA+DEF  GHTSK+VGEL S  GTDPEVLHLN++E  Q+
Subjt:  SNVVFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDGFNESSTTCAKDEFTAGHTSKKVGELFSRSGTDPEVLHLNDSERQQM

Query:  EEVDTSSLNNKSSTDGFQDHNFGKESKDTVDNIILDDAQLYSQATFLDSNSVSSTPACKNSTTDTARDSSDVVTFDHSIAEFEKESSA-RNIEGGPSMVL
        E+VDT  LNN  STDGF+DHNFGK+SKD+V+ IILDD QL SQ  FLD  +VSSTP CKNS  DTA DSSDVVTF HSIA  EKESS+ RNIEGGPS+ L
Subjt:  EEVDTSSLNNKSSTDGFQDHNFGKESKDTVDNIILDDAQLYSQATFLDSNSVSSTPACKNSTTDTARDSSDVVTFDHSIAEFEKESSA-RNIEGGPSMVL

Query:  SRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHHVKDPFSSEIHHSVSHQSLVAPPTGVEDDLLPGLNSKSFEVKNITHDRSLWTSMEIETAT
         R +  VD RP+ILSSDEQDVA LNGSL EDD TFLSALH V+DPFSS++ HS SHQSLVAPPTG E++LLPGL++KS EV+ + HDRSLWT  EIETAT
Subjt:  SRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHHVKDPFSSEIHHSVSHQSLVAPPTGVEDDLLPGLNSKSFEVKNITHDRSLWTSMEIETAT

Query:  GNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVRSSE
        GNADSTL+EHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVL+PIRKQVMQQLT GFPTKKWGSDHIALATELAFV S E
Subjt:  GNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVRSSE

Query:  E
        E
Subjt:  E

A0A6J1BSH6 carbon catabolite repressor protein 4 homolog 6 isoform X23.8e-30464.49Show/hide
Query:  MRRVATPPPPLHQLSVAVATTAT-NTSAAMSSRPPYRAGRYERHRGFSSERPYSGGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHTAPQYPRP
        MRR AT PPPL QLS AVAT A+  TSAAMSSRPPYR GRY+R  GFSSERPYSGGKGQ V+GDSH+QSVRESNLGFRQGE G FANNAGS+TAPQ PRP
Subjt:  MRRVATPPPPLHQLSVAVATTAT-NTSAAMSSRPPYRAGRYERHRGFSSERPYSGGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHTAPQYPRP

Query:  PSFRGNHQFRQAPPSSQRHQYRGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHHIP
        P + G HQFRQA P  Q HQYRGPHPH H+QQP SFNQNQGVR PQ+ RPRPPK  D+RHWDYAKT  PSTCERF+ILSYNILADYLAM+HKQKLYHHIP
Subjt:  PSFRGNHQFRQAPPSSQRHQYRGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHHIP

Query:  PYMLDWEWRKNNILFELGLWSTDIMCFQS----FRLEIRNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKG
         YMLDWEWRK N+LFELGLWSTDIMCFQ       LE     + F        K  T + V  CA+ ++++  +   E                +C+   
Subjt:  PYMLDWEWRKNNILFELGLWSTDIMCFQS----FRLEIRNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKG

Query:  FSKCDFLFPKVGRSRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERKGLLVFYNASEQQPRDSPSINERQSLVRVLLEKAHAISKIWDNAPIVLCGDFN
               F K+G    + VA++       +  D  +  PP ++    K ++   +      R    + +    VR+LLEKAHAISKIW+NAPIVLCGDFN
Subjt:  FSKCDFLFPKVGRSRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERKGLLVFYNASEQQPRDSPSINERQSLVRVLLEKAHAISKIWDNAPIVLCGDFN

Query:  CTPKSALYNFISEQKLDLSGLDRDKVSGQSSSEIYQPSSLYRNPRLQIDKGSVPLQLRPESSVIERKPDNSLSDIQKQECSHSLMENENLPSVNRFLPPD
        CTPKSALYNFISEQKL+LSGLDRDKVSGQSS+EI+QPSSLYRNPR Q   GSVPLQLR +S  IERKPD+ LSD++ Q+  HS MENENLPSVN+ LPPD
Subjt:  CTPKSALYNFISEQKLDLSGLDRDKVSGQSSSEIYQPSSLYRNPRLQIDKGSVPLQLRPESSVIERKPDNSLSDIQKQECSHSLMENENLPSVNRFLPPD

Query:  GSNVVFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDGFNESSTTCAKDEFTAGHTSKKVGELFSRSGTDPEVLHLNDSERQQ
         S+ V    + SCNDLQLGMKG  LHSEGQKE Q+ A F HKNVGETT C   D F +SS  CA+DEFT  H S+KV EL S  GTD E LHLN +ER+Q
Subjt:  GSNVVFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDGFNESSTTCAKDEFTAGHTSKKVGELFSRSGTDPEVLHLNDSERQQ

Query:  MEEVDTSS-LNNKSSTDGF-QDHNFGKESKDTVDNIILDDAQLYSQATFLDSNSVSSTPACKNSTTDTARDSSDVVTFDHSIAEFEKESSARNIEGGPSM
        ME+VD++S L ++SSTD   +D NF K+SK+ V+N+ILDD     Q   L S +V  TPAC+N   D A DSS+VV F H IAEFEKESSARNI+GGPS+
Subjt:  MEEVDTSS-LNNKSSTDGF-QDHNFGKESKDTVDNIILDDAQLYSQATFLDSNSVSSTPACKNSTTDTARDSSDVVTFDHSIAEFEKESSARNIEGGPSM

Query:  VLSRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHHV-KDPFSSEIHHSVSHQSLVAPPTGVEDDLLPGLNSKSFEVKNITHDRSLWTSMEIE
             +  VD R KILSSDEQDVA L+GSLTEDD TFLSALH + ++PFSSEIH SV HQSLVAP TGV DD LPGLN+KSFEV+N  HDRSLWT MEIE
Subjt:  VLSRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHHV-KDPFSSEIHHSVSHQSLVAPPTGVEDDLLPGLNSKSFEVKNITHDRSLWTSMEIE

Query:  TATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVR
         ATGN D TL+EH L+LRSTYTE ED SGTRDLN EPL TSYNRCFLGTVDYIWRSEGLQTV+VLAPI+K VM +LTPGFPTKKWGSDHIALA ELAF R
Subjt:  TATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVR

Query:  SSEE
          EE
Subjt:  SSEE

SwissProt top hitse value%identityAlignment
B2RYM0 Protein angel homolog 14.1e-1327.78Show/hide
Query:  PHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDF-----RHW-DYAKTPPPSTCE-------RFSILSYNILADYLAMNHKQKLYHHIPPYMLDWEWRKNN
        P P  Q+  S    +G+   +QL+P PP  + +     R W D++  P     E       +F+++SYNILA  L M    +LY H  P +L+W +R  N
Subjt:  PHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDF-----RHW-DYAKTPPPSTCE-------RFSILSYNILADYLAMNHKQKLYHHIPPYMLDWEWRKNN

Query:  ILFELGLWSTDIMCFQSFRLEIRNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGFSKCDFLFPKVGRSRG
        ++ E   W  DI+C Q    E++       D     ++    ++   C  + + T  ++ G C   Y+  R    FR  C     S  ++  P +     
Subjt:  ILFELGLWSTDIMCFQSFRLEIRNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGFSKCDFLFPKVGRSRG

Query:  EGVAEVFSGPLGINGLDGEELMPPTNSPGERKGLLVFYNASEQQPRDSPSINERQSLVRVLLEKAHAISKIWD--NAPIVLCGDFNCTPKSALYNFISEQ
        + V  V    L +  L  E L   + +P       V YN     PR     + + + + +LL +   ++++ D  + PI+LCGD N  P S LYNFI + 
Subjt:  EGVAEVFSGPLGINGLDGEELMPPTNSPGERKGLLVFYNASEQQPRDSPSINERQSLVRVLLEKAHAISKIWD--NAPIVLCGDFNCTPKSALYNFISEQ

Query:  KLDLSGLDRDKVSGQS--SSEIYQ
        +L  +G+   KVSGQ   S ++YQ
Subjt:  KLDLSGLDRDKVSGQS--SSEIYQ

Q0WKY2 Carbon catabolite repressor protein 4 homolog 51.8e-2147.17Show/hide
Query:  WTSMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALA
        W+  E++ ATG  ++T ++H L+L S Y+       TRD  GEPLAT+Y+  FLGTVDYIW ++ L  VRVL  +   V+++ T G P++ WGSDH+A+A
Subjt:  WTSMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALA

Query:  TELAFV
         EL FV
Subjt:  TELAFV

Q0WKY2 Carbon catabolite repressor protein 4 homolog 51.9e-1024.12Show/hide
Query:  PHHQQPPSFNQ---------NQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHHIPPYMLDWEWRKNNILFELG
        PH    P F+Q            +R  ++ + +    ++ R W ++     +  ++  ++SYN+L    A NH   LY+++P   L+W  RK+ I  E+ 
Subjt:  PHHQQPPSFNQ---------NQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHHIPPYMLDWEWRKNNILFELG

Query:  LWSTDIMCFQSFRLEIRNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGFSKCDFLFPKVGRSRGEGVAEV
         ++  I+C Q            F D D+L +K             F+       GE             F  + L +        F K G      VA++
Subjt:  LWSTDIMCFQSFRLEIRNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGFSKCDFLFPKVGRSRGEGVAEV

Query:  FSGPLGINGLDGEELMPPTNSPGERKGLLVFYNASEQQPRDSPSINERQSLVRVLLEKAHAISKIWDNAPIVLCGDFNCTPKSALYNFISEQKLDLSGLD
            L +N  +  +      S   R+ ++   +      R    + +    VR+ LEKA+ +S+ W N P+ + GD N TP+SA+Y+FI+   LD    D
Subjt:  FSGPLGINGLDGEELMPPTNSPGERKGLLVFYNASEQQPRDSPSINERQSLVRVLLEKAHAISKIWDNAPIVLCGDFNCTPKSALYNFISEQKLDLSGLD

Query:  RDKVSGQSSSE
        R ++SGQ+  E
Subjt:  RDKVSGQSSSE

Q8VCU0 Protein angel homolog 14.1e-1327.78Show/hide
Query:  PHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDF-----RHW-DYAKTPPPSTCE-------RFSILSYNILADYLAMNHKQKLYHHIPPYMLDWEWRKNN
        P P  Q+  S    +G+   +QL+P PP  + +     R W D++  P     E       +F+++SYNILA  L M    +LY H  P +L+W +R  N
Subjt:  PHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDF-----RHW-DYAKTPPPSTCE-------RFSILSYNILADYLAMNHKQKLYHHIPPYMLDWEWRKNN

Query:  ILFELGLWSTDIMCFQSFRLEIRNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGFSKCDFLFPKVGRSRG
        ++ E   W  DI+C Q    E++       D     ++    ++   C  + + T  ++ G C   Y+  R    FR  C     S  ++  P +     
Subjt:  ILFELGLWSTDIMCFQSFRLEIRNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGFSKCDFLFPKVGRSRG

Query:  EGVAEVFSGPLGINGLDGEELMPPTNSPGERKGLLVFYNASEQQPRDSPSINERQSLVRVLLEKAHAISKIWD--NAPIVLCGDFNCTPKSALYNFISEQ
        + V  V    L +  L  E L   + +P       V YN     PR     + + + + +LL +   ++++ D  + PI+LCGD N  P S LYNFI + 
Subjt:  EGVAEVFSGPLGINGLDGEELMPPTNSPGERKGLLVFYNASEQQPRDSPSINERQSLVRVLLEKAHAISKIWD--NAPIVLCGDFNCTPKSALYNFISEQ

Query:  KLDLSGLDRDKVSGQS--SSEIYQ
        +L  +G+   KVSGQ   S ++YQ
Subjt:  KLDLSGLDRDKVSGQS--SSEIYQ

Q8VYU4 Carbon catabolite repressor protein 4 homolog 61.0e-10134.93Show/hide
Query:  MRRVATPPPPLHQLSVAVATTATNTSAAMSSRPPYRA----GRYERHRGFSSERPYS--GGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHTAP
        MRR          ++ A A+T +     MS+R PYR     GR    R F S+RPY+   G+ Q VTGDSHFQSV ++N  FR GE          H  P
Subjt:  MRRVATPPPPLHQLSVAVATTATNTSAAMSSRPPYRA----GRYERHRGFSSERPYS--GGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHTAP

Query:  QYPR-PPSFRGNHQFRQAPPS-SQRHQYRGPHPHPHHQQ-----PPSFNQNQGVRMP--QQLRPRP-PKRLDFRHWDYAKTPPPSTCERFSILSYNILAD
           R  P F  N++FR  PPS  Q  Q+R P+  P +Q      PP F QNQ  R P  Q  R RP  K  D+R W+YAKTPP    E+F +LSYNILAD
Subjt:  QYPR-PPSFRGNHQFRQAPPS-SQRHQYRGPHPHPHHQQ-----PPSFNQNQGVRMP--QQLRPRP-PKRLDFRHWDYAKTPPPSTCERFSILSYNILAD

Query:  YLAMNHKQKLYHHIPPYMLDWEWRKNNILFELGLWSTDIMC------FQSFRLEIRNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAY
        YLA +H + LY HIP  ML W WRK+ ++FEL LWS DIMC      FQ    E+++     I       K  T   V  CA+ ++    +   E    +
Subjt:  YLAMNHKQKLYHHIPPYMLDWEWRKNNILFELGLWSTDIMC------FQSFRLEIRNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAY

Query:  QELRNDVFFRTKCLGKGFSKCDFLFPKVGRSRGEGVAEV-FSGPLGINGLDGEELMPPTNSPGERKGLLVFYNASEQQPRDSPSINERQSLVRVLLEKAH
         +L                              + VA++     L  +     E  PP +S G  + ++   +      R    + +    VR LL+KAH
Subjt:  QELRNDVFFRTKCLGKGFSKCDFLFPKVGRSRGEGVAEV-FSGPLGINGLDGEELMPPTNSPGERKGLLVFYNASEQQPRDSPSINERQSLVRVLLEKAH

Query:  AISKIWDNAPIVLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSGQSSSEIYQPSSLYRNPRLQIDKGSVPLQLRPESSVIERKPDNSLSDIQKQECSH
        A+SK+WD+APIVLCGDFNCTPKS LYNFIS++KLDLSGL RDKVSGQ S+E   P                    RPE+     +  N     Q Q    
Subjt:  AISKIWDNAPIVLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSGQSSSEIYQPSSLYRNPRLQIDKGSVPLQLRPESSVIERKPDNSLSDIQKQECSH

Query:  SLMENENLPSVNRFLPPDGSNVVFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDGFNESSTTCAKDEFTAGHTSKKVGELFS
                       PP   N++ +A   + +++ +G                           T   EKT     S   C  D   AGH          
Subjt:  SLMENENLPSVNRFLPPDGSNVVFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDGFNESSTTCAKDEFTAGHTSKKVGELFS

Query:  RSGTDPEVLHLNDSERQQMEEVDTSSLNNKSSTDGFQDHNFGKESKDTVDNIILDDAQLYSQATFLDSNSVSSTPACKNSTTDTARDS--SDVVTFDHSI
                            E  +SS       +   D  FG E++   D+  L  A+  S  T  D+    ++ A ++  TD +  S  S+       I
Subjt:  RSGTDPEVLHLNDSERQQMEEVDTSSLNNKSSTDGFQDHNFGKESKDTVDNIILDDAQLYSQATFLDSNSVSSTPACKNSTTDTARDS--SDVVTFDHSI

Query:  AEFEKE-SSARNIEGGPSMVLSRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHHVKDPFSSEIHHSVSHQSLVAPPTGVEDDLLPGLNSKSF
           +++ SS+ + +    +   + D +    P + + DE+       SL ED  TFL+ LH   +  S +    VS   L               +S++ 
Subjt:  AEFEKE-SSARNIEGGPSMVLSRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHHVKDPFSSEIHHSVSHQSLVAPPTGVEDDLLPGLNSKSF

Query:  EVKNITHDRSLWTSMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPT
            IT+  S WT MEI TATG+ + T +EH+L L+STY+E E  + TRD NGEP+ TSY+RCF+GTVDYIWRSEGLQTVRVLAPI KQ M Q TPGFPT
Subjt:  EVKNITHDRSLWTSMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPT

Query:  KKWGSDHIALATELAFVRS
         KWGSDHIAL +ELAF  S
Subjt:  KKWGSDHIALATELAFVRS

Q9LS39 Carbon catabolite repressor protein 4 homolog 35.0e-1947.66Show/hide
Query:  SLWTSMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIA
        S WT  EI  ATG  +S    H L+L S+Y   +  + TRD  GEPLATSY+  FLGTVDY+W S+GL   RVL  +   V+ + T G P ++ GSDH+A
Subjt:  SLWTSMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIA

Query:  LATELAF
        L +E  F
Subjt:  LATELAF

Q9LS39 Carbon catabolite repressor protein 4 homolog 34.1e-1326.83Show/hide
Query:  PPSSQRHQY--RGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHW-DYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHHIPPYMLDWEWR
        P SS    Y  R  +P P  Q P     +Q                  R W D   TP     ERF+++SYNIL D  +  H++ LY ++    L W +R
Subjt:  PPSSQRHQY--RGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHW-DYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHHIPPYMLDWEWR

Query:  KNNILFELGLWSTDIMCFQSFRLEIRNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRND------VFFRTKCLGKGFSKCDFL
        K  I  EL   + DI+  Q             +D                    F L ++       G+Y+    D      +F++    G    + +  
Subjt:  KNNILFELGLWSTDIMCFQSFRLEIRNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRND------VFFRTKCLGKGFSKCDFL

Query:  FPKVGRSRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERKGLL----VFYNASEQQPRDSPSINERQSLVRVLLEKAHAISKIWDNAPIVLCGDFNCTP
        F + G    + VA++              ++    S   RK LL    V YN ++         + +   VR L  KAH +SK W + PIVLCGDFN TP
Subjt:  FPKVGRSRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERKGLL----VFYNASEQQPRDSPSINERQSLVRVLLEKAHAISKIWDNAPIVLCGDFNCTP

Query:  KSALYNFISEQKLDLSGLDRDKVSGQSS
        KS LYNF++  +L++   D+ ++SGQ +
Subjt:  KSALYNFISEQKLDLSGLDRDKVSGQSS

Arabidopsis top hitse value%identityAlignment
AT1G73875.1 DNAse I-like superfamily protein1.3e-2247.17Show/hide
Query:  WTSMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALA
        W+  E++ ATG  ++T ++H L+L S Y+       TRD  GEPLAT+Y+  FLGTVDYIW ++ L  VRVL  +   V+++ T G P++ WGSDH+A+A
Subjt:  WTSMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALA

Query:  TELAFV
         EL FV
Subjt:  TELAFV

AT1G73875.1 DNAse I-like superfamily protein1.3e-1124.12Show/hide
Query:  PHHQQPPSFNQ---------NQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHHIPPYMLDWEWRKNNILFELG
        PH    P F+Q            +R  ++ + +    ++ R W ++     +  ++  ++SYN+L    A NH   LY+++P   L+W  RK+ I  E+ 
Subjt:  PHHQQPPSFNQ---------NQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHHIPPYMLDWEWRKNNILFELG

Query:  LWSTDIMCFQSFRLEIRNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGFSKCDFLFPKVGRSRGEGVAEV
         ++  I+C Q            F D D+L +K             F+       GE             F  + L +        F K G      VA++
Subjt:  LWSTDIMCFQSFRLEIRNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGFSKCDFLFPKVGRSRGEGVAEV

Query:  FSGPLGINGLDGEELMPPTNSPGERKGLLVFYNASEQQPRDSPSINERQSLVRVLLEKAHAISKIWDNAPIVLCGDFNCTPKSALYNFISEQKLDLSGLD
            L +N  +  +      S   R+ ++   +      R    + +    VR+ LEKA+ +S+ W N P+ + GD N TP+SA+Y+FI+   LD    D
Subjt:  FSGPLGINGLDGEELMPPTNSPGERKGLLVFYNASEQQPRDSPSINERQSLVRVLLEKAHAISKIWDNAPIVLCGDFNCTPKSALYNFISEQKLDLSGLD

Query:  RDKVSGQSSSE
        R ++SGQ+  E
Subjt:  RDKVSGQSSSE

AT3G18500.1 DNAse I-like superfamily protein3.5e-2047.66Show/hide
Query:  SLWTSMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIA
        S WT  EI  ATG  +S    H L+L S+Y   +  + TRD  GEPLATSY+  FLGTVDY+W S+GL   RVL  +   V+ + T G P ++ GSDH+A
Subjt:  SLWTSMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIA

Query:  LATELAF
        L +E  F
Subjt:  LATELAF

AT3G18500.1 DNAse I-like superfamily protein1.6e-1226.73Show/hide
Query:  PPSSQRHQY--RGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHW-DYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHHIPPYMLDWEWR
        P SS    Y  R  +P P  Q P     +Q                  R W D   TP     ERF+++SYNIL D  +  H++ LY ++    L W +R
Subjt:  PPSSQRHQY--RGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHW-DYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHHIPPYMLDWEWR

Query:  KNNILFELGLWSTDIMCFQSFRLEIRNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGFSKCDFLFPKVGR
        K  I  EL   + DI+  Q             +D   +  K     V+ R  +EF    ++     Q A  ELR     R   LG               
Subjt:  KNNILFELGLWSTDIMCFQSFRLEIRNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGFSKCDFLFPKVGR

Query:  SRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERKGLLVFYNASEQQPRDSPSINERQSLVRVLLEKAHAISKIWDNAPIVLCGDFNCTPKSALYNFISE
                                            + V YN ++         + +   VR L  KAH +SK W + PIVLCGDFN TPKS LYNF++ 
Subjt:  SRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERKGLLVFYNASEQQPRDSPSINERQSLVRVLLEKAHAISKIWDNAPIVLCGDFNCTPKSALYNFISE

Query:  QKLDLSGLDRDKVSGQSS
         +L++   D+ ++SGQ +
Subjt:  QKLDLSGLDRDKVSGQSS

AT3G18500.2 DNAse I-like superfamily protein3.5e-2047.66Show/hide
Query:  SLWTSMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIA
        S WT  EI  ATG  +S    H L+L S+Y   +  + TRD  GEPLATSY+  FLGTVDY+W S+GL   RVL  +   V+ + T G P ++ GSDH+A
Subjt:  SLWTSMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIA

Query:  LATELAF
        L +E  F
Subjt:  LATELAF

AT3G18500.2 DNAse I-like superfamily protein2.9e-1426.83Show/hide
Query:  PPSSQRHQY--RGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHW-DYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHHIPPYMLDWEWR
        P SS    Y  R  +P P  Q P     +Q                  R W D   TP     ERF+++SYNIL D  +  H++ LY ++    L W +R
Subjt:  PPSSQRHQY--RGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHW-DYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHHIPPYMLDWEWR

Query:  KNNILFELGLWSTDIMCFQSFRLEIRNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRND------VFFRTKCLGKGFSKCDFL
        K  I  EL   + DI+  Q             +D                    F L ++       G+Y+    D      +F++    G    + +  
Subjt:  KNNILFELGLWSTDIMCFQSFRLEIRNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRND------VFFRTKCLGKGFSKCDFL

Query:  FPKVGRSRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERKGLL----VFYNASEQQPRDSPSINERQSLVRVLLEKAHAISKIWDNAPIVLCGDFNCTP
        F + G    + VA++              ++    S   RK LL    V YN ++         + +   VR L  KAH +SK W + PIVLCGDFN TP
Subjt:  FPKVGRSRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERKGLL----VFYNASEQQPRDSPSINERQSLVRVLLEKAHAISKIWDNAPIVLCGDFNCTP

Query:  KSALYNFISEQKLDLSGLDRDKVSGQSS
        KS LYNF++  +L++   D+ ++SGQ +
Subjt:  KSALYNFISEQKLDLSGLDRDKVSGQSS

AT3G18500.3 DNAse I-like superfamily protein3.5e-2047.66Show/hide
Query:  SLWTSMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIA
        S WT  EI  ATG  +S    H L+L S+Y   +  + TRD  GEPLATSY+  FLGTVDY+W S+GL   RVL  +   V+ + T G P ++ GSDH+A
Subjt:  SLWTSMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIA

Query:  LATELAF
        L +E  F
Subjt:  LATELAF

AT3G18500.3 DNAse I-like superfamily protein2.9e-1426.83Show/hide
Query:  PPSSQRHQY--RGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHW-DYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHHIPPYMLDWEWR
        P SS    Y  R  +P P  Q P     +Q                  R W D   TP     ERF+++SYNIL D  +  H++ LY ++    L W +R
Subjt:  PPSSQRHQY--RGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHW-DYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHHIPPYMLDWEWR

Query:  KNNILFELGLWSTDIMCFQSFRLEIRNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRND------VFFRTKCLGKGFSKCDFL
        K  I  EL   + DI+  Q             +D                    F L ++       G+Y+    D      +F++    G    + +  
Subjt:  KNNILFELGLWSTDIMCFQSFRLEIRNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRND------VFFRTKCLGKGFSKCDFL

Query:  FPKVGRSRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERKGLL----VFYNASEQQPRDSPSINERQSLVRVLLEKAHAISKIWDNAPIVLCGDFNCTP
        F + G    + VA++              ++    S   RK LL    V YN ++         + +   VR L  KAH +SK W + PIVLCGDFN TP
Subjt:  FPKVGRSRGEGVAEVFSGPLGINGLDGEELMPPTNSPGERKGLL----VFYNASEQQPRDSPSINERQSLVRVLLEKAHAISKIWDNAPIVLCGDFNCTP

Query:  KSALYNFISEQKLDLSGLDRDKVSGQSS
        KS LYNF++  +L++   D+ ++SGQ +
Subjt:  KSALYNFISEQKLDLSGLDRDKVSGQSS

AT5G11350.1 DNAse I-like superfamily protein7.4e-10334.93Show/hide
Query:  MRRVATPPPPLHQLSVAVATTATNTSAAMSSRPPYRA----GRYERHRGFSSERPYS--GGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHTAP
        MRR          ++ A A+T +     MS+R PYR     GR    R F S+RPY+   G+ Q VTGDSHFQSV ++N  FR GE          H  P
Subjt:  MRRVATPPPPLHQLSVAVATTATNTSAAMSSRPPYRA----GRYERHRGFSSERPYS--GGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHTAP

Query:  QYPR-PPSFRGNHQFRQAPPS-SQRHQYRGPHPHPHHQQ-----PPSFNQNQGVRMP--QQLRPRP-PKRLDFRHWDYAKTPPPSTCERFSILSYNILAD
           R  P F  N++FR  PPS  Q  Q+R P+  P +Q      PP F QNQ  R P  Q  R RP  K  D+R W+YAKTPP    E+F +LSYNILAD
Subjt:  QYPR-PPSFRGNHQFRQAPPS-SQRHQYRGPHPHPHHQQ-----PPSFNQNQGVRMP--QQLRPRP-PKRLDFRHWDYAKTPPPSTCERFSILSYNILAD

Query:  YLAMNHKQKLYHHIPPYMLDWEWRKNNILFELGLWSTDIMC------FQSFRLEIRNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAY
        YLA +H + LY HIP  ML W WRK+ ++FEL LWS DIMC      FQ    E+++     I       K  T   V  CA+ ++    +   E    +
Subjt:  YLAMNHKQKLYHHIPPYMLDWEWRKNNILFELGLWSTDIMC------FQSFRLEIRNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAY

Query:  QELRNDVFFRTKCLGKGFSKCDFLFPKVGRSRGEGVAEV-FSGPLGINGLDGEELMPPTNSPGERKGLLVFYNASEQQPRDSPSINERQSLVRVLLEKAH
         +L                              + VA++     L  +     E  PP +S G  + ++   +      R    + +    VR LL+KAH
Subjt:  QELRNDVFFRTKCLGKGFSKCDFLFPKVGRSRGEGVAEV-FSGPLGINGLDGEELMPPTNSPGERKGLLVFYNASEQQPRDSPSINERQSLVRVLLEKAH

Query:  AISKIWDNAPIVLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSGQSSSEIYQPSSLYRNPRLQIDKGSVPLQLRPESSVIERKPDNSLSDIQKQECSH
        A+SK+WD+APIVLCGDFNCTPKS LYNFIS++KLDLSGL RDKVSGQ S+E   P                    RPE+     +  N     Q Q    
Subjt:  AISKIWDNAPIVLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSGQSSSEIYQPSSLYRNPRLQIDKGSVPLQLRPESSVIERKPDNSLSDIQKQECSH

Query:  SLMENENLPSVNRFLPPDGSNVVFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDGFNESSTTCAKDEFTAGHTSKKVGELFS
                       PP   N++ +A   + +++ +G                           T   EKT     S   C  D   AGH          
Subjt:  SLMENENLPSVNRFLPPDGSNVVFDALDTSCNDLQLGMKGTTLHSEGQKESQHSALFDHKNVGETTCCEKTDGFNESSTTCAKDEFTAGHTSKKVGELFS

Query:  RSGTDPEVLHLNDSERQQMEEVDTSSLNNKSSTDGFQDHNFGKESKDTVDNIILDDAQLYSQATFLDSNSVSSTPACKNSTTDTARDS--SDVVTFDHSI
                            E  +SS       +   D  FG E++   D+  L  A+  S  T  D+    ++ A ++  TD +  S  S+       I
Subjt:  RSGTDPEVLHLNDSERQQMEEVDTSSLNNKSSTDGFQDHNFGKESKDTVDNIILDDAQLYSQATFLDSNSVSSTPACKNSTTDTARDS--SDVVTFDHSI

Query:  AEFEKE-SSARNIEGGPSMVLSRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHHVKDPFSSEIHHSVSHQSLVAPPTGVEDDLLPGLNSKSF
           +++ SS+ + +    +   + D +    P + + DE+       SL ED  TFL+ LH   +  S +    VS   L               +S++ 
Subjt:  AEFEKE-SSARNIEGGPSMVLSRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALHHVKDPFSSEIHHSVSHQSLVAPPTGVEDDLLPGLNSKSF

Query:  EVKNITHDRSLWTSMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPT
            IT+  S WT MEI TATG+ + T +EH+L L+STY+E E  + TRD NGEP+ TSY+RCF+GTVDYIWRSEGLQTVRVLAPI KQ M Q TPGFPT
Subjt:  EVKNITHDRSLWTSMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPT

Query:  KKWGSDHIALATELAFVRS
         KWGSDHIAL +ELAF  S
Subjt:  KKWGSDHIALATELAFVRS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGCGCGTTGCTACTCCTCCTCCTCCGCTCCATCAACTGTCCGTCGCCGTCGCCACGACCGCTACCAACACATCCGCTGCCATGTCTTCTCGACCTCCATAC
CGAGCTGGCCGGTACGAACGCCACCGAGGCTTCTCGTCGGAACGGCCATACTCTGGCGGTAAAGGTCAATTAGTCACCGGTGATTCTCATTTTCAGTCTGTTCGG
GAGTCAAACCTAGGGTTCCGGCAAGGAGAGAGGGGAGGCTTTGCGAACAATGCGGGGTCGCATACGGCACCTCAATATCCTAGACCTCCGTCTTTTCGTGGAAAT
CATCAATTCCGACAGGCTCCGCCTTCCAGTCAGAGACACCAGTATCGGGGACCTCATCCTCATCCTCACCATCAGCAGCCACCGTCGTTTAATCAAAATCAAGGT
GTTCGTATGCCGCAGCAACTTCGACCTCGGCCTCCGAAGCGGCTAGACTTTCGTCATTGGGATTATGCAAAGACACCACCTCCATCTACTTGCGAGCGGTTTTCA
ATTCTTTCATACAACATCTTAGCTGATTACCTTGCCATGAATCACAAGCAAAAGCTCTACCATCATATTCCCCCTTACATGTTGGATTGGGAGTGGAGGAAAAAT
AATATTTTATTTGAGCTTGGATTATGGTCTACTGACATAATGTGCTTTCAGTCTTTTAGACTGGAGATCAGGAACATGTTCAAACCTTTTATTGATTTCGATCTG
TTGCCAATTAAAACCCATACTGATTTGGTAGTGTGTAGATGCGCACTGGAATTCCAGTTGACGGCTGTGCAATCTTTTGGCGAGTGTCAAGGAGCATACCAGGAG
TTAAGGAATGATGTTTTCTTTCGCACAAAATGCCTTGGAAAAGGCTTCAGCAAGTGTGATTTTCTTTTTCCAAAAGTTGGAAGATCGAGGGGCGAAGGGGTAGCA
GAGGTCTTTTCAGGACCCTTGGGAATTAATGGCTTGGATGGAGAAGAGTTAATGCCGCCTACCAACTCTCCTGGAGAGAGAAAAGGACTGCTTGTCTTTTACAAT
GCTTCAGAGCAACAACCGAGGGACTCTCCTTCCATTAATGAAAGACAATCGCTGGTCAGGGTTCTCTTGGAGAAGGCACATGCTATTTCAAAAATCTGGGACAAT
GCTCCAATCGTTCTATGTGGGGATTTTAACTGTACACCGAAGAGTGCATTGTATAACTTTATTTCAGAGCAGAAGCTAGATTTGTCTGGATTGGATAGAGACAAG
GTATCGGGACAATCTTCTTCCGAGATTTATCAACCTTCATCACTCTATCGTAATCCTCGGCTTCAGATTGACAAAGGTTCAGTTCCCCTCCAGCTGAGGCCAGAA
TCTAGTGTTATTGAAAGGAAGCCAGATAATTCTCTGTCTGACATACAGAAGCAAGAATGTTCACATAGCTTGATGGAAAACGAGAATCTTCCATCAGTGAACCGC
TTTTTGCCCCCTGATGGTTCTAACGTTGTCTTTGATGCACTTGATACTTCTTGTAACGATCTCCAGCTTGGAATGAAGGGTACTACTCTACATTCTGAAGGCCAA
AAGGAAAGTCAGCATAGTGCTTTGTTTGACCACAAAAATGTAGGGGAAACAACTTGCTGTGAGAAGACAGATGGCTTCAATGAAAGTTCAACCACATGTGCCAAA
GATGAGTTTACTGCTGGTCATACCAGTAAAAAAGTTGGTGAACTATTCTCCCGTTCAGGAACCGACCCTGAAGTACTTCATCTGAATGACTCTGAGAGACAACAG
ATGGAAGAAGTTGATACCTCTAGTTTAAACAATAAATCTTCAACAGATGGTTTTCAGGATCACAATTTTGGCAAGGAAAGCAAAGACACTGTTGATAATATTATC
TTAGATGACGCACAGCTTTATTCTCAGGCAACTTTTTTGGATTCAAATAGCGTTTCTTCTACACCTGCTTGCAAAAACTCCACGACCGACACTGCTAGAGACTCC
TCTGATGTTGTAACTTTTGACCACTCAATTGCTGAATTTGAGAAGGAAAGCTCTGCTAGGAATATTGAAGGTGGCCCATCAATGGTTTTGTCCAGGACTGACTCG
GTGGTGGATGGAAGACCAAAGATTTTATCTTCAGATGAGCAGGATGTAGCTGTGTTAAATGGAAGCTTAACTGAGGATGATCATACATTTCTCTCAGCCCTGCAT
CATGTTAAAGATCCCTTTTCATCTGAGATTCATCATTCTGTTAGTCATCAAAGCTTGGTTGCACCACCCACTGGAGTTGAAGATGATTTGTTGCCAGGATTGAAT
TCCAAGTCTTTTGAAGTAAAAAATATTACTCATGATCGCTCATTATGGACTTCAATGGAAATAGAAACTGCTACTGGCAATGCAGATAGTACTCTAATTGAACAC
TCTCTAAGGCTTAGAAGCACGTATACAGAAGCTGAGGACCTTTCTGGGACCAGAGACTTGAATGGGGAACCCCTGGCGACTAGTTACAATAGGTGTTTTCTGGGC
ACTGTTGACTACATATGGCGTTCAGAAGGTCTTCAGACGGTTAGGGTGCTCGCCCCTATAAGAAAACAAGTCATGCAGCAGTTGACTCCTGGATTTCCTACAAAG
AAATGGGGCAGCGATCACATTGCCTTAGCTACTGAATTGGCGTTTGTAAGGAGTAGTGAAGAGTGA
mRNA sequenceShow/hide mRNA sequence
TGAGGAAGCCCTCCGTGCTATTATTTCGCGAAAGAATCGTCACTGAGTCTTCACACTCTCTCCCGCCATTCAATGAGGCGCGTTGCTACTCCTCCTCCTCCGCTC
CATCAACTGTCCGTCGCCGTCGCCACGACCGCTACCAACACATCCGCTGCCATGTCTTCTCGACCTCCATACCGAGCTGGCCGGTACGAACGCCACCGAGGCTTC
TCGTCGGAACGGCCATACTCTGGCGGTAAAGGTCAATTAGTCACCGGTGATTCTCATTTTCAGTCTGTTCGGGAGTCAAACCTAGGGTTCCGGCAAGGAGAGAGG
GGAGGCTTTGCGAACAATGCGGGGTCGCATACGGCACCTCAATATCCTAGACCTCCGTCTTTTCGTGGAAATCATCAATTCCGACAGGCTCCGCCTTCCAGTCAG
AGACACCAGTATCGGGGACCTCATCCTCATCCTCACCATCAGCAGCCACCGTCGTTTAATCAAAATCAAGGTGTTCGTATGCCGCAGCAACTTCGACCTCGGCCT
CCGAAGCGGCTAGACTTTCGTCATTGGGATTATGCAAAGACACCACCTCCATCTACTTGCGAGCGGTTTTCAATTCTTTCATACAACATCTTAGCTGATTACCTT
GCCATGAATCACAAGCAAAAGCTCTACCATCATATTCCCCCTTACATGTTGGATTGGGAGTGGAGGAAAAATAATATTTTATTTGAGCTTGGATTATGGTCTACT
GACATAATGTGCTTTCAGTCTTTTAGACTGGAGATCAGGAACATGTTCAAACCTTTTATTGATTTCGATCTGTTGCCAATTAAAACCCATACTGATTTGGTAGTG
TGTAGATGCGCACTGGAATTCCAGTTGACGGCTGTGCAATCTTTTGGCGAGTGTCAAGGAGCATACCAGGAGTTAAGGAATGATGTTTTCTTTCGCACAAAATGC
CTTGGAAAAGGCTTCAGCAAGTGTGATTTTCTTTTTCCAAAAGTTGGAAGATCGAGGGGCGAAGGGGTAGCAGAGGTCTTTTCAGGACCCTTGGGAATTAATGGC
TTGGATGGAGAAGAGTTAATGCCGCCTACCAACTCTCCTGGAGAGAGAAAAGGACTGCTTGTCTTTTACAATGCTTCAGAGCAACAACCGAGGGACTCTCCTTCC
ATTAATGAAAGACAATCGCTGGTCAGGGTTCTCTTGGAGAAGGCACATGCTATTTCAAAAATCTGGGACAATGCTCCAATCGTTCTATGTGGGGATTTTAACTGT
ACACCGAAGAGTGCATTGTATAACTTTATTTCAGAGCAGAAGCTAGATTTGTCTGGATTGGATAGAGACAAGGTATCGGGACAATCTTCTTCCGAGATTTATCAA
CCTTCATCACTCTATCGTAATCCTCGGCTTCAGATTGACAAAGGTTCAGTTCCCCTCCAGCTGAGGCCAGAATCTAGTGTTATTGAAAGGAAGCCAGATAATTCT
CTGTCTGACATACAGAAGCAAGAATGTTCACATAGCTTGATGGAAAACGAGAATCTTCCATCAGTGAACCGCTTTTTGCCCCCTGATGGTTCTAACGTTGTCTTT
GATGCACTTGATACTTCTTGTAACGATCTCCAGCTTGGAATGAAGGGTACTACTCTACATTCTGAAGGCCAAAAGGAAAGTCAGCATAGTGCTTTGTTTGACCAC
AAAAATGTAGGGGAAACAACTTGCTGTGAGAAGACAGATGGCTTCAATGAAAGTTCAACCACATGTGCCAAAGATGAGTTTACTGCTGGTCATACCAGTAAAAAA
GTTGGTGAACTATTCTCCCGTTCAGGAACCGACCCTGAAGTACTTCATCTGAATGACTCTGAGAGACAACAGATGGAAGAAGTTGATACCTCTAGTTTAAACAAT
AAATCTTCAACAGATGGTTTTCAGGATCACAATTTTGGCAAGGAAAGCAAAGACACTGTTGATAATATTATCTTAGATGACGCACAGCTTTATTCTCAGGCAACT
TTTTTGGATTCAAATAGCGTTTCTTCTACACCTGCTTGCAAAAACTCCACGACCGACACTGCTAGAGACTCCTCTGATGTTGTAACTTTTGACCACTCAATTGCT
GAATTTGAGAAGGAAAGCTCTGCTAGGAATATTGAAGGTGGCCCATCAATGGTTTTGTCCAGGACTGACTCGGTGGTGGATGGAAGACCAAAGATTTTATCTTCA
GATGAGCAGGATGTAGCTGTGTTAAATGGAAGCTTAACTGAGGATGATCATACATTTCTCTCAGCCCTGCATCATGTTAAAGATCCCTTTTCATCTGAGATTCAT
CATTCTGTTAGTCATCAAAGCTTGGTTGCACCACCCACTGGAGTTGAAGATGATTTGTTGCCAGGATTGAATTCCAAGTCTTTTGAAGTAAAAAATATTACTCAT
GATCGCTCATTATGGACTTCAATGGAAATAGAAACTGCTACTGGCAATGCAGATAGTACTCTAATTGAACACTCTCTAAGGCTTAGAAGCACGTATACAGAAGCT
GAGGACCTTTCTGGGACCAGAGACTTGAATGGGGAACCCCTGGCGACTAGTTACAATAGGTGTTTTCTGGGCACTGTTGACTACATATGGCGTTCAGAAGGTCTT
CAGACGGTTAGGGTGCTCGCCCCTATAAGAAAACAAGTCATGCAGCAGTTGACTCCTGGATTTCCTACAAAGAAATGGGGCAGCGATCACATTGCCTTAGCTACT
GAATTGGCGTTTGTAAGGAGTAGTGAAGAGTGATACCATGAATCTTAGGAGCAAATGATAATAAGTAAATAAATACTTTCCCTTGTAATTGGAGTTTTTTTATTT
GAGGAACCCAACTGGTTCCAACGGCACAAGCCTGCAGGTGCAATGTGATGAGTGGAGGCAACAGAGCCTAAGACAGTGAGGTTTTTTTTTTTTTTAAATTATTAA
ATTTATAGATGTAGTTCCTCACGAATGCTCAGTTTCAGGTTACTTTATAGCTGTTGTTTTACTTCATTGATTATTTGGCCTGGATGAGCTATGTCACTGTGTGTT
GCAGTTGAGTTTGTCTTTTTTATATTTACGAATTAGATTGGAATTGCCTCCCTTACTGAACTCTCATATGTTGCCTGCATTTTTGTATTCTATGAATCTAACCTA
ATGTTTGTTATAGGAGTGATACCATGGTTACAATACAACCTAATGTTTGTTA
Protein sequenceShow/hide protein sequence
MRRVATPPPPLHQLSVAVATTATNTSAAMSSRPPYRAGRYERHRGFSSERPYSGGKGQLVTGDSHFQSVRESNLGFRQGERGGFANNAGSHTAPQYPRPPSFRGN
HQFRQAPPSSQRHQYRGPHPHPHHQQPPSFNQNQGVRMPQQLRPRPPKRLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMNHKQKLYHHIPPYMLDWEWRKN
NILFELGLWSTDIMCFQSFRLEIRNMFKPFIDFDLLPIKTHTDLVVCRCALEFQLTAVQSFGECQGAYQELRNDVFFRTKCLGKGFSKCDFLFPKVGRSRGEGVA
EVFSGPLGINGLDGEELMPPTNSPGERKGLLVFYNASEQQPRDSPSINERQSLVRVLLEKAHAISKIWDNAPIVLCGDFNCTPKSALYNFISEQKLDLSGLDRDK
VSGQSSSEIYQPSSLYRNPRLQIDKGSVPLQLRPESSVIERKPDNSLSDIQKQECSHSLMENENLPSVNRFLPPDGSNVVFDALDTSCNDLQLGMKGTTLHSEGQ
KESQHSALFDHKNVGETTCCEKTDGFNESSTTCAKDEFTAGHTSKKVGELFSRSGTDPEVLHLNDSERQQMEEVDTSSLNNKSSTDGFQDHNFGKESKDTVDNII
LDDAQLYSQATFLDSNSVSSTPACKNSTTDTARDSSDVVTFDHSIAEFEKESSARNIEGGPSMVLSRTDSVVDGRPKILSSDEQDVAVLNGSLTEDDHTFLSALH
HVKDPFSSEIHHSVSHQSLVAPPTGVEDDLLPGLNSKSFEVKNITHDRSLWTSMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLG
TVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVRSSEE