; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi04G021490 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi04G021490
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptioncarbon catabolite repressor protein 4 homolog 6 isoform X1
Genome locationchr04:28590395..28600342
RNA-Seq ExpressionLsi04G021490
SyntenyLsi04G021490
Gene Ontology termsGO:0090503 - RNA phosphodiester bond hydrolysis, exonucleolytic (biological process)
GO:0000175 - 3'-5'-exoribonuclease activity (molecular function)
InterPro domainsIPR005135 - Endonuclease/exonuclease/phosphatase
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008466384.1 PREDICTED: carbon catabolite repressor protein 4 homolog 6 isoform X1 [Cucumis melo]0.0e+0083.33Show/hide
Query:  MRRAATPPPPLHQLSVAV-----ATNTSAAMSSRPPYR-GGRYGRNRGFSSERPYSGGRGQFVTGDSHFQSVRESNLGFRQGERGGLANNAGSYAAPQNP
        MRRAATPPPPLHQLS AV     ATNTS AMSSR PYR GGRYGR+RGFSSERPYSGGRGQFV+ DSHFQSV+ESNLGFRQGERGG  NNAGSY AP+NP
Subjt:  MRRAATPPPPLHQLSVAV-----ATNTSAAMSSRPPYR-GGRYGRNRGFSSERPYSGGRGQFVTGDSHFQSVRESNLGFRQGERGGLANNAGSYAAPQNP

Query:  RPPSFGGNHQFRQAPPSTQRHQYRGPHPRTHYQQPPSFNQNQGVRMPQQLRPRPPKPLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMDHKQKLYHH
        RPPSFGGNHQFRQAPPSTQRHQYRGPHP THYQQPPSFNQNQGVRMPQQ RPRPPKPLDFRHWDYAKT PPSTCERFSILSYNILADYLAMDHK KLYHH
Subjt:  RPPSFGGNHQFRQAPPSTQRHQYRGPHPRTHYQQPPSFNQNQGVRMPQQLRPRPPKPLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMDHKQKLYHH

Query:  IPRYMLDWEWRKNNILFELGLWSTDIMCFQ----------------------MRTGIPVDGCAIFWRVSRFKLLHEESIEFNKLGLRDNVAQICVLEQRS
        IP YMLDWEWRKN+ILFELGLWSTDIMCFQ                      MRTGIPVDGCAIFWRVSRFKLLHEE IEFN+LGLRDNVAQICVLEQRS
Subjt:  IPRYMLDWEWRKNNILFELGLWSTDIMCFQ----------------------MRTGIPVDGCAIFWRVSRFKLLHEESIEFNKLGLRDNVAQICVLEQRS

Query:  QDNGDNSVTPPISTSNHNKVVICNIHVLYNPRRGEIKLGQVRVLLEKAHAISKIWNNAPIVLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSGQSSAE
        QDNGDNSVTPP+STSN N+VV+CNIHVLYNPRRGEIKLGQVRVLLEKAH ISKIWNNAP+VLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSGQSSAE
Subjt:  QDNGDNSVTPPISTSNHNKVVICNIHVLYNPRRGEIKLGQVRVLLEKAHAISKIWNNAPIVLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSGQSSAE

Query:  IHQPSVLYHNPRLQTANGSVPLQLRSESSDIERKPDSSLSDIQKQDCSHSCMENENLPSVNHFLPPDSSHIVFDAPDTSCNDLQLEVKG-TIHSERQKES
        I QPS LY NPR Q ANGSVPLQ RSESSD ERKPDSS++DIQKQDCSH+C+ENENL SVN+ L PD SH   DA DTSCN+ QL +KG T+HSE QKE 
Subjt:  IHQPSVLYHNPRLQTANGSVPLQLRSESSDIERKPDSSLSDIQKQDCSHSCMENENLPSVNHFLPPDSSHIVFDAPDTSCNDLQLEVKG-TIHSERQKES

Query:  QHSALIDHKNVGETTFCEKTDSFNKSSITCAKDEFTVGHTSKNVGELVSPLGTDPEVLHLNETERRQMEEADTSRLNNKSSIDDFEDHNFGKESSDTVDN
        QHSAL DHKNVGETTFCEKTD FN+ SITCA+DEF VGHTSK VGELVSPLGTDPEVLHLNETE  Q+E+ DT  LNN  S D F+DHNFGK+S D+V+ 
Subjt:  QHSALIDHKNVGETTFCEKTDSFNKSSITCAKDEFTVGHTSKNVGELVSPLGTDPEVLHLNETERRQMEEADTSRLNNKSSIDDFEDHNFGKESSDTVDN

Query:  IILDDAQLYSQTVFLDSKNVSSTPACRNSMADTAIDSADVVTFDHSTIAEFEKESSSARNIEGGSSMGLPRIDLAVDERPKILSSDEKDVAPLNGSLIED
        IILDD QL SQT FLD KNVSSTP C+NSMADTAIDS+DVVTF HS IA  EKESSS RNIEGG S+GLPRI+ AVDERP+ILSSDE+DVA LNGSLIED
Subjt:  IILDDAQLYSQTVFLDSKNVSSTPACRNSMADTAIDSADVVTFDHSTIAEFEKESSSARNIEGGSSMGLPRIDLAVDERPKILSSDEKDVAPLNGSLIED

Query:  DRTFLSALHDVEDPFSSEI-----HQSLVAPPTGVD-DLLPGLNTKSFEVENITHDRSLWTPMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLN
        DRTFLSALHDVEDPFSS++     HQSLVAPPTG + +LLPGL+TKS EVE + HDRSLWTP EIETATGNADSTL+EHSLRLRSTYTEAEDLSGTRDLN
Subjt:  DRTFLSALHDVEDPFSSEI-----HQSLVAPPTGVD-DLLPGLNTKSFEVENITHDRSLWTPMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLN

Query:  GEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVRSREE
        GEPLATSYNRCFLGTVDYIWRSEGLQTVRVL+PIRKQVMQQLT GFPTKKWGSDHIALATELAFV SREE
Subjt:  GEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVRSREE

XP_008466385.1 PREDICTED: carbon catabolite repressor protein 4 homolog 6 isoform X2 [Cucumis melo]0.0e+0083.22Show/hide
Query:  MRRAATPPPPLHQLSVAV-----ATNTSAAMSSRPPYR-GGRYGRNRGFSSERPYSGGRGQFVTGDSHFQSVRESNLGFRQGERGGLANNAGSYAAPQNP
        MRRAATPPPPLHQLS AV     ATNTS AMSSR PYR GGRYGR+RGFSSERPYSGGRGQFV+ DSHFQSV+ESNLGFRQGERGG  NNAGSY AP+NP
Subjt:  MRRAATPPPPLHQLSVAV-----ATNTSAAMSSRPPYR-GGRYGRNRGFSSERPYSGGRGQFVTGDSHFQSVRESNLGFRQGERGGLANNAGSYAAPQNP

Query:  RPPSFGGNHQFRQAPPSTQRHQYRGPHPRTHYQQPPSFNQNQGVRMPQQLRPRPPKPLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMDHKQKLYHH
        RPPSFGGNHQFRQAPPSTQRHQYRGPHP THYQQPPSFNQNQGVRMPQQ RPRPPKPLDFRHWDYAKT PPSTCERFSILSYNILADYLAMDHK KLYHH
Subjt:  RPPSFGGNHQFRQAPPSTQRHQYRGPHPRTHYQQPPSFNQNQGVRMPQQLRPRPPKPLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMDHKQKLYHH

Query:  IPRYMLDWEWRKNNILFELGLWSTDIMCFQ----------------------MRTGIPVDGCAIFWRVSRFKLLHEESIEFNKLGLRDNVAQICVLEQRS
        IP YMLDWEWRKN+ILFELGLWSTDIMCFQ                      MRTGIPVDGCAIFWRVSRFKLLHEE IEFN+LGLRDNVAQICVLE RS
Subjt:  IPRYMLDWEWRKNNILFELGLWSTDIMCFQ----------------------MRTGIPVDGCAIFWRVSRFKLLHEESIEFNKLGLRDNVAQICVLEQRS

Query:  QDNGDNSVTPPISTSNHNKVVICNIHVLYNPRRGEIKLGQVRVLLEKAHAISKIWNNAPIVLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSGQSSAE
        QDNGDNSVTPP+STSN N+VV+CNIHVLYNPRRGEIKLGQVRVLLEKAH ISKIWNNAP+VLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSGQSSAE
Subjt:  QDNGDNSVTPPISTSNHNKVVICNIHVLYNPRRGEIKLGQVRVLLEKAHAISKIWNNAPIVLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSGQSSAE

Query:  IHQPSVLYHNPRLQTANGSVPLQLRSESSDIERKPDSSLSDIQKQDCSHSCMENENLPSVNHFLPPDSSHIVFDAPDTSCNDLQLEVKG-TIHSERQKES
        I QPS LY NPR Q ANGSVPLQ RSESSD ERKPDSS++DIQKQDCSH+C+ENENL SVN+ L PD SH   DA DTSCN+ QL +KG T+HSE QKE 
Subjt:  IHQPSVLYHNPRLQTANGSVPLQLRSESSDIERKPDSSLSDIQKQDCSHSCMENENLPSVNHFLPPDSSHIVFDAPDTSCNDLQLEVKG-TIHSERQKES

Query:  QHSALIDHKNVGETTFCEKTDSFNKSSITCAKDEFTVGHTSKNVGELVSPLGTDPEVLHLNETERRQMEEADTSRLNNKSSIDDFEDHNFGKESSDTVDN
        QHSAL DHKNVGETTFCEKTD FN+ SITCA+DEF VGHTSK VGELVSPLGTDPEVLHLNETE  Q+E+ DT  LNN  S D F+DHNFGK+S D+V+ 
Subjt:  QHSALIDHKNVGETTFCEKTDSFNKSSITCAKDEFTVGHTSKNVGELVSPLGTDPEVLHLNETERRQMEEADTSRLNNKSSIDDFEDHNFGKESSDTVDN

Query:  IILDDAQLYSQTVFLDSKNVSSTPACRNSMADTAIDSADVVTFDHSTIAEFEKESSSARNIEGGSSMGLPRIDLAVDERPKILSSDEKDVAPLNGSLIED
        IILDD QL SQT FLD KNVSSTP C+NSMADTAIDS+DVVTF HS IA  EKESSS RNIEGG S+GLPRI+ AVDERP+ILSSDE+DVA LNGSLIED
Subjt:  IILDDAQLYSQTVFLDSKNVSSTPACRNSMADTAIDSADVVTFDHSTIAEFEKESSSARNIEGGSSMGLPRIDLAVDERPKILSSDEKDVAPLNGSLIED

Query:  DRTFLSALHDVEDPFSSEI-----HQSLVAPPTGVD-DLLPGLNTKSFEVENITHDRSLWTPMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLN
        DRTFLSALHDVEDPFSS++     HQSLVAPPTG + +LLPGL+TKS EVE + HDRSLWTP EIETATGNADSTL+EHSLRLRSTYTEAEDLSGTRDLN
Subjt:  DRTFLSALHDVEDPFSSEI-----HQSLVAPPTGVD-DLLPGLNTKSFEVENITHDRSLWTPMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLN

Query:  GEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVRSREE
        GEPLATSYNRCFLGTVDYIWRSEGLQTVRVL+PIRKQVMQQLT GFPTKKWGSDHIALATELAFV SREE
Subjt:  GEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVRSREE

XP_011652489.1 carbon catabolite repressor protein 4 homolog 6 isoform X1 [Cucumis sativus]0.0e+0083.31Show/hide
Query:  MRRAATPPPPLHQLSVAV-----ATNTSAAMSSRPPYRGGRYGRNRGFSSERPYSGGRGQFVTGDSHFQSVRESNLGFRQGERGGLANNAGSYAAPQNPR
        MRRAATPPPPLHQLS AV     ATNTS AMSSRPPYRGG YGR+RG+SSERPYSGGRGQFV+GDSHFQSV+ESNLGFRQGERGG  NNAGSY AP+NPR
Subjt:  MRRAATPPPPLHQLSVAV-----ATNTSAAMSSRPPYRGGRYGRNRGFSSERPYSGGRGQFVTGDSHFQSVRESNLGFRQGERGGLANNAGSYAAPQNPR

Query:  PPSFGGNHQFRQAPPSTQRHQYRGPHPRTHYQQPPSFNQNQGVRMPQQLRPRPPKPLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMDHKQKLYHHI
        PPSFGGNHQFRQAPPS+QRHQYRGP+P THYQQPPSFNQNQGVRMPQQ R RPPKPLDFRHWDYAKT PP TCERFSILSYNILADYLAMDHKQKLYHHI
Subjt:  PPSFGGNHQFRQAPPSTQRHQYRGPHPRTHYQQPPSFNQNQGVRMPQQLRPRPPKPLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMDHKQKLYHHI

Query:  PRYMLDWEWRKNNILFELGLWSTDIMCFQ----------------------MRTGIPVDGCAIFWRVSRFKLLHEESIEFNKLGLRDNVAQICVLEQRSQ
        P YMLDWEWRKN+ILFELGLWSTDIMCFQ                      MRTGIPVDGCAIFWRVSRFKLL EE IEFNKLGLRDNVAQICVLEQR+Q
Subjt:  PRYMLDWEWRKNNILFELGLWSTDIMCFQ----------------------MRTGIPVDGCAIFWRVSRFKLLHEESIEFNKLGLRDNVAQICVLEQRSQ

Query:  DNGDNSVTPPISTSNHNKVVICNIHVLYNPRRGEIKLGQVRVLLEKAHAISKIWNNAPIVLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSGQSSAEI
        DNGDNSVT PISTSN N+VV+CNIHVLYNPRRGEIKLGQVRVLLEKAHAISKIWNNAPIVLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSGQSSAEI
Subjt:  DNGDNSVTPPISTSNHNKVVICNIHVLYNPRRGEIKLGQVRVLLEKAHAISKIWNNAPIVLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSGQSSAEI

Query:  HQPSVLYHNPRLQTANGSVPLQLRSESSDIERKPDSSLSDIQKQDCSHSCMENENLPSVNHFLPPDSSHIVFDAPDTSCNDLQLEVKG-TIHSERQKESQ
         QPS LY NPR Q ANGSVPLQ RSESSD E KPDSS+SDIQKQDCSHSCM+NENL S N+ L PD SHI  DA DTSCN+LQL +KG T+HSE QKESQ
Subjt:  HQPSVLYHNPRLQTANGSVPLQLRSESSDIERKPDSSLSDIQKQDCSHSCMENENLPSVNHFLPPDSSHIVFDAPDTSCNDLQLEVKG-TIHSERQKESQ

Query:  HSALIDHKNVGETTFCEKTDSFNKSSITCAKDEFTVGHTSKNVGELVSPLGTDPEVLHLNETERRQMEEADTSRLNNKSSIDDFEDHNFGKESSDTVDNI
        HSAL DHKNVGETTFCEKTDSFN+ S+TCA+DEF VGHTSK +GELVSPLGTDP+V HLNETE+RQ+E+    RLNN SS D F+DHN  K+S D+V+ I
Subjt:  HSALIDHKNVGETTFCEKTDSFNKSSITCAKDEFTVGHTSKNVGELVSPLGTDPEVLHLNETERRQMEEADTSRLNNKSSIDDFEDHNFGKESSDTVDNI

Query:  ILDDAQLYSQTVFLDSKNVSSTPACRNSMADTAIDSADVVTFDHSTIAEFEKESSSARNIEGGSSMGLPRIDLAVDERPKILSSDEKDVAPLNGSLIEDD
        ILDD QL S+TVFLD KNVSSTP C+NSMADTAIDS DVVT DHS IAE EKESSSARNIEGG S+GLPRI+  VDERP+ILSSDE+DVA LNGSL EDD
Subjt:  ILDDAQLYSQTVFLDSKNVSSTPACRNSMADTAIDSADVVTFDHSTIAEFEKESSSARNIEGGSSMGLPRIDLAVDERPKILSSDEKDVAPLNGSLIEDD

Query:  RTFLSALHDVEDPFSSEI-----HQSLVAPPTGV-DDLLPGLNTKSFEVENITHDRSLWTPMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNG
        RTFLSALHDVEDPFS E+     HQSLVAP TG  +DLLPGLNTKS EVEN+ HDRSLWTP +IETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNG
Subjt:  RTFLSALHDVEDPFSSEI-----HQSLVAPPTGV-DDLLPGLNTKSFEVENITHDRSLWTPMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNG

Query:  EPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVRSREE
        EPLATSYNRCFLGTVDYIWRSEGLQTV+VLAPIRKQVMQQLT GFPTKKWGSDHIALATELAFV SREE
Subjt:  EPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVRSREE

XP_038897964.1 carbon catabolite repressor protein 4 homolog 6-like isoform X1 [Benincasa hispida]0.0e+0085.37Show/hide
Query:  MRRAATPPPPLHQLSVAV-----------ATNTSAAMSSRPPYRGGRYGRNRGFSSERPYSGGRGQFVTGDSHFQSVRESNLGFRQGERGGLANNAGSYA
        MRRAATPPPPLHQLSVAV           ATNTSA MSSRPPYRGGRYG +RGFSSERPYSGGRGQFVTGDSHFQSVRESNLGF++GERGG ANNAG Y+
Subjt:  MRRAATPPPPLHQLSVAV-----------ATNTSAAMSSRPPYRGGRYGRNRGFSSERPYSGGRGQFVTGDSHFQSVRESNLGFRQGERGGLANNAGSYA

Query:  APQNPRPPSFGGNHQFRQAPPSTQRHQYRGPHPRTHYQQPPSFNQNQGVRMPQQLRPRPPKPLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMDHKQ
        A QNPRPPSFGGNHQFRQAPPSTQRHQYRGPHP TH QQPPSFNQNQGV MPQQ+RPRPPKPLD+RHWDYAKTPPPSTCERFSILSYNILADYLAMDHKQ
Subjt:  APQNPRPPSFGGNHQFRQAPPSTQRHQYRGPHPRTHYQQPPSFNQNQGVRMPQQLRPRPPKPLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMDHKQ

Query:  KLYHHIPRYMLDWEWRKNNILFELGLWSTDIMCFQ----------------------MRTGIPVDGCAIFWRVSRFKLLHEESIEFNKLGLRDNVAQICV
        KLY HIP YMLDWEWRKN+ILFELGLWSTDIMCFQ                      MRTG PVDGCAIFW +SRFKLLHEESIEFNKLGLRDNVAQICV
Subjt:  KLYHHIPRYMLDWEWRKNNILFELGLWSTDIMCFQ----------------------MRTGIPVDGCAIFWRVSRFKLLHEESIEFNKLGLRDNVAQICV

Query:  LEQRSQDNGDNSVTPPISTSNHNKVVICNIHVLYNPRRGEIKLGQVRVLLEKAHAISKIWNNAPIVLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSG
        LEQ  +D+GDNSVTPPISTSNHNKVVICNIHVLYNP+RGEIKLGQVRVLLEKAHAISK W+NAPIVLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSG
Subjt:  LEQRSQDNGDNSVTPPISTSNHNKVVICNIHVLYNPRRGEIKLGQVRVLLEKAHAISKIWNNAPIVLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSG

Query:  QSSAEIHQPSVLYHNPRLQTANGSVPLQLRSESSDIERKPDSSLSDIQKQDCSHSCMENENLPSVNHFLPPDSSHIVFDAPDTSCNDLQLEVKG-TIHSE
        QSSAEIHQPS  + NPRLQTANGSVPLQ RSESSDIERK DSSLSDIQKQDCS SCMENENLPSVNH LPPDSSHIVFDAPDTSCNDLQL +KG T+HSE
Subjt:  QSSAEIHQPSVLYHNPRLQTANGSVPLQLRSESSDIERKPDSSLSDIQKQDCSHSCMENENLPSVNHFLPPDSSHIVFDAPDTSCNDLQLEVKG-TIHSE

Query:  RQKESQHSALIDHKNVGETTFCEKTDSFNKSSITCAKDEFTVGHTSKNVGELVSPLGTDPEVLHLNETERRQMEEADTSRLNNKSSIDDFEDHNFGKESS
         +KESQ SAL DHKN GETT CEKTDSFN++SITCAKDEFTVGHTSK VGELVSPLGTDPE++HLNETERRQME+ D S L NKSS D +EDHNFGKES 
Subjt:  RQKESQHSALIDHKNVGETTFCEKTDSFNKSSITCAKDEFTVGHTSKNVGELVSPLGTDPEVLHLNETERRQMEEADTSRLNNKSSIDDFEDHNFGKESS

Query:  DTVDNIILDDAQLYSQTVFLDSKNVSSTPACRNSMADTAIDSADVVTFDHSTIAEFEKESSSARNIEGGSSMGLPRIDLAVDERPKILSSDEKDVAPLNG
        DTVD +ILDDAQLYSQTV  DSKNVSSTPAC+NSMA+TAIDS+DVVTFD S   EFEKESSS RNIEGG S GLP ID  +DERPKI  SDE+DVA LNG
Subjt:  DTVDNIILDDAQLYSQTVFLDSKNVSSTPACRNSMADTAIDSADVVTFDHSTIAEFEKESSSARNIEGGSSMGLPRIDLAVDERPKILSSDEKDVAPLNG

Query:  SLIEDDRTFLSALHDVEDPFSSEIHQS-----LVAPPTGV-DDLLPGLNTKSFEVENITHDRSLWTPMEIETATGNADSTLIEHSLRLRSTYTEAEDLSG
        SL EDD+TFLSALH VEDPFSS+IH S     LV PPTGV DDLLPGLNTKSFEVEN+THDRSLWTPMEIETATGNAD TLIEHSLRLRSTYTEAEDLSG
Subjt:  SLIEDDRTFLSALHDVEDPFSSEIHQS-----LVAPPTGV-DDLLPGLNTKSFEVENITHDRSLWTPMEIETATGNADSTLIEHSLRLRSTYTEAEDLSG

Query:  TRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVRSREE
        TRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVRSREE
Subjt:  TRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVRSREE

XP_038897965.1 carbon catabolite repressor protein 4 homolog 6-like isoform X2 [Benincasa hispida]0.0e+0085.26Show/hide
Query:  MRRAATPPPPLHQLSVAV-----------ATNTSAAMSSRPPYRGGRYGRNRGFSSERPYSGGRGQFVTGDSHFQSVRESNLGFRQGERGGLANNAGSYA
        MRRAATPPPPLHQLSVAV           ATNTSA MSSRPPYRGGRYG +RGFSSERPYSGGRGQFVTGDSHFQSVRESNLGF++GERGG ANNAG Y+
Subjt:  MRRAATPPPPLHQLSVAV-----------ATNTSAAMSSRPPYRGGRYGRNRGFSSERPYSGGRGQFVTGDSHFQSVRESNLGFRQGERGGLANNAGSYA

Query:  APQNPRPPSFGGNHQFRQAPPSTQRHQYRGPHPRTHYQQPPSFNQNQGVRMPQQLRPRPPKPLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMDHKQ
        A QNPRPPSFGGNHQFRQAPPSTQRHQYRGPHP TH QQPPSFNQNQGV MPQQ+RPRPPKPLD+RHWDYAKTPPPSTCERFSILSYNILADYLAMDHKQ
Subjt:  APQNPRPPSFGGNHQFRQAPPSTQRHQYRGPHPRTHYQQPPSFNQNQGVRMPQQLRPRPPKPLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMDHKQ

Query:  KLYHHIPRYMLDWEWRKNNILFELGLWSTDIMCFQ----------------------MRTGIPVDGCAIFWRVSRFKLLHEESIEFNKLGLRDNVAQICV
        KLY HIP YMLDWEWRKN+ILFELGLWSTDIMCFQ                      MRTG PVDGCAIFW +SRFKLLHEESIEFNKLGLRDNVAQICV
Subjt:  KLYHHIPRYMLDWEWRKNNILFELGLWSTDIMCFQ----------------------MRTGIPVDGCAIFWRVSRFKLLHEESIEFNKLGLRDNVAQICV

Query:  LEQRSQDNGDNSVTPPISTSNHNKVVICNIHVLYNPRRGEIKLGQVRVLLEKAHAISKIWNNAPIVLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSG
        LE   +D+GDNSVTPPISTSNHNKVVICNIHVLYNP+RGEIKLGQVRVLLEKAHAISK W+NAPIVLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSG
Subjt:  LEQRSQDNGDNSVTPPISTSNHNKVVICNIHVLYNPRRGEIKLGQVRVLLEKAHAISKIWNNAPIVLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSG

Query:  QSSAEIHQPSVLYHNPRLQTANGSVPLQLRSESSDIERKPDSSLSDIQKQDCSHSCMENENLPSVNHFLPPDSSHIVFDAPDTSCNDLQLEVKG-TIHSE
        QSSAEIHQPS  + NPRLQTANGSVPLQ RSESSDIERK DSSLSDIQKQDCS SCMENENLPSVNH LPPDSSHIVFDAPDTSCNDLQL +KG T+HSE
Subjt:  QSSAEIHQPSVLYHNPRLQTANGSVPLQLRSESSDIERKPDSSLSDIQKQDCSHSCMENENLPSVNHFLPPDSSHIVFDAPDTSCNDLQLEVKG-TIHSE

Query:  RQKESQHSALIDHKNVGETTFCEKTDSFNKSSITCAKDEFTVGHTSKNVGELVSPLGTDPEVLHLNETERRQMEEADTSRLNNKSSIDDFEDHNFGKESS
         +KESQ SAL DHKN GETT CEKTDSFN++SITCAKDEFTVGHTSK VGELVSPLGTDPE++HLNETERRQME+ D S L NKSS D +EDHNFGKES 
Subjt:  RQKESQHSALIDHKNVGETTFCEKTDSFNKSSITCAKDEFTVGHTSKNVGELVSPLGTDPEVLHLNETERRQMEEADTSRLNNKSSIDDFEDHNFGKESS

Query:  DTVDNIILDDAQLYSQTVFLDSKNVSSTPACRNSMADTAIDSADVVTFDHSTIAEFEKESSSARNIEGGSSMGLPRIDLAVDERPKILSSDEKDVAPLNG
        DTVD +ILDDAQLYSQTV  DSKNVSSTPAC+NSMA+TAIDS+DVVTFD S   EFEKESSS RNIEGG S GLP ID  +DERPKI  SDE+DVA LNG
Subjt:  DTVDNIILDDAQLYSQTVFLDSKNVSSTPACRNSMADTAIDSADVVTFDHSTIAEFEKESSSARNIEGGSSMGLPRIDLAVDERPKILSSDEKDVAPLNG

Query:  SLIEDDRTFLSALHDVEDPFSSEIHQS-----LVAPPTGV-DDLLPGLNTKSFEVENITHDRSLWTPMEIETATGNADSTLIEHSLRLRSTYTEAEDLSG
        SL EDD+TFLSALH VEDPFSS+IH S     LV PPTGV DDLLPGLNTKSFEVEN+THDRSLWTPMEIETATGNAD TLIEHSLRLRSTYTEAEDLSG
Subjt:  SLIEDDRTFLSALHDVEDPFSSEIHQS-----LVAPPTGV-DDLLPGLNTKSFEVENITHDRSLWTPMEIETATGNADSTLIEHSLRLRSTYTEAEDLSG

Query:  TRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVRSREE
        TRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVRSREE
Subjt:  TRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVRSREE

TrEMBL top hitse value%identityAlignment
A0A0A0LHB6 Endo/exonuclease/phosphatase domain-containing protein0.0e+0083.2Show/hide
Query:  MRRAATPPPPLHQLSVAV-----ATNTSAAMSSRPPYRGGRYGRNRGFSSERPYSGGRGQFVTGDSHFQSVRESNLGFRQGERGGLANNAGSYAAPQNPR
        MRRAATPPPPLHQLS AV     ATNTS AMSSRPPYRGG YGR+RG+SSERPYSGGRGQFV+GDSHFQSV+ESNLGFRQGERGG  NNAGSY AP+NPR
Subjt:  MRRAATPPPPLHQLSVAV-----ATNTSAAMSSRPPYRGGRYGRNRGFSSERPYSGGRGQFVTGDSHFQSVRESNLGFRQGERGGLANNAGSYAAPQNPR

Query:  PPSFGGNHQFRQAPPSTQRHQYRGPHPRTHYQQPPSFNQNQGVRMPQQLRPRPPKPLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMDHKQKLYHHI
        PPSFGGNHQFRQAPPS+QRHQYRGP+P THYQQPPSFNQNQGVRMPQQ R RPPKPLDFRHWDYAKT PP TCERFSILSYNILADYLAMDHKQKLYHHI
Subjt:  PPSFGGNHQFRQAPPSTQRHQYRGPHPRTHYQQPPSFNQNQGVRMPQQLRPRPPKPLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMDHKQKLYHHI

Query:  PRYMLDWEWRKNNILFELGLWSTDIMCFQ----------------------MRTGIPVDGCAIFWRVSRFKLLHEESIEFNKLGLRDNVAQICVLEQRSQ
        P YMLDWEWRKN+ILFELGLWSTDIMCFQ                      MRTGIPVDGCAIFWRVSRFKLL EE IEFNKLGLRDNVAQICVLE R+Q
Subjt:  PRYMLDWEWRKNNILFELGLWSTDIMCFQ----------------------MRTGIPVDGCAIFWRVSRFKLLHEESIEFNKLGLRDNVAQICVLEQRSQ

Query:  DNGDNSVTPPISTSNHNKVVICNIHVLYNPRRGEIKLGQVRVLLEKAHAISKIWNNAPIVLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSGQSSAEI
        DNGDNSVT PISTSN N+VV+CNIHVLYNPRRGEIKLGQVRVLLEKAHAISKIWNNAPIVLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSGQSSAEI
Subjt:  DNGDNSVTPPISTSNHNKVVICNIHVLYNPRRGEIKLGQVRVLLEKAHAISKIWNNAPIVLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSGQSSAEI

Query:  HQPSVLYHNPRLQTANGSVPLQLRSESSDIERKPDSSLSDIQKQDCSHSCMENENLPSVNHFLPPDSSHIVFDAPDTSCNDLQLEVKG-TIHSERQKESQ
         QPS LY NPR Q ANGSVPLQ RSESSD E KPDSS+SDIQKQDCSHSCM+NENL S N+ L PD SHI  DA DTSCN+LQL +KG T+HSE QKESQ
Subjt:  HQPSVLYHNPRLQTANGSVPLQLRSESSDIERKPDSSLSDIQKQDCSHSCMENENLPSVNHFLPPDSSHIVFDAPDTSCNDLQLEVKG-TIHSERQKESQ

Query:  HSALIDHKNVGETTFCEKTDSFNKSSITCAKDEFTVGHTSKNVGELVSPLGTDPEVLHLNETERRQMEEADTSRLNNKSSIDDFEDHNFGKESSDTVDNI
        HSAL DHKNVGETTFCEKTDSFN+ S+TCA+DEF VGHTSK +GELVSPLGTDP+V HLNETE+RQ+E+    RLNN SS D F+DHN  K+S D+V+ I
Subjt:  HSALIDHKNVGETTFCEKTDSFNKSSITCAKDEFTVGHTSKNVGELVSPLGTDPEVLHLNETERRQMEEADTSRLNNKSSIDDFEDHNFGKESSDTVDNI

Query:  ILDDAQLYSQTVFLDSKNVSSTPACRNSMADTAIDSADVVTFDHSTIAEFEKESSSARNIEGGSSMGLPRIDLAVDERPKILSSDEKDVAPLNGSLIEDD
        ILDD QL S+TVFLD KNVSSTP C+NSMADTAIDS DVVT DHS IAE EKESSSARNIEGG S+GLPRI+  VDERP+ILSSDE+DVA LNGSL EDD
Subjt:  ILDDAQLYSQTVFLDSKNVSSTPACRNSMADTAIDSADVVTFDHSTIAEFEKESSSARNIEGGSSMGLPRIDLAVDERPKILSSDEKDVAPLNGSLIEDD

Query:  RTFLSALHDVEDPFSSEI-----HQSLVAPPTGV-DDLLPGLNTKSFEVENITHDRSLWTPMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNG
        RTFLSALHDVEDPFS E+     HQSLVAP TG  +DLLPGLNTKS EVEN+ HDRSLWTP +IETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNG
Subjt:  RTFLSALHDVEDPFSSEI-----HQSLVAPPTGV-DDLLPGLNTKSFEVENITHDRSLWTPMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNG

Query:  EPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVRSREE
        EPLATSYNRCFLGTVDYIWRSEGLQTV+VLAPIRKQVMQQLT GFPTKKWGSDHIALATELAFV SREE
Subjt:  EPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVRSREE

A0A1S3CRA9 carbon catabolite repressor protein 4 homolog 6 isoform X10.0e+0083.33Show/hide
Query:  MRRAATPPPPLHQLSVAV-----ATNTSAAMSSRPPYR-GGRYGRNRGFSSERPYSGGRGQFVTGDSHFQSVRESNLGFRQGERGGLANNAGSYAAPQNP
        MRRAATPPPPLHQLS AV     ATNTS AMSSR PYR GGRYGR+RGFSSERPYSGGRGQFV+ DSHFQSV+ESNLGFRQGERGG  NNAGSY AP+NP
Subjt:  MRRAATPPPPLHQLSVAV-----ATNTSAAMSSRPPYR-GGRYGRNRGFSSERPYSGGRGQFVTGDSHFQSVRESNLGFRQGERGGLANNAGSYAAPQNP

Query:  RPPSFGGNHQFRQAPPSTQRHQYRGPHPRTHYQQPPSFNQNQGVRMPQQLRPRPPKPLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMDHKQKLYHH
        RPPSFGGNHQFRQAPPSTQRHQYRGPHP THYQQPPSFNQNQGVRMPQQ RPRPPKPLDFRHWDYAKT PPSTCERFSILSYNILADYLAMDHK KLYHH
Subjt:  RPPSFGGNHQFRQAPPSTQRHQYRGPHPRTHYQQPPSFNQNQGVRMPQQLRPRPPKPLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMDHKQKLYHH

Query:  IPRYMLDWEWRKNNILFELGLWSTDIMCFQ----------------------MRTGIPVDGCAIFWRVSRFKLLHEESIEFNKLGLRDNVAQICVLEQRS
        IP YMLDWEWRKN+ILFELGLWSTDIMCFQ                      MRTGIPVDGCAIFWRVSRFKLLHEE IEFN+LGLRDNVAQICVLEQRS
Subjt:  IPRYMLDWEWRKNNILFELGLWSTDIMCFQ----------------------MRTGIPVDGCAIFWRVSRFKLLHEESIEFNKLGLRDNVAQICVLEQRS

Query:  QDNGDNSVTPPISTSNHNKVVICNIHVLYNPRRGEIKLGQVRVLLEKAHAISKIWNNAPIVLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSGQSSAE
        QDNGDNSVTPP+STSN N+VV+CNIHVLYNPRRGEIKLGQVRVLLEKAH ISKIWNNAP+VLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSGQSSAE
Subjt:  QDNGDNSVTPPISTSNHNKVVICNIHVLYNPRRGEIKLGQVRVLLEKAHAISKIWNNAPIVLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSGQSSAE

Query:  IHQPSVLYHNPRLQTANGSVPLQLRSESSDIERKPDSSLSDIQKQDCSHSCMENENLPSVNHFLPPDSSHIVFDAPDTSCNDLQLEVKG-TIHSERQKES
        I QPS LY NPR Q ANGSVPLQ RSESSD ERKPDSS++DIQKQDCSH+C+ENENL SVN+ L PD SH   DA DTSCN+ QL +KG T+HSE QKE 
Subjt:  IHQPSVLYHNPRLQTANGSVPLQLRSESSDIERKPDSSLSDIQKQDCSHSCMENENLPSVNHFLPPDSSHIVFDAPDTSCNDLQLEVKG-TIHSERQKES

Query:  QHSALIDHKNVGETTFCEKTDSFNKSSITCAKDEFTVGHTSKNVGELVSPLGTDPEVLHLNETERRQMEEADTSRLNNKSSIDDFEDHNFGKESSDTVDN
        QHSAL DHKNVGETTFCEKTD FN+ SITCA+DEF VGHTSK VGELVSPLGTDPEVLHLNETE  Q+E+ DT  LNN  S D F+DHNFGK+S D+V+ 
Subjt:  QHSALIDHKNVGETTFCEKTDSFNKSSITCAKDEFTVGHTSKNVGELVSPLGTDPEVLHLNETERRQMEEADTSRLNNKSSIDDFEDHNFGKESSDTVDN

Query:  IILDDAQLYSQTVFLDSKNVSSTPACRNSMADTAIDSADVVTFDHSTIAEFEKESSSARNIEGGSSMGLPRIDLAVDERPKILSSDEKDVAPLNGSLIED
        IILDD QL SQT FLD KNVSSTP C+NSMADTAIDS+DVVTF HS IA  EKESSS RNIEGG S+GLPRI+ AVDERP+ILSSDE+DVA LNGSLIED
Subjt:  IILDDAQLYSQTVFLDSKNVSSTPACRNSMADTAIDSADVVTFDHSTIAEFEKESSSARNIEGGSSMGLPRIDLAVDERPKILSSDEKDVAPLNGSLIED

Query:  DRTFLSALHDVEDPFSSEI-----HQSLVAPPTGVD-DLLPGLNTKSFEVENITHDRSLWTPMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLN
        DRTFLSALHDVEDPFSS++     HQSLVAPPTG + +LLPGL+TKS EVE + HDRSLWTP EIETATGNADSTL+EHSLRLRSTYTEAEDLSGTRDLN
Subjt:  DRTFLSALHDVEDPFSSEI-----HQSLVAPPTGVD-DLLPGLNTKSFEVENITHDRSLWTPMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLN

Query:  GEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVRSREE
        GEPLATSYNRCFLGTVDYIWRSEGLQTVRVL+PIRKQVMQQLT GFPTKKWGSDHIALATELAFV SREE
Subjt:  GEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVRSREE

A0A1S3CSF9 carbon catabolite repressor protein 4 homolog 6 isoform X20.0e+0083.22Show/hide
Query:  MRRAATPPPPLHQLSVAV-----ATNTSAAMSSRPPYR-GGRYGRNRGFSSERPYSGGRGQFVTGDSHFQSVRESNLGFRQGERGGLANNAGSYAAPQNP
        MRRAATPPPPLHQLS AV     ATNTS AMSSR PYR GGRYGR+RGFSSERPYSGGRGQFV+ DSHFQSV+ESNLGFRQGERGG  NNAGSY AP+NP
Subjt:  MRRAATPPPPLHQLSVAV-----ATNTSAAMSSRPPYR-GGRYGRNRGFSSERPYSGGRGQFVTGDSHFQSVRESNLGFRQGERGGLANNAGSYAAPQNP

Query:  RPPSFGGNHQFRQAPPSTQRHQYRGPHPRTHYQQPPSFNQNQGVRMPQQLRPRPPKPLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMDHKQKLYHH
        RPPSFGGNHQFRQAPPSTQRHQYRGPHP THYQQPPSFNQNQGVRMPQQ RPRPPKPLDFRHWDYAKT PPSTCERFSILSYNILADYLAMDHK KLYHH
Subjt:  RPPSFGGNHQFRQAPPSTQRHQYRGPHPRTHYQQPPSFNQNQGVRMPQQLRPRPPKPLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMDHKQKLYHH

Query:  IPRYMLDWEWRKNNILFELGLWSTDIMCFQ----------------------MRTGIPVDGCAIFWRVSRFKLLHEESIEFNKLGLRDNVAQICVLEQRS
        IP YMLDWEWRKN+ILFELGLWSTDIMCFQ                      MRTGIPVDGCAIFWRVSRFKLLHEE IEFN+LGLRDNVAQICVLE RS
Subjt:  IPRYMLDWEWRKNNILFELGLWSTDIMCFQ----------------------MRTGIPVDGCAIFWRVSRFKLLHEESIEFNKLGLRDNVAQICVLEQRS

Query:  QDNGDNSVTPPISTSNHNKVVICNIHVLYNPRRGEIKLGQVRVLLEKAHAISKIWNNAPIVLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSGQSSAE
        QDNGDNSVTPP+STSN N+VV+CNIHVLYNPRRGEIKLGQVRVLLEKAH ISKIWNNAP+VLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSGQSSAE
Subjt:  QDNGDNSVTPPISTSNHNKVVICNIHVLYNPRRGEIKLGQVRVLLEKAHAISKIWNNAPIVLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSGQSSAE

Query:  IHQPSVLYHNPRLQTANGSVPLQLRSESSDIERKPDSSLSDIQKQDCSHSCMENENLPSVNHFLPPDSSHIVFDAPDTSCNDLQLEVKG-TIHSERQKES
        I QPS LY NPR Q ANGSVPLQ RSESSD ERKPDSS++DIQKQDCSH+C+ENENL SVN+ L PD SH   DA DTSCN+ QL +KG T+HSE QKE 
Subjt:  IHQPSVLYHNPRLQTANGSVPLQLRSESSDIERKPDSSLSDIQKQDCSHSCMENENLPSVNHFLPPDSSHIVFDAPDTSCNDLQLEVKG-TIHSERQKES

Query:  QHSALIDHKNVGETTFCEKTDSFNKSSITCAKDEFTVGHTSKNVGELVSPLGTDPEVLHLNETERRQMEEADTSRLNNKSSIDDFEDHNFGKESSDTVDN
        QHSAL DHKNVGETTFCEKTD FN+ SITCA+DEF VGHTSK VGELVSPLGTDPEVLHLNETE  Q+E+ DT  LNN  S D F+DHNFGK+S D+V+ 
Subjt:  QHSALIDHKNVGETTFCEKTDSFNKSSITCAKDEFTVGHTSKNVGELVSPLGTDPEVLHLNETERRQMEEADTSRLNNKSSIDDFEDHNFGKESSDTVDN

Query:  IILDDAQLYSQTVFLDSKNVSSTPACRNSMADTAIDSADVVTFDHSTIAEFEKESSSARNIEGGSSMGLPRIDLAVDERPKILSSDEKDVAPLNGSLIED
        IILDD QL SQT FLD KNVSSTP C+NSMADTAIDS+DVVTF HS IA  EKESSS RNIEGG S+GLPRI+ AVDERP+ILSSDE+DVA LNGSLIED
Subjt:  IILDDAQLYSQTVFLDSKNVSSTPACRNSMADTAIDSADVVTFDHSTIAEFEKESSSARNIEGGSSMGLPRIDLAVDERPKILSSDEKDVAPLNGSLIED

Query:  DRTFLSALHDVEDPFSSEI-----HQSLVAPPTGVD-DLLPGLNTKSFEVENITHDRSLWTPMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLN
        DRTFLSALHDVEDPFSS++     HQSLVAPPTG + +LLPGL+TKS EVE + HDRSLWTP EIETATGNADSTL+EHSLRLRSTYTEAEDLSGTRDLN
Subjt:  DRTFLSALHDVEDPFSSEI-----HQSLVAPPTGVD-DLLPGLNTKSFEVENITHDRSLWTPMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLN

Query:  GEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVRSREE
        GEPLATSYNRCFLGTVDYIWRSEGLQTVRVL+PIRKQVMQQLT GFPTKKWGSDHIALATELAFV SREE
Subjt:  GEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVRSREE

A0A5A7T7G2 Carbon catabolite repressor protein 4-like protein 6 isoform X10.0e+0083.33Show/hide
Query:  MRRAATPPPPLHQLSVAV-----ATNTSAAMSSRPPYR-GGRYGRNRGFSSERPYSGGRGQFVTGDSHFQSVRESNLGFRQGERGGLANNAGSYAAPQNP
        MRRAATPPPPLHQLS AV     ATNTS AMSSR PYR GGRYGR+RGFSSERPYSGGRGQFV+ DSHFQSV+ESNLGFRQGERGG  NNAGSY AP+NP
Subjt:  MRRAATPPPPLHQLSVAV-----ATNTSAAMSSRPPYR-GGRYGRNRGFSSERPYSGGRGQFVTGDSHFQSVRESNLGFRQGERGGLANNAGSYAAPQNP

Query:  RPPSFGGNHQFRQAPPSTQRHQYRGPHPRTHYQQPPSFNQNQGVRMPQQLRPRPPKPLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMDHKQKLYHH
        RPPSFGGNHQFRQAPPSTQRHQYRGPHP THYQQPPSFNQNQGVRMPQQ RPRPPKPLDFRHWDYAKT PPSTCERFSILSYNILADYLAMDHK KLYHH
Subjt:  RPPSFGGNHQFRQAPPSTQRHQYRGPHPRTHYQQPPSFNQNQGVRMPQQLRPRPPKPLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMDHKQKLYHH

Query:  IPRYMLDWEWRKNNILFELGLWSTDIMCFQ----------------------MRTGIPVDGCAIFWRVSRFKLLHEESIEFNKLGLRDNVAQICVLEQRS
        IP YMLDWEWRKN+ILFELGLWSTDIMCFQ                      MRTGIPVDGCAIFWRVSRFKLLHEE IEFN+LGLRDNVAQICVLEQRS
Subjt:  IPRYMLDWEWRKNNILFELGLWSTDIMCFQ----------------------MRTGIPVDGCAIFWRVSRFKLLHEESIEFNKLGLRDNVAQICVLEQRS

Query:  QDNGDNSVTPPISTSNHNKVVICNIHVLYNPRRGEIKLGQVRVLLEKAHAISKIWNNAPIVLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSGQSSAE
        QDNGDNSVTPP+STSN N+VV+CNIHVLYNPRRGEIKLGQVRVLLEKAH ISKIWNNAP+VLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSGQSSAE
Subjt:  QDNGDNSVTPPISTSNHNKVVICNIHVLYNPRRGEIKLGQVRVLLEKAHAISKIWNNAPIVLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSGQSSAE

Query:  IHQPSVLYHNPRLQTANGSVPLQLRSESSDIERKPDSSLSDIQKQDCSHSCMENENLPSVNHFLPPDSSHIVFDAPDTSCNDLQLEVKG-TIHSERQKES
        I QPS LY NPR Q ANGSVPLQ RSESSD ERKPDSS++DIQKQDCSH+C+ENENL SVN+ L PD SH   DA DTSCN+ QL +KG T+HSE QKE 
Subjt:  IHQPSVLYHNPRLQTANGSVPLQLRSESSDIERKPDSSLSDIQKQDCSHSCMENENLPSVNHFLPPDSSHIVFDAPDTSCNDLQLEVKG-TIHSERQKES

Query:  QHSALIDHKNVGETTFCEKTDSFNKSSITCAKDEFTVGHTSKNVGELVSPLGTDPEVLHLNETERRQMEEADTSRLNNKSSIDDFEDHNFGKESSDTVDN
        QHSAL DHKNVGETTFCEKTD FN+ SITCA+DEF VGHTSK VGELVSPLGTDPEVLHLNETE  Q+E+ DT  LNN  S D F+DHNFGK+S D+V+ 
Subjt:  QHSALIDHKNVGETTFCEKTDSFNKSSITCAKDEFTVGHTSKNVGELVSPLGTDPEVLHLNETERRQMEEADTSRLNNKSSIDDFEDHNFGKESSDTVDN

Query:  IILDDAQLYSQTVFLDSKNVSSTPACRNSMADTAIDSADVVTFDHSTIAEFEKESSSARNIEGGSSMGLPRIDLAVDERPKILSSDEKDVAPLNGSLIED
        IILDD QL SQT FLD KNVSSTP C+NSMADTAIDS+DVVTF HS IA  EKESSS RNIEGG S+GLPRI+ AVDERP+ILSSDE+DVA LNGSLIED
Subjt:  IILDDAQLYSQTVFLDSKNVSSTPACRNSMADTAIDSADVVTFDHSTIAEFEKESSSARNIEGGSSMGLPRIDLAVDERPKILSSDEKDVAPLNGSLIED

Query:  DRTFLSALHDVEDPFSSEI-----HQSLVAPPTGVD-DLLPGLNTKSFEVENITHDRSLWTPMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLN
        DRTFLSALHDVEDPFSS++     HQSLVAPPTG + +LLPGL+TKS EVE + HDRSLWTP EIETATGNADSTL+EHSLRLRSTYTEAEDLSGTRDLN
Subjt:  DRTFLSALHDVEDPFSSEI-----HQSLVAPPTGVD-DLLPGLNTKSFEVENITHDRSLWTPMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLN

Query:  GEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVRSREE
        GEPLATSYNRCFLGTVDYIWRSEGLQTVRVL+PIRKQVMQQLT GFPTKKWGSDHIALATELAFV SREE
Subjt:  GEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVRSREE

A0A6J1BWG6 carbon catabolite repressor protein 4 homolog 6 isoform X10.0e+0076.13Show/hide
Query:  MLKVKESSPSLHILSRHSMRRAATPPPPLHQLSVAVAT----NTSAAMSSRPPYRGGRYGRNRGFSSERPYSGGRGQFVTGDSHFQSVRESNLGFRQGER
        MLK KE+S SL ILSR SMRRAAT PPPL QLS AVAT     TSAAMSSRPPYRGGRY R  GFSSERPYSGG+GQFV+GDSH+QSVRESNLGFRQGE 
Subjt:  MLKVKESSPSLHILSRHSMRRAATPPPPLHQLSVAVAT----NTSAAMSSRPPYRGGRYGRNRGFSSERPYSGGRGQFVTGDSHFQSVRESNLGFRQGER

Query:  GGLANNAGSYAAPQNPRPPSFGGNHQFRQAPPSTQRHQYRGPHPRTHYQQPPSFNQNQGVRMPQQLRPRPPKPLDFRHWDYAKTPPPSTCERFSILSYNI
        G  ANNAGSY APQNPRPP + G HQFRQA P  Q HQYRGPHP TH QQP SFNQNQGVR PQ+ RPRPPKP D+RHWDYAKT  PSTCERF+ILSYNI
Subjt:  GGLANNAGSYAAPQNPRPPSFGGNHQFRQAPPSTQRHQYRGPHPRTHYQQPPSFNQNQGVRMPQQLRPRPPKPLDFRHWDYAKTPPPSTCERFSILSYNI

Query:  LADYLAMDHKQKLYHHIPRYMLDWEWRKNNILFELGLWSTDIMCFQ----------------------MRTGIPVDGCAIFWRVSRFKLLHEESIEFNKL
        LADYLAMDHKQKLYHHIP YMLDWEWRK N+LFELGLWSTDIMCFQ                      MRTGIPVDGCAIFWRVSRFKLLHEE IEFNKL
Subjt:  LADYLAMDHKQKLYHHIPRYMLDWEWRKNNILFELGLWSTDIMCFQ----------------------MRTGIPVDGCAIFWRVSRFKLLHEESIEFNKL

Query:  GLRDNVAQICVLEQRSQDNGDNSVTPPISTSNHNKVVICNIHVLYNPRRGEIKLGQVRVLLEKAHAISKIWNNAPIVLCGDFNCTPKSALYNFISEQKLD
        GLRDNVAQICVLEQRSQDN DNS  PPISTSN NKVVICNIHVLYNP+RGEIKLGQVR+LLEKAHAISKIWNNAPIVLCGDFNCTPKSALYNFISEQKL+
Subjt:  GLRDNVAQICVLEQRSQDNGDNSVTPPISTSNHNKVVICNIHVLYNPRRGEIKLGQVRVLLEKAHAISKIWNNAPIVLCGDFNCTPKSALYNFISEQKLD

Query:  LSGLDRDKVSGQSSAEIHQPSVLYHNPRLQTANGSVPLQLRSESSDIERKPDSSLSDIQKQDCSHSCMENENLPSVNHFLPPDSSHIVFDAPDTSCNDLQ
        LSGLDRDKVSGQSSAEIHQPS LY NPR QTA+GSVPLQLRS+S DIERKPDS LSD++ QD  HS MENENLPSVN  LPPDSS  V    + SCNDLQ
Subjt:  LSGLDRDKVSGQSSAEIHQPSVLYHNPRLQTANGSVPLQLRSESSDIERKPDSSLSDIQKQDCSHSCMENENLPSVNHFLPPDSSHIVFDAPDTSCNDLQ

Query:  LEVKG-TIHSERQKESQHSALIDHKNVGETTFCEKTDSFNKSSITCAKDEFTVGHTSKNVGELVSPLGTDPEVLHLNETERRQMEEADT-SRLNNKSSID
        L +KG  +HSE QKE Q+ A   HKNVGETTFC   DSF  SSI CA+DEFTV H S+ V ELVSP GTD E LHLN TERRQME+ D+ S L ++SS D
Subjt:  LEVKG-TIHSERQKESQHSALIDHKNVGETTFCEKTDSFNKSSITCAKDEFTVGHTSKNVGELVSPLGTDPEVLHLNETERRQMEEADT-SRLNNKSSID

Query:  DFE-DHNFGKESSDTVDNIILDDAQLYSQTVFLDSKNVSSTPACRNSMADTAIDSADVVTFDHSTIAEFEKESSSARNIEGGSSMGLPRIDLAVDERPKI
          + D NF K+S + V+N+ILDD     QT  L SKNV  TPAC+N MAD A+DS++VV F H  IAEFEKE SSARNI+GG S+  P I+LAVDER KI
Subjt:  DFE-DHNFGKESSDTVDNIILDDAQLYSQTVFLDSKNVSSTPACRNSMADTAIDSADVVTFDHSTIAEFEKESSSARNIEGGSSMGLPRIDLAVDERPKI

Query:  LSSDEKDVAPLNGSLIEDDRTFLSALHDV-EDPFSSEI-----HQSLVAPPTGVDDLLPGLNTKSFEVENITHDRSLWTPMEIETATGNADSTLIEHSLR
        LSSDE+DVA L+GSL EDDRTFLSALHD+ E+PFSSEI     HQSLVAP TGV D LPGLNTKSFEVEN  HDRSLWTPMEIE ATGN D TL+EH L+
Subjt:  LSSDEKDVAPLNGSLIEDDRTFLSALHDV-EDPFSSEI-----HQSLVAPPTGVDDLLPGLNTKSFEVENITHDRSLWTPMEIETATGNADSTLIEHSLR

Query:  LRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVRSREE
        LRSTYTE ED SGTRDLN EPL TSYNRCFLGTVDYIWRSEGLQTV+VLAPI+K VM +LTPGFPTKKWGSDHIALA ELAF R  EE
Subjt:  LRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVRSREE

SwissProt top hitse value%identityAlignment
Q0WKY2 Carbon catabolite repressor protein 4 homolog 55.0e-4536.76Show/hide
Query:  RHWDYAKTPPPSTCERFSILSYNILADYLAMDHKQKLYHHIPRYMLDWEWRKNNILFELGLWSTDIMCFQ----------------------MRTGIPVD
        R W ++     +  ++  ++SYN+L    A +H   LY+++PR  L+W  RK+ I  E+  ++  I+C Q                       RTG   D
Subjt:  RHWDYAKTPPPSTCERFSILSYNILADYLAMDHKQKLYHHIPRYMLDWEWRKNNILFELGLWSTDIMCFQ----------------------MRTGIPVD

Query:  GCAIFWRVSRFKLLHEESIEFNKLGLRDNVAQICVLEQRSQDNGDNSVTPPISTSNHNKVVICNIHVLYNPRRGEIKLGQVRVLLEKAHAISKIWNNAPI
        GCAIFW+ + F+LL  + IEF+K G+R+NVAQ+CVLE   +++  + +   + +S+  ++V+ NIHVL+NP+RG+IKLGQVR+ LEKA+ +S+ W N P+
Subjt:  GCAIFWRVSRFKLLHEESIEFNKLGLRDNVAQICVLEQRSQDNGDNSVTPPISTSNHNKVVICNIHVLYNPRRGEIKLGQVRVLLEKAHAISKIWNNAPI

Query:  VLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSGQSSAEIHQPSVLYHNPRLQTANGSVPLQLRSESSDIE
         + GD N TP+SA+Y+FI+   LD    DR ++SGQ+  E  + S  + N    +A+ S+   L +E S  E
Subjt:  VLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSGQSSAEIHQPSVLYHNPRLQTANGSVPLQLRSESSDIE

Q0WKY2 Carbon catabolite repressor protein 4 homolog 51.3e-2147.17Show/hide
Query:  WTPMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALA
        W+  E++ ATG  ++T ++H L+L S Y+       TRD  GEPLAT+Y+  FLGTVDYIW ++ L  VRVL  +   V+++ T G P++ WGSDH+A+A
Subjt:  WTPMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALA

Query:  TELAFV
         EL FV
Subjt:  TELAFV

Q5RGT6 Protein angel homolog 23.2e-3132.67Show/hide
Query:  FSILSYNILADYLAMDHKQKLYHHIPRYMLDWEWRKNNILFELGLWSTDIMC-------------------------FQMRTGIPVDGCAIFWRVSRFKL
        FS++SYNIL+  L  D+   LY H    +LDW  R  NI+ EL  +S DIMC                         F+ RTG+  DGCA+ ++  RF L
Subjt:  FSILSYNILADYLAMDHKQKLYHHIPRYMLDWEWRKNNILFELGLWSTDIMC-------------------------FQMRTGIPVDGCAIFWRVSRFKL

Query:  LHEESIEFNKLGL----RDNVAQICVLEQRSQDNGDNSVTPPISTSNHNKVVICNIHVLYNPRRGEIKLGQVRVLLEKAHAISKIWNNA--PIVLCGDFN
        +    +E+ + G+    RDNV  I +L           + P +S SN   + + N H+LYNPRRG+IKL Q+ +LL +   +S++ +++  P++LCGDFN
Subjt:  LHEESIEFNKLGL----RDNVAQICVLEQRSQDNGDNSVTPPISTSNHNKVVICNIHVLYNPRRGEIKLGQVRVLLEKAHAISKIWNNA--PIVLCGDFN

Query:  CTPKSALYNFISEQKLDLSGLDRDKVSGQSSAEIHQPSVLYHNPRLQTANGSVPLQLRSESSDIERKPDSSLSDIQKQDCSHSCMENENLPSVNHFLPPD
          P S LY FI +++LD  G+   KVSGQ              PR Q    +VP+  RS     + + ++   D + +D   +  E+    S+ H L   
Subjt:  CTPKSALYNFISEQKLDLSGLDRDKVSGQSSAEIHQPSVLYHNPRLQTANGSVPLQLRSESSDIERKPDSSLSDIQKQDCSHSCMENENLPSVNHFLPPD

Query:  SSH
        S++
Subjt:  SSH

Q8VCU0 Protein angel homolog 14.1e-3133.46Show/hide
Query:  QQLRPRPPKPLDF-----RHW-DYAKTPPPSTCE-------RFSILSYNILADYLAMDHKQKLYHHIPRYMLDWEWRKNNILFELGLWSTDIMCFQ----
        +QL+P PP  + +     R W D++  P     E       +F+++SYNILA  L M    +LY H    +L+W +R  N++ E   W  DI+C Q    
Subjt:  QQLRPRPPKPLDF-----RHW-DYAKTPPPSTCE-------RFSILSYNILADYLAMDHKQKLYHHIPRYMLDWEWRKNNILFELGLWSTDIMCFQ----

Query:  ---------------------MRTGIPVDGCAIFWRVSRFKLLHEESIEFNKLGL----RDNVAQICVLEQRSQDN-GDNSVTPPISTSNHNKVVICNIH
                              RTG   DGCA+ ++ +RF+LL    +E+ + GL    RDNV  + +L+    +  G  SV P         + + N H
Subjt:  ---------------------MRTGIPVDGCAIFWRVSRFKLLHEESIEFNKLGL----RDNVAQICVLEQRSQDN-GDNSVTPPISTSNHNKVVICNIH

Query:  VLYNPRRGEIKLGQVRVLLEKAHAISKI--WNNAPIVLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSGQ
        VLYNPRRG++KL Q+ +LL +   ++++   ++ PI+LCGD N  P S LYNFI + +L  +G+   KVSGQ
Subjt:  VLYNPRRGEIKLGQVRVLLEKAHAISKI--WNNAPIVLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSGQ

Q8VYU4 Carbon catabolite repressor protein 4 homolog 62.0e-15041.52Show/hide
Query:  MRRAATPPPPLHQLSVAVATNTSA---AMSSRPPYRGGRYGRNRGFS----SERPYS--GGRGQFVTGDSHFQSVRESNLGFRQGERGGLANNAGSYAAP
        MRR+         ++ A A+  SA    MS+R PYRG R GR RG      S+RPY+   GR QFVTGDSHFQSV ++N  FR GE          Y   
Subjt:  MRRAATPPPPLHQLSVAVATNTSA---AMSSRPPYRGGRYGRNRGFS----SERPYS--GGRGQFVTGDSHFQSVRESNLGFRQGERGGLANNAGSYAAP

Query:  QNP----RPPSFGGNHQFRQAPPST-QRHQYRGPHPRTHYQQ-----PPSFNQNQGVRMP--QQLRPRP-PKPLDFRHWDYAKTPPPSTCERFSILSYNI
        Q P    + P F  N++FR  PPS  Q  Q+R P+     Q      PP F QNQ  R P  Q  R RP  KP D+R W+YAKTPP    E+F +LSYNI
Subjt:  QNP----RPPSFGGNHQFRQAPPST-QRHQYRGPHPRTHYQQ-----PPSFNQNQGVRMP--QQLRPRP-PKPLDFRHWDYAKTPPPSTCERFSILSYNI

Query:  LADYLAMDHKQKLYHHIPRYMLDWEWRKNNILFELGLWSTDIMCFQ----------------------MRTGIPVDGCAIFWRVSRFKLLHEESIEFNKL
        LADYLA DH + LY HIPR ML W WRK+ ++FEL LWS DIMC Q                      MRTG  VDGCAIFWR +RFKL+HEESI+FN+L
Subjt:  LADYLAMDHKQKLYHHIPRYMLDWEWRKNNILFELGLWSTDIMCFQ----------------------MRTGIPVDGCAIFWRVSRFKLLHEESIEFNKL

Query:  GLRDNVAQICVLEQ-RSQDNGDNSVTPPISTSNHNKVVICNIHVLYNPRRGEIKLGQVRVLLEKAHAISKIWNNAPIVLCGDFNCTPKSALYNFISEQKL
        GLRDNVAQICVLE   +    +N   PP S++  ++VVICNIHVL+NP+RG+ KLGQVR LL+KAHA+SK+W++APIVLCGDFNCTPKS LYNFIS++KL
Subjt:  GLRDNVAQICVLEQ-RSQDNGDNSVTPPISTSNHNKVVICNIHVLYNPRRGEIKLGQVRVLLEKAHAISKIWNNAPIVLCGDFNCTPKSALYNFISEQKL

Query:  DLSGLDRDKVSGQSSAEIHQPSVLYHNPRLQTANGSVPLQLRSESSDIERKPDSSLSDIQKQDCSHSCMENENLPSVNHFLPPDSSHIVFDAPDTSCNDL
        DLSGL RDKVSGQ SAE   P    +  R Q+AN S   Q+         +P + +++          MEN +   V                       
Subjt:  DLSGLDRDKVSGQSSAEIHQPSVLYHNPRLQTANGSVPLQLRSESSDIERKPDSSLSDIQKQDCSHSCMENENLPSVNHFLPPDSSHIVFDAPDTSCNDL

Query:  QLEVKGTIHSERQKESQHSALIDHKNVGETTFCEKTDSFNKSSITCAKDEFTVGHTSKNVGELVSPLGTDPEVLHLNETERRQMEEADTSRLNNKSSIDD
             GT  SE+  E                            + C  D    GH + +  + V P                           N +S   
Subjt:  QLEVKGTIHSERQKESQHSALIDHKNVGETTFCEKTDSFNKSSITCAKDEFTVGHTSKNVGELVSPLGTDPEVLHLNETERRQMEEADTSRLNNKSSIDD

Query:  FEDHNFGKESSDTVDNIILDDAQLYSQTVFLDSKNVSSTPACRNSMADTAIDSADVVTFD-HSTIAEFEKESSSARNIEGGSSMGLPRIDLAVDERPKIL
          D  FG E+    D+  L  A+  S     D++   ++ A  +   D ++ S    T      I   +++ SS+ + +  + +   ++D    + P + 
Subjt:  FEDHNFGKESSDTVDNIILDDAQLYSQTVFLDSKNVSSTPACRNSMADTAIDSADVVTFD-HSTIAEFEKESSSARNIEGGSSMGLPRIDLAVDERPKIL

Query:  SSDEKDVAPLNGSLIEDDRTFLSALHDVEDPFSSEIHQSLVAPPTGVDDLLPGLNTKSFEVENITHDRSLWTPMEIETATGNADSTLIEHSLRLRSTYTE
        + DE+       SL ED  TFL+ LHD  +  S +           V ++    ++++   + IT+  S WTPMEI TATG+ + T +EH+L L+STY+E
Subjt:  SSDEKDVAPLNGSLIEDDRTFLSALHDVEDPFSSEIHQSLVAPPTGVDDLLPGLNTKSFEVENITHDRSLWTPMEIETATGNADSTLIEHSLRLRSTYTE

Query:  AEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVRSR
         E  + TRD NGEP+ TSY+RCF+GTVDYIWRSEGLQTVRVLAPI KQ M Q TPGFPT KWGSDHIAL +ELAF  S+
Subjt:  AEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVRSR

Q9LS39 Carbon catabolite repressor protein 4 homolog 31.6e-4340.39Show/hide
Query:  MPQQLRP-RPPKPLDFRHW-DYAKTPPPSTCERFSILSYNILADYLAMDHKQKLYHHIPRYMLDWEWRKNNILFELGLWSTDIMCFQ-------------
        +P++  P + P     R W D   TP     ERF+++SYNIL D  +  H++ LY ++    L W +RK  I  EL   + DI+  Q             
Subjt:  MPQQLRP-RPPKPLDFRHW-DYAKTPPPSTCERFSILSYNILADYLAMDHKQKLYHHIPRYMLDWEWRKNNILFELGLWSTDIMCFQ-------------

Query:  ---------MRTGIPVDGCAIFWRVSRFKLLHEESIEFNKLGLRDNVAQICVLEQRSQDNGDNSVTPPISTSNHNKVVICNIHVLYNPRRGEIKLGQVRV
                  RTG  VDGCA+FW+  RF +L  E+IEF++ G+RDNVAQ+ VLE R              ++   K+++ NIHVLYNP +G++KLGQVR 
Subjt:  ---------MRTGIPVDGCAIFWRVSRFKLLHEESIEFNKLGLRDNVAQICVLEQRSQDNGDNSVTPPISTSNHNKVVICNIHVLYNPRRGEIKLGQVRV

Query:  LLEKAHAISKIWNNAPIVLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSGQSS
        L  KAH +SK W + PIVLCGDFN TPKS LYNF++  +L++   D+ ++SGQ +
Subjt:  LLEKAHAISKIWNNAPIVLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSGQSS

Q9LS39 Carbon catabolite repressor protein 4 homolog 33.6e-1947.66Show/hide
Query:  SLWTPMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIA
        S WT  EI  ATG  +S    H L+L S+Y   +  + TRD  GEPLATSY+  FLGTVDY+W S+GL   RVL  +   V+ + T G P ++ GSDH+A
Subjt:  SLWTPMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIA

Query:  LATELAF
        L +E  F
Subjt:  LATELAF

Arabidopsis top hitse value%identityAlignment
AT1G73875.1 DNAse I-like superfamily protein3.6e-4636.76Show/hide
Query:  RHWDYAKTPPPSTCERFSILSYNILADYLAMDHKQKLYHHIPRYMLDWEWRKNNILFELGLWSTDIMCFQ----------------------MRTGIPVD
        R W ++     +  ++  ++SYN+L    A +H   LY+++PR  L+W  RK+ I  E+  ++  I+C Q                       RTG   D
Subjt:  RHWDYAKTPPPSTCERFSILSYNILADYLAMDHKQKLYHHIPRYMLDWEWRKNNILFELGLWSTDIMCFQ----------------------MRTGIPVD

Query:  GCAIFWRVSRFKLLHEESIEFNKLGLRDNVAQICVLEQRSQDNGDNSVTPPISTSNHNKVVICNIHVLYNPRRGEIKLGQVRVLLEKAHAISKIWNNAPI
        GCAIFW+ + F+LL  + IEF+K G+R+NVAQ+CVLE   +++  + +   + +S+  ++V+ NIHVL+NP+RG+IKLGQVR+ LEKA+ +S+ W N P+
Subjt:  GCAIFWRVSRFKLLHEESIEFNKLGLRDNVAQICVLEQRSQDNGDNSVTPPISTSNHNKVVICNIHVLYNPRRGEIKLGQVRVLLEKAHAISKIWNNAPI

Query:  VLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSGQSSAEIHQPSVLYHNPRLQTANGSVPLQLRSESSDIE
         + GD N TP+SA+Y+FI+   LD    DR ++SGQ+  E  + S  + N    +A+ S+   L +E S  E
Subjt:  VLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSGQSSAEIHQPSVLYHNPRLQTANGSVPLQLRSESSDIE

AT1G73875.1 DNAse I-like superfamily protein9.5e-2347.17Show/hide
Query:  WTPMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALA
        W+  E++ ATG  ++T ++H L+L S Y+       TRD  GEPLAT+Y+  FLGTVDYIW ++ L  VRVL  +   V+++ T G P++ WGSDH+A+A
Subjt:  WTPMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALA

Query:  TELAFV
         EL FV
Subjt:  TELAFV

AT3G18500.1 DNAse I-like superfamily protein1.7e-4844.21Show/hide
Query:  MPQQLRP-RPPKPLDFRHW-DYAKTPPPSTCERFSILSYNILADYLAMDHKQKLYHHIPRYMLDWEWRKNNILFELGLWSTDIMCFQMRTGIPVDGCAIF
        +P++  P + P     R W D   TP     ERF+++SYNIL D  +  H++ LY ++    L W +RK  I  EL   + DI+  Q RTG  VDGCA+F
Subjt:  MPQQLRP-RPPKPLDFRHW-DYAKTPPPSTCERFSILSYNILADYLAMDHKQKLYHHIPRYMLDWEWRKNNILFELGLWSTDIMCFQMRTGIPVDGCAIF

Query:  WRVSRFKLLHEESIEFNKLGLRDNVAQICVLEQRSQDNGDNSVTPPISTSNHNKVVICNIHVLYNPRRGEIKLGQVRVLLEKAHAISKIWNNAPIVLCGD
        W+  RF +L  E+IEF++ G+RDNVAQ+ VLE R              ++   K+++ NIHVLYNP +G++KLGQVR L  KAH +SK W + PIVLCGD
Subjt:  WRVSRFKLLHEESIEFNKLGLRDNVAQICVLEQRSQDNGDNSVTPPISTSNHNKVVICNIHVLYNPRRGEIKLGQVRVLLEKAHAISKIWNNAPIVLCGD

Query:  FNCTPKSALYNFISEQKLDLSGLDRDKVSGQSS
        FN TPKS LYNF++  +L++   D+ ++SGQ +
Subjt:  FNCTPKSALYNFISEQKLDLSGLDRDKVSGQSS

AT3G18500.1 DNAse I-like superfamily protein2.6e-2047.66Show/hide
Query:  SLWTPMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIA
        S WT  EI  ATG  +S    H L+L S+Y   +  + TRD  GEPLATSY+  FLGTVDY+W S+GL   RVL  +   V+ + T G P ++ GSDH+A
Subjt:  SLWTPMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIA

Query:  LATELAF
        L +E  F
Subjt:  LATELAF

AT3G18500.2 DNAse I-like superfamily protein1.1e-4440.39Show/hide
Query:  MPQQLRP-RPPKPLDFRHW-DYAKTPPPSTCERFSILSYNILADYLAMDHKQKLYHHIPRYMLDWEWRKNNILFELGLWSTDIMCFQ-------------
        +P++  P + P     R W D   TP     ERF+++SYNIL D  +  H++ LY ++    L W +RK  I  EL   + DI+  Q             
Subjt:  MPQQLRP-RPPKPLDFRHW-DYAKTPPPSTCERFSILSYNILADYLAMDHKQKLYHHIPRYMLDWEWRKNNILFELGLWSTDIMCFQ-------------

Query:  ---------MRTGIPVDGCAIFWRVSRFKLLHEESIEFNKLGLRDNVAQICVLEQRSQDNGDNSVTPPISTSNHNKVVICNIHVLYNPRRGEIKLGQVRV
                  RTG  VDGCA+FW+  RF +L  E+IEF++ G+RDNVAQ+ VLE R              ++   K+++ NIHVLYNP +G++KLGQVR 
Subjt:  ---------MRTGIPVDGCAIFWRVSRFKLLHEESIEFNKLGLRDNVAQICVLEQRSQDNGDNSVTPPISTSNHNKVVICNIHVLYNPRRGEIKLGQVRV

Query:  LLEKAHAISKIWNNAPIVLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSGQSS
        L  KAH +SK W + PIVLCGDFN TPKS LYNF++  +L++   D+ ++SGQ +
Subjt:  LLEKAHAISKIWNNAPIVLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSGQSS

AT3G18500.2 DNAse I-like superfamily protein2.6e-2047.66Show/hide
Query:  SLWTPMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIA
        S WT  EI  ATG  +S    H L+L S+Y   +  + TRD  GEPLATSY+  FLGTVDY+W S+GL   RVL  +   V+ + T G P ++ GSDH+A
Subjt:  SLWTPMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIA

Query:  LATELAF
        L +E  F
Subjt:  LATELAF

AT3G18500.3 DNAse I-like superfamily protein1.1e-4440.39Show/hide
Query:  MPQQLRP-RPPKPLDFRHW-DYAKTPPPSTCERFSILSYNILADYLAMDHKQKLYHHIPRYMLDWEWRKNNILFELGLWSTDIMCFQ-------------
        +P++  P + P     R W D   TP     ERF+++SYNIL D  +  H++ LY ++    L W +RK  I  EL   + DI+  Q             
Subjt:  MPQQLRP-RPPKPLDFRHW-DYAKTPPPSTCERFSILSYNILADYLAMDHKQKLYHHIPRYMLDWEWRKNNILFELGLWSTDIMCFQ-------------

Query:  ---------MRTGIPVDGCAIFWRVSRFKLLHEESIEFNKLGLRDNVAQICVLEQRSQDNGDNSVTPPISTSNHNKVVICNIHVLYNPRRGEIKLGQVRV
                  RTG  VDGCA+FW+  RF +L  E+IEF++ G+RDNVAQ+ VLE R              ++   K+++ NIHVLYNP +G++KLGQVR 
Subjt:  ---------MRTGIPVDGCAIFWRVSRFKLLHEESIEFNKLGLRDNVAQICVLEQRSQDNGDNSVTPPISTSNHNKVVICNIHVLYNPRRGEIKLGQVRV

Query:  LLEKAHAISKIWNNAPIVLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSGQSS
        L  KAH +SK W + PIVLCGDFN TPKS LYNF++  +L++   D+ ++SGQ +
Subjt:  LLEKAHAISKIWNNAPIVLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSGQSS

AT3G18500.3 DNAse I-like superfamily protein2.6e-2047.66Show/hide
Query:  SLWTPMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIA
        S WT  EI  ATG  +S    H L+L S+Y   +  + TRD  GEPLATSY+  FLGTVDY+W S+GL   RVL  +   V+ + T G P ++ GSDH+A
Subjt:  SLWTPMEIETATGNADSTLIEHSLRLRSTYTEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIA

Query:  LATELAF
        L +E  F
Subjt:  LATELAF

AT5G11350.1 DNAse I-like superfamily protein1.4e-15141.52Show/hide
Query:  MRRAATPPPPLHQLSVAVATNTSA---AMSSRPPYRGGRYGRNRGFS----SERPYS--GGRGQFVTGDSHFQSVRESNLGFRQGERGGLANNAGSYAAP
        MRR+         ++ A A+  SA    MS+R PYRG R GR RG      S+RPY+   GR QFVTGDSHFQSV ++N  FR GE          Y   
Subjt:  MRRAATPPPPLHQLSVAVATNTSA---AMSSRPPYRGGRYGRNRGFS----SERPYS--GGRGQFVTGDSHFQSVRESNLGFRQGERGGLANNAGSYAAP

Query:  QNP----RPPSFGGNHQFRQAPPST-QRHQYRGPHPRTHYQQ-----PPSFNQNQGVRMP--QQLRPRP-PKPLDFRHWDYAKTPPPSTCERFSILSYNI
        Q P    + P F  N++FR  PPS  Q  Q+R P+     Q      PP F QNQ  R P  Q  R RP  KP D+R W+YAKTPP    E+F +LSYNI
Subjt:  QNP----RPPSFGGNHQFRQAPPST-QRHQYRGPHPRTHYQQ-----PPSFNQNQGVRMP--QQLRPRP-PKPLDFRHWDYAKTPPPSTCERFSILSYNI

Query:  LADYLAMDHKQKLYHHIPRYMLDWEWRKNNILFELGLWSTDIMCFQ----------------------MRTGIPVDGCAIFWRVSRFKLLHEESIEFNKL
        LADYLA DH + LY HIPR ML W WRK+ ++FEL LWS DIMC Q                      MRTG  VDGCAIFWR +RFKL+HEESI+FN+L
Subjt:  LADYLAMDHKQKLYHHIPRYMLDWEWRKNNILFELGLWSTDIMCFQ----------------------MRTGIPVDGCAIFWRVSRFKLLHEESIEFNKL

Query:  GLRDNVAQICVLEQ-RSQDNGDNSVTPPISTSNHNKVVICNIHVLYNPRRGEIKLGQVRVLLEKAHAISKIWNNAPIVLCGDFNCTPKSALYNFISEQKL
        GLRDNVAQICVLE   +    +N   PP S++  ++VVICNIHVL+NP+RG+ KLGQVR LL+KAHA+SK+W++APIVLCGDFNCTPKS LYNFIS++KL
Subjt:  GLRDNVAQICVLEQ-RSQDNGDNSVTPPISTSNHNKVVICNIHVLYNPRRGEIKLGQVRVLLEKAHAISKIWNNAPIVLCGDFNCTPKSALYNFISEQKL

Query:  DLSGLDRDKVSGQSSAEIHQPSVLYHNPRLQTANGSVPLQLRSESSDIERKPDSSLSDIQKQDCSHSCMENENLPSVNHFLPPDSSHIVFDAPDTSCNDL
        DLSGL RDKVSGQ SAE   P    +  R Q+AN S   Q+         +P + +++          MEN +   V                       
Subjt:  DLSGLDRDKVSGQSSAEIHQPSVLYHNPRLQTANGSVPLQLRSESSDIERKPDSSLSDIQKQDCSHSCMENENLPSVNHFLPPDSSHIVFDAPDTSCNDL

Query:  QLEVKGTIHSERQKESQHSALIDHKNVGETTFCEKTDSFNKSSITCAKDEFTVGHTSKNVGELVSPLGTDPEVLHLNETERRQMEEADTSRLNNKSSIDD
             GT  SE+  E                            + C  D    GH + +  + V P                           N +S   
Subjt:  QLEVKGTIHSERQKESQHSALIDHKNVGETTFCEKTDSFNKSSITCAKDEFTVGHTSKNVGELVSPLGTDPEVLHLNETERRQMEEADTSRLNNKSSIDD

Query:  FEDHNFGKESSDTVDNIILDDAQLYSQTVFLDSKNVSSTPACRNSMADTAIDSADVVTFD-HSTIAEFEKESSSARNIEGGSSMGLPRIDLAVDERPKIL
          D  FG E+    D+  L  A+  S     D++   ++ A  +   D ++ S    T      I   +++ SS+ + +  + +   ++D    + P + 
Subjt:  FEDHNFGKESSDTVDNIILDDAQLYSQTVFLDSKNVSSTPACRNSMADTAIDSADVVTFD-HSTIAEFEKESSSARNIEGGSSMGLPRIDLAVDERPKIL

Query:  SSDEKDVAPLNGSLIEDDRTFLSALHDVEDPFSSEIHQSLVAPPTGVDDLLPGLNTKSFEVENITHDRSLWTPMEIETATGNADSTLIEHSLRLRSTYTE
        + DE+       SL ED  TFL+ LHD  +  S +           V ++    ++++   + IT+  S WTPMEI TATG+ + T +EH+L L+STY+E
Subjt:  SSDEKDVAPLNGSLIEDDRTFLSALHDVEDPFSSEIHQSLVAPPTGVDDLLPGLNTKSFEVENITHDRSLWTPMEIETATGNADSTLIEHSLRLRSTYTE

Query:  AEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVRSR
         E  + TRD NGEP+ TSY+RCF+GTVDYIWRSEGLQTVRVLAPI KQ M Q TPGFPT KWGSDHIAL +ELAF  S+
Subjt:  AEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVRSR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTAAAGTGAAAGAATCGTCACCGAGTCTTCACATTCTCTCCCGCCATTCAATGAGGCGCGCTGCTACTCCTCCTCCTCCGCTCCATCAACTGTCCGTCGCCGTCGC
CACCAACACATCCGCTGCCATGTCTTCTCGACCTCCATACCGAGGTGGCCGGTACGGACGAAACCGAGGCTTCTCGTCGGAACGGCCATACTCTGGCGGTAGAGGTCAAT
TCGTCACCGGGGATTCTCATTTTCAGTCTGTTCGGGAGTCCAACCTAGGGTTCCGGCAAGGAGAGAGGGGAGGCTTGGCGAACAATGCGGGGTCATATGCTGCACCTCAA
AATCCTAGACCTCCGTCTTTTGGTGGAAATCATCAATTCCGACAGGCTCCGCCTTCCACTCAGAGGCACCAGTATCGGGGACCTCATCCTCGTACTCACTATCAGCAGCC
ACCGTCGTTTAATCAAAATCAAGGTGTTCGTATGCCGCAGCAACTTCGACCCAGGCCTCCTAAGCCGTTAGATTTTCGTCATTGGGATTATGCAAAAACCCCACCTCCAT
CTACTTGCGAGCGGTTTTCAATTCTTTCATACAACATCTTAGCTGATTACCTTGCCATGGATCACAAGCAAAAGCTCTACCATCATATTCCCCGTTACATGTTGGATTGG
GAGTGGAGGAAAAATAATATTTTATTTGAGCTTGGATTATGGTCTACTGACATAATGTGCTTTCAGATGCGTACTGGAATTCCAGTTGACGGCTGTGCGATCTTTTGGCG
AGTGTCAAGGTTCAAGCTTCTGCATGAGGAGTCTATTGAGTTCAATAAGCTTGGACTACGGGACAATGTTGCTCAGATATGTGTTCTTGAGCAGAGGAGTCAGGATAATG
GTGACAATTCAGTTACTCCACCGATTAGCACATCAAATCACAATAAAGTTGTAATATGTAATATACATGTCCTCTATAACCCTAGAAGAGGAGAAATCAAGCTTGGCCAG
GTCAGGGTTCTCTTGGAGAAGGCTCACGCTATTTCAAAAATCTGGAACAATGCTCCTATCGTTCTCTGTGGGGATTTTAACTGTACACCAAAGAGTGCATTGTATAACTT
TATTTCAGAACAGAAGCTAGATTTGTCTGGATTGGATAGAGACAAGGTATCGGGACAATCTTCTGCTGAGATTCATCAACCTTCAGTACTCTATCATAATCCTCGGCTTC
AGACTGCCAATGGTTCAGTTCCCCTCCAGCTGAGGTCAGAATCTAGTGATATTGAAAGAAAGCCAGATAGTTCTCTGTCTGACATACAGAAGCAAGATTGTTCACATAGC
TGCATGGAAAATGAGAATCTTCCATCAGTGAACCACTTTTTGCCCCCTGATAGTTCTCACATTGTCTTTGATGCGCCTGATACTTCTTGTAACGATCTCCAACTTGAAGT
GAAGGGTACTATACATTCTGAAAGGCAAAAGGAAAGCCAGCATAGTGCTTTGATTGACCACAAAAATGTAGGGGAAACAACTTTCTGTGAGAAGACAGATAGCTTCAACA
AAAGTTCAATCACGTGTGCTAAAGATGAGTTTACTGTTGGTCATACCAGTAAAAATGTCGGTGAACTAGTCTCTCCTTTAGGAACTGACCCTGAAGTACTTCATCTGAAT
GAAACTGAGAGACGACAGATGGAAGAAGCTGATACCTCTCGTTTAAACAATAAATCTTCTATAGATGATTTTGAGGATCACAATTTTGGCAAGGAAAGCAGTGACACTGT
CGATAATATTATCTTAGATGATGCACAGCTTTATTCTCAGACAGTTTTTTTGGATTCAAAGAATGTTTCTTCTACACCTGCTTGCAGAAACTCCATGGCCGACACTGCTA
TAGACTCCGCTGACGTTGTAACTTTTGACCACTCAACAATTGCTGAATTTGAGAAGGAAAGCTCCTCTGCTAGGAATATTGAAGGTGGCTCATCAATGGGTTTGCCCCGG
ATTGACTTGGCGGTGGATGAGAGACCAAAGATTTTATCTTCAGATGAGAAGGATGTAGCCCCGTTAAATGGAAGCTTAATTGAGGATGATCGTACATTTCTCTCAGCTCT
GCATGATGTTGAAGATCCCTTTTCATCCGAGATTCATCAGAGCTTGGTTGCACCACCCACTGGAGTTGATGATTTGTTGCCAGGATTGAATACCAAGTCTTTTGAAGTCG
AAAATATTACTCATGATCGCTCATTATGGACTCCAATGGAAATAGAAACAGCTACTGGCAATGCTGATAGTACTCTAATTGAACACTCTCTAAGGCTTAGAAGCACGTAC
ACAGAAGCTGAGGACCTTTCCGGGACCAGAGACTTGAATGGGGAACCCCTGGCGACTAGTTACAATAGGTGTTTTCTGGGCACTGTTGACTACATATGGCGTTCAGAAGG
TCTTCAGACGGTTAGGGTACTTGCTCCTATAAGAAAACAAGTCATGCAGCAGTTGACTCCTGGATTTCCTACAAAGAAATGGGGCAGCGATCACATTGCCTTAGCTACTG
AATTGGCATTTGTAAGGAGTCGTGAAGAGTGA
mRNA sequenceShow/hide mRNA sequence
CGTTGATCGTAATGCTTAAAGTGAAAGAATCGTCACCGAGTCTTCACATTCTCTCCCGCCATTCAATGAGGCGCGCTGCTACTCCTCCTCCTCCGCTCCATCAACTGTCC
GTCGCCGTCGCCACCAACACATCCGCTGCCATGTCTTCTCGACCTCCATACCGAGGTGGCCGGTACGGACGAAACCGAGGCTTCTCGTCGGAACGGCCATACTCTGGCGG
TAGAGGTCAATTCGTCACCGGGGATTCTCATTTTCAGTCTGTTCGGGAGTCCAACCTAGGGTTCCGGCAAGGAGAGAGGGGAGGCTTGGCGAACAATGCGGGGTCATATG
CTGCACCTCAAAATCCTAGACCTCCGTCTTTTGGTGGAAATCATCAATTCCGACAGGCTCCGCCTTCCACTCAGAGGCACCAGTATCGGGGACCTCATCCTCGTACTCAC
TATCAGCAGCCACCGTCGTTTAATCAAAATCAAGGTGTTCGTATGCCGCAGCAACTTCGACCCAGGCCTCCTAAGCCGTTAGATTTTCGTCATTGGGATTATGCAAAAAC
CCCACCTCCATCTACTTGCGAGCGGTTTTCAATTCTTTCATACAACATCTTAGCTGATTACCTTGCCATGGATCACAAGCAAAAGCTCTACCATCATATTCCCCGTTACA
TGTTGGATTGGGAGTGGAGGAAAAATAATATTTTATTTGAGCTTGGATTATGGTCTACTGACATAATGTGCTTTCAGATGCGTACTGGAATTCCAGTTGACGGCTGTGCG
ATCTTTTGGCGAGTGTCAAGGTTCAAGCTTCTGCATGAGGAGTCTATTGAGTTCAATAAGCTTGGACTACGGGACAATGTTGCTCAGATATGTGTTCTTGAGCAGAGGAG
TCAGGATAATGGTGACAATTCAGTTACTCCACCGATTAGCACATCAAATCACAATAAAGTTGTAATATGTAATATACATGTCCTCTATAACCCTAGAAGAGGAGAAATCA
AGCTTGGCCAGGTCAGGGTTCTCTTGGAGAAGGCTCACGCTATTTCAAAAATCTGGAACAATGCTCCTATCGTTCTCTGTGGGGATTTTAACTGTACACCAAAGAGTGCA
TTGTATAACTTTATTTCAGAACAGAAGCTAGATTTGTCTGGATTGGATAGAGACAAGGTATCGGGACAATCTTCTGCTGAGATTCATCAACCTTCAGTACTCTATCATAA
TCCTCGGCTTCAGACTGCCAATGGTTCAGTTCCCCTCCAGCTGAGGTCAGAATCTAGTGATATTGAAAGAAAGCCAGATAGTTCTCTGTCTGACATACAGAAGCAAGATT
GTTCACATAGCTGCATGGAAAATGAGAATCTTCCATCAGTGAACCACTTTTTGCCCCCTGATAGTTCTCACATTGTCTTTGATGCGCCTGATACTTCTTGTAACGATCTC
CAACTTGAAGTGAAGGGTACTATACATTCTGAAAGGCAAAAGGAAAGCCAGCATAGTGCTTTGATTGACCACAAAAATGTAGGGGAAACAACTTTCTGTGAGAAGACAGA
TAGCTTCAACAAAAGTTCAATCACGTGTGCTAAAGATGAGTTTACTGTTGGTCATACCAGTAAAAATGTCGGTGAACTAGTCTCTCCTTTAGGAACTGACCCTGAAGTAC
TTCATCTGAATGAAACTGAGAGACGACAGATGGAAGAAGCTGATACCTCTCGTTTAAACAATAAATCTTCTATAGATGATTTTGAGGATCACAATTTTGGCAAGGAAAGC
AGTGACACTGTCGATAATATTATCTTAGATGATGCACAGCTTTATTCTCAGACAGTTTTTTTGGATTCAAAGAATGTTTCTTCTACACCTGCTTGCAGAAACTCCATGGC
CGACACTGCTATAGACTCCGCTGACGTTGTAACTTTTGACCACTCAACAATTGCTGAATTTGAGAAGGAAAGCTCCTCTGCTAGGAATATTGAAGGTGGCTCATCAATGG
GTTTGCCCCGGATTGACTTGGCGGTGGATGAGAGACCAAAGATTTTATCTTCAGATGAGAAGGATGTAGCCCCGTTAAATGGAAGCTTAATTGAGGATGATCGTACATTT
CTCTCAGCTCTGCATGATGTTGAAGATCCCTTTTCATCCGAGATTCATCAGAGCTTGGTTGCACCACCCACTGGAGTTGATGATTTGTTGCCAGGATTGAATACCAAGTC
TTTTGAAGTCGAAAATATTACTCATGATCGCTCATTATGGACTCCAATGGAAATAGAAACAGCTACTGGCAATGCTGATAGTACTCTAATTGAACACTCTCTAAGGCTTA
GAAGCACGTACACAGAAGCTGAGGACCTTTCCGGGACCAGAGACTTGAATGGGGAACCCCTGGCGACTAGTTACAATAGGTGTTTTCTGGGCACTGTTGACTACATATGG
CGTTCAGAAGGTCTTCAGACGGTTAGGGTACTTGCTCCTATAAGAAAACAAGTCATGCAGCAGTTGACTCCTGGATTTCCTACAAAGAAATGGGGCAGCGATCACATTGC
CTTAGCTACTGAATTGGCATTTGTAAGGAGTCGTGAAGAGTGATACTATGAATCTTAGGAGCACATGATAATAAAGTAAATAATTAATTTCCCTTGTAATTGGAGGTTTT
CTATTTGAGGAATCCAACTGGTTCCGACGGCACCAGCCAGAAGGTGCAATGTGTTGAGTTAGAGGAGGTAACAGAGCCTAAG
Protein sequenceShow/hide protein sequence
MLKVKESSPSLHILSRHSMRRAATPPPPLHQLSVAVATNTSAAMSSRPPYRGGRYGRNRGFSSERPYSGGRGQFVTGDSHFQSVRESNLGFRQGERGGLANNAGSYAAPQ
NPRPPSFGGNHQFRQAPPSTQRHQYRGPHPRTHYQQPPSFNQNQGVRMPQQLRPRPPKPLDFRHWDYAKTPPPSTCERFSILSYNILADYLAMDHKQKLYHHIPRYMLDW
EWRKNNILFELGLWSTDIMCFQMRTGIPVDGCAIFWRVSRFKLLHEESIEFNKLGLRDNVAQICVLEQRSQDNGDNSVTPPISTSNHNKVVICNIHVLYNPRRGEIKLGQ
VRVLLEKAHAISKIWNNAPIVLCGDFNCTPKSALYNFISEQKLDLSGLDRDKVSGQSSAEIHQPSVLYHNPRLQTANGSVPLQLRSESSDIERKPDSSLSDIQKQDCSHS
CMENENLPSVNHFLPPDSSHIVFDAPDTSCNDLQLEVKGTIHSERQKESQHSALIDHKNVGETTFCEKTDSFNKSSITCAKDEFTVGHTSKNVGELVSPLGTDPEVLHLN
ETERRQMEEADTSRLNNKSSIDDFEDHNFGKESSDTVDNIILDDAQLYSQTVFLDSKNVSSTPACRNSMADTAIDSADVVTFDHSTIAEFEKESSSARNIEGGSSMGLPR
IDLAVDERPKILSSDEKDVAPLNGSLIEDDRTFLSALHDVEDPFSSEIHQSLVAPPTGVDDLLPGLNTKSFEVENITHDRSLWTPMEIETATGNADSTLIEHSLRLRSTY
TEAEDLSGTRDLNGEPLATSYNRCFLGTVDYIWRSEGLQTVRVLAPIRKQVMQQLTPGFPTKKWGSDHIALATELAFVRSREE