; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10002855 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10002855
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionAT-rich interactive domain-containing protein 3-like
Genome locationChr11:14760140..14778483
RNA-Seq ExpressionHG10002855
SyntenyHG10002855
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0032991 - protein-containing complex (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0004144 - diacylglycerol O-acyltransferase activity (molecular function)
InterPro domainsIPR001606 - ARID DNA-binding domain
IPR002068 - Alpha crystallin/Hsp20 domain
IPR007130 - Diacylglycerol acyltransferase
IPR008978 - HSP20-like chaperone
IPR036431 - ARID DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8637399.1 hypothetical protein CSA_023376 [Cucumis sativus]0.0e+0072.45Show/hide
Query:  ISLFYLSLPKALLVFALLLILVLIPVDDKSKYGRVLARYICQNACSYFPVTLHVEDIYAFDPNRAYVFGYEPHSVLPIGVVALADLTGFMPLKKLKVLAS
        + LF +SL   L VFALLLILVLIPVD KSKYGRVLARYICQNACSYFPVTLHVEDI+AFD NRAYVFGYEPHSVLPIGVVALADLTGFMPLKKLKVLAS
Subjt:  ISLFYLSLPKALLVFALLLILVLIPVDDKSKYGRVLARYICQNACSYFPVTLHVEDIYAFDPNRAYVFGYEPHSVLPIGVVALADLTGFMPLKKLKVLAS

Query:  SAVFYTPFLRHIWTWMGLTPATKKNFISLLAAGCSCIIVPGGVQETFHMEHNSETVFLKTRRGFVRIAMEMGTPLVPVFCFGQ---SSVYQWWKPGGNFF
        SAVFYTPFLRHIWTWMGLTPAT+KNFISLLAAG SCIIVPGGVQETFHMEHNSETVFLKTRRGFVRIAMEMGTPLVPVFCFGQ   +S  Q    GGN  
Subjt:  SAVFYTPFLRHIWTWMGLTPATKKNFISLLAAGCSCIIVPGGVQETFHMEHNSETVFLKTRRGFVRIAMEMGTPLVPVFCFGQ---SSVYQWWKPGGNFF

Query:  LQFSRAIKFTPIVFWGVFGSPLPYRRRMHVVVGKPIEVKKNPNPTSDEVLDLHGQFVEALENMFERAHCKRPSQPNKRPSNHRCSTTTCGGPNPDGDCSP
        +  + +             SPLPYRR+MHVVVG+PIEVKKNPNPTSDEVLDLHG+FVEALE+MFER                                  
Subjt:  LQFSRAIKFTPIVFWGVFGSPLPYRRRMHVVVGKPIEVKKNPNPTSDEVLDLHGQFVEALENMFERAHCKRPSQPNKRPSNHRCSTTTCGGPNPDGDCSP

Query:  SRRTAPAREKVEIDNAEFKFQNHVAERTIGLEKKRNVAFFPSLLKLSSENLTSNSCRILPLELEMCDAKEKQESAQDVSVALSEGNQVYA-EDDLHDSLS
                                                                 ILPLEL+MCDAKEKQES QDVS ALSEGNQV A ED+LHDSLS
Subjt:  SRRTAPAREKVEIDNAEFKFQNHVAERTIGLEKKRNVAFFPSLLKLSSENLTSNSCRILPLELEMCDAKEKQESAQDVSVALSEGNQVYA-EDDLHDSLS

Query:  VPAEATPNSCEKEIYSPETHVQEVVPPEDNLDKPTIPQQSNQNSAHSLPFDHDQELDKPDAEPASDVKIVPNELPPRDLDAAISNSPLETVQSSMDTKSE
        VPAEATPNSCEKEI SP T VQE+VP EDNL+KPTI QQSNQ S HSLP D+DQELDK  AEPASDVK  P ELP RDLDAA+S+SPLETVQSS+D KSE
Subjt:  VPAEATPNSCEKEIYSPETHVQEVVPPEDNLDKPTIPQQSNQNSAHSLPFDHDQELDKPDAEPASDVKIVPNELPPRDLDAAISNSPLETVQSSMDTKSE

Query:  AFDMPETKTGCLDDASTASHDEPVTPHPVSSCVKAETENAIESKVNEDSVTTPHNGDSNMNHSFILDENHIAEGSESGTEEEQSAFMKELENFFRERSLE
        A  MPE KT  LDDAST SHDEPVTPHPVSSCVKAETENAIE KVNED VTTPHNGDSNMNHSF+LDENHIAEGSESGTEEEQSAFMKELENFFRERSLE
Subjt:  AFDMPETKTGCLDDASTASHDEPVTPHPVSSCVKAETENAIESKVNEDSVTTPHNGDSNMNHSFILDENHIAEGSESGTEEEQSAFMKELENFFRERSLE

Query:  FKPPKFYGEGLNCLKLWRAVTRLGGYD---------------------------------KALLDYERHKTNGGELSVPIASNSEPMSIENQGSGSGRAR
        FKPPKFYGEGLNCLKLWRAVTRLGGYD                                 KALLDYERHKTNGGELSVPIASNSEPMSIENQGSGSGRAR
Subjt:  FKPPKFYGEGLNCLKLWRAVTRLGGYD---------------------------------KALLDYERHKTNGGELSVPIASNSEPMSIENQGSGSGRAR

Query:  RDAAARAMQGWHSQRLLGNGEVSDPIIKDKNSLSMQKREKQLKGIGLLKRKKPAYMEHAMKSTRTKSPKPHYPFYDNWPICIETHVKNIGWNVGKEFNIL
        RDAAARAMQGWHSQRLLGNGEVSDPIIKDKNSLSMQKREKQLK IGLLKRKKP+YMEH+MKS RTKSPKP                              
Subjt:  RDAAARAMQGWHSQRLLGNGEVSDPIIKDKNSLSMQKREKQLKGIGLLKRKKPAYMEHAMKSTRTKSPKPHYPFYDNWPICIETHVKNIGWNVGKEFNIL

Query:  LISLSVARLDVAVVDIGQPADWVKVNVQKTKDCYEVYALVPGLLREEVRVQSDPAGRLVISGEPEHPDNPWGVTPFKKVVSLPSRIDPHQTSAVVTLHGQ
               +LDVAVVDIGQPADWVKVNVQKTKDCYEVYALVPGLLREEVRVQSDPAGRLVISG+PEHPDNPWGVTPFKKVVSLPSRIDPHQTSAVVTLHGQ
Subjt:  LISLSVARLDVAVVDIGQPADWVKVNVQKTKDCYEVYALVPGLLREEVRVQSDPAGRLVISGEPEHPDNPWGVTPFKKVVSLPSRIDPHQTSAVVTLHGQ

Query:  LFVRVPFEQLE
        LFVRVPFEQLE
Subjt:  LFVRVPFEQLE

KAG6577828.1 AT-rich interactive domain-containing protein 3, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0071.7Show/hide
Query:  DEISGPSVTVFKGGGGPATKWRMMHIIVALGIWLGGIHLNFTLGLISLFYLSLPKALLVFALLLILVLIPVDDKSKYGRVLARYICQNACSYFPVTLHVE
        DEISG +VTVFK GGG  TKWR MHI+VALGIWLGGIHLN  L LISLFYLSLPKALLVFALLLILVLIPVDDKSKYGR+LARYICQNA SYFPVTLHVE
Subjt:  DEISGPSVTVFKGGGGPATKWRMMHIIVALGIWLGGIHLNFTLGLISLFYLSLPKALLVFALLLILVLIPVDDKSKYGRVLARYICQNACSYFPVTLHVE

Query:  DIYAFDPNRAYVFGYEPHSVLPIGVVALADLTGFMPLKKLKVLASSAVFYTPFLRHIWTWMGLTPATKKNFISLLAAGCSCIIVPGGVQETFHMEHNSET
        DI+AFDPNRAYVFGYEPHSVLPIGVVALADLTG MPL KLKVLASSAVFYTPFLRHIWTWMGL+PAT+KNF SLLAAG SCIIVPGGVQETFHME NSET
Subjt:  DIYAFDPNRAYVFGYEPHSVLPIGVVALADLTGFMPLKKLKVLASSAVFYTPFLRHIWTWMGLTPATKKNFISLLAAGCSCIIVPGGVQETFHMEHNSET

Query:  VFLKTRRGFVRIAMEMGTPLVPVFCFGQSSVYQWWKPGGNFFLQFSRAIKFTPIVFWGVFGSPLPYRRRMHVVVGKPIEVKKNPNPTSDEVLDLHGQFVE
        VFLKTRRGFVRIA+EMGTPLVPVFCFGQSSVYQWWKPGG  FLQFSRAIKFTPI+FWGV GSPLPYRRRMHVVVGKPIEVKKNPNP+SDEVLDLH QFVE
Subjt:  VFLKTRRGFVRIAMEMGTPLVPVFCFGQSSVYQWWKPGGNFFLQFSRAIKFTPIVFWGVFGSPLPYRRRMHVVVGKPIEVKKNPNPTSDEVLDLHGQFVE

Query:  ALENMFERAHCKRPSQPNKRPSNHRCSTTTCGGPNPDGDCSPSRRTAPAREKVEIDNAEFKFQNHVAERTIGLEKKRNVAFFPSLLKLSSENLTSNSCRI
        ALE++FER   +                    G  P G         P  ++   +      Q H         +  N  F  S L         +SCRI
Subjt:  ALENMFERAHCKRPSQPNKRPSNHRCSTTTCGGPNPDGDCSPSRRTAPAREKVEIDNAEFKFQNHVAERTIGLEKKRNVAFFPSLLKLSSENLTSNSCRI

Query:  LPLELEMCDAKEKQESAQDVSVALSEGNQVYAEDDLHDSLSVPAEATPNSCEKEIYSPETHVQEVVPPEDNLDKPTIPQQSNQNSAHSLPFDHDQELDKP
        L LE EMCDAKEKQESAQDV VALSEGNQV  +D+LHDSL V AEATPNSCEKEI SPET                                        
Subjt:  LPLELEMCDAKEKQESAQDVSVALSEGNQVYAEDDLHDSLSVPAEATPNSCEKEIYSPETHVQEVVPPEDNLDKPTIPQQSNQNSAHSLPFDHDQELDKP

Query:  DAEPASDVKIVPNELPPRD--LDAAISNSPLETVQSSMDTKSEAFDMPETKTGCLDDASTASHDEPVTPHPVSSCVKAETENAIESKVNEDSVTTPHNGD
               VK  P+ELPPR+  LDAA SNS +ETVQ SMD KSEAF+M ETKTG LD A  A+HDEPVTP PV SC+K ETENAI  K+NEDSVTTPHNG 
Subjt:  DAEPASDVKIVPNELPPRD--LDAAISNSPLETVQSSMDTKSEAFDMPETKTGCLDDASTASHDEPVTPHPVSSCVKAETENAIESKVNEDSVTTPHNGD

Query:  SNMNHSFILDENHIAEGSESGTEEEQSAFMKELENFFRERSLEFKPPKFYGEGLNCLKLWRAVTRLGGYD------------------------------
        +NM+HSFILDENHIAEGSESGTEEEQSAFMKELENFFRERSLEFKPPKFYGEGLNCLKLWRAVTRLGGYD                              
Subjt:  SNMNHSFILDENHIAEGSESGTEEEQSAFMKELENFFRERSLEFKPPKFYGEGLNCLKLWRAVTRLGGYD------------------------------

Query:  ---KALLDYERHKTNGGELSVPIASNSEPMSIENQGSGSGRARRDAAARAMQGWHSQRLLGNGEVSDPIIKDKNSLSMQKREKQLKGIGLLKRKKPAYME
           KALLDYERHKT+GGELSVPIASNSEPMSIENQG GSGRARRDAAARAMQGWHS RLLGNGEVSDPIIKDKNSLSMQKREKQLK IGLLKRKK +YME
Subjt:  ---KALLDYERHKTNGGELSVPIASNSEPMSIENQGSGSGRARRDAAARAMQGWHSQRLLGNGEVSDPIIKDKNSLSMQKREKQLKGIGLLKRKKPAYME

Query:  HAMKSTRTKSPKPHYPFYDNWPICIETHVKNIGWNVGKEFNILLISLSVARLDVAVVDIGQPADWVKVNVQKTKDCYEVYALVPGLLREEVRVQSDPAGR
        HAMKS RTKSPKP                                     +LDVAVVDIGQPADWVKVNVQKTKDCYEVYALVPGLLREEVRVQSDPAGR
Subjt:  HAMKSTRTKSPKPHYPFYDNWPICIETHVKNIGWNVGKEFNILLISLSVARLDVAVVDIGQPADWVKVNVQKTKDCYEVYALVPGLLREEVRVQSDPAGR

Query:  LVISGEPEHPDNPWGVTPFKKVVSLPSRIDPHQTSAVVTLHGQLFVRVPFEQLE
        LVISGEPEHPDNPWGVTPFKKVVSLPSRIDPHQTSAVVTLHGQLFVRVPFEQLE
Subjt:  LVISGEPEHPDNPWGVTPFKKVVSLPSRIDPHQTSAVVTLHGQLFVRVPFEQLE

KAG6596292.1 Kinesin-like protein KIN-14I, partial [Cucurbita argyrosperma subsp. sororia]8.0e-23569.71Show/hide
Query:  VEALENMFERAHCKRPSQPNKRPSNHRCSTTTCGGPNPDGDCSPSRRTAPAREKVEIDNAEFKFQNHVAERTIGLEKKRNVAFFPSLLKLSSENLTSNSC
        V+ LE + +  HCK   + + R S  +            GDCSPSR  A  R+K + DN EF+F+NH        ++  N  F+ S L         +SC
Subjt:  VEALENMFERAHCKRPSQPNKRPSNHRCSTTTCGGPNPDGDCSPSRRTAPAREKVEIDNAEFKFQNHVAERTIGLEKKRNVAFFPSLLKLSSENLTSNSC

Query:  RILPLELEMCDAKEKQESAQDVSVALSEGNQVYAEDDLHDSLSVPAEATPNSCEKEIYSPETHVQEVVPPEDNLDKPTIPQQSNQNSAHSLPFDHDQELD
        RILPLE EMCDA E+QESAQD+ VAL EGNQV ++D+LH SL +PAEATPNSCE +I SPETHVQEVVPPEDNLDK TIPQQSNQ+SA+SLP +HDQ+LD
Subjt:  RILPLELEMCDAKEKQESAQDVSVALSEGNQVYAEDDLHDSLSVPAEATPNSCEKEIYSPETHVQEVVPPEDNLDKPTIPQQSNQNSAHSLPFDHDQELD

Query:  KPDAEPASDVKIVPNELPPRD--LDAAISNSPLETVQSSMDTKSEAFDMPETKTGCLDDASTASHDEPVTPHPVSSCVKAETENAIESKVNEDSVTTPHN
        KPDAEPASDVK  P+ELPPR+  LDAA SN+ LETV  S+D  +EAF MPETKTG LDDASTASHDEPVTPHPVSSC+KAETENAIE KVNED+V TPHN
Subjt:  KPDAEPASDVKIVPNELPPRD--LDAAISNSPLETVQSSMDTKSEAFDMPETKTGCLDDASTASHDEPVTPHPVSSCVKAETENAIESKVNEDSVTTPHN

Query:  GDSNMNHSFILDENHIAEGSESGTEEEQSAFMKELENFFRERSLEFKPPKFYGEGLNCLKLWRAVTRLGGYD----------------------------
        G SNMNHSFILDEN IAEGSESGTEEEQSAFMKELENFFRER +EFKPPKFYGEGLNCLKLWRAVTRLGGYD                            
Subjt:  GDSNMNHSFILDENHIAEGSESGTEEEQSAFMKELENFFRERSLEFKPPKFYGEGLNCLKLWRAVTRLGGYD----------------------------

Query:  -----KALLDYERHKTNGGELSVPIASNSEPMSIENQGSGSGRARRDAAARAMQGWHSQRLLGNGEVSDPIIKDKNSLSMQKREKQLKGI-GLLKRKKPA
             KA+LDYERHKT+GGEL VPIASNSEPMSIENQGSGSGRARRDAAARAMQGWHSQRLLGNGEVSDPIIKDKNS SMQK+EKQLKGI G+LKRKKP+
Subjt:  -----KALLDYERHKTNGGELSVPIASNSEPMSIENQGSGSGRARRDAAARAMQGWHSQRLLGNGEVSDPIIKDKNSLSMQKREKQLKGI-GLLKRKKPA

Query:  YMEHAMKSTRTKSPKPHYPFYDNWPICIETHVKNIGWNVGKEFNILLISLSVARLDVAVVDIGQPADWVKVNVQKTKDCYEVYALVPGLLREEVRVQSDP
        YMEH +KS RTKS KP                                     +LDVAVVDIGQPADWVKVNVQKTKDCYEVYALVPGLLREEVRVQSDP
Subjt:  YMEHAMKSTRTKSPKPHYPFYDNWPICIETHVKNIGWNVGKEFNILLISLSVARLDVAVVDIGQPADWVKVNVQKTKDCYEVYALVPGLLREEVRVQSDP

Query:  AGRLVISGEPEHPDNPWGVTPFKKVVSLPSRIDPHQTSAVVTLHGQLFVRVPFEQLE
        AGRLVISGEPEHPDNPWGVTPFKKVVSLPSRIDPHQTSAVVTLHGQLFVRVPFEQLE
Subjt:  AGRLVISGEPEHPDNPWGVTPFKKVVSLPSRIDPHQTSAVVTLHGQLFVRVPFEQLE

XP_038904744.1 AT-rich interactive domain-containing protein 6-like isoform X1 [Benincasa hispida]5.5e-23680.43Show/hide
Query:  NSCRILPLELEMCDAKEKQESAQDVSVALSEGNQVYAEDDLHDSLSVPAEATPNSCEKEIYSPETHVQEVVPPEDNLDKPTIPQQSNQNSAHSLPFDHDQ
        +SCRILPLELEMCDA EK+ESAQDVSVALSEG+QV A+D+LH+SLSVPAEATPNSCEK   SPETHVQEV       DKPTIPQQSNQNSA SLP DHDQ
Subjt:  NSCRILPLELEMCDAKEKQESAQDVSVALSEGNQVYAEDDLHDSLSVPAEATPNSCEKEIYSPETHVQEVVPPEDNLDKPTIPQQSNQNSAHSLPFDHDQ

Query:  ELDKPDAEPASDVKIVPNELPPRDLDAAISNSPLETVQSSMDTKSEAFDMPETKTGCLDDASTASHDEPVTPHPVSSCVKAETENAIESKVNEDSVTTPH
        ELDKPDAEPASDVK  P+E+PPR+LDA ISNSPLETVQSSMD KSEA DMPETKTG LDDASTAS   PVTPHPVSSCVKAETENAIE KVNEDSVTTPH
Subjt:  ELDKPDAEPASDVKIVPNELPPRDLDAAISNSPLETVQSSMDTKSEAFDMPETKTGCLDDASTASHDEPVTPHPVSSCVKAETENAIESKVNEDSVTTPH

Query:  NGDSNMNHSFILDENHIAEGSESGTEEEQSAFMKELENFFRERSLEFKPPKFYGEGLNCLKLWRAVTRLGGYD---------------------------
        NGDSNMNHSFI DENHIAEGSESGTEEEQSAFMKELENFFRERSLEFKPPKFYGEGLNCLKLWRAVTRLGGYD                           
Subjt:  NGDSNMNHSFILDENHIAEGSESGTEEEQSAFMKELENFFRERSLEFKPPKFYGEGLNCLKLWRAVTRLGGYD---------------------------

Query:  ------KALLDYERHKTNGGELSVPIASNSEPMSIENQGSGSGRARRDAAARAMQGWHSQRLLGNGEVSDPIIKDKNSLSMQKREKQLKGIGLLKRKKPA
              KALLDYERHKTNGGELSVPIASNSEPMSIENQGSGSGRARRDAAARAMQGWHSQRLLGNGEVSDPIIKDKNSLSMQKREKQLK IG  KRKKPA
Subjt:  ------KALLDYERHKTNGGELSVPIASNSEPMSIENQGSGSGRARRDAAARAMQGWHSQRLLGNGEVSDPIIKDKNSLSMQKREKQLKGIGLLKRKKPA

Query:  YMEHAMKSTRTKSPKPHYPFYDNWPICIETHVKNIGWNVGKEFNILLISLSVARLDVAVVDIGQPADWVKVNVQKTKDCYEVYALVPGLLREEVRVQSDP
        YMEHAMKSTRTKSPKP                                     +LDVAVVDIGQPADWVKVNVQKTKDCYEVYALVPGLLREEVRVQSDP
Subjt:  YMEHAMKSTRTKSPKPHYPFYDNWPICIETHVKNIGWNVGKEFNILLISLSVARLDVAVVDIGQPADWVKVNVQKTKDCYEVYALVPGLLREEVRVQSDP

Query:  AGRLVISGEPEHPDNPWGVTPFKKVVSLPSRIDPHQTSAVVTLHGQLFVRVPFEQLE
        AGRLVISGEPEHPDNPWGVTPFKKVVSLPSRIDPHQTSAVVTLHGQLFVRVPFEQ E
Subjt:  AGRLVISGEPEHPDNPWGVTPFKKVVSLPSRIDPHQTSAVVTLHGQLFVRVPFEQLE

XP_038904745.1 AT-rich interactive domain-containing protein 6-like isoform X2 [Benincasa hispida]2.3e-23478.6Show/hide
Query:  PSLLKLSSENLTSNSCRILPLELEMCDAKEKQESAQDVSVALSEGNQVYAEDDLHDSLSVPAEATPNSCEKEIYSPETHVQEVVPPEDNLDKPTIPQQSN
        P  +K +     +   RILPLELEMCDA EK+ESAQDVSVALSEG+QV A+D+LH+SLSVPAEATPNSCEK   SPETHVQEV       DKPTIPQQSN
Subjt:  PSLLKLSSENLTSNSCRILPLELEMCDAKEKQESAQDVSVALSEGNQVYAEDDLHDSLSVPAEATPNSCEKEIYSPETHVQEVVPPEDNLDKPTIPQQSN

Query:  QNSAHSLPFDHDQELDKPDAEPASDVKIVPNELPPRDLDAAISNSPLETVQSSMDTKSEAFDMPETKTGCLDDASTASHDEPVTPHPVSSCVKAETENAI
        QNSA SLP DHDQELDKPDAEPASDVK  P+E+PPR+LDA ISNSPLETVQSSMD KSEA DMPETKTG LDDASTAS   PVTPHPVSSCVKAETENAI
Subjt:  QNSAHSLPFDHDQELDKPDAEPASDVKIVPNELPPRDLDAAISNSPLETVQSSMDTKSEAFDMPETKTGCLDDASTASHDEPVTPHPVSSCVKAETENAI

Query:  ESKVNEDSVTTPHNGDSNMNHSFILDENHIAEGSESGTEEEQSAFMKELENFFRERSLEFKPPKFYGEGLNCLKLWRAVTRLGGYD--------------
        E KVNEDSVTTPHNGDSNMNHSFI DENHIAEGSESGTEEEQSAFMKELENFFRERSLEFKPPKFYGEGLNCLKLWRAVTRLGGYD              
Subjt:  ESKVNEDSVTTPHNGDSNMNHSFILDENHIAEGSESGTEEEQSAFMKELENFFRERSLEFKPPKFYGEGLNCLKLWRAVTRLGGYD--------------

Query:  -------------------KALLDYERHKTNGGELSVPIASNSEPMSIENQGSGSGRARRDAAARAMQGWHSQRLLGNGEVSDPIIKDKNSLSMQKREKQ
                           KALLDYERHKTNGGELSVPIASNSEPMSIENQGSGSGRARRDAAARAMQGWHSQRLLGNGEVSDPIIKDKNSLSMQKREKQ
Subjt:  -------------------KALLDYERHKTNGGELSVPIASNSEPMSIENQGSGSGRARRDAAARAMQGWHSQRLLGNGEVSDPIIKDKNSLSMQKREKQ

Query:  LKGIGLLKRKKPAYMEHAMKSTRTKSPKPHYPFYDNWPICIETHVKNIGWNVGKEFNILLISLSVARLDVAVVDIGQPADWVKVNVQKTKDCYEVYALVP
        LK IG  KRKKPAYMEHAMKSTRTKSPKP                                     +LDVAVVDIGQPADWVKVNVQKTKDCYEVYALVP
Subjt:  LKGIGLLKRKKPAYMEHAMKSTRTKSPKPHYPFYDNWPICIETHVKNIGWNVGKEFNILLISLSVARLDVAVVDIGQPADWVKVNVQKTKDCYEVYALVP

Query:  GLLREEVRVQSDPAGRLVISGEPEHPDNPWGVTPFKKVVSLPSRIDPHQTSAVVTLHGQLFVRVPFEQLE
        GLLREEVRVQSDPAGRLVISGEPEHPDNPWGVTPFKKVVSLPSRIDPHQTSAVVTLHGQLFVRVPFEQ E
Subjt:  GLLREEVRVQSDPAGRLVISGEPEHPDNPWGVTPFKKVVSLPSRIDPHQTSAVVTLHGQLFVRVPFEQLE

TrEMBL top hitse value%identityAlignment
A0A0A0L619 Uncharacterized protein3.7e-23079.71Show/hide
Query:  MCDAKEKQESAQDVSVALSEGNQVYA-EDDLHDSLSVPAEATPNSCEKEIYSPETHVQEVVPPEDNLDKPTIPQQSNQNSAHSLPFDHDQELDKPDAEPA
        MCDAKEKQES QDVS ALSEGNQV A ED+LHDSLSVPAEATPNSCEKEI SP T VQE+VP EDNL+KPTI QQSNQ S HSLP D+DQELDK  AEPA
Subjt:  MCDAKEKQESAQDVSVALSEGNQVYA-EDDLHDSLSVPAEATPNSCEKEIYSPETHVQEVVPPEDNLDKPTIPQQSNQNSAHSLPFDHDQELDKPDAEPA

Query:  SDVKIVPNELPPRDLDAAISNSPLETVQSSMDTKSEAFDMPETKTGCLDDASTASHDEPVTPHPVSSCVKAETENAIESKVNEDSVTTPHNGDSNMNHSF
        SDVK  P ELP RDLDAA+S+SPLETVQSS+D KSEA  MPE KT  LDDAST SHDEPVTPHPVSSCVKAETENAIE KVNED VTTPHNGDSNMNHSF
Subjt:  SDVKIVPNELPPRDLDAAISNSPLETVQSSMDTKSEAFDMPETKTGCLDDASTASHDEPVTPHPVSSCVKAETENAIESKVNEDSVTTPHNGDSNMNHSF

Query:  ILDENHIAEGSESGTEEEQSAFMKELENFFRERSLEFKPPKFYGEGLNCLKLWRAVTRLGGYD---------------------------------KALL
        +LDENHIAEGSESGTEEEQSAFMKELENFFRERSLEFKPPKFYGEGLNCLKLWRAVTRLGGYD                                 KALL
Subjt:  ILDENHIAEGSESGTEEEQSAFMKELENFFRERSLEFKPPKFYGEGLNCLKLWRAVTRLGGYD---------------------------------KALL

Query:  DYERHKTNGGELSVPIASNSEPMSIENQGSGSGRARRDAAARAMQGWHSQRLLGNGEVSDPIIKDKNSLSMQKREKQLKGIGLLKRKKPAYMEHAMKSTR
        DYERHKTNGGELSVPIASNSEPMSIENQGSGSGRARRDAAARAMQGWHSQRLLGNGEVSDPIIKDKNSLSMQKREKQLK IGLLKRKKP+YMEH+MKS R
Subjt:  DYERHKTNGGELSVPIASNSEPMSIENQGSGSGRARRDAAARAMQGWHSQRLLGNGEVSDPIIKDKNSLSMQKREKQLKGIGLLKRKKPAYMEHAMKSTR

Query:  TKSPKPHYPFYDNWPICIETHVKNIGWNVGKEFNILLISLSVARLDVAVVDIGQPADWVKVNVQKTKDCYEVYALVPGLLREEVRVQSDPAGRLVISGEP
        TKSPKP                                     +LDVAVVDIGQPADWVKVNVQKTKDCYEVYALVPGLLREEVRVQSDPAGRLVISG+P
Subjt:  TKSPKPHYPFYDNWPICIETHVKNIGWNVGKEFNILLISLSVARLDVAVVDIGQPADWVKVNVQKTKDCYEVYALVPGLLREEVRVQSDPAGRLVISGEP

Query:  EHPDNPWGVTPFKKVVSLPSRIDPHQTSAVVTLHGQLFVRVPFEQLE
        EHPDNPWGVTPFKKVVSLPSRIDPHQTSAVVTLHGQLFVRVPFEQLE
Subjt:  EHPDNPWGVTPFKKVVSLPSRIDPHQTSAVVTLHGQLFVRVPFEQLE

A0A1S3BKB0 AT-rich interactive domain-containing protein 3-like2.6e-23180.07Show/hide
Query:  MCDAKEKQESAQDVSVALSEGNQVYA-EDDLHDSLSVPAEATPNSCEKEIYSPETHVQEVVPPEDNLDKPTIPQQSNQNSAHSLPFDHDQELDKPDAEPA
        M DAKEKQE  QDVS  LSEGN V A ED+LHDSLSVPAEATPNSCEKEI SPET VQE+VP EDNLDK T  QQSNQ S HSLP ++DQELDKP AEPA
Subjt:  MCDAKEKQESAQDVSVALSEGNQVYA-EDDLHDSLSVPAEATPNSCEKEIYSPETHVQEVVPPEDNLDKPTIPQQSNQNSAHSLPFDHDQELDKPDAEPA

Query:  SDVKIVPNELPPRDLDAAISNSPLETVQSSMDTKSEAFDMPETKTGCLDDASTASHDEPVTPHPVSSCVKAETENAIESKVNEDSVTTPHNGDSNMNHSF
        SDVK  P ELPPRDLDAA+S+SPLETVQSSMD KSE F+MPE KT  LDDAS ASHDEPVTPHPVSSCVKAETENAIE KVNED VTTPHNGDSNMNHSF
Subjt:  SDVKIVPNELPPRDLDAAISNSPLETVQSSMDTKSEAFDMPETKTGCLDDASTASHDEPVTPHPVSSCVKAETENAIESKVNEDSVTTPHNGDSNMNHSF

Query:  ILDENHIAEGSESGTEEEQSAFMKELENFFRERSLEFKPPKFYGEGLNCLKLWRAVTRLGGYD---------------------------------KALL
        ILDENHIAEGSESGTEEEQSAFMKELENFFRERSLEFKPPKFYGEGLNCLKLWRAVTRLGGYD                                 KALL
Subjt:  ILDENHIAEGSESGTEEEQSAFMKELENFFRERSLEFKPPKFYGEGLNCLKLWRAVTRLGGYD---------------------------------KALL

Query:  DYERHKTNGGELSVPIASNSEPMSIENQGSGSGRARRDAAARAMQGWHSQRLLGNGEVSDPIIKDKNSLSMQKREKQLKGIGLLKRKKPAYMEHAMKSTR
        DYERHKTNGGELSVPIASNSEPMSIENQGSGSGRARRDAAARAMQGWHSQRLLGNGEVSDPIIKDKNSLSMQKREKQLK IGLLKRKKP+YMEHAMKSTR
Subjt:  DYERHKTNGGELSVPIASNSEPMSIENQGSGSGRARRDAAARAMQGWHSQRLLGNGEVSDPIIKDKNSLSMQKREKQLKGIGLLKRKKPAYMEHAMKSTR

Query:  TKSPKPHYPFYDNWPICIETHVKNIGWNVGKEFNILLISLSVARLDVAVVDIGQPADWVKVNVQKTKDCYEVYALVPGLLREEVRVQSDPAGRLVISGEP
        TKSPKP                                     +LDVAVVDIGQPADWVKVNVQKTKDCYEVYALVPGLLREEVRVQSDPAGRLVISGEP
Subjt:  TKSPKPHYPFYDNWPICIETHVKNIGWNVGKEFNILLISLSVARLDVAVVDIGQPADWVKVNVQKTKDCYEVYALVPGLLREEVRVQSDPAGRLVISGEP

Query:  EHPDNPWGVTPFKKVVSLPSRIDPHQTSAVVTLHGQLFVRVPFEQLE
        EHPDNPWGVTPFKKVVSLPSRIDPHQTSAVVTLHGQLFVRVPFEQLE
Subjt:  EHPDNPWGVTPFKKVVSLPSRIDPHQTSAVVTLHGQLFVRVPFEQLE

A0A5D3CLR0 AT-rich interactive domain-containing protein 3-like2.6e-23180.07Show/hide
Query:  MCDAKEKQESAQDVSVALSEGNQVYA-EDDLHDSLSVPAEATPNSCEKEIYSPETHVQEVVPPEDNLDKPTIPQQSNQNSAHSLPFDHDQELDKPDAEPA
        M DAKEKQE  QDVS  LSEGN V A ED+LHDSLSVPAEATPNSCEKEI SPET VQE+VP EDNLDK T  QQSNQ S HSLP ++DQELDKP AEPA
Subjt:  MCDAKEKQESAQDVSVALSEGNQVYA-EDDLHDSLSVPAEATPNSCEKEIYSPETHVQEVVPPEDNLDKPTIPQQSNQNSAHSLPFDHDQELDKPDAEPA

Query:  SDVKIVPNELPPRDLDAAISNSPLETVQSSMDTKSEAFDMPETKTGCLDDASTASHDEPVTPHPVSSCVKAETENAIESKVNEDSVTTPHNGDSNMNHSF
        SDVK  P ELPPRDLDAA+S+SPLETVQSSMD KSE F+MPE KT  LDDAS ASHDEPVTPHPVSSCVKAETENAIE KVNED VTTPHNGDSNMNHSF
Subjt:  SDVKIVPNELPPRDLDAAISNSPLETVQSSMDTKSEAFDMPETKTGCLDDASTASHDEPVTPHPVSSCVKAETENAIESKVNEDSVTTPHNGDSNMNHSF

Query:  ILDENHIAEGSESGTEEEQSAFMKELENFFRERSLEFKPPKFYGEGLNCLKLWRAVTRLGGYD---------------------------------KALL
        ILDENHIAEGSESGTEEEQSAFMKELENFFRERSLEFKPPKFYGEGLNCLKLWRAVTRLGGYD                                 KALL
Subjt:  ILDENHIAEGSESGTEEEQSAFMKELENFFRERSLEFKPPKFYGEGLNCLKLWRAVTRLGGYD---------------------------------KALL

Query:  DYERHKTNGGELSVPIASNSEPMSIENQGSGSGRARRDAAARAMQGWHSQRLLGNGEVSDPIIKDKNSLSMQKREKQLKGIGLLKRKKPAYMEHAMKSTR
        DYERHKTNGGELSVPIASNSEPMSIENQGSGSGRARRDAAARAMQGWHSQRLLGNGEVSDPIIKDKNSLSMQKREKQLK IGLLKRKKP+YMEHAMKSTR
Subjt:  DYERHKTNGGELSVPIASNSEPMSIENQGSGSGRARRDAAARAMQGWHSQRLLGNGEVSDPIIKDKNSLSMQKREKQLKGIGLLKRKKPAYMEHAMKSTR

Query:  TKSPKPHYPFYDNWPICIETHVKNIGWNVGKEFNILLISLSVARLDVAVVDIGQPADWVKVNVQKTKDCYEVYALVPGLLREEVRVQSDPAGRLVISGEP
        TKSPKP                                     +LDVAVVDIGQPADWVKVNVQKTKDCYEVYALVPGLLREEVRVQSDPAGRLVISGEP
Subjt:  TKSPKPHYPFYDNWPICIETHVKNIGWNVGKEFNILLISLSVARLDVAVVDIGQPADWVKVNVQKTKDCYEVYALVPGLLREEVRVQSDPAGRLVISGEP

Query:  EHPDNPWGVTPFKKVVSLPSRIDPHQTSAVVTLHGQLFVRVPFEQLE
        EHPDNPWGVTPFKKVVSLPSRIDPHQTSAVVTLHGQLFVRVPFEQLE
Subjt:  EHPDNPWGVTPFKKVVSLPSRIDPHQTSAVVTLHGQLFVRVPFEQLE

A0A6J1HF99 AT-rich interactive domain-containing protein 3-like7.1e-22176.32Show/hide
Query:  MCDAKEKQESAQDVSVALSEGNQVYAEDDLHDSLSVPAEATPNSCEKEIYSPETHVQEVVPPEDNLDKPTIPQQSNQNSAHSLPFDHDQELDKPDAEPAS
        MCDA E++ESAQD+ VAL EGNQV ++D+LHDSL +PAEATPNSCE +I SPETHVQEVVPPEDNLDK TIPQQSNQ+SA+SLP +HDQ+LDKPDAEPAS
Subjt:  MCDAKEKQESAQDVSVALSEGNQVYAEDDLHDSLSVPAEATPNSCEKEIYSPETHVQEVVPPEDNLDKPTIPQQSNQNSAHSLPFDHDQELDKPDAEPAS

Query:  DVKIVPNELPPRD--LDAAISNSPLETVQSSMDTKSEAFDMPETKTGCLDDASTASHDEPVTPHPVSSCVKAETENAIESKVNEDSVTTPHNGDSNMNHS
        DVK  P+ELPPR+  LDAA SN+ LETV  S+D  +EAF MPETKTG LDDASTASHDEPVTPHPVSS +KAETENAIE KVNED+V TPHNG SNMNHS
Subjt:  DVKIVPNELPPRD--LDAAISNSPLETVQSSMDTKSEAFDMPETKTGCLDDASTASHDEPVTPHPVSSCVKAETENAIESKVNEDSVTTPHNGDSNMNHS

Query:  FILDENHIAEGSESGTEEEQSAFMKELENFFRERSLEFKPPKFYGEGLNCLKLWRAVTRLGGYD---------------------------------KAL
        FILDEN IAEGSESGTEEEQSAFMKELENFFRER +EFKPPKFYGEGLNCLKLWRAVTRLGGYD                                 KA+
Subjt:  FILDENHIAEGSESGTEEEQSAFMKELENFFRERSLEFKPPKFYGEGLNCLKLWRAVTRLGGYD---------------------------------KAL

Query:  LDYERHKTNGGELSVPIASNSEPMSIENQGSGSGRARRDAAARAMQGWHSQRLLGNGEVSDPIIKDKNSLSMQKREKQLKGI-GLLKRKKPAYMEHAMKS
        LDYERHKT+GGEL VPIASNSEPMSIENQGSGSGRARRDAAARAMQGWHSQRLLGNGEVSDPIIKDKNS SMQK+EKQLKGI G+LKRKKP+YMEH +KS
Subjt:  LDYERHKTNGGELSVPIASNSEPMSIENQGSGSGRARRDAAARAMQGWHSQRLLGNGEVSDPIIKDKNSLSMQKREKQLKGI-GLLKRKKPAYMEHAMKS

Query:  TRTKSPKPHYPFYDNWPICIETHVKNIGWNVGKEFNILLISLSVARLDVAVVDIGQPADWVKVNVQKTKDCYEVYALVPGLLREEVRVQSDPAGRLVISG
         RTKS KP                                     +LDVAVVDIGQPADWVKVNVQKTKDCYEVYALVPGLLREEVRVQSDPAGRLVISG
Subjt:  TRTKSPKPHYPFYDNWPICIETHVKNIGWNVGKEFNILLISLSVARLDVAVVDIGQPADWVKVNVQKTKDCYEVYALVPGLLREEVRVQSDPAGRLVISG

Query:  EPEHPDNPWGVTPFKKVVSLPSRIDPHQTSAVVTLHGQLFVRVPFEQLE
        EPEHPDNPWGVTPFKKVVSLPSRIDPHQTSAVVTLHGQLFVRVPFEQLE
Subjt:  EPEHPDNPWGVTPFKKVVSLPSRIDPHQTSAVVTLHGQLFVRVPFEQLE

A0A6J1I1R1 AT-rich interactive domain-containing protein 3-like2.4e-22176.5Show/hide
Query:  MCDAKEKQESAQDVSVALSEGNQVYAEDDLHDSLSVPAEATPNSCEKEIYSPETHVQEVVPPEDNLDKPTIPQQSNQNSAHSLPFDHDQELDKPDAEPAS
        MCDA E+QESAQD+ VAL EGNQV ++D+LHDSL +PAEATPNSCE +I SPETHVQEVVPPEDNLDK TIPQQSNQ+SA+SLP +HDQEL KPDAEPAS
Subjt:  MCDAKEKQESAQDVSVALSEGNQVYAEDDLHDSLSVPAEATPNSCEKEIYSPETHVQEVVPPEDNLDKPTIPQQSNQNSAHSLPFDHDQELDKPDAEPAS

Query:  DVKIVPNELPPRD--LDAAISNSPLETVQSSMDTKSEAFDMPETKTGCLDDASTASHDEPVTPHPVSSCVKAETENAIESKVNEDSVTTPHNGDSNMNHS
        DVK  P+ELPPR+  LDAA SN+ LETV SS+D  +EAF MPETKTG LDDASTA+HDEPVTPHPVSSC+KAETENAIE KVNED+V TPHNG SN NHS
Subjt:  DVKIVPNELPPRD--LDAAISNSPLETVQSSMDTKSEAFDMPETKTGCLDDASTASHDEPVTPHPVSSCVKAETENAIESKVNEDSVTTPHNGDSNMNHS

Query:  FILDENHIAEGSESGTEEEQSAFMKELENFFRERSLEFKPPKFYGEGLNCLKLWRAVTRLGGYD---------------------------------KAL
        FILDEN IAEGSESGTEEEQSAFMKELENFFRER +EFKPPKFYGEGLNCLKLWRAVTRLGGYD                                 KA+
Subjt:  FILDENHIAEGSESGTEEEQSAFMKELENFFRERSLEFKPPKFYGEGLNCLKLWRAVTRLGGYD---------------------------------KAL

Query:  LDYERHKTNGGELSVPIASNSEPMSIENQGSGSGRARRDAAARAMQGWHSQRLLGNGEVSDPIIKDKNSLSMQKREKQLKGI-GLLKRKKPAYMEHAMKS
        LDYERHKT+GGEL VPIASNSEPMSIENQGSGSGRARRDAAARAMQGWHSQRLLGNGEVSDPIIKDKNS SMQK+EKQLKGI G+LKRKKP+YMEH +KS
Subjt:  LDYERHKTNGGELSVPIASNSEPMSIENQGSGSGRARRDAAARAMQGWHSQRLLGNGEVSDPIIKDKNSLSMQKREKQLKGI-GLLKRKKPAYMEHAMKS

Query:  TRTKSPKPHYPFYDNWPICIETHVKNIGWNVGKEFNILLISLSVARLDVAVVDIGQPADWVKVNVQKTKDCYEVYALVPGLLREEVRVQSDPAGRLVISG
         RTKS KP                                     +LDVAVVDIGQPADWVKVNVQKTKDCYEVYALVPGLLREEVRVQSDPAGRLVISG
Subjt:  TRTKSPKPHYPFYDNWPICIETHVKNIGWNVGKEFNILLISLSVARLDVAVVDIGQPADWVKVNVQKTKDCYEVYALVPGLLREEVRVQSDPAGRLVISG

Query:  EPEHPDNPWGVTPFKKVVSLPSRIDPHQTSAVVTLHGQLFVRVPFEQLE
        EPEHPDNPWGVTPFKKVVSLPSRIDPHQTSAVVTLHGQLFVRVPFEQLE
Subjt:  EPEHPDNPWGVTPFKKVVSLPSRIDPHQTSAVVTLHGQLFVRVPFEQLE

SwissProt top hitse value%identityAlignment
A1A442 Diacylglycerol O-acyltransferase 23.2e-10964.69Show/hide
Query:  MMHIIVALGIWLGGIHLNFTLGLISLFYLSLPKALLVFALLLILVLIPVDDKSKYGRVLARYICQNACSYFPVTLHVEDIYAFDPNRAYVFGYEPHSVLP
        + H ++AL IW+G IH N  L  IS  +LS P  LL+    ++L+ IP+D+ SK GR L RY+C++ACS+FPVTLHVED+ AF  +RAYVFGYEPHSV P
Subjt:  MMHIIVALGIWLGGIHLNFTLGLISLFYLSLPKALLVFALLLILVLIPVDDKSKYGRVLARYICQNACSYFPVTLHVEDIYAFDPNRAYVFGYEPHSVLP

Query:  IGVVALADLTGFMPLKKLKVLASSAVFYTPFLRHIWTWMGLTPATKKNFISLLAAGCSCIIVPGGVQETFHMEHNSETVFLKTRRGFVRIAMEMGTPLVP
        +GV  L+D    +PL K+KVLAS+AVF TP LRHIWTW GLT ATKKNF +LLA+G SCI++PGGVQETF+M+H SE  FLK RRGFVR+AMEMG PLVP
Subjt:  IGVVALADLTGFMPLKKLKVLASSAVFYTPFLRHIWTWMGLTPATKKNFISLLAAGCSCIIVPGGVQETFHMEHNSETVFLKTRRGFVRIAMEMGTPLVP

Query:  VFCFGQSSVYQWWKPGGNFFLQFSRAIKFTPIVFWGVFGSPLPYRRRMHVVVGKPIEVKKNPNPTSDEVLDLHGQFVEALENMFER
        VFCFGQS+VY+WWKP G  F++ +RAIKF+PIVFWGV GS LP +R MHVVVGKPIEVK+NP PT +EV ++ GQFV AL+++FER
Subjt:  VFCFGQSSVYQWWKPGGNFFLQFSRAIKFTPIVFWGVFGSPLPYRRRMHVVVGKPIEVKKNPNPTSDEVLDLHGQFVEALENMFER

K7K424 Diacylglycerol O-acyltransferase 2D1.6e-12169.18Show/hide
Query:  IVALGIWLGGIHLNFTLGLISLFYLSLPKALLVFALLLILVLIPVDDKSKYGRVLARYICQNACSYFPVTLHVEDIYAFDPNRAYVFGYEPHSVLPIGVV
        I+A+ +WLG IH N  L L+++F+L L K+LLVF  L   +++P+++KS++GR L+R+IC++AC+YFP+TLHVED+ AFDPNRAYVFGYEPHSVLPIG+V
Subjt:  IVALGIWLGGIHLNFTLGLISLFYLSLPKALLVFALLLILVLIPVDDKSKYGRVLARYICQNACSYFPVTLHVEDIYAFDPNRAYVFGYEPHSVLPIGVV

Query:  ALADLTGFMPLKKLKVLASSAVFYTPFLRHIWTWMGLTPATKKNFISLLAAGCSCIIVPGGVQETFHMEHNSETVFLKTRRGFVRIAMEMGTPLVPVFCF
        ALAD TGFMPL K+KVLASS VFYTPFLRH+WTW+GLTPATKKNFISLLA+G SCI++PGGVQE FHM+H +E  FLK RRGFVR+AM  G PLVPVFCF
Subjt:  ALADLTGFMPLKKLKVLASSAVFYTPFLRHIWTWMGLTPATKKNFISLLAAGCSCIIVPGGVQETFHMEHNSETVFLKTRRGFVRIAMEMGTPLVPVFCF

Query:  GQSSVYQWWKPGGNFFLQFSRAIKFTPIVFWGVFGSPLPYRRRMHVVVGKPIEVKKNPNPTSDEVLDLHGQFVEALENMFERAHCKRPSQPN
        GQS+VY+WWKPGG  FL+F+RAIKFTPI FWG+FGSPLP+R  MHVVVG+PIEV KN  PT++EV  +HG FVEAL+++FER H  R   PN
Subjt:  GQSSVYQWWKPGGNFFLQFSRAIKFTPIVFWGVFGSPLPYRRRMHVVVGKPIEVKKNPNPTSDEVLDLHGQFVEALENMFERAHCKRPSQPN

Q0WNR6 AT-rich interactive domain-containing protein 51.5e-7945.6Show/hide
Query:  ESKVNEDSVTTPHNGDSNMN--HSFILDENHIAEGSESGTEEEQSAFMKELENFFRERSLEFKPPKFYGEGLNCLKLWRAVTRLGGYD------------
        ES   +   ++PH  + ++    +++L +    E  E+G  ++Q AF+KE+E F +E  LEFK PKFYG+ LNCLKLWRAV +LGGYD            
Subjt:  ESKVNEDSVTTPHNGDSNMN--HSFILDENHIAEGSESGTEEEQSAFMKELENFFRERSLEFKPPKFYGEGLNCLKLWRAVTRLGGYD------------

Query:  ---------------------KALLDYERHKTNGGELSVPIASNSEPMSIE-----NQGSGSGRARRDAAARAMQGWHSQRLLGNGEVSDPIIKDKNSLS
                             KALL+YE+H    GEL++P +++     IE     +Q SGSGR RRDAAARAMQGWHSQRLLG+GEV++PI+K+K  L+
Subjt:  ---------------------KALLDYERHKTNGGELSVPIASNSEPMSIE-----NQGSGSGRARRDAAARAMQGWHSQRLLGNGEVSDPIIKDKNSLS

Query:  MQKREKQLKGIGLLKRKKPAYMEHAMKSTRTKSPKPHYPFYDNWPICIETHVKNIGWNVGKEFNILLISLSVARLDVAVVDIGQPADWVKVNVQKTKDCY
           ++K LK IG+ K+K    M+                                         ++    S  +    V+D+G PADWVK+NV++TKDC+
Subjt:  MQKREKQLKGIGLLKRKKPAYMEHAMKSTRTKSPKPHYPFYDNWPICIETHVKNIGWNVGKEFNILLISLSVARLDVAVVDIGQPADWVKVNVQKTKDCY

Query:  EVYALVPGLLREEVRVQSDPAGRLVISGEPEHPDNPWGVTPFKKVVSLPSRIDPHQTSAVVTLHGQLFVRVPFEQ
        E++ALVPGLLREEVRVQSDPAGRLVI+G+PE  DNPWG+TPFKKVV+ P+RIDP  TSAVV+LHG+LFVRVPFEQ
Subjt:  EVYALVPGLLREEVRVQSDPAGRLVISGEPEHPDNPWGVTPFKKVVSLPSRIDPHQTSAVVTLHGQLFVRVPFEQ

Q940Y3 AT-rich interactive domain-containing protein 35.9e-10847.46Show/hide
Query:  LHDSLSVPAEATPNSCEKEIYSPETHVQEVVPPED-NLDKPTIPQQSNQNS------AHSLPFDHDQELDKPDAEPASDVKI-VPNELPPRDLDAAISNS
        L D  S     T  + EKE+  P     E++P  D N D   +    + ++      A     D D+ +   DAEP  D+K+ VP+     D     +N+
Subjt:  LHDSLSVPAEATPNSCEKEIYSPETHVQEVVPPED-NLDKPTIPQQSNQNS------AHSLPFDHDQELDKPDAEPASDVKI-VPNELPPRDLDAAISNS

Query:  PLETVQSSMDTKSEAFDMPETKTGCLDDASTASHDEPVTPHP--VSSCVKAETENA--IESKVNEDSVTTP-HNGDSNMNHSFILDENHIAEGSESGTEE
            V+++ + +  +  +       L+DA+  S      P P   SS +K+E   +  + + V++   T P  +G      SF+LD+   ++G+ESGTEE
Subjt:  PLETVQSSMDTKSEAFDMPETKTGCLDDASTASHDEPVTPHP--VSSCVKAETENA--IESKVNEDSVTTP-HNGDSNMNHSFILDENHIAEGSESGTEE

Query:  EQSAFMKELENFFRERSLEFKPPKFYGEGLNCLKLWRAVTRLGGYD---------------------------------KALLDYERHKTNGGELSVPIA
        +QSAFMKEL++FFRER+++FKPPKFYGEGLNCLKLWRAVTRLGGYD                                 KALL+YERHK + GEL +P+ 
Subjt:  EQSAFMKELENFFRERSLEFKPPKFYGEGLNCLKLWRAVTRLGGYD---------------------------------KALLDYERHKTNGGELSVPIA

Query:  SNSEPMSIENQGSGSGRARRDAAARAMQGWHSQRLLGNGEVSDPIIKDKNSLSMQKREKQLKGI-GLLKRKKPAYMEHAMKSTRTKSPKPHYPFYDNWPI
           EPM+I+NQ SGSGRARRDAA+RAMQGWHSQRL GNGEVSDP IKDKN +  QKREKQ+    GLLKRK+ A  EH  K+                  
Subjt:  SNSEPMSIENQGSGSGRARRDAAARAMQGWHSQRLLGNGEVSDPIIKDKNSLSMQKREKQLKGI-GLLKRKKPAYMEHAMKSTRTKSPKPHYPFYDNWPI

Query:  CIETHVKNIGWNVGKEFNILLISLSVARLDVAVVDIGQPADWVKVNVQKTKDCYEVYALVPGLLREEVRVQSDPAGRLVISGEPEHPDNPWGVTPFKKVV
                             I +S + LDV VVD+G PADWVK+NVQ+T+DC+EVYALVPGL+REEVRVQSDPAGRLVISGEPE+P NPWG TPFKKVV
Subjt:  CIETHVKNIGWNVGKEFNILLISLSVARLDVAVVDIGQPADWVKVNVQKTKDCYEVYALVPGLLREEVRVQSDPAGRLVISGEPEHPDNPWGVTPFKKVV

Query:  SLPSRIDPHQTSAVVTLHGQLFVRVPFEQLE
        SLP+RIDPH TSAVVTL+GQLFVRVP EQLE
Subjt:  SLPSRIDPHQTSAVVTLHGQLFVRVPFEQLE

Q9ASU1 Diacylglycerol O-acyltransferase 22.3e-11568.31Show/hide
Query:  HIIVALGIWLGGIHLNFTLGLISLFYLSLPKALLVFALLLILVLIPVDDKSKYGRVLARYICQNACSYFPVTLHVEDIYAFDPNRAYVFGYEPHSVLPIG
        H I+A+ IWLG IH N  L L SL +L    +L+V  LL + + IP+D +SKYGR LARYIC++AC+YFPV+L+VED  AF PNRAYVFGYEPHSVLPIG
Subjt:  HIIVALGIWLGGIHLNFTLGLISLFYLSLPKALLVFALLLILVLIPVDDKSKYGRVLARYICQNACSYFPVTLHVEDIYAFDPNRAYVFGYEPHSVLPIG

Query:  VVALADLTGFMPLKKLKVLASSAVFYTPFLRHIWTWMGLTPATKKNFISLLAAGCSCIIVPGGVQETFHMEHNSETVFLKTRRGFVRIAMEMGTPLVPVF
        VVAL DLTGFMP+  +KVLASSA+FYTPFLRHIWTW+GLT A++KNF SLL +G SC++VPGGVQETFHM+H++E VFL  RRGFVRIAME G+PLVPVF
Subjt:  VVALADLTGFMPLKKLKVLASSAVFYTPFLRHIWTWMGLTPATKKNFISLLAAGCSCIIVPGGVQETFHMEHNSETVFLKTRRGFVRIAMEMGTPLVPVF

Query:  CFGQSSVYQWWKPGGNFFLQFSRAIKFTPIVFWGVFGSPLPYRRRMHVVVGKPIEVKKNPNPTSDEVLDLHGQFVEALENMFER
        CFGQ+ VY+WWKP  + +L+ SRAI+FTPI FWGVFGSPLP R+ MHVVVGKPIEV K   PT +E+   HGQ+VEAL ++FER
Subjt:  CFGQSSVYQWWKPGGNFFLQFSRAIKFTPIVFWGVFGSPLPYRRRMHVVVGKPIEVKKNPNPTSDEVLDLHGQFVEALENMFER

Arabidopsis top hitse value%identityAlignment
AT1G76510.1 ARID/BRIGHT DNA-binding domain-containing protein1.1e-8045.6Show/hide
Query:  ESKVNEDSVTTPHNGDSNMN--HSFILDENHIAEGSESGTEEEQSAFMKELENFFRERSLEFKPPKFYGEGLNCLKLWRAVTRLGGYD------------
        ES   +   ++PH  + ++    +++L +    E  E+G  ++Q AF+KE+E F +E  LEFK PKFYG+ LNCLKLWRAV +LGGYD            
Subjt:  ESKVNEDSVTTPHNGDSNMN--HSFILDENHIAEGSESGTEEEQSAFMKELENFFRERSLEFKPPKFYGEGLNCLKLWRAVTRLGGYD------------

Query:  ---------------------KALLDYERHKTNGGELSVPIASNSEPMSIE-----NQGSGSGRARRDAAARAMQGWHSQRLLGNGEVSDPIIKDKNSLS
                             KALL+YE+H    GEL++P +++     IE     +Q SGSGR RRDAAARAMQGWHSQRLLG+GEV++PI+K+K  L+
Subjt:  ---------------------KALLDYERHKTNGGELSVPIASNSEPMSIE-----NQGSGSGRARRDAAARAMQGWHSQRLLGNGEVSDPIIKDKNSLS

Query:  MQKREKQLKGIGLLKRKKPAYMEHAMKSTRTKSPKPHYPFYDNWPICIETHVKNIGWNVGKEFNILLISLSVARLDVAVVDIGQPADWVKVNVQKTKDCY
           ++K LK IG+ K+K    M+                                         ++    S  +    V+D+G PADWVK+NV++TKDC+
Subjt:  MQKREKQLKGIGLLKRKKPAYMEHAMKSTRTKSPKPHYPFYDNWPICIETHVKNIGWNVGKEFNILLISLSVARLDVAVVDIGQPADWVKVNVQKTKDCY

Query:  EVYALVPGLLREEVRVQSDPAGRLVISGEPEHPDNPWGVTPFKKVVSLPSRIDPHQTSAVVTLHGQLFVRVPFEQ
        E++ALVPGLLREEVRVQSDPAGRLVI+G+PE  DNPWG+TPFKKVV+ P+RIDP  TSAVV+LHG+LFVRVPFEQ
Subjt:  EVYALVPGLLREEVRVQSDPAGRLVISGEPEHPDNPWGVTPFKKVVSLPSRIDPHQTSAVVTLHGQLFVRVPFEQ

AT1G76510.2 ARID/BRIGHT DNA-binding domain-containing protein1.1e-8045.6Show/hide
Query:  ESKVNEDSVTTPHNGDSNMN--HSFILDENHIAEGSESGTEEEQSAFMKELENFFRERSLEFKPPKFYGEGLNCLKLWRAVTRLGGYD------------
        ES   +   ++PH  + ++    +++L +    E  E+G  ++Q AF+KE+E F +E  LEFK PKFYG+ LNCLKLWRAV +LGGYD            
Subjt:  ESKVNEDSVTTPHNGDSNMN--HSFILDENHIAEGSESGTEEEQSAFMKELENFFRERSLEFKPPKFYGEGLNCLKLWRAVTRLGGYD------------

Query:  ---------------------KALLDYERHKTNGGELSVPIASNSEPMSIE-----NQGSGSGRARRDAAARAMQGWHSQRLLGNGEVSDPIIKDKNSLS
                             KALL+YE+H    GEL++P +++     IE     +Q SGSGR RRDAAARAMQGWHSQRLLG+GEV++PI+K+K  L+
Subjt:  ---------------------KALLDYERHKTNGGELSVPIASNSEPMSIE-----NQGSGSGRARRDAAARAMQGWHSQRLLGNGEVSDPIIKDKNSLS

Query:  MQKREKQLKGIGLLKRKKPAYMEHAMKSTRTKSPKPHYPFYDNWPICIETHVKNIGWNVGKEFNILLISLSVARLDVAVVDIGQPADWVKVNVQKTKDCY
           ++K LK IG+ K+K    M+                                         ++    S  +    V+D+G PADWVK+NV++TKDC+
Subjt:  MQKREKQLKGIGLLKRKKPAYMEHAMKSTRTKSPKPHYPFYDNWPICIETHVKNIGWNVGKEFNILLISLSVARLDVAVVDIGQPADWVKVNVQKTKDCY

Query:  EVYALVPGLLREEVRVQSDPAGRLVISGEPEHPDNPWGVTPFKKVVSLPSRIDPHQTSAVVTLHGQLFVRVPFEQ
        E++ALVPGLLREEVRVQSDPAGRLVI+G+PE  DNPWG+TPFKKVV+ P+RIDP  TSAVV+LHG+LFVRVPFEQ
Subjt:  EVYALVPGLLREEVRVQSDPAGRLVISGEPEHPDNPWGVTPFKKVVSLPSRIDPHQTSAVVTLHGQLFVRVPFEQ

AT2G17410.1 ARID/BRIGHT DNA-binding domain-containing protein4.2e-10947.46Show/hide
Query:  LHDSLSVPAEATPNSCEKEIYSPETHVQEVVPPED-NLDKPTIPQQSNQNS------AHSLPFDHDQELDKPDAEPASDVKI-VPNELPPRDLDAAISNS
        L D  S     T  + EKE+  P     E++P  D N D   +    + ++      A     D D+ +   DAEP  D+K+ VP+     D     +N+
Subjt:  LHDSLSVPAEATPNSCEKEIYSPETHVQEVVPPED-NLDKPTIPQQSNQNS------AHSLPFDHDQELDKPDAEPASDVKI-VPNELPPRDLDAAISNS

Query:  PLETVQSSMDTKSEAFDMPETKTGCLDDASTASHDEPVTPHP--VSSCVKAETENA--IESKVNEDSVTTP-HNGDSNMNHSFILDENHIAEGSESGTEE
            V+++ + +  +  +       L+DA+  S      P P   SS +K+E   +  + + V++   T P  +G      SF+LD+   ++G+ESGTEE
Subjt:  PLETVQSSMDTKSEAFDMPETKTGCLDDASTASHDEPVTPHP--VSSCVKAETENA--IESKVNEDSVTTP-HNGDSNMNHSFILDENHIAEGSESGTEE

Query:  EQSAFMKELENFFRERSLEFKPPKFYGEGLNCLKLWRAVTRLGGYD---------------------------------KALLDYERHKTNGGELSVPIA
        +QSAFMKEL++FFRER+++FKPPKFYGEGLNCLKLWRAVTRLGGYD                                 KALL+YERHK + GEL +P+ 
Subjt:  EQSAFMKELENFFRERSLEFKPPKFYGEGLNCLKLWRAVTRLGGYD---------------------------------KALLDYERHKTNGGELSVPIA

Query:  SNSEPMSIENQGSGSGRARRDAAARAMQGWHSQRLLGNGEVSDPIIKDKNSLSMQKREKQLKGI-GLLKRKKPAYMEHAMKSTRTKSPKPHYPFYDNWPI
           EPM+I+NQ SGSGRARRDAA+RAMQGWHSQRL GNGEVSDP IKDKN +  QKREKQ+    GLLKRK+ A  EH  K+                  
Subjt:  SNSEPMSIENQGSGSGRARRDAAARAMQGWHSQRLLGNGEVSDPIIKDKNSLSMQKREKQLKGI-GLLKRKKPAYMEHAMKSTRTKSPKPHYPFYDNWPI

Query:  CIETHVKNIGWNVGKEFNILLISLSVARLDVAVVDIGQPADWVKVNVQKTKDCYEVYALVPGLLREEVRVQSDPAGRLVISGEPEHPDNPWGVTPFKKVV
                             I +S + LDV VVD+G PADWVK+NVQ+T+DC+EVYALVPGL+REEVRVQSDPAGRLVISGEPE+P NPWG TPFKKVV
Subjt:  CIETHVKNIGWNVGKEFNILLISLSVARLDVAVVDIGQPADWVKVNVQKTKDCYEVYALVPGLLREEVRVQSDPAGRLVISGEPEHPDNPWGVTPFKKVV

Query:  SLPSRIDPHQTSAVVTLHGQLFVRVPFEQLE
        SLP+RIDPH TSAVVTL+GQLFVRVP EQLE
Subjt:  SLPSRIDPHQTSAVVTLHGQLFVRVPFEQLE

AT2G17410.2 ARID/BRIGHT DNA-binding domain-containing protein3.0e-10747.27Show/hide
Query:  LHDSLSVPAEATPNSCEKEIYSPETHVQEVVPPED-NLDKPTIPQQSNQNS------AHSLPFDHDQELDKPDAEPASDVKI-VPNELPPRDLDAAISNS
        L D  S     T  + EKE+  P     E++P  D N D   +    + ++      A     D D+ +   DAEP  D+K+ VP+     D     +N+
Subjt:  LHDSLSVPAEATPNSCEKEIYSPETHVQEVVPPED-NLDKPTIPQQSNQNS------AHSLPFDHDQELDKPDAEPASDVKI-VPNELPPRDLDAAISNS

Query:  PLETVQSSMDTKSEAFDMPETKTGCLDDASTASHDEPVTPHP--VSSCVKAETENA--IESKVNEDSVTTP-HNGDSNMNHSFILDENHIAEGSESGTEE
            V+++ + +  +  +       L+DA+  S      P P   SS +K+E   +  + + V++   T P  +G      SF+LD+   ++G+ESGTEE
Subjt:  PLETVQSSMDTKSEAFDMPETKTGCLDDASTASHDEPVTPHP--VSSCVKAETENA--IESKVNEDSVTTP-HNGDSNMNHSFILDENHIAEGSESGTEE

Query:  EQSAFMKELENFFRERSLEFKPPKFYGEGLNCLKLWRAVTRLGGYD---------------------------------KALLDYERHKTNGGELSVPIA
        +QSAFMKEL++FFRER+++FKPPKFYGEGLNCLKLWRAVTRLGGYD                                 KALL+YERHK + GEL +P+ 
Subjt:  EQSAFMKELENFFRERSLEFKPPKFYGEGLNCLKLWRAVTRLGGYD---------------------------------KALLDYERHKTNGGELSVPIA

Query:  SNSEPMSIENQGSGSGRARRDAAARAMQGWHSQRLLGNGEVSDPIIKDKNSLSMQKREKQLKGI-GLLKRKKPAYMEHAMKSTRTKSPKPHYPFYDNWPI
           EPM+I+NQ SGSGRARRDAA+RAMQGWHSQRL GNGEVSDP IKDKN +  QKREKQ+    GLLKRK+ A  EH +           +P       
Subjt:  SNSEPMSIENQGSGSGRARRDAAARAMQGWHSQRLLGNGEVSDPIIKDKNSLSMQKREKQLKGI-GLLKRKKPAYMEHAMKSTRTKSPKPHYPFYDNWPI

Query:  CIETHVKNIGWNVGKEFNILLISLSVARLDVAVVDIGQPADWVKVNVQKTKDCYEVYALVPGLLREEVRVQSDPAGRLVISGEPEHPDNPWGVTPFKKVV
                                   RLDV VVD+G PADWVK+NVQ+T+DC+EVYALVPGL+REEVRVQSDPAGRLVISGEPE+P NPWG TPFKKVV
Subjt:  CIETHVKNIGWNVGKEFNILLISLSVARLDVAVVDIGQPADWVKVNVQKTKDCYEVYALVPGLLREEVRVQSDPAGRLVISGEPEHPDNPWGVTPFKKVV

Query:  SLPSRIDPHQTSAVVTLHGQLFVRVPFEQLE
        SLP+RIDPH TSAVVTL+GQLFVRVP EQLE
Subjt:  SLPSRIDPHQTSAVVTLHGQLFVRVPFEQLE

AT3G51520.1 diacylglycerol acyltransferase family1.6e-11668.31Show/hide
Query:  HIIVALGIWLGGIHLNFTLGLISLFYLSLPKALLVFALLLILVLIPVDDKSKYGRVLARYICQNACSYFPVTLHVEDIYAFDPNRAYVFGYEPHSVLPIG
        H I+A+ IWLG IH N  L L SL +L    +L+V  LL + + IP+D +SKYGR LARYIC++AC+YFPV+L+VED  AF PNRAYVFGYEPHSVLPIG
Subjt:  HIIVALGIWLGGIHLNFTLGLISLFYLSLPKALLVFALLLILVLIPVDDKSKYGRVLARYICQNACSYFPVTLHVEDIYAFDPNRAYVFGYEPHSVLPIG

Query:  VVALADLTGFMPLKKLKVLASSAVFYTPFLRHIWTWMGLTPATKKNFISLLAAGCSCIIVPGGVQETFHMEHNSETVFLKTRRGFVRIAMEMGTPLVPVF
        VVAL DLTGFMP+  +KVLASSA+FYTPFLRHIWTW+GLT A++KNF SLL +G SC++VPGGVQETFHM+H++E VFL  RRGFVRIAME G+PLVPVF
Subjt:  VVALADLTGFMPLKKLKVLASSAVFYTPFLRHIWTWMGLTPATKKNFISLLAAGCSCIIVPGGVQETFHMEHNSETVFLKTRRGFVRIAMEMGTPLVPVF

Query:  CFGQSSVYQWWKPGGNFFLQFSRAIKFTPIVFWGVFGSPLPYRRRMHVVVGKPIEVKKNPNPTSDEVLDLHGQFVEALENMFER
        CFGQ+ VY+WWKP  + +L+ SRAI+FTPI FWGVFGSPLP R+ MHVVVGKPIEV K   PT +E+   HGQ+VEAL ++FER
Subjt:  CFGQSSVYQWWKPGGNFFLQFSRAIKFTPIVFWGVFGSPLPYRRRMHVVVGKPIEVKKNPNPTSDEVLDLHGQFVEALENMFER


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGATGGGGTTGGGAATGGGGAAGGAGATTGATGAAATCTCAGGACCTTCCGTCACAGTTTTCAAGGGAGGCGGTGGACCGGCGACCAAATGGCGGATGATGCACAT
AATAGTAGCCCTCGGAATTTGGCTCGGCGGCATTCATCTCAATTTTACTCTAGGCCTCATCTCCCTCTTCTACCTCTCCCTTCCCAAAGCCCTCTTGGTTTTCGCTTTAC
TTTTGATATTAGTGTTAATTCCTGTCGACGATAAAAGCAAATACGGCCGCGTATTGGCCAGATATATATGTCAAAATGCTTGCAGCTATTTTCCTGTTACTCTACATGTT
GAGGATATATATGCCTTTGATCCCAATCGTGCTTATGTTTTTGGTTATGAGCCACATTCAGTTTTACCCATTGGTGTTGTTGCGTTGGCCGACCTTACTGGTTTCATGCC
TCTCAAAAAATTGAAAGTCCTTGCTAGTTCTGCTGTGTTTTATACTCCATTTTTGAGGCACATATGGACATGGATGGGTCTAACGCCAGCAACAAAGAAAAATTTCATCT
CCCTCTTGGCAGCTGGCTGTAGTTGCATCATTGTGCCTGGTGGAGTACAAGAGACATTTCATATGGAGCATAATTCGGAGACTGTCTTCCTCAAGACAAGACGAGGATTT
GTTCGTATAGCGATGGAGATGGGTACACCACTGGTCCCAGTTTTCTGCTTTGGTCAGTCAAGTGTTTATCAGTGGTGGAAACCTGGTGGTAACTTCTTCCTCCAATTTTC
TAGAGCGATCAAGTTCACACCAATTGTCTTCTGGGGAGTTTTTGGATCGCCCCTTCCTTATAGGCGGCGGATGCATGTGGTGGTAGGGAAACCCATTGAGGTGAAGAAAA
ATCCAAATCCAACAAGTGACGAGGTGCTTGATTTACACGGTCAGTTTGTTGAAGCACTCGAAAATATGTTTGAAAGGGCCCATTGCAAGCGTCCCTCCCAACCAAATAAA
AGGCCATCGAATCATCGATGTTCCACAACCACTTGTGGAGGTCCAAATCCTGATGGCGATTGCTCTCCTTCAAGGCGTACAGCTCCTGCGCGAGAGAAAGTAGAAATAGA
CAACGCCGAATTCAAATTCCAGAACCACGTTGCTGAACGAACAATCGGATTGGAGAAGAAGAGGAACGTTGCATTCTTTCCTTCTTTACTTAAACTTTCTTCTGAAAACC
TAACTTCCAATAGTTGTAGGATTCTCCCTTTGGAACTGGAGATGTGTGACGCGAAGGAGAAACAGGAATCTGCACAAGATGTTTCAGTTGCTTTAAGTGAAGGCAACCAA
GTTTATGCAGAAGATGATCTCCATGATTCGCTCTCTGTGCCAGCTGAGGCAACTCCAAACTCTTGTGAAAAAGAGATCTATTCACCTGAGACTCATGTACAGGAAGTTGT
ACCTCCTGAGGATAATTTAGATAAACCTACGATTCCTCAGCAATCAAATCAAAATTCTGCACATTCACTTCCCTTTGATCATGACCAAGAGTTGGACAAGCCTGATGCAG
AACCTGCTTCTGATGTCAAGATTGTCCCTAATGAGTTGCCCCCAAGGGATCTGGATGCTGCTATTTCTAACTCTCCCCTAGAAACTGTTCAGTCCTCCATGGATACCAAA
TCTGAGGCCTTTGATATGCCTGAAACTAAAACCGGTTGTCTTGATGATGCTTCAACAGCAAGTCACGATGAACCTGTAACTCCACATCCAGTGTCATCTTGTGTTAAAGC
TGAAACAGAGAATGCCATAGAATCGAAGGTCAATGAAGACAGTGTTACTACACCTCATAATGGGGATTCAAACATGAATCACTCGTTTATTTTGGATGAAAATCATATTG
CTGAAGGTAGTGAATCAGGAACAGAAGAAGAGCAATCTGCTTTCATGAAGGAGCTTGAAAACTTCTTCAGAGAAAGAAGCCTGGAATTTAAACCTCCTAAGTTCTATGGA
GAGGGTTTGAATTGCCTCAAGTTATGGAGGGCTGTTACTAGATTGGGAGGCTATGACAAGGCTCTCCTTGATTATGAGAGGCATAAAACCAATGGTGGCGAACTTAGTGT
ACCTATTGCTTCCAACTCAGAACCTATGAGTATTGAAAACCAGGGATCAGGATCAGGTAGAGCACGAAGAGATGCTGCAGCACGTGCCATGCAGGGTTGGCACTCGCAAC
GTCTTCTGGGAAATGGGGAAGTTAGTGACCCCATTATCAAGGATAAGAACTCACTTTCAATGCAGAAAAGGGAAAAACAACTTAAAGGCATTGGTCTTCTTAAACGTAAG
AAACCAGCTTACATGGAGCATGCCATGAAATCAACACGTACAAAATCACCAAAACCACACTACCCATTTTATGACAATTGGCCAATTTGCATCGAGACCCATGTAAAAAA
CATTGGATGGAATGTTGGAAAGGAATTTAACATTTTGCTGATTTCTCTGTCTGTGGCTAGGTTGGATGTAGCAGTAGTTGATATTGGACAACCAGCCGACTGGGTCAAGG
TCAATGTGCAGAAAACTAAAGATTGTTATGAGGTCTATGCATTAGTTCCCGGACTACTTCGTGAGGAGGTGCGTGTCCAGTCTGATCCGGCAGGACGCTTGGTCATTAGT
GGTGAACCTGAACATCCTGATAATCCATGGGGTGTCACACCCTTCAAAAAGGTGGTCAGCCTACCGTCAAGGATTGATCCACATCAGACTTCTGCCGTTGTCACCCTACA
TGGACAGTTGTTTGTTCGCGTTCCATTTGAACAGTTGGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGGGATGGGGTTGGGAATGGGGAAGGAGATTGATGAAATCTCAGGACCTTCCGTCACAGTTTTCAAGGGAGGCGGTGGACCGGCGACCAAATGGCGGATGATGCACAT
AATAGTAGCCCTCGGAATTTGGCTCGGCGGCATTCATCTCAATTTTACTCTAGGCCTCATCTCCCTCTTCTACCTCTCCCTTCCCAAAGCCCTCTTGGTTTTCGCTTTAC
TTTTGATATTAGTGTTAATTCCTGTCGACGATAAAAGCAAATACGGCCGCGTATTGGCCAGATATATATGTCAAAATGCTTGCAGCTATTTTCCTGTTACTCTACATGTT
GAGGATATATATGCCTTTGATCCCAATCGTGCTTATGTTTTTGGTTATGAGCCACATTCAGTTTTACCCATTGGTGTTGTTGCGTTGGCCGACCTTACTGGTTTCATGCC
TCTCAAAAAATTGAAAGTCCTTGCTAGTTCTGCTGTGTTTTATACTCCATTTTTGAGGCACATATGGACATGGATGGGTCTAACGCCAGCAACAAAGAAAAATTTCATCT
CCCTCTTGGCAGCTGGCTGTAGTTGCATCATTGTGCCTGGTGGAGTACAAGAGACATTTCATATGGAGCATAATTCGGAGACTGTCTTCCTCAAGACAAGACGAGGATTT
GTTCGTATAGCGATGGAGATGGGTACACCACTGGTCCCAGTTTTCTGCTTTGGTCAGTCAAGTGTTTATCAGTGGTGGAAACCTGGTGGTAACTTCTTCCTCCAATTTTC
TAGAGCGATCAAGTTCACACCAATTGTCTTCTGGGGAGTTTTTGGATCGCCCCTTCCTTATAGGCGGCGGATGCATGTGGTGGTAGGGAAACCCATTGAGGTGAAGAAAA
ATCCAAATCCAACAAGTGACGAGGTGCTTGATTTACACGGTCAGTTTGTTGAAGCACTCGAAAATATGTTTGAAAGGGCCCATTGCAAGCGTCCCTCCCAACCAAATAAA
AGGCCATCGAATCATCGATGTTCCACAACCACTTGTGGAGGTCCAAATCCTGATGGCGATTGCTCTCCTTCAAGGCGTACAGCTCCTGCGCGAGAGAAAGTAGAAATAGA
CAACGCCGAATTCAAATTCCAGAACCACGTTGCTGAACGAACAATCGGATTGGAGAAGAAGAGGAACGTTGCATTCTTTCCTTCTTTACTTAAACTTTCTTCTGAAAACC
TAACTTCCAATAGTTGTAGGATTCTCCCTTTGGAACTGGAGATGTGTGACGCGAAGGAGAAACAGGAATCTGCACAAGATGTTTCAGTTGCTTTAAGTGAAGGCAACCAA
GTTTATGCAGAAGATGATCTCCATGATTCGCTCTCTGTGCCAGCTGAGGCAACTCCAAACTCTTGTGAAAAAGAGATCTATTCACCTGAGACTCATGTACAGGAAGTTGT
ACCTCCTGAGGATAATTTAGATAAACCTACGATTCCTCAGCAATCAAATCAAAATTCTGCACATTCACTTCCCTTTGATCATGACCAAGAGTTGGACAAGCCTGATGCAG
AACCTGCTTCTGATGTCAAGATTGTCCCTAATGAGTTGCCCCCAAGGGATCTGGATGCTGCTATTTCTAACTCTCCCCTAGAAACTGTTCAGTCCTCCATGGATACCAAA
TCTGAGGCCTTTGATATGCCTGAAACTAAAACCGGTTGTCTTGATGATGCTTCAACAGCAAGTCACGATGAACCTGTAACTCCACATCCAGTGTCATCTTGTGTTAAAGC
TGAAACAGAGAATGCCATAGAATCGAAGGTCAATGAAGACAGTGTTACTACACCTCATAATGGGGATTCAAACATGAATCACTCGTTTATTTTGGATGAAAATCATATTG
CTGAAGGTAGTGAATCAGGAACAGAAGAAGAGCAATCTGCTTTCATGAAGGAGCTTGAAAACTTCTTCAGAGAAAGAAGCCTGGAATTTAAACCTCCTAAGTTCTATGGA
GAGGGTTTGAATTGCCTCAAGTTATGGAGGGCTGTTACTAGATTGGGAGGCTATGACAAGGCTCTCCTTGATTATGAGAGGCATAAAACCAATGGTGGCGAACTTAGTGT
ACCTATTGCTTCCAACTCAGAACCTATGAGTATTGAAAACCAGGGATCAGGATCAGGTAGAGCACGAAGAGATGCTGCAGCACGTGCCATGCAGGGTTGGCACTCGCAAC
GTCTTCTGGGAAATGGGGAAGTTAGTGACCCCATTATCAAGGATAAGAACTCACTTTCAATGCAGAAAAGGGAAAAACAACTTAAAGGCATTGGTCTTCTTAAACGTAAG
AAACCAGCTTACATGGAGCATGCCATGAAATCAACACGTACAAAATCACCAAAACCACACTACCCATTTTATGACAATTGGCCAATTTGCATCGAGACCCATGTAAAAAA
CATTGGATGGAATGTTGGAAAGGAATTTAACATTTTGCTGATTTCTCTGTCTGTGGCTAGGTTGGATGTAGCAGTAGTTGATATTGGACAACCAGCCGACTGGGTCAAGG
TCAATGTGCAGAAAACTAAAGATTGTTATGAGGTCTATGCATTAGTTCCCGGACTACTTCGTGAGGAGGTGCGTGTCCAGTCTGATCCGGCAGGACGCTTGGTCATTAGT
GGTGAACCTGAACATCCTGATAATCCATGGGGTGTCACACCCTTCAAAAAGGTGGTCAGCCTACCGTCAAGGATTGATCCACATCAGACTTCTGCCGTTGTCACCCTACA
TGGACAGTTGTTTGTTCGCGTTCCATTTGAACAGTTGGAATGA
Protein sequenceShow/hide protein sequence
MGMGLGMGKEIDEISGPSVTVFKGGGGPATKWRMMHIIVALGIWLGGIHLNFTLGLISLFYLSLPKALLVFALLLILVLIPVDDKSKYGRVLARYICQNACSYFPVTLHV
EDIYAFDPNRAYVFGYEPHSVLPIGVVALADLTGFMPLKKLKVLASSAVFYTPFLRHIWTWMGLTPATKKNFISLLAAGCSCIIVPGGVQETFHMEHNSETVFLKTRRGF
VRIAMEMGTPLVPVFCFGQSSVYQWWKPGGNFFLQFSRAIKFTPIVFWGVFGSPLPYRRRMHVVVGKPIEVKKNPNPTSDEVLDLHGQFVEALENMFERAHCKRPSQPNK
RPSNHRCSTTTCGGPNPDGDCSPSRRTAPAREKVEIDNAEFKFQNHVAERTIGLEKKRNVAFFPSLLKLSSENLTSNSCRILPLELEMCDAKEKQESAQDVSVALSEGNQ
VYAEDDLHDSLSVPAEATPNSCEKEIYSPETHVQEVVPPEDNLDKPTIPQQSNQNSAHSLPFDHDQELDKPDAEPASDVKIVPNELPPRDLDAAISNSPLETVQSSMDTK
SEAFDMPETKTGCLDDASTASHDEPVTPHPVSSCVKAETENAIESKVNEDSVTTPHNGDSNMNHSFILDENHIAEGSESGTEEEQSAFMKELENFFRERSLEFKPPKFYG
EGLNCLKLWRAVTRLGGYDKALLDYERHKTNGGELSVPIASNSEPMSIENQGSGSGRARRDAAARAMQGWHSQRLLGNGEVSDPIIKDKNSLSMQKREKQLKGIGLLKRK
KPAYMEHAMKSTRTKSPKPHYPFYDNWPICIETHVKNIGWNVGKEFNILLISLSVARLDVAVVDIGQPADWVKVNVQKTKDCYEVYALVPGLLREEVRVQSDPAGRLVIS
GEPEHPDNPWGVTPFKKVVSLPSRIDPHQTSAVVTLHGQLFVRVPFEQLE