; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10013438 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10013438
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionHTH myb-type domain-containing protein
Genome locationChr02:1506270..1515512
RNA-Seq ExpressionHG10013438
SyntenyHG10013438
Gene Ontology termsNA
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK31274.1 uncharacterized protein E5676_scaffold455G005670 [Cucumis melo var. makuwa]3.1e-21681.04Show/hide
Query:  MIERKEKQRTGTISMEDCSNLLERYSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPLDYDSDLDFEIEP
        MI +KE +RTGTISMEDCS LL RYSVR IFTLLREVA+VSGVRIDWDKLVKNTSTGIS+  EYQLLWRHL YRHTLLE+M SVTD LDYDSDLDFE+EP
Subjt:  MIERKEKQRTGTISMEDCSNLLERYSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPLDYDSDLDFEIEP

Query:  FPFVSSESLNEAAACVKVLIANCIPTESDVPSSSAVEAPLTIGISNSQASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAGANAAS
        FP V SES NEAAACVKVLIAN IP ESDVP+SSAVEAPLTI ISNSQ  TDN +N QSA L QG+SVTIPLSIQRQPIP+P A EVFDVNGAAGA+AAS
Subjt:  FPFVSSESLNEAAACVKVLIANCIPTESDVPSSSAVEAPLTIGISNSQASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAGANAAS

Query:  RKRKKPWSKAEDLELMAAVEKCGEGNWGNILKGDFKGDRTASQLSH------------NVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKT-ANSNIN
        RKR+KPWSKAEDLEL+AAVEKCGEGNW NILKGDFKGDRTASQLS             N+GAST+STTQKAQIDAAHRAL+FALDLPVNN+KT ANSNIN
Subjt:  RKRKKPWSKAEDLELMAAVEKCGEGNWGNILKGDFKGDRTASQLSH------------NVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKT-ANSNIN

Query:  SSIV-SSASGAEASVQMQNQSPHISIPSRPLLVEPLPSAVKSGINTSKNSLMMKSTHNSDSIVRATAVAAGARIVSPSDTVSLLKATQTRNAIRIKSKCV
        SSIV SSAS +E+SVQMQNQSP IS+PSRPLLV+PLPSAVKSGINTSKNSLM+ STHNSDSIVRATAVAAGARIVSPSD  SL+KATQT+NAI IKSKC 
Subjt:  SSIV-SSASGAEASVQMQNQSPHISIPSRPLLVEPLPSAVKSGINTSKNSLMMKSTHNSDSIVRATAVAAGARIVSPSDTVSLLKATQTRNAIRIKSKCV

Query:  SSIQSPVVGNAPMHLDARPSVHYISTGKTPTPGSNYVSGKSTMVGNNSMKAVSPKVLHNRCTAILTNPPSDRVSPTTESPLKQEVNSSEERKIAKPIITA
                   PMHLDARP+VHYISTGKTPTP SNYVSGKSTMVGNNSMKAVSPK+LH+R  AI TN PS++VSPTTESPLKQEVNSSEERK  + IITA
Subjt:  SSIQSPVVGNAPMHLDARPSVHYISTGKTPTPGSNYVSGKSTMVGNNSMKAVSPKVLHNRCTAILTNPPSDRVSPTTESPLKQEVNSSEERKIAKPIITA

Query:  KEEFRENSVANDVKIRG
        KEEFRENS  NDVKIRG
Subjt:  KEEFRENSVANDVKIRG

XP_008466159.1 PREDICTED: uncharacterized protein LOC103503656 isoform X1 [Cucumis melo]9.0e-21680.42Show/hide
Query:  MIERKEKQRTGTISMEDCSNLLERYSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPL-----DYDSDLD
        MI +KE +RTGTISMEDCS LL RYSVR IFTLLREVA+VSGVRIDWDKLVKNTSTGIS+  EYQLLWRHL YRHTLLE+M SVTD L     DYDSDLD
Subjt:  MIERKEKQRTGTISMEDCSNLLERYSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPL-----DYDSDLD

Query:  FEIEPFPFVSSESLNEAAACVKVLIANCIPTESDVPSSSAVEAPLTIGISNSQASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAG
        FE+EPFP V SES NEAAACVKVLIAN IP ESDVP+SSAVEAPLTI ISNSQ  TDN +N QSA L QG+SVTIPLSIQRQPIP+P A EVFDVNGAAG
Subjt:  FEIEPFPFVSSESLNEAAACVKVLIANCIPTESDVPSSSAVEAPLTIGISNSQASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAG

Query:  ANAASRKRKKPWSKAEDLELMAAVEKCGEGNWGNILKGDFKGDRTASQLSH------------NVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKTAN
        A+AASRKR+KPWSKAEDLEL+AAVEKCGEGNW NILKGDFKGDRTASQLS             N+GAST+STTQKAQIDAAHRAL+FALDLPVNN+KTAN
Subjt:  ANAASRKRKKPWSKAEDLELMAAVEKCGEGNWGNILKGDFKGDRTASQLSH------------NVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKTAN

Query:  SNINSSIV-SSASGAEASVQMQNQSPHISIPSRPLLVEPLPSAVKSGINTSKNSLMMKSTHNSDSIVRATAVAAGARIVSPSDTVSLLKATQTRNAIRIK
        SNINSSIV SSAS +E+SVQMQNQSP IS+PSRPLLV+PLPSAVKSGINTSKNSLM+ STHNSDSIVRATAVAAGARIVSPSD  SL+KATQT+NAI IK
Subjt:  SNINSSIV-SSASGAEASVQMQNQSPHISIPSRPLLVEPLPSAVKSGINTSKNSLMMKSTHNSDSIVRATAVAAGARIVSPSDTVSLLKATQTRNAIRIK

Query:  SKCVSSIQSPVVGNAPMHLDARPSVHYISTGKTPTPGSNYVSGKSTMVGNNSMKAVSPKVLHNRCTAILTNPPSDRVSPTTESPLKQEVNSSEERKIAKP
        SKC            PMHLDARP+VHYISTGKTPTP SNYVSGKSTMVGNNSMKAVSPK+LH+R  AI TN PS++VSPTTESPLKQEVNSSEERK  + 
Subjt:  SKCVSSIQSPVVGNAPMHLDARPSVHYISTGKTPTPGSNYVSGKSTMVGNNSMKAVSPKVLHNRCTAILTNPPSDRVSPTTESPLKQEVNSSEERKIAKP

Query:  IITAKEEFRENSVANDVKIRG
        IITAKEEFRENS  NDVKIRG
Subjt:  IITAKEEFRENSVANDVKIRG

XP_008466161.1 PREDICTED: uncharacterized protein LOC103503656 isoform X2 [Cucumis melo]1.3e-21781.2Show/hide
Query:  MIERKEKQRTGTISMEDCSNLLERYSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPLDYDSDLDFEIEP
        MI +KE +RTGTISMEDCS LL RYSVR IFTLLREVA+VSGVRIDWDKLVKNTSTGIS+  EYQLLWRHL YRHTLLE+M SVTD LDYDSDLDFE+EP
Subjt:  MIERKEKQRTGTISMEDCSNLLERYSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPLDYDSDLDFEIEP

Query:  FPFVSSESLNEAAACVKVLIANCIPTESDVPSSSAVEAPLTIGISNSQASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAGANAAS
        FP V SES NEAAACVKVLIAN IP ESDVP+SSAVEAPLTI ISNSQ  TDN +N QSA L QG+SVTIPLSIQRQPIP+P A EVFDVNGAAGA+AAS
Subjt:  FPFVSSESLNEAAACVKVLIANCIPTESDVPSSSAVEAPLTIGISNSQASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAGANAAS

Query:  RKRKKPWSKAEDLELMAAVEKCGEGNWGNILKGDFKGDRTASQLSH------------NVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKTANSNINS
        RKR+KPWSKAEDLEL+AAVEKCGEGNW NILKGDFKGDRTASQLS             N+GAST+STTQKAQIDAAHRAL+FALDLPVNN+KTANSNINS
Subjt:  RKRKKPWSKAEDLELMAAVEKCGEGNWGNILKGDFKGDRTASQLSH------------NVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKTANSNINS

Query:  SIV-SSASGAEASVQMQNQSPHISIPSRPLLVEPLPSAVKSGINTSKNSLMMKSTHNSDSIVRATAVAAGARIVSPSDTVSLLKATQTRNAIRIKSKCVS
        SIV SSAS +E+SVQMQNQSP IS+PSRPLLV+PLPSAVKSGINTSKNSLM+ STHNSDSIVRATAVAAGARIVSPSD  SL+KATQT+NAI IKSKC  
Subjt:  SIV-SSASGAEASVQMQNQSPHISIPSRPLLVEPLPSAVKSGINTSKNSLMMKSTHNSDSIVRATAVAAGARIVSPSDTVSLLKATQTRNAIRIKSKCVS

Query:  SIQSPVVGNAPMHLDARPSVHYISTGKTPTPGSNYVSGKSTMVGNNSMKAVSPKVLHNRCTAILTNPPSDRVSPTTESPLKQEVNSSEERKIAKPIITAK
                  PMHLDARP+VHYISTGKTPTP SNYVSGKSTMVGNNSMKAVSPK+LH+R  AI TN PS++VSPTTESPLKQEVNSSEERK  + IITAK
Subjt:  SIQSPVVGNAPMHLDARPSVHYISTGKTPTPGSNYVSGKSTMVGNNSMKAVSPKVLHNRCTAILTNPPSDRVSPTTESPLKQEVNSSEERKIAKPIITAK

Query:  EEFRENSVANDVKIRG
        EEFRENS  NDVKIRG
Subjt:  EEFRENSVANDVKIRG

XP_038898738.1 uncharacterized protein LOC120086263 isoform X1 [Benincasa hispida]2.4e-22485.21Show/hide
Query:  MIERKEKQRTGTISMEDCSNLLERYSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPLDYDSDLDFEIEP
        MIERKE  RTGTISMEDCS LLERYSVR IFTLLREVA+VSGVRIDW  LVKNTSTGISNV EYQLLWRHL YRHT LENMDSVTDPLDYDSDLDFEIEP
Subjt:  MIERKEKQRTGTISMEDCSNLLERYSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPLDYDSDLDFEIEP

Query:  FPFVSSESLNEAAACVKVLIANCIPTESDVPSSSAVEAPLTIGISNSQASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAGANAAS
        FP VSSES NEAAACVKVLIAN IP+ESDVPSSSAVEAPLTIGISNSQ+ST NLEN QSACLMQGMSVTIPLS+QRQPIP+P ATEV DVNGAA ANAAS
Subjt:  FPFVSSESLNEAAACVKVLIANCIPTESDVPSSSAVEAPLTIGISNSQASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAGANAAS

Query:  RKRKKPWSKAEDLELMAAVEKCGEGNWGNILKGDFKGDRTASQLSH------------NVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKT-ANSNIN
        RKR+KPWSKAEDLELMAAVEKCGEGNW NILKGDFKGDRTASQLS             NVGAST+STTQKAQIDAAHRALSFALDLPVNNSKT ANSNIN
Subjt:  RKRKKPWSKAEDLELMAAVEKCGEGNWGNILKGDFKGDRTASQLSH------------NVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKT-ANSNIN

Query:  SSIVSSASGAEASVQMQNQSPHISIPSRPLLVEPLPSAVKSGINTSKNSLMMKSTHNSDSIVRATAVAAGARIVSPSDTVSLLKATQTRNAIRIKSKCVS
        SS+VSSASGAEASVQMQNQSP IS+PSRPLLVEPLPS+VKSGI TSKNSLMMKSTHNSDSIVRATAVAAGARIVSPSD  SLLKA Q +NAI IKSKC  
Subjt:  SSIVSSASGAEASVQMQNQSPHISIPSRPLLVEPLPSAVKSGINTSKNSLMMKSTHNSDSIVRATAVAAGARIVSPSDTVSLLKATQTRNAIRIKSKCVS

Query:  SIQSPVVGNAPMHLDARPSVHYISTGKTPTPGSNYVSGKSTMVGNNSMKAVSPKVLHNRCTAILTNPPSDRVSPTTESPLKQEVNSSEERKIAKPIITAK
                 +PMHLDARPSVHYISTGKTPTPGSN+V GKSTM+GNNS+KAVSPKV HNR TAI TNPPSDRVSPTTESPLKQ+VNSSEERKI++ IITAK
Subjt:  SIQSPVVGNAPMHLDARPSVHYISTGKTPTPGSNYVSGKSTMVGNNSMKAVSPKVLHNRCTAILTNPPSDRVSPTTESPLKQEVNSSEERKIAKPIITAK

Query:  EEFRENS
        EEFRE +
Subjt:  EEFRENS

XP_038898739.1 uncharacterized protein LOC120086263 isoform X2 [Benincasa hispida]9.6e-22685.38Show/hide
Query:  MIERKEKQRTGTISMEDCSNLLERYSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPLDYDSDLDFEIEP
        MIERKE  RTGTISMEDCS LLERYSVR IFTLLREVA+VSGVRIDW  LVKNTSTGISNV EYQLLWRHL YRHT LENMDSVTDPLDYDSDLDFEIEP
Subjt:  MIERKEKQRTGTISMEDCSNLLERYSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPLDYDSDLDFEIEP

Query:  FPFVSSESLNEAAACVKVLIANCIPTESDVPSSSAVEAPLTIGISNSQASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAGANAAS
        FP VSSES NEAAACVKVLIAN IP+ESDVPSSSAVEAPLTIGISNSQ+ST NLEN QSACLMQGMSVTIPLS+QRQPIP+P ATEV DVNGAA ANAAS
Subjt:  FPFVSSESLNEAAACVKVLIANCIPTESDVPSSSAVEAPLTIGISNSQASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAGANAAS

Query:  RKRKKPWSKAEDLELMAAVEKCGEGNWGNILKGDFKGDRTASQLSH------------NVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKTANSNINS
        RKR+KPWSKAEDLELMAAVEKCGEGNW NILKGDFKGDRTASQLS             NVGAST+STTQKAQIDAAHRALSFALDLPVNNSKTANSNINS
Subjt:  RKRKKPWSKAEDLELMAAVEKCGEGNWGNILKGDFKGDRTASQLSH------------NVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKTANSNINS

Query:  SIVSSASGAEASVQMQNQSPHISIPSRPLLVEPLPSAVKSGINTSKNSLMMKSTHNSDSIVRATAVAAGARIVSPSDTVSLLKATQTRNAIRIKSKCVSS
        S+VSSASGAEASVQMQNQSP IS+PSRPLLVEPLPS+VKSGI TSKNSLMMKSTHNSDSIVRATAVAAGARIVSPSD  SLLKA Q +NAI IKSKC   
Subjt:  SIVSSASGAEASVQMQNQSPHISIPSRPLLVEPLPSAVKSGINTSKNSLMMKSTHNSDSIVRATAVAAGARIVSPSDTVSLLKATQTRNAIRIKSKCVSS

Query:  IQSPVVGNAPMHLDARPSVHYISTGKTPTPGSNYVSGKSTMVGNNSMKAVSPKVLHNRCTAILTNPPSDRVSPTTESPLKQEVNSSEERKIAKPIITAKE
                +PMHLDARPSVHYISTGKTPTPGSN+V GKSTM+GNNS+KAVSPKV HNR TAI TNPPSDRVSPTTESPLKQ+VNSSEERKI++ IITAKE
Subjt:  IQSPVVGNAPMHLDARPSVHYISTGKTPTPGSNYVSGKSTMVGNNSMKAVSPKVLHNRCTAILTNPPSDRVSPTTESPLKQEVNSSEERKIAKPIITAKE

Query:  EFRENS
        EFRE +
Subjt:  EFRENS

TrEMBL top hitse value%identityAlignment
A0A1S3CQJ6 uncharacterized protein LOC103503656 isoform X14.4e-21680.42Show/hide
Query:  MIERKEKQRTGTISMEDCSNLLERYSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPL-----DYDSDLD
        MI +KE +RTGTISMEDCS LL RYSVR IFTLLREVA+VSGVRIDWDKLVKNTSTGIS+  EYQLLWRHL YRHTLLE+M SVTD L     DYDSDLD
Subjt:  MIERKEKQRTGTISMEDCSNLLERYSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPL-----DYDSDLD

Query:  FEIEPFPFVSSESLNEAAACVKVLIANCIPTESDVPSSSAVEAPLTIGISNSQASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAG
        FE+EPFP V SES NEAAACVKVLIAN IP ESDVP+SSAVEAPLTI ISNSQ  TDN +N QSA L QG+SVTIPLSIQRQPIP+P A EVFDVNGAAG
Subjt:  FEIEPFPFVSSESLNEAAACVKVLIANCIPTESDVPSSSAVEAPLTIGISNSQASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAG

Query:  ANAASRKRKKPWSKAEDLELMAAVEKCGEGNWGNILKGDFKGDRTASQLSH------------NVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKTAN
        A+AASRKR+KPWSKAEDLEL+AAVEKCGEGNW NILKGDFKGDRTASQLS             N+GAST+STTQKAQIDAAHRAL+FALDLPVNN+KTAN
Subjt:  ANAASRKRKKPWSKAEDLELMAAVEKCGEGNWGNILKGDFKGDRTASQLSH------------NVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKTAN

Query:  SNINSSIV-SSASGAEASVQMQNQSPHISIPSRPLLVEPLPSAVKSGINTSKNSLMMKSTHNSDSIVRATAVAAGARIVSPSDTVSLLKATQTRNAIRIK
        SNINSSIV SSAS +E+SVQMQNQSP IS+PSRPLLV+PLPSAVKSGINTSKNSLM+ STHNSDSIVRATAVAAGARIVSPSD  SL+KATQT+NAI IK
Subjt:  SNINSSIV-SSASGAEASVQMQNQSPHISIPSRPLLVEPLPSAVKSGINTSKNSLMMKSTHNSDSIVRATAVAAGARIVSPSDTVSLLKATQTRNAIRIK

Query:  SKCVSSIQSPVVGNAPMHLDARPSVHYISTGKTPTPGSNYVSGKSTMVGNNSMKAVSPKVLHNRCTAILTNPPSDRVSPTTESPLKQEVNSSEERKIAKP
        SKC            PMHLDARP+VHYISTGKTPTP SNYVSGKSTMVGNNSMKAVSPK+LH+R  AI TN PS++VSPTTESPLKQEVNSSEERK  + 
Subjt:  SKCVSSIQSPVVGNAPMHLDARPSVHYISTGKTPTPGSNYVSGKSTMVGNNSMKAVSPKVLHNRCTAILTNPPSDRVSPTTESPLKQEVNSSEERKIAKP

Query:  IITAKEEFRENSVANDVKIRG
        IITAKEEFRENS  NDVKIRG
Subjt:  IITAKEEFRENSVANDVKIRG

A0A1S3CRZ4 uncharacterized protein LOC103503656 isoform X26.1e-21881.2Show/hide
Query:  MIERKEKQRTGTISMEDCSNLLERYSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPLDYDSDLDFEIEP
        MI +KE +RTGTISMEDCS LL RYSVR IFTLLREVA+VSGVRIDWDKLVKNTSTGIS+  EYQLLWRHL YRHTLLE+M SVTD LDYDSDLDFE+EP
Subjt:  MIERKEKQRTGTISMEDCSNLLERYSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPLDYDSDLDFEIEP

Query:  FPFVSSESLNEAAACVKVLIANCIPTESDVPSSSAVEAPLTIGISNSQASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAGANAAS
        FP V SES NEAAACVKVLIAN IP ESDVP+SSAVEAPLTI ISNSQ  TDN +N QSA L QG+SVTIPLSIQRQPIP+P A EVFDVNGAAGA+AAS
Subjt:  FPFVSSESLNEAAACVKVLIANCIPTESDVPSSSAVEAPLTIGISNSQASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAGANAAS

Query:  RKRKKPWSKAEDLELMAAVEKCGEGNWGNILKGDFKGDRTASQLSH------------NVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKTANSNINS
        RKR+KPWSKAEDLEL+AAVEKCGEGNW NILKGDFKGDRTASQLS             N+GAST+STTQKAQIDAAHRAL+FALDLPVNN+KTANSNINS
Subjt:  RKRKKPWSKAEDLELMAAVEKCGEGNWGNILKGDFKGDRTASQLSH------------NVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKTANSNINS

Query:  SIV-SSASGAEASVQMQNQSPHISIPSRPLLVEPLPSAVKSGINTSKNSLMMKSTHNSDSIVRATAVAAGARIVSPSDTVSLLKATQTRNAIRIKSKCVS
        SIV SSAS +E+SVQMQNQSP IS+PSRPLLV+PLPSAVKSGINTSKNSLM+ STHNSDSIVRATAVAAGARIVSPSD  SL+KATQT+NAI IKSKC  
Subjt:  SIV-SSASGAEASVQMQNQSPHISIPSRPLLVEPLPSAVKSGINTSKNSLMMKSTHNSDSIVRATAVAAGARIVSPSDTVSLLKATQTRNAIRIKSKCVS

Query:  SIQSPVVGNAPMHLDARPSVHYISTGKTPTPGSNYVSGKSTMVGNNSMKAVSPKVLHNRCTAILTNPPSDRVSPTTESPLKQEVNSSEERKIAKPIITAK
                  PMHLDARP+VHYISTGKTPTP SNYVSGKSTMVGNNSMKAVSPK+LH+R  AI TN PS++VSPTTESPLKQEVNSSEERK  + IITAK
Subjt:  SIQSPVVGNAPMHLDARPSVHYISTGKTPTPGSNYVSGKSTMVGNNSMKAVSPKVLHNRCTAILTNPPSDRVSPTTESPLKQEVNSSEERKIAKPIITAK

Query:  EEFRENSVANDVKIRG
        EEFRENS  NDVKIRG
Subjt:  EEFRENSVANDVKIRG

A0A5A7T5C8 HTH myb-type domain-containing protein1.1e-21480.27Show/hide
Query:  MIERKEKQRTGTISMEDCSNLLERYSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPL-----DYDSDLD
        MI +KE +RTGTISMEDCS LL RYSVR IFTLLREVA+VSGVRIDWDKLVKNTSTGIS+  EYQLLWRHL YRHTLLE+M SVTD L     DYDSDLD
Subjt:  MIERKEKQRTGTISMEDCSNLLERYSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPL-----DYDSDLD

Query:  FEIEPFPFVSSESLNEAAACVKVLIANCIPTESDVPSSSAVEAPLTIGISNSQASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAG
        FE+EPFP V SES NEAAACVKVLIAN IP ESDVP+SSAVEAPLTI ISNSQ  TDN +N QSA L QG+SVTIPLSIQRQPIP+P A EVFDVNGAAG
Subjt:  FEIEPFPFVSSESLNEAAACVKVLIANCIPTESDVPSSSAVEAPLTIGISNSQASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAG

Query:  ANAASRKRKKPWSKAEDLELMAAVEKCGEGNWGNILKGDFKGDRTASQLSH------------NVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKT-A
        A+AASRKR+KPWSKAEDLEL+AAVEKCGEGNW NILKGDFKGDRTASQLS             N+GAST+STTQKAQIDAAHRAL+FALDLPVNN+KT A
Subjt:  ANAASRKRKKPWSKAEDLELMAAVEKCGEGNWGNILKGDFKGDRTASQLSH------------NVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKT-A

Query:  NSNINSSIV-SSASGAEASVQMQNQSPHISIPSRPLLVEPLPSAVKSGINTSKNSLMMKSTHNSDSIVRATAVAAGARIVSPSDTVSLLKATQTRNAIRI
        NSNINSSIV SSAS +E+SVQMQNQSP IS+PSRPLLV+PLPSAVKSGINTSKNSLM+ STHNSDSIVRATAVAAGARIVSPSD  SL+KATQT+NAI I
Subjt:  NSNINSSIV-SSASGAEASVQMQNQSPHISIPSRPLLVEPLPSAVKSGINTSKNSLMMKSTHNSDSIVRATAVAAGARIVSPSDTVSLLKATQTRNAIRI

Query:  KSKCVSSIQSPVVGNAPMHLDARPSVHYISTGKTPTPGSNYVSGKSTMVGNNSMKAVSPKVLHNRCTAILTNPPSDRVSPTTESPLKQEVNSSEERKIAK
        KSKC            PMHLDARP+VHYISTGKTPTP SNYVSGKSTMVGNNSMKAVSPK+LH+R  AI TN PS++VSPTTESPLKQEVNSSEERK  +
Subjt:  KSKCVSSIQSPVVGNAPMHLDARPSVHYISTGKTPTPGSNYVSGKSTMVGNNSMKAVSPKVLHNRCTAILTNPPSDRVSPTTESPLKQEVNSSEERKIAK

Query:  PIITAKEEFRENSVANDVKIRG
         IITAKEEFRENS  NDVKIRG
Subjt:  PIITAKEEFRENSVANDVKIRG

A0A5D3E5P5 HTH myb-type domain-containing protein1.5e-21681.04Show/hide
Query:  MIERKEKQRTGTISMEDCSNLLERYSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPLDYDSDLDFEIEP
        MI +KE +RTGTISMEDCS LL RYSVR IFTLLREVA+VSGVRIDWDKLVKNTSTGIS+  EYQLLWRHL YRHTLLE+M SVTD LDYDSDLDFE+EP
Subjt:  MIERKEKQRTGTISMEDCSNLLERYSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPLDYDSDLDFEIEP

Query:  FPFVSSESLNEAAACVKVLIANCIPTESDVPSSSAVEAPLTIGISNSQASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAGANAAS
        FP V SES NEAAACVKVLIAN IP ESDVP+SSAVEAPLTI ISNSQ  TDN +N QSA L QG+SVTIPLSIQRQPIP+P A EVFDVNGAAGA+AAS
Subjt:  FPFVSSESLNEAAACVKVLIANCIPTESDVPSSSAVEAPLTIGISNSQASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAGANAAS

Query:  RKRKKPWSKAEDLELMAAVEKCGEGNWGNILKGDFKGDRTASQLSH------------NVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKT-ANSNIN
        RKR+KPWSKAEDLEL+AAVEKCGEGNW NILKGDFKGDRTASQLS             N+GAST+STTQKAQIDAAHRAL+FALDLPVNN+KT ANSNIN
Subjt:  RKRKKPWSKAEDLELMAAVEKCGEGNWGNILKGDFKGDRTASQLSH------------NVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKT-ANSNIN

Query:  SSIV-SSASGAEASVQMQNQSPHISIPSRPLLVEPLPSAVKSGINTSKNSLMMKSTHNSDSIVRATAVAAGARIVSPSDTVSLLKATQTRNAIRIKSKCV
        SSIV SSAS +E+SVQMQNQSP IS+PSRPLLV+PLPSAVKSGINTSKNSLM+ STHNSDSIVRATAVAAGARIVSPSD  SL+KATQT+NAI IKSKC 
Subjt:  SSIV-SSASGAEASVQMQNQSPHISIPSRPLLVEPLPSAVKSGINTSKNSLMMKSTHNSDSIVRATAVAAGARIVSPSDTVSLLKATQTRNAIRIKSKCV

Query:  SSIQSPVVGNAPMHLDARPSVHYISTGKTPTPGSNYVSGKSTMVGNNSMKAVSPKVLHNRCTAILTNPPSDRVSPTTESPLKQEVNSSEERKIAKPIITA
                   PMHLDARP+VHYISTGKTPTP SNYVSGKSTMVGNNSMKAVSPK+LH+R  AI TN PS++VSPTTESPLKQEVNSSEERK  + IITA
Subjt:  SSIQSPVVGNAPMHLDARPSVHYISTGKTPTPGSNYVSGKSTMVGNNSMKAVSPKVLHNRCTAILTNPPSDRVSPTTESPLKQEVNSSEERKIAKPIITA

Query:  KEEFRENSVANDVKIRG
        KEEFRENS  NDVKIRG
Subjt:  KEEFRENSVANDVKIRG

A0A6J1FAZ9 uncharacterized protein LOC111443670 isoform X25.0e-21279.84Show/hide
Query:  MIERKEKQRTGTISMEDCSNLLERYSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPLDYDSDLDFEIEP
        MIE KEKQ+ GTIS ED S +LERYSVR IFTLLREVA VS VRIDWDKLVKNTSTGISNV EYQLLWRHL YRHTLLEN+DSVTDPLDYDSDLDFEIEP
Subjt:  MIERKEKQRTGTISMEDCSNLLERYSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPLDYDSDLDFEIEP

Query:  FPFVSSESLNEAAACVKVLIANCIPTESDVPSSSAVEAPLTIGI-SNSQASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAGANAA
        FP VS+ESLNEAAACVKVLIAN IP+ESDVPSSS VEAPLTIGI SNS++   +LENPQSACLMQGM VT+P+SIQRQP+P P ATEVFDVNGAAG+NAA
Subjt:  FPFVSSESLNEAAACVKVLIANCIPTESDVPSSSAVEAPLTIGI-SNSQASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAGANAA

Query:  SRKRKKPWSKAEDLELMAAVEKCGEGNWGNILKGDFKGDRTASQLSH------------NVGASTTST-TQKAQIDAAHRALSFALDLPVNNSKTANSNI
        SRKR+KPWSK EDLELMAAVEK GEGNW NILK DFKGDRTASQLS             NVGA+TTST   KAQIDAAHRALS ALDLPVNNSK+ANSN+
Subjt:  SRKRKKPWSKAEDLELMAAVEKCGEGNWGNILKGDFKGDRTASQLSH------------NVGASTTST-TQKAQIDAAHRALSFALDLPVNNSKTANSNI

Query:  NSSIVSSASGAEASVQMQNQSPHISIPSRPLLVEPLPSAVKSGINTSKNSLMMKSTHNSDSIVRATAVAAGARIVSPSDTVSLLKATQTRNAIRIKSKCV
        NSS VSS SGAEA VQ+QNQSP + +PSRPL V+PLPSA KSGINT+KN+LMMKSTHNSDSIVRATAVAAGARIVSPSD  SL+KA QT+NAI IKSKCV
Subjt:  NSSIVSSASGAEASVQMQNQSPHISIPSRPLLVEPLPSAVKSGINTSKNSLMMKSTHNSDSIVRATAVAAGARIVSPSDTVSLLKATQTRNAIRIKSKCV

Query:  SSIQSPVVGNAPMHLDARPSVHYISTGKTPTPGSNYVSGKSTMVGNNSMKAVSPKVLHNRCTAILTNPPSDRVSPTTESPLKQEVNSSEERKIAKPIITA
        SSIQ P++GNA  HLDARPSVHYISTG+T TPG+NYV GKSTM GNNSMK VSPK  +N  TA+LTNPPS+++SPTTESPLKQEV SSEE KI+KPIIT 
Subjt:  SSIQSPVVGNAPMHLDARPSVHYISTGKTPTPGSNYVSGKSTMVGNNSMKAVSPKVLHNRCTAILTNPPSDRVSPTTESPLKQEVNSSEERKIAKPIITA

Query:  KEEFRE
        K +FRE
Subjt:  KEEFRE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G09710.1 Homeodomain-like superfamily protein2.0e-5139.13Show/hide
Query:  QRTGTISMEDCSNLLERYSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPLDYDSDLDFEIEPFPFVSSE
        +R   I+  D + LL RY +  I  +L+E++  S  ++DW+ LVK T+TGI+N  EYQLLWRHL YRH LL   D    PLD DSD++ E+E  P VS E
Subjt:  QRTGTISMEDCSNLLERYSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPLDYDSDLDFEIEPFPFVSSE

Query:  SLNEAAACVKVLIANCIPTESDVPSSSAVEAPLTIGISNS--QASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAGANAASRKRKK
        +  EA A VKV+ A+ + +ESD+   S VEAPLTI I  +  + S +  E+P S+   +GM++  P+ +Q+       +TE  + NG+AG + A R+++K
Subjt:  SLNEAAACVKVLIANCIPTESDVPSSSAVEAPLTIGISNS--QASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAGANAASRKRKK

Query:  PWSKAEDLELMAAVEKCGEGNWGNILKGDFKGDRTASQLSH-------NVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKTAN--------SNINSSI
         WS  ED EL AAV++CGEGNW +I+KGDF+G+RTASQLS            ST+ +    Q   A  A++ AL L + N   +N        +  + +I
Subjt:  PWSKAEDLELMAAVEKCGEGNWGNILKGDFKGDRTASQLSH-------NVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKTAN--------SNINSSI

Query:  VSSASGAEASVQMQNQSPHISIPSRPLLVEPLPSAVKSGINTSKNSLMMKSTHNSDSIVRATAVAAGA
          + +   +S Q Q QS  I + + P     LP+A    +  +  S    ST  SD +V A +VAA A
Subjt:  VSSASGAEASVQMQNQSPHISIPSRPLLVEPLPSAVKSGINTSKNSLMMKSTHNSDSIVRATAVAAGA

AT1G09710.2 Homeodomain-like superfamily protein5.4e-4939.68Show/hide
Query:  QRTGTISMEDCSNLLERYSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPLDYDSDLDFEIEPFPFVSSE
        +R   I+  D + LL RY +  I  +L+E++  S  ++DW+ LVK T+TGI+N  EYQLLWRHL YRH LL   D    PLD DSD++ E+E  P VS E
Subjt:  QRTGTISMEDCSNLLERYSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPLDYDSDLDFEIEPFPFVSSE

Query:  SLNEAAACVKVLIANCIPTESDVPSSSAVEAPLTIGISNS--QASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAGANAASRKRKK
        +  EA A VKV+ A+ + +ESD+   S VEAPLTI I  +  + S +  E+P S+   +GM++  P+ +Q+       +TE  + NG+AG + A R+++K
Subjt:  SLNEAAACVKVLIANCIPTESDVPSSSAVEAPLTIGISNS--QASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAGANAASRKRKK

Query:  PWSKAEDLELMAAVEKCGEGNWGNILKGDFKGDRTASQLSH-------NVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKTANSNINSSIVSSASGAE
         WS  ED EL AAV++CGEGNW +I+KGDF+G+RTASQLS            ST+ +    Q   A  A++ AL L + N   +N     +    +  A 
Subjt:  PWSKAEDLELMAAVEKCGEGNWGNILKGDFKGDRTASQLSH-------NVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKTANSNINSSIVSSASGAE

Query:  ASVQMQNQSPHISIP
        +S+ +  +   + +P
Subjt:  ASVQMQNQSPHISIP

AT1G58220.1 Homeodomain-like superfamily protein5.0e-4738.78Show/hide
Query:  KQRTGTISMEDCSNLLERYSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPLDYDSDLDFEIEPFPFVSS
        K+R   IS  D + LL+RY    I  LL+E+A  +  +++W++LVK TSTGI++  EYQLLWRHL YR +L+  + +    LD DSD++ E+E  P VS 
Subjt:  KQRTGTISMEDCSNLLERYSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPLDYDSDLDFEIEPFPFVSS

Query:  ESLNEAAACVKVLIANCIPTESDVPSSSAVEAPLTIGISNSQASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAGANAASRKRKKP
        + + EA A VKV+ A+ +P+ESD+P  S VEAPLTI I  S       E   S    +GM++T        P+ LP A E  + NG A ++ A RKR+K 
Subjt:  ESLNEAAACVKVLIANCIPTESDVPSSSAVEAPLTIGISNSQASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAGANAASRKRKKP

Query:  WSKAEDLELMAAVEKCGEGNWGNILKGDFKGDRTASQLSHNVGA-------STTSTTQKAQIDAAHRALSFALDLPVNN---SKTANSNINSSIVS----
        WS  ED EL+AAV++ GEG+W  I K +F+G+RTASQLS   GA       S TST    Q   A  A + AL L V N   SK     +   + S    
Subjt:  WSKAEDLELMAAVEKCGEGNWGNILKGDFKGDRTASQLSHNVGA-------STTSTTQKAQIDAAHRALSFALDLPVNN---SKTANSNINSSIVS----

Query:  --SASGAEASVQMQNQ---SPHISIPSRPLLVEPLPSAVKSGINTSKNSLMMKSTHNSDSIVRATAVAAGARIVSPSDTVSLLKATQTRNAI
           A+GA +   +Q Q    P I   SR     P+    KS +   K +    ST  +D +V A +VAA A +   +  V++ K    +NA+
Subjt:  --SASGAEASVQMQNQ---SPHISIPSRPLLVEPLPSAVKSGINTSKNSLMMKSTHNSDSIVRATAVAAGARIVSPSDTVSLLKATQTRNAI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGATTGAGAATCTTTGCAATTTAACACGACAGGCAAAGATTCAGTCGGTTTGCACTTGCAGTGAGCTGCACTTCTGTGCCTGCGTGTCTATCATTGAGCAGTGCAC
TCTCACCACCGTCGCCGATAAGTCCGGATTTCTTTTGCAACCGCAACGCTTAACATTCGTTTTCCGGTTTCGTCTTCAAACAATTTCCGAGAAGCGATTTCAGTCTCATC
GGCGACGGTCACCGACGCTGCACCTCCTCTTATGCGATTCCATTACGGGCTCTATAATGATTGAGAGGAAAGAGAAGCAAAGGACAGGGACAATTAGTATGGAAGATTGT
TCCAATCTATTGGAAAGATATTCGGTAAGAGCAATTTTTACTTTGCTTCGAGAGGTGGCCCGGGTTTCGGGAGTGAGAATTGATTGGGACAAATTGGTGAAGAACACGTC
CACTGGGATTTCTAATGTTTGGGAGTATCAGTTGTTATGGCGGCATTTGCCTTATCGTCACACGTTACTGGAGAACATGGATTCTGTTACTGATCCACTGGATTATGATA
GTGACTTAGATTTTGAAATAGAACCTTTTCCATTTGTCAGCAGTGAGTCCTTGAATGAAGCTGCAGCATGTGTAAAGGTACTGATTGCTAATTGCATACCAACTGAGTCA
GATGTTCCAAGTAGTTCTGCAGTTGAGGCCCCATTGACTATAGGTATATCCAATAGTCAAGCATCTACAGACAATCTTGAAAATCCTCAATCTGCTTGTTTGATGCAAGG
GATGTCTGTTACAATTCCACTTTCCATTCAGAGACAGCCGATTCCATTGCCAGTAGCAACTGAAGTATTTGATGTGAATGGAGCAGCTGGTGCTAATGCAGCTTCTCGAA
AAAGAAAAAAACCTTGGTCGAAGGCAGAGGATTTGGAATTGATGGCTGCTGTGGAGAAGTGTGGTGAAGGAAACTGGGGGAATATCTTGAAAGGAGACTTCAAGGGGGAT
AGAACTGCTTCACAGCTATCTCATAATGTGGGAGCTAGCACCACAAGTACTACCCAGAAAGCTCAGATTGATGCTGCACACCGTGCATTGTCCTTTGCCCTTGATTTGCC
TGTGAATAACTCAAAAACAGCAAATTCAAACATAAACAGTAGCATTGTTTCTTCTGCAAGTGGTGCCGAAGCTTCAGTTCAAATGCAGAATCAGTCTCCACATATTTCCA
TTCCTTCAAGGCCGCTGCTGGTAGAGCCTTTGCCTTCAGCAGTGAAATCTGGGATCAACACTTCCAAAAATTCATTGATGATGAAGTCTACTCACAATTCTGATTCTATA
GTTAGAGCAACTGCAGTAGCTGCAGGGGCCCGAATCGTTTCTCCGTCTGATACTGTATCTCTACTGAAAGCGACGCAAACGAGAAATGCCATCCGCATAAAGTCCAAATG
TGTTTCATCAATCCAATCACCCGTGGTTGGGAATGCACCAATGCACTTGGATGCACGCCCCAGTGTACATTATATTTCCACAGGAAAAACACCAACTCCAGGCTCAAACT
ATGTGAGTGGTAAATCTACTATGGTAGGCAATAACTCAATGAAGGCTGTCTCACCAAAAGTTCTGCATAATCGTTGTACTGCTATTTTGACAAACCCGCCATCAGACCGA
GTAAGCCCAACAACTGAGTCTCCATTGAAGCAAGAGGTTAACAGTTCAGAAGAACGCAAAATTGCCAAGCCAATCATTACTGCAAAAGAGGAGTTTCGAGAAAACAGCGT
GGCAAATGATGTCAAGATTAGGGGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGATTGAGAATCTTTGCAATTTAACACGACAGGCAAAGATTCAGTCGGTTTGCACTTGCAGTGAGCTGCACTTCTGTGCCTGCGTGTCTATCATTGAGCAGTGCAC
TCTCACCACCGTCGCCGATAAGTCCGGATTTCTTTTGCAACCGCAACGCTTAACATTCGTTTTCCGGTTTCGTCTTCAAACAATTTCCGAGAAGCGATTTCAGTCTCATC
GGCGACGGTCACCGACGCTGCACCTCCTCTTATGCGATTCCATTACGGGCTCTATAATGATTGAGAGGAAAGAGAAGCAAAGGACAGGGACAATTAGTATGGAAGATTGT
TCCAATCTATTGGAAAGATATTCGGTAAGAGCAATTTTTACTTTGCTTCGAGAGGTGGCCCGGGTTTCGGGAGTGAGAATTGATTGGGACAAATTGGTGAAGAACACGTC
CACTGGGATTTCTAATGTTTGGGAGTATCAGTTGTTATGGCGGCATTTGCCTTATCGTCACACGTTACTGGAGAACATGGATTCTGTTACTGATCCACTGGATTATGATA
GTGACTTAGATTTTGAAATAGAACCTTTTCCATTTGTCAGCAGTGAGTCCTTGAATGAAGCTGCAGCATGTGTAAAGGTACTGATTGCTAATTGCATACCAACTGAGTCA
GATGTTCCAAGTAGTTCTGCAGTTGAGGCCCCATTGACTATAGGTATATCCAATAGTCAAGCATCTACAGACAATCTTGAAAATCCTCAATCTGCTTGTTTGATGCAAGG
GATGTCTGTTACAATTCCACTTTCCATTCAGAGACAGCCGATTCCATTGCCAGTAGCAACTGAAGTATTTGATGTGAATGGAGCAGCTGGTGCTAATGCAGCTTCTCGAA
AAAGAAAAAAACCTTGGTCGAAGGCAGAGGATTTGGAATTGATGGCTGCTGTGGAGAAGTGTGGTGAAGGAAACTGGGGGAATATCTTGAAAGGAGACTTCAAGGGGGAT
AGAACTGCTTCACAGCTATCTCATAATGTGGGAGCTAGCACCACAAGTACTACCCAGAAAGCTCAGATTGATGCTGCACACCGTGCATTGTCCTTTGCCCTTGATTTGCC
TGTGAATAACTCAAAAACAGCAAATTCAAACATAAACAGTAGCATTGTTTCTTCTGCAAGTGGTGCCGAAGCTTCAGTTCAAATGCAGAATCAGTCTCCACATATTTCCA
TTCCTTCAAGGCCGCTGCTGGTAGAGCCTTTGCCTTCAGCAGTGAAATCTGGGATCAACACTTCCAAAAATTCATTGATGATGAAGTCTACTCACAATTCTGATTCTATA
GTTAGAGCAACTGCAGTAGCTGCAGGGGCCCGAATCGTTTCTCCGTCTGATACTGTATCTCTACTGAAAGCGACGCAAACGAGAAATGCCATCCGCATAAAGTCCAAATG
TGTTTCATCAATCCAATCACCCGTGGTTGGGAATGCACCAATGCACTTGGATGCACGCCCCAGTGTACATTATATTTCCACAGGAAAAACACCAACTCCAGGCTCAAACT
ATGTGAGTGGTAAATCTACTATGGTAGGCAATAACTCAATGAAGGCTGTCTCACCAAAAGTTCTGCATAATCGTTGTACTGCTATTTTGACAAACCCGCCATCAGACCGA
GTAAGCCCAACAACTGAGTCTCCATTGAAGCAAGAGGTTAACAGTTCAGAAGAACGCAAAATTGCCAAGCCAATCATTACTGCAAAAGAGGAGTTTCGAGAAAACAGCGT
GGCAAATGATGTCAAGATTAGGGGCTGA
Protein sequenceShow/hide protein sequence
MEIENLCNLTRQAKIQSVCTCSELHFCACVSIIEQCTLTTVADKSGFLLQPQRLTFVFRFRLQTISEKRFQSHRRRSPTLHLLLCDSITGSIMIERKEKQRTGTISMEDC
SNLLERYSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPLDYDSDLDFEIEPFPFVSSESLNEAAACVKVLIANCIPTES
DVPSSSAVEAPLTIGISNSQASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAGANAASRKRKKPWSKAEDLELMAAVEKCGEGNWGNILKGDFKGD
RTASQLSHNVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKTANSNINSSIVSSASGAEASVQMQNQSPHISIPSRPLLVEPLPSAVKSGINTSKNSLMMKSTHNSDSI
VRATAVAAGARIVSPSDTVSLLKATQTRNAIRIKSKCVSSIQSPVVGNAPMHLDARPSVHYISTGKTPTPGSNYVSGKSTMVGNNSMKAVSPKVLHNRCTAILTNPPSDR
VSPTTESPLKQEVNSSEERKIAKPIITAKEEFRENSVANDVKIRG