; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G017070 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G017070
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionU-box domain-containing protein 1-like
Genome locationCG_Chr05:29363500..29368405
RNA-Seq ExpressionClCG05G017070
SyntenyClCG05G017070
Gene Ontology termsNA
InterPro domainsIPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136473.1 uncharacterized protein LOC101217803 [Cucumis sativus]2.2e-19281.43Show/hide
Query:  LSTLFFISRLKHLKNTRFGALQSNSMLYHCAEQSSADQEVLPSEWYESASRKIKKLSCSLKNVDLIDGRLVNVNDDSTIIDERIEQKMHTFKALVRVLIG
        L  LFFISRLKHL NTR GA QSNSMLYH AE SSA QEVLPSEWYE A  KIKKLSC L+NVDL+DGR+VN +DDSTI DERIEQ+M TFK+LVR+LIG
Subjt:  LSTLFFISRLKHLKNTRFGALQSNSMLYHCAEQSSADQEVLPSEWYESASRKIKKLSCSLKNVDLIDGRLVNVNDDSTIIDERIEQKMHTFKALVRVLIG

Query:  SPSTQRRLTEMAVSSSINCQPQAWFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKELTSELDPLFHQSTNKGIKMG
        SPS QRR+TE+A SSSINCQP AWFRNSSERE MVVDSLTKV N L V+ QQRKLVRHTICPQVTQHHIWTGALD +LKEL  EL PL H+ST+KGIKM 
Subjt:  SPSTQRRLTEMAVSSSINCQPQAWFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKELTSELDPLFHQSTNKGIKMG

Query:  NQIVSSCLKFLDDVTNSNAHYTSWMRPAPLRAVADSSAPPRWEDMLEMFTDLISYLKDEKYLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKEASHQESLV
         QIVSSCLKFLD  TNSN H++SW+RPAP R V  SS PPRWEDMLEMF DLI YLKDEK LVHYVTKLEVMKEGLSQIKDV +D+SIG++EA  QESLV
Subjt:  NQIVSSCLKFLDDVTNSNAHYTSWMRPAPLRAVADSSAPPRWEDMLEMFTDLISYLKDEKYLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKEASHQESLV

Query:  QKKLSKTLGHSSRCLFTLLLYYLFGHFRDIEVDLCGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRHLHRVMGLFKFVWETAGMKGELELQGHLFCVG
        QKKLSKTLGHSSRCLFTLLLYYLFGHFRDIEVD CGGLLK DGNDKFLLFMGRVLS DEEKIVWNGVR L R MG+FK VWETAGMKGEL L+GHLFCVG
Subjt:  QKKLSKTLGHSSRCLFTLLLYYLFGHFRDIEVDLCGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRHLHRVMGLFKFVWETAGMKGELELQGHLFCVG

Query:  AEDRQLSYKENAYLLHEINL
         E RQLSYK NAYLLHEI L
Subjt:  AEDRQLSYKENAYLLHEINL

XP_008466389.1 PREDICTED: uncharacterized protein LOC103503810 [Cucumis melo]2.2e-18980.76Show/hide
Query:  LSTLFFISRLKHLKNTRFGALQSNSMLYHCAEQSSADQEVLPSEWYESASRKIKKLSCSLKNVDLIDGRLVNVNDDSTIIDERIEQKMHTFKALVRVLIG
        L   FFISRLKHL +TR GA QSNSMLYH  EQSS DQEVLPSEWYE A  KIKKLSC L+NVDL+DGR+VN +DDSTIIDERIEQKM TFK+LVR+LIG
Subjt:  LSTLFFISRLKHLKNTRFGALQSNSMLYHCAEQSSADQEVLPSEWYESASRKIKKLSCSLKNVDLIDGRLVNVNDDSTIIDERIEQKMHTFKALVRVLIG

Query:  SPSTQRRLTEMAVSSSINCQPQAWFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKELTSELDPLFHQSTNKGIKMG
        SPS QRR+TEMA SSSIN Q  AWFRNSSERE MVVDSLTK  NFL V+ QQRKL+RHTICPQ+TQHHIWTGALD +LKEL  EL PL ++STNKGI M 
Subjt:  SPSTQRRLTEMAVSSSINCQPQAWFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKELTSELDPLFHQSTNKGIKMG

Query:  NQIVSSCLKFLDDVTNSNAHYTS-WMRPAPLRAVADSSAPPRWEDMLEMFTDLISYLKDEKYLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKEASHQESL
         QIVSSCLKFLDD TNSN H+TS W+RPAP R + +SS PPRWEDMLEMF DLI YLKDEK LVHYVTKLEVMKEGLSQIKDV +D+SIG+KEA  QESL
Subjt:  NQIVSSCLKFLDDVTNSNAHYTS-WMRPAPLRAVADSSAPPRWEDMLEMFTDLISYLKDEKYLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKEASHQESL

Query:  VQKKLSKTLGHSSRCLFTLLLYYLFGHFRDIEVDLCGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRHLHRVMGLFKFVWETAGMKGELELQGHLFCV
        VQKKLSKTLGHSSRCLFTLLLYYLFGHFRDIEVD CGGLLK DGNDKFLLFMGRVLS DEEKIVWNGVR L R MG+FK VWETAGMKGEL LQGHLFCV
Subjt:  VQKKLSKTLGHSSRCLFTLLLYYLFGHFRDIEVDLCGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRHLHRVMGLFKFVWETAGMKGELELQGHLFCV

Query:  GAEDRQLSYKENAYLLHEINL
          E RQLSYK NAYLLHEI L
Subjt:  GAEDRQLSYKENAYLLHEINL

XP_022940414.1 uncharacterized protein LOC111446029 [Cucurbita moschata]3.6e-18779.43Show/hide
Query:  LFFISRLKHLKNTRFGALQSNSMLYHCAEQSSADQEVLPSEWYESASRKIKKLSCSLKNVDLIDGRLVNVNDDSTIIDERIEQKMHTFKALVRVLIGSPS
        L F+ R+ HL  TR  A  SN MLYHC E S  D E LP++WYE A  KIKKLSCSLKNVDLIDGRLVNVNDDSTI+DERIEQ+M  FK+LVRV IGSPS
Subjt:  LFFISRLKHLKNTRFGALQSNSMLYHCAEQSSADQEVLPSEWYESASRKIKKLSCSLKNVDLIDGRLVNVNDDSTIIDERIEQKMHTFKALVRVLIGSPS

Query:  TQRRLTEMAVSSSINCQPQAWFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKELTSELDPLFHQSTNKGIKMGNQI
         QRR+TEMA S++ N QPQ  FRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQ TQHHIWTGALDH+LKEL  ELDPL H S NKGIKMG QI
Subjt:  TQRRLTEMAVSSSINCQPQAWFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKELTSELDPLFHQSTNKGIKMGNQI

Query:  VSSCLKFLDDVTNSNAHYTSWMRPAPLRAVADSSAPPRWEDMLEMFTDLISYLKDEKYLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKEASHQESLVQKK
        VSSCL FL+D TNSNAH TSWMRPAPL+   DSS  P+WEDMLEMFTDLIS LKDEK L  YVTKLEVMKEGL+QI+DVLTDKSIG+KEA HQESLVQKK
Subjt:  VSSCLKFLDDVTNSNAHYTSWMRPAPLRAVADSSAPPRWEDMLEMFTDLISYLKDEKYLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKEASHQESLVQKK

Query:  LSKTLGHSSRCLFTLLLYYLFGHFRDIEVDLCGGLLKA-DGNDKFLLFMGRVLSSDEEKIVWNGVRHLHRVMGLFKFVWETAGMKGELELQGHLFCVGAE
        LSKTLGHSSRCLFTLLLYYLFGHFRD+EVDLCGGLLKA +  +K+L+FMGR+LS DEE++VWNGVR L R MGLFKFVWETAGMKG+L LQGHLFCVGAE
Subjt:  LSKTLGHSSRCLFTLLLYYLFGHFRDIEVDLCGGLLKA-DGNDKFLLFMGRVLSSDEEKIVWNGVRHLHRVMGLFKFVWETAGMKGELELQGHLFCVGAE

Query:  DRQLSYKENAYLLHEINL
        DRQLSYK N YLLH+I+L
Subjt:  DRQLSYKENAYLLHEINL

XP_022982191.1 uncharacterized protein LOC111481093 [Cucurbita maxima]1.2e-18779.62Show/hide
Query:  LFFISRLKHLKNTRFGALQSNSMLYHCAEQSSADQEVLPSEWYESASRKIKKLSCSLKNVDLIDGRLVNVNDDSTIIDERIEQKMHTFKALVRVLIGSPS
        L F+ R+ HL  TR  AL SN MLYHC+E S  DQE LP++WYE A  KIKKLSCSLKNVDLIDGRLVNVNDDSTI+DERIEQ+M  FK+LVRV IGS S
Subjt:  LFFISRLKHLKNTRFGALQSNSMLYHCAEQSSADQEVLPSEWYESASRKIKKLSCSLKNVDLIDGRLVNVNDDSTIIDERIEQKMHTFKALVRVLIGSPS

Query:  TQRRLTEMAVSSSINCQPQAWFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKELTSELDPLFHQSTNKGIKMGNQI
         QRR+TEMA S++IN QPQA FRNSSEREPMVVDS TKVSNFLNVSAQQRKLVRHTICPQ TQHHIWTGALDH+LKEL  ELDPL H S NKGIKMG QI
Subjt:  TQRRLTEMAVSSSINCQPQAWFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKELTSELDPLFHQSTNKGIKMGNQI

Query:  VSSCLKFLDDVTNSNAHYTSWMRPAPLRAVADSSAPPRWEDMLEMFTDLISYLKDEKYLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKEASHQESLVQKK
        VSSCLKFL+D TNSNAH TSWMRPAPL+   DSS  P+WEDMLEMFTDLI  LKDEK L  YVTKLEVMKEGL+QI+DVL DKSIG+KEA HQESLVQKK
Subjt:  VSSCLKFLDDVTNSNAHYTSWMRPAPLRAVADSSAPPRWEDMLEMFTDLISYLKDEKYLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKEASHQESLVQKK

Query:  LSKTLGHSSRCLFTLLLYYLFGHFRDIEVDLCGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRHLHRVMGLFKFVWETAGMKGELELQGHLFCVGAED
        LSKTLGHSSRCLFTLLLYYLFGHFRD+EVDLCGGLLKA   +K+L+FMGR+LS DEE+ VWNGVR L R MGLFKFVWETAGMKG+L L+GHLFCVGAED
Subjt:  LSKTLGHSSRCLFTLLLYYLFGHFRDIEVDLCGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRHLHRVMGLFKFVWETAGMKGELELQGHLFCVGAED

Query:  RQLSYKENAYLLHEINL
        RQLSYK N YL+HEI+L
Subjt:  RQLSYKENAYLLHEINL

XP_038896888.1 uncharacterized protein LOC120085101 isoform X1 [Benincasa hispida]1.8e-21589.81Show/hide
Query:  DLSTLFFISRLKHLKNTRFGALQSNSMLYHCAEQSSADQEVLPSEWYESASRKIKKLSCSLKNVDLIDGRLVNVNDDSTIIDERIEQKMHTFKALVRVLI
        +L  LFFISRLKHL NTRFGALQSNSMLYHCAE SSADQEVLPSEWYE+A RKIKKLSCSLKNVDLIDGRLVNVNDDSTIIDE IEQ+M TFK+LV VLI
Subjt:  DLSTLFFISRLKHLKNTRFGALQSNSMLYHCAEQSSADQEVLPSEWYESASRKIKKLSCSLKNVDLIDGRLVNVNDDSTIIDERIEQKMHTFKALVRVLI

Query:  GSPSTQRRLTEMAVSSSINCQPQAWFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKELTSELDPLFHQSTNKGIKM
        GSP+ +RR+TEMAVSSSI CQP AWFRN SEREPM+VDSLTK+SNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKEL  EL PL  QSTNKGIKM
Subjt:  GSPSTQRRLTEMAVSSSINCQPQAWFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKELTSELDPLFHQSTNKGIKM

Query:  GNQIVSSCLKFLDDVTNSNAHYTSWMRPAPLR-AVADSSAPPRWEDMLEMFTDLISYLKDEKYLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKEASHQES
        G+QIVSSCLKFLDD TNSNAH+TSWMRPAPLR AV DSSAPPRWEDMLEMFTDLI  LK+EK LVHYVTKL+VMKEGLSQIKDVLTDKSIGYKEASHQES
Subjt:  GNQIVSSCLKFLDDVTNSNAHYTSWMRPAPLR-AVADSSAPPRWEDMLEMFTDLISYLKDEKYLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKEASHQES

Query:  LVQKKLSKTLGHSSRCLFTLLLYYLFGHFRDIEVDLCGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRHLHRVMGLFKFVWETAGMKGELELQGHLFC
        LVQKKLSKTLGHSSRCLFTLLLYY+FGHFRDIEVDLCGGLLKADGNDKFLLFMGRVLSSDEEKIVWNG+R L RVMGLFKFVWETAGMKG+LELQGHLFC
Subjt:  LVQKKLSKTLGHSSRCLFTLLLYYLFGHFRDIEVDLCGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRHLHRVMGLFKFVWETAGMKGELELQGHLFC

Query:  VGAEDRQLSYKENAYLLHEINL
        VG EDRQLSYK NAYLLHEINL
Subjt:  VGAEDRQLSYKENAYLLHEINL

TrEMBL top hitse value%identityAlignment
A0A0A0LED2 Uncharacterized protein1.0e-19281.43Show/hide
Query:  LSTLFFISRLKHLKNTRFGALQSNSMLYHCAEQSSADQEVLPSEWYESASRKIKKLSCSLKNVDLIDGRLVNVNDDSTIIDERIEQKMHTFKALVRVLIG
        L  LFFISRLKHL NTR GA QSNSMLYH AE SSA QEVLPSEWYE A  KIKKLSC L+NVDL+DGR+VN +DDSTI DERIEQ+M TFK+LVR+LIG
Subjt:  LSTLFFISRLKHLKNTRFGALQSNSMLYHCAEQSSADQEVLPSEWYESASRKIKKLSCSLKNVDLIDGRLVNVNDDSTIIDERIEQKMHTFKALVRVLIG

Query:  SPSTQRRLTEMAVSSSINCQPQAWFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKELTSELDPLFHQSTNKGIKMG
        SPS QRR+TE+A SSSINCQP AWFRNSSERE MVVDSLTKV N L V+ QQRKLVRHTICPQVTQHHIWTGALD +LKEL  EL PL H+ST+KGIKM 
Subjt:  SPSTQRRLTEMAVSSSINCQPQAWFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKELTSELDPLFHQSTNKGIKMG

Query:  NQIVSSCLKFLDDVTNSNAHYTSWMRPAPLRAVADSSAPPRWEDMLEMFTDLISYLKDEKYLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKEASHQESLV
         QIVSSCLKFLD  TNSN H++SW+RPAP R V  SS PPRWEDMLEMF DLI YLKDEK LVHYVTKLEVMKEGLSQIKDV +D+SIG++EA  QESLV
Subjt:  NQIVSSCLKFLDDVTNSNAHYTSWMRPAPLRAVADSSAPPRWEDMLEMFTDLISYLKDEKYLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKEASHQESLV

Query:  QKKLSKTLGHSSRCLFTLLLYYLFGHFRDIEVDLCGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRHLHRVMGLFKFVWETAGMKGELELQGHLFCVG
        QKKLSKTLGHSSRCLFTLLLYYLFGHFRDIEVD CGGLLK DGNDKFLLFMGRVLS DEEKIVWNGVR L R MG+FK VWETAGMKGEL L+GHLFCVG
Subjt:  QKKLSKTLGHSSRCLFTLLLYYLFGHFRDIEVDLCGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRHLHRVMGLFKFVWETAGMKGELELQGHLFCVG

Query:  AEDRQLSYKENAYLLHEINL
         E RQLSYK NAYLLHEI L
Subjt:  AEDRQLSYKENAYLLHEINL

A0A1S3CR45 uncharacterized protein LOC1035038101.1e-18980.76Show/hide
Query:  LSTLFFISRLKHLKNTRFGALQSNSMLYHCAEQSSADQEVLPSEWYESASRKIKKLSCSLKNVDLIDGRLVNVNDDSTIIDERIEQKMHTFKALVRVLIG
        L   FFISRLKHL +TR GA QSNSMLYH  EQSS DQEVLPSEWYE A  KIKKLSC L+NVDL+DGR+VN +DDSTIIDERIEQKM TFK+LVR+LIG
Subjt:  LSTLFFISRLKHLKNTRFGALQSNSMLYHCAEQSSADQEVLPSEWYESASRKIKKLSCSLKNVDLIDGRLVNVNDDSTIIDERIEQKMHTFKALVRVLIG

Query:  SPSTQRRLTEMAVSSSINCQPQAWFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKELTSELDPLFHQSTNKGIKMG
        SPS QRR+TEMA SSSIN Q  AWFRNSSERE MVVDSLTK  NFL V+ QQRKL+RHTICPQ+TQHHIWTGALD +LKEL  EL PL ++STNKGI M 
Subjt:  SPSTQRRLTEMAVSSSINCQPQAWFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKELTSELDPLFHQSTNKGIKMG

Query:  NQIVSSCLKFLDDVTNSNAHYTS-WMRPAPLRAVADSSAPPRWEDMLEMFTDLISYLKDEKYLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKEASHQESL
         QIVSSCLKFLDD TNSN H+TS W+RPAP R + +SS PPRWEDMLEMF DLI YLKDEK LVHYVTKLEVMKEGLSQIKDV +D+SIG+KEA  QESL
Subjt:  NQIVSSCLKFLDDVTNSNAHYTS-WMRPAPLRAVADSSAPPRWEDMLEMFTDLISYLKDEKYLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKEASHQESL

Query:  VQKKLSKTLGHSSRCLFTLLLYYLFGHFRDIEVDLCGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRHLHRVMGLFKFVWETAGMKGELELQGHLFCV
        VQKKLSKTLGHSSRCLFTLLLYYLFGHFRDIEVD CGGLLK DGNDKFLLFMGRVLS DEEKIVWNGVR L R MG+FK VWETAGMKGEL LQGHLFCV
Subjt:  VQKKLSKTLGHSSRCLFTLLLYYLFGHFRDIEVDLCGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRHLHRVMGLFKFVWETAGMKGELELQGHLFCV

Query:  GAEDRQLSYKENAYLLHEINL
          E RQLSYK NAYLLHEI L
Subjt:  GAEDRQLSYKENAYLLHEINL

A0A6J1CBU2 uncharacterized protein LOC111010055 isoform X14.4e-18378.07Show/hide
Query:  LSTLFFISRLKHLKNTRFGALQSNSMLYHCAEQSSADQEVLPSEWYESASRKIKKLSCSLKNVDLIDGRLVNVNDDSTIIDERIEQKMHTFKALVRVLIG
        LS   FI RL HL NTR+GAL SN MLYH AE SSADQE+LPSEWYE+A RKI+KLSCSLKNVDLIDGRLVNV DDSTI DERIEQ+M  FK+LVRV +G
Subjt:  LSTLFFISRLKHLKNTRFGALQSNSMLYHCAEQSSADQEVLPSEWYESASRKIKKLSCSLKNVDLIDGRLVNVNDDSTIIDERIEQKMHTFKALVRVLIG

Query:  SPSTQRRLTE--MAVSSSINCQPQAWFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKELTSELDPLFHQST-NKGI
        SPS +RR+TE  MA SS+ NCQP   F NSSEREPMVVDSLTK+SNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKEL  ELDPL HQST NKGI
Subjt:  SPSTQRRLTE--MAVSSSINCQPQAWFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKELTSELDPLFHQST-NKGI

Query:  KMGNQIVSSCLKFLDDVTNSNAHYTSWMRPAPLRAVADSSAPPRWEDMLEMFTDLISYLKDEKYLVHYVTKLEVMKEGLSQIKDVLTD-KSIGYKEASHQ
        KMG QIVSSCLKFLDD TNSNAH+TSWMRPAP + V D SA PRWEDMLEMF DLI  LK EK L+ +V KLEVMKEGLSQIKDVL+D KSIG+KE+ HQ
Subjt:  KMGNQIVSSCLKFLDDVTNSNAHYTSWMRPAPLRAVADSSAPPRWEDMLEMFTDLISYLKDEKYLVHYVTKLEVMKEGLSQIKDVLTD-KSIGYKEASHQ

Query:  ESLVQKKLSKTLGHSSRCLFTLLLYYLFGHFRDIEVDLCGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRHLHRVMGLFKFVWETAGMKGELELQGHL
        ESLVQ+KLSKTLGHSSRCLFTLL++YL+GH RDIEVD CGG+LK   N+KF L MGR+LS DEEK+VWNGV+ L R MG+FKFVWETAGMKG LELQGHL
Subjt:  ESLVQKKLSKTLGHSSRCLFTLLLYYLFGHFRDIEVDLCGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRHLHRVMGLFKFVWETAGMKGELELQGHL

Query:  FCVGAEDRQLSYKENAYLLHEINL
        + VGA+ RQLSYK NAY+LH+I L
Subjt:  FCVGAEDRQLSYKENAYLLHEINL

A0A6J1FK17 uncharacterized protein LOC1114460291.7e-18779.43Show/hide
Query:  LFFISRLKHLKNTRFGALQSNSMLYHCAEQSSADQEVLPSEWYESASRKIKKLSCSLKNVDLIDGRLVNVNDDSTIIDERIEQKMHTFKALVRVLIGSPS
        L F+ R+ HL  TR  A  SN MLYHC E S  D E LP++WYE A  KIKKLSCSLKNVDLIDGRLVNVNDDSTI+DERIEQ+M  FK+LVRV IGSPS
Subjt:  LFFISRLKHLKNTRFGALQSNSMLYHCAEQSSADQEVLPSEWYESASRKIKKLSCSLKNVDLIDGRLVNVNDDSTIIDERIEQKMHTFKALVRVLIGSPS

Query:  TQRRLTEMAVSSSINCQPQAWFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKELTSELDPLFHQSTNKGIKMGNQI
         QRR+TEMA S++ N QPQ  FRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQ TQHHIWTGALDH+LKEL  ELDPL H S NKGIKMG QI
Subjt:  TQRRLTEMAVSSSINCQPQAWFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKELTSELDPLFHQSTNKGIKMGNQI

Query:  VSSCLKFLDDVTNSNAHYTSWMRPAPLRAVADSSAPPRWEDMLEMFTDLISYLKDEKYLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKEASHQESLVQKK
        VSSCL FL+D TNSNAH TSWMRPAPL+   DSS  P+WEDMLEMFTDLIS LKDEK L  YVTKLEVMKEGL+QI+DVLTDKSIG+KEA HQESLVQKK
Subjt:  VSSCLKFLDDVTNSNAHYTSWMRPAPLRAVADSSAPPRWEDMLEMFTDLISYLKDEKYLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKEASHQESLVQKK

Query:  LSKTLGHSSRCLFTLLLYYLFGHFRDIEVDLCGGLLKA-DGNDKFLLFMGRVLSSDEEKIVWNGVRHLHRVMGLFKFVWETAGMKGELELQGHLFCVGAE
        LSKTLGHSSRCLFTLLLYYLFGHFRD+EVDLCGGLLKA +  +K+L+FMGR+LS DEE++VWNGVR L R MGLFKFVWETAGMKG+L LQGHLFCVGAE
Subjt:  LSKTLGHSSRCLFTLLLYYLFGHFRDIEVDLCGGLLKA-DGNDKFLLFMGRVLSSDEEKIVWNGVRHLHRVMGLFKFVWETAGMKGELELQGHLFCVGAE

Query:  DRQLSYKENAYLLHEINL
        DRQLSYK N YLLH+I+L
Subjt:  DRQLSYKENAYLLHEINL

A0A6J1IW03 uncharacterized protein LOC1114810935.9e-18879.62Show/hide
Query:  LFFISRLKHLKNTRFGALQSNSMLYHCAEQSSADQEVLPSEWYESASRKIKKLSCSLKNVDLIDGRLVNVNDDSTIIDERIEQKMHTFKALVRVLIGSPS
        L F+ R+ HL  TR  AL SN MLYHC+E S  DQE LP++WYE A  KIKKLSCSLKNVDLIDGRLVNVNDDSTI+DERIEQ+M  FK+LVRV IGS S
Subjt:  LFFISRLKHLKNTRFGALQSNSMLYHCAEQSSADQEVLPSEWYESASRKIKKLSCSLKNVDLIDGRLVNVNDDSTIIDERIEQKMHTFKALVRVLIGSPS

Query:  TQRRLTEMAVSSSINCQPQAWFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKELTSELDPLFHQSTNKGIKMGNQI
         QRR+TEMA S++IN QPQA FRNSSEREPMVVDS TKVSNFLNVSAQQRKLVRHTICPQ TQHHIWTGALDH+LKEL  ELDPL H S NKGIKMG QI
Subjt:  TQRRLTEMAVSSSINCQPQAWFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKELTSELDPLFHQSTNKGIKMGNQI

Query:  VSSCLKFLDDVTNSNAHYTSWMRPAPLRAVADSSAPPRWEDMLEMFTDLISYLKDEKYLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKEASHQESLVQKK
        VSSCLKFL+D TNSNAH TSWMRPAPL+   DSS  P+WEDMLEMFTDLI  LKDEK L  YVTKLEVMKEGL+QI+DVL DKSIG+KEA HQESLVQKK
Subjt:  VSSCLKFLDDVTNSNAHYTSWMRPAPLRAVADSSAPPRWEDMLEMFTDLISYLKDEKYLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKEASHQESLVQKK

Query:  LSKTLGHSSRCLFTLLLYYLFGHFRDIEVDLCGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRHLHRVMGLFKFVWETAGMKGELELQGHLFCVGAED
        LSKTLGHSSRCLFTLLLYYLFGHFRD+EVDLCGGLLKA   +K+L+FMGR+LS DEE+ VWNGVR L R MGLFKFVWETAGMKG+L L+GHLFCVGAED
Subjt:  LSKTLGHSSRCLFTLLLYYLFGHFRDIEVDLCGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRHLHRVMGLFKFVWETAGMKGELELQGHLFCVGAED

Query:  RQLSYKENAYLLHEINL
        RQLSYK N YL+HEI+L
Subjt:  RQLSYKENAYLLHEINL

SwissProt top hitse value%identityAlignment
O22193 U-box domain-containing protein 41.1e-1028.35Show/hide
Query:  VASFHSSSSMRSMS-VTTAHKTITECIAGARSDALEVQEKALQNLVIITQVSPLYRNLLVQVDGSISILISLSKSSSSTIQSLSLSILFNLSLNHDMKKL
        + S  S+ + R +S V T  K + E +   +S +L+ Q +A   L ++ + + +   +++   G+I +L+ L  S+ S  Q  +++ L NLS+N + KK 
Subjt:  VASFHSSSSMRSMS-VTTAHKTITECIAGARSDALEVQEKALQNLVIITQVSPLYRNLLVQVDGSISILISLSKSSSSTIQSLSLSILFNLSLNHDMKKL

Query:  LASMETIYHLNTLVSLGSPETVKLASSLICSLAMLDKNKAKFGVAGTIQLLVRALTVPTVPAAHHLLCSLAELGQFHGNCTVAVRSGAIRVLIDVVESTS
        +A    I  L  ++  GS E  + +++ + SL+++++NK K G +G I  LV  L   T         +L  L     N  + V+SGA+R LID+++  +
Subjt:  LASMETIYHLNTLVSLGSPETVKLASSLICSLAMLDKNKAKFGVAGTIQLLVRALTVPTVPAAHHLLCSLAELGQFHGNCTVAVRSGAIRVLIDVVESTS

Query:  GEDLAGTALVVLGLLARFEEGLRALIKTDQIVISMFNVLKGRCMLSKEGATEILLRLFDES
        G  +   A+ VL  LA   EG  A+ +   I + +  V++      KE A   LL+L   S
Subjt:  GEDLAGTALVVLGLLARFEEGLRALIKTDQIVISMFNVLKGRCMLSKEGATEILLRLFDES

Q681N2 U-box domain-containing protein 154.8e-0925.94Show/hide
Query:  SDALEVQEKALQNLVIITQVSPLYRNLLVQVDGSISILISLSKSSSSTIQSLSLSILFNLSLNHDMKKLLASMETIYHLNTLVSLGSPETVKLASSLICS
        S  LE Q ++++ + ++ + +P  R L+    G+I +L+ L     S IQ  +++ L NLS++   KKL+++   I ++  ++  G+ E  + +++ + S
Subjt:  SDALEVQEKALQNLVIITQVSPLYRNLLVQVDGSISILISLSKSSSSTIQSLSLSILFNLSLNHDMKKLLASMETIYHLNTLVSLGSPETVKLASSLICS

Query:  LAMLDKNKAKFGVAGTIQLLVRALTVPTVPAAHHLLCSLAELGQFHGNCTVAVRSGAIRVLIDVVESTSGEDLAGTALVVLGLLARFEEGLRALIKTDQI
        L+MLD+NK   G++  I  LV  L   T+      L +L  L     N   A+ +G ++ L+++++      +   AL +L LLA   EG +A+ +    
Subjt:  LAMLDKNKAKFGVAGTIQLLVRALTVPTVPAAHHLLCSLAELGQFHGNCTVAVRSGAIRVLIDVVESTSGEDLAGTALVVLGLLARFEEGLRALIKTDQI

Query:  VISMFNVLKGRCMLSKEGATEILLRLFDESEGCLSDALR
        + ++   ++     +KE AT +LL L   +   +  AL+
Subjt:  VISMFNVLKGRCMLSKEGATEILLRLFDESEGCLSDALR

Q8GUG9 U-box domain-containing protein 111.6e-0925.5Show/hide
Query:  LLVQVDGSISILISLSKSSSSTIQSLSLSILFNLSLNHDMKKLLASMETIYHLNTLVSLGSPETVKLASSLICSLAMLDKNKAKFGVAGTIQLLVRALTV
        +L+   G+I +L++L  S     Q  +++ + NLS+  + K+L+     +  +  ++  G+ E  + A++ + SL++ D+NK   G +G I  LV  L  
Subjt:  LLVQVDGSISILISLSKSSSSTIQSLSLSILFNLSLNHDMKKLLASMETIYHLNTLVSLGSPETVKLASSLICSLAMLDKNKAKFGVAGTIQLLVRALTV

Query:  PTVPAAHHLLCSLAELGQFHGNCTVAVRSGAIRVLIDVVESTSGEDLAGTALVVLGLLARFEEGLRALIKTDQIVISMFNVLKGRCMLSKEGATEILLRL
         T         +L  L  +HGN   AVR+G +  L+ ++  ++   +   AL +L +LA  ++   A++K + +  ++  +L+     ++E A  ILL L
Subjt:  PTVPAAHHLLCSLAELGQFHGNCTVAVRSGAIRVLIDVVESTSGEDLAGTALVVLGLLARFEEGLRALIKTDQIVISMFNVLKGRCMLSKEGATEILLRL

Q8VZ40 U-box domain-containing protein 142.8e-0926.48Show/hide
Query:  GSISILISLSKSSSSTIQSLSLSILFNLSLNHDMKKLLASMETIYHLNTLVSLGSPETVKLASSLICSLAMLDKNKAKFGVAGTIQLLVRALTVPTVPAA
        G+I +L+ L  S     Q  S++ L NLS+N   K  +     I  +  ++  GS E  + A++ + SL+++D+NK   G AG IQ L+  L   T    
Subjt:  GSISILISLSKSSSSTIQSLSLSILFNLSLNHDMKKLLASMETIYHLNTLVSLGSPETVKLASSLICSLAMLDKNKAKFGVAGTIQLLVRALTVPTVPAA

Query:  HHLLCSLAELGQFHGNCTVAVRSGAIRVLIDVVESTSGEDLAGTALVVLGLLARFEEGLRALIKTDQIVISMFNVLKGRCMLSKEGATEILLRLFDESEG
             ++  L  + GN + AV+ G +  L  +++   G  +   AL +L +L+  +EG  A+ + + I + +  +++     ++E A  IL  L      
Subjt:  HHLLCSLAELGQFHGNCTVAVRSGAIRVLIDVVESTSGEDLAGTALVVLGLLARFEEGLRALIKTDQIVISMFNVLKGRCMLSKEGATEILLRLFDESEG

Query:  CLSDALRLPEFLGVVADLS
        C+ +  RL     V AD++
Subjt:  CLSDALRLPEFLGVVADLS

Q9C9A6 U-box domain-containing protein 102.4e-0825.89Show/hide
Query:  HSSSSMRSMSVTTAHKTITECIAGARSDALEVQEKALQNLVIITQVSPLYRNLLVQVDGSISILISLSKSSSST-IQSLSLSILFNLSLNHDMKKLLASM
        +S  S R +S   +   I   +    S ++E +  A+  +  +++ S   R L+ +  G+I +L+ L  S   T  Q  +++ + NLS+    K+L+   
Subjt:  HSSSSMRSMSVTTAHKTITECIAGARSDALEVQEKALQNLVIITQVSPLYRNLLVQVDGSISILISLSKSSSST-IQSLSLSILFNLSLNHDMKKLLASM

Query:  ETIYHLNTLVSLGSPETVKLASSLICSLAMLDKNKAKFGVAGTIQLLVRALTVPTVPAAHHLLCSLAELGQFHGNCTVAVRSGAIRVLIDVVESTSGEDL
          +  +  ++  GS E  + A++ + SL++ D+NK   G +G I  LV  L   +V        +L  L  + GN   AVR+G ++ L+ ++  +S E +
Subjt:  ETIYHLNTLVSLGSPETVKLASSLICSLAMLDKNKAKFGVAGTIQLLVRALTVPTVPAAHHLLCSLAELGQFHGNCTVAVRSGAIRVLIDVVESTSGEDL

Query:  AGTALVVLGLLARFEEGLRALIKTDQIVISMFNVLKGRCMLSKEGATEILLRLFDESEGCLSDALRLPEF--LGVVADLSTL
        A  AL +L +LA  +    A+++ + I   + + L+     ++E A  ILL L      C  D  +L     LG V  L  L
Subjt:  AGTALVVLGLLARFEEGLRALIKTDQIVISMFNVLKGRCMLSKEGATEILLRLFDESEGCLSDALRLPEF--LGVVADLSTL

Arabidopsis top hitse value%identityAlignment
AT1G23030.1 ARM repeat superfamily protein1.2e-1025.5Show/hide
Query:  LLVQVDGSISILISLSKSSSSTIQSLSLSILFNLSLNHDMKKLLASMETIYHLNTLVSLGSPETVKLASSLICSLAMLDKNKAKFGVAGTIQLLVRALTV
        +L+   G+I +L++L  S     Q  +++ + NLS+  + K+L+     +  +  ++  G+ E  + A++ + SL++ D+NK   G +G I  LV  L  
Subjt:  LLVQVDGSISILISLSKSSSSTIQSLSLSILFNLSLNHDMKKLLASMETIYHLNTLVSLGSPETVKLASSLICSLAMLDKNKAKFGVAGTIQLLVRALTV

Query:  PTVPAAHHLLCSLAELGQFHGNCTVAVRSGAIRVLIDVVESTSGEDLAGTALVVLGLLARFEEGLRALIKTDQIVISMFNVLKGRCMLSKEGATEILLRL
         T         +L  L  +HGN   AVR+G +  L+ ++  ++   +   AL +L +LA  ++   A++K + +  ++  +L+     ++E A  ILL L
Subjt:  PTVPAAHHLLCSLAELGQFHGNCTVAVRSGAIRVLIDVVESTSGEDLAGTALVVLGLLARFEEGLRALIKTDQIVISMFNVLKGRCMLSKEGATEILLRL

AT2G23140.1 RING/U-box superfamily protein with ARM repeat domain8.1e-1228.35Show/hide
Query:  VASFHSSSSMRSMS-VTTAHKTITECIAGARSDALEVQEKALQNLVIITQVSPLYRNLLVQVDGSISILISLSKSSSSTIQSLSLSILFNLSLNHDMKKL
        + S  S+ + R +S V T  K + E +   +S +L+ Q +A   L ++ + + +   +++   G+I +L+ L  S+ S  Q  +++ L NLS+N + KK 
Subjt:  VASFHSSSSMRSMS-VTTAHKTITECIAGARSDALEVQEKALQNLVIITQVSPLYRNLLVQVDGSISILISLSKSSSSTIQSLSLSILFNLSLNHDMKKL

Query:  LASMETIYHLNTLVSLGSPETVKLASSLICSLAMLDKNKAKFGVAGTIQLLVRALTVPTVPAAHHLLCSLAELGQFHGNCTVAVRSGAIRVLIDVVESTS
        +A    I  L  ++  GS E  + +++ + SL+++++NK K G +G I  LV  L   T         +L  L     N  + V+SGA+R LID+++  +
Subjt:  LASMETIYHLNTLVSLGSPETVKLASSLICSLAMLDKNKAKFGVAGTIQLLVRALTVPTVPAAHHLLCSLAELGQFHGNCTVAVRSGAIRVLIDVVESTS

Query:  GEDLAGTALVVLGLLARFEEGLRALIKTDQIVISMFNVLKGRCMLSKEGATEILLRLFDES
        G  +   A+ VL  LA   EG  A+ +   I + +  V++      KE A   LL+L   S
Subjt:  GEDLAGTALVVLGLLARFEEGLRALIKTDQIVISMFNVLKGRCMLSKEGATEILLRLFDES

AT2G23140.2 RING/U-box superfamily protein with ARM repeat domain8.1e-1228.35Show/hide
Query:  VASFHSSSSMRSMS-VTTAHKTITECIAGARSDALEVQEKALQNLVIITQVSPLYRNLLVQVDGSISILISLSKSSSSTIQSLSLSILFNLSLNHDMKKL
        + S  S+ + R +S V T  K + E +   +S +L+ Q +A   L ++ + + +   +++   G+I +L+ L  S+ S  Q  +++ L NLS+N + KK 
Subjt:  VASFHSSSSMRSMS-VTTAHKTITECIAGARSDALEVQEKALQNLVIITQVSPLYRNLLVQVDGSISILISLSKSSSSTIQSLSLSILFNLSLNHDMKKL

Query:  LASMETIYHLNTLVSLGSPETVKLASSLICSLAMLDKNKAKFGVAGTIQLLVRALTVPTVPAAHHLLCSLAELGQFHGNCTVAVRSGAIRVLIDVVESTS
        +A    I  L  ++  GS E  + +++ + SL+++++NK K G +G I  LV  L   T         +L  L     N  + V+SGA+R LID+++  +
Subjt:  LASMETIYHLNTLVSLGSPETVKLASSLICSLAMLDKNKAKFGVAGTIQLLVRALTVPTVPAAHHLLCSLAELGQFHGNCTVAVRSGAIRVLIDVVESTS

Query:  GEDLAGTALVVLGLLARFEEGLRALIKTDQIVISMFNVLKGRCMLSKEGATEILLRLFDES
        G  +   A+ VL  LA   EG  A+ +   I + +  V++      KE A   LL+L   S
Subjt:  GEDLAGTALVVLGLLARFEEGLRALIKTDQIVISMFNVLKGRCMLSKEGATEILLRLFDES

AT3G54850.1 plant U-box 142.0e-1026.48Show/hide
Query:  GSISILISLSKSSSSTIQSLSLSILFNLSLNHDMKKLLASMETIYHLNTLVSLGSPETVKLASSLICSLAMLDKNKAKFGVAGTIQLLVRALTVPTVPAA
        G+I +L+ L  S     Q  S++ L NLS+N   K  +     I  +  ++  GS E  + A++ + SL+++D+NK   G AG IQ L+  L   T    
Subjt:  GSISILISLSKSSSSTIQSLSLSILFNLSLNHDMKKLLASMETIYHLNTLVSLGSPETVKLASSLICSLAMLDKNKAKFGVAGTIQLLVRALTVPTVPAA

Query:  HHLLCSLAELGQFHGNCTVAVRSGAIRVLIDVVESTSGEDLAGTALVVLGLLARFEEGLRALIKTDQIVISMFNVLKGRCMLSKEGATEILLRLFDESEG
             ++  L  + GN + AV+ G +  L  +++   G  +   AL +L +L+  +EG  A+ + + I + +  +++     ++E A  IL  L      
Subjt:  HHLLCSLAELGQFHGNCTVAVRSGAIRVLIDVVESTSGEDLAGTALVVLGLLARFEEGLRALIKTDQIVISMFNVLKGRCMLSKEGATEILLRLFDESEG

Query:  CLSDALRLPEFLGVVADLS
        C+ +  RL     V AD++
Subjt:  CLSDALRLPEFLGVVADLS

AT5G25500.1 unknown protein6.7e-9948.02Show/hide
Query:  QSNSMLYHCAEQSSADQEVLPSEWYESASRKIKKLSCSLKNVDLIDGRLVNVNDDSTIIDERIEQKMHTFKALVRVLIGSPSTQRRLTEMAVSSSINCQP
        +S  +LYH +  S  D  VLP EWYE+    +KKL+ +L++VDL+DG+L ++N    + D+ I +KM  FK+L R+ IGSPS Q++L E           
Subjt:  QSNSMLYHCAEQSSADQEVLPSEWYESASRKIKKLSCSLKNVDLIDGRLVNVNDDSTIIDERIEQKMHTFKALVRVLIGSPSTQRRLTEMAVSSSINCQP

Query:  QAWFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKELTSELDPLF-HQSTNKGIKMGNQIVSSCLKFLDD--VTNSN
          +F + SEREP+VV+SLTKV NFLNVSAQQRKLVR T+C QVTQ+ IW G L+ +L  L  E+D L  H+  ++G  +  Q++ SCL+FL +  V+   
Subjt:  QAWFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKELTSELDPLF-HQSTNKGIKMGNQIVSSCLKFLDD--VTNSN

Query:  AHYTSWMRPAPLRAVADSSAPPRWEDMLEMFTDLISYLK--DEKYLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKEASHQESLVQKKLSKTLGHSSRCLF
           TSWMRP P R  A ++A  +WED+L+M  DL  YL+  +E  +++++ KL  MKEGL QIKDV  D +IG++E  HQE LV +KLSK LG  S CLF
Subjt:  AHYTSWMRPAPLRAVADSSAPPRWEDMLEMFTDLISYLK--DEKYLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKEASHQESLVQKKLSKTLGHSSRCLF

Query:  TLLLYYLFGHFRDIEVDLCGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRHLHRVMGLFKFVWETAGMKGELELQGHLFCVGAEDRQLSYKENAYLLH
         L++Y+L+G  RDIEVDLCGG  K + ++   L MGR+L+S +EK++  G++ L R +GLF+FVWETAGMK  L LQGHL+C+GAE+R ++Y+   + +H
Subjt:  TLLLYYLFGHFRDIEVDLCGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRHLHRVMGLFKFVWETAGMKGELELQGHLFCVGAEDRQLSYKENAYLLH

Query:  EINL
        +++L
Subjt:  EINL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGTTGCATCTTTCCACTCATCATCTTCCATGAGATCCATGAGTGTTACTACTGCACATAAAACTATAACAGAGTGTATAGCTGGCGCTCGGTCAGATGCTCTCGA
AGTTCAGGAAAAGGCTCTTCAAAACTTGGTCATCATTACTCAGGTTAGTCCTCTATACAGGAACTTGCTTGTACAAGTAGATGGATCAATATCAATTCTTATTTCTCTCT
CCAAATCATCTTCTTCAACCATCCAATCTCTTTCATTGTCAATTCTTTTCAATCTCTCTCTGAACCATGACATGAAAAAGCTTCTTGCATCAATGGAAACCATCTACCAT
CTCAATACACTTGTATCCCTAGGCTCACCGGAAACCGTCAAGCTAGCATCCTCGTTGATTTGCAGCCTTGCAATGCTAGACAAAAACAAAGCTAAGTTTGGTGTAGCAGG
GACAATACAGTTATTGGTTAGAGCACTTACGGTCCCTACTGTTCCTGCTGCCCATCACCTTCTCTGTTCTTTAGCCGAACTAGGCCAGTTTCATGGAAACTGCACTGTGG
CAGTTCGATCGGGAGCCATCCGAGTTCTTATCGATGTAGTGGAGAGTACTAGTGGGGAGGATCTTGCCGGCACTGCTCTTGTTGTTCTCGGTCTCTTGGCTAGATTTGAG
GAAGGGTTGAGAGCTTTGATAAAAACTGATCAGATTGTAATTTCGATGTTTAATGTGCTGAAAGGAAGGTGTATGCTCAGCAAAGAAGGTGCAACCGAGATCCTTTTGCG
ATTGTTCGACGAAAGTGAAGGTTGTCTGAGTGATGCTTTGAGGTTGCCAGAGTTTTTGGGTGTTGTTGCTGATCTTTCGACATTGTTCTTCATCTCTCGCCTGAAGCATC
TCAAGAACACGAGATTTGGAGCTTTGCAATCAAATTCTATGTTGTATCATTGCGCAGAGCAATCCTCCGCCGATCAAGAGGTGTTACCATCTGAATGGTACGAGAGTGCT
TCTCGGAAGATAAAGAAATTGAGCTGCTCGTTGAAGAATGTGGATTTGATCGATGGACGACTTGTTAATGTTAATGATGATTCGACCATTATCGACGAGCGAATTGAACA
AAAAATGCACACTTTCAAGGCCCTTGTAAGAGTCTTGATTGGTTCTCCATCGACTCAGAGGAGATTAACAGAGATGGCTGTATCGAGTTCAATAAATTGCCAGCCTCAGG
CATGGTTTAGAAATTCAAGTGAACGAGAGCCGATGGTTGTTGATTCACTCACCAAGGTCAGCAACTTCCTCAACGTCTCTGCCCAACAAAGGAAACTGGTGCGCCATACC
ATATGCCCACAGGTTACACAACATCACATTTGGACTGGTGCATTGGATCATATGCTGAAAGAGTTAACTTCAGAGTTGGATCCATTATTTCATCAGTCAACCAACAAAGG
GATCAAAATGGGCAATCAGATTGTTTCAAGTTGCCTAAAGTTTTTGGATGATGTTACCAATTCAAATGCTCACTACACTTCGTGGATGCGGCCAGCACCATTACGAGCTG
TTGCAGATTCATCTGCACCGCCAAGATGGGAAGACATGCTTGAGATGTTCACCGATCTGATCAGCTATCTGAAAGATGAGAAATATTTGGTCCATTATGTGACAAAGCTT
GAAGTTATGAAAGAGGGGCTTTCCCAGATCAAAGATGTATTGACTGATAAAAGCATTGGATACAAGGAAGCCAGTCATCAAGAAAGCTTGGTGCAGAAGAAGCTTTCAAA
GACACTGGGCCACTCATCCAGGTGCTTGTTCACTCTTTTACTATACTATCTTTTTGGGCATTTTAGGGATATTGAAGTGGATCTTTGTGGTGGGTTGTTGAAGGCTGATG
GGAATGACAAGTTTTTGTTGTTCATGGGGAGGGTCTTGAGTTCTGATGAAGAGAAAATTGTTTGGAATGGGGTGAGGCATCTTCATAGAGTTATGGGGCTTTTTAAATTT
GTTTGGGAAACAGCTGGAATGAAGGGAGAATTGGAATTGCAAGGCCATTTATTTTGTGTTGGGGCTGAGGATAGGCAGCTTAGTTATAAAGAAAATGCTTATTTATTACA
TGAGATCAATTTATAA
mRNA sequenceShow/hide mRNA sequence
ATGTCAGTTGCATCTTTCCACTCATCATCTTCCATGAGATCCATGAGTGTTACTACTGCACATAAAACTATAACAGAGTGTATAGCTGGCGCTCGGTCAGATGCTCTCGA
AGTTCAGGAAAAGGCTCTTCAAAACTTGGTCATCATTACTCAGGTTAGTCCTCTATACAGGAACTTGCTTGTACAAGTAGATGGATCAATATCAATTCTTATTTCTCTCT
CCAAATCATCTTCTTCAACCATCCAATCTCTTTCATTGTCAATTCTTTTCAATCTCTCTCTGAACCATGACATGAAAAAGCTTCTTGCATCAATGGAAACCATCTACCAT
CTCAATACACTTGTATCCCTAGGCTCACCGGAAACCGTCAAGCTAGCATCCTCGTTGATTTGCAGCCTTGCAATGCTAGACAAAAACAAAGCTAAGTTTGGTGTAGCAGG
GACAATACAGTTATTGGTTAGAGCACTTACGGTCCCTACTGTTCCTGCTGCCCATCACCTTCTCTGTTCTTTAGCCGAACTAGGCCAGTTTCATGGAAACTGCACTGTGG
CAGTTCGATCGGGAGCCATCCGAGTTCTTATCGATGTAGTGGAGAGTACTAGTGGGGAGGATCTTGCCGGCACTGCTCTTGTTGTTCTCGGTCTCTTGGCTAGATTTGAG
GAAGGGTTGAGAGCTTTGATAAAAACTGATCAGATTGTAATTTCGATGTTTAATGTGCTGAAAGGAAGGTGTATGCTCAGCAAAGAAGGTGCAACCGAGATCCTTTTGCG
ATTGTTCGACGAAAGTGAAGGTTGTCTGAGTGATGCTTTGAGGTTGCCAGAGTTTTTGGGTGTTGTTGCTGATCTTTCGACATTGTTCTTCATCTCTCGCCTGAAGCATC
TCAAGAACACGAGATTTGGAGCTTTGCAATCAAATTCTATGTTGTATCATTGCGCAGAGCAATCCTCCGCCGATCAAGAGGTGTTACCATCTGAATGGTACGAGAGTGCT
TCTCGGAAGATAAAGAAATTGAGCTGCTCGTTGAAGAATGTGGATTTGATCGATGGACGACTTGTTAATGTTAATGATGATTCGACCATTATCGACGAGCGAATTGAACA
AAAAATGCACACTTTCAAGGCCCTTGTAAGAGTCTTGATTGGTTCTCCATCGACTCAGAGGAGATTAACAGAGATGGCTGTATCGAGTTCAATAAATTGCCAGCCTCAGG
CATGGTTTAGAAATTCAAGTGAACGAGAGCCGATGGTTGTTGATTCACTCACCAAGGTCAGCAACTTCCTCAACGTCTCTGCCCAACAAAGGAAACTGGTGCGCCATACC
ATATGCCCACAGGTTACACAACATCACATTTGGACTGGTGCATTGGATCATATGCTGAAAGAGTTAACTTCAGAGTTGGATCCATTATTTCATCAGTCAACCAACAAAGG
GATCAAAATGGGCAATCAGATTGTTTCAAGTTGCCTAAAGTTTTTGGATGATGTTACCAATTCAAATGCTCACTACACTTCGTGGATGCGGCCAGCACCATTACGAGCTG
TTGCAGATTCATCTGCACCGCCAAGATGGGAAGACATGCTTGAGATGTTCACCGATCTGATCAGCTATCTGAAAGATGAGAAATATTTGGTCCATTATGTGACAAAGCTT
GAAGTTATGAAAGAGGGGCTTTCCCAGATCAAAGATGTATTGACTGATAAAAGCATTGGATACAAGGAAGCCAGTCATCAAGAAAGCTTGGTGCAGAAGAAGCTTTCAAA
GACACTGGGCCACTCATCCAGGTGCTTGTTCACTCTTTTACTATACTATCTTTTTGGGCATTTTAGGGATATTGAAGTGGATCTTTGTGGTGGGTTGTTGAAGGCTGATG
GGAATGACAAGTTTTTGTTGTTCATGGGGAGGGTCTTGAGTTCTGATGAAGAGAAAATTGTTTGGAATGGGGTGAGGCATCTTCATAGAGTTATGGGGCTTTTTAAATTT
GTTTGGGAAACAGCTGGAATGAAGGGAGAATTGGAATTGCAAGGCCATTTATTTTGTGTTGGGGCTGAGGATAGGCAGCTTAGTTATAAAGAAAATGCTTATTTATTACA
TGAGATCAATTTATAA
Protein sequenceShow/hide protein sequence
MSVASFHSSSSMRSMSVTTAHKTITECIAGARSDALEVQEKALQNLVIITQVSPLYRNLLVQVDGSISILISLSKSSSSTIQSLSLSILFNLSLNHDMKKLLASMETIYH
LNTLVSLGSPETVKLASSLICSLAMLDKNKAKFGVAGTIQLLVRALTVPTVPAAHHLLCSLAELGQFHGNCTVAVRSGAIRVLIDVVESTSGEDLAGTALVVLGLLARFE
EGLRALIKTDQIVISMFNVLKGRCMLSKEGATEILLRLFDESEGCLSDALRLPEFLGVVADLSTLFFISRLKHLKNTRFGALQSNSMLYHCAEQSSADQEVLPSEWYESA
SRKIKKLSCSLKNVDLIDGRLVNVNDDSTIIDERIEQKMHTFKALVRVLIGSPSTQRRLTEMAVSSSINCQPQAWFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHT
ICPQVTQHHIWTGALDHMLKELTSELDPLFHQSTNKGIKMGNQIVSSCLKFLDDVTNSNAHYTSWMRPAPLRAVADSSAPPRWEDMLEMFTDLISYLKDEKYLVHYVTKL
EVMKEGLSQIKDVLTDKSIGYKEASHQESLVQKKLSKTLGHSSRCLFTLLLYYLFGHFRDIEVDLCGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRHLHRVMGLFKF
VWETAGMKGELELQGHLFCVGAEDRQLSYKENAYLLHEINL