; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC05G096480 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC05G096480
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionU-box domain-containing protein 1-like
Genome locationCiama_Chr05:27699646..27704854
RNA-Seq ExpressionCaUC05G096480
SyntenyCaUC05G096480
Gene Ontology termsNA
InterPro domainsIPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136473.1 uncharacterized protein LOC101217803 [Cucumis sativus]1.4e-19482.14Show/hide
Query:  LSRLFFISRLKHLNNTRFGALQSNSMLYHCAEQSSADQEVLPSEWYENASRKIKKLSCSLKNVDLIDGRLVNVNDDSTIMDERIEQKMHTFKALVRVLIG
        L RLFFISRLKHL NTR GA QSNSMLYH AE SSA QEVLPSEWYE A  KIKKLSC L+NVDL+DGR+VN +DDSTI DERIEQ+M TFK+LVR+LIG
Subjt:  LSRLFFISRLKHLNNTRFGALQSNSMLYHCAEQSSADQEVLPSEWYENASRKIKKLSCSLKNVDLIDGRLVNVNDDSTIMDERIEQKMHTFKALVRVLIG

Query:  SPSTQRRLTEMAVSSSINCQPQAWFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKELNSELDPLFHQSTNKGIKMG
        SPS QRR+TE+A SSSINCQP AWFRNSSERE MVVDSLTKV N L V+ QQRKLVRHTICPQVTQHHIWTGALD +LKELN EL PL H+ST+KGIKM 
Subjt:  SPSTQRRLTEMAVSSSINCQPQAWFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKELNSELDPLFHQSTNKGIKMG

Query:  NQIVSSCLKFLDDVTNSNAHYTSWMRPAPLRAVVDSSAPPRWEDMLEMFTDLISYLKDEKYLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKEASHQESLV
         QIVSSCLKFLD  TNSN H++SW+RPAP R VV SS PPRWEDMLEMF DLI YLKDEK LVHYVTKLEVMKEGLSQIKDV +D+SIG++EA  QESLV
Subjt:  NQIVSSCLKFLDDVTNSNAHYTSWMRPAPLRAVVDSSAPPRWEDMLEMFTDLISYLKDEKYLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKEASHQESLV

Query:  QKKLSKTLGHSSRCLFTLLLYYLFGHFRDIEVDLCGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRHLHRVMGLFKFVWETAGMKGELELQGHLFCVG
        QKKLSKTLGHSSRCLFTLLLYYLFGHFRDIEVD CGGLLK DGNDKFLLFMGRVLS DEEKIVWNGVR L R MG+FK VWETAGMKGEL L+GHLFCVG
Subjt:  QKKLSKTLGHSSRCLFTLLLYYLFGHFRDIEVDLCGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRHLHRVMGLFKFVWETAGMKGELELQGHLFCVG

Query:  AEDRQLSYKENAYLLHEINL
         E RQLSYK NAYLLHEI L
Subjt:  AEDRQLSYKENAYLLHEINL

XP_008466389.1 PREDICTED: uncharacterized protein LOC103503810 [Cucumis melo]4.1e-19181.24Show/hide
Query:  LSRLFFISRLKHLNNTRFGALQSNSMLYHCAEQSSADQEVLPSEWYENASRKIKKLSCSLKNVDLIDGRLVNVNDDSTIMDERIEQKMHTFKALVRVLIG
        L R FFISRLKHL +TR GA QSNSMLYH  EQSS DQEVLPSEWYE A  KIKKLSC L+NVDL+DGR+VN +DDSTI+DERIEQKM TFK+LVR+LIG
Subjt:  LSRLFFISRLKHLNNTRFGALQSNSMLYHCAEQSSADQEVLPSEWYENASRKIKKLSCSLKNVDLIDGRLVNVNDDSTIMDERIEQKMHTFKALVRVLIG

Query:  SPSTQRRLTEMAVSSSINCQPQAWFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKELNSELDPLFHQSTNKGIKMG
        SPS QRR+TEMA SSSIN Q  AWFRNSSERE MVVDSLTK  NFL V+ QQRKL+RHTICPQ+TQHHIWTGALD +LKELN EL PL ++STNKGI M 
Subjt:  SPSTQRRLTEMAVSSSINCQPQAWFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKELNSELDPLFHQSTNKGIKMG

Query:  NQIVSSCLKFLDDVTNSNAHYTS-WMRPAPLRAVVDSSAPPRWEDMLEMFTDLISYLKDEKYLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKEASHQESL
         QIVSSCLKFLDD TNSN H+TS W+RPAP R +V+SS PPRWEDMLEMF DLI YLKDEK LVHYVTKLEVMKEGLSQIKDV +D+SIG+KEA  QESL
Subjt:  NQIVSSCLKFLDDVTNSNAHYTS-WMRPAPLRAVVDSSAPPRWEDMLEMFTDLISYLKDEKYLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKEASHQESL

Query:  VQKKLSKTLGHSSRCLFTLLLYYLFGHFRDIEVDLCGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRHLHRVMGLFKFVWETAGMKGELELQGHLFCV
        VQKKLSKTLGHSSRCLFTLLLYYLFGHFRDIEVD CGGLLK DGNDKFLLFMGRVLS DEEKIVWNGVR L R MG+FK VWETAGMKGEL LQGHLFCV
Subjt:  VQKKLSKTLGHSSRCLFTLLLYYLFGHFRDIEVDLCGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRHLHRVMGLFKFVWETAGMKGELELQGHLFCV

Query:  GAEDRQLSYKENAYLLHEINL
          E RQLSYK NAYLLHEI L
Subjt:  GAEDRQLSYKENAYLLHEINL

XP_022940414.1 uncharacterized protein LOC111446029 [Cucurbita moschata]3.8e-18979.95Show/hide
Query:  RLFFISRLKHLNNTRFGALQSNSMLYHCAEQSSADQEVLPSEWYENASRKIKKLSCSLKNVDLIDGRLVNVNDDSTIMDERIEQKMHTFKALVRVLIGSP
        RL F+ R+ HLN TR  A  SN MLYHC E S  D E LP++WYE A  KIKKLSCSLKNVDLIDGRLVNVNDDSTI+DERIEQ+M  FK+LVRV IGSP
Subjt:  RLFFISRLKHLNNTRFGALQSNSMLYHCAEQSSADQEVLPSEWYENASRKIKKLSCSLKNVDLIDGRLVNVNDDSTIMDERIEQKMHTFKALVRVLIGSP

Query:  STQRRLTEMAVSSSINCQPQAWFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKELNSELDPLFHQSTNKGIKMGNQ
        S QRR+TEMA S++ N QPQ  FRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQ TQHHIWTGALDH+LKEL  ELDPL H S NKGIKMG Q
Subjt:  STQRRLTEMAVSSSINCQPQAWFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKELNSELDPLFHQSTNKGIKMGNQ

Query:  IVSSCLKFLDDVTNSNAHYTSWMRPAPLRAVVDSSAPPRWEDMLEMFTDLISYLKDEKYLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKEASHQESLVQK
        IVSSCL FL+D TNSNAH TSWMRPAPL+  VDSS  P+WEDMLEMFTDLIS LKDEK L  YVTKLEVMKEGL+QI+DVLTDKSIG+KEA HQESLVQK
Subjt:  IVSSCLKFLDDVTNSNAHYTSWMRPAPLRAVVDSSAPPRWEDMLEMFTDLISYLKDEKYLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKEASHQESLVQK

Query:  KLSKTLGHSSRCLFTLLLYYLFGHFRDIEVDLCGGLLKA-DGNDKFLLFMGRVLSSDEEKIVWNGVRHLHRVMGLFKFVWETAGMKGELELQGHLFCVGA
        KLSKTLGHSSRCLFTLLLYYLFGHFRD+EVDLCGGLLKA +  +K+L+FMGR+LS DEE++VWNGVR L R MGLFKFVWETAGMKG+L LQGHLFCVGA
Subjt:  KLSKTLGHSSRCLFTLLLYYLFGHFRDIEVDLCGGLLKA-DGNDKFLLFMGRVLSSDEEKIVWNGVRHLHRVMGLFKFVWETAGMKGELELQGHLFCVGA

Query:  EDRQLSYKENAYLLHEINL
        EDRQLSYK N YLLH+I+L
Subjt:  EDRQLSYKENAYLLHEINL

XP_022982191.1 uncharacterized protein LOC111481093 [Cucurbita maxima]1.3e-18980.14Show/hide
Query:  RLFFISRLKHLNNTRFGALQSNSMLYHCAEQSSADQEVLPSEWYENASRKIKKLSCSLKNVDLIDGRLVNVNDDSTIMDERIEQKMHTFKALVRVLIGSP
        RL F+ R+ HLN TR  AL SN MLYHC+E S  DQE LP++WYE A  KIKKLSCSLKNVDLIDGRLVNVNDDSTI+DERIEQ+M  FK+LVRV IGS 
Subjt:  RLFFISRLKHLNNTRFGALQSNSMLYHCAEQSSADQEVLPSEWYENASRKIKKLSCSLKNVDLIDGRLVNVNDDSTIMDERIEQKMHTFKALVRVLIGSP

Query:  STQRRLTEMAVSSSINCQPQAWFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKELNSELDPLFHQSTNKGIKMGNQ
        S QRR+TEMA S++IN QPQA FRNSSEREPMVVDS TKVSNFLNVSAQQRKLVRHTICPQ TQHHIWTGALDH+LKEL  ELDPL H S NKGIKMG Q
Subjt:  STQRRLTEMAVSSSINCQPQAWFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKELNSELDPLFHQSTNKGIKMGNQ

Query:  IVSSCLKFLDDVTNSNAHYTSWMRPAPLRAVVDSSAPPRWEDMLEMFTDLISYLKDEKYLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKEASHQESLVQK
        IVSSCLKFL+D TNSNAH TSWMRPAPL+  VDSS  P+WEDMLEMFTDLI  LKDEK L  YVTKLEVMKEGL+QI+DVL DKSIG+KEA HQESLVQK
Subjt:  IVSSCLKFLDDVTNSNAHYTSWMRPAPLRAVVDSSAPPRWEDMLEMFTDLISYLKDEKYLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKEASHQESLVQK

Query:  KLSKTLGHSSRCLFTLLLYYLFGHFRDIEVDLCGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRHLHRVMGLFKFVWETAGMKGELELQGHLFCVGAE
        KLSKTLGHSSRCLFTLLLYYLFGHFRD+EVDLCGGLLKA   +K+L+FMGR+LS DEE+ VWNGVR L R MGLFKFVWETAGMKG+L L+GHLFCVGAE
Subjt:  KLSKTLGHSSRCLFTLLLYYLFGHFRDIEVDLCGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRHLHRVMGLFKFVWETAGMKGELELQGHLFCVGAE

Query:  DRQLSYKENAYLLHEINL
        DRQLSYK N YL+HEI+L
Subjt:  DRQLSYKENAYLLHEINL

XP_038896888.1 uncharacterized protein LOC120085101 isoform X1 [Benincasa hispida]2.3e-21890.76Show/hide
Query:  DLSRLFFISRLKHLNNTRFGALQSNSMLYHCAEQSSADQEVLPSEWYENASRKIKKLSCSLKNVDLIDGRLVNVNDDSTIMDERIEQKMHTFKALVRVLI
        +L RLFFISRLKHLNNTRFGALQSNSMLYHCAE SSADQEVLPSEWYENA RKIKKLSCSLKNVDLIDGRLVNVNDDSTI+DE IEQ+M TFK+LV VLI
Subjt:  DLSRLFFISRLKHLNNTRFGALQSNSMLYHCAEQSSADQEVLPSEWYENASRKIKKLSCSLKNVDLIDGRLVNVNDDSTIMDERIEQKMHTFKALVRVLI

Query:  GSPSTQRRLTEMAVSSSINCQPQAWFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKELNSELDPLFHQSTNKGIKM
        GSP+ +RR+TEMAVSSSI CQP AWFRN SEREPM+VDSLTK+SNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKELN EL PL  QSTNKGIKM
Subjt:  GSPSTQRRLTEMAVSSSINCQPQAWFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKELNSELDPLFHQSTNKGIKM

Query:  GNQIVSSCLKFLDDVTNSNAHYTSWMRPAPLR-AVVDSSAPPRWEDMLEMFTDLISYLKDEKYLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKEASHQES
        G+QIVSSCLKFLDD TNSNAH+TSWMRPAPLR AVVDSSAPPRWEDMLEMFTDLI  LK+EK LVHYVTKL+VMKEGLSQIKDVLTDKSIGYKEASHQES
Subjt:  GNQIVSSCLKFLDDVTNSNAHYTSWMRPAPLR-AVVDSSAPPRWEDMLEMFTDLISYLKDEKYLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKEASHQES

Query:  LVQKKLSKTLGHSSRCLFTLLLYYLFGHFRDIEVDLCGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRHLHRVMGLFKFVWETAGMKGELELQGHLFC
        LVQKKLSKTLGHSSRCLFTLLLYY+FGHFRDIEVDLCGGLLKADGNDKFLLFMGRVLSSDEEKIVWNG+R L RVMGLFKFVWETAGMKG+LELQGHLFC
Subjt:  LVQKKLSKTLGHSSRCLFTLLLYYLFGHFRDIEVDLCGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRHLHRVMGLFKFVWETAGMKGELELQGHLFC

Query:  VGAEDRQLSYKENAYLLHEINL
        VG EDRQLSYK NAYLLHEINL
Subjt:  VGAEDRQLSYKENAYLLHEINL

TrEMBL top hitse value%identityAlignment
A0A0A0LED2 Uncharacterized protein6.6e-19582.14Show/hide
Query:  LSRLFFISRLKHLNNTRFGALQSNSMLYHCAEQSSADQEVLPSEWYENASRKIKKLSCSLKNVDLIDGRLVNVNDDSTIMDERIEQKMHTFKALVRVLIG
        L RLFFISRLKHL NTR GA QSNSMLYH AE SSA QEVLPSEWYE A  KIKKLSC L+NVDL+DGR+VN +DDSTI DERIEQ+M TFK+LVR+LIG
Subjt:  LSRLFFISRLKHLNNTRFGALQSNSMLYHCAEQSSADQEVLPSEWYENASRKIKKLSCSLKNVDLIDGRLVNVNDDSTIMDERIEQKMHTFKALVRVLIG

Query:  SPSTQRRLTEMAVSSSINCQPQAWFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKELNSELDPLFHQSTNKGIKMG
        SPS QRR+TE+A SSSINCQP AWFRNSSERE MVVDSLTKV N L V+ QQRKLVRHTICPQVTQHHIWTGALD +LKELN EL PL H+ST+KGIKM 
Subjt:  SPSTQRRLTEMAVSSSINCQPQAWFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKELNSELDPLFHQSTNKGIKMG

Query:  NQIVSSCLKFLDDVTNSNAHYTSWMRPAPLRAVVDSSAPPRWEDMLEMFTDLISYLKDEKYLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKEASHQESLV
         QIVSSCLKFLD  TNSN H++SW+RPAP R VV SS PPRWEDMLEMF DLI YLKDEK LVHYVTKLEVMKEGLSQIKDV +D+SIG++EA  QESLV
Subjt:  NQIVSSCLKFLDDVTNSNAHYTSWMRPAPLRAVVDSSAPPRWEDMLEMFTDLISYLKDEKYLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKEASHQESLV

Query:  QKKLSKTLGHSSRCLFTLLLYYLFGHFRDIEVDLCGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRHLHRVMGLFKFVWETAGMKGELELQGHLFCVG
        QKKLSKTLGHSSRCLFTLLLYYLFGHFRDIEVD CGGLLK DGNDKFLLFMGRVLS DEEKIVWNGVR L R MG+FK VWETAGMKGEL L+GHLFCVG
Subjt:  QKKLSKTLGHSSRCLFTLLLYYLFGHFRDIEVDLCGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRHLHRVMGLFKFVWETAGMKGELELQGHLFCVG

Query:  AEDRQLSYKENAYLLHEINL
         E RQLSYK NAYLLHEI L
Subjt:  AEDRQLSYKENAYLLHEINL

A0A1S3CR45 uncharacterized protein LOC1035038102.0e-19181.24Show/hide
Query:  LSRLFFISRLKHLNNTRFGALQSNSMLYHCAEQSSADQEVLPSEWYENASRKIKKLSCSLKNVDLIDGRLVNVNDDSTIMDERIEQKMHTFKALVRVLIG
        L R FFISRLKHL +TR GA QSNSMLYH  EQSS DQEVLPSEWYE A  KIKKLSC L+NVDL+DGR+VN +DDSTI+DERIEQKM TFK+LVR+LIG
Subjt:  LSRLFFISRLKHLNNTRFGALQSNSMLYHCAEQSSADQEVLPSEWYENASRKIKKLSCSLKNVDLIDGRLVNVNDDSTIMDERIEQKMHTFKALVRVLIG

Query:  SPSTQRRLTEMAVSSSINCQPQAWFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKELNSELDPLFHQSTNKGIKMG
        SPS QRR+TEMA SSSIN Q  AWFRNSSERE MVVDSLTK  NFL V+ QQRKL+RHTICPQ+TQHHIWTGALD +LKELN EL PL ++STNKGI M 
Subjt:  SPSTQRRLTEMAVSSSINCQPQAWFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKELNSELDPLFHQSTNKGIKMG

Query:  NQIVSSCLKFLDDVTNSNAHYTS-WMRPAPLRAVVDSSAPPRWEDMLEMFTDLISYLKDEKYLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKEASHQESL
         QIVSSCLKFLDD TNSN H+TS W+RPAP R +V+SS PPRWEDMLEMF DLI YLKDEK LVHYVTKLEVMKEGLSQIKDV +D+SIG+KEA  QESL
Subjt:  NQIVSSCLKFLDDVTNSNAHYTS-WMRPAPLRAVVDSSAPPRWEDMLEMFTDLISYLKDEKYLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKEASHQESL

Query:  VQKKLSKTLGHSSRCLFTLLLYYLFGHFRDIEVDLCGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRHLHRVMGLFKFVWETAGMKGELELQGHLFCV
        VQKKLSKTLGHSSRCLFTLLLYYLFGHFRDIEVD CGGLLK DGNDKFLLFMGRVLS DEEKIVWNGVR L R MG+FK VWETAGMKGEL LQGHLFCV
Subjt:  VQKKLSKTLGHSSRCLFTLLLYYLFGHFRDIEVDLCGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRHLHRVMGLFKFVWETAGMKGELELQGHLFCV

Query:  GAEDRQLSYKENAYLLHEINL
          E RQLSYK NAYLLHEI L
Subjt:  GAEDRQLSYKENAYLLHEINL

A0A6J1CBU2 uncharacterized protein LOC111010055 isoform X12.8e-18578.77Show/hide
Query:  LSRLFFISRLKHLNNTRFGALQSNSMLYHCAEQSSADQEVLPSEWYENASRKIKKLSCSLKNVDLIDGRLVNVNDDSTIMDERIEQKMHTFKALVRVLIG
        LS+  FI RL HLNNTR+GAL SN MLYH AE SSADQE+LPSEWYENA RKI+KLSCSLKNVDLIDGRLVNV DDSTI DERIEQ+M  FK+LVRV +G
Subjt:  LSRLFFISRLKHLNNTRFGALQSNSMLYHCAEQSSADQEVLPSEWYENASRKIKKLSCSLKNVDLIDGRLVNVNDDSTIMDERIEQKMHTFKALVRVLIG

Query:  SPSTQRRLTE--MAVSSSINCQPQAWFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKELNSELDPLFHQST-NKGI
        SPS +RR+TE  MA SS+ NCQP   F NSSEREPMVVDSLTK+SNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKEL  ELDPL HQST NKGI
Subjt:  SPSTQRRLTE--MAVSSSINCQPQAWFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKELNSELDPLFHQST-NKGI

Query:  KMGNQIVSSCLKFLDDVTNSNAHYTSWMRPAPLRAVVDSSAPPRWEDMLEMFTDLISYLKDEKYLVHYVTKLEVMKEGLSQIKDVLTD-KSIGYKEASHQ
        KMG QIVSSCLKFLDD TNSNAH+TSWMRPAP + VVD SA PRWEDMLEMF DLI  LK EK L+ +V KLEVMKEGLSQIKDVL+D KSIG+KE+ HQ
Subjt:  KMGNQIVSSCLKFLDDVTNSNAHYTSWMRPAPLRAVVDSSAPPRWEDMLEMFTDLISYLKDEKYLVHYVTKLEVMKEGLSQIKDVLTD-KSIGYKEASHQ

Query:  ESLVQKKLSKTLGHSSRCLFTLLLYYLFGHFRDIEVDLCGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRHLHRVMGLFKFVWETAGMKGELELQGHL
        ESLVQ+KLSKTLGHSSRCLFTLL++YL+GH RDIEVD CGG+LK   N+KF L MGR+LS DEEK+VWNGV+ L R MG+FKFVWETAGMKG LELQGHL
Subjt:  ESLVQKKLSKTLGHSSRCLFTLLLYYLFGHFRDIEVDLCGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRHLHRVMGLFKFVWETAGMKGELELQGHL

Query:  FCVGAEDRQLSYKENAYLLHEINL
        + VGA+ RQLSYK NAY+LH+I L
Subjt:  FCVGAEDRQLSYKENAYLLHEINL

A0A6J1FK17 uncharacterized protein LOC1114460291.8e-18979.95Show/hide
Query:  RLFFISRLKHLNNTRFGALQSNSMLYHCAEQSSADQEVLPSEWYENASRKIKKLSCSLKNVDLIDGRLVNVNDDSTIMDERIEQKMHTFKALVRVLIGSP
        RL F+ R+ HLN TR  A  SN MLYHC E S  D E LP++WYE A  KIKKLSCSLKNVDLIDGRLVNVNDDSTI+DERIEQ+M  FK+LVRV IGSP
Subjt:  RLFFISRLKHLNNTRFGALQSNSMLYHCAEQSSADQEVLPSEWYENASRKIKKLSCSLKNVDLIDGRLVNVNDDSTIMDERIEQKMHTFKALVRVLIGSP

Query:  STQRRLTEMAVSSSINCQPQAWFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKELNSELDPLFHQSTNKGIKMGNQ
        S QRR+TEMA S++ N QPQ  FRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQ TQHHIWTGALDH+LKEL  ELDPL H S NKGIKMG Q
Subjt:  STQRRLTEMAVSSSINCQPQAWFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKELNSELDPLFHQSTNKGIKMGNQ

Query:  IVSSCLKFLDDVTNSNAHYTSWMRPAPLRAVVDSSAPPRWEDMLEMFTDLISYLKDEKYLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKEASHQESLVQK
        IVSSCL FL+D TNSNAH TSWMRPAPL+  VDSS  P+WEDMLEMFTDLIS LKDEK L  YVTKLEVMKEGL+QI+DVLTDKSIG+KEA HQESLVQK
Subjt:  IVSSCLKFLDDVTNSNAHYTSWMRPAPLRAVVDSSAPPRWEDMLEMFTDLISYLKDEKYLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKEASHQESLVQK

Query:  KLSKTLGHSSRCLFTLLLYYLFGHFRDIEVDLCGGLLKA-DGNDKFLLFMGRVLSSDEEKIVWNGVRHLHRVMGLFKFVWETAGMKGELELQGHLFCVGA
        KLSKTLGHSSRCLFTLLLYYLFGHFRD+EVDLCGGLLKA +  +K+L+FMGR+LS DEE++VWNGVR L R MGLFKFVWETAGMKG+L LQGHLFCVGA
Subjt:  KLSKTLGHSSRCLFTLLLYYLFGHFRDIEVDLCGGLLKA-DGNDKFLLFMGRVLSSDEEKIVWNGVRHLHRVMGLFKFVWETAGMKGELELQGHLFCVGA

Query:  EDRQLSYKENAYLLHEINL
        EDRQLSYK N YLLH+I+L
Subjt:  EDRQLSYKENAYLLHEINL

A0A6J1IW03 uncharacterized protein LOC1114810936.3e-19080.14Show/hide
Query:  RLFFISRLKHLNNTRFGALQSNSMLYHCAEQSSADQEVLPSEWYENASRKIKKLSCSLKNVDLIDGRLVNVNDDSTIMDERIEQKMHTFKALVRVLIGSP
        RL F+ R+ HLN TR  AL SN MLYHC+E S  DQE LP++WYE A  KIKKLSCSLKNVDLIDGRLVNVNDDSTI+DERIEQ+M  FK+LVRV IGS 
Subjt:  RLFFISRLKHLNNTRFGALQSNSMLYHCAEQSSADQEVLPSEWYENASRKIKKLSCSLKNVDLIDGRLVNVNDDSTIMDERIEQKMHTFKALVRVLIGSP

Query:  STQRRLTEMAVSSSINCQPQAWFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKELNSELDPLFHQSTNKGIKMGNQ
        S QRR+TEMA S++IN QPQA FRNSSEREPMVVDS TKVSNFLNVSAQQRKLVRHTICPQ TQHHIWTGALDH+LKEL  ELDPL H S NKGIKMG Q
Subjt:  STQRRLTEMAVSSSINCQPQAWFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKELNSELDPLFHQSTNKGIKMGNQ

Query:  IVSSCLKFLDDVTNSNAHYTSWMRPAPLRAVVDSSAPPRWEDMLEMFTDLISYLKDEKYLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKEASHQESLVQK
        IVSSCLKFL+D TNSNAH TSWMRPAPL+  VDSS  P+WEDMLEMFTDLI  LKDEK L  YVTKLEVMKEGL+QI+DVL DKSIG+KEA HQESLVQK
Subjt:  IVSSCLKFLDDVTNSNAHYTSWMRPAPLRAVVDSSAPPRWEDMLEMFTDLISYLKDEKYLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKEASHQESLVQK

Query:  KLSKTLGHSSRCLFTLLLYYLFGHFRDIEVDLCGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRHLHRVMGLFKFVWETAGMKGELELQGHLFCVGAE
        KLSKTLGHSSRCLFTLLLYYLFGHFRD+EVDLCGGLLKA   +K+L+FMGR+LS DEE+ VWNGVR L R MGLFKFVWETAGMKG+L L+GHLFCVGAE
Subjt:  KLSKTLGHSSRCLFTLLLYYLFGHFRDIEVDLCGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRHLHRVMGLFKFVWETAGMKGELELQGHLFCVGAE

Query:  DRQLSYKENAYLLHEINL
        DRQLSYK N YL+HEI+L
Subjt:  DRQLSYKENAYLLHEINL

SwissProt top hitse value%identityAlignment
O22193 U-box domain-containing protein 41.1e-1028.35Show/hide
Query:  VASFHSSSSMRSMS-VTTAHKTITECIAGARSDALEVQEKALQNLVIITQVSPLYRNLLVQVDGSISILISLSKSSSSTIQSLSLSILFNLSLNHDMKKL
        + S  S+ + R +S V T  K + E +   +S +L+ Q +A   L ++ + + +   +++   G+I +L+ L  S+ S  Q  +++ L NLS+N + KK 
Subjt:  VASFHSSSSMRSMS-VTTAHKTITECIAGARSDALEVQEKALQNLVIITQVSPLYRNLLVQVDGSISILISLSKSSSSTIQSLSLSILFNLSLNHDMKKL

Query:  LASMETIYHLNTLVSLGSPETVKLASSLICSLAMLDKNKAKFGVAGTIQLLVRALTVPTVPAAHHLLCSLAELGQFHGNCTVAVRSGAIRVLIDVVESTS
        +A    I  L  ++  GS E  + +++ + SL+++++NK K G +G I  LV  L   T         +L  L     N  + V+SGA+R LID+++  +
Subjt:  LASMETIYHLNTLVSLGSPETVKLASSLICSLAMLDKNKAKFGVAGTIQLLVRALTVPTVPAAHHLLCSLAELGQFHGNCTVAVRSGAIRVLIDVVESTS

Query:  GEDLAGTALVVLGLLARFEEGLRALIKTDRIVISMFNVLKGRCMLSKEGATEILLRLFDES
        G  +   A+ VL  LA   EG  A+ +   I + +  V++      KE A   LL+L   S
Subjt:  GEDLAGTALVVLGLLARFEEGLRALIKTDRIVISMFNVLKGRCMLSKEGATEILLRLFDES

Q681N2 U-box domain-containing protein 153.7e-0925.94Show/hide
Query:  SDALEVQEKALQNLVIITQVSPLYRNLLVQVDGSISILISLSKSSSSTIQSLSLSILFNLSLNHDMKKLLASMETIYHLNTLVSLGSPETVKLASSLICS
        S  LE Q ++++ + ++ + +P  R L+    G+I +L+ L     S IQ  +++ L NLS++   KKL+++   I ++  ++  G+ E  + +++ + S
Subjt:  SDALEVQEKALQNLVIITQVSPLYRNLLVQVDGSISILISLSKSSSSTIQSLSLSILFNLSLNHDMKKLLASMETIYHLNTLVSLGSPETVKLASSLICS

Query:  LAMLDKNKAKFGVAGTIQLLVRALTVPTVPAAHHLLCSLAELGQFHGNCTVAVRSGAIRVLIDVVESTSGEDLAGTALVVLGLLARFEEGLRALIKTDRI
        L+MLD+NK   G++  I  LV  L   T+      L +L  L     N   A+ +G ++ L+++++      +   AL +L LLA   EG +A+ +    
Subjt:  LAMLDKNKAKFGVAGTIQLLVRALTVPTVPAAHHLLCSLAELGQFHGNCTVAVRSGAIRVLIDVVESTSGEDLAGTALVVLGLLARFEEGLRALIKTDRI

Query:  VISMFNVLKGRCMLSKEGATEILLRLFDESEGCLSDALR
        + ++   ++     +KE AT +LL L   +   +  AL+
Subjt:  VISMFNVLKGRCMLSKEGATEILLRLFDESEGCLSDALR

Q8GUG9 U-box domain-containing protein 112.1e-0925.5Show/hide
Query:  LLVQVDGSISILISLSKSSSSTIQSLSLSILFNLSLNHDMKKLLASMETIYHLNTLVSLGSPETVKLASSLICSLAMLDKNKAKFGVAGTIQLLVRALTV
        +L+   G+I +L++L  S     Q  +++ + NLS+  + K+L+     +  +  ++  G+ E  + A++ + SL++ D+NK   G +G I  LV  L  
Subjt:  LLVQVDGSISILISLSKSSSSTIQSLSLSILFNLSLNHDMKKLLASMETIYHLNTLVSLGSPETVKLASSLICSLAMLDKNKAKFGVAGTIQLLVRALTV

Query:  PTVPAAHHLLCSLAELGQFHGNCTVAVRSGAIRVLIDVVESTSGEDLAGTALVVLGLLARFEEGLRALIKTDRIVISMFNVLKGRCMLSKEGATEILLRL
         T         +L  L  +HGN   AVR+G +  L+ ++  ++   +   AL +L +LA  ++   A++K + +  ++  +L+     ++E A  ILL L
Subjt:  PTVPAAHHLLCSLAELGQFHGNCTVAVRSGAIRVLIDVVESTSGEDLAGTALVVLGLLARFEEGLRALIKTDRIVISMFNVLKGRCMLSKEGATEILLRL

Q8VZ40 U-box domain-containing protein 143.7e-0926.48Show/hide
Query:  GSISILISLSKSSSSTIQSLSLSILFNLSLNHDMKKLLASMETIYHLNTLVSLGSPETVKLASSLICSLAMLDKNKAKFGVAGTIQLLVRALTVPTVPAA
        G+I +L+ L  S     Q  S++ L NLS+N   K  +     I  +  ++  GS E  + A++ + SL+++D+NK   G AG IQ L+  L   T    
Subjt:  GSISILISLSKSSSSTIQSLSLSILFNLSLNHDMKKLLASMETIYHLNTLVSLGSPETVKLASSLICSLAMLDKNKAKFGVAGTIQLLVRALTVPTVPAA

Query:  HHLLCSLAELGQFHGNCTVAVRSGAIRVLIDVVESTSGEDLAGTALVVLGLLARFEEGLRALIKTDRIVISMFNVLKGRCMLSKEGATEILLRLFDESEG
             ++  L  + GN + AV+ G +  L  +++   G  +   AL +L +L+  +EG  A+ + + I + +  +++     ++E A  IL  L      
Subjt:  HHLLCSLAELGQFHGNCTVAVRSGAIRVLIDVVESTSGEDLAGTALVVLGLLARFEEGLRALIKTDRIVISMFNVLKGRCMLSKEGATEILLRLFDESEG

Query:  CLSDALRLPEFLGVVADLS
        C+ +  RL     V AD++
Subjt:  CLSDALRLPEFLGVVADLS

Q9C9A6 U-box domain-containing protein 101.8e-0825.89Show/hide
Query:  HSSSSMRSMSVTTAHKTITECIAGARSDALEVQEKALQNLVIITQVSPLYRNLLVQVDGSISILISLSKSSSST-IQSLSLSILFNLSLNHDMKKLLASM
        +S  S R +S   +   I   +    S ++E +  A+  +  +++ S   R L+ +  G+I +L+ L  S   T  Q  +++ + NLS+    K+L+   
Subjt:  HSSSSMRSMSVTTAHKTITECIAGARSDALEVQEKALQNLVIITQVSPLYRNLLVQVDGSISILISLSKSSSST-IQSLSLSILFNLSLNHDMKKLLASM

Query:  ETIYHLNTLVSLGSPETVKLASSLICSLAMLDKNKAKFGVAGTIQLLVRALTVPTVPAAHHLLCSLAELGQFHGNCTVAVRSGAIRVLIDVVESTSGEDL
          +  +  ++  GS E  + A++ + SL++ D+NK   G +G I  LV  L   +V        +L  L  + GN   AVR+G ++ L+ ++  +S E +
Subjt:  ETIYHLNTLVSLGSPETVKLASSLICSLAMLDKNKAKFGVAGTIQLLVRALTVPTVPAAHHLLCSLAELGQFHGNCTVAVRSGAIRVLIDVVESTSGEDL

Query:  AGTALVVLGLLARFEEGLRALIKTDRIVISMFNVLKGRCMLSKEGATEILLRLFDESEGCLSDALRLPEF--LGVVADLSRL
        A  AL +L +LA  +    A+++ + I   + + L+     ++E A  ILL L      C  D  +L     LG V  L  L
Subjt:  AGTALVVLGLLARFEEGLRALIKTDRIVISMFNVLKGRCMLSKEGATEILLRLFDESEGCLSDALRLPEF--LGVVADLSRL

Arabidopsis top hitse value%identityAlignment
AT1G23030.1 ARM repeat superfamily protein1.5e-1025.5Show/hide
Query:  LLVQVDGSISILISLSKSSSSTIQSLSLSILFNLSLNHDMKKLLASMETIYHLNTLVSLGSPETVKLASSLICSLAMLDKNKAKFGVAGTIQLLVRALTV
        +L+   G+I +L++L  S     Q  +++ + NLS+  + K+L+     +  +  ++  G+ E  + A++ + SL++ D+NK   G +G I  LV  L  
Subjt:  LLVQVDGSISILISLSKSSSSTIQSLSLSILFNLSLNHDMKKLLASMETIYHLNTLVSLGSPETVKLASSLICSLAMLDKNKAKFGVAGTIQLLVRALTV

Query:  PTVPAAHHLLCSLAELGQFHGNCTVAVRSGAIRVLIDVVESTSGEDLAGTALVVLGLLARFEEGLRALIKTDRIVISMFNVLKGRCMLSKEGATEILLRL
         T         +L  L  +HGN   AVR+G +  L+ ++  ++   +   AL +L +LA  ++   A++K + +  ++  +L+     ++E A  ILL L
Subjt:  PTVPAAHHLLCSLAELGQFHGNCTVAVRSGAIRVLIDVVESTSGEDLAGTALVVLGLLARFEEGLRALIKTDRIVISMFNVLKGRCMLSKEGATEILLRL

AT2G23140.1 RING/U-box superfamily protein with ARM repeat domain8.1e-1228.35Show/hide
Query:  VASFHSSSSMRSMS-VTTAHKTITECIAGARSDALEVQEKALQNLVIITQVSPLYRNLLVQVDGSISILISLSKSSSSTIQSLSLSILFNLSLNHDMKKL
        + S  S+ + R +S V T  K + E +   +S +L+ Q +A   L ++ + + +   +++   G+I +L+ L  S+ S  Q  +++ L NLS+N + KK 
Subjt:  VASFHSSSSMRSMS-VTTAHKTITECIAGARSDALEVQEKALQNLVIITQVSPLYRNLLVQVDGSISILISLSKSSSSTIQSLSLSILFNLSLNHDMKKL

Query:  LASMETIYHLNTLVSLGSPETVKLASSLICSLAMLDKNKAKFGVAGTIQLLVRALTVPTVPAAHHLLCSLAELGQFHGNCTVAVRSGAIRVLIDVVESTS
        +A    I  L  ++  GS E  + +++ + SL+++++NK K G +G I  LV  L   T         +L  L     N  + V+SGA+R LID+++  +
Subjt:  LASMETIYHLNTLVSLGSPETVKLASSLICSLAMLDKNKAKFGVAGTIQLLVRALTVPTVPAAHHLLCSLAELGQFHGNCTVAVRSGAIRVLIDVVESTS

Query:  GEDLAGTALVVLGLLARFEEGLRALIKTDRIVISMFNVLKGRCMLSKEGATEILLRLFDES
        G  +   A+ VL  LA   EG  A+ +   I + +  V++      KE A   LL+L   S
Subjt:  GEDLAGTALVVLGLLARFEEGLRALIKTDRIVISMFNVLKGRCMLSKEGATEILLRLFDES

AT2G23140.2 RING/U-box superfamily protein with ARM repeat domain8.1e-1228.35Show/hide
Query:  VASFHSSSSMRSMS-VTTAHKTITECIAGARSDALEVQEKALQNLVIITQVSPLYRNLLVQVDGSISILISLSKSSSSTIQSLSLSILFNLSLNHDMKKL
        + S  S+ + R +S V T  K + E +   +S +L+ Q +A   L ++ + + +   +++   G+I +L+ L  S+ S  Q  +++ L NLS+N + KK 
Subjt:  VASFHSSSSMRSMS-VTTAHKTITECIAGARSDALEVQEKALQNLVIITQVSPLYRNLLVQVDGSISILISLSKSSSSTIQSLSLSILFNLSLNHDMKKL

Query:  LASMETIYHLNTLVSLGSPETVKLASSLICSLAMLDKNKAKFGVAGTIQLLVRALTVPTVPAAHHLLCSLAELGQFHGNCTVAVRSGAIRVLIDVVESTS
        +A    I  L  ++  GS E  + +++ + SL+++++NK K G +G I  LV  L   T         +L  L     N  + V+SGA+R LID+++  +
Subjt:  LASMETIYHLNTLVSLGSPETVKLASSLICSLAMLDKNKAKFGVAGTIQLLVRALTVPTVPAAHHLLCSLAELGQFHGNCTVAVRSGAIRVLIDVVESTS

Query:  GEDLAGTALVVLGLLARFEEGLRALIKTDRIVISMFNVLKGRCMLSKEGATEILLRLFDES
        G  +   A+ VL  LA   EG  A+ +   I + +  V++      KE A   LL+L   S
Subjt:  GEDLAGTALVVLGLLARFEEGLRALIKTDRIVISMFNVLKGRCMLSKEGATEILLRLFDES

AT5G25500.1 unknown protein8.7e-9947.57Show/hide
Query:  NNTRFGALQSNSMLYHCAEQSSADQEVLPSEWYENASRKIKKLSCSLKNVDLIDGRLVNVNDDSTIMDERIEQKMHTFKALVRVLIGSPSTQRRLTEMAV
        N +RF   +S  +LYH +  S  D  VLP EWYE     +KKL+ +L++VDL+DG+L ++N    + D+ I +KM  FK+L R+ IGSPS Q++L E   
Subjt:  NNTRFGALQSNSMLYHCAEQSSADQEVLPSEWYENASRKIKKLSCSLKNVDLIDGRLVNVNDDSTIMDERIEQKMHTFKALVRVLIGSPSTQRRLTEMAV

Query:  SSSINCQPQAWFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKELNSELDPLF-HQSTNKGIKMGNQIVSSCLKFLD
                  +F + SEREP+VV+SLTKV NFLNVSAQQRKLVR T+C QVTQ+ IW G L+ +L  L  E+D L  H+  ++G  +  Q++ SCL+FL 
Subjt:  SSSINCQPQAWFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKELNSELDPLF-HQSTNKGIKMGNQIVSSCLKFLD

Query:  D--VTNSNAHYTSWMRPAPLRAVVDSSAPPRWEDMLEMFTDLISYLK--DEKYLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKEASHQESLVQKKLSKTL
        +  V+      TSWMRP P R    ++A  +WED+L+M  DL  YL+  +E  +++++ KL  MKEGL QIKDV  D +IG++E  HQE LV +KLSK L
Subjt:  D--VTNSNAHYTSWMRPAPLRAVVDSSAPPRWEDMLEMFTDLISYLK--DEKYLVHYVTKLEVMKEGLSQIKDVLTDKSIGYKEASHQESLVQKKLSKTL

Query:  GHSSRCLFTLLLYYLFGHFRDIEVDLCGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRHLHRVMGLFKFVWETAGMKGELELQGHLFCVGAEDRQLSY
        G  S CLF L++Y+L+G  RDIEVDLCGG  K + ++   L MGR+L+S +EK++  G++ L R +GLF+FVWETAGMK  L LQGHL+C+GAE+R ++Y
Subjt:  GHSSRCLFTLLLYYLFGHFRDIEVDLCGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRHLHRVMGLFKFVWETAGMKGELELQGHLFCVGAEDRQLSY

Query:  KENAYLLHEINL
        +   + +H+++L
Subjt:  KENAYLLHEINL

AT5G42340.1 Plant U-Box 152.6e-1025.94Show/hide
Query:  SDALEVQEKALQNLVIITQVSPLYRNLLVQVDGSISILISLSKSSSSTIQSLSLSILFNLSLNHDMKKLLASMETIYHLNTLVSLGSPETVKLASSLICS
        S  LE Q ++++ + ++ + +P  R L+    G+I +L+ L     S IQ  +++ L NLS++   KKL+++   I ++  ++  G+ E  + +++ + S
Subjt:  SDALEVQEKALQNLVIITQVSPLYRNLLVQVDGSISILISLSKSSSSTIQSLSLSILFNLSLNHDMKKLLASMETIYHLNTLVSLGSPETVKLASSLICS

Query:  LAMLDKNKAKFGVAGTIQLLVRALTVPTVPAAHHLLCSLAELGQFHGNCTVAVRSGAIRVLIDVVESTSGEDLAGTALVVLGLLARFEEGLRALIKTDRI
        L+MLD+NK   G++  I  LV  L   T+      L +L  L     N   A+ +G ++ L+++++      +   AL +L LLA   EG +A+ +    
Subjt:  LAMLDKNKAKFGVAGTIQLLVRALTVPTVPAAHHLLCSLAELGQFHGNCTVAVRSGAIRVLIDVVESTSGEDLAGTALVVLGLLARFEEGLRALIKTDRI

Query:  VISMFNVLKGRCMLSKEGATEILLRLFDESEGCLSDALR
        + ++   ++     +KE AT +LL L   +   +  AL+
Subjt:  VISMFNVLKGRCMLSKEGATEILLRLFDESEGCLSDALR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGTTGCATCTTTCCACTCATCATCTTCCATGAGATCCATGAGTGTTACTACTGCACATAAAACTATAACAGAGTGTATAGCTGGCGCTCGGTCGGATGCTCTCGA
AGTTCAGGAAAAGGCTCTTCAAAACTTGGTCATCATTACTCAGGTTAGTCCTCTATACAGGAACTTGCTTGTACAAGTAGATGGATCAATATCAATTCTTATTTCTCTCT
CCAAATCATCTTCTTCAACCATCCAATCTCTTTCATTGTCAATTCTTTTCAATCTCTCTCTGAACCATGACATGAAAAAGCTTCTTGCATCAATGGAAACCATCTACCAT
CTCAATACACTTGTATCCCTAGGCTCACCGGAAACCGTCAAGCTAGCATCCTCGTTGATTTGCAGCCTTGCAATGCTAGACAAAAACAAAGCTAAGTTTGGTGTAGCAGG
GACAATACAGTTATTGGTTAGAGCACTTACGGTCCCTACTGTTCCTGCTGCCCATCACCTTCTCTGTTCTTTAGCCGAACTAGGCCAGTTTCATGGAAACTGCACTGTGG
CAGTTCGATCGGGAGCCATCCGAGTTCTTATCGATGTAGTGGAGAGTACTAGTGGGGAGGATCTTGCCGGCACTGCTCTTGTTGTTCTCGGTCTCTTGGCTAGATTTGAG
GAAGGGTTGAGAGCTTTGATAAAAACTGATCGGATTGTAATTTCGATGTTTAATGTGCTGAAAGGAAGGTGTATGCTCAGCAAAGAAGGTGCAACCGAGATCCTTTTGCG
ATTGTTCGACGAAAGTGAAGGTTGTCTGAGTGATGCTTTGAGGTTGCCAGAGTTTTTGGGTGTTGTTGCTGATCTTTCGAGATTGTTCTTCATCTCTCGCCTGAAGCATC
TCAATAACACGAGATTTGGAGCTTTGCAATCAAATTCTATGTTGTATCATTGCGCAGAGCAATCCTCCGCCGATCAAGAGGTGTTACCATCTGAATGGTACGAGAATGCT
TCTCGGAAGATAAAGAAATTGAGTTGCTCGTTGAAGAATGTGGATTTGATCGATGGACGACTTGTTAATGTTAATGATGATTCGACCATTATGGACGAGCGAATTGAACA
AAAAATGCACACTTTCAAGGCCCTTGTAAGAGTCTTGATTGGTTCTCCATCGACTCAGAGGAGATTAACAGAGATGGCTGTATCGAGTTCAATAAATTGCCAGCCTCAGG
CATGGTTTAGAAATTCAAGTGAACGAGAGCCGATGGTTGTTGATTCACTCACCAAGGTCAGCAACTTCCTCAACGTCTCTGCCCAACAAAGGAAACTGGTGCGCCATACC
ATATGCCCACAGGTTACACAACATCACATTTGGACTGGTGCATTGGATCATATGCTGAAAGAGTTAAATTCGGAGTTGGATCCATTATTTCATCAGTCAACCAACAAAGG
GATCAAAATGGGCAATCAGATTGTTTCAAGTTGCCTAAAGTTTTTGGATGATGTTACCAATTCAAATGCTCACTACACTTCGTGGATGCGGCCAGCACCATTACGAGCTG
TTGTAGATTCATCTGCACCGCCAAGATGGGAAGACATGCTCGAGATGTTCACCGATCTGATCAGCTATCTGAAAGATGAGAAATATTTGGTCCATTATGTGACAAAGCTT
GAAGTTATGAAAGAGGGGCTTTCCCAGATCAAAGATGTATTGACTGATAAAAGCATTGGATACAAGGAAGCCAGTCATCAAGAAAGCTTGGTGCAGAAGAAGCTTTCAAA
GACACTGGGCCACTCATCCAGGTGCTTGTTCACTCTTTTACTATACTATCTTTTTGGGCATTTTAGGGATATTGAAGTGGATCTTTGTGGTGGGTTGTTGAAGGCTGATG
GGAATGACAAGTTTTTGTTGTTCATGGGGAGGGTCTTGAGTTCTGATGAAGAGAAAATTGTTTGGAATGGGGTGAGGCATCTTCATAGAGTTATGGGGCTTTTTAAATTT
GTTTGGGAAACAGCTGGAATGAAGGGAGAATTGGAATTGCAAGGCCATTTATTTTGTGTTGGGGCTGAGGATAGGCAGCTTAGTTATAAAGAAAATGCTTATTTATTACA
TGAGATCAATTTATAA
mRNA sequenceShow/hide mRNA sequence
ATGTCAGTTGCATCTTTCCACTCATCATCTTCCATGAGATCCATGAGTGTTACTACTGCACATAAAACTATAACAGAGTGTATAGCTGGCGCTCGGTCGGATGCTCTCGA
AGTTCAGGAAAAGGCTCTTCAAAACTTGGTCATCATTACTCAGGTTAGTCCTCTATACAGGAACTTGCTTGTACAAGTAGATGGATCAATATCAATTCTTATTTCTCTCT
CCAAATCATCTTCTTCAACCATCCAATCTCTTTCATTGTCAATTCTTTTCAATCTCTCTCTGAACCATGACATGAAAAAGCTTCTTGCATCAATGGAAACCATCTACCAT
CTCAATACACTTGTATCCCTAGGCTCACCGGAAACCGTCAAGCTAGCATCCTCGTTGATTTGCAGCCTTGCAATGCTAGACAAAAACAAAGCTAAGTTTGGTGTAGCAGG
GACAATACAGTTATTGGTTAGAGCACTTACGGTCCCTACTGTTCCTGCTGCCCATCACCTTCTCTGTTCTTTAGCCGAACTAGGCCAGTTTCATGGAAACTGCACTGTGG
CAGTTCGATCGGGAGCCATCCGAGTTCTTATCGATGTAGTGGAGAGTACTAGTGGGGAGGATCTTGCCGGCACTGCTCTTGTTGTTCTCGGTCTCTTGGCTAGATTTGAG
GAAGGGTTGAGAGCTTTGATAAAAACTGATCGGATTGTAATTTCGATGTTTAATGTGCTGAAAGGAAGGTGTATGCTCAGCAAAGAAGGTGCAACCGAGATCCTTTTGCG
ATTGTTCGACGAAAGTGAAGGTTGTCTGAGTGATGCTTTGAGGTTGCCAGAGTTTTTGGGTGTTGTTGCTGATCTTTCGAGATTGTTCTTCATCTCTCGCCTGAAGCATC
TCAATAACACGAGATTTGGAGCTTTGCAATCAAATTCTATGTTGTATCATTGCGCAGAGCAATCCTCCGCCGATCAAGAGGTGTTACCATCTGAATGGTACGAGAATGCT
TCTCGGAAGATAAAGAAATTGAGTTGCTCGTTGAAGAATGTGGATTTGATCGATGGACGACTTGTTAATGTTAATGATGATTCGACCATTATGGACGAGCGAATTGAACA
AAAAATGCACACTTTCAAGGCCCTTGTAAGAGTCTTGATTGGTTCTCCATCGACTCAGAGGAGATTAACAGAGATGGCTGTATCGAGTTCAATAAATTGCCAGCCTCAGG
CATGGTTTAGAAATTCAAGTGAACGAGAGCCGATGGTTGTTGATTCACTCACCAAGGTCAGCAACTTCCTCAACGTCTCTGCCCAACAAAGGAAACTGGTGCGCCATACC
ATATGCCCACAGGTTACACAACATCACATTTGGACTGGTGCATTGGATCATATGCTGAAAGAGTTAAATTCGGAGTTGGATCCATTATTTCATCAGTCAACCAACAAAGG
GATCAAAATGGGCAATCAGATTGTTTCAAGTTGCCTAAAGTTTTTGGATGATGTTACCAATTCAAATGCTCACTACACTTCGTGGATGCGGCCAGCACCATTACGAGCTG
TTGTAGATTCATCTGCACCGCCAAGATGGGAAGACATGCTCGAGATGTTCACCGATCTGATCAGCTATCTGAAAGATGAGAAATATTTGGTCCATTATGTGACAAAGCTT
GAAGTTATGAAAGAGGGGCTTTCCCAGATCAAAGATGTATTGACTGATAAAAGCATTGGATACAAGGAAGCCAGTCATCAAGAAAGCTTGGTGCAGAAGAAGCTTTCAAA
GACACTGGGCCACTCATCCAGGTGCTTGTTCACTCTTTTACTATACTATCTTTTTGGGCATTTTAGGGATATTGAAGTGGATCTTTGTGGTGGGTTGTTGAAGGCTGATG
GGAATGACAAGTTTTTGTTGTTCATGGGGAGGGTCTTGAGTTCTGATGAAGAGAAAATTGTTTGGAATGGGGTGAGGCATCTTCATAGAGTTATGGGGCTTTTTAAATTT
GTTTGGGAAACAGCTGGAATGAAGGGAGAATTGGAATTGCAAGGCCATTTATTTTGTGTTGGGGCTGAGGATAGGCAGCTTAGTTATAAAGAAAATGCTTATTTATTACA
TGAGATCAATTTATAA
Protein sequenceShow/hide protein sequence
MSVASFHSSSSMRSMSVTTAHKTITECIAGARSDALEVQEKALQNLVIITQVSPLYRNLLVQVDGSISILISLSKSSSSTIQSLSLSILFNLSLNHDMKKLLASMETIYH
LNTLVSLGSPETVKLASSLICSLAMLDKNKAKFGVAGTIQLLVRALTVPTVPAAHHLLCSLAELGQFHGNCTVAVRSGAIRVLIDVVESTSGEDLAGTALVVLGLLARFE
EGLRALIKTDRIVISMFNVLKGRCMLSKEGATEILLRLFDESEGCLSDALRLPEFLGVVADLSRLFFISRLKHLNNTRFGALQSNSMLYHCAEQSSADQEVLPSEWYENA
SRKIKKLSCSLKNVDLIDGRLVNVNDDSTIMDERIEQKMHTFKALVRVLIGSPSTQRRLTEMAVSSSINCQPQAWFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHT
ICPQVTQHHIWTGALDHMLKELNSELDPLFHQSTNKGIKMGNQIVSSCLKFLDDVTNSNAHYTSWMRPAPLRAVVDSSAPPRWEDMLEMFTDLISYLKDEKYLVHYVTKL
EVMKEGLSQIKDVLTDKSIGYKEASHQESLVQKKLSKTLGHSSRCLFTLLLYYLFGHFRDIEVDLCGGLLKADGNDKFLLFMGRVLSSDEEKIVWNGVRHLHRVMGLFKF
VWETAGMKGELELQGHLFCVGAEDRQLSYKENAYLLHEINL