; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc05G25820 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc05G25820
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Description18S pre-ribosomal assembly protein gar2-related, putative isoform 2
Genome locationClcChr05:33554938..33559488
RNA-Seq ExpressionClc05G25820
SyntenyClc05G25820
Gene Ontology termsGO:0009786 - regulation of asymmetric cell division (biological process)
GO:0005886 - plasma membrane (cellular component)
InterPro domainsIPR040378 - Protein BREAKING OF ASYMMETRY IN THE STOMATAL LINEAGE


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008446468.1 PREDICTED: uncharacterized protein LOC103489197 isoform X1 [Cucumis melo]9.9e-20779.96Show/hide
Query:  MKVEGEPIVCRSNASPKFVPKSFECDNDALDSGGMKLEDQKEFTSLLKGNEDADHNNTAADGWVAMKHECLDLDNFNEYDDVKAFVSPLTNSSKVDLFEE
        MKVEGEPIVC SNASPKFVPKSFECDND L+SGGMKLEDQKEFTS+LKGN DA HNNTAADGWVA K ECLDLD+FN+YDDVKAFVSPL NS KVDL EE
Subjt:  MKVEGEPIVCRSNASPKFVPKSFECDNDALDSGGMKLEDQKEFTSLLKGNEDADHNNTAADGWVAMKHECLDLDNFNEYDDVKAFVSPLTNSSKVDLFEE

Query:  DSELYMEKSIVECQLPELIVCYKENICNIVKDICIDDGVPSRDKLFCSSSLDEKDACSILPPKKDWKDELVGELEKTDMFASDDSEHSESFGNKDSPKQS
        DSELYMEKSIVECQLPELIVCYKENICNIVKDICIDDG P RDKLFC SSLDE+D CSI PP KDWKDE VGEL++ DMFASDDSEHSESFG+KDSP Q 
Subjt:  DSELYMEKSIVECQLPELIVCYKENICNIVKDICIDDGVPSRDKLFCSSSLDEKDACSILPPKKDWKDELVGELEKTDMFASDDSEHSESFGNKDSPKQS

Query:  DSNNLARTPEAEYDVAYFTDNDMPNLPMTDLVTESLKPLTNNKNEPHPQSEQVFIETTSLEVPVLACVAEESFSNTREAISEATTSAIAPEEPKNSDSAN
        DS +LA TPEAEYDVAYFTDNDM   PMTDLVTESLKPLT+NK +PHPQSEQV IETT  EVPVLA VA+ESF NTRE  SE+ TSA   E+PKNSDSAN
Subjt:  DSNNLARTPEAEYDVAYFTDNDMPNLPMTDLVTESLKPLTNNKNEPHPQSEQVFIETTSLEVPVLACVAEESFSNTREAISEATTSAIAPEEPKNSDSAN

Query:  DTSYNSKVDKGNITFDFNSLASTASDGLERCDNGDVNSSAPSTSASVGCEETGSSNPLASADKCQAQCHDISSNPKHVEYEDLPRVEYEDLPKTDVGNFD
          SYNSKVDKGNITFDFNSLA TASDGLERCDNGD+NSSAPSTSASVGC+ET SSNPLASADK + QCH+ SSNPK VEYEDLPRVEYED+ KT+VGNFD
Subjt:  DTSYNSKVDKGNITFDFNSLASTASDGLERCDNGDVNSSAPSTSASVGCEETGSSNPLASADKCQAQCHDISSNPKHVEYEDLPRVEYEDLPKTDVGNFD

Query:  SDTVSSQVQHG--------------------IGFSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRVM--AILCC
        S TVSS+VQ G                    IG+SGSIS RSDSSTTSTRSFAFPILQ+EWNSSPVRMAK DRKHL+KHR     ILCC
Subjt:  SDTVSSQVQHG--------------------IGFSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRVM--AILCC

XP_011655720.1 uncharacterized protein LOC101218906 isoform X1 [Cucumis sativus]1.2e-20779.96Show/hide
Query:  MKVEGEPIVCRSNASPKFVPKSFECDNDALDSGGMKLEDQKEFTSLLKGNEDADHNNTAADGWVAMKHECLDLDNFNEYDDVKAFVSPLTNSSKVDLFEE
        MKVEGEPIVC SNASPKFVPKSFECDNDAL+SGGMKLEDQKEFT+ LKGN DADHNNT ADGWVA K ECLDLD+FN+YDDVKAFVSPL NS K DL EE
Subjt:  MKVEGEPIVCRSNASPKFVPKSFECDNDALDSGGMKLEDQKEFTSLLKGNEDADHNNTAADGWVAMKHECLDLDNFNEYDDVKAFVSPLTNSSKVDLFEE

Query:  DSELYMEKSIVECQLPELIVCYKENICNIVKDICIDDGVPSRDKLFCSSSLDEKDACSILPPKKDWKDELVGELEKTDMFASDDSEHSESFGNKDSPKQS
        DSELYMEKS+VECQLPELIVCYKENICNIVKDICIDDG P RDKLFC SSLDEKD CSILPP KDWKDE VGEL++ DMFASDDSEHSESFG+KDSP + 
Subjt:  DSELYMEKSIVECQLPELIVCYKENICNIVKDICIDDGVPSRDKLFCSSSLDEKDACSILPPKKDWKDELVGELEKTDMFASDDSEHSESFGNKDSPKQS

Query:  DSNNLARTPEAEYDVAYFTDNDMPNLPMTDLVTESLKPLTNNKNEPHPQSEQVFIETTSLEVPVLACVAEESFSNTREAISEATTSAIAPEEPKNSDSAN
        DS +LA TPEAEYDVAYFTDNDM   PMTDLVTESLKPLTNN+ +PHPQSEQVFIETT  EVPVL  VAEESFSNTREA SE+ TSA   E+PK+ DSAN
Subjt:  DSNNLARTPEAEYDVAYFTDNDMPNLPMTDLVTESLKPLTNNKNEPHPQSEQVFIETTSLEVPVLACVAEESFSNTREAISEATTSAIAPEEPKNSDSAN

Query:  DTSYNSKVDKGNITFDFNSLASTASDGLERCDNGDVNSSAPSTSASVGCEETGSSNPLASADKCQAQCHDISSNPKHVEYEDLPRVEYEDLPKTDVGNFD
          SYNSKVDKGNITFDFNSLASTASDGLER DNGD+NSSAPSTSASVGC+ET SSNPLASADKC+A+CH  SSNPKHVEYEDL RVEYED+PKT+VGNFD
Subjt:  DTSYNSKVDKGNITFDFNSLASTASDGLERCDNGDVNSSAPSTSASVGCEETGSSNPLASADKCQAQCHDISSNPKHVEYEDLPRVEYEDLPKTDVGNFD

Query:  SDTVSSQVQHG--------------------IGFSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRVM--AILCC
        S TVSS+V  G                    IG+SGSIS RSDSSTTSTRSFAFPILQSEW SSPVRM K DRKHL+KHR     ILCC
Subjt:  SDTVSSQVQHG--------------------IGFSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRVM--AILCC

XP_038892052.1 uncharacterized protein LOC120081347 isoform X1 [Benincasa hispida]6.8e-22482.8Show/hide
Query:  MKVEGEPIVCRSNASPKFVPKSFECDNDALDSGGMKLEDQKEFTSLLKGNEDADHNNTAADGWVAMKHECLDLDNFNEYDDVKAFVSPLTNSSKVDLFEE
        MKVEGEPIV  SNASP+FVPKSFECDNDA++SGGMKLEDQKEFTSLLKGNEDA+HNNTAADGWVAMK ECLDLD+FNEYD+VKAFVSPLTNSSKVDLFEE
Subjt:  MKVEGEPIVCRSNASPKFVPKSFECDNDALDSGGMKLEDQKEFTSLLKGNEDADHNNTAADGWVAMKHECLDLDNFNEYDDVKAFVSPLTNSSKVDLFEE

Query:  DSELYMEKSIVECQLPELIVCYKENICNIVKDICIDDGVPSRDKLFCSSSLDEKDACSILPPKKDWKDELVGELEKTDMFASDDSEHSESFGNKDSPKQS
        DSELYMEKS VECQLPELIVCYKENICNIVKDICIDDGVPSRDKL C SSLDEKD CSILPP   WKD+LV ELEK D++ASDDSEHSESFGNKDSPKQS
Subjt:  DSELYMEKSIVECQLPELIVCYKENICNIVKDICIDDGVPSRDKLFCSSSLDEKDACSILPPKKDWKDELVGELEKTDMFASDDSEHSESFGNKDSPKQS

Query:  DSNNLARTPEAEYDVAYFTDNDMPNLPMTDLVTESLKPLTNNKNEPHPQSEQVFIETTSLEVPVLACVAEESFSNTREAISEATTSAIAPEEPKNSDSAN
        DSN+L RTPEAEYDVAYFTDNDM   PMTD VTESLKPLTNNK EPHP+SEQVFIETTSLEVPVLACVAEESFS++RE ISE+TTSA+APEE KNSDSAN
Subjt:  DSNNLARTPEAEYDVAYFTDNDMPNLPMTDLVTESLKPLTNNKNEPHPQSEQVFIETTSLEVPVLACVAEESFSNTREAISEATTSAIAPEEPKNSDSAN

Query:  DTSYNSKVDKGNITFDFNSLASTASDGLERCDNGDVNSSAPSTSASVGCEETGSSNPLASADKCQAQCHDISSNPKHVEYEDLPRVEYEDLPKTDVGNFD
        D SYNSKVDKGNITFDFNSLASTASDGLE CDN D+N+SAPSTSASVGC+ET SSNPLASADKCQAQCHD S+NPK VEYEDLPR+EYEDLPKT+VGNFD
Subjt:  DTSYNSKVDKGNITFDFNSLASTASDGLERCDNGDVNSSAPSTSASVGCEETGSSNPLASADKCQAQCHDISSNPKHVEYEDLPRVEYEDLPKTDVGNFD

Query:  SDTVSSQVQHG-------------------------------IGFSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHR--VMAILCC
        S TVSSQVQHG                               IG+SGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADR+HLRKHR     ILCC
Subjt:  SDTVSSQVQHG-------------------------------IGFSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHR--VMAILCC

XP_038892056.1 uncharacterized protein LOC120081347 isoform X2 [Benincasa hispida]7.8e-22888.06Show/hide
Query:  MKVEGEPIVCRSNASPKFVPKSFECDNDALDSGGMKLEDQKEFTSLLKGNEDADHNNTAADGWVAMKHECLDLDNFNEYDDVKAFVSPLTNSSKVDLFEE
        MKVEGEPIV  SNASP+FVPKSFECDNDA++SGGMKLEDQKEFTSLLKGNEDA+HNNTAADGWVAMK ECLDLD+FNEYD+VKAFVSPLTNSSKVDLFEE
Subjt:  MKVEGEPIVCRSNASPKFVPKSFECDNDALDSGGMKLEDQKEFTSLLKGNEDADHNNTAADGWVAMKHECLDLDNFNEYDDVKAFVSPLTNSSKVDLFEE

Query:  DSELYMEKSIVECQLPELIVCYKENICNIVKDICIDDGVPSRDKLFCSSSLDEKDACSILPPKKDWKDELVGELEKTDMFASDDSEHSESFGNKDSPKQS
        DSELYMEKS VECQLPELIVCYKENICNIVKDICIDDGVPSRDKL C SSLDEKD CSILPP   WKD+LV ELEK D++ASDDSEHSESFGNKDSPKQS
Subjt:  DSELYMEKSIVECQLPELIVCYKENICNIVKDICIDDGVPSRDKLFCSSSLDEKDACSILPPKKDWKDELVGELEKTDMFASDDSEHSESFGNKDSPKQS

Query:  DSNNLARTPEAEYDVAYFTDNDMPNLPMTDLVTESLKPLTNNKNEPHPQSEQVFIETTSLEVPVLACVAEESFSNTREAISEATTSAIAPEEPKNSDSAN
        DSN+L RTPEAEYDVAYFTDNDM   PMTD VTESLKPLTNNK EPHP+SEQVFIETTSLEVPVLACVAEESFS++RE ISE+TTSA+APEE KNSDSAN
Subjt:  DSNNLARTPEAEYDVAYFTDNDMPNLPMTDLVTESLKPLTNNKNEPHPQSEQVFIETTSLEVPVLACVAEESFSNTREAISEATTSAIAPEEPKNSDSAN

Query:  DTSYNSKVDKGNITFDFNSLASTASDGLERCDNGDVNSSAPSTSASVGCEETGSSNPLASADKCQAQCHDISSNPKHVEYEDLPRVEYEDLPKTDVGNFD
        D SYNSKVDKGNITFDFNSLASTASDGLE CDN D+N+SAPSTSASVGC+ET SSNPLASADKCQAQCHD S+NPK VEYEDLPR+EYEDLPKT+VGNFD
Subjt:  DTSYNSKVDKGNITFDFNSLASTASDGLERCDNGDVNSSAPSTSASVGCEETGSSNPLASADKCQAQCHDISSNPKHVEYEDLPRVEYEDLPKTDVGNFD

Query:  SDTVSSQVQHGIGFSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHR--VMAILCC
        S TVSSQVQHG G+SGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADR+HLRKHR     ILCC
Subjt:  SDTVSSQVQHGIGFSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHR--VMAILCC

XP_038892057.1 uncharacterized protein LOC120081347 isoform X3 [Benincasa hispida]1.1e-20582.62Show/hide
Query:  MKLEDQKEFTSLLKGNEDADHNNTAADGWVAMKHECLDLDNFNEYDDVKAFVSPLTNSSKVDLFEEDSELYMEKSIVECQLPELIVCYKENICNIVKDIC
        MKLEDQKEFTSLLKGNEDA+HNNTAADGWVAMK ECLDLD+FNEYD+VKAFVSPLTNSSKVDLFEEDSELYMEKS VECQLPELIVCYKENICNIVKDIC
Subjt:  MKLEDQKEFTSLLKGNEDADHNNTAADGWVAMKHECLDLDNFNEYDDVKAFVSPLTNSSKVDLFEEDSELYMEKSIVECQLPELIVCYKENICNIVKDIC

Query:  IDDGVPSRDKLFCSSSLDEKDACSILPPKKDWKDELVGELEKTDMFASDDSEHSESFGNKDSPKQSDSNNLARTPEAEYDVAYFTDNDMPNLPMTDLVTE
        IDDGVPSRDKL C SSLDEKD CSILPP   WKD+LV ELEK D++ASDDSEHSESFGNKDSPKQSDSN+L RTPEAEYDVAYFTDNDM   PMTD VTE
Subjt:  IDDGVPSRDKLFCSSSLDEKDACSILPPKKDWKDELVGELEKTDMFASDDSEHSESFGNKDSPKQSDSNNLARTPEAEYDVAYFTDNDMPNLPMTDLVTE

Query:  SLKPLTNNKNEPHPQSEQVFIETTSLEVPVLACVAEESFSNTREAISEATTSAIAPEEPKNSDSANDTSYNSKVDKGNITFDFNSLASTASDGLERCDNG
        SLKPLTNNK EPHP+SEQVFIETTSLEVPVLACVAEESFS++RE ISE+TTSA+APEE KNSDSAND SYNSKVDKGNITFDFNSLASTASDGLE CDN 
Subjt:  SLKPLTNNKNEPHPQSEQVFIETTSLEVPVLACVAEESFSNTREAISEATTSAIAPEEPKNSDSANDTSYNSKVDKGNITFDFNSLASTASDGLERCDNG

Query:  DVNSSAPSTSASVGCEETGSSNPLASADKCQAQCHDISSNPKHVEYEDLPRVEYEDLPKTDVGNFDSDTVSSQVQHG-----------------------
        D+N+SAPSTSASVGC+ET SSNPLASADKCQAQCHD S+NPK VEYEDLPR+EYEDLPKT+VGNFDS TVSSQVQHG                       
Subjt:  DVNSSAPSTSASVGCEETGSSNPLASADKCQAQCHDISSNPKHVEYEDLPRVEYEDLPKTDVGNFDSDTVSSQVQHG-----------------------

Query:  --------IGFSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHR--VMAILCC
                IG+SGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADR+HLRKHR     ILCC
Subjt:  --------IGFSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHR--VMAILCC

TrEMBL top hitse value%identityAlignment
A0A1S3BFZ0 uncharacterized protein LOC103489197 isoform X14.8e-20779.96Show/hide
Query:  MKVEGEPIVCRSNASPKFVPKSFECDNDALDSGGMKLEDQKEFTSLLKGNEDADHNNTAADGWVAMKHECLDLDNFNEYDDVKAFVSPLTNSSKVDLFEE
        MKVEGEPIVC SNASPKFVPKSFECDND L+SGGMKLEDQKEFTS+LKGN DA HNNTAADGWVA K ECLDLD+FN+YDDVKAFVSPL NS KVDL EE
Subjt:  MKVEGEPIVCRSNASPKFVPKSFECDNDALDSGGMKLEDQKEFTSLLKGNEDADHNNTAADGWVAMKHECLDLDNFNEYDDVKAFVSPLTNSSKVDLFEE

Query:  DSELYMEKSIVECQLPELIVCYKENICNIVKDICIDDGVPSRDKLFCSSSLDEKDACSILPPKKDWKDELVGELEKTDMFASDDSEHSESFGNKDSPKQS
        DSELYMEKSIVECQLPELIVCYKENICNIVKDICIDDG P RDKLFC SSLDE+D CSI PP KDWKDE VGEL++ DMFASDDSEHSESFG+KDSP Q 
Subjt:  DSELYMEKSIVECQLPELIVCYKENICNIVKDICIDDGVPSRDKLFCSSSLDEKDACSILPPKKDWKDELVGELEKTDMFASDDSEHSESFGNKDSPKQS

Query:  DSNNLARTPEAEYDVAYFTDNDMPNLPMTDLVTESLKPLTNNKNEPHPQSEQVFIETTSLEVPVLACVAEESFSNTREAISEATTSAIAPEEPKNSDSAN
        DS +LA TPEAEYDVAYFTDNDM   PMTDLVTESLKPLT+NK +PHPQSEQV IETT  EVPVLA VA+ESF NTRE  SE+ TSA   E+PKNSDSAN
Subjt:  DSNNLARTPEAEYDVAYFTDNDMPNLPMTDLVTESLKPLTNNKNEPHPQSEQVFIETTSLEVPVLACVAEESFSNTREAISEATTSAIAPEEPKNSDSAN

Query:  DTSYNSKVDKGNITFDFNSLASTASDGLERCDNGDVNSSAPSTSASVGCEETGSSNPLASADKCQAQCHDISSNPKHVEYEDLPRVEYEDLPKTDVGNFD
          SYNSKVDKGNITFDFNSLA TASDGLERCDNGD+NSSAPSTSASVGC+ET SSNPLASADK + QCH+ SSNPK VEYEDLPRVEYED+ KT+VGNFD
Subjt:  DTSYNSKVDKGNITFDFNSLASTASDGLERCDNGDVNSSAPSTSASVGCEETGSSNPLASADKCQAQCHDISSNPKHVEYEDLPRVEYEDLPKTDVGNFD

Query:  SDTVSSQVQHG--------------------IGFSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRVM--AILCC
        S TVSS+VQ G                    IG+SGSIS RSDSSTTSTRSFAFPILQ+EWNSSPVRMAK DRKHL+KHR     ILCC
Subjt:  SDTVSSQVQHG--------------------IGFSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRVM--AILCC

A0A1S4DWE7 uncharacterized protein LOC103489197 isoform X21.1e-18779.12Show/hide
Query:  MKLEDQKEFTSLLKGNEDADHNNTAADGWVAMKHECLDLDNFNEYDDVKAFVSPLTNSSKVDLFEEDSELYMEKSIVECQLPELIVCYKENICNIVKDIC
        MKLEDQKEFTS+LKGN DA HNNTAADGWVA K ECLDLD+FN+YDDVKAFVSPL NS KVDL EEDSELYMEKSIVECQLPELIVCYKENICNIVKDIC
Subjt:  MKLEDQKEFTSLLKGNEDADHNNTAADGWVAMKHECLDLDNFNEYDDVKAFVSPLTNSSKVDLFEEDSELYMEKSIVECQLPELIVCYKENICNIVKDIC

Query:  IDDGVPSRDKLFCSSSLDEKDACSILPPKKDWKDELVGELEKTDMFASDDSEHSESFGNKDSPKQSDSNNLARTPEAEYDVAYFTDNDMPNLPMTDLVTE
        IDDG P RDKLFC SSLDE+D CSI PP KDWKDE VGEL++ DMFASDDSEHSESFG+KDSP Q DS +LA TPEAEYDVAYFTDNDM   PMTDLVTE
Subjt:  IDDGVPSRDKLFCSSSLDEKDACSILPPKKDWKDELVGELEKTDMFASDDSEHSESFGNKDSPKQSDSNNLARTPEAEYDVAYFTDNDMPNLPMTDLVTE

Query:  SLKPLTNNKNEPHPQSEQVFIETTSLEVPVLACVAEESFSNTREAISEATTSAIAPEEPKNSDSANDTSYNSKVDKGNITFDFNSLASTASDGLERCDNG
        SLKPLT+NK +PHPQSEQV IETT  EVPVLA VA+ESF NTRE  SE+ TSA   E+PKNSDSAN  SYNSKVDKGNITFDFNSLA TASDGLERCDNG
Subjt:  SLKPLTNNKNEPHPQSEQVFIETTSLEVPVLACVAEESFSNTREAISEATTSAIAPEEPKNSDSANDTSYNSKVDKGNITFDFNSLASTASDGLERCDNG

Query:  DVNSSAPSTSASVGCEETGSSNPLASADKCQAQCHDISSNPKHVEYEDLPRVEYEDLPKTDVGNFDSDTVSSQVQHG--------------------IGF
        D+NSSAPSTSASVGC+ET SSNPLASADK + QCH+ SSNPK VEYEDLPRVEYED+ KT+VGNFDS TVSS+VQ G                    IG+
Subjt:  DVNSSAPSTSASVGCEETGSSNPLASADKCQAQCHDISSNPKHVEYEDLPRVEYEDLPKTDVGNFDSDTVSSQVQHG--------------------IGF

Query:  SGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRVM--AILCC
        SGSIS RSDSSTTSTRSFAFPILQ+EWNSSPVRMAK DRKHL+KHR     ILCC
Subjt:  SGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRVM--AILCC

A0A6J1GWB6 uncharacterized protein LOC111458170 isoform X22.9e-18874.13Show/hide
Query:  MKVEGEPIVCRSNASPKFVPKSFECDNDALDSGGMKLEDQKEFTSLLKGNEDADHNNTAADGWVAMKHECLDLDNFNEYDDVKAFVSPLTNSSKVDLFEE
        MKVEGEP+V  S+ASPKF+PKSFECDNDALDSGGMKLED K+FT LLK NEDA+H N+            + LD+ NE+D+VKAFV  +TNSSKVDLFEE
Subjt:  MKVEGEPIVCRSNASPKFVPKSFECDNDALDSGGMKLEDQKEFTSLLKGNEDADHNNTAADGWVAMKHECLDLDNFNEYDDVKAFVSPLTNSSKVDLFEE

Query:  DSELYMEKSIVECQLPELIVCYKENICNIVKDICIDDGVPSRDKLFC-SSSLDEKDACSILPPKKDWKDELVGELEKTDMFASDDSEHSESFGNKDSPKQ
        DSELYMEKSIVECQLPELIVCYKEN CNIVKDICIDDGVPSRDKL C SSSLDEK  C ILPP++DWKDEL   LE+ DMFASDDSEHSESFG KDSPKQ
Subjt:  DSELYMEKSIVECQLPELIVCYKENICNIVKDICIDDGVPSRDKLFC-SSSLDEKDACSILPPKKDWKDELVGELEKTDMFASDDSEHSESFGNKDSPKQ

Query:  SDSNNLARTPEAEYDVAYFTDNDMPNLPMTDLVTESLKPLTNNKNEPHPQSEQVFIETTSLEVPVLACVAEESFSNTREAISEATTSAIAPEEPKNSDSA
        SD  +LARTPEAEYDV YFTDND+ NLPMTDL TES+KPLTNNKNEP+PQSEQV      LEVPVLACVAEES+S+TRE ISE  +S +A EEPKNSDSA
Subjt:  SDSNNLARTPEAEYDVAYFTDNDMPNLPMTDLVTESLKPLTNNKNEPHPQSEQVFIETTSLEVPVLACVAEESFSNTREAISEATTSAIAPEEPKNSDSA

Query:  NDTSYNSKVDKGNITFDFNSLASTASDGLERCDNGDVNSSAPSTSASVGCEETGSSNPLASADKCQAQCHDISSNPKHVEYEDLPRVEYEDLPKTDVGNF
         D SYNSK+DKGNITFDFNS ASTASDGLE CDNG +NSSAPSTSASV C ++ SSN LASADKCQA C+D SSNPK VEYEDL RVEYEDL K +VGN 
Subjt:  NDTSYNSKVDKGNITFDFNSLASTASDGLERCDNGDVNSSAPSTSASVGCEETGSSNPLASADKCQAQCHDISSNPKHVEYEDLPRVEYEDLPKTDVGNF

Query:  DSDTVSSQVQHG---------------------IGFSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHR--VMAILCC
        DS +VSSQVQHG                     IG+SGSIS RSDSSTTST SFAFPILQSEWNSSPVRMAKAD+KHLRK R     +LCC
Subjt:  DSDTVSSQVQHG---------------------IGFSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHR--VMAILCC

A0A6J1GWT7 uncharacterized protein LOC111458170 isoform X11.4e-19375.36Show/hide
Query:  MKVEGEPIVCRSNASPKFVPKSFECDNDALDSGGMKLEDQKEFTSLLKGNEDADHNNTAADGWVAMKHECLDLDNFNEYDDVKAFVSPLTNSSKVDLFEE
        MKVEGEP+V  S+ASPKF+PKSFECDNDALDSGGMKLED K+FT LLK NEDA+H N+            + LD+ NE+D+VKAFV  +TNSSKVDLFEE
Subjt:  MKVEGEPIVCRSNASPKFVPKSFECDNDALDSGGMKLEDQKEFTSLLKGNEDADHNNTAADGWVAMKHECLDLDNFNEYDDVKAFVSPLTNSSKVDLFEE

Query:  DSELYMEKSIVECQLPELIVCYKENICNIVKDICIDDGVPSRDKLFC-SSSLDEKDACSILPPKKDWKDELVGELEKTDMFASDDSEHSESFGNKDSPKQ
        DSELYMEKSIVECQLPELIVCYKEN CNIVKDICIDDGVPSRDKL C SSSLDEK  C ILPP++DWKDEL   LE+ DMFASDDSEHSESFG KDSPKQ
Subjt:  DSELYMEKSIVECQLPELIVCYKENICNIVKDICIDDGVPSRDKLFC-SSSLDEKDACSILPPKKDWKDELVGELEKTDMFASDDSEHSESFGNKDSPKQ

Query:  SDSNNLARTPEAEYDVAYFTDNDMPNLPMTDLVTESLKPLTNNKNEPHPQSEQVFIETTSLEVPVLACVAEESFSNTREAISEATTSAIAPEEPKNSDSA
        SD  +LARTPEAEYDV YFTDND+ NLPMTDL TES+KPLTNNKNEP+PQSEQVFIETTSLEVPVLACVAEES+S+TRE ISE  +S +A EEPKNSDSA
Subjt:  SDSNNLARTPEAEYDVAYFTDNDMPNLPMTDLVTESLKPLTNNKNEPHPQSEQVFIETTSLEVPVLACVAEESFSNTREAISEATTSAIAPEEPKNSDSA

Query:  NDTSYNSKVDKGNITFDFNSLASTASDGLERCDNGDVNSSAPSTSASVGCEETGSSNPLASADKCQAQCHDISSNPKHVEYEDLPRVEYEDLPKTDVGNF
         D SYNSK+DKGNITFDFNS ASTASDGLE CDNG +NSSAPSTSASV C ++ SSN LASADKCQA C+D SSNPK VEYEDL RVEYEDL K +VGN 
Subjt:  NDTSYNSKVDKGNITFDFNSLASTASDGLERCDNGDVNSSAPSTSASVGCEETGSSNPLASADKCQAQCHDISSNPKHVEYEDLPRVEYEDLPKTDVGNF

Query:  DSDTVSSQVQHG---------------------IGFSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHR--VMAILCC
        DS +VSSQVQHG                     IG+SGSIS RSDSSTTST SFAFPILQSEWNSSPVRMAKAD+KHLRK R     +LCC
Subjt:  DSDTVSSQVQHG---------------------IGFSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHR--VMAILCC

A0A6J1GWU3 uncharacterized protein LOC111458170 isoform X35.2e-18573.12Show/hide
Query:  MKVEGEPIVCRSNASPKFVPKSFECDNDALDSGGMKLEDQKEFTSLLKGNEDADHNNTAADGWVAMKHECLDLDNFNEYDDVKAFVSPLTNSSKVDLFEE
        MKVEGEP+V  S+ASPKF+PKSFECDNDALDSGGMKLED K+FT LLK NEDA+H N+            + LD+ NE+D+VKAFV  +TNSSKVDLFEE
Subjt:  MKVEGEPIVCRSNASPKFVPKSFECDNDALDSGGMKLEDQKEFTSLLKGNEDADHNNTAADGWVAMKHECLDLDNFNEYDDVKAFVSPLTNSSKVDLFEE

Query:  DSELYMEKSIVECQLPELIVCYKENICNIVKDICIDDGVPSRDKLFC-SSSLDEKDACSILPPKKDWKDELVGELEKTDMFASDDSEHSESFGNKDSPKQ
        DSELYMEKSIVECQLPELIVCYKEN CNIVKDICIDDGVPSRDKL C SSSLDEK  C ILPP++DWKDEL   LE+ DMFASDDSEHSESFG KDSPKQ
Subjt:  DSELYMEKSIVECQLPELIVCYKENICNIVKDICIDDGVPSRDKLFC-SSSLDEKDACSILPPKKDWKDELVGELEKTDMFASDDSEHSESFGNKDSPKQ

Query:  SDSNNLARTPEAEYDVAYFTDNDMPNLPMTDLVTESLKPLTNNKNEPHPQSEQVFIETTSLEVPVLACVAEESFSNTREAISEATTSAIAPEEPKNSDSA
        SD  +LARTPEAEYDV YFTDND+ NLPMTDL TES+KPLTNNKNEP+PQSEQ           VLACVAEES+S+TRE ISE  +S +A EEPKNSDSA
Subjt:  SDSNNLARTPEAEYDVAYFTDNDMPNLPMTDLVTESLKPLTNNKNEPHPQSEQVFIETTSLEVPVLACVAEESFSNTREAISEATTSAIAPEEPKNSDSA

Query:  NDTSYNSKVDKGNITFDFNSLASTASDGLERCDNGDVNSSAPSTSASVGCEETGSSNPLASADKCQAQCHDISSNPKHVEYEDLPRVEYEDLPKTDVGNF
         D SYNSK+DKGNITFDFNS ASTASDGLE CDNG +NSSAPSTSASV C ++ SSN LASADKCQA C+D SSNPK VEYEDL RVEYEDL K +VGN 
Subjt:  NDTSYNSKVDKGNITFDFNSLASTASDGLERCDNGDVNSSAPSTSASVGCEETGSSNPLASADKCQAQCHDISSNPKHVEYEDLPRVEYEDLPKTDVGNF

Query:  DSDTVSSQVQHG---------------------IGFSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHR--VMAILCC
        DS +VSSQVQHG                     IG+SGSIS RSDSSTTST SFAFPILQSEWNSSPVRMAKAD+KHLRK R     +LCC
Subjt:  DSDTVSSQVQHG---------------------IGFSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHR--VMAILCC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G13650.2 BEST Arabidopsis thaliana protein match is: 18S pre-ribosomal assembly protein gar2-related (TAIR:AT2G03810.4)8.0e-0530.37Show/hide
Query:  STSASVGCEETGSSNPLASADKCQAQCHDISSNPKHVEYEDLPRVEYEDLPKTDVGNFDSDTVSSQVQHGIG-------------------FSGSISLRS
        +TS SV  +     + L+  D       + ++ P+       P +E  +  + D    D++ +   + +G G                    SGS +L  
Subjt:  STSASVGCEETGSSNPLASADKCQAQCHDISSNPKHVEYEDLPRVEYEDLPKTDVGNFDSDTVSSQVQHGIG-------------------FSGSISLRS

Query:  DSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLR
         S  TS  SFA PILQSEWNSSPVRM KA+   LR
Subjt:  DSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLR

AT2G03810.1 18S pre-ribosomal assembly protein gar2-related1.6e-2429.82Show/hide
Query:  EEDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDDGVPSRDKLFCSSSLDEKDACSILPPKKDWKDELVGELEKTDMFASDDSEHSESFGNKDSPK
        ++D   YM+K++  C LPE++VCYKEN  +IVKDIC+D+GVP ++K        EKD+          K     +L K D     +   SE+   +DS  
Subjt:  EEDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDDGVPSRDKLFCSSSLDEKDACSILPPKKDWKDELVGELEKTDMFASDDSEHSESFGNKDSPK

Query:  QSDSNNLARTPEAEYDVAYFTDNDMPNLPMTD--------LVTESLKPLTNNKNEPHPQSEQVFIETTSLEVPVLACVAEESFSNTREAISEATTSAIAP
        + D +      + + DV   +  D  +   T         +VTE +K    +   P   SE    E +  EV +      +      + +S         
Subjt:  QSDSNNLARTPEAEYDVAYFTDNDMPNLPMTD--------LVTESLKPLTNNKNEPHPQSEQVFIETTSLEVPVLACVAEESFSNTREAISEATTSAIAP

Query:  EEPKNSDSANDTSYNSKVDKGNITFDFNSLASTASDGLERCDNGDVNSSAPSTSASVGCEETGSSNPLASADKCQAQCHDISSNPKHVEYEDLPRVEYED
        ++  +SDS  + S +   DK   + +  ++  T  +  E    G+   S+ ST+ S    +T +       +K + + H   +      YED  +     
Subjt:  EEPKNSDSANDTSYNSKVDKGNITFDFNSLASTASDGLERCDNGDVNSSAPSTSASVGCEETGSSNPLASADKCQAQCHDISSNPKHVEYEDLPRVEYED

Query:  LPKTDVGNFDSDTVSSQVQHG--IGFSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRVMAILCC
          +T     DS ++S  + +   I +SGS+S+RSD+STTS RSFAFPILQSEWNSSPVRMAKAD++  +      +LCC
Subjt:  LPKTDVGNFDSDTVSSQVQHG--IGFSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRVMAILCC

AT2G03810.2 18S pre-ribosomal assembly protein gar2-related1.6e-2429.82Show/hide
Query:  EEDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDDGVPSRDKLFCSSSLDEKDACSILPPKKDWKDELVGELEKTDMFASDDSEHSESFGNKDSPK
        ++D   YM+K++  C LPE++VCYKEN  +IVKDIC+D+GVP ++K        EKD+          K     +L K D     +   SE+   +DS  
Subjt:  EEDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDDGVPSRDKLFCSSSLDEKDACSILPPKKDWKDELVGELEKTDMFASDDSEHSESFGNKDSPK

Query:  QSDSNNLARTPEAEYDVAYFTDNDMPNLPMTD--------LVTESLKPLTNNKNEPHPQSEQVFIETTSLEVPVLACVAEESFSNTREAISEATTSAIAP
        + D +      + + DV   +  D  +   T         +VTE +K    +   P   SE    E +  EV +      +      + +S         
Subjt:  QSDSNNLARTPEAEYDVAYFTDNDMPNLPMTD--------LVTESLKPLTNNKNEPHPQSEQVFIETTSLEVPVLACVAEESFSNTREAISEATTSAIAP

Query:  EEPKNSDSANDTSYNSKVDKGNITFDFNSLASTASDGLERCDNGDVNSSAPSTSASVGCEETGSSNPLASADKCQAQCHDISSNPKHVEYEDLPRVEYED
        ++  +SDS  + S +   DK   + +  ++  T  +  E    G+   S+ ST+ S    +T +       +K + + H   +      YED  +     
Subjt:  EEPKNSDSANDTSYNSKVDKGNITFDFNSLASTASDGLERCDNGDVNSSAPSTSASVGCEETGSSNPLASADKCQAQCHDISSNPKHVEYEDLPRVEYED

Query:  LPKTDVGNFDSDTVSSQVQHG--IGFSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRVMAILCC
          +T     DS ++S  + +   I +SGS+S+RSD+STTS RSFAFPILQSEWNSSPVRMAKAD++  +      +LCC
Subjt:  LPKTDVGNFDSDTVSSQVQHG--IGFSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRVMAILCC

AT2G03810.3 18S pre-ribosomal assembly protein gar2-related1.6e-2429.82Show/hide
Query:  EEDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDDGVPSRDKLFCSSSLDEKDACSILPPKKDWKDELVGELEKTDMFASDDSEHSESFGNKDSPK
        ++D   YM+K++  C LPE++VCYKEN  +IVKDIC+D+GVP ++K        EKD+          K     +L K D     +   SE+   +DS  
Subjt:  EEDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDDGVPSRDKLFCSSSLDEKDACSILPPKKDWKDELVGELEKTDMFASDDSEHSESFGNKDSPK

Query:  QSDSNNLARTPEAEYDVAYFTDNDMPNLPMTD--------LVTESLKPLTNNKNEPHPQSEQVFIETTSLEVPVLACVAEESFSNTREAISEATTSAIAP
        + D +      + + DV   +  D  +   T         +VTE +K    +   P   SE    E +  EV +      +      + +S         
Subjt:  QSDSNNLARTPEAEYDVAYFTDNDMPNLPMTD--------LVTESLKPLTNNKNEPHPQSEQVFIETTSLEVPVLACVAEESFSNTREAISEATTSAIAP

Query:  EEPKNSDSANDTSYNSKVDKGNITFDFNSLASTASDGLERCDNGDVNSSAPSTSASVGCEETGSSNPLASADKCQAQCHDISSNPKHVEYEDLPRVEYED
        ++  +SDS  + S +   DK   + +  ++  T  +  E    G+   S+ ST+ S    +T +       +K + + H   +      YED  +     
Subjt:  EEPKNSDSANDTSYNSKVDKGNITFDFNSLASTASDGLERCDNGDVNSSAPSTSASVGCEETGSSNPLASADKCQAQCHDISSNPKHVEYEDLPRVEYED

Query:  LPKTDVGNFDSDTVSSQVQHG--IGFSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRVMAILCC
          +T     DS ++S  + +   I +SGS+S+RSD+STTS RSFAFPILQSEWNSSPVRMAKAD++  +      +LCC
Subjt:  LPKTDVGNFDSDTVSSQVQHG--IGFSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRVMAILCC

AT2G03810.4 18S pre-ribosomal assembly protein gar2-related1.6e-2429.82Show/hide
Query:  EEDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDDGVPSRDKLFCSSSLDEKDACSILPPKKDWKDELVGELEKTDMFASDDSEHSESFGNKDSPK
        ++D   YM+K++  C LPE++VCYKEN  +IVKDIC+D+GVP ++K        EKD+          K     +L K D     +   SE+   +DS  
Subjt:  EEDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDDGVPSRDKLFCSSSLDEKDACSILPPKKDWKDELVGELEKTDMFASDDSEHSESFGNKDSPK

Query:  QSDSNNLARTPEAEYDVAYFTDNDMPNLPMTD--------LVTESLKPLTNNKNEPHPQSEQVFIETTSLEVPVLACVAEESFSNTREAISEATTSAIAP
        + D +      + + DV   +  D  +   T         +VTE +K    +   P   SE    E +  EV +      +      + +S         
Subjt:  QSDSNNLARTPEAEYDVAYFTDNDMPNLPMTD--------LVTESLKPLTNNKNEPHPQSEQVFIETTSLEVPVLACVAEESFSNTREAISEATTSAIAP

Query:  EEPKNSDSANDTSYNSKVDKGNITFDFNSLASTASDGLERCDNGDVNSSAPSTSASVGCEETGSSNPLASADKCQAQCHDISSNPKHVEYEDLPRVEYED
        ++  +SDS  + S +   DK   + +  ++  T  +  E    G+   S+ ST+ S    +T +       +K + + H   +      YED  +     
Subjt:  EEPKNSDSANDTSYNSKVDKGNITFDFNSLASTASDGLERCDNGDVNSSAPSTSASVGCEETGSSNPLASADKCQAQCHDISSNPKHVEYEDLPRVEYED

Query:  LPKTDVGNFDSDTVSSQVQHG--IGFSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRVMAILCC
          +T     DS ++S  + +   I +SGS+S+RSD+STTS RSFAFPILQSEWNSSPVRMAKAD++  +      +LCC
Subjt:  LPKTDVGNFDSDTVSSQVQHG--IGFSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRVMAILCC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGTTGAGGGTGAGCCCATAGTTTGCCGTTCAAATGCTTCCCCCAAGTTTGTTCCCAAGTCTTTTGAATGCGATAATGATGCTCTTGATTCTGGCGGGATGAAGCT
TGAAGATCAGAAAGAATTTACAAGCCTTCTCAAAGGTAATGAGGATGCCGATCACAATAATACTGCTGCTGATGGTTGGGTTGCAATGAAGCATGAATGTTTAGATCTTG
ATAATTTTAATGAGTATGACGATGTTAAAGCCTTTGTGTCACCACTCACTAACTCTTCTAAAGTAGACTTGTTTGAGGAAGACTCGGAATTATACATGGAAAAGAGTATT
GTTGAATGTCAACTTCCCGAACTAATAGTTTGTTACAAAGAAAATATTTGCAATATTGTGAAGGATATTTGTATTGACGACGGAGTACCTTCTCGGGATAAGCTCTTTTG
TAGTAGTAGTTTGGACGAGAAGGATGCCTGTTCCATTCTCCCTCCTAAGAAAGATTGGAAGGATGAGTTGGTAGGAGAACTAGAGAAAACAGATATGTTTGCTTCAGATG
ATTCAGAGCATTCAGAATCTTTTGGCAATAAGGATTCACCCAAACAAAGTGATTCCAACAATTTGGCTAGAACGCCTGAAGCAGAATATGATGTGGCATATTTTACTGAC
AATGATATGCCTAATCTTCCAATGACAGACTTGGTTACAGAGAGCTTAAAGCCACTGACCAACAATAAGAATGAGCCTCACCCTCAATCTGAACAGGTTTTTATTGAAAC
TACGAGTTTGGAAGTCCCTGTTTTGGCATGTGTAGCTGAAGAATCTTTTAGTAACACCAGAGAAGCAATATCGGAGGCCACTACTTCAGCCATAGCACCTGAAGAGCCCA
AAAATAGTGATTCTGCGAATGATACATCATACAATAGTAAAGTGGACAAAGGAAACATTACTTTTGATTTCAATTCCTTAGCATCAACAGCTAGTGATGGACTGGAGCGT
TGTGATAACGGTGACGTGAACTCTTCAGCTCCTTCGACCAGTGCCTCGGTGGGCTGCGAAGAGACTGGCAGCTCCAACCCTTTAGCTTCTGCCGACAAATGTCAAGCACA
ATGTCACGATATTAGTAGCAACCCTAAACATGTGGAATATGAGGACTTACCACGAGTGGAATATGAAGACTTACCTAAGACAGATGTTGGGAATTTTGATAGTGATACAG
TTTCAAGTCAAGTCCAACATGGCATAGGTTTCTCGGGTAGCATCTCTCTTCGGTCTGATAGCAGCACGACCAGCACCCGTTCGTTTGCCTTTCCCATATTACAATCTGAG
TGGAATAGCAGCCCTGTTAGAATGGCTAAAGCTGATCGAAAGCATCTACGGAAGCATCGGGTCATGGCCATACTTTGCTGTGTTTCTCTCTTGATTCTTTTCATACCTTC
TGCTTCCTCACATCATCTGGACCAGGCCTCTCCCTTGCAAGCAACTAAAGCAGTGCATTCTCCTCGTCTTGTTTTGGGAAGGAAGTTCAAAATGTTAGAAGAAGCAAGAG
TGGATGAGAAGGTTGAAGCGCATGGATTCCAAGTTGCCAAACAAAAAGATAACCACAAGGAAAATGTTTCAGGTCATTCTTATAAAAGAGAGGAAAAAGGAATGAACTTG
AACAGCGGAAAGGGGAGAAAGTGGGTGAATGTGGCCGACATGTCGTCACAGGTTTTCACAATGGACTATGCTCACCTAAAACGGCGTCGTCCTGTTCATAACATGTCGTT
CAAGCGTCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAGTTGAGGGTGAGCCCATAGTTTGCCGTTCAAATGCTTCCCCCAAGTTTGTTCCCAAGTCTTTTGAATGCGATAATGATGCTCTTGATTCTGGCGGGATGAAGCT
TGAAGATCAGAAAGAATTTACAAGCCTTCTCAAAGGTAATGAGGATGCCGATCACAATAATACTGCTGCTGATGGTTGGGTTGCAATGAAGCATGAATGTTTAGATCTTG
ATAATTTTAATGAGTATGACGATGTTAAAGCCTTTGTGTCACCACTCACTAACTCTTCTAAAGTAGACTTGTTTGAGGAAGACTCGGAATTATACATGGAAAAGAGTATT
GTTGAATGTCAACTTCCCGAACTAATAGTTTGTTACAAAGAAAATATTTGCAATATTGTGAAGGATATTTGTATTGACGACGGAGTACCTTCTCGGGATAAGCTCTTTTG
TAGTAGTAGTTTGGACGAGAAGGATGCCTGTTCCATTCTCCCTCCTAAGAAAGATTGGAAGGATGAGTTGGTAGGAGAACTAGAGAAAACAGATATGTTTGCTTCAGATG
ATTCAGAGCATTCAGAATCTTTTGGCAATAAGGATTCACCCAAACAAAGTGATTCCAACAATTTGGCTAGAACGCCTGAAGCAGAATATGATGTGGCATATTTTACTGAC
AATGATATGCCTAATCTTCCAATGACAGACTTGGTTACAGAGAGCTTAAAGCCACTGACCAACAATAAGAATGAGCCTCACCCTCAATCTGAACAGGTTTTTATTGAAAC
TACGAGTTTGGAAGTCCCTGTTTTGGCATGTGTAGCTGAAGAATCTTTTAGTAACACCAGAGAAGCAATATCGGAGGCCACTACTTCAGCCATAGCACCTGAAGAGCCCA
AAAATAGTGATTCTGCGAATGATACATCATACAATAGTAAAGTGGACAAAGGAAACATTACTTTTGATTTCAATTCCTTAGCATCAACAGCTAGTGATGGACTGGAGCGT
TGTGATAACGGTGACGTGAACTCTTCAGCTCCTTCGACCAGTGCCTCGGTGGGCTGCGAAGAGACTGGCAGCTCCAACCCTTTAGCTTCTGCCGACAAATGTCAAGCACA
ATGTCACGATATTAGTAGCAACCCTAAACATGTGGAATATGAGGACTTACCACGAGTGGAATATGAAGACTTACCTAAGACAGATGTTGGGAATTTTGATAGTGATACAG
TTTCAAGTCAAGTCCAACATGGCATAGGTTTCTCGGGTAGCATCTCTCTTCGGTCTGATAGCAGCACGACCAGCACCCGTTCGTTTGCCTTTCCCATATTACAATCTGAG
TGGAATAGCAGCCCTGTTAGAATGGCTAAAGCTGATCGAAAGCATCTACGGAAGCATCGGGTCATGGCCATACTTTGCTGTGTTTCTCTCTTGATTCTTTTCATACCTTC
TGCTTCCTCACATCATCTGGACCAGGCCTCTCCCTTGCAAGCAACTAAAGCAGTGCATTCTCCTCGTCTTGTTTTGGGAAGGAAGTTCAAAATGTTAGAAGAAGCAAGAG
TGGATGAGAAGGTTGAAGCGCATGGATTCCAAGTTGCCAAACAAAAAGATAACCACAAGGAAAATGTTTCAGGTCATTCTTATAAAAGAGAGGAAAAAGGAATGAACTTG
AACAGCGGAAAGGGGAGAAAGTGGGTGAATGTGGCCGACATGTCGTCACAGGTTTTCACAATGGACTATGCTCACCTAAAACGGCGTCGTCCTGTTCATAACATGTCGTT
CAAGCGTCCTTGA
Protein sequenceShow/hide protein sequence
MKVEGEPIVCRSNASPKFVPKSFECDNDALDSGGMKLEDQKEFTSLLKGNEDADHNNTAADGWVAMKHECLDLDNFNEYDDVKAFVSPLTNSSKVDLFEEDSELYMEKSI
VECQLPELIVCYKENICNIVKDICIDDGVPSRDKLFCSSSLDEKDACSILPPKKDWKDELVGELEKTDMFASDDSEHSESFGNKDSPKQSDSNNLARTPEAEYDVAYFTD
NDMPNLPMTDLVTESLKPLTNNKNEPHPQSEQVFIETTSLEVPVLACVAEESFSNTREAISEATTSAIAPEEPKNSDSANDTSYNSKVDKGNITFDFNSLASTASDGLER
CDNGDVNSSAPSTSASVGCEETGSSNPLASADKCQAQCHDISSNPKHVEYEDLPRVEYEDLPKTDVGNFDSDTVSSQVQHGIGFSGSISLRSDSSTTSTRSFAFPILQSE
WNSSPVRMAKADRKHLRKHRVMAILCCVSLLILFIPSASSHHLDQASPLQATKAVHSPRLVLGRKFKMLEEARVDEKVEAHGFQVAKQKDNHKENVSGHSYKREEKGMNL
NSGKGRKWVNVADMSSQVFTMDYAHLKRRRPVHNMSFKRP