; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg03661 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg03661
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionGATA zinc finger domain-containing protein isoform 1
Genome locationCarg_Chr14:1399967..1405444
RNA-Seq ExpressionCarg03661
SyntenyCarg03661
Gene Ontology termsNA
InterPro domainsIPR023213 - Chloramphenicol acetyltransferase-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580708.1 hypothetical protein SDJN03_20710, partial [Cucurbita argyrosperma subsp. sororia]4.7e-27099.79Show/hide
Query:  MSDEIKFRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDISHLQASLHNLQNLHPILRSKIHHDPSRRDFSLLIPPSPSIHLQILDLAAAARAIASHPDA
        MSDEIKFRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDISHLQASLHNLQNLHPILRSKIHHDPSRRDFSLLIPPSPSIHLQILDLAAAARAIASHPDA
Subjt:  MSDEIKFRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDISHLQASLHNLQNLHPILRSKIHHDPSRRDFSLLIPPSPSIHLQILDLAAAARAIASHPDA

Query:  DNPSISDFHKILEHEINRAKWLNPSHPSYSDTDVMFATVYALSDGQWAVFLTLHTAACDRVAAASLLRELLVLTAAGGKIEGGGFEIGDNGEIGSGIEDL
        D+PSISDFHKILEHEINRAKWLNPSHPSYSDTDVMFATVYALSDGQWAVFLTLHTAACDRVAAASLLRELLVLTAAGGKIEGGGFEIGDNGEIGSGIEDL
Subjt:  DNPSISDFHKILEHEINRAKWLNPSHPSYSDTDVMFATVYALSDGQWAVFLTLHTAACDRVAAASLLRELLVLTAAGGKIEGGGFEIGDNGEIGSGIEDL

Query:  IPSGKAYKPLWARGLDMLGYSLNSFRFANLEFKDASSERFSQMIRLKLNSDETQKLLAGCKSRGIKLCGALEAAGLIATRCSKDLPPYQTEKYAVVTLID
        IPSGKAYKPLWARGLDMLGYSLNSFRFANLEFKDASSERFSQMIRLKLNSDETQKLLAGCKSRGIKLCGALEAAGLIATRCSKDLPPYQTEKYAVVTLID
Subjt:  IPSGKAYKPLWARGLDMLGYSLNSFRFANLEFKDASSERFSQMIRLKLNSDETQKLLAGCKSRGIKLCGALEAAGLIATRCSKDLPPYQTEKYAVVTLID

Query:  CRSLLDPPLTTHHLGFYHSAILNTHDISAEDTLWEVAERCYFSFSNGKENNKHFTDMSDLNFLMGKAIENPSLTPSSSMRTALISAFEDPIIYTSDPAQQ
        CRSLLDPPLTTHHLGFYHSAILNTHDISAEDTLWEVAERCYFSFSNGKENNKHFTDMSDLNFLMGKAIENPSLTPSSSMRTALISAFEDPIIYTSDPAQQ
Subjt:  CRSLLDPPLTTHHLGFYHSAILNTHDISAEDTLWEVAERCYFSFSNGKENNKHFTDMSDLNFLMGKAIENPSLTPSSSMRTALISAFEDPIIYTSDPAQQ

Query:  HLGISDYIGCASAHGVGPSIALFDMIRDGQLDCACVYPSPLFSRDQMNLLFDEMKKILVSGAMEVVE
        HLGISDYIGCASAHGVGPSIALFDMIRDGQLDCACVYPSPLFSRDQMNLLFDEMKKILVSGAMEVVE
Subjt:  HLGISDYIGCASAHGVGPSIALFDMIRDGQLDCACVYPSPLFSRDQMNLLFDEMKKILVSGAMEVVE

KAG7017467.1 hypothetical protein SDJN02_19332, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+00100Show/hide
Query:  MEGSISRRRVAACTETAWCRAVPGGTGTAVIALSSSDAPNLQLLQNALQELQNSHPILKSKLHFNPISSAFSFITSPTPFVQIKTYELPETSKILNDQNV
        MEGSISRRRVAACTETAWCRAVPGGTGTAVIALSSSDAPNLQLLQNALQELQNSHPILKSKLHFNPISSAFSFITSPTPFVQIKTYELPETSKILNDQNV
Subjt:  MEGSISRRRVAACTETAWCRAVPGGTGTAVIALSSSDAPNLQLLQNALQELQNSHPILKSKLHFNPISSAFSFITSPTPFVQIKTYELPETSKILNDQNV

Query:  LNYHKTPHDISISPLQIILEHELNENSPWQTLHYSDTAATSAADMLFVSLYEVGSGKWIVVFRLHVAACDRTTAVSLLEELLLLMNGGGVDKTGEVELGM
        LNYHKTPHDISISPLQIILEHELNENSPWQTLHYSDTAATSAADMLFVSLYEVGSGKWIVVFRLHVAACDRTTAVSLLEELLLLMNGGGVDKTGEVELGM
Subjt:  LNYHKTPHDISISPLQIILEHELNENSPWQTLHYSDTAATSAADMLFVSLYEVGSGKWIVVFRLHVAACDRTTAVSLLEELLLLMNGGGVDKTGEVELGM

Query:  EDLVPRKLAKKSMLSRGLNVISYSVNSLRLTNLKFKDVKSARRSQVARLQMNRTETHKILSECKSRGIKLSSAMVAAGLVATHSSGSHGLDRHQRKYGII
        EDLVPRKLAKKSMLSRGLNVISYSVNSLRLTNLKFKDVKSARRSQVARLQMNRTETHKILSECKSRGIKLSSAMVAAGLVATHSSGSHGLDRHQRKYGII
Subjt:  EDLVPRKLAKKSMLSRGLNVISYSVNSLRLTNLKFKDVKSARRSQVARLQMNRTETHKILSECKSRGIKLSSAMVAAGLVATHSSGSHGLDRHQRKYGII

Query:  TLIDCRRFLEPPLRSHHFGFYHAAILNSYTIRGGEELWELAKKISTTLEASKNSNKHFTDMSDLNFLLCRAVENPSLTESGAMRTSLMTVFEDTVVDNSG
        TLIDCRRFLEPPLRSHHFGFYHAAILNSYTIRGGEELWELAKKISTTLEASKNSNKHFTDMSDLNFLLCRAVENPSLTESGAMRTSLMTVFEDTVVDNSG
Subjt:  TLIDCRRFLEPPLRSHHFGFYHAAILNSYTIRGGEELWELAKKISTTLEASKNSNKHFTDMSDLNFLLCRAVENPSLTESGAMRTSLMTVFEDTVVDNSG

Query:  AMQAEIGIKDYMGCASTHGIGPSVAVFDTIRDGRLDCACVYPAPLHSREQMEALVDNMKALLSDVVLAFQIVSPPLGPSSSSYYLLLKGSDFLRLQYPLC
        AMQAEIGIKDYMGCASTHGIGPSVAVFDTIRDGRLDCACVYPAPLHSREQMEALVDNMKALLSDVVLAFQIVSPPLGPSSSSYYLLLKGSDFLRLQYPLC
Subjt:  AMQAEIGIKDYMGCASTHGIGPSVAVFDTIRDGRLDCACVYPAPLHSREQMEALVDNMKALLSDVVLAFQIVSPPLGPSSSSYYLLLKGSDFLRLQYPLC

Query:  TPTVTASKMSDEIKFRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDISHLQASLHNLQNLHPILRSKIHHDPSRRDFSLLIPPSPSIHLQILDLAAAAR
        TPTVTASKMSDEIKFRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDISHLQASLHNLQNLHPILRSKIHHDPSRRDFSLLIPPSPSIHLQILDLAAAAR
Subjt:  TPTVTASKMSDEIKFRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDISHLQASLHNLQNLHPILRSKIHHDPSRRDFSLLIPPSPSIHLQILDLAAAAR

Query:  AIASHPDADNPSISDFHKILEHEINRAKWLNPSHPSYSDTDVMFATVYALSDGQWAVFLTLHTAACDRVAAASLLRELLVLTAAGGKIEGGGFEIGDNGE
        AIASHPDADNPSISDFHKILEHEINRAKWLNPSHPSYSDTDVMFATVYALSDGQWAVFLTLHTAACDRVAAASLLRELLVLTAAGGKIEGGGFEIGDNGE
Subjt:  AIASHPDADNPSISDFHKILEHEINRAKWLNPSHPSYSDTDVMFATVYALSDGQWAVFLTLHTAACDRVAAASLLRELLVLTAAGGKIEGGGFEIGDNGE

Query:  IGSGIEDLIPSGKAYKPLWARGLDMLGYSLNSFRFANLEFKDASSERFSQMIRLKLNSDETQKLLAGCKSRGIKLCGALEAAGLIATRCSKDLPPYQTEK
        IGSGIEDLIPSGKAYKPLWARGLDMLGYSLNSFRFANLEFKDASSERFSQMIRLKLNSDETQKLLAGCKSRGIKLCGALEAAGLIATRCSKDLPPYQTEK
Subjt:  IGSGIEDLIPSGKAYKPLWARGLDMLGYSLNSFRFANLEFKDASSERFSQMIRLKLNSDETQKLLAGCKSRGIKLCGALEAAGLIATRCSKDLPPYQTEK

Query:  YAVVTLIDCRSLLDPPLTTHHLGFYHSAILNTHDISAEDTLWEVAERCYFSFSNGKENNKHFTDMSDLNFLMGKAIENPSLTPSSSMRTALISAFEDPII
        YAVVTLIDCRSLLDPPLTTHHLGFYHSAILNTHDISAEDTLWEVAERCYFSFSNGKENNKHFTDMSDLNFLMGKAIENPSLTPSSSMRTALISAFEDPII
Subjt:  YAVVTLIDCRSLLDPPLTTHHLGFYHSAILNTHDISAEDTLWEVAERCYFSFSNGKENNKHFTDMSDLNFLMGKAIENPSLTPSSSMRTALISAFEDPII

Query:  YTSDPAQQHLGISDYIGCASAHGVGPSIALFDMIRDGQLDCACVYPSPLFSRDQMNLLFDEMKKILVSGAMEVVES
        YTSDPAQQHLGISDYIGCASAHGVGPSIALFDMIRDGQLDCACVYPSPLFSRDQMNLLFDEMKKILVSGAMEVVES
Subjt:  YTSDPAQQHLGISDYIGCASAHGVGPSIALFDMIRDGQLDCACVYPSPLFSRDQMNLLFDEMKKILVSGAMEVVES

KAG7028117.1 hypothetical protein SDJN02_09297, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0063.15Show/hide
Query:  MEGSISRRRVAACTETAWCRAVPGGTGTAVIALSSSDA-----PNLQLLQNALQELQNSHPILKSKLHFNPISSAFSFITSPTPFVQIKTYELPETSKIL
        ME S SRRRVA  TETAWCRAVPGGTGTAV+ALSSS +     PNLQLLQNAL +LQN+HP+LKSKLH++PISS  SF+TSPTP VQ+KT++ PETSKI+
Subjt:  MEGSISRRRVAACTETAWCRAVPGGTGTAVIALSSSDA-----PNLQLLQNALQELQNSHPILKSKLHFNPISSAFSFITSPTPFVQIKTYELPETSKIL

Query:  NDQNVLNYHKTPHDISISPLQIILEHELNENSPWQTLHYSDTAATSAADMLFVSLYEVGSGKWIVVFRLHVAACDRTTAVSLLEELLLLMN-GGGVDKTG
        NDQN L  +   H ISISPLQI+LEHELNEN+ W+ L+ SDT    AADMLFV+LYEVGS  W+ VFRLHVAACDRTTAVSLLEELL+LMN GGG DK  
Subjt:  NDQNVLNYHKTPHDISISPLQIILEHELNENSPWQTLHYSDTAATSAADMLFVSLYEVGSGKWIVVFRLHVAACDRTTAVSLLEELLLLMN-GGGVDKTG

Query:  EVELGMEDLVPRKLAKKSMLSRGLNVISYSVNSLRLTNLKFKDVKSARRSQVARLQMNRTETHKILSECKSRGIKLSSAMVAAGLVATHSSGSHGLDRHQ
        E+ELGME+LVPRKLAKK +L+RGL++ISYS+NSLRLTNLKFKDVKS RRSQV RLQ+N  +T KIL                                  
Subjt:  EVELGMEDLVPRKLAKKSMLSRGLNVISYSVNSLRLTNLKFKDVKSARRSQVARLQMNRTETHKILSECKSRGIKLSSAMVAAGLVATHSSGSHGLDRHQ

Query:  RKYGIITLIDCRRFLEPPLRSHHFGFYHAAILNSYTIRGGEELWELAKKISTTLEASKNSNKHFTDMSDLNFLLCRAVENPSLTESGAMRTSLMTVFEDT
                                GFYHAAILNSYT+RGGE+LWELA KIS+TLEASKN NKHFTD+    FL+                          
Subjt:  RKYGIITLIDCRRFLEPPLRSHHFGFYHAAILNSYTIRGGEELWELAKKISTTLEASKNSNKHFTDMSDLNFLLCRAVENPSLTESGAMRTSLMTVFEDT

Query:  VVDNSGAMQAEIGIKDYMGCASTHGIGPSVAVFDTIRDGRLDCACVYPAPLHSREQMEALVDNMKALLSDVVLAFQIVSPPLGPSSSSYYLLLKGSDFLR
                                                                                                            
Subjt:  VVDNSGAMQAEIGIKDYMGCASTHGIGPSVAVFDTIRDGRLDCACVYPAPLHSREQMEALVDNMKALLSDVVLAFQIVSPPLGPSSSSYYLLLKGSDFLR

Query:  LQYPLCTPTVTASKMSD----EIKFRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDISHLQASLHNLQNLHPILRSKIHHDPSRRDFSLLIPPSPSIHL
            LCT T  AS+MSD    E K RPVGGTE+SWCRA PGGTGTTVLGLLLSKPPD+ +LQ++LH+LQNLHPILRSKI +DPSRRDFS L PPSP +HL
Subjt:  LQYPLCTPTVTASKMSD----EIKFRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDISHLQASLHNLQNLHPILRSKIHHDPSRRDFSLLIPPSPSIHL

Query:  QILDLAAAARAIASHPDADNPSISDFHKILEHEINRAKWLNPSHPSYSDTDVMFATVYALSDGQWAVFLTLHTAACDRVAAASLLRELLVLTAAGGKIEG
        QILDL AAARAIASHPDAD+PS+SDFHKILEHEIN   WL+P+HPSYSDTDVMFA+VY ++DGQWAVFL LHTA CDR AA +LLRELL +  A G+ EG
Subjt:  QILDLAAAARAIASHPDADNPSISDFHKILEHEINRAKWLNPSHPSYSDTDVMFATVYALSDGQWAVFLTLHTAACDRVAAASLLRELLVLTAAGGKIEG

Query:  GGFEIGDNGEIGSGIEDLIPSGKAYKPLWARGLDMLGYSLNSFRFANLEFKDASSERFSQMIRLKLNSDETQKLLAGCKSRGIKLCGALEAAGLIATRCS
        G FEIGD+GEIG GIEDLIP+GKA KPLWARGLDMLGYSLNSFR ANLEFKDA+S RFSQMIRLK+NSD T+KLLAGCK RGIK+CGAL AAGLIATRCS
Subjt:  GGFEIGDNGEIGSGIEDLIPSGKAYKPLWARGLDMLGYSLNSFRFANLEFKDASSERFSQMIRLKLNSDETQKLLAGCKSRGIKLCGALEAAGLIATRCS

Query:  KDLPPYQTEKYAVVTLIDCRSLLDPPLTTHHLGFYHSAILNTHDISAEDTLWEVAERCYFSFSNGKENNKHFTDMSDLNFLMGKAIENPSLTPSSSMRTA
        KDLPPY  EKY VVTL DCRSLL+PPLTTHHLGFYHSAILNTHDISAEDTLW+VA+RCYF+FSN K+NNKHF+DMSDLNFLM KAIENP LTPSSSMRTA
Subjt:  KDLPPYQTEKYAVVTLIDCRSLLDPPLTTHHLGFYHSAILNTHDISAEDTLWEVAERCYFSFSNGKENNKHFTDMSDLNFLMGKAIENPSLTPSSSMRTA

Query:  LISAFEDPIIYTSDPAQQHLGISDYIGCASAHGVGPSIALFDMIRDGQLDCACVYPSPLFSRDQMNLLFDEMKKILVSGAMEVVE
        LIS FEDPI     PAQ++LG+ DYIGCASAHGVGPSIA FDMIRDGQLDCACVYP PLFSRDQMN + DEMKKILV GA++VVE
Subjt:  LISAFEDPIIYTSDPAQQHLGISDYIGCASAHGVGPSIALFDMIRDGQLDCACVYPSPLFSRDQMNLLFDEMKKILVSGAMEVVE

XP_022935261.1 uncharacterized protein LOC111442199 [Cucurbita moschata]1.8e-26999.57Show/hide
Query:  MSDEIKFRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDISHLQASLHNLQNLHPILRSKIHHDPSRRDFSLLIPPSPSIHLQILDLAAAARAIASHPDA
        MSDEIKFRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDISHLQASLHNLQNLHPILRSKIHHDPSRRDFSLLIPPSPSIHLQILDLAAAARAIASHPDA
Subjt:  MSDEIKFRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDISHLQASLHNLQNLHPILRSKIHHDPSRRDFSLLIPPSPSIHLQILDLAAAARAIASHPDA

Query:  DNPSISDFHKILEHEINRAKWLNPSHPSYSDTDVMFATVYALSDGQWAVFLTLHTAACDRVAAASLLRELLVLTAAGGKIEGGGFEIGDNGEIGSGIEDL
        DNPSISDFHKILEHEINRAKWLNPSHPSYSDTDVMFATVYALSDGQWAVFLTLHTAACDRVAAASLLRELLVLTAAGGKIEGGGFEIGDNGEIGSGIEDL
Subjt:  DNPSISDFHKILEHEINRAKWLNPSHPSYSDTDVMFATVYALSDGQWAVFLTLHTAACDRVAAASLLRELLVLTAAGGKIEGGGFEIGDNGEIGSGIEDL

Query:  IPSGKAYKPLWARGLDMLGYSLNSFRFANLEFKDASSERFSQMIRLKLNSDETQKLLAGCKSRGIKLCGALEAAGLIATRCSKDLPPYQTEKYAVVTLID
        IPSGKAYKPLWARGLDMLGYSLNSFRFANLEFKDASSERFSQMIRLKLNSDETQKLLAGCKSRGIKLCGALEAAGLIATRCSKDLPPYQTEKYAVVTLID
Subjt:  IPSGKAYKPLWARGLDMLGYSLNSFRFANLEFKDASSERFSQMIRLKLNSDETQKLLAGCKSRGIKLCGALEAAGLIATRCSKDLPPYQTEKYAVVTLID

Query:  CRSLLDPPLTTHHLGFYHSAILNTHDISAEDTLWEVAERCYFSFSNGKENNKHFTDMSDLNFLMGKAIENPSLTPSSSMRTALISAFEDPIIYTSDPAQQ
        CRSLLDPPLTTHHLGFYHSAILNTHDISAEDTLWEVAERCYFSFSNGKENNKHFTDMSDLNFLMGKAIENP LTPSSSMRTALISAFEDPIIYTSDPAQQ
Subjt:  CRSLLDPPLTTHHLGFYHSAILNTHDISAEDTLWEVAERCYFSFSNGKENNKHFTDMSDLNFLMGKAIENPSLTPSSSMRTALISAFEDPIIYTSDPAQQ

Query:  HLGISDYIGCASAHGVGPSIALFDMIRDGQLDCACVYPSPLFSRDQMNLLFDEMKKILVSGAMEVVE
        HLGI DYIGCASAHGVGPSIALFDMIRDGQLDCACVYPSPLFSRDQMNLLFDEMKKILVSGAMEVVE
Subjt:  HLGISDYIGCASAHGVGPSIALFDMIRDGQLDCACVYPSPLFSRDQMNLLFDEMKKILVSGAMEVVE

XP_022982938.1 uncharacterized protein LOC111481632 [Cucurbita maxima]4.7e-26296.57Show/hide
Query:  MSDEIKFRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDISHLQASLHNLQNLHPILRSKIHHDPSRRDFSLLIPPSPSIHLQILDLAAAARAIASHPDA
        MSDEI+ RPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDISHLQASLHNLQNLHPILRSKIHHDPSRRDFS LIPPSPSIHLQILDLAAAARAIASHPDA
Subjt:  MSDEIKFRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDISHLQASLHNLQNLHPILRSKIHHDPSRRDFSLLIPPSPSIHLQILDLAAAARAIASHPDA

Query:  DNPSISDFHKILEHEINRAKWLNPSHPSYSDTDVMFATVYALSDGQWAVFLTLHTAACDRVAAASLLRELLVLTAAGGKIEGGGFEIGDNGEIGSGIEDL
        D+PSISDFHKILEHEINRAKWLNPSHPSYSDTDVMFATVYA+SDGQWAVFLTLHTAACDRVAAASLLRELLVLTAA GKIEGGGF+IGDNGEIGSGIEDL
Subjt:  DNPSISDFHKILEHEINRAKWLNPSHPSYSDTDVMFATVYALSDGQWAVFLTLHTAACDRVAAASLLRELLVLTAAGGKIEGGGFEIGDNGEIGSGIEDL

Query:  IPSGKAYKPLWARGLDMLGYSLNSFRFANLEFKDASSERFSQMIRLKLNSDETQKLLAGCKSRGIKLCGALEAAGLIATRCSKDLPPYQTEKYAVVTLID
        IPSGKA+KPLWARGLDMLGYSLNSFRFANLEFKDASSERFSQMIRLKLNSDETQKLLAGCKSRGIKLCGALEAAGLIATRCSKDLPP+QTEKYAVVTLID
Subjt:  IPSGKAYKPLWARGLDMLGYSLNSFRFANLEFKDASSERFSQMIRLKLNSDETQKLLAGCKSRGIKLCGALEAAGLIATRCSKDLPPYQTEKYAVVTLID

Query:  CRSLLDPPLTTHHLGFYHSAILNTHDISAEDTLWEVAERCYFSFSNGKENNKHFTDMSDLNFLMGKAIENPSLTPSSSMRTALISAFEDPIIYTSDPAQQ
        CRSLLDPPLTTHHLGFYHSAILNTHDISAEDTLWEV+ERCYFSFSN K+NNKHFTDMSDLNFLMGKAIENP LTPSSSMRTALISAFEDPIIYTSDPAQQ
Subjt:  CRSLLDPPLTTHHLGFYHSAILNTHDISAEDTLWEVAERCYFSFSNGKENNKHFTDMSDLNFLMGKAIENPSLTPSSSMRTALISAFEDPIIYTSDPAQQ

Query:  HLGISDYIGCASAHGVGPSIALFDMIRDGQLDCACVYPSPLFSRDQMNLLFDEMKKILVSGAMEVVE
        HLGISDYIGCASAHGVGPSIALFD+IRDGQLDCACVYPSPLFSRDQMN LFDEMKKILVS AMEVVE
Subjt:  HLGISDYIGCASAHGVGPSIALFDMIRDGQLDCACVYPSPLFSRDQMNLLFDEMKKILVSGAMEVVE

TrEMBL top hitse value%identityAlignment
A0A4D8YPF4 Uncharacterized protein5.8e-24247.59Show/hide
Query:  EGSISRRRVAACTETAWCRAVPGGTGTAVIALSSSDAPN--LQLLQNALQELQNSHPILKSKLHFNPISSAFSFI-TSPTPFVQIKTYELPETSKILNDQ
        E   S  R    TE  WCRAV  GTG  V+AL  +  P      L   L++LQ  HP+L +KLH+N  S AFSF+ T+  P   ++ ++ P T+KIL   
Subjt:  EGSISRRRVAACTETAWCRAVPGGTGTAVIALSSSDAPN--LQLLQNALQELQNSHPILKSKLHFNPISSAFSFI-TSPTPFVQIKTYELPETSKILNDQ

Query:  NVLNYHKTPHDISISPLQIILEHELNENSPWQTLHYSDTAATSAADMLFVSLYEVGSGKWIVVFRLHVAACDRTTAVSLLEELLLLMNGGGVD----KTG
                    + SPLQ ILEHELN NS W    +  +   S  ++L V++Y       +V  R H + CDR TAVSLL EL+ ++ G GV+      G
Subjt:  NVLNYHKTPHDISISPLQIILEHELNENSPWQTLHYSDTAATSAADMLFVSLYEVGSGKWIVVFRLHVAACDRTTAVSLLEELLLLMNGGGVD----KTG

Query:  EVELGMEDLVPRKLAKKSMLSRGLNVISYSVNSLRLTNLKFKDVKSARRSQVARLQMNRTETHKILSECKSRGIKLSSAMVAAGLVATHSSGSHGLDRHQ
        E ++G+E L+   +AKK++ +  L+ + YSVNSLRLTNL F++ K  RRS+V RLQ+    T  IL  CKSRGIK+   + AA ++A +S+  H  D   
Subjt:  EVELGMEDLVPRKLAKKSMLSRGLNVISYSVNSLRLTNLKFKDVKSARRSQVARLQMNRTETHKILSECKSRGIKLSSAMVAAGLVATHSSGSHGLDRHQ

Query:  RKYGIITLIDCRRFLEPPLRSHHFGFYHAAILNSYTIRGGEELWELAKKISTTLEASKNSNKHFTDMSDLNFLLCRAVENPSLTESGAMRTSLMTVFEDT
        +KYG++TL DCR   +PPL  HH+GFYH+AILN + ++G E LW+LAK         K SNKH +DM+DLNFL+ +A++NP+LT S ++RTSL+TVFED 
Subjt:  RKYGIITLIDCRRFLEPPLRSHHFGFYHAAILNSYTIRGGEELWELAKKISTTLEASKNSNKHFTDMSDLNFLLCRAVENPSLTESGAMRTSLMTVFEDT

Query:  VVDNSGAMQAEIGIKDYMGCASTHGIGPSVAVFDTIRDGRLDCACVYPAPLHSREQMEALVDNMKALLSDVVLAFQIVSPPLGPSSSSYYLLLKGSDFLR
        V+D+S  MQ  IG +D++GCAS HG+GPS+A+FDT+R+GRLDC CVYPAPLHSREQM  LV+ M                                    
Subjt:  VVDNSGAMQAEIGIKDYMGCASTHGIGPSVAVFDTIRDGRLDCACVYPAPLHSREQMEALVDNMKALLSDVVLAFQIVSPPLGPSSSSYYLLLKGSDFLR

Query:  LQYPLCTPTVTASKMSDEIKFRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDISHLQASLHNLQNLHPILRSKIHHDPSRRDFSLLIPPSPSIHLQILD
                    SK S   K R V  TE SWCRAVPGGTG TVL LL SKPPD+  LQ++L   Q+ HPIL SK+  D S   FS +   SP + ++  D
Subjt:  LQYPLCTPTVTASKMSDEIKFRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDISHLQASLHNLQNLHPILRSKIHHDPSRRDFSLLIPPSPSIHLQILD

Query:  LAAAARAIASHPDADNPSISDFHKILEHEINRAKWLNPSHPSYSDTDVMFATVYALSDGQWAVFLTLHTAACDRVAAASLLRELLVLTAAGGKIEGGGFE
          + A+ + SH  + + SI     ILEHE+N   W NP  PS SD D+  A++Y L    W + L +HT+ CDR AAA+L+ EL+ L            E
Subjt:  LAAAARAIASHPDADNPSISDFHKILEHEINRAKWLNPSHPSYSDTDVMFATVYALSDGQWAVFLTLHTAACDRVAAASLLRELLVLTAAGGKIEGGGFE

Query:  IGDNGEIGSGIEDLIPSGKAYKPLWARGLDMLGYSLNSFRFANLEFKDASSERFSQMIRLKLNSDETQKLLAGCKSRGIKLCGALEAAGLIATRCSKDLP
             E+  GIED IP+G   K  WARG+DMLGYSLNSFR ANL F D +S R S ++R+++N+++T ++L+ C S  IKL  AL AA LIA   SK  P
Subjt:  IGDNGEIGSGIEDLIPSGKAYKPLWARGLDMLGYSLNSFRFANLEFKDASSERFSQMIRLKLNSDETQKLLAGCKSRGIKLCGALEAAGLIATRCSKDLP

Query:  PYQTEKYAVVTLIDCRSLLDPPLTTHHLGFYHSAILNTHDISAEDTLWEVAERCYFSFSNGKENNKHFTDMSDLNFLMGKAIENPSLTPSSSMRTALISA
          Q EKYA VTL DCRS LDP L++HH+GFYHSAIL+THDI   + LWE+A+R + SF N K  NKHFTDM+DLNFLM KAIENP LTPSSS+RT+L+S 
Subjt:  PYQTEKYAVVTLIDCRSLLDPPLTTHHLGFYHSAILNTHDISAEDTLWEVAERCYFSFSNGKENNKHFTDMSDLNFLMGKAIENPSLTPSSSMRTALISA

Query:  FEDPIIYTSDPAQQHLGISDYIGCASAHGVGPSIALFDMIRDGQLDCACVYPSPLFSRDQMNLLFDEMKKILVSG
        FEDPI    +  ++ LG  D+IGCAS HGVGPS+A FD IR+G++DCA VYP PLFSR+QMN   D +K IL+ G
Subjt:  FEDPIIYTSDPAQQHLGISDYIGCASAHGVGPSIALFDMIRDGQLDCACVYPSPLFSRDQMNLLFDEMKKILVSG

A0A6J1F424 uncharacterized protein LOC1114421998.6e-27099.57Show/hide
Query:  MSDEIKFRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDISHLQASLHNLQNLHPILRSKIHHDPSRRDFSLLIPPSPSIHLQILDLAAAARAIASHPDA
        MSDEIKFRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDISHLQASLHNLQNLHPILRSKIHHDPSRRDFSLLIPPSPSIHLQILDLAAAARAIASHPDA
Subjt:  MSDEIKFRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDISHLQASLHNLQNLHPILRSKIHHDPSRRDFSLLIPPSPSIHLQILDLAAAARAIASHPDA

Query:  DNPSISDFHKILEHEINRAKWLNPSHPSYSDTDVMFATVYALSDGQWAVFLTLHTAACDRVAAASLLRELLVLTAAGGKIEGGGFEIGDNGEIGSGIEDL
        DNPSISDFHKILEHEINRAKWLNPSHPSYSDTDVMFATVYALSDGQWAVFLTLHTAACDRVAAASLLRELLVLTAAGGKIEGGGFEIGDNGEIGSGIEDL
Subjt:  DNPSISDFHKILEHEINRAKWLNPSHPSYSDTDVMFATVYALSDGQWAVFLTLHTAACDRVAAASLLRELLVLTAAGGKIEGGGFEIGDNGEIGSGIEDL

Query:  IPSGKAYKPLWARGLDMLGYSLNSFRFANLEFKDASSERFSQMIRLKLNSDETQKLLAGCKSRGIKLCGALEAAGLIATRCSKDLPPYQTEKYAVVTLID
        IPSGKAYKPLWARGLDMLGYSLNSFRFANLEFKDASSERFSQMIRLKLNSDETQKLLAGCKSRGIKLCGALEAAGLIATRCSKDLPPYQTEKYAVVTLID
Subjt:  IPSGKAYKPLWARGLDMLGYSLNSFRFANLEFKDASSERFSQMIRLKLNSDETQKLLAGCKSRGIKLCGALEAAGLIATRCSKDLPPYQTEKYAVVTLID

Query:  CRSLLDPPLTTHHLGFYHSAILNTHDISAEDTLWEVAERCYFSFSNGKENNKHFTDMSDLNFLMGKAIENPSLTPSSSMRTALISAFEDPIIYTSDPAQQ
        CRSLLDPPLTTHHLGFYHSAILNTHDISAEDTLWEVAERCYFSFSNGKENNKHFTDMSDLNFLMGKAIENP LTPSSSMRTALISAFEDPIIYTSDPAQQ
Subjt:  CRSLLDPPLTTHHLGFYHSAILNTHDISAEDTLWEVAERCYFSFSNGKENNKHFTDMSDLNFLMGKAIENPSLTPSSSMRTALISAFEDPIIYTSDPAQQ

Query:  HLGISDYIGCASAHGVGPSIALFDMIRDGQLDCACVYPSPLFSRDQMNLLFDEMKKILVSGAMEVVE
        HLGI DYIGCASAHGVGPSIALFDMIRDGQLDCACVYPSPLFSRDQMNLLFDEMKKILVSGAMEVVE
Subjt:  HLGISDYIGCASAHGVGPSIALFDMIRDGQLDCACVYPSPLFSRDQMNLLFDEMKKILVSGAMEVVE

A0A6J1F529 uncharacterized protein LOC1114422005.6e-261100Show/hide
Query:  MEGSISRRRVAACTETAWCRAVPGGTGTAVIALSSSDAPNLQLLQNALQELQNSHPILKSKLHFNPISSAFSFITSPTPFVQIKTYELPETSKILNDQNV
        MEGSISRRRVAACTETAWCRAVPGGTGTAVIALSSSDAPNLQLLQNALQELQNSHPILKSKLHFNPISSAFSFITSPTPFVQIKTYELPETSKILNDQNV
Subjt:  MEGSISRRRVAACTETAWCRAVPGGTGTAVIALSSSDAPNLQLLQNALQELQNSHPILKSKLHFNPISSAFSFITSPTPFVQIKTYELPETSKILNDQNV

Query:  LNYHKTPHDISISPLQIILEHELNENSPWQTLHYSDTAATSAADMLFVSLYEVGSGKWIVVFRLHVAACDRTTAVSLLEELLLLMNGGGVDKTGEVELGM
        LNYHKTPHDISISPLQIILEHELNENSPWQTLHYSDTAATSAADMLFVSLYEVGSGKWIVVFRLHVAACDRTTAVSLLEELLLLMNGGGVDKTGEVELGM
Subjt:  LNYHKTPHDISISPLQIILEHELNENSPWQTLHYSDTAATSAADMLFVSLYEVGSGKWIVVFRLHVAACDRTTAVSLLEELLLLMNGGGVDKTGEVELGM

Query:  EDLVPRKLAKKSMLSRGLNVISYSVNSLRLTNLKFKDVKSARRSQVARLQMNRTETHKILSECKSRGIKLSSAMVAAGLVATHSSGSHGLDRHQRKYGII
        EDLVPRKLAKKSMLSRGLNVISYSVNSLRLTNLKFKDVKSARRSQVARLQMNRTETHKILSECKSRGIKLSSAMVAAGLVATHSSGSHGLDRHQRKYGII
Subjt:  EDLVPRKLAKKSMLSRGLNVISYSVNSLRLTNLKFKDVKSARRSQVARLQMNRTETHKILSECKSRGIKLSSAMVAAGLVATHSSGSHGLDRHQRKYGII

Query:  TLIDCRRFLEPPLRSHHFGFYHAAILNSYTIRGGEELWELAKKISTTLEASKNSNKHFTDMSDLNFLLCRAVENPSLTESGAMRTSLMTVFEDTVVDNSG
        TLIDCRRFLEPPLRSHHFGFYHAAILNSYTIRGGEELWELAKKISTTLEASKNSNKHFTDMSDLNFLLCRAVENPSLTESGAMRTSLMTVFEDTVVDNSG
Subjt:  TLIDCRRFLEPPLRSHHFGFYHAAILNSYTIRGGEELWELAKKISTTLEASKNSNKHFTDMSDLNFLLCRAVENPSLTESGAMRTSLMTVFEDTVVDNSG

Query:  AMQAEIGIKDYMGCASTHGIGPSVAVFDTIRDGRLDCACVYPAPLHSREQMEALVDNMKALL
        AMQAEIGIKDYMGCASTHGIGPSVAVFDTIRDGRLDCACVYPAPLHSREQMEALVDNMKALL
Subjt:  AMQAEIGIKDYMGCASTHGIGPSVAVFDTIRDGRLDCACVYPAPLHSREQMEALVDNMKALL

A0A6J1IXX9 uncharacterized protein LOC1114816362.4e-25697.84Show/hide
Query:  MEGSISRRRVAACTETAWCRAVPGGTGTAVIALSSSDAPNLQLLQNALQELQNSHPILKSKLHFNPISSAFSFITSPTPFVQIKTYELPETSKILNDQNV
        MEGSISRRRVAACTETAWCRAVPGGTGTAVIALSSSD PNLQLLQNALQELQNSHPILKSKLHFNPISS FSF+TSPTPFVQIKTYELPETSKILNDQNV
Subjt:  MEGSISRRRVAACTETAWCRAVPGGTGTAVIALSSSDAPNLQLLQNALQELQNSHPILKSKLHFNPISSAFSFITSPTPFVQIKTYELPETSKILNDQNV

Query:  LNYHKTPHDISISPLQIILEHELNENSPWQTLHYSDTAATSAADMLFVSLYEVGSGKWIVVFRLHVAACDRTTAVSLLEELLLLMNGGGVDKTGEVELGM
        LNYHKTPHDISISPLQIILEHELNENSPWQTLHYSDTAA S+ADMLFVSLYEVGSGKWIVVFRLHVAACDRTTAVSLLEELLLLMNGGGVDKTGEVELGM
Subjt:  LNYHKTPHDISISPLQIILEHELNENSPWQTLHYSDTAATSAADMLFVSLYEVGSGKWIVVFRLHVAACDRTTAVSLLEELLLLMNGGGVDKTGEVELGM

Query:  EDLVPRKLAKKSMLSRGLNVISYSVNSLRLTNLKFKDVKSARRSQVARLQMNRTETHKILSECKSRGIKLSSAMVAAGLVATHSSGSHGLDRHQRKYGII
        EDLVPR LAKKSMLSRGLNVISYSVNSLRLTNLKFKDVKSARRSQVARLQMNRTETHKILSECKSRGIKLSS MVAAGLVATHSSGSHGLDRHQRKYGII
Subjt:  EDLVPRKLAKKSMLSRGLNVISYSVNSLRLTNLKFKDVKSARRSQVARLQMNRTETHKILSECKSRGIKLSSAMVAAGLVATHSSGSHGLDRHQRKYGII

Query:  TLIDCRRFLEPPLRSHHFGFYHAAILNSYTIRGGEELWELAKKISTTLEASKNSNKHFTDMSDLNFLLCRAVENPSLTESGAMRTSLMTVFEDTVVDNSG
        TLIDCRRFLEPPLRS+HFGFYHAAILNSYTIRGGEELWELAKKISTTLEASKNSNKHFTDMSDLNFLLCRAVENPSLTESGAMRTSLMTVFEDTVVDNSG
Subjt:  TLIDCRRFLEPPLRSHHFGFYHAAILNSYTIRGGEELWELAKKISTTLEASKNSNKHFTDMSDLNFLLCRAVENPSLTESGAMRTSLMTVFEDTVVDNSG

Query:  AMQAEIGIKDYMGCASTHGIGPSVAVFDTIRDGRLDCACVYPAPLHSREQMEALVDNMKALL
        AMQAEIG+KDY+GCASTHGIGPSVAVFDTIRDGRLDCACVYPAPLHSREQMEALVDNMKALL
Subjt:  AMQAEIGIKDYMGCASTHGIGPSVAVFDTIRDGRLDCACVYPAPLHSREQMEALVDNMKALL

A0A6J1J6C5 uncharacterized protein LOC1114816322.3e-26296.57Show/hide
Query:  MSDEIKFRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDISHLQASLHNLQNLHPILRSKIHHDPSRRDFSLLIPPSPSIHLQILDLAAAARAIASHPDA
        MSDEI+ RPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDISHLQASLHNLQNLHPILRSKIHHDPSRRDFS LIPPSPSIHLQILDLAAAARAIASHPDA
Subjt:  MSDEIKFRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDISHLQASLHNLQNLHPILRSKIHHDPSRRDFSLLIPPSPSIHLQILDLAAAARAIASHPDA

Query:  DNPSISDFHKILEHEINRAKWLNPSHPSYSDTDVMFATVYALSDGQWAVFLTLHTAACDRVAAASLLRELLVLTAAGGKIEGGGFEIGDNGEIGSGIEDL
        D+PSISDFHKILEHEINRAKWLNPSHPSYSDTDVMFATVYA+SDGQWAVFLTLHTAACDRVAAASLLRELLVLTAA GKIEGGGF+IGDNGEIGSGIEDL
Subjt:  DNPSISDFHKILEHEINRAKWLNPSHPSYSDTDVMFATVYALSDGQWAVFLTLHTAACDRVAAASLLRELLVLTAAGGKIEGGGFEIGDNGEIGSGIEDL

Query:  IPSGKAYKPLWARGLDMLGYSLNSFRFANLEFKDASSERFSQMIRLKLNSDETQKLLAGCKSRGIKLCGALEAAGLIATRCSKDLPPYQTEKYAVVTLID
        IPSGKA+KPLWARGLDMLGYSLNSFRFANLEFKDASSERFSQMIRLKLNSDETQKLLAGCKSRGIKLCGALEAAGLIATRCSKDLPP+QTEKYAVVTLID
Subjt:  IPSGKAYKPLWARGLDMLGYSLNSFRFANLEFKDASSERFSQMIRLKLNSDETQKLLAGCKSRGIKLCGALEAAGLIATRCSKDLPPYQTEKYAVVTLID

Query:  CRSLLDPPLTTHHLGFYHSAILNTHDISAEDTLWEVAERCYFSFSNGKENNKHFTDMSDLNFLMGKAIENPSLTPSSSMRTALISAFEDPIIYTSDPAQQ
        CRSLLDPPLTTHHLGFYHSAILNTHDISAEDTLWEV+ERCYFSFSN K+NNKHFTDMSDLNFLMGKAIENP LTPSSSMRTALISAFEDPIIYTSDPAQQ
Subjt:  CRSLLDPPLTTHHLGFYHSAILNTHDISAEDTLWEVAERCYFSFSNGKENNKHFTDMSDLNFLMGKAIENPSLTPSSSMRTALISAFEDPIIYTSDPAQQ

Query:  HLGISDYIGCASAHGVGPSIALFDMIRDGQLDCACVYPSPLFSRDQMNLLFDEMKKILVSGAMEVVE
        HLGISDYIGCASAHGVGPSIALFD+IRDGQLDCACVYPSPLFSRDQMN LFDEMKKILVS AMEVVE
Subjt:  HLGISDYIGCASAHGVGPSIALFDMIRDGQLDCACVYPSPLFSRDQMNLLFDEMKKILVSGAMEVVE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G52610.1 unknown protein1.2e-13554.98Show/hide
Query:  RPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDISHLQASLHNLQNLHPILRSKIHHDPSRRDFSLLIPPSPSIHLQI--LDLAAAARAIASHPDADNPSI
        RPVGGTE+SWCRA+ GGTG  V+ LLLS+ P + +LQ +L  LQ  HP LRS I  D S   FS ++  +   H++I   D  + A+ I    D+D+P  
Subjt:  RPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDISHLQASLHNLQNLHPILRSKIHHDPSRRDFSLLIPPSPSIHLQI--LDLAAAARAIASHPDADNPSI

Query:  SDFHKILEHEINRAKWLNPSHPSYSDTDVMFATVYALSDGQWAVFLT--LHTAACDRVAAASLLRELLVLTAAGGKIEGGGFEIGDNG-EIGSGIEDLIP
             ILEHE+N+  W+NP     S++ V   ++Y L+D      LT  L+TAA DR AA +LLRE +  TAA G   G      +    +G  IE+LIP
Subjt:  SDFHKILEHEINRAKWLNPSHPSYSDTDVMFATVYALSDGQWAVFLT--LHTAACDRVAAASLLRELLVLTAAGGKIEGGGFEIGDNG-EIGSGIEDLIP

Query:  SGKAYKPLWARGLDMLGYSLNSFRFANLEFKDA-SSERFSQMIRLKLNSDETQKLLAGCKSRGIKLCGALEAAGLIATRCSKDLPPYQTEKYAVVTLIDC
        SGK  KP WARG+D+LGYSLN+FRF+NL F DA +S R SQ++RLKL+ D+T KL+AGCK+RG+KL  AL ++ LIA   SK+LPPYQ EKYAVVTL DC
Subjt:  SGKAYKPLWARGLDMLGYSLNSFRFANLEFKDA-SSERFSQMIRLKLNSDETQKLLAGCKSRGIKLCGALEAAGLIATRCSKDLPPYQTEKYAVVTLIDC

Query:  RSLLDPPLTTHHLGFYHSAILNTHDISAEDTLWEVAERCYFSFSNGKENNKHFTDMSDLNFLMGKAIENPSLTPSSSMRTALISAFEDPIIYTS-DPAQQ
        RS+L+PPLT++  GFYH+ IL+THD++ E+ LW++A+RCY SF++ K +NK FTDMSDLNFLM KAIENP+LTPSSS+RTA IS FEDP+I  S +P   
Subjt:  RSLLDPPLTTHHLGFYHSAILNTHDISAEDTLWEVAERCYFSFSNGKENNKHFTDMSDLNFLMGKAIENPSLTPSSSMRTALISAFEDPIIYTS-DPAQQ

Query:  HLGISDYIGCASAHGVGPSIALFDMIRDGQLDCACVYPSPLFSRDQMNLLFDEMKKILVSGA
         LG+ DYIGCAS HGVGPS+A+FD +RDG+LDCA VYPSPL SR+QM+ L   MK IL+ G+
Subjt:  HLGISDYIGCASAHGVGPSIALFDMIRDGQLDCACVYPSPLFSRDQMNLLFDEMKKILVSGA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGGTTCCATTAGCCGACGGCGAGTCGCTGCCTGCACTGAAACCGCCTGGTGCCGCGCCGTCCCGGGCGGAACCGGCACGGCGGTGATAGCTCTATCCTCCTCCGA
CGCACCAAATCTCCAACTTCTCCAAAATGCCCTTCAGGAACTCCAAAATTCCCACCCTATCCTCAAATCCAAACTCCACTTCAATCCCATTTCTTCCGCTTTCTCCTTCA
TCACTTCACCGACTCCTTTCGTTCAAATCAAGACGTACGAACTTCCAGAAACTTCTAAAATCCTAAATGACCAAAATGTCCTCAACTACCACAAAACCCCTCACGACATT
TCCATTTCCCCTCTCCAGATTATCCTCGAACACGAACTCAACGAGAACTCCCCTTGGCAGACCCTACACTACTCCGACACGGCGGCGACTTCCGCAGCCGACATGTTGTT
TGTTAGCTTGTACGAGGTTGGCTCAGGTAAATGGATCGTCGTGTTCCGGCTCCACGTGGCAGCGTGTGACCGGACCACAGCGGTGTCGTTGCTGGAGGAGCTGCTCCTTT
TGATGAACGGCGGCGGCGTTGATAAAACAGGGGAAGTGGAGCTGGGCATGGAGGATCTTGTTCCTAGAAAGTTGGCGAAGAAATCAATGTTGTCCCGAGGACTGAATGTG
ATCAGCTACTCCGTTAACTCTTTGAGATTAACGAATCTGAAATTCAAAGATGTAAAGTCCGCGAGACGATCGCAGGTGGCGCGGCTCCAGATGAACCGGACTGAAACCCA
CAAGATTCTCTCTGAATGCAAGTCGAGAGGGATAAAGTTGAGTTCGGCAATGGTGGCGGCGGGGCTGGTGGCAACGCATAGCTCCGGCAGCCATGGGTTGGACCGGCATC
AACGGAAGTACGGAATCATAACGCTGATAGATTGCCGGCGGTTTCTGGAGCCTCCGCTGAGATCCCACCATTTCGGATTTTACCATGCTGCCATCTTGAACTCTTACACC
ATAAGAGGAGGAGAAGAGCTGTGGGAGTTAGCAAAGAAAATCTCGACGACATTGGAGGCTTCCAAGAACTCAAACAAGCACTTCACCGACATGTCGGACCTAAACTTTCT
GCTATGTCGTGCCGTTGAGAACCCAAGCCTCACAGAATCAGGAGCAATGAGGACATCGCTAATGACGGTGTTTGAAGATACGGTGGTGGACAATTCAGGTGCAATGCAGG
CTGAGATCGGCATTAAGGACTACATGGGCTGTGCGTCTACTCACGGCATTGGCCCGTCCGTCGCCGTGTTCGACACGATTAGAGACGGGCGGCTCGACTGCGCGTGTGTT
TATCCAGCTCCGTTGCACTCCCGGGAGCAAATGGAAGCTCTGGTTGATAACATGAAGGCTCTTCTCTCCGACGTCGTTTTAGCGTTCCAAATCGTTTCTCCTCCCCTTGG
ACCATCGTCTTCTTCCTATTATCTTCTTTTGAAAGGATCAGATTTCCTACGGCTGCAGTACCCTCTCTGTACTCCCACCGTCACCGCCTCCAAAATGTCCGATGAGATCA
AGTTCCGTCCCGTCGGCGGCACCGAGCACAGCTGGTGCCGCGCCGTTCCCGGTGGCACCGGCACCACTGTCCTTGGCCTCCTCCTCTCAAAGCCTCCCGATATTTCCCAT
CTCCAAGCCTCCCTCCACAATCTCCAAAACCTCCACCCAATTCTCCGCTCCAAAATCCACCACGATCCTTCCCGACGAGATTTCTCCTTACTTATTCCCCCTTCTCCGTC
GATCCATCTCCAGATCCTCGATCTCGCCGCTGCCGCACGTGCCATCGCCTCTCATCCCGACGCCGACAATCCTTCCATCTCTGATTTTCACAAGATCCTCGAGCACGAAA
TCAACCGCGCCAAGTGGCTCAATCCGAGCCATCCGTCCTACTCCGATACCGATGTGATGTTCGCCACCGTCTACGCCCTCAGCGACGGCCAATGGGCGGTGTTTCTCACC
CTCCACACAGCGGCGTGCGATCGGGTTGCAGCGGCGTCTCTGTTGCGAGAGCTGCTCGTGCTTACGGCGGCGGGAGGAAAAATCGAGGGCGGAGGATTTGAAATTGGGGA
TAATGGTGAGATTGGATCAGGGATTGAGGATCTAATCCCTAGCGGTAAAGCGTATAAGCCTCTGTGGGCGCGTGGATTGGACATGCTTGGTTACTCGTTGAATTCTTTCC
GATTCGCGAATTTGGAATTCAAAGACGCGAGTTCTGAGAGATTTTCTCAGATGATTAGGTTGAAATTGAACTCCGATGAGACCCAGAAACTTCTCGCTGGCTGCAAATCG
AGAGGCATTAAGCTGTGTGGAGCTCTTGAAGCTGCTGGATTGATAGCTACCCGTTGTTCTAAAGACCTTCCTCCTTATCAGACCGAGAAATATGCTGTGGTTACACTCAT
CGATTGTCGTTCCCTACTTGATCCTCCCCTCACGACCCACCATTTAGGATTCTATCACTCTGCCATCCTCAACACACACGACATATCAGCTGAAGATACACTATGGGAAG
TGGCAGAGCGATGCTATTTTTCCTTCTCAAATGGCAAAGAAAACAACAAGCATTTCACAGACATGTCTGACTTGAACTTTCTCATGGGCAAAGCAATCGAAAATCCCAGC
CTCACTCCATCCTCATCCATGAGAACGGCCCTAATCTCGGCGTTCGAAGACCCAATCATCTACACATCTGATCCTGCTCAGCAACACCTAGGCATATCTGATTACATCGG
TTGTGCCTCCGCGCACGGTGTCGGACCATCGATCGCCCTCTTCGACATGATTCGTGACGGTCAGTTGGATTGTGCTTGTGTGTACCCGTCGCCTTTGTTTTCCCGAGATC
AAATGAACCTACTTTTCGATGAGATGAAGAAAATTTTGGTAAGTGGTGCCATGGAAGTAGTTGAAAGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGGTTCCATTAGCCGACGGCGAGTCGCTGCCTGCACTGAAACCGCCTGGTGCCGCGCCGTCCCGGGCGGAACCGGCACGGCGGTGATAGCTCTATCCTCCTCCGA
CGCACCAAATCTCCAACTTCTCCAAAATGCCCTTCAGGAACTCCAAAATTCCCACCCTATCCTCAAATCCAAACTCCACTTCAATCCCATTTCTTCCGCTTTCTCCTTCA
TCACTTCACCGACTCCTTTCGTTCAAATCAAGACGTACGAACTTCCAGAAACTTCTAAAATCCTAAATGACCAAAATGTCCTCAACTACCACAAAACCCCTCACGACATT
TCCATTTCCCCTCTCCAGATTATCCTCGAACACGAACTCAACGAGAACTCCCCTTGGCAGACCCTACACTACTCCGACACGGCGGCGACTTCCGCAGCCGACATGTTGTT
TGTTAGCTTGTACGAGGTTGGCTCAGGTAAATGGATCGTCGTGTTCCGGCTCCACGTGGCAGCGTGTGACCGGACCACAGCGGTGTCGTTGCTGGAGGAGCTGCTCCTTT
TGATGAACGGCGGCGGCGTTGATAAAACAGGGGAAGTGGAGCTGGGCATGGAGGATCTTGTTCCTAGAAAGTTGGCGAAGAAATCAATGTTGTCCCGAGGACTGAATGTG
ATCAGCTACTCCGTTAACTCTTTGAGATTAACGAATCTGAAATTCAAAGATGTAAAGTCCGCGAGACGATCGCAGGTGGCGCGGCTCCAGATGAACCGGACTGAAACCCA
CAAGATTCTCTCTGAATGCAAGTCGAGAGGGATAAAGTTGAGTTCGGCAATGGTGGCGGCGGGGCTGGTGGCAACGCATAGCTCCGGCAGCCATGGGTTGGACCGGCATC
AACGGAAGTACGGAATCATAACGCTGATAGATTGCCGGCGGTTTCTGGAGCCTCCGCTGAGATCCCACCATTTCGGATTTTACCATGCTGCCATCTTGAACTCTTACACC
ATAAGAGGAGGAGAAGAGCTGTGGGAGTTAGCAAAGAAAATCTCGACGACATTGGAGGCTTCCAAGAACTCAAACAAGCACTTCACCGACATGTCGGACCTAAACTTTCT
GCTATGTCGTGCCGTTGAGAACCCAAGCCTCACAGAATCAGGAGCAATGAGGACATCGCTAATGACGGTGTTTGAAGATACGGTGGTGGACAATTCAGGTGCAATGCAGG
CTGAGATCGGCATTAAGGACTACATGGGCTGTGCGTCTACTCACGGCATTGGCCCGTCCGTCGCCGTGTTCGACACGATTAGAGACGGGCGGCTCGACTGCGCGTGTGTT
TATCCAGCTCCGTTGCACTCCCGGGAGCAAATGGAAGCTCTGGTTGATAACATGAAGGCTCTTCTCTCCGACGTCGTTTTAGCGTTCCAAATCGTTTCTCCTCCCCTTGG
ACCATCGTCTTCTTCCTATTATCTTCTTTTGAAAGGATCAGATTTCCTACGGCTGCAGTACCCTCTCTGTACTCCCACCGTCACCGCCTCCAAAATGTCCGATGAGATCA
AGTTCCGTCCCGTCGGCGGCACCGAGCACAGCTGGTGCCGCGCCGTTCCCGGTGGCACCGGCACCACTGTCCTTGGCCTCCTCCTCTCAAAGCCTCCCGATATTTCCCAT
CTCCAAGCCTCCCTCCACAATCTCCAAAACCTCCACCCAATTCTCCGCTCCAAAATCCACCACGATCCTTCCCGACGAGATTTCTCCTTACTTATTCCCCCTTCTCCGTC
GATCCATCTCCAGATCCTCGATCTCGCCGCTGCCGCACGTGCCATCGCCTCTCATCCCGACGCCGACAATCCTTCCATCTCTGATTTTCACAAGATCCTCGAGCACGAAA
TCAACCGCGCCAAGTGGCTCAATCCGAGCCATCCGTCCTACTCCGATACCGATGTGATGTTCGCCACCGTCTACGCCCTCAGCGACGGCCAATGGGCGGTGTTTCTCACC
CTCCACACAGCGGCGTGCGATCGGGTTGCAGCGGCGTCTCTGTTGCGAGAGCTGCTCGTGCTTACGGCGGCGGGAGGAAAAATCGAGGGCGGAGGATTTGAAATTGGGGA
TAATGGTGAGATTGGATCAGGGATTGAGGATCTAATCCCTAGCGGTAAAGCGTATAAGCCTCTGTGGGCGCGTGGATTGGACATGCTTGGTTACTCGTTGAATTCTTTCC
GATTCGCGAATTTGGAATTCAAAGACGCGAGTTCTGAGAGATTTTCTCAGATGATTAGGTTGAAATTGAACTCCGATGAGACCCAGAAACTTCTCGCTGGCTGCAAATCG
AGAGGCATTAAGCTGTGTGGAGCTCTTGAAGCTGCTGGATTGATAGCTACCCGTTGTTCTAAAGACCTTCCTCCTTATCAGACCGAGAAATATGCTGTGGTTACACTCAT
CGATTGTCGTTCCCTACTTGATCCTCCCCTCACGACCCACCATTTAGGATTCTATCACTCTGCCATCCTCAACACACACGACATATCAGCTGAAGATACACTATGGGAAG
TGGCAGAGCGATGCTATTTTTCCTTCTCAAATGGCAAAGAAAACAACAAGCATTTCACAGACATGTCTGACTTGAACTTTCTCATGGGCAAAGCAATCGAAAATCCCAGC
CTCACTCCATCCTCATCCATGAGAACGGCCCTAATCTCGGCGTTCGAAGACCCAATCATCTACACATCTGATCCTGCTCAGCAACACCTAGGCATATCTGATTACATCGG
TTGTGCCTCCGCGCACGGTGTCGGACCATCGATCGCCCTCTTCGACATGATTCGTGACGGTCAGTTGGATTGTGCTTGTGTGTACCCGTCGCCTTTGTTTTCCCGAGATC
AAATGAACCTACTTTTCGATGAGATGAAGAAAATTTTGGTAAGTGGTGCCATGGAAGTAGTTGAAAGCTAA
Protein sequenceShow/hide protein sequence
MEGSISRRRVAACTETAWCRAVPGGTGTAVIALSSSDAPNLQLLQNALQELQNSHPILKSKLHFNPISSAFSFITSPTPFVQIKTYELPETSKILNDQNVLNYHKTPHDI
SISPLQIILEHELNENSPWQTLHYSDTAATSAADMLFVSLYEVGSGKWIVVFRLHVAACDRTTAVSLLEELLLLMNGGGVDKTGEVELGMEDLVPRKLAKKSMLSRGLNV
ISYSVNSLRLTNLKFKDVKSARRSQVARLQMNRTETHKILSECKSRGIKLSSAMVAAGLVATHSSGSHGLDRHQRKYGIITLIDCRRFLEPPLRSHHFGFYHAAILNSYT
IRGGEELWELAKKISTTLEASKNSNKHFTDMSDLNFLLCRAVENPSLTESGAMRTSLMTVFEDTVVDNSGAMQAEIGIKDYMGCASTHGIGPSVAVFDTIRDGRLDCACV
YPAPLHSREQMEALVDNMKALLSDVVLAFQIVSPPLGPSSSSYYLLLKGSDFLRLQYPLCTPTVTASKMSDEIKFRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDISH
LQASLHNLQNLHPILRSKIHHDPSRRDFSLLIPPSPSIHLQILDLAAAARAIASHPDADNPSISDFHKILEHEINRAKWLNPSHPSYSDTDVMFATVYALSDGQWAVFLT
LHTAACDRVAAASLLRELLVLTAAGGKIEGGGFEIGDNGEIGSGIEDLIPSGKAYKPLWARGLDMLGYSLNSFRFANLEFKDASSERFSQMIRLKLNSDETQKLLAGCKS
RGIKLCGALEAAGLIATRCSKDLPPYQTEKYAVVTLIDCRSLLDPPLTTHHLGFYHSAILNTHDISAEDTLWEVAERCYFSFSNGKENNKHFTDMSDLNFLMGKAIENPS
LTPSSSMRTALISAFEDPIIYTSDPAQQHLGISDYIGCASAHGVGPSIALFDMIRDGQLDCACVYPSPLFSRDQMNLLFDEMKKILVSGAMEVVES