; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10012506 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10012506
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionNucleic acid-binding proteins superfamily isoform 1
Genome locationChr01:21929199..21937540
RNA-Seq ExpressionHG10012506
SyntenyHG10012506
Gene Ontology termsNA
InterPro domainsIPR012340 - Nucleic acid-binding, OB-fold
IPR035200 - Cell division control protein 24, OB domain 2
IPR035201 - Cell division control protein 24, OB domain 1
IPR035203 - Cell division control protein 24, OB domain 3


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049545.1 Nucleic acid-binding proteins superfamily isoform 1 [Cucumis melo var. makuwa]0.0e+0086.26Show/hide
Query:  MELDDRRRLQEEGDDDPFLKFVDYARSVLAFEDEEDFDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKIPECINQ
        MELDD R+LQEEGDDDPFLKFVDYARSVLAFED+EDFDPNVNGTET+TPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKIPECINQ
Subjt:  MELDDRRRLQEEGDDDPFLKFVDYARSVLAFEDEEDFDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKIPECINQ

Query:  LKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILP-----------------------------VDGILKKGRQIFVTGCYLRAASGGSGHPR
        LKKKNRRKKLPKTVTIDSIYEKNFLS+SSVLEAVILDEFILP                             VDGILKKGRQIFVTGCYLRAASGGSG+PR
Subjt:  LKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILP-----------------------------VDGILKKGRQIFVTGCYLRAASGGSGHPR

Query:  LLPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVNEGITYSL---IESIGPLEIYEKINGLRMIQIILVDNDGFKLKFLLWGEQVLLANLLSVGS
        LLPTEYL+ILLDEEEDDDVMLLGAQFCSD+FSSVSLDSVNEG TYSL   IESIGPLEI+EKINGLRMIQIILVDNDGFKLKFLLWGEQVLLANLLSVGS
Subjt:  LLPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVNEGITYSL---IESIGPLEIYEKINGLRMIQIILVDNDGFKLKFLLWGEQVLLANLLSVGS

Query:  VLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRTLSTSYPTQGPQVSQVSLPCDSHGAIDFGNYPFRSFVIDLQDK
        VLALDRPY+ATVNENG+GTS+ELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRT+S SYPTQGPQVSQVSLPCDSHGAIDFGNYPFRSFVIDLQDK
Subjt:  VLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRTLSTSYPTQGPQVSQVSLPCDSHGAIDFGNYPFRSFVIDLQDK

Query:  MTGISLYGIVLDIANERNTTEAVFSLRIEDNTGEILAKLHFARS------------------C------LEALWTENHVGASFVNVSCLPALLTSSCLHK
        MTGISLYG VLDIANERNTTEA FS+RIEDNTGEILAKL F RS                  C      LEALW ENHVGASFVN+SCLPALLTSSCLHK
Subjt:  MTGISLYGIVLDIANERNTTEAVFSLRIEDNTGEILAKLHFARS------------------C------LEALWTENHVGASFVNVSCLPALLTSSCLHK

Query:  LSRLSDLTSNTHGTKVCQVRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSELVLTFDLKITLADDSAKIFAWCTGQTAAELLQISPDEFY
        LSRLSDLTSNTHGTKVC+VRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFC CECKSELV TFDLKITLADDSAKIFAWC GQTAAELLQISPDEF 
Subjt:  LSRLSDLTSNTHGTKVCQVRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSELVLTFDLKITLADDSAKIFAWCTGQTAAELLQISPDEFY

Query:  ELPEEEQVMYPSSLENENFVVAIVNCRRQTSRCGNNIYFVNDPLSWEITRALKCE
        ELPEEEQVMYPSSLENENFVVAIVNCRRQ+ + GNN+ F NDPLSWEITRALKCE
Subjt:  ELPEEEQVMYPSSLENENFVVAIVNCRRQTSRCGNNIYFVNDPLSWEITRALKCE

XP_004134503.1 uncharacterized protein LOC101215087 [Cucumis sativus]0.0e+0085.5Show/hide
Query:  MELDDRRRLQEEGDDDPFLKFVDYARSVLAFEDEEDFDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKIPECINQ
        MELDD ++LQEEGDDDPFLKFVDYARSVLAFED+EDFDPN+NGTET+TPGW+WIASRVLRTC+AYSSSVTPAILLSELSQAWYEQHRVGAPKKIPECINQ
Subjt:  MELDDRRRLQEEGDDDPFLKFVDYARSVLAFEDEEDFDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKIPECINQ

Query:  LKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILP-----------------------------VDGILKKGRQIFVTGCYLRAASGGSGHPR
        LKKKNRRKKLPKTVTIDSIYEKNFL+LSSVLEAVILDEFILP                             V+GILKKGRQIFVTGCYLRAASGGSG+PR
Subjt:  LKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILP-----------------------------VDGILKKGRQIFVTGCYLRAASGGSGHPR

Query:  LLPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVNEGITYSL---IESIGPLEIYEKINGLRMIQIILVDNDGFKLKFLLWGEQVLLANLLSVGS
        LLPTEYLIILLDEEEDDDVMLLGAQFCSD+FSSVSLDSVNEG TYSL   IESIGPLEI+E +NGLRMIQIILVDNDGFKLKFLLWGEQVLLANLLSVGS
Subjt:  LLPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVNEGITYSL---IESIGPLEIYEKINGLRMIQIILVDNDGFKLKFLLWGEQVLLANLLSVGS

Query:  VLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRTLSTSYPTQGPQVSQVSLPCDSHGAIDFGNYPFRSFVIDLQDK
        VLALDRPY+ATVNENG+GTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRT+S SYPTQ PQVSQVSLPCDSHGAIDFGNYPFRSFVIDLQDK
Subjt:  VLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRTLSTSYPTQGPQVSQVSLPCDSHGAIDFGNYPFRSFVIDLQDK

Query:  MTGISLYGIVLDIANERNTTEAVFSLRIEDNTGEILAKLHFARS------------------C------LEALWTENHVGASFVNVSCLPALLTSSCLHK
        MTGISLYG VLDIANERNTTEA FS+RIEDNTGE+LAKL F RS                  C      LEALW ENHVGASFVN+SCLPALLTSSCLHK
Subjt:  MTGISLYGIVLDIANERNTTEAVFSLRIEDNTGEILAKLHFARS------------------C------LEALWTENHVGASFVNVSCLPALLTSSCLHK

Query:  LSRLSDLTSNTHGTKVCQVRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSELVLTFDLKITLADDSAKIFAWCTGQTAAELLQISPDEFY
        LSRLSDLTSNTHGTKVCQVRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSEL+ TFDLKITLADDSAKIFAWCTGQTAAELLQISPDEF 
Subjt:  LSRLSDLTSNTHGTKVCQVRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSELVLTFDLKITLADDSAKIFAWCTGQTAAELLQISPDEFY

Query:  ELPEEEQVMYPSSLENENFVVAIVNCRRQTSRCGNNIYFVNDPLSWEITRALKCE
        ELPEEEQVMYPSSLENENFVVAIVNCRR++S  GNN+ F NDPLSWEITRALKCE
Subjt:  ELPEEEQVMYPSSLENENFVVAIVNCRRQTSRCGNNIYFVNDPLSWEITRALKCE

XP_008438949.1 PREDICTED: uncharacterized protein LOC103483891 isoform X2 [Cucumis melo]0.0e+0086.26Show/hide
Query:  MELDDRRRLQEEGDDDPFLKFVDYARSVLAFEDEEDFDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKIPECINQ
        MELDD R+LQEEGDDDPFLKFVDYARSVLAFED+EDFDPNVNGTET+TPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKIPECINQ
Subjt:  MELDDRRRLQEEGDDDPFLKFVDYARSVLAFEDEEDFDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKIPECINQ

Query:  LKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILP-----------------------------VDGILKKGRQIFVTGCYLRAASGGSGHPR
        LKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILP                             VDGILKKGRQIFVTGCYLRAASGGSG+PR
Subjt:  LKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILP-----------------------------VDGILKKGRQIFVTGCYLRAASGGSGHPR

Query:  LLPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVNEGITYSL---IESIGPLEIYEKINGLRMIQIILVDNDGFKLKFLLWGEQVLLANLLSVGS
        LLPTEYL+ILLDEEEDDDVMLLGAQFCSD+FSSVSLDSVNEG TYSL   IESIGPLEI+EKINGLRMIQIILVDNDGFKLKFLLWGEQVLLA LLSVGS
Subjt:  LLPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVNEGITYSL---IESIGPLEIYEKINGLRMIQIILVDNDGFKLKFLLWGEQVLLANLLSVGS

Query:  VLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRTLSTSYPTQGPQVSQVSLPCDSHGAIDFGNYPFRSFVIDLQDK
        VLALDRPY+ATVNENG+GTS+ELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRT+S SYPTQGPQVSQVSLPCDSHGAIDFGNYPFRSFVIDLQDK
Subjt:  VLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRTLSTSYPTQGPQVSQVSLPCDSHGAIDFGNYPFRSFVIDLQDK

Query:  MTGISLYGIVLDIANERNTTEAVFSLRIEDNTGEILAKLHFARS------------------C------LEALWTENHVGASFVNVSCLPALLTSSCLHK
        MTGISLYG VLDIANERNTTEA FS+RIEDNTGEILAKL F RS                  C      LEALW ENHVGASFVN+SCLPALLTSSCLHK
Subjt:  MTGISLYGIVLDIANERNTTEAVFSLRIEDNTGEILAKLHFARS------------------C------LEALWTENHVGASFVNVSCLPALLTSSCLHK

Query:  LSRLSDLTSNTHGTKVCQVRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSELVLTFDLKITLADDSAKIFAWCTGQTAAELLQISPDEFY
        LSRLSDLTSNTHGTKVC+VRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFC CECKSELV TFDLKITLADDSAKIFAWC GQTAAELLQISPDEF 
Subjt:  LSRLSDLTSNTHGTKVCQVRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSELVLTFDLKITLADDSAKIFAWCTGQTAAELLQISPDEFY

Query:  ELPEEEQVMYPSSLENENFVVAIVNCRRQTSRCGNNIYFVNDPLSWEITRALKCE
        ELPEEEQVMYPSSLENENFVVAIVNCRRQ+ + GNN+ F NDPLSWEITRALKCE
Subjt:  ELPEEEQVMYPSSLENENFVVAIVNCRRQTSRCGNNIYFVNDPLSWEITRALKCE

XP_016898971.1 PREDICTED: uncharacterized protein LOC103483891 isoform X1 [Cucumis melo]0.0e+0085.61Show/hide
Query:  MELDDRRRLQEEGDDDPFLKFVDYARSVLAFEDEEDFDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKIPECINQ
        MELDD R+LQEEGDDDPFLKFVDYARSVLAFED+EDFDPNVNGTET+TPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKIPECINQ
Subjt:  MELDDRRRLQEEGDDDPFLKFVDYARSVLAFEDEEDFDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKIPECINQ

Query:  LKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILP-----------------------------VDGILKKGRQIFVTGCYLRAASGGSGHPR
        LKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILP                             VDGILKKGRQIFVTGCYLRAASGGSG+PR
Subjt:  LKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILP-----------------------------VDGILKKGRQIFVTGCYLRAASGGSGHPR

Query:  LLPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVNEGITYSL---IESIGPLEIYEKINGLRMIQIILVDNDGFKLKFLLWGEQVLLANLL----
        LLPTEYL+ILLDEEEDDDVMLLGAQFCSD+FSSVSLDSVNEG TYSL   IESIGPLEI+EKINGLRMIQIILVDNDGFKLKFLLWGEQVLLA LL    
Subjt:  LLPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVNEGITYSL---IESIGPLEIYEKINGLRMIQIILVDNDGFKLKFLLWGEQVLLANLL----

Query:  -SVGSVLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRTLSTSYPTQGPQVSQVSLPCDSHGAIDFGNYPFRSFVI
         SVGSVLALDRPY+ATVNENG+GTS+ELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRT+S SYPTQGPQVSQVSLPCDSHGAIDFGNYPFRSFVI
Subjt:  -SVGSVLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRTLSTSYPTQGPQVSQVSLPCDSHGAIDFGNYPFRSFVI

Query:  DLQDKMTGISLYGIVLDIANERNTTEAVFSLRIEDNTGEILAKLHFARS------------------C------LEALWTENHVGASFVNVSCLPALLTS
        DLQDKMTGISLYG VLDIANERNTTEA FS+RIEDNTGEILAKL F RS                  C      LEALW ENHVGASFVN+SCLPALLTS
Subjt:  DLQDKMTGISLYGIVLDIANERNTTEAVFSLRIEDNTGEILAKLHFARS------------------C------LEALWTENHVGASFVNVSCLPALLTS

Query:  SCLHKLSRLSDLTSNTHGTKVCQVRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSELVLTFDLKITLADDSAKIFAWCTGQTAAELLQIS
        SCLHKLSRLSDLTSNTHGTKVC+VRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFC CECKSELV TFDLKITLADDSAKIFAWC GQTAAELLQIS
Subjt:  SCLHKLSRLSDLTSNTHGTKVCQVRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSELVLTFDLKITLADDSAKIFAWCTGQTAAELLQIS

Query:  PDEFYELPEEEQVMYPSSLENENFVVAIVNCRRQTSRCGNNIYFVNDPLSWEITRALKCE
        PDEF ELPEEEQVMYPSSLENENFVVAIVNCRRQ+ + GNN+ F NDPLSWEITRALKCE
Subjt:  PDEFYELPEEEQVMYPSSLENENFVVAIVNCRRQTSRCGNNIYFVNDPLSWEITRALKCE

XP_038880460.1 uncharacterized protein LOC120072117 [Benincasa hispida]0.0e+0086.26Show/hide
Query:  MELDDRRRLQEEGDDDPFLKFVDYARSVLAFEDEEDFDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKIPECINQ
        MELDDR RLQ+EGDDDPFLKFVDYARSVLAFEDEE FDP+VNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKIPECINQ
Subjt:  MELDDRRRLQEEGDDDPFLKFVDYARSVLAFEDEEDFDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKIPECINQ

Query:  LKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILP-----------------------------VDGILKKGRQIFVTGCYLRAASGGSGHPR
        LKKKNRRKKLPKTVTIDSI+EKNFLSLSSVLEAVILDEFILP                             VDGILKKGRQIFVTGCYLRAASGGSGHPR
Subjt:  LKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILP-----------------------------VDGILKKGRQIFVTGCYLRAASGGSGHPR

Query:  LLPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVNEGITYSL---IESIGPLEIYEKINGLRMIQIILVDNDGFKLKFLLWGEQVLLANLLSVGS
        LLPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVN+G TYSL   IESIGPLEIYEKINGLRM+Q++LVDN GFKLKFLLWGEQVLLANLLSVGS
Subjt:  LLPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVNEGITYSL---IESIGPLEIYEKINGLRMIQIILVDNDGFKLKFLLWGEQVLLANLLSVGS

Query:  VLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRTLSTSYPTQGPQVSQVSLPCDSHGAIDFGNYPFRSFVIDLQDK
        VLALDRPYIAT NENG+GTSDELCLEYGSATQLYLVPCIQHEEQVCV+TQNINQA RTLSTSYPTQGPQVSQVSLPCD HGAIDF NYPFRSFVIDLQDK
Subjt:  VLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRTLSTSYPTQGPQVSQVSLPCDSHGAIDFGNYPFRSFVIDLQDK

Query:  MTGISLYGIVLDIANERNTTEAVFSLRIEDNTGEILAKLHFARS------------------C------LEALWTENHVGASFVNVSCLPALLTSSCLHK
        MTGISLYGIVLDIA+ERNTTEAVFS+RIEDNTGE+LAKLHF RS                  C      LEALW ENHVGASFVN+SCLPALLTSSCLHK
Subjt:  MTGISLYGIVLDIANERNTTEAVFSLRIEDNTGEILAKLHFARS------------------C------LEALWTENHVGASFVNVSCLPALLTSSCLHK

Query:  LSRLSDLTSNTHGTKVCQVRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSELVLTFDLKITLADDSAKIFAWCTGQTAAELLQISPDEFY
        LSRLSDLT NT GTKVC+VRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSE V TFDLKITLAD+SAKIFAWCTGQTAAELLQISPDEF 
Subjt:  LSRLSDLTSNTHGTKVCQVRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSELVLTFDLKITLADDSAKIFAWCTGQTAAELLQISPDEFY

Query:  ELPEEEQVMYPSSLENENFVVAIVNCRRQTSRCGNNIYFVNDPLSWEITRALKCE
        ELPEEEQVMYPSSLENENFVVAIVNCRRQ+S+CGNN+YFV DPLSWEITRALKCE
Subjt:  ELPEEEQVMYPSSLENENFVVAIVNCRRQTSRCGNNIYFVNDPLSWEITRALKCE

TrEMBL top hitse value%identityAlignment
A0A0A0L5D2 Uncharacterized protein0.0e+0085.5Show/hide
Query:  MELDDRRRLQEEGDDDPFLKFVDYARSVLAFEDEEDFDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKIPECINQ
        MELDD ++LQEEGDDDPFLKFVDYARSVLAFED+EDFDPN+NGTET+TPGW+WIASRVLRTC+AYSSSVTPAILLSELSQAWYEQHRVGAPKKIPECINQ
Subjt:  MELDDRRRLQEEGDDDPFLKFVDYARSVLAFEDEEDFDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKIPECINQ

Query:  LKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILP-----------------------------VDGILKKGRQIFVTGCYLRAASGGSGHPR
        LKKKNRRKKLPKTVTIDSIYEKNFL+LSSVLEAVILDEFILP                             V+GILKKGRQIFVTGCYLRAASGGSG+PR
Subjt:  LKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILP-----------------------------VDGILKKGRQIFVTGCYLRAASGGSGHPR

Query:  LLPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVNEGITYSL---IESIGPLEIYEKINGLRMIQIILVDNDGFKLKFLLWGEQVLLANLLSVGS
        LLPTEYLIILLDEEEDDDVMLLGAQFCSD+FSSVSLDSVNEG TYSL   IESIGPLEI+E +NGLRMIQIILVDNDGFKLKFLLWGEQVLLANLLSVGS
Subjt:  LLPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVNEGITYSL---IESIGPLEIYEKINGLRMIQIILVDNDGFKLKFLLWGEQVLLANLLSVGS

Query:  VLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRTLSTSYPTQGPQVSQVSLPCDSHGAIDFGNYPFRSFVIDLQDK
        VLALDRPY+ATVNENG+GTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRT+S SYPTQ PQVSQVSLPCDSHGAIDFGNYPFRSFVIDLQDK
Subjt:  VLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRTLSTSYPTQGPQVSQVSLPCDSHGAIDFGNYPFRSFVIDLQDK

Query:  MTGISLYGIVLDIANERNTTEAVFSLRIEDNTGEILAKLHFARS------------------C------LEALWTENHVGASFVNVSCLPALLTSSCLHK
        MTGISLYG VLDIANERNTTEA FS+RIEDNTGE+LAKL F RS                  C      LEALW ENHVGASFVN+SCLPALLTSSCLHK
Subjt:  MTGISLYGIVLDIANERNTTEAVFSLRIEDNTGEILAKLHFARS------------------C------LEALWTENHVGASFVNVSCLPALLTSSCLHK

Query:  LSRLSDLTSNTHGTKVCQVRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSELVLTFDLKITLADDSAKIFAWCTGQTAAELLQISPDEFY
        LSRLSDLTSNTHGTKVCQVRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSEL+ TFDLKITLADDSAKIFAWCTGQTAAELLQISPDEF 
Subjt:  LSRLSDLTSNTHGTKVCQVRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSELVLTFDLKITLADDSAKIFAWCTGQTAAELLQISPDEFY

Query:  ELPEEEQVMYPSSLENENFVVAIVNCRRQTSRCGNNIYFVNDPLSWEITRALKCE
        ELPEEEQVMYPSSLENENFVVAIVNCRR++S  GNN+ F NDPLSWEITRALKCE
Subjt:  ELPEEEQVMYPSSLENENFVVAIVNCRRQTSRCGNNIYFVNDPLSWEITRALKCE

A0A1S3AX73 uncharacterized protein LOC103483891 isoform X20.0e+0086.26Show/hide
Query:  MELDDRRRLQEEGDDDPFLKFVDYARSVLAFEDEEDFDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKIPECINQ
        MELDD R+LQEEGDDDPFLKFVDYARSVLAFED+EDFDPNVNGTET+TPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKIPECINQ
Subjt:  MELDDRRRLQEEGDDDPFLKFVDYARSVLAFEDEEDFDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKIPECINQ

Query:  LKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILP-----------------------------VDGILKKGRQIFVTGCYLRAASGGSGHPR
        LKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILP                             VDGILKKGRQIFVTGCYLRAASGGSG+PR
Subjt:  LKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILP-----------------------------VDGILKKGRQIFVTGCYLRAASGGSGHPR

Query:  LLPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVNEGITYSL---IESIGPLEIYEKINGLRMIQIILVDNDGFKLKFLLWGEQVLLANLLSVGS
        LLPTEYL+ILLDEEEDDDVMLLGAQFCSD+FSSVSLDSVNEG TYSL   IESIGPLEI+EKINGLRMIQIILVDNDGFKLKFLLWGEQVLLA LLSVGS
Subjt:  LLPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVNEGITYSL---IESIGPLEIYEKINGLRMIQIILVDNDGFKLKFLLWGEQVLLANLLSVGS

Query:  VLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRTLSTSYPTQGPQVSQVSLPCDSHGAIDFGNYPFRSFVIDLQDK
        VLALDRPY+ATVNENG+GTS+ELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRT+S SYPTQGPQVSQVSLPCDSHGAIDFGNYPFRSFVIDLQDK
Subjt:  VLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRTLSTSYPTQGPQVSQVSLPCDSHGAIDFGNYPFRSFVIDLQDK

Query:  MTGISLYGIVLDIANERNTTEAVFSLRIEDNTGEILAKLHFARS------------------C------LEALWTENHVGASFVNVSCLPALLTSSCLHK
        MTGISLYG VLDIANERNTTEA FS+RIEDNTGEILAKL F RS                  C      LEALW ENHVGASFVN+SCLPALLTSSCLHK
Subjt:  MTGISLYGIVLDIANERNTTEAVFSLRIEDNTGEILAKLHFARS------------------C------LEALWTENHVGASFVNVSCLPALLTSSCLHK

Query:  LSRLSDLTSNTHGTKVCQVRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSELVLTFDLKITLADDSAKIFAWCTGQTAAELLQISPDEFY
        LSRLSDLTSNTHGTKVC+VRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFC CECKSELV TFDLKITLADDSAKIFAWC GQTAAELLQISPDEF 
Subjt:  LSRLSDLTSNTHGTKVCQVRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSELVLTFDLKITLADDSAKIFAWCTGQTAAELLQISPDEFY

Query:  ELPEEEQVMYPSSLENENFVVAIVNCRRQTSRCGNNIYFVNDPLSWEITRALKCE
        ELPEEEQVMYPSSLENENFVVAIVNCRRQ+ + GNN+ F NDPLSWEITRALKCE
Subjt:  ELPEEEQVMYPSSLENENFVVAIVNCRRQTSRCGNNIYFVNDPLSWEITRALKCE

A0A1S4DSK5 uncharacterized protein LOC103483891 isoform X10.0e+0085.61Show/hide
Query:  MELDDRRRLQEEGDDDPFLKFVDYARSVLAFEDEEDFDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKIPECINQ
        MELDD R+LQEEGDDDPFLKFVDYARSVLAFED+EDFDPNVNGTET+TPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKIPECINQ
Subjt:  MELDDRRRLQEEGDDDPFLKFVDYARSVLAFEDEEDFDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKIPECINQ

Query:  LKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILP-----------------------------VDGILKKGRQIFVTGCYLRAASGGSGHPR
        LKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILP                             VDGILKKGRQIFVTGCYLRAASGGSG+PR
Subjt:  LKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILP-----------------------------VDGILKKGRQIFVTGCYLRAASGGSGHPR

Query:  LLPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVNEGITYSL---IESIGPLEIYEKINGLRMIQIILVDNDGFKLKFLLWGEQVLLANLL----
        LLPTEYL+ILLDEEEDDDVMLLGAQFCSD+FSSVSLDSVNEG TYSL   IESIGPLEI+EKINGLRMIQIILVDNDGFKLKFLLWGEQVLLA LL    
Subjt:  LLPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVNEGITYSL---IESIGPLEIYEKINGLRMIQIILVDNDGFKLKFLLWGEQVLLANLL----

Query:  -SVGSVLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRTLSTSYPTQGPQVSQVSLPCDSHGAIDFGNYPFRSFVI
         SVGSVLALDRPY+ATVNENG+GTS+ELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRT+S SYPTQGPQVSQVSLPCDSHGAIDFGNYPFRSFVI
Subjt:  -SVGSVLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRTLSTSYPTQGPQVSQVSLPCDSHGAIDFGNYPFRSFVI

Query:  DLQDKMTGISLYGIVLDIANERNTTEAVFSLRIEDNTGEILAKLHFARS------------------C------LEALWTENHVGASFVNVSCLPALLTS
        DLQDKMTGISLYG VLDIANERNTTEA FS+RIEDNTGEILAKL F RS                  C      LEALW ENHVGASFVN+SCLPALLTS
Subjt:  DLQDKMTGISLYGIVLDIANERNTTEAVFSLRIEDNTGEILAKLHFARS------------------C------LEALWTENHVGASFVNVSCLPALLTS

Query:  SCLHKLSRLSDLTSNTHGTKVCQVRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSELVLTFDLKITLADDSAKIFAWCTGQTAAELLQIS
        SCLHKLSRLSDLTSNTHGTKVC+VRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFC CECKSELV TFDLKITLADDSAKIFAWC GQTAAELLQIS
Subjt:  SCLHKLSRLSDLTSNTHGTKVCQVRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSELVLTFDLKITLADDSAKIFAWCTGQTAAELLQIS

Query:  PDEFYELPEEEQVMYPSSLENENFVVAIVNCRRQTSRCGNNIYFVNDPLSWEITRALKCE
        PDEF ELPEEEQVMYPSSLENENFVVAIVNCRRQ+ + GNN+ F NDPLSWEITRALKCE
Subjt:  PDEFYELPEEEQVMYPSSLENENFVVAIVNCRRQTSRCGNNIYFVNDPLSWEITRALKCE

A0A5A7U7H0 Nucleic acid-binding proteins superfamily isoform 10.0e+0086.26Show/hide
Query:  MELDDRRRLQEEGDDDPFLKFVDYARSVLAFEDEEDFDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKIPECINQ
        MELDD R+LQEEGDDDPFLKFVDYARSVLAFED+EDFDPNVNGTET+TPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKIPECINQ
Subjt:  MELDDRRRLQEEGDDDPFLKFVDYARSVLAFEDEEDFDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKIPECINQ

Query:  LKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILP-----------------------------VDGILKKGRQIFVTGCYLRAASGGSGHPR
        LKKKNRRKKLPKTVTIDSIYEKNFLS+SSVLEAVILDEFILP                             VDGILKKGRQIFVTGCYLRAASGGSG+PR
Subjt:  LKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILP-----------------------------VDGILKKGRQIFVTGCYLRAASGGSGHPR

Query:  LLPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVNEGITYSL---IESIGPLEIYEKINGLRMIQIILVDNDGFKLKFLLWGEQVLLANLLSVGS
        LLPTEYL+ILLDEEEDDDVMLLGAQFCSD+FSSVSLDSVNEG TYSL   IESIGPLEI+EKINGLRMIQIILVDNDGFKLKFLLWGEQVLLANLLSVGS
Subjt:  LLPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVNEGITYSL---IESIGPLEIYEKINGLRMIQIILVDNDGFKLKFLLWGEQVLLANLLSVGS

Query:  VLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRTLSTSYPTQGPQVSQVSLPCDSHGAIDFGNYPFRSFVIDLQDK
        VLALDRPY+ATVNENG+GTS+ELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRT+S SYPTQGPQVSQVSLPCDSHGAIDFGNYPFRSFVIDLQDK
Subjt:  VLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRTLSTSYPTQGPQVSQVSLPCDSHGAIDFGNYPFRSFVIDLQDK

Query:  MTGISLYGIVLDIANERNTTEAVFSLRIEDNTGEILAKLHFARS------------------C------LEALWTENHVGASFVNVSCLPALLTSSCLHK
        MTGISLYG VLDIANERNTTEA FS+RIEDNTGEILAKL F RS                  C      LEALW ENHVGASFVN+SCLPALLTSSCLHK
Subjt:  MTGISLYGIVLDIANERNTTEAVFSLRIEDNTGEILAKLHFARS------------------C------LEALWTENHVGASFVNVSCLPALLTSSCLHK

Query:  LSRLSDLTSNTHGTKVCQVRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSELVLTFDLKITLADDSAKIFAWCTGQTAAELLQISPDEFY
        LSRLSDLTSNTHGTKVC+VRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFC CECKSELV TFDLKITLADDSAKIFAWC GQTAAELLQISPDEF 
Subjt:  LSRLSDLTSNTHGTKVCQVRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSELVLTFDLKITLADDSAKIFAWCTGQTAAELLQISPDEFY

Query:  ELPEEEQVMYPSSLENENFVVAIVNCRRQTSRCGNNIYFVNDPLSWEITRALKCE
        ELPEEEQVMYPSSLENENFVVAIVNCRRQ+ + GNN+ F NDPLSWEITRALKCE
Subjt:  ELPEEEQVMYPSSLENENFVVAIVNCRRQTSRCGNNIYFVNDPLSWEITRALKCE

A0A6J1IB36 uncharacterized protein LOC111470879 isoform X18.9e-31083.21Show/hide
Query:  MELDDRRRLQEEGDDDPFLKFVDYARSVLAFEDEEDFDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKIPECINQ
        MEL+DRRRLQEE DDDPFLKF+DYARSVLAFEDEEDFDPNV GT+T TPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAW EQHR+GAPKKIPECINQ
Subjt:  MELDDRRRLQEEGDDDPFLKFVDYARSVLAFEDEEDFDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKIPECINQ

Query:  LKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILP-----------------------------VDGILKKGRQIFVTGCYLRAASGGSGHPR
        LKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVI++EFILP                             V GILKKGRQIF+TGCYLRAASGGSGHPR
Subjt:  LKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILP-----------------------------VDGILKKGRQIFVTGCYLRAASGGSGHPR

Query:  LLPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVNEGITYSL---IESIGPLEIYEKINGLRMIQIILVDNDGFKLKFLLWGEQVLLANLLSVGS
        LLPTEYLIILLDEEEDDDV+LLGAQFCSDSFSSVSLD+V++G TYSL   IESIGP EI+EK NGL+MIQI+L+DNDGFKLKFLLWGEQV+LANLLSVGS
Subjt:  LLPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVNEGITYSL---IESIGPLEIYEKINGLRMIQIILVDNDGFKLKFLLWGEQVLLANLLSVGS

Query:  VLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRTLSTSYPTQGPQVSQVSLPCDSHGAIDFGNYPFRSFVIDLQDK
        +LALDRPYIATVNENGIG+SDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRTL TSYPTQ P+VSQVSLPCDSHG IDFGNYPFRSFV+DLQDK
Subjt:  VLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRTLSTSYPTQGPQVSQVSLPCDSHGAIDFGNYPFRSFVIDLQDK

Query:  MTGISLYGIVLDIANERNTTEAVFSLRIEDNTGEILAKLHFARS------------------C------LEALWTENHVGASFVNVSCLPALLTSSCLHK
        MTGISLYGIV+DI NERNTTEAVFS+RIEDNTG+I AKLHF RS                  C      LEALW ENHVGASFVN+SCLPALLTSSCLHK
Subjt:  MTGISLYGIVLDIANERNTTEAVFSLRIEDNTGEILAKLHFARS------------------C------LEALWTENHVGASFVNVSCLPALLTSSCLHK

Query:  LSRLSDLTSNTHGTKVCQVRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSELVLTFDLKITLADDSAKIFAWCTGQTAAELLQISPDEFY
        +SRLSDLTSN+HGTKVC+ RLDQVSHCHVSTKFLHA CGHFVEETP RIEC FCR ECKSELV TFDLKITLADD+AKIFAWCTGQTAAELLQISPDEF 
Subjt:  LSRLSDLTSNTHGTKVCQVRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSELVLTFDLKITLADDSAKIFAWCTGQTAAELLQISPDEFY

Query:  ELPEEEQVMYPSSLENENFVVAIVNCRRQTSRCGNNIYFVNDPLSWEITRALKCE
        ELPEEEQVMYPSSLENE+FVVAIVNCRRQTS+CGNN+Y VNDPLSWEITRALKCE
Subjt:  ELPEEEQVMYPSSLENENFVVAIVNCRRQTSRCGNNIYFVNDPLSWEITRALKCE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G17030.1 Nucleic acid-binding proteins superfamily1.0e-17750.68Show/hide
Query:  EEGDDDPFLKFVDYARSVLAFEDEED------FDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKIPECINQLKKK
        +E  +DPFL F+DYAR+V++ ED+ED        P    TE + PGW W+ASR+L+TC AYSS VT AILLS+LSQAW+EQ++ G  KK PE I+QLKK 
Subjt:  EEGDDDPFLKFVDYARSVLAFEDEED------FDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKIPECINQLKKK

Query:  NRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILP--------------------------------VDGILKKGRQIFVTGCYLRAASGGSGHPRL
        +RR++L  TVTIDSIYEKNFLS++SVLEAVI++  +LP                                 +GIL+KGR++ +TGCYLR A  G G PRL
Subjt:  NRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILP--------------------------------VDGILKKGRQIFVTGCYLRAASGGSGHPRL

Query:  LPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVNEGITYSL---IESIGPLEIYEKINGLRMIQIILVDNDGFKLKFLLWGEQVLLANLLSVGSV
        LPTEYL++LLDE++DDD +L+ AQFCSD+FSSVSLD+ N+G +YSL   IESIGPLE     +  R  QI LVD DG +LKF+LWGEQV++ANLLSVGSV
Subjt:  LPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVNEGITYSL---IESIGPLEIYEKINGLRMIQIILVDNDGFKLKFLLWGEQVLLANLLSVGSV

Query:  LALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCV-LTQNINQASRTLSTSYPTQGPQVSQVSLPCDSHGAIDFGNYPFRSFVIDLQDK
        L ++RPYI+++ E+ +  + E CLEYGSAT LYLVP    EE+VCV L+Q+  Q S+ L +        VSQV+LP D+ G++DF NYPFR+ + D++DK
Subjt:  LALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCV-LTQNINQASRTLSTSYPTQGPQVSQVSLPCDSHGAIDFGNYPFRSFVIDLQDK

Query:  MTGISLYGIVLDIANERNTTEAVFSLRIEDNTGEILAKLHFA-------------------------RSCLEALWTENHVGASFVNVSCLPALLTSSCLH
         TGISLYG+V DI+ + N T  VFSL+IED TG I AKLHF                           +C+E LW E    A+FVN+SCLPA LTSSC+H
Subjt:  MTGISLYGIVLDIANERNTTEAVFSLRIEDNTGEILAKLHFA-------------------------RSCLEALWTENHVGASFVNVSCLPALLTSSCLH

Query:  KLSRLSDLTSNTH-GTKVCQVRLDQVSHCH-VSTKFLHAICGHFVEETP-----ARIECSFCRCECKS--ELVLTFDLKITLADDSAKIFAWCTGQTAAE
         +S LS ++        +C+V+LD++  CH ++T+  H++CGHF++E       A + CSFCR  C S  E+V TF + ITLAD+  K++AWCTGQ+A+ 
Subjt:  KLSRLSDLTSNTH-GTKVCQVRLDQVSHCH-VSTKFLHAICGHFVEETP-----ARIECSFCRCECKS--ELVLTFDLKITLADDSAKIFAWCTGQTAAE

Query:  LLQISPDEFYELPEEEQVMYPSSLENENFVVAIVNCRRQTSRCGNNIYFVNDPLSWEITRALK
        +LQISPDEF +LPE++Q+MYPSSLENE F+V + N   +    G+      D   WEITRALK
Subjt:  LLQISPDEFYELPEEEQVMYPSSLENENFVVAIVNCRRQTSRCGNNIYFVNDPLSWEITRALK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTTAGATGATCGCCGAAGGCTTCAGGAAGAAGGTGATGATGATCCGTTTCTTAAATTTGTCGATTACGCGAGGTCCGTGCTAGCATTCGAAGACGAAGAAGACTT
CGATCCTAATGTTAATGGAACGGAGACAAATACCCCGGGTTGGAGTTGGATCGCCTCTCGGGTCCTCAGAACTTGTATCGCCTACTCCAGTTCTGTTACCCCTGCGATTT
TGCTATCCGAGCTCTCGCAGGCCTGGTATGAGCAACACAGAGTTGGTGCTCCCAAGAAAATACCTGAATGTATCAACCAGTTGAAGAAGAAGAATAGGAGAAAGAAACTC
CCAAAAACAGTTACTATTGACTCCATATATGAGAAGAATTTCCTATCTTTAAGTAGCGTTTTGGAAGCTGTAATTCTTGATGAGTTTATTCTTCCAGTGGATGGAATTCT
GAAGAAAGGGAGGCAAATATTTGTAACTGGATGCTATCTTCGTGCTGCCAGTGGTGGCTCTGGTCATCCACGACTTCTACCAACTGAATACCTTATCATATTGTTAGACG
AGGAAGAGGACGATGATGTAATGCTTCTAGGAGCTCAATTTTGTTCTGATTCCTTTTCTTCTGTTTCTCTTGATTCCGTCAATGAAGGGATTACGTATTCATTGATTGAG
TCCATTGGTCCACTGGAAATTTATGAGAAGATTAATGGTTTACGGATGATACAAATCATTCTTGTTGATAATGATGGTTTCAAGCTAAAGTTTCTCTTATGGGGTGAACA
GGTGCTACTGGCCAATCTTTTAAGTGTTGGTAGTGTGCTTGCGCTTGATAGACCATATATTGCAACCGTTAACGAGAATGGCATTGGAACAAGTGATGAACTTTGTCTTG
AATATGGTAGTGCAACACAGTTATATTTGGTGCCTTGCATTCAGCATGAGGAGCAAGTCTGTGTTTTAACTCAGAATATAAACCAAGCTTCAAGGACACTCAGTACATCG
TATCCTACTCAGGGTCCCCAAGTTTCTCAAGTTTCCTTGCCCTGTGATTCACATGGGGCAATTGATTTTGGTAATTATCCTTTTCGGTCTTTTGTGATCGACCTTCAAGA
CAAGATGACTGGCATTAGCTTATATGGTATCGTTTTAGATATAGCTAATGAAAGAAATACCACAGAAGCTGTTTTCTCTTTGAGAATTGAAGATAACACGGGAGAAATTT
TGGCCAAGTTACACTTCGCGAGATCTTGCTTAGAGGCGCTATGGACTGAGAATCATGTTGGAGCTTCTTTTGTCAACGTTAGCTGCTTGCCAGCATTGTTAACTTCATCT
TGTCTTCATAAACTTTCACGACTTTCTGATCTTACCAGCAACACTCATGGTACAAAGGTCTGTCAAGTTCGGCTCGACCAAGTTTCACATTGTCATGTCAGTACGAAATT
TTTGCATGCAATTTGTGGTCATTTCGTCGAGGAGACACCTGCCAGAATTGAGTGCAGCTTCTGTCGTTGTGAATGCAAGTCTGAGCTTGTGCTTACATTCGACCTCAAAA
TCACCCTTGCAGACGACAGTGCCAAAATCTTTGCATGGTGTACAGGTCAAACTGCTGCAGAGTTGTTGCAAATATCTCCTGATGAATTCTATGAACTACCTGAGGAAGAA
CAAGTAATGTACCCATCTTCACTCGAGAACGAGAATTTTGTGGTTGCAATAGTGAATTGCAGAAGGCAGACCAGCAGATGTGGAAATAATATCTATTTTGTTAATGATCC
ACTTTCATGGGAGATTACTCGTGCACTCAAGTGTGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGTTAGATGATCGCCGAAGGCTTCAGGAAGAAGGTGATGATGATCCGTTTCTTAAATTTGTCGATTACGCGAGGTCCGTGCTAGCATTCGAAGACGAAGAAGACTT
CGATCCTAATGTTAATGGAACGGAGACAAATACCCCGGGTTGGAGTTGGATCGCCTCTCGGGTCCTCAGAACTTGTATCGCCTACTCCAGTTCTGTTACCCCTGCGATTT
TGCTATCCGAGCTCTCGCAGGCCTGGTATGAGCAACACAGAGTTGGTGCTCCCAAGAAAATACCTGAATGTATCAACCAGTTGAAGAAGAAGAATAGGAGAAAGAAACTC
CCAAAAACAGTTACTATTGACTCCATATATGAGAAGAATTTCCTATCTTTAAGTAGCGTTTTGGAAGCTGTAATTCTTGATGAGTTTATTCTTCCAGTGGATGGAATTCT
GAAGAAAGGGAGGCAAATATTTGTAACTGGATGCTATCTTCGTGCTGCCAGTGGTGGCTCTGGTCATCCACGACTTCTACCAACTGAATACCTTATCATATTGTTAGACG
AGGAAGAGGACGATGATGTAATGCTTCTAGGAGCTCAATTTTGTTCTGATTCCTTTTCTTCTGTTTCTCTTGATTCCGTCAATGAAGGGATTACGTATTCATTGATTGAG
TCCATTGGTCCACTGGAAATTTATGAGAAGATTAATGGTTTACGGATGATACAAATCATTCTTGTTGATAATGATGGTTTCAAGCTAAAGTTTCTCTTATGGGGTGAACA
GGTGCTACTGGCCAATCTTTTAAGTGTTGGTAGTGTGCTTGCGCTTGATAGACCATATATTGCAACCGTTAACGAGAATGGCATTGGAACAAGTGATGAACTTTGTCTTG
AATATGGTAGTGCAACACAGTTATATTTGGTGCCTTGCATTCAGCATGAGGAGCAAGTCTGTGTTTTAACTCAGAATATAAACCAAGCTTCAAGGACACTCAGTACATCG
TATCCTACTCAGGGTCCCCAAGTTTCTCAAGTTTCCTTGCCCTGTGATTCACATGGGGCAATTGATTTTGGTAATTATCCTTTTCGGTCTTTTGTGATCGACCTTCAAGA
CAAGATGACTGGCATTAGCTTATATGGTATCGTTTTAGATATAGCTAATGAAAGAAATACCACAGAAGCTGTTTTCTCTTTGAGAATTGAAGATAACACGGGAGAAATTT
TGGCCAAGTTACACTTCGCGAGATCTTGCTTAGAGGCGCTATGGACTGAGAATCATGTTGGAGCTTCTTTTGTCAACGTTAGCTGCTTGCCAGCATTGTTAACTTCATCT
TGTCTTCATAAACTTTCACGACTTTCTGATCTTACCAGCAACACTCATGGTACAAAGGTCTGTCAAGTTCGGCTCGACCAAGTTTCACATTGTCATGTCAGTACGAAATT
TTTGCATGCAATTTGTGGTCATTTCGTCGAGGAGACACCTGCCAGAATTGAGTGCAGCTTCTGTCGTTGTGAATGCAAGTCTGAGCTTGTGCTTACATTCGACCTCAAAA
TCACCCTTGCAGACGACAGTGCCAAAATCTTTGCATGGTGTACAGGTCAAACTGCTGCAGAGTTGTTGCAAATATCTCCTGATGAATTCTATGAACTACCTGAGGAAGAA
CAAGTAATGTACCCATCTTCACTCGAGAACGAGAATTTTGTGGTTGCAATAGTGAATTGCAGAAGGCAGACCAGCAGATGTGGAAATAATATCTATTTTGTTAATGATCC
ACTTTCATGGGAGATTACTCGTGCACTCAAGTGTGAATGA
Protein sequenceShow/hide protein sequence
MELDDRRRLQEEGDDDPFLKFVDYARSVLAFEDEEDFDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKIPECINQLKKKNRRKKL
PKTVTIDSIYEKNFLSLSSVLEAVILDEFILPVDGILKKGRQIFVTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVNEGITYSLIE
SIGPLEIYEKINGLRMIQIILVDNDGFKLKFLLWGEQVLLANLLSVGSVLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRTLSTS
YPTQGPQVSQVSLPCDSHGAIDFGNYPFRSFVIDLQDKMTGISLYGIVLDIANERNTTEAVFSLRIEDNTGEILAKLHFARSCLEALWTENHVGASFVNVSCLPALLTSS
CLHKLSRLSDLTSNTHGTKVCQVRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSELVLTFDLKITLADDSAKIFAWCTGQTAAELLQISPDEFYELPEEE
QVMYPSSLENENFVVAIVNCRRQTSRCGNNIYFVNDPLSWEITRALKCE