; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi05G015900 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi05G015900
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionNucleic acid-binding proteins superfamily isoform 1
Genome locationchr05:23666410..23677834
RNA-Seq ExpressionLsi05G015900
SyntenyLsi05G015900
Gene Ontology termsNA
InterPro domainsIPR012340 - Nucleic acid-binding, OB-fold
IPR035200 - Cell division control protein 24, OB domain 2
IPR035201 - Cell division control protein 24, OB domain 1
IPR035203 - Cell division control protein 24, OB domain 3


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049545.1 Nucleic acid-binding proteins superfamily isoform 1 [Cucumis melo var. makuwa]1.6e-26784.32Show/hide
Query:  KAWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILPEYN--QLPIYRLRPAETKD-----RFYDLVDGILKKG
        +AWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTVTIDSIYEKNFLS+SSVLEAVILDEFILP  N   L +     + T D     RFYDLVDGILKKG
Subjt:  KAWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILPEYN--QLPIYRLRPAETKD-----RFYDLVDGILKKG

Query:  RQIFVTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVNEGIT------IESIGPLEIYEKINGLRMIQIILVDNDGF
        RQIFVTGCYLRAASGGSG+PRLLPTEYL+ILLDEEEDDDVMLLGAQFCSD+FSSVSLDSVNEG T      IESIGPLEI+EKINGLRMIQIILVDNDGF
Subjt:  RQIFVTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVNEGIT------IESIGPLEIYEKINGLRMIQIILVDNDGF

Query:  KLKFLLWGEQVLLANLL---SVLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRTLSTSYPTQGPQVSQVSLPCDS
        KLKFLLWGEQVLLANLL   SVLALDRPY+ATVNENG+GTS+ELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRT+S SYPTQGPQVSQVSLPCDS
Subjt:  KLKFLLWGEQVLLANLL---SVLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRTLSTSYPTQGPQVSQVSLPCDS

Query:  HGAIDFGNYPFRSFVIDLQDKMTGISLYGIVLDIANERNTTEAVFSLRIEDNTGEILA-------------------------------NLEALWTENHV
        HGAIDFGNYPFRSFVIDLQDKMTGISLYG VLDIANERNTTEA FS+RIEDNTGEILA                                LEALW ENHV
Subjt:  HGAIDFGNYPFRSFVIDLQDKMTGISLYGIVLDIANERNTTEAVFSLRIEDNTGEILA-------------------------------NLEALWTENHV

Query:  GASFVNVSCLPALLTSSCLHKLSRLSDLTSNTHGTKVCQVRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSELVLTFDLKITLADDSAKI
        GASFVN+SCLPALLTSSCLHKLSRLSDLTSNTHGTKVC+VRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFC CECKSELV TFDLKITLADDSAKI
Subjt:  GASFVNVSCLPALLTSSCLHKLSRLSDLTSNTHGTKVCQVRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSELVLTFDLKITLADDSAKI

Query:  FAWCTGQTAAELLQISPDEFYELPEEEQVMYPSSLENENFVVAIVNCRRQTSRCGNNIYFVNDPLSWEITRALK
        FAWC GQTAAELLQISPDEF ELPEEEQVMYPSSLENENFVVAIVNCRRQ+ + GNN+ F NDPLSWEITRALK
Subjt:  FAWCTGQTAAELLQISPDEFYELPEEEQVMYPSSLENENFVVAIVNCRRQTSRCGNNIYFVNDPLSWEITRALK

TYK16222.1 Nucleic acid-binding proteins superfamily isoform 1 [Cucumis melo var. makuwa]3.4e-27086.85Show/hide
Query:  MKAWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILPEYNQ-------LPIYR--LRPAETKDRFYDLVDGIL
        ++AWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTVTIDSIYEKNFLS+SSVLEAVILDEFILP+ +          +YR     ++T   FYDLVDGIL
Subjt:  MKAWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILPEYNQ-------LPIYR--LRPAETKDRFYDLVDGIL

Query:  KKGRQIFVTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVNEGIT---IESIGPLEIYEKINGLRMIQIILVDNDGF
        KKGRQIFVTGCYLRAASGGSG+PRLLPTEYL+ILLDEEEDDDVMLLGAQFCSD+FSSVSLDSVNEG T   IESIGPLEI+EKINGLRMIQIILVDNDGF
Subjt:  KKGRQIFVTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVNEGIT---IESIGPLEIYEKINGLRMIQIILVDNDGF

Query:  KLKFLLWGEQVLLANLL--------SVLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRTLSTSYPTQGPQVSQVS
        KLKFLLWGEQVLLANLL        SVLALDRPY+ATVNENG+GTS+ELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRT+S SYPTQGPQVSQVS
Subjt:  KLKFLLWGEQVLLANLL--------SVLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRTLSTSYPTQGPQVSQVS

Query:  LPCDSHGAIDFGNYPFRSFVIDLQDKMTGISLYGIVLDIANERNTTEAVFSLRIEDNTGEILAN-------LEALWTENHVGASFVNVSCLPALLTSSCL
        LPCDSHGAIDFGNYPFRSFVIDLQDKMTGISLYG VLDIANERNTTEA FS+RIEDNTGEILA        LEALW ENHVGASFVN+SCLPALLTSSCL
Subjt:  LPCDSHGAIDFGNYPFRSFVIDLQDKMTGISLYGIVLDIANERNTTEAVFSLRIEDNTGEILAN-------LEALWTENHVGASFVNVSCLPALLTSSCL

Query:  HKLSRLSDLTSNTHGTKVCQVRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSELVLTFDLKITLADDSAKIFAWCTGQTAAELLQISPDE
        HKLSRLSDLTSNTHGTKVC+VRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFC CECKSELV TFDLKITLADDSAKIFAWC GQTAAELLQISPDE
Subjt:  HKLSRLSDLTSNTHGTKVCQVRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSELVLTFDLKITLADDSAKIFAWCTGQTAAELLQISPDE

Query:  FYELPEEEQVMYPSSLENENFVVAIVNCRRQTSRCGNNIYFVNDPLSWEITRALK
        F ELPEEEQVMYPSSLENENFVVAIVNCRRQ+ + GNN+ F NDPLSWEITRALK
Subjt:  FYELPEEEQVMYPSSLENENFVVAIVNCRRQTSRCGNNIYFVNDPLSWEITRALK

XP_004134503.1 uncharacterized protein LOC101215087 [Cucumis sativus]6.0e-26784.15Show/hide
Query:  KAWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILPEYN--QLPIYRLRPAETKD-----RFYDLVDGILKKG
        +AWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTVTIDSIYEKNFL+LSSVLEAVILDEFILP  N   L +     + T D     RFYDLV+GILKKG
Subjt:  KAWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILPEYN--QLPIYRLRPAETKD-----RFYDLVDGILKKG

Query:  RQIFVTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVNEGIT------IESIGPLEIYEKINGLRMIQIILVDNDGF
        RQIFVTGCYLRAASGGSG+PRLLPTEYLIILLDEEEDDDVMLLGAQFCSD+FSSVSLDSVNEG T      IESIGPLEI+E +NGLRMIQIILVDNDGF
Subjt:  RQIFVTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVNEGIT------IESIGPLEIYEKINGLRMIQIILVDNDGF

Query:  KLKFLLWGEQVLLANLL---SVLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRTLSTSYPTQGPQVSQVSLPCDS
        KLKFLLWGEQVLLANLL   SVLALDRPY+ATVNENG+GTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRT+S SYPTQ PQVSQVSLPCDS
Subjt:  KLKFLLWGEQVLLANLL---SVLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRTLSTSYPTQGPQVSQVSLPCDS

Query:  HGAIDFGNYPFRSFVIDLQDKMTGISLYGIVLDIANERNTTEAVFSLRIEDNTGEILA-------------------------------NLEALWTENHV
        HGAIDFGNYPFRSFVIDLQDKMTGISLYG VLDIANERNTTEA FS+RIEDNTGE+LA                                LEALW ENHV
Subjt:  HGAIDFGNYPFRSFVIDLQDKMTGISLYGIVLDIANERNTTEAVFSLRIEDNTGEILA-------------------------------NLEALWTENHV

Query:  GASFVNVSCLPALLTSSCLHKLSRLSDLTSNTHGTKVCQVRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSELVLTFDLKITLADDSAKI
        GASFVN+SCLPALLTSSCLHKLSRLSDLTSNTHGTKVCQVRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSEL+ TFDLKITLADDSAKI
Subjt:  GASFVNVSCLPALLTSSCLHKLSRLSDLTSNTHGTKVCQVRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSELVLTFDLKITLADDSAKI

Query:  FAWCTGQTAAELLQISPDEFYELPEEEQVMYPSSLENENFVVAIVNCRRQTSRCGNNIYFVNDPLSWEITRALK
        FAWCTGQTAAELLQISPDEF ELPEEEQVMYPSSLENENFVVAIVNCRR++S  GNN+ F NDPLSWEITRALK
Subjt:  FAWCTGQTAAELLQISPDEFYELPEEEQVMYPSSLENENFVVAIVNCRRQTSRCGNNIYFVNDPLSWEITRALK

XP_008438949.1 PREDICTED: uncharacterized protein LOC103483891 isoform X2 [Cucumis melo]4.6e-26784.32Show/hide
Query:  KAWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILPEYN--QLPIYRLRPAETKD-----RFYDLVDGILKKG
        +AWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILP  N   L +     + T D     RFYDLVDGILKKG
Subjt:  KAWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILPEYN--QLPIYRLRPAETKD-----RFYDLVDGILKKG

Query:  RQIFVTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVNEGIT------IESIGPLEIYEKINGLRMIQIILVDNDGF
        RQIFVTGCYLRAASGGSG+PRLLPTEYL+ILLDEEEDDDVMLLGAQFCSD+FSSVSLDSVNEG T      IESIGPLEI+EKINGLRMIQIILVDNDGF
Subjt:  RQIFVTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVNEGIT------IESIGPLEIYEKINGLRMIQIILVDNDGF

Query:  KLKFLLWGEQVLLANLL---SVLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRTLSTSYPTQGPQVSQVSLPCDS
        KLKFLLWGEQVLLA LL   SVLALDRPY+ATVNENG+GTS+ELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRT+S SYPTQGPQVSQVSLPCDS
Subjt:  KLKFLLWGEQVLLANLL---SVLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRTLSTSYPTQGPQVSQVSLPCDS

Query:  HGAIDFGNYPFRSFVIDLQDKMTGISLYGIVLDIANERNTTEAVFSLRIEDNTGEILA-------------------------------NLEALWTENHV
        HGAIDFGNYPFRSFVIDLQDKMTGISLYG VLDIANERNTTEA FS+RIEDNTGEILA                                LEALW ENHV
Subjt:  HGAIDFGNYPFRSFVIDLQDKMTGISLYGIVLDIANERNTTEAVFSLRIEDNTGEILA-------------------------------NLEALWTENHV

Query:  GASFVNVSCLPALLTSSCLHKLSRLSDLTSNTHGTKVCQVRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSELVLTFDLKITLADDSAKI
        GASFVN+SCLPALLTSSCLHKLSRLSDLTSNTHGTKVC+VRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFC CECKSELV TFDLKITLADDSAKI
Subjt:  GASFVNVSCLPALLTSSCLHKLSRLSDLTSNTHGTKVCQVRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSELVLTFDLKITLADDSAKI

Query:  FAWCTGQTAAELLQISPDEFYELPEEEQVMYPSSLENENFVVAIVNCRRQTSRCGNNIYFVNDPLSWEITRALK
        FAWC GQTAAELLQISPDEF ELPEEEQVMYPSSLENENFVVAIVNCRRQ+ + GNN+ F NDPLSWEITRALK
Subjt:  FAWCTGQTAAELLQISPDEFYELPEEEQVMYPSSLENENFVVAIVNCRRQTSRCGNNIYFVNDPLSWEITRALK

XP_038880460.1 uncharacterized protein LOC120072117 [Benincasa hispida]5.5e-26884.15Show/hide
Query:  KAWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILPEYN--QLPIYRLRPAETKD-----RFYDLVDGILKKG
        +AWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTVTIDSI+EKNFLSLSSVLEAVILDEFILP  N   L +     + T D     RFYDLVDGILKKG
Subjt:  KAWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILPEYN--QLPIYRLRPAETKD-----RFYDLVDGILKKG

Query:  RQIFVTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVNEGIT------IESIGPLEIYEKINGLRMIQIILVDNDGF
        RQIFVTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVN+G T      IESIGPLEIYEKINGLRM+Q++LVDN GF
Subjt:  RQIFVTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVNEGIT------IESIGPLEIYEKINGLRMIQIILVDNDGF

Query:  KLKFLLWGEQVLLANLL---SVLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRTLSTSYPTQGPQVSQVSLPCDS
        KLKFLLWGEQVLLANLL   SVLALDRPYIAT NENG+GTSDELCLEYGSATQLYLVPCIQHEEQVCV+TQNINQA RTLSTSYPTQGPQVSQVSLPCD 
Subjt:  KLKFLLWGEQVLLANLL---SVLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRTLSTSYPTQGPQVSQVSLPCDS

Query:  HGAIDFGNYPFRSFVIDLQDKMTGISLYGIVLDIANERNTTEAVFSLRIEDNTGEILA-------------------------------NLEALWTENHV
        HGAIDF NYPFRSFVIDLQDKMTGISLYGIVLDIA+ERNTTEAVFS+RIEDNTGE+LA                                LEALW ENHV
Subjt:  HGAIDFGNYPFRSFVIDLQDKMTGISLYGIVLDIANERNTTEAVFSLRIEDNTGEILA-------------------------------NLEALWTENHV

Query:  GASFVNVSCLPALLTSSCLHKLSRLSDLTSNTHGTKVCQVRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSELVLTFDLKITLADDSAKI
        GASFVN+SCLPALLTSSCLHKLSRLSDLT NT GTKVC+VRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSE V TFDLKITLAD+SAKI
Subjt:  GASFVNVSCLPALLTSSCLHKLSRLSDLTSNTHGTKVCQVRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSELVLTFDLKITLADDSAKI

Query:  FAWCTGQTAAELLQISPDEFYELPEEEQVMYPSSLENENFVVAIVNCRRQTSRCGNNIYFVNDPLSWEITRALK
        FAWCTGQTAAELLQISPDEF ELPEEEQVMYPSSLENENFVVAIVNCRRQ+S+CGNN+YFV DPLSWEITRALK
Subjt:  FAWCTGQTAAELLQISPDEFYELPEEEQVMYPSSLENENFVVAIVNCRRQTSRCGNNIYFVNDPLSWEITRALK

TrEMBL top hitse value%identityAlignment
A0A0A0L5D2 Uncharacterized protein2.9e-26784.15Show/hide
Query:  KAWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILPEYN--QLPIYRLRPAETKD-----RFYDLVDGILKKG
        +AWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTVTIDSIYEKNFL+LSSVLEAVILDEFILP  N   L +     + T D     RFYDLV+GILKKG
Subjt:  KAWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILPEYN--QLPIYRLRPAETKD-----RFYDLVDGILKKG

Query:  RQIFVTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVNEGIT------IESIGPLEIYEKINGLRMIQIILVDNDGF
        RQIFVTGCYLRAASGGSG+PRLLPTEYLIILLDEEEDDDVMLLGAQFCSD+FSSVSLDSVNEG T      IESIGPLEI+E +NGLRMIQIILVDNDGF
Subjt:  RQIFVTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVNEGIT------IESIGPLEIYEKINGLRMIQIILVDNDGF

Query:  KLKFLLWGEQVLLANLL---SVLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRTLSTSYPTQGPQVSQVSLPCDS
        KLKFLLWGEQVLLANLL   SVLALDRPY+ATVNENG+GTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRT+S SYPTQ PQVSQVSLPCDS
Subjt:  KLKFLLWGEQVLLANLL---SVLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRTLSTSYPTQGPQVSQVSLPCDS

Query:  HGAIDFGNYPFRSFVIDLQDKMTGISLYGIVLDIANERNTTEAVFSLRIEDNTGEILA-------------------------------NLEALWTENHV
        HGAIDFGNYPFRSFVIDLQDKMTGISLYG VLDIANERNTTEA FS+RIEDNTGE+LA                                LEALW ENHV
Subjt:  HGAIDFGNYPFRSFVIDLQDKMTGISLYGIVLDIANERNTTEAVFSLRIEDNTGEILA-------------------------------NLEALWTENHV

Query:  GASFVNVSCLPALLTSSCLHKLSRLSDLTSNTHGTKVCQVRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSELVLTFDLKITLADDSAKI
        GASFVN+SCLPALLTSSCLHKLSRLSDLTSNTHGTKVCQVRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSEL+ TFDLKITLADDSAKI
Subjt:  GASFVNVSCLPALLTSSCLHKLSRLSDLTSNTHGTKVCQVRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSELVLTFDLKITLADDSAKI

Query:  FAWCTGQTAAELLQISPDEFYELPEEEQVMYPSSLENENFVVAIVNCRRQTSRCGNNIYFVNDPLSWEITRALK
        FAWCTGQTAAELLQISPDEF ELPEEEQVMYPSSLENENFVVAIVNCRR++S  GNN+ F NDPLSWEITRALK
Subjt:  FAWCTGQTAAELLQISPDEFYELPEEEQVMYPSSLENENFVVAIVNCRRQTSRCGNNIYFVNDPLSWEITRALK

A0A1S3AX73 uncharacterized protein LOC103483891 isoform X22.2e-26784.32Show/hide
Query:  KAWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILPEYN--QLPIYRLRPAETKD-----RFYDLVDGILKKG
        +AWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILP  N   L +     + T D     RFYDLVDGILKKG
Subjt:  KAWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILPEYN--QLPIYRLRPAETKD-----RFYDLVDGILKKG

Query:  RQIFVTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVNEGIT------IESIGPLEIYEKINGLRMIQIILVDNDGF
        RQIFVTGCYLRAASGGSG+PRLLPTEYL+ILLDEEEDDDVMLLGAQFCSD+FSSVSLDSVNEG T      IESIGPLEI+EKINGLRMIQIILVDNDGF
Subjt:  RQIFVTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVNEGIT------IESIGPLEIYEKINGLRMIQIILVDNDGF

Query:  KLKFLLWGEQVLLANLL---SVLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRTLSTSYPTQGPQVSQVSLPCDS
        KLKFLLWGEQVLLA LL   SVLALDRPY+ATVNENG+GTS+ELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRT+S SYPTQGPQVSQVSLPCDS
Subjt:  KLKFLLWGEQVLLANLL---SVLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRTLSTSYPTQGPQVSQVSLPCDS

Query:  HGAIDFGNYPFRSFVIDLQDKMTGISLYGIVLDIANERNTTEAVFSLRIEDNTGEILA-------------------------------NLEALWTENHV
        HGAIDFGNYPFRSFVIDLQDKMTGISLYG VLDIANERNTTEA FS+RIEDNTGEILA                                LEALW ENHV
Subjt:  HGAIDFGNYPFRSFVIDLQDKMTGISLYGIVLDIANERNTTEAVFSLRIEDNTGEILA-------------------------------NLEALWTENHV

Query:  GASFVNVSCLPALLTSSCLHKLSRLSDLTSNTHGTKVCQVRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSELVLTFDLKITLADDSAKI
        GASFVN+SCLPALLTSSCLHKLSRLSDLTSNTHGTKVC+VRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFC CECKSELV TFDLKITLADDSAKI
Subjt:  GASFVNVSCLPALLTSSCLHKLSRLSDLTSNTHGTKVCQVRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSELVLTFDLKITLADDSAKI

Query:  FAWCTGQTAAELLQISPDEFYELPEEEQVMYPSSLENENFVVAIVNCRRQTSRCGNNIYFVNDPLSWEITRALK
        FAWC GQTAAELLQISPDEF ELPEEEQVMYPSSLENENFVVAIVNCRRQ+ + GNN+ F NDPLSWEITRALK
Subjt:  FAWCTGQTAAELLQISPDEFYELPEEEQVMYPSSLENENFVVAIVNCRRQTSRCGNNIYFVNDPLSWEITRALK

A0A1S4DSK5 uncharacterized protein LOC103483891 isoform X18.5e-26783.59Show/hide
Query:  KAWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILPEYN--QLPIYRLRPAETKD-----RFYDLVDGILKKG
        +AWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILP  N   L +     + T D     RFYDLVDGILKKG
Subjt:  KAWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILPEYN--QLPIYRLRPAETKD-----RFYDLVDGILKKG

Query:  RQIFVTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVNEGIT------IESIGPLEIYEKINGLRMIQIILVDNDGF
        RQIFVTGCYLRAASGGSG+PRLLPTEYL+ILLDEEEDDDVMLLGAQFCSD+FSSVSLDSVNEG T      IESIGPLEI+EKINGLRMIQIILVDNDGF
Subjt:  RQIFVTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVNEGIT------IESIGPLEIYEKINGLRMIQIILVDNDGF

Query:  KLKFLLWGEQVLLANLL--------SVLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRTLSTSYPTQGPQVSQVS
        KLKFLLWGEQVLLA LL        SVLALDRPY+ATVNENG+GTS+ELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRT+S SYPTQGPQVSQVS
Subjt:  KLKFLLWGEQVLLANLL--------SVLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRTLSTSYPTQGPQVSQVS

Query:  LPCDSHGAIDFGNYPFRSFVIDLQDKMTGISLYGIVLDIANERNTTEAVFSLRIEDNTGEILA-------------------------------NLEALW
        LPCDSHGAIDFGNYPFRSFVIDLQDKMTGISLYG VLDIANERNTTEA FS+RIEDNTGEILA                                LEALW
Subjt:  LPCDSHGAIDFGNYPFRSFVIDLQDKMTGISLYGIVLDIANERNTTEAVFSLRIEDNTGEILA-------------------------------NLEALW

Query:  TENHVGASFVNVSCLPALLTSSCLHKLSRLSDLTSNTHGTKVCQVRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSELVLTFDLKITLAD
         ENHVGASFVN+SCLPALLTSSCLHKLSRLSDLTSNTHGTKVC+VRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFC CECKSELV TFDLKITLAD
Subjt:  TENHVGASFVNVSCLPALLTSSCLHKLSRLSDLTSNTHGTKVCQVRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSELVLTFDLKITLAD

Query:  DSAKIFAWCTGQTAAELLQISPDEFYELPEEEQVMYPSSLENENFVVAIVNCRRQTSRCGNNIYFVNDPLSWEITRALK
        DSAKIFAWC GQTAAELLQISPDEF ELPEEEQVMYPSSLENENFVVAIVNCRRQ+ + GNN+ F NDPLSWEITRALK
Subjt:  DSAKIFAWCTGQTAAELLQISPDEFYELPEEEQVMYPSSLENENFVVAIVNCRRQTSRCGNNIYFVNDPLSWEITRALK

A0A5A7U7H0 Nucleic acid-binding proteins superfamily isoform 17.7e-26884.32Show/hide
Query:  KAWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILPEYN--QLPIYRLRPAETKD-----RFYDLVDGILKKG
        +AWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTVTIDSIYEKNFLS+SSVLEAVILDEFILP  N   L +     + T D     RFYDLVDGILKKG
Subjt:  KAWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILPEYN--QLPIYRLRPAETKD-----RFYDLVDGILKKG

Query:  RQIFVTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVNEGIT------IESIGPLEIYEKINGLRMIQIILVDNDGF
        RQIFVTGCYLRAASGGSG+PRLLPTEYL+ILLDEEEDDDVMLLGAQFCSD+FSSVSLDSVNEG T      IESIGPLEI+EKINGLRMIQIILVDNDGF
Subjt:  RQIFVTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVNEGIT------IESIGPLEIYEKINGLRMIQIILVDNDGF

Query:  KLKFLLWGEQVLLANLL---SVLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRTLSTSYPTQGPQVSQVSLPCDS
        KLKFLLWGEQVLLANLL   SVLALDRPY+ATVNENG+GTS+ELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRT+S SYPTQGPQVSQVSLPCDS
Subjt:  KLKFLLWGEQVLLANLL---SVLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRTLSTSYPTQGPQVSQVSLPCDS

Query:  HGAIDFGNYPFRSFVIDLQDKMTGISLYGIVLDIANERNTTEAVFSLRIEDNTGEILA-------------------------------NLEALWTENHV
        HGAIDFGNYPFRSFVIDLQDKMTGISLYG VLDIANERNTTEA FS+RIEDNTGEILA                                LEALW ENHV
Subjt:  HGAIDFGNYPFRSFVIDLQDKMTGISLYGIVLDIANERNTTEAVFSLRIEDNTGEILA-------------------------------NLEALWTENHV

Query:  GASFVNVSCLPALLTSSCLHKLSRLSDLTSNTHGTKVCQVRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSELVLTFDLKITLADDSAKI
        GASFVN+SCLPALLTSSCLHKLSRLSDLTSNTHGTKVC+VRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFC CECKSELV TFDLKITLADDSAKI
Subjt:  GASFVNVSCLPALLTSSCLHKLSRLSDLTSNTHGTKVCQVRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSELVLTFDLKITLADDSAKI

Query:  FAWCTGQTAAELLQISPDEFYELPEEEQVMYPSSLENENFVVAIVNCRRQTSRCGNNIYFVNDPLSWEITRALK
        FAWC GQTAAELLQISPDEF ELPEEEQVMYPSSLENENFVVAIVNCRRQ+ + GNN+ F NDPLSWEITRALK
Subjt:  FAWCTGQTAAELLQISPDEFYELPEEEQVMYPSSLENENFVVAIVNCRRQTSRCGNNIYFVNDPLSWEITRALK

A0A5D3CXI2 Nucleic acid-binding proteins superfamily isoform 11.7e-27086.85Show/hide
Query:  MKAWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILPEYNQ-------LPIYR--LRPAETKDRFYDLVDGIL
        ++AWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTVTIDSIYEKNFLS+SSVLEAVILDEFILP+ +          +YR     ++T   FYDLVDGIL
Subjt:  MKAWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILPEYNQ-------LPIYR--LRPAETKDRFYDLVDGIL

Query:  KKGRQIFVTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVNEGIT---IESIGPLEIYEKINGLRMIQIILVDNDGF
        KKGRQIFVTGCYLRAASGGSG+PRLLPTEYL+ILLDEEEDDDVMLLGAQFCSD+FSSVSLDSVNEG T   IESIGPLEI+EKINGLRMIQIILVDNDGF
Subjt:  KKGRQIFVTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVNEGIT---IESIGPLEIYEKINGLRMIQIILVDNDGF

Query:  KLKFLLWGEQVLLANLL--------SVLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRTLSTSYPTQGPQVSQVS
        KLKFLLWGEQVLLANLL        SVLALDRPY+ATVNENG+GTS+ELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRT+S SYPTQGPQVSQVS
Subjt:  KLKFLLWGEQVLLANLL--------SVLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRTLSTSYPTQGPQVSQVS

Query:  LPCDSHGAIDFGNYPFRSFVIDLQDKMTGISLYGIVLDIANERNTTEAVFSLRIEDNTGEILAN-------LEALWTENHVGASFVNVSCLPALLTSSCL
        LPCDSHGAIDFGNYPFRSFVIDLQDKMTGISLYG VLDIANERNTTEA FS+RIEDNTGEILA        LEALW ENHVGASFVN+SCLPALLTSSCL
Subjt:  LPCDSHGAIDFGNYPFRSFVIDLQDKMTGISLYGIVLDIANERNTTEAVFSLRIEDNTGEILAN-------LEALWTENHVGASFVNVSCLPALLTSSCL

Query:  HKLSRLSDLTSNTHGTKVCQVRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSELVLTFDLKITLADDSAKIFAWCTGQTAAELLQISPDE
        HKLSRLSDLTSNTHGTKVC+VRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFC CECKSELV TFDLKITLADDSAKIFAWC GQTAAELLQISPDE
Subjt:  HKLSRLSDLTSNTHGTKVCQVRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSELVLTFDLKITLADDSAKIFAWCTGQTAAELLQISPDE

Query:  FYELPEEEQVMYPSSLENENFVVAIVNCRRQTSRCGNNIYFVNDPLSWEITRALK
        F ELPEEEQVMYPSSLENENFVVAIVNCRRQ+ + GNN+ F NDPLSWEITRALK
Subjt:  FYELPEEEQVMYPSSLENENFVVAIVNCRRQTSRCGNNIYFVNDPLSWEITRALK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G17030.1 Nucleic acid-binding proteins superfamily1.6e-14849.66Show/hide
Query:  KAWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILPEYN--QLPIYRLRPAETKD-----RFYDLVD---GIL
        +AW+EQ++ G  KK PE I+QLKK +RR++L  TVTIDSIYEKNFLS++SVLEAVI++  +LP  N   L +     + T D     R+Y+LV+   GIL
Subjt:  KAWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILPEYN--QLPIYRLRPAETKD-----RFYDLVD---GIL

Query:  KKGRQIFVTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVNEGIT------IESIGPLEIYEKINGLRMIQIILVDN
        +KGR++ +TGCYLR A  G G PRLLPTEYL++LLDE++DDD +L+ AQFCSD+FSSVSLD+ N+G +      IESIGPLE     +  R  QI LVD 
Subjt:  KKGRQIFVTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVNEGIT------IESIGPLEIYEKINGLRMIQIILVDN

Query:  DGFKLKFLLWGEQVLLANLL---SVLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCV-LTQNINQASRTLSTSYPTQGPQVSQVSL
        DG +LKF+LWGEQV++ANLL   SVL ++RPYI+++ E+ +  + E CLEYGSAT LYLVP    EE+VCV L+Q+  Q S+ L +        VSQV+L
Subjt:  DGFKLKFLLWGEQVLLANLL---SVLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCV-LTQNINQASRTLSTSYPTQGPQVSQVSL

Query:  PCDSHGAIDFGNYPFRSFVIDLQDKMTGISLYGIVLDIANERNTTEAVFSLRIEDNTGEILANL--------------------------------EALW
        P D+ G++DF NYPFR+ + D++DK TGISLYG+V DI+ + N T  VFSL+IED TG I A L                                E LW
Subjt:  PCDSHGAIDFGNYPFRSFVIDLQDKMTGISLYGIVLDIANERNTTEAVFSLRIEDNTGEILANL--------------------------------EALW

Query:  TENHVGASFVNVSCLPALLTSSCLHKLSRLSDLTSNTH-GTKVCQVRLDQVSHCH-VSTKFLHAICGHFVEETP-----ARIECSFCRCECKS--ELVLT
         E    A+FVN+SCLPA LTSSC+H +S LS ++        +C+V+LD++  CH ++T+  H++CGHF++E       A + CSFCR  C S  E+V T
Subjt:  TENHVGASFVNVSCLPALLTSSCLHKLSRLSDLTSNTH-GTKVCQVRLDQVSHCH-VSTKFLHAICGHFVEETP-----ARIECSFCRCECKS--ELVLT

Query:  FDLKITLADDSAKIFAWCTGQTAAELLQISPDEFYELPEEEQVMYPSSLENENFVVAIVNCRRQTSRCGNNIYFVNDPLSWEITRALK
        F + ITLAD+  K++AWCTGQ+A+ +LQISPDEF +LPE++Q+MYPSSLENE F+V + N   +    G+      D   WEITRALK
Subjt:  FDLKITLADDSAKIFAWCTGQTAAELLQISPDEFYELPEEEQVMYPSSLENENFVVAIVNCRRQTSRCGNNIYFVNDPLSWEITRALK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGCCTGGTATGAGCAACACAGAGTTGGTGCTCCCAAGAAAATACCTGAATGTATCAACCAGTTGAAGAAGAAGAATAGGAGAAAGAAACTCCCAAAAACAGTTAC
TATTGACTCCATATATGAGAAGAATTTCCTATCTTTAAGTAGCGTTTTGGAAGCTGTAATTCTTGATGAGTTTATTCTTCCAGAATACAATCAGCTGCCAATTTATAGAC
TAAGGCCAGCTGAGACCAAAGACCGATTCTATGATTTAGTGGATGGAATTCTGAAGAAAGGGAGGCAAATATTTGTAACTGGATGCTATCTTCGTGCTGCCAGTGGTGGC
TCTGGTCATCCACGACTTCTACCAACTGAATACCTTATCATATTGTTAGACGAGGAAGAGGACGATGATGTAATGCTTCTAGGAGCTCAATTTTGTTCTGATTCCTTTTC
TTCTGTTTCTCTTGATTCCGTCAATGAAGGGATTACGATTGAGTCCATTGGTCCACTGGAAATTTATGAGAAGATTAATGGTTTACGGATGATACAAATCATTCTTGTTG
ATAATGATGGTTTCAAGCTAAAGTTTCTCTTATGGGGTGAACAGGTGCTACTGGCCAATCTTTTAAGTGTGCTTGCGCTTGATAGACCATATATTGCAACCGTTAACGAG
AATGGCATTGGAACAAGTGATGAACTTTGTCTTGAATATGGTAGTGCAACACAGTTATATTTGGTGCCTTGCATTCAGCATGAGGAGCAAGTCTGTGTTTTAACTCAGAA
TATAAACCAAGCTTCAAGGACACTCAGTACATCGTATCCTACTCAGGGTCCCCAAGTTTCTCAAGTTTCCTTGCCCTGTGATTCACATGGGGCAATTGATTTTGGTAATT
ATCCTTTTCGGTCTTTTGTGATCGACCTTCAAGACAAGATGACTGGCATTAGCTTATATGGTATCGTTTTAGATATAGCTAATGAAAGAAATACCACAGAAGCTGTTTTC
TCTTTGAGAATTGAAGATAACACGGGAGAAATTTTGGCCAACTTAGAGGCGCTATGGACTGAGAATCATGTTGGAGCTTCTTTTGTCAACGTTAGCTGCTTGCCAGCATT
GTTAACTTCATCTTGTCTTCATAAACTTTCACGACTTTCTGATCTTACCAGCAACACTCATGGTACAAAGGTCTGTCAAGTTCGGCTCGACCAAGTTTCACATTGTCATG
TCAGTACGAAATTTTTGCATGCAATTTGTGGTCATTTCGTCGAGGAGACACCTGCCAGAATTGAGTGCAGCTTCTGTCGTTGTGAATGCAAGTCTGAGCTTGTGCTTACA
TTCGACCTCAAAATCACCCTTGCAGACGACAGTGCCAAAATCTTTGCATGGTGTACAGGTCAAACTGCTGCAGAGTTGTTGCAAATATCTCCTGATGAATTCTATGAACT
ACCTGAGGAAGAACAAGTAATGTACCCATCTTCACTCGAGAACGAGAATTTTGTGGTTGCAATAGTGAATTGCAGAAGGCAGACCAGCAGATGTGGAAATAATATCTATT
TTGTTAATGATCCACTTTCATGGGAGATTACTCGTGCACTCAAATCCTTAAAGAACAATATTTTACGAATTACCTTAACACTGTCCCGGAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGAAAGCCTGGTATGAGCAACACAGAGTTGGTGCTCCCAAGAAAATACCTGAATGTATCAACCAGTTGAAGAAGAAGAATAGGAGAAAGAAACTCCCAAAAACAGTTAC
TATTGACTCCATATATGAGAAGAATTTCCTATCTTTAAGTAGCGTTTTGGAAGCTGTAATTCTTGATGAGTTTATTCTTCCAGAATACAATCAGCTGCCAATTTATAGAC
TAAGGCCAGCTGAGACCAAAGACCGATTCTATGATTTAGTGGATGGAATTCTGAAGAAAGGGAGGCAAATATTTGTAACTGGATGCTATCTTCGTGCTGCCAGTGGTGGC
TCTGGTCATCCACGACTTCTACCAACTGAATACCTTATCATATTGTTAGACGAGGAAGAGGACGATGATGTAATGCTTCTAGGAGCTCAATTTTGTTCTGATTCCTTTTC
TTCTGTTTCTCTTGATTCCGTCAATGAAGGGATTACGATTGAGTCCATTGGTCCACTGGAAATTTATGAGAAGATTAATGGTTTACGGATGATACAAATCATTCTTGTTG
ATAATGATGGTTTCAAGCTAAAGTTTCTCTTATGGGGTGAACAGGTGCTACTGGCCAATCTTTTAAGTGTGCTTGCGCTTGATAGACCATATATTGCAACCGTTAACGAG
AATGGCATTGGAACAAGTGATGAACTTTGTCTTGAATATGGTAGTGCAACACAGTTATATTTGGTGCCTTGCATTCAGCATGAGGAGCAAGTCTGTGTTTTAACTCAGAA
TATAAACCAAGCTTCAAGGACACTCAGTACATCGTATCCTACTCAGGGTCCCCAAGTTTCTCAAGTTTCCTTGCCCTGTGATTCACATGGGGCAATTGATTTTGGTAATT
ATCCTTTTCGGTCTTTTGTGATCGACCTTCAAGACAAGATGACTGGCATTAGCTTATATGGTATCGTTTTAGATATAGCTAATGAAAGAAATACCACAGAAGCTGTTTTC
TCTTTGAGAATTGAAGATAACACGGGAGAAATTTTGGCCAACTTAGAGGCGCTATGGACTGAGAATCATGTTGGAGCTTCTTTTGTCAACGTTAGCTGCTTGCCAGCATT
GTTAACTTCATCTTGTCTTCATAAACTTTCACGACTTTCTGATCTTACCAGCAACACTCATGGTACAAAGGTCTGTCAAGTTCGGCTCGACCAAGTTTCACATTGTCATG
TCAGTACGAAATTTTTGCATGCAATTTGTGGTCATTTCGTCGAGGAGACACCTGCCAGAATTGAGTGCAGCTTCTGTCGTTGTGAATGCAAGTCTGAGCTTGTGCTTACA
TTCGACCTCAAAATCACCCTTGCAGACGACAGTGCCAAAATCTTTGCATGGTGTACAGGTCAAACTGCTGCAGAGTTGTTGCAAATATCTCCTGATGAATTCTATGAACT
ACCTGAGGAAGAACAAGTAATGTACCCATCTTCACTCGAGAACGAGAATTTTGTGGTTGCAATAGTGAATTGCAGAAGGCAGACCAGCAGATGTGGAAATAATATCTATT
TTGTTAATGATCCACTTTCATGGGAGATTACTCGTGCACTCAAATCCTTAAAGAACAATATTTTACGAATTACCTTAACACTGTCCCGGAAATAA
Protein sequenceShow/hide protein sequence
MKAWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILPEYNQLPIYRLRPAETKDRFYDLVDGILKKGRQIFVTGCYLRAASGG
SGHPRLLPTEYLIILLDEEEDDDVMLLGAQFCSDSFSSVSLDSVNEGITIESIGPLEIYEKINGLRMIQIILVDNDGFKLKFLLWGEQVLLANLLSVLALDRPYIATVNE
NGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRTLSTSYPTQGPQVSQVSLPCDSHGAIDFGNYPFRSFVIDLQDKMTGISLYGIVLDIANERNTTEAVF
SLRIEDNTGEILANLEALWTENHVGASFVNVSCLPALLTSSCLHKLSRLSDLTSNTHGTKVCQVRLDQVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSELVLT
FDLKITLADDSAKIFAWCTGQTAAELLQISPDEFYELPEEEQVMYPSSLENENFVVAIVNCRRQTSRCGNNIYFVNDPLSWEITRALKSLKNNILRITLTLSRK