; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS013120 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS013120
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProtein of unknown function (DUF630 and DUF632)
Genome locationscaffold459:362321..364443
RNA-Seq ExpressionMS013120
SyntenyMS013120
Gene Ontology termsNA
InterPro domainsIPR006867 - Domain of unknown function DUF632


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050775.1 uncharacterized protein E6C27_scaffold404G00250 [Cucumis melo var. makuwa]1.2e-16488.66Show/hide
Query:  TVLDKLLAWEKKLYDEVKAGELMKFEYQKKVATLNRLKKRDSNAEALEKAKAAVSHLHTRYIVDMQSLDSTISEINRLRDEQLYPKLVQLVNGMATMWET
        TVLDKLLAWEKKLYDEVKAGELMKFEYQKKVATLNRLKKRDSNAEALEKAKAAVSHLHTRYIVDMQSLDST+SEI+RLRDEQLYPKLVQLVNGMA MW T
Subjt:  TVLDKLLAWEKKLYDEVKAGELMKFEYQKKVATLNRLKKRDSNAEALEKAKAAVSHLHTRYIVDMQSLDSTISEINRLRDEQLYPKLVQLVNGMATMWET

Query:  MRVHHEGQLKIVNVLRSVNPSQFPKETSEHHHERTVQLCGVVGEWHSQFEKLVRRQKDYIKALNSWLKLNLVPIESSLKEKVSSPPRVQNPPIQKLLLAW
        MR HHE QLKIV+ LR+++ SQ PKETS HH+ERTVQLCGVV EWHSQFEKLVR QKDYIK+LNSWLKLNL+PIESSLKEKVSSPPRVQNPPIQKLLLAW
Subjt:  MRVHHEGQLKIVNVLRSVNPSQFPKETSEHHHERTVQLCGVVGEWHSQFEKLVRRQKDYIKALNSWLKLNLVPIESSLKEKVSSPPRVQNPPIQKLLLAW

Query:  HDQLERLPDEHLRTAIFTFGAVINTIVLQQDEEMKLRIKWEETAKELERKERHFSDWHYKYQQRRMPDEMDPDKSEENAQDTAVTEKLIMVETLKKRLEE
        HDQLERLPDEHLRTAIFTFGAVINTI+LQQDEE KL++KWEET KEL+RK+RHF DWHYKYQQRR+PD+MDP++SEE  QD AVTEK I VE+LKKRLEE
Subjt:  HDQLERLPDEHLRTAIFTFGAVINTIVLQQDEEMKLRIKWEETAKELERKERHFSDWHYKYQQRRMPDEMDPDKSEENAQDTAVTEKLIMVETLKKRLEE

Query:  EKETHAKQCLHVREKSLVSLKNQLPELFRALSQFS
        EKETHAKQCLHVREKSLVSLKNQLPELFRALS+FS
Subjt:  EKETHAKQCLHVREKSLVSLKNQLPELFRALSQFS

XP_008447501.1 PREDICTED: uncharacterized protein LOC103489933 [Cucumis melo]1.2e-16488.66Show/hide
Query:  TVLDKLLAWEKKLYDEVKAGELMKFEYQKKVATLNRLKKRDSNAEALEKAKAAVSHLHTRYIVDMQSLDSTISEINRLRDEQLYPKLVQLVNGMATMWET
        TVLDKLLAWEKKLYDEVKAGELMKFEYQKKVATLNRLKKRDSNAEALEKAKAAVSHLHTRYIVDMQSLDST+SEI+RLRDEQLYPKLVQLVNGMA MW T
Subjt:  TVLDKLLAWEKKLYDEVKAGELMKFEYQKKVATLNRLKKRDSNAEALEKAKAAVSHLHTRYIVDMQSLDSTISEINRLRDEQLYPKLVQLVNGMATMWET

Query:  MRVHHEGQLKIVNVLRSVNPSQFPKETSEHHHERTVQLCGVVGEWHSQFEKLVRRQKDYIKALNSWLKLNLVPIESSLKEKVSSPPRVQNPPIQKLLLAW
        MR HHE QLKIV+ LR+++ SQ PKETS HH+ERTVQLCGVV EWHSQFEKLVR QKDYIK+LNSWLKLNL+PIESSLKEKVSSPPRVQNPPIQKLLLAW
Subjt:  MRVHHEGQLKIVNVLRSVNPSQFPKETSEHHHERTVQLCGVVGEWHSQFEKLVRRQKDYIKALNSWLKLNLVPIESSLKEKVSSPPRVQNPPIQKLLLAW

Query:  HDQLERLPDEHLRTAIFTFGAVINTIVLQQDEEMKLRIKWEETAKELERKERHFSDWHYKYQQRRMPDEMDPDKSEENAQDTAVTEKLIMVETLKKRLEE
        HDQLERLPDEHLRTAIFTFGAVINTI+LQQDEE KL++KWEET KEL+RK+RHF DWHYKYQQRR+PD+MDP++SEE  QD AVTEK I VE+LKKRLEE
Subjt:  HDQLERLPDEHLRTAIFTFGAVINTIVLQQDEEMKLRIKWEETAKELERKERHFSDWHYKYQQRRMPDEMDPDKSEENAQDTAVTEKLIMVETLKKRLEE

Query:  EKETHAKQCLHVREKSLVSLKNQLPELFRALSQFS
        EKETHAKQCLHVREKSLVSLKNQLPELFRALS+FS
Subjt:  EKETHAKQCLHVREKSLVSLKNQLPELFRALSQFS

XP_022153603.1 uncharacterized protein LOC111021069 [Momordica charantia]5.1e-18499.4Show/hide
Query:  TVLDKLLAWEKKLYDEVKAGELMKFEYQKKVATLNRLKKRDSNAEALEKAKAAVSHLHTRYIVDMQSLDSTISEINRLRDEQLYPKLVQLVNGMATMWET
        TVLDKLLAWEKKLYDEVKAGELMKFEYQKKVATLNRLKKRDSNAEALEKAKAAVSHLHTRYIVDMQSLDSTISEINRLRD+QLYPKLVQLVNGMATMWET
Subjt:  TVLDKLLAWEKKLYDEVKAGELMKFEYQKKVATLNRLKKRDSNAEALEKAKAAVSHLHTRYIVDMQSLDSTISEINRLRDEQLYPKLVQLVNGMATMWET

Query:  MRVHHEGQLKIVNVLRSVNPSQFPKETSEHHHERTVQLCGVVGEWHSQFEKLVRRQKDYIKALNSWLKLNLVPIESSLKEKVSSPPRVQNPPIQKLLLAW
        MRVHHEGQLKIVNVLRSVNPSQFPKETSEHHHERTVQLCGVVGEWHSQFEKLVRRQKDYIKALNSWLKLNLVPIESSLKEK+SSPPRVQNPPIQKLLLAW
Subjt:  MRVHHEGQLKIVNVLRSVNPSQFPKETSEHHHERTVQLCGVVGEWHSQFEKLVRRQKDYIKALNSWLKLNLVPIESSLKEKVSSPPRVQNPPIQKLLLAW

Query:  HDQLERLPDEHLRTAIFTFGAVINTIVLQQDEEMKLRIKWEETAKELERKERHFSDWHYKYQQRRMPDEMDPDKSEENAQDTAVTEKLIMVETLKKRLEE
        HDQLERLPDEHLRTAIFTFGAVINTIVLQQDEEMKLRIKWEETAKELERKERHFSDWHYKYQQRRMPDEMDPDKSEENAQDTAVTEKLIMVETLKKRLEE
Subjt:  HDQLERLPDEHLRTAIFTFGAVINTIVLQQDEEMKLRIKWEETAKELERKERHFSDWHYKYQQRRMPDEMDPDKSEENAQDTAVTEKLIMVETLKKRLEE

Query:  EKETHAKQCLHVREKSLVSLKNQLPELFRALSQFS
        EKETHAKQCLHVREKSLVSLKNQLPELFRALSQFS
Subjt:  EKETHAKQCLHVREKSLVSLKNQLPELFRALSQFS

XP_023526711.1 nitrate regulatory gene2 protein-like [Cucurbita pepo subsp. pepo]3.8e-16388.06Show/hide
Query:  TVLDKLLAWEKKLYDEVKAGELMKFEYQKKVATLNRLKKRDSNAEALEKAKAAVSHLHTRYIVDMQSLDSTISEINRLRDEQLYPKLVQLVNGMATMWET
        TVLDKLLAWEKKLYDEVKAGELMKFEYQKKVATLNRLKKRDSNAEALEKAKAAVSHLHTRYIVDMQSLDST+SEI+RLRDEQLYPKLVQLVNGM+ MW+T
Subjt:  TVLDKLLAWEKKLYDEVKAGELMKFEYQKKVATLNRLKKRDSNAEALEKAKAAVSHLHTRYIVDMQSLDSTISEINRLRDEQLYPKLVQLVNGMATMWET

Query:  MRVHHEGQLKIVNVLRSVNPSQFPKETSEHHHERTVQLCGVVGEWHSQFEKLVRRQKDYIKALNSWLKLNLVPIESSLKEKVSSPPRVQNPPIQKLLLAW
        MR HHE QLKIV+ LRS++ SQ PKETS HH+ERTVQLCGVV EWHSQFEKLVR QKDYI+ALNSWLKLNL+PIESSL+EKVSSPPRVQ+PPIQKLLLAW
Subjt:  MRVHHEGQLKIVNVLRSVNPSQFPKETSEHHHERTVQLCGVVGEWHSQFEKLVRRQKDYIKALNSWLKLNLVPIESSLKEKVSSPPRVQNPPIQKLLLAW

Query:  HDQLERLPDEHLRTAIFTFGAVINTIVLQQDEEMKLRIKWEETAKELERKERHFSDWHYKYQQRRMPDEMDPDKSEENAQDTAVTEKLIMVETLKKRLEE
        HDQLERLPDEHLRTAIFTFGAVINTI+LQQDEE KL+ KWEET KELERK+RHF++WH KYQQRRMPDE+DPD+SEEN QD AVTEKL+ VE LKKRLEE
Subjt:  HDQLERLPDEHLRTAIFTFGAVINTIVLQQDEEMKLRIKWEETAKELERKERHFSDWHYKYQQRRMPDEMDPDKSEENAQDTAVTEKLIMVETLKKRLEE

Query:  EKETHAKQCLHVREKSLVSLKNQLPELFRALSQFS
        E ETHAKQCLHVREKSLVSLKNQLP+LFRALS+FS
Subjt:  EKETHAKQCLHVREKSLVSLKNQLPELFRALSQFS

XP_038891518.1 protein ROLLING AND ERECT LEAF 2-like [Benincasa hispida]3.7e-16689.25Show/hide
Query:  TVLDKLLAWEKKLYDEVKAGELMKFEYQKKVATLNRLKKRDSNAEALEKAKAAVSHLHTRYIVDMQSLDSTISEINRLRDEQLYPKLVQLVNGMATMWET
        TVLDKLLAWEKKLYDEVK GELMKFEYQKKVATLNRLKKRDSNAEALEKAKAAVSHLHTRYIVDMQSLDST+SEI+RLRDEQLYPKLVQLVNGMA MW T
Subjt:  TVLDKLLAWEKKLYDEVKAGELMKFEYQKKVATLNRLKKRDSNAEALEKAKAAVSHLHTRYIVDMQSLDSTISEINRLRDEQLYPKLVQLVNGMATMWET

Query:  MRVHHEGQLKIVNVLRSVNPSQFPKETSEHHHERTVQLCGVVGEWHSQFEKLVRRQKDYIKALNSWLKLNLVPIESSLKEKVSSPPRVQNPPIQKLLLAW
        MR HHE QLKIV+ LRS++ SQ PKETS HH+ERTVQLCGVV EWHSQFEKLVR QKDYIKALNSWLKLNL+PIESSLKEKVSSPPRVQNPPIQKLLLAW
Subjt:  MRVHHEGQLKIVNVLRSVNPSQFPKETSEHHHERTVQLCGVVGEWHSQFEKLVRRQKDYIKALNSWLKLNLVPIESSLKEKVSSPPRVQNPPIQKLLLAW

Query:  HDQLERLPDEHLRTAIFTFGAVINTIVLQQDEEMKLRIKWEETAKELERKERHFSDWHYKYQQRRMPDEMDPDKSEENAQDTAVTEKLIMVETLKKRLEE
        HDQLERLPDEHLRTAIFTFGAVINTI+LQQDEE KL++KWEET KELERK+RHF+DWHYKYQQRRMPDE+DP++SEEN QD AVTEKLI VE++++RLEE
Subjt:  HDQLERLPDEHLRTAIFTFGAVINTIVLQQDEEMKLRIKWEETAKELERKERHFSDWHYKYQQRRMPDEMDPDKSEENAQDTAVTEKLIMVETLKKRLEE

Query:  EKETHAKQCLHVREKSLVSLKNQLPELFRALSQFS
        EKETHAKQCLHVREKSLVSLKNQLPELFRALS+FS
Subjt:  EKETHAKQCLHVREKSLVSLKNQLPELFRALSQFS

TrEMBL top hitse value%identityAlignment
A0A0A0LBP3 Uncharacterized protein2.7e-16287.54Show/hide
Query:  TVLDKLLAWEKKLYDEVKAGELMKFEYQKKVATLNRLKKRDSNAEALEKAKAAVSHLHTRYIVDMQSLDSTISEINRLRDEQLYPKLVQLVNGMATMWET
        TVLDKLLAWEKKLYDEVKAGELMKFEYQKKVATLNRLKKRDSNAEALEKAKAAVSHLHTRYIVDMQSLDST+SEI+RLRDEQLYPKLVQLVNGMA MW T
Subjt:  TVLDKLLAWEKKLYDEVKAGELMKFEYQKKVATLNRLKKRDSNAEALEKAKAAVSHLHTRYIVDMQSLDSTISEINRLRDEQLYPKLVQLVNGMATMWET

Query:  MRVHHEGQLKIVNVLRSVNPSQFPKETSEHHHERTVQLCGVVGEWHSQFEKLVRRQKDYIKALNSWLKLNLVPIESSLKEKV--SSPPRVQNPPIQKLLL
        MR HHE QLKIV+ LR+++ SQ PKETS HH+ERTVQLCGVV EWHSQFEKLVR QK+YIK+LNSWLKLNL+PIESSLKEKV  SSPPRVQNPPIQKLLL
Subjt:  MRVHHEGQLKIVNVLRSVNPSQFPKETSEHHHERTVQLCGVVGEWHSQFEKLVRRQKDYIKALNSWLKLNLVPIESSLKEKV--SSPPRVQNPPIQKLLL

Query:  AWHDQLERLPDEHLRTAIFTFGAVINTIVLQQDEEMKLRIKWEETAKELERKERHFSDWHYKYQQRRMPDEMDPDKSEENAQDTAVTEKLIMVETLKKRL
        AWHDQLERLPDEHLRTAIFTFGAVINTI+LQQDEE KL++KWEET KELERK+RHF +WHYKYQQRRMPD++DP++SE   QD AVTEKLI VE+LKKRL
Subjt:  AWHDQLERLPDEHLRTAIFTFGAVINTIVLQQDEEMKLRIKWEETAKELERKERHFSDWHYKYQQRRMPDEMDPDKSEENAQDTAVTEKLIMVETLKKRL

Query:  EEEKETHAKQCLHVREKSLVSLKNQLPELFRALSQFS
        EEEKETH KQCLHVREKSLVSLKNQLPELFRALS+FS
Subjt:  EEEKETHAKQCLHVREKSLVSLKNQLPELFRALSQFS

A0A1S3BI69 uncharacterized protein LOC1034899335.8e-16588.66Show/hide
Query:  TVLDKLLAWEKKLYDEVKAGELMKFEYQKKVATLNRLKKRDSNAEALEKAKAAVSHLHTRYIVDMQSLDSTISEINRLRDEQLYPKLVQLVNGMATMWET
        TVLDKLLAWEKKLYDEVKAGELMKFEYQKKVATLNRLKKRDSNAEALEKAKAAVSHLHTRYIVDMQSLDST+SEI+RLRDEQLYPKLVQLVNGMA MW T
Subjt:  TVLDKLLAWEKKLYDEVKAGELMKFEYQKKVATLNRLKKRDSNAEALEKAKAAVSHLHTRYIVDMQSLDSTISEINRLRDEQLYPKLVQLVNGMATMWET

Query:  MRVHHEGQLKIVNVLRSVNPSQFPKETSEHHHERTVQLCGVVGEWHSQFEKLVRRQKDYIKALNSWLKLNLVPIESSLKEKVSSPPRVQNPPIQKLLLAW
        MR HHE QLKIV+ LR+++ SQ PKETS HH+ERTVQLCGVV EWHSQFEKLVR QKDYIK+LNSWLKLNL+PIESSLKEKVSSPPRVQNPPIQKLLLAW
Subjt:  MRVHHEGQLKIVNVLRSVNPSQFPKETSEHHHERTVQLCGVVGEWHSQFEKLVRRQKDYIKALNSWLKLNLVPIESSLKEKVSSPPRVQNPPIQKLLLAW

Query:  HDQLERLPDEHLRTAIFTFGAVINTIVLQQDEEMKLRIKWEETAKELERKERHFSDWHYKYQQRRMPDEMDPDKSEENAQDTAVTEKLIMVETLKKRLEE
        HDQLERLPDEHLRTAIFTFGAVINTI+LQQDEE KL++KWEET KEL+RK+RHF DWHYKYQQRR+PD+MDP++SEE  QD AVTEK I VE+LKKRLEE
Subjt:  HDQLERLPDEHLRTAIFTFGAVINTIVLQQDEEMKLRIKWEETAKELERKERHFSDWHYKYQQRRMPDEMDPDKSEENAQDTAVTEKLIMVETLKKRLEE

Query:  EKETHAKQCLHVREKSLVSLKNQLPELFRALSQFS
        EKETHAKQCLHVREKSLVSLKNQLPELFRALS+FS
Subjt:  EKETHAKQCLHVREKSLVSLKNQLPELFRALSQFS

A0A5D3C9M7 Uncharacterized protein5.8e-16588.66Show/hide
Query:  TVLDKLLAWEKKLYDEVKAGELMKFEYQKKVATLNRLKKRDSNAEALEKAKAAVSHLHTRYIVDMQSLDSTISEINRLRDEQLYPKLVQLVNGMATMWET
        TVLDKLLAWEKKLYDEVKAGELMKFEYQKKVATLNRLKKRDSNAEALEKAKAAVSHLHTRYIVDMQSLDST+SEI+RLRDEQLYPKLVQLVNGMA MW T
Subjt:  TVLDKLLAWEKKLYDEVKAGELMKFEYQKKVATLNRLKKRDSNAEALEKAKAAVSHLHTRYIVDMQSLDSTISEINRLRDEQLYPKLVQLVNGMATMWET

Query:  MRVHHEGQLKIVNVLRSVNPSQFPKETSEHHHERTVQLCGVVGEWHSQFEKLVRRQKDYIKALNSWLKLNLVPIESSLKEKVSSPPRVQNPPIQKLLLAW
        MR HHE QLKIV+ LR+++ SQ PKETS HH+ERTVQLCGVV EWHSQFEKLVR QKDYIK+LNSWLKLNL+PIESSLKEKVSSPPRVQNPPIQKLLLAW
Subjt:  MRVHHEGQLKIVNVLRSVNPSQFPKETSEHHHERTVQLCGVVGEWHSQFEKLVRRQKDYIKALNSWLKLNLVPIESSLKEKVSSPPRVQNPPIQKLLLAW

Query:  HDQLERLPDEHLRTAIFTFGAVINTIVLQQDEEMKLRIKWEETAKELERKERHFSDWHYKYQQRRMPDEMDPDKSEENAQDTAVTEKLIMVETLKKRLEE
        HDQLERLPDEHLRTAIFTFGAVINTI+LQQDEE KL++KWEET KEL+RK+RHF DWHYKYQQRR+PD+MDP++SEE  QD AVTEK I VE+LKKRLEE
Subjt:  HDQLERLPDEHLRTAIFTFGAVINTIVLQQDEEMKLRIKWEETAKELERKERHFSDWHYKYQQRRMPDEMDPDKSEENAQDTAVTEKLIMVETLKKRLEE

Query:  EKETHAKQCLHVREKSLVSLKNQLPELFRALSQFS
        EKETHAKQCLHVREKSLVSLKNQLPELFRALS+FS
Subjt:  EKETHAKQCLHVREKSLVSLKNQLPELFRALSQFS

A0A6J1DJJ9 uncharacterized protein LOC1110210692.5e-18499.4Show/hide
Query:  TVLDKLLAWEKKLYDEVKAGELMKFEYQKKVATLNRLKKRDSNAEALEKAKAAVSHLHTRYIVDMQSLDSTISEINRLRDEQLYPKLVQLVNGMATMWET
        TVLDKLLAWEKKLYDEVKAGELMKFEYQKKVATLNRLKKRDSNAEALEKAKAAVSHLHTRYIVDMQSLDSTISEINRLRD+QLYPKLVQLVNGMATMWET
Subjt:  TVLDKLLAWEKKLYDEVKAGELMKFEYQKKVATLNRLKKRDSNAEALEKAKAAVSHLHTRYIVDMQSLDSTISEINRLRDEQLYPKLVQLVNGMATMWET

Query:  MRVHHEGQLKIVNVLRSVNPSQFPKETSEHHHERTVQLCGVVGEWHSQFEKLVRRQKDYIKALNSWLKLNLVPIESSLKEKVSSPPRVQNPPIQKLLLAW
        MRVHHEGQLKIVNVLRSVNPSQFPKETSEHHHERTVQLCGVVGEWHSQFEKLVRRQKDYIKALNSWLKLNLVPIESSLKEK+SSPPRVQNPPIQKLLLAW
Subjt:  MRVHHEGQLKIVNVLRSVNPSQFPKETSEHHHERTVQLCGVVGEWHSQFEKLVRRQKDYIKALNSWLKLNLVPIESSLKEKVSSPPRVQNPPIQKLLLAW

Query:  HDQLERLPDEHLRTAIFTFGAVINTIVLQQDEEMKLRIKWEETAKELERKERHFSDWHYKYQQRRMPDEMDPDKSEENAQDTAVTEKLIMVETLKKRLEE
        HDQLERLPDEHLRTAIFTFGAVINTIVLQQDEEMKLRIKWEETAKELERKERHFSDWHYKYQQRRMPDEMDPDKSEENAQDTAVTEKLIMVETLKKRLEE
Subjt:  HDQLERLPDEHLRTAIFTFGAVINTIVLQQDEEMKLRIKWEETAKELERKERHFSDWHYKYQQRRMPDEMDPDKSEENAQDTAVTEKLIMVETLKKRLEE

Query:  EKETHAKQCLHVREKSLVSLKNQLPELFRALSQFS
        EKETHAKQCLHVREKSLVSLKNQLPELFRALSQFS
Subjt:  EKETHAKQCLHVREKSLVSLKNQLPELFRALSQFS

A0A6J1ISF0 nitrate regulatory gene2 protein-like2.0e-16287.16Show/hide
Query:  TVLDKLLAWEKKLYDEVKAGELMKFEYQKKVATLNRLKKRDSNAEALEKAKAAVSHLHTRYIVDMQSLDSTISEINRLRDEQLYPKLVQLVNGMATMWET
        TVLDKLLAWEKKLYDEVKAGELMKFEYQKKVATLNRLKKRDSNAEALEKAKAAVSHLHTRYIVDMQSLDST+SEI+RLRDEQLYPKLVQLVNGM+ MW+T
Subjt:  TVLDKLLAWEKKLYDEVKAGELMKFEYQKKVATLNRLKKRDSNAEALEKAKAAVSHLHTRYIVDMQSLDSTISEINRLRDEQLYPKLVQLVNGMATMWET

Query:  MRVHHEGQLKIVNVLRSVNPSQFPKETSEHHHERTVQLCGVVGEWHSQFEKLVRRQKDYIKALNSWLKLNLVPIESSLKEKVSSPPRVQNPPIQKLLLAW
        MR HHE QLKIV+ LRS++ SQ PKETS HH+ERTVQLCGVV EWHSQFEKLVR QKDYI+ALNSWLKLNL+PIESSL+EKVSSPPRVQ+PPIQKLL+AW
Subjt:  MRVHHEGQLKIVNVLRSVNPSQFPKETSEHHHERTVQLCGVVGEWHSQFEKLVRRQKDYIKALNSWLKLNLVPIESSLKEKVSSPPRVQNPPIQKLLLAW

Query:  HDQLERLPDEHLRTAIFTFGAVINTIVLQQDEEMKLRIKWEETAKELERKERHFSDWHYKYQQRRMPDEMDPDKSEENAQDTAVTEKLIMVETLKKRLEE
        HDQLERLPDEHLRTAIFTFGAVINTI+LQQDEE KL+ KWEET KELERK+RHF++WH KYQQRRMPDE+DP++SEEN QD AVT+KL+ VE LKKRLEE
Subjt:  HDQLERLPDEHLRTAIFTFGAVINTIVLQQDEEMKLRIKWEETAKELERKERHFSDWHYKYQQRRMPDEMDPDKSEENAQDTAVTEKLIMVETLKKRLEE

Query:  EKETHAKQCLHVREKSLVSLKNQLPELFRALSQFS
        E ETHAKQCLHVREKSLVSLKNQLP+LFRALS+FS
Subjt:  EKETHAKQCLHVREKSLVSLKNQLPELFRALSQFS

SwissProt top hitse value%identityAlignment
A0A178VBJ0 Protein ALTERED PHOSPHATE STARVATION RESPONSE 16.4e-4434.33Show/hide
Query:  TVLDKLLAWEKKLYDEVKAGELMKFEYQKKVATLNRLKKRDSNAEALEKAKAAVSHLHTRYIVDMQSLDSTISEINRLRDEQLYPKLVQLVNGMATMWET
        + +D+L AWEKKLY EVK  E +K +++KKV  + RL+ + +     EKAK  V  L ++  V  Q++ S  +EI +LR+ +LYP+LV+LV G+  MW +
Subjt:  TVLDKLLAWEKKLYDEVKAGELMKFEYQKKVATLNRLKKRDSNAEALEKAKAAVSHLHTRYIVDMQSLDSTISEINRLRDEQLYPKLVQLVNGMATMWET

Query:  MRVHHEGQLKIVNVLRSVNPSQFPKETSEHHHERTVQLCGVVGEWHSQFEKLVRRQKDYIKALNSWLKLNLVPIESSLKEKVSSPPRVQNPPIQKLLLAW
        M   H+ Q  IV  L+ +N     + TSE H + T+QL   V +WH  F  LV+ Q+DYI++L  WL+L+L     +   + S   +     I      W
Subjt:  MRVHHEGQLKIVNVLRSVNPSQFPKETSEHHHERTVQLCGVVGEWHSQFEKLVRRQKDYIKALNSWLKLNLVPIESSLKEKVSSPPRVQNPPIQKLLLAW

Query:  HDQLERLPDEHLRTAIFTFGAVINTIVLQQDEEMKLRIKWEETAKELERKERHFSDWHYKYQQRRMPDEMDPDKSEENAQDTAVTEKLIMVETLKKRLEE
        H  ++R+PD+     I +F   ++ IV QQ +E K + + E   K+ E+K         KY    +P         E+ +   V EK + VE LK + EE
Subjt:  HDQLERLPDEHLRTAIFTFGAVINTIVLQQDEEMKLRIKWEETAKELERKERHFSDWHYKYQQRRMPDEMDPDKSEENAQDTAVTEKLIMVETLKKRLEE

Query:  EKETHAKQCLHVREKSLVSLKNQLPELFRALSQFS
        EK  H K     R  +L +L+   P +F+A+  FS
Subjt:  EKETHAKQCLHVREKSLVSLKNQLPELFRALSQFS

Q93YU8 Nitrate regulatory gene2 protein1.5e-4032.94Show/hide
Query:  TVLDKLLAWEKKLYDEVKAGELMKFEYQKKVATLNRLKKRDSNAEALEKAKAAVSHLHTRYIVDMQSLDSTISEINRLRDEQLYPKLVQLVNGMATMWET
        + LD+LLAWEKKLY+E+KA E  K E++KK++ L   + +  +   L+K KA+++ L +  IV  Q++ +T + I RLRD  L P+LV+L +G   MW++
Subjt:  TVLDKLLAWEKKLYDEVKAGELMKFEYQKKVATLNRLKKRDSNAEALEKAKAAVSHLHTRYIVDMQSLDSTISEINRLRDEQLYPKLVQLVNGMATMWET

Query:  MRVHHEGQLKIVNVLRS-VNPSQFPKETSEHHHERTVQLCGVVGEWHSQFEKLVRRQKDYIKALNSWLKLNLVPIESSLKEKVSSPPRVQNPPIQKLLLA
        M  +HE Q  IV  +R  +N S   + TSE H + T  L   V  WHS F  L++ Q+D+I ++++W KL L+P+    +E  ++       P+      
Subjt:  MRVHHEGQLKIVNVLRS-VNPSQFPKETSEHHHERTVQLCGVVGEWHSQFEKLVRRQKDYIKALNSWLKLNLVPIESSLKEKVSSPPRVQNPPIQKLLLA

Query:  --WHDQLERLPDEHLRTAIFTFGAVINTIVLQQDEEMKLRIKWEETAKELERKERHFSDWHYKYQQRRMP-----DEMDPDKSEENAQDTAVTEKLIMVE
          W   L+R+PD     AI +F  V++ I  +Q +E K++ + E  +KELE+K     +   KY Q          E  PD          +++K   + 
Subjt:  --WHDQLERLPDEHLRTAIFTFGAVINTIVLQQDEEMKLRIKWEETAKELERKERHFSDWHYKYQQRRMP-----DEMDPDKSEENAQDTAVTEKLIMVE

Query:  TLKKRLEEEKETHAKQCLHVREKSLVSLKNQLPELFRALSQFS
          ++R+EEE   ++K     R  +L +L+  LP +F++L+ FS
Subjt:  TLKKRLEEEKETHAKQCLHVREKSLVSLKNQLPELFRALSQFS

Q9AQW1 Protein ROLLING AND ERECT LEAF 29.8e-4534.21Show/hide
Query:  TVLDKLLAWEKKLYDEVKAGELMKFEYQKKVATLNRLKKRDSNAEALEKAKAAVSHLHTRYIVDMQSLDSTISEINRLRDEQLYPKLVQLVNGMATMWET
        + L++LLAWEKKLY EVKA E +K E++KK++TL  L+ R  ++  L+K KA+++ L +  IV  Q+  +T S I R+RD +L P+LV+L   + +MW +
Subjt:  TVLDKLLAWEKKLYDEVKAGELMKFEYQKKVATLNRLKKRDSNAEALEKAKAAVSHLHTRYIVDMQSLDSTISEINRLRDEQLYPKLVQLVNGMATMWET

Query:  MRVHHEGQLKIVNVLRSVNPSQFPKETSEHHHERTVQLCGVVGEWHSQFEKLVRRQKDYIKALNSWLKLNLVPIESSLKEKVSSPPRVQNPPIQKLLLAW
        M   HE Q +IV  +R +  +   + TS+ H   T  L   V  WHS F +L++ Q+DYI+AL  WLKL L  ++S++ ++  +   + +  +      W
Subjt:  MRVHHEGQLKIVNVLRSVNPSQFPKETSEHHHERTVQLCGVVGEWHSQFEKLVRRQKDYIKALNSWLKLNLVPIESSLKEKVSSPPRVQNPPIQKLLLAW

Query:  HDQLERLPDEHLRTAIFTFGAVINTIVLQQDEEMKLRIKWEETAKELERKERHFSDWHYKYQQR------RMPDE-MDPDKSEENAQDTAVTEKLIMVET
           L+RLPD     AI +F  V++ I  +Q EEMK++ + E  +KELE+K         KY Q        +P    D  +S        + EK   +  
Subjt:  HDQLERLPDEHLRTAIFTFGAVINTIVLQQDEEMKLRIKWEETAKELERKERHFSDWHYKYQQR------RMPDE-MDPDKSEENAQDTAVTEKLIMVET

Query:  LKKRLEEEKETHAKQCLHVREKSLVSLKNQLPELFRALSQFS
         ++++E+E   HAK     R  +L +++  LP +F+A++ FS
Subjt:  LKKRLEEEKETHAKQCLHVREKSLVSLKNQLPELFRALSQFS

Arabidopsis top hitse value%identityAlignment
AT1G52320.1 unknown protein2.2e-11663.96Show/hide
Query:  TVLDKLLAWEKKLYDEVKAGELMKFEYQKKVATLNRLKKRDSNAEALEKAKAAVSHLHTRYIVDMQSLDSTISEINRLRDEQLYPKLVQLVNGMATMWET
        TVLDKLLAWEKKLYDEVKAGELMK EYQKKVA LNR+KKR  ++++LE+AKAAVSHLHTRYIVDMQS+DST+SEINRLRDEQLY KLV LV  M  MWE 
Subjt:  TVLDKLLAWEKKLYDEVKAGELMKFEYQKKVATLNRLKKRDSNAEALEKAKAAVSHLHTRYIVDMQSLDSTISEINRLRDEQLYPKLVQLVNGMATMWET

Query:  MRVHHEGQLKIVNVLRSVNPSQFPKETSEHHHERTVQLCGVVGEWHSQFEKLVRRQKDYIKALNSWLKLNLVPIESSLKEKVSSPPRVQNPPIQKLLLAW
        M++HH+ Q +I  VLRS++ SQ  KET++HHHERT+QL  VV EWH+QF +++  QK+YIKAL  WLKLNL+PIES+LKEKVSSPPRV NP IQKLL AW
Subjt:  MRVHHEGQLKIVNVLRSVNPSQFPKETSEHHHERTVQLCGVVGEWHSQFEKLVRRQKDYIKALNSWLKLNLVPIESSLKEKVSSPPRVQNPPIQKLLLAW

Query:  HDQLERLPDEHLRTAIFTFGAVINTIVLQQDEEMKLRIKWEETAKELERKERHFSDWHYKYQQRRMPDEMDPDKSEENAQDTAVTEKLIMVETLKKRLEE
        +D+L+++PDE  ++AI  F AV++TI+ QQ++E+ LR K EET KEL RK R F DW++KY Q+R P+ M+PD+++ +  D  V  +   VE +KKRLEE
Subjt:  HDQLERLPDEHLRTAIFTFGAVINTIVLQQDEEMKLRIKWEETAKELERKERHFSDWHYKYQQRRMPDEMDPDKSEENAQDTAVTEKLIMVETLKKRLEE

Query:  EKETHAKQCLHVREKSLVSLKNQLPELFRALSQ
        E+E + +Q   VREKSL SL+ +LPELF+A+S+
Subjt:  EKETHAKQCLHVREKSLVSLKNQLPELFRALSQ

AT1G52320.2 unknown protein2.2e-11663.96Show/hide
Query:  TVLDKLLAWEKKLYDEVKAGELMKFEYQKKVATLNRLKKRDSNAEALEKAKAAVSHLHTRYIVDMQSLDSTISEINRLRDEQLYPKLVQLVNGMATMWET
        TVLDKLLAWEKKLYDEVKAGELMK EYQKKVA LNR+KKR  ++++LE+AKAAVSHLHTRYIVDMQS+DST+SEINRLRDEQLY KLV LV  M  MWE 
Subjt:  TVLDKLLAWEKKLYDEVKAGELMKFEYQKKVATLNRLKKRDSNAEALEKAKAAVSHLHTRYIVDMQSLDSTISEINRLRDEQLYPKLVQLVNGMATMWET

Query:  MRVHHEGQLKIVNVLRSVNPSQFPKETSEHHHERTVQLCGVVGEWHSQFEKLVRRQKDYIKALNSWLKLNLVPIESSLKEKVSSPPRVQNPPIQKLLLAW
        M++HH+ Q +I  VLRS++ SQ  KET++HHHERT+QL  VV EWH+QF +++  QK+YIKAL  WLKLNL+PIES+LKEKVSSPPRV NP IQKLL AW
Subjt:  MRVHHEGQLKIVNVLRSVNPSQFPKETSEHHHERTVQLCGVVGEWHSQFEKLVRRQKDYIKALNSWLKLNLVPIESSLKEKVSSPPRVQNPPIQKLLLAW

Query:  HDQLERLPDEHLRTAIFTFGAVINTIVLQQDEEMKLRIKWEETAKELERKERHFSDWHYKYQQRRMPDEMDPDKSEENAQDTAVTEKLIMVETLKKRLEE
        +D+L+++PDE  ++AI  F AV++TI+ QQ++E+ LR K EET KEL RK R F DW++KY Q+R P+ M+PD+++ +  D  V  +   VE +KKRLEE
Subjt:  HDQLERLPDEHLRTAIFTFGAVINTIVLQQDEEMKLRIKWEETAKELERKERHFSDWHYKYQQRRMPDEMDPDKSEENAQDTAVTEKLIMVETLKKRLEE

Query:  EKETHAKQCLHVREKSLVSLKNQLPELFRALSQ
        E+E + +Q   VREKSL SL+ +LPELF+A+S+
Subjt:  EKETHAKQCLHVREKSLVSLKNQLPELFRALSQ

AT1G52320.3 unknown protein2.2e-11663.96Show/hide
Query:  TVLDKLLAWEKKLYDEVKAGELMKFEYQKKVATLNRLKKRDSNAEALEKAKAAVSHLHTRYIVDMQSLDSTISEINRLRDEQLYPKLVQLVNGMATMWET
        TVLDKLLAWEKKLYDEVKAGELMK EYQKKVA LNR+KKR  ++++LE+AKAAVSHLHTRYIVDMQS+DST+SEINRLRDEQLY KLV LV  M  MWE 
Subjt:  TVLDKLLAWEKKLYDEVKAGELMKFEYQKKVATLNRLKKRDSNAEALEKAKAAVSHLHTRYIVDMQSLDSTISEINRLRDEQLYPKLVQLVNGMATMWET

Query:  MRVHHEGQLKIVNVLRSVNPSQFPKETSEHHHERTVQLCGVVGEWHSQFEKLVRRQKDYIKALNSWLKLNLVPIESSLKEKVSSPPRVQNPPIQKLLLAW
        M++HH+ Q +I  VLRS++ SQ  KET++HHHERT+QL  VV EWH+QF +++  QK+YIKAL  WLKLNL+PIES+LKEKVSSPPRV NP IQKLL AW
Subjt:  MRVHHEGQLKIVNVLRSVNPSQFPKETSEHHHERTVQLCGVVGEWHSQFEKLVRRQKDYIKALNSWLKLNLVPIESSLKEKVSSPPRVQNPPIQKLLLAW

Query:  HDQLERLPDEHLRTAIFTFGAVINTIVLQQDEEMKLRIKWEETAKELERKERHFSDWHYKYQQRRMPDEMDPDKSEENAQDTAVTEKLIMVETLKKRLEE
        +D+L+++PDE  ++AI  F AV++TI+ QQ++E+ LR K EET KEL RK R F DW++KY Q+R P+ M+PD+++ +  D  V  +   VE +KKRLEE
Subjt:  HDQLERLPDEHLRTAIFTFGAVINTIVLQQDEEMKLRIKWEETAKELERKERHFSDWHYKYQQRRMPDEMDPDKSEENAQDTAVTEKLIMVETLKKRLEE

Query:  EKETHAKQCLHVREKSLVSLKNQLPELFRALSQ
        E+E + +Q   VREKSL SL+ +LPELF+A+S+
Subjt:  EKETHAKQCLHVREKSLVSLKNQLPELFRALSQ

AT1G52320.4 unknown protein2.2e-11663.96Show/hide
Query:  TVLDKLLAWEKKLYDEVKAGELMKFEYQKKVATLNRLKKRDSNAEALEKAKAAVSHLHTRYIVDMQSLDSTISEINRLRDEQLYPKLVQLVNGMATMWET
        TVLDKLLAWEKKLYDEVKAGELMK EYQKKVA LNR+KKR  ++++LE+AKAAVSHLHTRYIVDMQS+DST+SEINRLRDEQLY KLV LV  M  MWE 
Subjt:  TVLDKLLAWEKKLYDEVKAGELMKFEYQKKVATLNRLKKRDSNAEALEKAKAAVSHLHTRYIVDMQSLDSTISEINRLRDEQLYPKLVQLVNGMATMWET

Query:  MRVHHEGQLKIVNVLRSVNPSQFPKETSEHHHERTVQLCGVVGEWHSQFEKLVRRQKDYIKALNSWLKLNLVPIESSLKEKVSSPPRVQNPPIQKLLLAW
        M++HH+ Q +I  VLRS++ SQ  KET++HHHERT+QL  VV EWH+QF +++  QK+YIKAL  WLKLNL+PIES+LKEKVSSPPRV NP IQKLL AW
Subjt:  MRVHHEGQLKIVNVLRSVNPSQFPKETSEHHHERTVQLCGVVGEWHSQFEKLVRRQKDYIKALNSWLKLNLVPIESSLKEKVSSPPRVQNPPIQKLLLAW

Query:  HDQLERLPDEHLRTAIFTFGAVINTIVLQQDEEMKLRIKWEETAKELERKERHFSDWHYKYQQRRMPDEMDPDKSEENAQDTAVTEKLIMVETLKKRLEE
        +D+L+++PDE  ++AI  F AV++TI+ QQ++E+ LR K EET KEL RK R F DW++KY Q+R P+ M+PD+++ +  D  V  +   VE +KKRLEE
Subjt:  HDQLERLPDEHLRTAIFTFGAVINTIVLQQDEEMKLRIKWEETAKELERKERHFSDWHYKYQQRRMPDEMDPDKSEENAQDTAVTEKLIMVETLKKRLEE

Query:  EKETHAKQCLHVREKSLVSLKNQLPELFRALSQ
        E+E + +Q   VREKSL SL+ +LPELF+A+S+
Subjt:  EKETHAKQCLHVREKSLVSLKNQLPELFRALSQ

AT5G25590.1 Protein of unknown function (DUF630 and DUF632)3.9e-11363.1Show/hide
Query:  TVLDKLLAWEKKLYDEVKAGELMKFEYQKKVATLNRLKKRDSNAEALEKAKAAVSHLHTRYIVDMQSLDSTISEINRLRDEQLYPKLVQLVNGMATMWET
        TVLDKLLAWEKKLYDEVK GELMK EYQKKV+ LNR KKR ++AE +EK KAAVSHLHTRYIVDMQS+DST+SE+NRLRD+QLYP+LV LV GMA MW  
Subjt:  TVLDKLLAWEKKLYDEVKAGELMKFEYQKKVATLNRLKKRDSNAEALEKAKAAVSHLHTRYIVDMQSLDSTISEINRLRDEQLYPKLVQLVNGMATMWET

Query:  MRVHHEGQLKIVNVLRSVNPSQFPKETSEHHHERTVQLCGVVGEWHSQFEKLVRRQKDYIKALNSWLKLNLVPIESSLKEKVSSPPRVQNPPIQKLLLAW
        M +HH+ QL IV  L+++  S   KET++ HH +T Q C V+ EWH QF+ LV  QK YI +LN+WLKLNL+PIESSLKEKVSSPPR Q PPIQ LL +W
Subjt:  MRVHHEGQLKIVNVLRSVNPSQFPKETSEHHHERTVQLCGVVGEWHSQFEKLVRRQKDYIKALNSWLKLNLVPIESSLKEKVSSPPRVQNPPIQKLLLAW

Query:  HDQLERLPDEHLRTAIFTFGAVINTIVLQQDEEMKLRIKWEETAKELERKERHFSDWHYKYQQRRMP-DEMDPDKSEENAQDTAVTEKLIMVETLKKRLE
        HD+LE+LPDE  ++AI +F AVI TI+L Q+EEMKL+ K EET +E  RK++ F DW+ K+ Q+R P +E +       +    VTE+ I VETLKKRLE
Subjt:  HDQLERLPDEHLRTAIFTFGAVINTIVLQQDEEMKLRIKWEETAKELERKERHFSDWHYKYQQRRMP-DEMDPDKSEENAQDTAVTEKLIMVETLKKRLE

Query:  EEKETHAKQCLHVREKSLVSLKNQLPELFRALSQFS
        EE+E H + C+ VREKSL SLK +LPE+FRALS ++
Subjt:  EEKETHAKQCLHVREKSLVSLKNQLPELFRALSQFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ACCGTGTTAGACAAACTACTGGCATGGGAGAAAAAGCTATATGATGAAGTGAAGGCAGGTGAACTCATGAAGTTTGAGTACCAAAAGAAGGTTGCTACATTGAATAGGCT
AAAGAAACGAGATTCTAATGCAGAAGCTTTGGAGAAAGCAAAAGCAGCAGTAAGTCATCTGCACACTAGATATATTGTTGACATGCAATCCTTGGATTCAACTATCTCAG
AGATTAATCGTCTCCGAGACGAACAGTTATACCCAAAACTTGTTCAGCTTGTTAATGGGATGGCAACAATGTGGGAAACAATGCGAGTTCACCATGAAGGGCAATTGAAG
ATTGTAAATGTACTGAGATCGGTGAATCCCTCTCAATTCCCAAAAGAAACTAGTGAGCATCATCATGAGCGCACGGTTCAGCTCTGTGGTGTCGTGGGAGAGTGGCATTC
ACAGTTTGAAAAACTTGTGCGTCGTCAGAAAGACTACATTAAAGCCTTAAACAGCTGGTTGAAACTGAATCTAGTTCCTATAGAGAGTAGCTTGAAAGAGAAGGTTTCTT
CTCCACCAAGGGTTCAAAATCCTCCAATTCAAAAACTCCTCCTTGCTTGGCATGACCAACTCGAGAGACTCCCAGACGAGCATCTCAGAACTGCCATATTCACTTTTGGT
GCTGTTATTAATACTATTGTGCTGCAGCAAGATGAAGAGATGAAATTGAGGATAAAGTGGGAGGAGACTGCGAAAGAACTCGAGCGCAAGGAGAGGCATTTCAGTGACTG
GCATTACAAATACCAGCAACGAAGAATGCCTGATGAGATGGACCCTGACAAGTCTGAAGAGAACGCTCAGGACACTGCAGTTACAGAGAAGTTAATTATGGTAGAGACGT
TGAAAAAGAGATTGGAGGAGGAAAAGGAAACTCATGCGAAGCAATGCCTTCACGTGAGGGAGAAGTCATTGGTAAGCCTTAAGAATCAGTTGCCAGAACTCTTCAGGGCA
TTGTCACAGTTCTCT
mRNA sequenceShow/hide mRNA sequence
ACCGTGTTAGACAAACTACTGGCATGGGAGAAAAAGCTATATGATGAAGTGAAGGCAGGTGAACTCATGAAGTTTGAGTACCAAAAGAAGGTTGCTACATTGAATAGGCT
AAAGAAACGAGATTCTAATGCAGAAGCTTTGGAGAAAGCAAAAGCAGCAGTAAGTCATCTGCACACTAGATATATTGTTGACATGCAATCCTTGGATTCAACTATCTCAG
AGATTAATCGTCTCCGAGACGAACAGTTATACCCAAAACTTGTTCAGCTTGTTAATGGGATGGCAACAATGTGGGAAACAATGCGAGTTCACCATGAAGGGCAATTGAAG
ATTGTAAATGTACTGAGATCGGTGAATCCCTCTCAATTCCCAAAAGAAACTAGTGAGCATCATCATGAGCGCACGGTTCAGCTCTGTGGTGTCGTGGGAGAGTGGCATTC
ACAGTTTGAAAAACTTGTGCGTCGTCAGAAAGACTACATTAAAGCCTTAAACAGCTGGTTGAAACTGAATCTAGTTCCTATAGAGAGTAGCTTGAAAGAGAAGGTTTCTT
CTCCACCAAGGGTTCAAAATCCTCCAATTCAAAAACTCCTCCTTGCTTGGCATGACCAACTCGAGAGACTCCCAGACGAGCATCTCAGAACTGCCATATTCACTTTTGGT
GCTGTTATTAATACTATTGTGCTGCAGCAAGATGAAGAGATGAAATTGAGGATAAAGTGGGAGGAGACTGCGAAAGAACTCGAGCGCAAGGAGAGGCATTTCAGTGACTG
GCATTACAAATACCAGCAACGAAGAATGCCTGATGAGATGGACCCTGACAAGTCTGAAGAGAACGCTCAGGACACTGCAGTTACAGAGAAGTTAATTATGGTAGAGACGT
TGAAAAAGAGATTGGAGGAGGAAAAGGAAACTCATGCGAAGCAATGCCTTCACGTGAGGGAGAAGTCATTGGTAAGCCTTAAGAATCAGTTGCCAGAACTCTTCAGGGCA
TTGTCACAGTTCTCT
Protein sequenceShow/hide protein sequence
TVLDKLLAWEKKLYDEVKAGELMKFEYQKKVATLNRLKKRDSNAEALEKAKAAVSHLHTRYIVDMQSLDSTISEINRLRDEQLYPKLVQLVNGMATMWETMRVHHEGQLK
IVNVLRSVNPSQFPKETSEHHHERTVQLCGVVGEWHSQFEKLVRRQKDYIKALNSWLKLNLVPIESSLKEKVSSPPRVQNPPIQKLLLAWHDQLERLPDEHLRTAIFTFG
AVINTIVLQQDEEMKLRIKWEETAKELERKERHFSDWHYKYQQRRMPDEMDPDKSEENAQDTAVTEKLIMVETLKKRLEEEKETHAKQCLHVREKSLVSLKNQLPELFRA
LSQFS