; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS009120 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS009120
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProtein of unknown function (DUF668)
Genome locationscaffold220:34362..36191
RNA-Seq ExpressionMS009120
SyntenyMS009120
Gene Ontology termsGO:0045927 - positive regulation of growth (biological process)
InterPro domainsIPR007700 - Domain of unknown function DUF668
IPR021864 - Domain of unknown function DUF3475


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004138676.1 uncharacterized protein LOC101204446 [Cucumis sativus]1.9e-30990.16Show/hide
Query:  MVSESWFRSLWKPPRKRESSTQKVVIGVLAFEIATLMSKLVHLWQSLSDKQVRRLREEINSSLGIKKLVSDNDEHIVRLIYAEMTESLAHVAKSVARLGK
        MVSESWFRSLWKPPRKRE STQKVVIGVLAFEIA+LMSKLVHLWQSLSDKQV RLREEIN+SLGIKKLVSD+DE+IVRLI AEMTE+L HVAKSVARLGK
Subjt:  MVSESWFRSLWKPPRKRESSTQKVVIGVLAFEIATLMSKLVHLWQSLSDKQVRRLREEINSSLGIKKLVSDNDEHIVRLIYAEMTESLAHVAKSVARLGK

Query:  KCSDPSLKNFEHIFDDLIQIGADPYGWIYSWKKMEKKVKKMETFISVNANLYQEMEMLADLEQTLRRMKVNEESDVINLVEFQKKVAWKQQEVKNLREMS
        KCSDPSLKNFEH+FD LIQIGADPYGWIYSWKKMEKKVKKMETFISVNANLYQEMEMLADLEQT  RMK NE+SDVINLVEF+KKVAWKQQEVKNLREMS
Subjt:  KCSDPSLKNFEHIFDDLIQIGADPYGWIYSWKKMEKKVKKMETFISVNANLYQEMEMLADLEQTLRRMKVNEESDVINLVEFQKKVAWKQQEVKNLREMS

Query:  LWKRTYDYTILLLARSLATIFSRIKHVFGIEQSEDNGEMNDSRDMNSDYIARSQSVSSLMQSMVHPSEHNGLTKFASGPLKRFTTKSGPISKTAKANNFY
        LWKRTYDYTILLLARSL TIFSRIK VF  EQS DN   +DSRDM+SDYIARSQSVSSLMQSMVHPSE +GLTKFASGPLKRFTTKSGPISKTAK NNFY
Subjt:  LWKRTYDYTILLLARSLATIFSRIKHVFGIEQSEDNGEMNDSRDMNSDYIARSQSVSSLMQSMVHPSEHNGLTKFASGPLKRFTTKSGPISKTAKANNFY

Query:  SGPLGSSITKSGPISGPVSGTNRNFNSYSGPLTSSTMRSGLISGVHNKTNQKNWLVGGYSSIFNGKKSHQKPNRLTQVGPFKGCMISGTSSPVANCHISS
        SGPLGSSITKSGPISGPVSGTNRNFNSYSGPLTSS +RSG  SG+ NKTNQKNWLVGGYSS+FNGKKSHQKPNRLTQVGPFKGCMISG SS VANCHISS
Subjt:  SGPLGSSITKSGPISGPVSGTNRNFNSYSGPLTSSTMRSGLISGVHNKTNQKNWLVGGYSSIFNGKKSHQKPNRLTQVGPFKGCMISGTSSPVANCHISS

Query:  NGIHTELLNGTKDAGKIVDPCNKTIPCKQLLGAKRRLLDAPPETLGAAALALHYANVIIVIEKLAASPHLIGLDARDDLYNMLPAKVRASLRASLKPYAK
        NG H++L+NG KD G IV+ CN+  PCKQLL  K RLLDAPPETLG AALALHYANVIIVIEKLAASPHLIGLDARDDLYNMLPAKVRASLRA+LKPYAK
Subjt:  NGIHTELLNGTKDAGKIVDPCNKTIPCKQLLGAKRRLLDAPPETLGAAALALHYANVIIVIEKLAASPHLIGLDARDDLYNMLPAKVRASLRASLKPYAK

Query:  SLSSSAYDTGLAGEWSEAITGILEWLAPLAHNMVRWQSERSFEQQNFVSRTNMLLVQTLFFANQEKTEAIITELLVGLNYLWRFGRELNAKALNECASSR
        SL+SS YDTGLAGEW+EAI GILEWLAPLAHNMVRWQSERSFEQQNFVSRTNMLLVQTLFFANQEKTEAIITELLVGLNYLW FGRELNAKALNECASSR
Subjt:  SLSSSAYDTGLAGEWSEAITGILEWLAPLAHNMVRWQSERSFEQQNFVSRTNMLLVQTLFFANQEKTEAIITELLVGLNYLWRFGRELNAKALNECASSR

Query:  IHDEYLDIVG
        IHDEYLDIVG
Subjt:  IHDEYLDIVG

XP_008456574.1 PREDICTED: uncharacterized protein LOC103496487 [Cucumis melo]2.1e-30890Show/hide
Query:  MVSESWFRSLWKPPRKRESSTQKVVIGVLAFEIATLMSKLVHLWQSLSDKQVRRLREEINSSLGIKKLVSDNDEHIVRLIYAEMTESLAHVAKSVARLGK
        MVSESWFRSLWKPPRKRE S QKVVIGVLAFEIA+LMSKLVHLWQSLSDKQV RLREEIN+SLGIKKLVSD+DE+IVRLI AEMTE+LAHVAKSVARLGK
Subjt:  MVSESWFRSLWKPPRKRESSTQKVVIGVLAFEIATLMSKLVHLWQSLSDKQVRRLREEINSSLGIKKLVSDNDEHIVRLIYAEMTESLAHVAKSVARLGK

Query:  KCSDPSLKNFEHIFDDLIQIGADPYGWIYSWKKMEKKVKKMETFISVNANLYQEMEMLADLEQTLRRMKVNEESDVINLVEFQKKVAWKQQEVKNLREMS
        KCSDPSLKNFEH+FD LIQIGADPYGWIYSWKKMEKKVKKMETFISVNANLYQEMEMLADLEQT  RMK NE+SDVINLVEF+KKVAWKQQEVKNLREMS
Subjt:  KCSDPSLKNFEHIFDDLIQIGADPYGWIYSWKKMEKKVKKMETFISVNANLYQEMEMLADLEQTLRRMKVNEESDVINLVEFQKKVAWKQQEVKNLREMS

Query:  LWKRTYDYTILLLARSLATIFSRIKHVFGIEQSEDNGEMNDSRDMNSDYIARSQSVSSLMQSMVHPSEHNGLTKFASGPLKRFTTKSGPISKTAKANNFY
        LWKRTYDYTILLLARSL TIFSRIK VF IEQS DN   +DSRDM+SDYIARSQSVSSLMQSMVHPSE +GLTKFASGPLKRFTTKSGPISKTA+ +NFY
Subjt:  LWKRTYDYTILLLARSLATIFSRIKHVFGIEQSEDNGEMNDSRDMNSDYIARSQSVSSLMQSMVHPSEHNGLTKFASGPLKRFTTKSGPISKTAKANNFY

Query:  SGPLGSSITKSGPISGPVSGTNRNFNSYSGPLTSSTMRSGLISGVHNKTNQKNWLVGGYSSIFNGKKSHQKPNRLTQVGPFKGCMISGTSSPVANCHISS
        SGPLG SITKSGPISGPVSGTNRNFNSYSGPLTSS +RSG  SG+ NKTNQKNWLVGGYSS+FNGKKSHQKPNRLTQVGPFKGCMISG SS VANCHISS
Subjt:  SGPLGSSITKSGPISGPVSGTNRNFNSYSGPLTSSTMRSGLISGVHNKTNQKNWLVGGYSSIFNGKKSHQKPNRLTQVGPFKGCMISGTSSPVANCHISS

Query:  NGIHTELLNGTKDAGKIVDPCNKTIPCKQLLGAKRRLLDAPPETLGAAALALHYANVIIVIEKLAASPHLIGLDARDDLYNMLPAKVRASLRASLKPYAK
        NG H++L+NG KD GKIV+ CN+  PCKQLL  K RLLDAPPETLGAAALALHYANVIIVIEKLAASPHLIGLDARDDLYNMLPAKVRASLRA+LKPYAK
Subjt:  NGIHTELLNGTKDAGKIVDPCNKTIPCKQLLGAKRRLLDAPPETLGAAALALHYANVIIVIEKLAASPHLIGLDARDDLYNMLPAKVRASLRASLKPYAK

Query:  SLSSSAYDTGLAGEWSEAITGILEWLAPLAHNMVRWQSERSFEQQNFVSRTNMLLVQTLFFANQEKTEAIITELLVGLNYLWRFGRELNAKALNECASSR
        SL+SS YDTGLAGEW+EAI GILEWLAPLAHNMVRWQSERSFEQQNFVSRTNMLLVQTLFFANQEKTEAIITELLVGLNYLW FGRELNAKALNECASSR
Subjt:  SLSSSAYDTGLAGEWSEAITGILEWLAPLAHNMVRWQSERSFEQQNFVSRTNMLLVQTLFFANQEKTEAIITELLVGLNYLWRFGRELNAKALNECASSR

Query:  IHDEYLDIVG
        +HDEYLDIVG
Subjt:  IHDEYLDIVG

XP_022133904.1 uncharacterized protein LOC111006336 [Momordica charantia]0.0e+0099.51Show/hide
Query:  MVSESWFRSLWKPPRKRESSTQKVVIGVLAFEIATLMSKLVHLWQSLSDKQVRRLREEINSSLGIKKLVSDNDEHIVRLIYAEMTESLAHVAKSVARLGK
        MVSESWFRSLWKPPRKRESSTQKVVIGVLAFEIATLMSKLVHLWQSLSDKQVRRLREEINSSLGIKKLVSDNDEHIVRLIYAEMTESLAHVAKSVARLGK
Subjt:  MVSESWFRSLWKPPRKRESSTQKVVIGVLAFEIATLMSKLVHLWQSLSDKQVRRLREEINSSLGIKKLVSDNDEHIVRLIYAEMTESLAHVAKSVARLGK

Query:  KCSDPSLKNFEHIFDDLIQIGADPYGWIYSWKKMEKKVKKMETFISVNANLYQEMEMLADLEQTLRRMKVNEESDVINLVEFQKKVAWKQQEVKNLREMS
        KCSDPSLKNFEHIFDDLIQIGADPYGWIYSWKKME KVKKMETFISVNANLYQEMEMLADLEQTLRRMKVNEESDVINLVEFQKKVAWKQQEVKNLREMS
Subjt:  KCSDPSLKNFEHIFDDLIQIGADPYGWIYSWKKMEKKVKKMETFISVNANLYQEMEMLADLEQTLRRMKVNEESDVINLVEFQKKVAWKQQEVKNLREMS

Query:  LWKRTYDYTILLLARSLATIFSRIKHVFGIEQSEDNGEMNDSRDMNSDYIARSQSVSSLMQSMVHPSEHNGLTKFASGPLKRFTTKSGPISKTAKANNFY
        LWKRTYDY+ILLLARSLATIFSRIKHVFGIEQSEDNGEMNDSRDMNSDYIARSQSVSSLMQSMVHPSEHNGLTKFASGPLKRFTTKSGPISKTAKANNFY
Subjt:  LWKRTYDYTILLLARSLATIFSRIKHVFGIEQSEDNGEMNDSRDMNSDYIARSQSVSSLMQSMVHPSEHNGLTKFASGPLKRFTTKSGPISKTAKANNFY

Query:  SGPLGSSITKSGPISGPVSGTNRNFNSYSGPLTSSTMRSGLISGVHNKTNQKNWLVGGYSSIFNGKKSHQKPNRLTQVGPFKGCMISGTSSPVANCHISS
        SGPLGSSITKSGPISGPVSGTNRNFNSYSGPLTSSTMRSG ISGVHNKTNQKNWLVGGYSSIFNGKKSHQKPNRLTQVGPFKGCMISGTSSPVANCHISS
Subjt:  SGPLGSSITKSGPISGPVSGTNRNFNSYSGPLTSSTMRSGLISGVHNKTNQKNWLVGGYSSIFNGKKSHQKPNRLTQVGPFKGCMISGTSSPVANCHISS

Query:  NGIHTELLNGTKDAGKIVDPCNKTIPCKQLLGAKRRLLDAPPETLGAAALALHYANVIIVIEKLAASPHLIGLDARDDLYNMLPAKVRASLRASLKPYAK
        NGIHTELLNGTKDAGKIVDPCNKTIPCKQLLGAKRRLLDAPPETLGAAALALHYANVIIVIEKLAASPHLIGLDARDDLYNMLPAKVRASLRASLKPYAK
Subjt:  NGIHTELLNGTKDAGKIVDPCNKTIPCKQLLGAKRRLLDAPPETLGAAALALHYANVIIVIEKLAASPHLIGLDARDDLYNMLPAKVRASLRASLKPYAK

Query:  SLSSSAYDTGLAGEWSEAITGILEWLAPLAHNMVRWQSERSFEQQNFVSRTNMLLVQTLFFANQEKTEAIITELLVGLNYLWRFGRELNAKALNECASSR
        SLSSSAYDTGLAGEWSEAITGILEWLAPLAHNMVRWQSERSFEQQNFVSRTNMLLVQTLFFANQEKTEAIITELLVGLNYLWRFGRELNAKALNECASSR
Subjt:  SLSSSAYDTGLAGEWSEAITGILEWLAPLAHNMVRWQSERSFEQQNFVSRTNMLLVQTLFFANQEKTEAIITELLVGLNYLWRFGRELNAKALNECASSR

Query:  IHDEYLDIVG
        IHDEYLDIVG
Subjt:  IHDEYLDIVG

XP_022990285.1 uncharacterized protein LOC111487207 [Cucurbita maxima]2.9e-30286.56Show/hide
Query:  MVSESWFRSLWKPPRKRESSTQKVVIGVLAFEIATLMSKLVHLWQSLSDKQVRRLREEINSSLGIKKLVSDNDEHIVRLIYAEMTESLAHVAKSVARLGK
        MVSESWFRSLWKP RKR+SSTQKVVIGVLAFEIA+LMS+LVHLW SLSDKQ+RRLRE+I +SLGIKKLVSD+DE+IVRLI AE+TE+LAHVAKSVARLGK
Subjt:  MVSESWFRSLWKPPRKRESSTQKVVIGVLAFEIATLMSKLVHLWQSLSDKQVRRLREEINSSLGIKKLVSDNDEHIVRLIYAEMTESLAHVAKSVARLGK

Query:  KCSDPSLKNFEHIFDDLIQIGADPYGWIYSWKKMEKKVKKMETFISVNANLYQEMEMLADLEQTLRRMKVNEESDVINLVEFQKKVAWKQQEVKNLREMS
        KCSDPSLK FEH+FD+L+QIGADPY WIYSWKKMEKKVKKMETFISVNANLYQEMEML +LEQT RRMKVN+ +  +NL +F+KKV WKQQEVKNLREMS
Subjt:  KCSDPSLKNFEHIFDDLIQIGADPYGWIYSWKKMEKKVKKMETFISVNANLYQEMEMLADLEQTLRRMKVNEESDVINLVEFQKKVAWKQQEVKNLREMS

Query:  LWKRTYDYTILLLARSLATIFSRIKHVFGIEQSEDNGEMNDSRDMNSDYIARSQSVSSLMQSMVHPSEHNGLTKFASGPLKRFTTKSGPISKTAKANNFY
        LW+RTYDYTILLLARS+ TIFSRIKHVFGIEQS    E NDSRDMNSDYIARSQSVSSLMQSMVHPSE NGLTKFASGPLKRFTTKSGPISKTAK NNFY
Subjt:  LWKRTYDYTILLLARSLATIFSRIKHVFGIEQSEDNGEMNDSRDMNSDYIARSQSVSSLMQSMVHPSEHNGLTKFASGPLKRFTTKSGPISKTAKANNFY

Query:  SGPLGSSITKSGPISGPVSGTNRNFNSYSGPLTSSTMRSGLISGVHNKTNQKNWLVGGYSSIFNGKKSHQKPNRLTQVGPFKGCMISGTSSPVANCHISS
        SGPLGSS+TKSGPI GPVSGTNRNFNSYSGPLTSST+RSG +SG++NKTN KNW  GGYSS+FNGKKSHQKPNRLTQVGPFKGCM+SG SS V NCHISS
Subjt:  SGPLGSSITKSGPISGPVSGTNRNFNSYSGPLTSSTMRSGLISGVHNKTNQKNWLVGGYSSIFNGKKSHQKPNRLTQVGPFKGCMISGTSSPVANCHISS

Query:  NGIHTELLNGTKDAGKIVDPCNKTIPCKQLLGAKRRLLDAPPETLGAAALALHYANVIIVIEKLAASPHLIGLDARDDLYNMLPAKVRASLRASLKPYAK
        NG H++L+NG KD GKIVDPCNK  PC++LLG K +LLDAPPETLG AALALHYANVIIVIEKLAASPHLIGLDARDDLYNMLPAKVRASLRA+LKPYAK
Subjt:  NGIHTELLNGTKDAGKIVDPCNKTIPCKQLLGAKRRLLDAPPETLGAAALALHYANVIIVIEKLAASPHLIGLDARDDLYNMLPAKVRASLRASLKPYAK

Query:  SLSSSAYDTGLAGEWSEAITGILEWLAPLAHNMVRWQSERSFEQQNFVSRTNMLLVQTLFFANQEKTEAIITELLVGLNYLWRFGRELNAKALNECASSR
         L+SS YDTGLA EW++AI GILEWLAPLAHNMVRWQSERSFEQQNFVSRTNMLLVQTLFFANQEKTE IITELLVGLNYLWRFGRE NAKALNECASSR
Subjt:  SLSSSAYDTGLAGEWSEAITGILEWLAPLAHNMVRWQSERSFEQQNFVSRTNMLLVQTLFFANQEKTEAIITELLVGLNYLWRFGRELNAKALNECASSR

Query:  IHDEYLDIVG
        IHDE+LDIVG
Subjt:  IHDEYLDIVG

XP_038885046.1 protein PSK SIMULATOR 1 [Benincasa hispida]0.0e+0090.33Show/hide
Query:  MVSESWFRSLWKPPRKRESSTQKVVIGVLAFEIATLMSKLVHLWQSLSDKQVRRLREEINSSLGIKKLVSDNDEHIVRLIYAEMTESLAHVAKSVARLGK
        MVSESWFRSLWKPPRKR+ STQKVVIGVLAFEIA+LMSKLVHLWQSLSDKQV RLREEIN+SLG+KKLVSD+DE+IVRLI AEMTESLAHVAKSVARLGK
Subjt:  MVSESWFRSLWKPPRKRESSTQKVVIGVLAFEIATLMSKLVHLWQSLSDKQVRRLREEINSSLGIKKLVSDNDEHIVRLIYAEMTESLAHVAKSVARLGK

Query:  KCSDPSLKNFEHIFDDLIQIGADPYGWIYSWKKMEKKVKKMETFISVNANLYQEMEMLADLEQTLRRMKVNEESDVINLVEFQKKVAWKQQEVKNLREMS
        KCSDPSLKNFEH+FD LIQIGADPYGWIYSWKKMEKKVKKMETFISVNANLYQEMEMLADLEQTLRRMKVN++SDVINLVEFQKKVAWKQQEVKNLREMS
Subjt:  KCSDPSLKNFEHIFDDLIQIGADPYGWIYSWKKMEKKVKKMETFISVNANLYQEMEMLADLEQTLRRMKVNEESDVINLVEFQKKVAWKQQEVKNLREMS

Query:  LWKRTYDYTILLLARSLATIFSRIKHVFGIEQSEDNGEMNDSRDMNSDYIARSQSVSSLMQSMVHPSEHNGLTKFASGPLKRFTTKSGPISKTAKANNFY
        LWKRTYDYTIL LARSL TIFSRIKHVF IEQS D    NDSRDMNSDYIARSQSVSSLMQSMVHPSE NGLTKFASGPLKRFTTKSGPISKT K NNFY
Subjt:  LWKRTYDYTILLLARSLATIFSRIKHVFGIEQSEDNGEMNDSRDMNSDYIARSQSVSSLMQSMVHPSEHNGLTKFASGPLKRFTTKSGPISKTAKANNFY

Query:  SGPLGSSITKSGPISGPVSGTNRNFNSYSGPLTSSTMRSGLISGVHNKTNQKNWLVGGYSSIFNGKKSHQKPNRLTQVGPFKGCMISGTSSPVANCHISS
        SGPLG SITKSGPISGP SGTNRNFNSYSGPLTSS +RSG ISG+ NKT+QKNWLVGGYSS+FNGKKSHQKPNRLTQVGPFKGCMISG SS VANCHISS
Subjt:  SGPLGSSITKSGPISGPVSGTNRNFNSYSGPLTSSTMRSGLISGVHNKTNQKNWLVGGYSSIFNGKKSHQKPNRLTQVGPFKGCMISGTSSPVANCHISS

Query:  NGIHTELLNGTKDAGKIVDPCNKTIPCKQLLGAKRRLLDAPPETLGAAALALHYANVIIVIEKLAASPHLIGLDARDDLYNMLPAKVRASLRASLKPYAK
        NG H++L++G KD G  VD C++  PCKQLL  KRRLLDAPPETLG AALALHYANVIIVIEKLAASPHLIGLDARDDLYNMLPAKVRA+LRA+LKPYAK
Subjt:  NGIHTELLNGTKDAGKIVDPCNKTIPCKQLLGAKRRLLDAPPETLGAAALALHYANVIIVIEKLAASPHLIGLDARDDLYNMLPAKVRASLRASLKPYAK

Query:  SLSSSAYDTGLAGEWSEAITGILEWLAPLAHNMVRWQSERSFEQQNFVSRTNMLLVQTLFFANQEKTEAIITELLVGLNYLWRFGRELNAKALNECASSR
        SL+SS YDTGLAGEW+EAITGILEWLAPLAHNMVRWQSERSFEQQNFVSRTNMLLVQTLFFANQEKTEA+ITELLVGLNYLW+FGRELNAKALNECASSR
Subjt:  SLSSSAYDTGLAGEWSEAITGILEWLAPLAHNMVRWQSERSFEQQNFVSRTNMLLVQTLFFANQEKTEAIITELLVGLNYLWRFGRELNAKALNECASSR

Query:  IHDEYLDIVG
        IHDEYLDIVG
Subjt:  IHDEYLDIVG

TrEMBL top hitse value%identityAlignment
A0A0A0LPX2 Uncharacterized protein9.1e-31090.16Show/hide
Query:  MVSESWFRSLWKPPRKRESSTQKVVIGVLAFEIATLMSKLVHLWQSLSDKQVRRLREEINSSLGIKKLVSDNDEHIVRLIYAEMTESLAHVAKSVARLGK
        MVSESWFRSLWKPPRKRE STQKVVIGVLAFEIA+LMSKLVHLWQSLSDKQV RLREEIN+SLGIKKLVSD+DE+IVRLI AEMTE+L HVAKSVARLGK
Subjt:  MVSESWFRSLWKPPRKRESSTQKVVIGVLAFEIATLMSKLVHLWQSLSDKQVRRLREEINSSLGIKKLVSDNDEHIVRLIYAEMTESLAHVAKSVARLGK

Query:  KCSDPSLKNFEHIFDDLIQIGADPYGWIYSWKKMEKKVKKMETFISVNANLYQEMEMLADLEQTLRRMKVNEESDVINLVEFQKKVAWKQQEVKNLREMS
        KCSDPSLKNFEH+FD LIQIGADPYGWIYSWKKMEKKVKKMETFISVNANLYQEMEMLADLEQT  RMK NE+SDVINLVEF+KKVAWKQQEVKNLREMS
Subjt:  KCSDPSLKNFEHIFDDLIQIGADPYGWIYSWKKMEKKVKKMETFISVNANLYQEMEMLADLEQTLRRMKVNEESDVINLVEFQKKVAWKQQEVKNLREMS

Query:  LWKRTYDYTILLLARSLATIFSRIKHVFGIEQSEDNGEMNDSRDMNSDYIARSQSVSSLMQSMVHPSEHNGLTKFASGPLKRFTTKSGPISKTAKANNFY
        LWKRTYDYTILLLARSL TIFSRIK VF  EQS DN   +DSRDM+SDYIARSQSVSSLMQSMVHPSE +GLTKFASGPLKRFTTKSGPISKTAK NNFY
Subjt:  LWKRTYDYTILLLARSLATIFSRIKHVFGIEQSEDNGEMNDSRDMNSDYIARSQSVSSLMQSMVHPSEHNGLTKFASGPLKRFTTKSGPISKTAKANNFY

Query:  SGPLGSSITKSGPISGPVSGTNRNFNSYSGPLTSSTMRSGLISGVHNKTNQKNWLVGGYSSIFNGKKSHQKPNRLTQVGPFKGCMISGTSSPVANCHISS
        SGPLGSSITKSGPISGPVSGTNRNFNSYSGPLTSS +RSG  SG+ NKTNQKNWLVGGYSS+FNGKKSHQKPNRLTQVGPFKGCMISG SS VANCHISS
Subjt:  SGPLGSSITKSGPISGPVSGTNRNFNSYSGPLTSSTMRSGLISGVHNKTNQKNWLVGGYSSIFNGKKSHQKPNRLTQVGPFKGCMISGTSSPVANCHISS

Query:  NGIHTELLNGTKDAGKIVDPCNKTIPCKQLLGAKRRLLDAPPETLGAAALALHYANVIIVIEKLAASPHLIGLDARDDLYNMLPAKVRASLRASLKPYAK
        NG H++L+NG KD G IV+ CN+  PCKQLL  K RLLDAPPETLG AALALHYANVIIVIEKLAASPHLIGLDARDDLYNMLPAKVRASLRA+LKPYAK
Subjt:  NGIHTELLNGTKDAGKIVDPCNKTIPCKQLLGAKRRLLDAPPETLGAAALALHYANVIIVIEKLAASPHLIGLDARDDLYNMLPAKVRASLRASLKPYAK

Query:  SLSSSAYDTGLAGEWSEAITGILEWLAPLAHNMVRWQSERSFEQQNFVSRTNMLLVQTLFFANQEKTEAIITELLVGLNYLWRFGRELNAKALNECASSR
        SL+SS YDTGLAGEW+EAI GILEWLAPLAHNMVRWQSERSFEQQNFVSRTNMLLVQTLFFANQEKTEAIITELLVGLNYLW FGRELNAKALNECASSR
Subjt:  SLSSSAYDTGLAGEWSEAITGILEWLAPLAHNMVRWQSERSFEQQNFVSRTNMLLVQTLFFANQEKTEAIITELLVGLNYLWRFGRELNAKALNECASSR

Query:  IHDEYLDIVG
        IHDEYLDIVG
Subjt:  IHDEYLDIVG

A0A1S3C3N3 uncharacterized protein LOC1034964871.0e-30890Show/hide
Query:  MVSESWFRSLWKPPRKRESSTQKVVIGVLAFEIATLMSKLVHLWQSLSDKQVRRLREEINSSLGIKKLVSDNDEHIVRLIYAEMTESLAHVAKSVARLGK
        MVSESWFRSLWKPPRKRE S QKVVIGVLAFEIA+LMSKLVHLWQSLSDKQV RLREEIN+SLGIKKLVSD+DE+IVRLI AEMTE+LAHVAKSVARLGK
Subjt:  MVSESWFRSLWKPPRKRESSTQKVVIGVLAFEIATLMSKLVHLWQSLSDKQVRRLREEINSSLGIKKLVSDNDEHIVRLIYAEMTESLAHVAKSVARLGK

Query:  KCSDPSLKNFEHIFDDLIQIGADPYGWIYSWKKMEKKVKKMETFISVNANLYQEMEMLADLEQTLRRMKVNEESDVINLVEFQKKVAWKQQEVKNLREMS
        KCSDPSLKNFEH+FD LIQIGADPYGWIYSWKKMEKKVKKMETFISVNANLYQEMEMLADLEQT  RMK NE+SDVINLVEF+KKVAWKQQEVKNLREMS
Subjt:  KCSDPSLKNFEHIFDDLIQIGADPYGWIYSWKKMEKKVKKMETFISVNANLYQEMEMLADLEQTLRRMKVNEESDVINLVEFQKKVAWKQQEVKNLREMS

Query:  LWKRTYDYTILLLARSLATIFSRIKHVFGIEQSEDNGEMNDSRDMNSDYIARSQSVSSLMQSMVHPSEHNGLTKFASGPLKRFTTKSGPISKTAKANNFY
        LWKRTYDYTILLLARSL TIFSRIK VF IEQS DN   +DSRDM+SDYIARSQSVSSLMQSMVHPSE +GLTKFASGPLKRFTTKSGPISKTA+ +NFY
Subjt:  LWKRTYDYTILLLARSLATIFSRIKHVFGIEQSEDNGEMNDSRDMNSDYIARSQSVSSLMQSMVHPSEHNGLTKFASGPLKRFTTKSGPISKTAKANNFY

Query:  SGPLGSSITKSGPISGPVSGTNRNFNSYSGPLTSSTMRSGLISGVHNKTNQKNWLVGGYSSIFNGKKSHQKPNRLTQVGPFKGCMISGTSSPVANCHISS
        SGPLG SITKSGPISGPVSGTNRNFNSYSGPLTSS +RSG  SG+ NKTNQKNWLVGGYSS+FNGKKSHQKPNRLTQVGPFKGCMISG SS VANCHISS
Subjt:  SGPLGSSITKSGPISGPVSGTNRNFNSYSGPLTSSTMRSGLISGVHNKTNQKNWLVGGYSSIFNGKKSHQKPNRLTQVGPFKGCMISGTSSPVANCHISS

Query:  NGIHTELLNGTKDAGKIVDPCNKTIPCKQLLGAKRRLLDAPPETLGAAALALHYANVIIVIEKLAASPHLIGLDARDDLYNMLPAKVRASLRASLKPYAK
        NG H++L+NG KD GKIV+ CN+  PCKQLL  K RLLDAPPETLGAAALALHYANVIIVIEKLAASPHLIGLDARDDLYNMLPAKVRASLRA+LKPYAK
Subjt:  NGIHTELLNGTKDAGKIVDPCNKTIPCKQLLGAKRRLLDAPPETLGAAALALHYANVIIVIEKLAASPHLIGLDARDDLYNMLPAKVRASLRASLKPYAK

Query:  SLSSSAYDTGLAGEWSEAITGILEWLAPLAHNMVRWQSERSFEQQNFVSRTNMLLVQTLFFANQEKTEAIITELLVGLNYLWRFGRELNAKALNECASSR
        SL+SS YDTGLAGEW+EAI GILEWLAPLAHNMVRWQSERSFEQQNFVSRTNMLLVQTLFFANQEKTEAIITELLVGLNYLW FGRELNAKALNECASSR
Subjt:  SLSSSAYDTGLAGEWSEAITGILEWLAPLAHNMVRWQSERSFEQQNFVSRTNMLLVQTLFFANQEKTEAIITELLVGLNYLWRFGRELNAKALNECASSR

Query:  IHDEYLDIVG
        +HDEYLDIVG
Subjt:  IHDEYLDIVG

A0A5D3CIT4 Uncharacterized protein1.0e-30890Show/hide
Query:  MVSESWFRSLWKPPRKRESSTQKVVIGVLAFEIATLMSKLVHLWQSLSDKQVRRLREEINSSLGIKKLVSDNDEHIVRLIYAEMTESLAHVAKSVARLGK
        MVSESWFRSLWKPPRKRE S QKVVIGVLAFEIA+LMSKLVHLWQSLSDKQV RLREEIN+SLGIKKLVSD+DE+IVRLI AEMTE+LAHVAKSVARLGK
Subjt:  MVSESWFRSLWKPPRKRESSTQKVVIGVLAFEIATLMSKLVHLWQSLSDKQVRRLREEINSSLGIKKLVSDNDEHIVRLIYAEMTESLAHVAKSVARLGK

Query:  KCSDPSLKNFEHIFDDLIQIGADPYGWIYSWKKMEKKVKKMETFISVNANLYQEMEMLADLEQTLRRMKVNEESDVINLVEFQKKVAWKQQEVKNLREMS
        KCSDPSLKNFEH+FD LIQIGADPYGWIYSWKKMEKKVKKMETFISVNANLYQEMEMLADLEQT  RMK NE+SDVINLVEF+KKVAWKQQEVKNLREMS
Subjt:  KCSDPSLKNFEHIFDDLIQIGADPYGWIYSWKKMEKKVKKMETFISVNANLYQEMEMLADLEQTLRRMKVNEESDVINLVEFQKKVAWKQQEVKNLREMS

Query:  LWKRTYDYTILLLARSLATIFSRIKHVFGIEQSEDNGEMNDSRDMNSDYIARSQSVSSLMQSMVHPSEHNGLTKFASGPLKRFTTKSGPISKTAKANNFY
        LWKRTYDYTILLLARSL TIFSRIK VF IEQS DN   +DSRDM+SDYIARSQSVSSLMQSMVHPSE +GLTKFASGPLKRFTTKSGPISKTA+ +NFY
Subjt:  LWKRTYDYTILLLARSLATIFSRIKHVFGIEQSEDNGEMNDSRDMNSDYIARSQSVSSLMQSMVHPSEHNGLTKFASGPLKRFTTKSGPISKTAKANNFY

Query:  SGPLGSSITKSGPISGPVSGTNRNFNSYSGPLTSSTMRSGLISGVHNKTNQKNWLVGGYSSIFNGKKSHQKPNRLTQVGPFKGCMISGTSSPVANCHISS
        SGPLG SITKSGPISGPVSGTNRNFNSYSGPLTSS +RSG  SG+ NKTNQKNWLVGGYSS+FNGKKSHQKPNRLTQVGPFKGCMISG SS VANCHISS
Subjt:  SGPLGSSITKSGPISGPVSGTNRNFNSYSGPLTSSTMRSGLISGVHNKTNQKNWLVGGYSSIFNGKKSHQKPNRLTQVGPFKGCMISGTSSPVANCHISS

Query:  NGIHTELLNGTKDAGKIVDPCNKTIPCKQLLGAKRRLLDAPPETLGAAALALHYANVIIVIEKLAASPHLIGLDARDDLYNMLPAKVRASLRASLKPYAK
        NG H++L+NG KD GKIV+ CN+  PCKQLL  K RLLDAPPETLGAAALALHYANVIIVIEKLAASPHLIGLDARDDLYNMLPAKVRASLRA+LKPYAK
Subjt:  NGIHTELLNGTKDAGKIVDPCNKTIPCKQLLGAKRRLLDAPPETLGAAALALHYANVIIVIEKLAASPHLIGLDARDDLYNMLPAKVRASLRASLKPYAK

Query:  SLSSSAYDTGLAGEWSEAITGILEWLAPLAHNMVRWQSERSFEQQNFVSRTNMLLVQTLFFANQEKTEAIITELLVGLNYLWRFGRELNAKALNECASSR
        SL+SS YDTGLAGEW+EAI GILEWLAPLAHNMVRWQSERSFEQQNFVSRTNMLLVQTLFFANQEKTEAIITELLVGLNYLW FGRELNAKALNECASSR
Subjt:  SLSSSAYDTGLAGEWSEAITGILEWLAPLAHNMVRWQSERSFEQQNFVSRTNMLLVQTLFFANQEKTEAIITELLVGLNYLWRFGRELNAKALNECASSR

Query:  IHDEYLDIVG
        +HDEYLDIVG
Subjt:  IHDEYLDIVG

A0A6J1BWJ5 uncharacterized protein LOC1110063360.0e+0099.51Show/hide
Query:  MVSESWFRSLWKPPRKRESSTQKVVIGVLAFEIATLMSKLVHLWQSLSDKQVRRLREEINSSLGIKKLVSDNDEHIVRLIYAEMTESLAHVAKSVARLGK
        MVSESWFRSLWKPPRKRESSTQKVVIGVLAFEIATLMSKLVHLWQSLSDKQVRRLREEINSSLGIKKLVSDNDEHIVRLIYAEMTESLAHVAKSVARLGK
Subjt:  MVSESWFRSLWKPPRKRESSTQKVVIGVLAFEIATLMSKLVHLWQSLSDKQVRRLREEINSSLGIKKLVSDNDEHIVRLIYAEMTESLAHVAKSVARLGK

Query:  KCSDPSLKNFEHIFDDLIQIGADPYGWIYSWKKMEKKVKKMETFISVNANLYQEMEMLADLEQTLRRMKVNEESDVINLVEFQKKVAWKQQEVKNLREMS
        KCSDPSLKNFEHIFDDLIQIGADPYGWIYSWKKME KVKKMETFISVNANLYQEMEMLADLEQTLRRMKVNEESDVINLVEFQKKVAWKQQEVKNLREMS
Subjt:  KCSDPSLKNFEHIFDDLIQIGADPYGWIYSWKKMEKKVKKMETFISVNANLYQEMEMLADLEQTLRRMKVNEESDVINLVEFQKKVAWKQQEVKNLREMS

Query:  LWKRTYDYTILLLARSLATIFSRIKHVFGIEQSEDNGEMNDSRDMNSDYIARSQSVSSLMQSMVHPSEHNGLTKFASGPLKRFTTKSGPISKTAKANNFY
        LWKRTYDY+ILLLARSLATIFSRIKHVFGIEQSEDNGEMNDSRDMNSDYIARSQSVSSLMQSMVHPSEHNGLTKFASGPLKRFTTKSGPISKTAKANNFY
Subjt:  LWKRTYDYTILLLARSLATIFSRIKHVFGIEQSEDNGEMNDSRDMNSDYIARSQSVSSLMQSMVHPSEHNGLTKFASGPLKRFTTKSGPISKTAKANNFY

Query:  SGPLGSSITKSGPISGPVSGTNRNFNSYSGPLTSSTMRSGLISGVHNKTNQKNWLVGGYSSIFNGKKSHQKPNRLTQVGPFKGCMISGTSSPVANCHISS
        SGPLGSSITKSGPISGPVSGTNRNFNSYSGPLTSSTMRSG ISGVHNKTNQKNWLVGGYSSIFNGKKSHQKPNRLTQVGPFKGCMISGTSSPVANCHISS
Subjt:  SGPLGSSITKSGPISGPVSGTNRNFNSYSGPLTSSTMRSGLISGVHNKTNQKNWLVGGYSSIFNGKKSHQKPNRLTQVGPFKGCMISGTSSPVANCHISS

Query:  NGIHTELLNGTKDAGKIVDPCNKTIPCKQLLGAKRRLLDAPPETLGAAALALHYANVIIVIEKLAASPHLIGLDARDDLYNMLPAKVRASLRASLKPYAK
        NGIHTELLNGTKDAGKIVDPCNKTIPCKQLLGAKRRLLDAPPETLGAAALALHYANVIIVIEKLAASPHLIGLDARDDLYNMLPAKVRASLRASLKPYAK
Subjt:  NGIHTELLNGTKDAGKIVDPCNKTIPCKQLLGAKRRLLDAPPETLGAAALALHYANVIIVIEKLAASPHLIGLDARDDLYNMLPAKVRASLRASLKPYAK

Query:  SLSSSAYDTGLAGEWSEAITGILEWLAPLAHNMVRWQSERSFEQQNFVSRTNMLLVQTLFFANQEKTEAIITELLVGLNYLWRFGRELNAKALNECASSR
        SLSSSAYDTGLAGEWSEAITGILEWLAPLAHNMVRWQSERSFEQQNFVSRTNMLLVQTLFFANQEKTEAIITELLVGLNYLWRFGRELNAKALNECASSR
Subjt:  SLSSSAYDTGLAGEWSEAITGILEWLAPLAHNMVRWQSERSFEQQNFVSRTNMLLVQTLFFANQEKTEAIITELLVGLNYLWRFGRELNAKALNECASSR

Query:  IHDEYLDIVG
        IHDEYLDIVG
Subjt:  IHDEYLDIVG

A0A6J1JMI9 uncharacterized protein LOC1114872071.4e-30286.56Show/hide
Query:  MVSESWFRSLWKPPRKRESSTQKVVIGVLAFEIATLMSKLVHLWQSLSDKQVRRLREEINSSLGIKKLVSDNDEHIVRLIYAEMTESLAHVAKSVARLGK
        MVSESWFRSLWKP RKR+SSTQKVVIGVLAFEIA+LMS+LVHLW SLSDKQ+RRLRE+I +SLGIKKLVSD+DE+IVRLI AE+TE+LAHVAKSVARLGK
Subjt:  MVSESWFRSLWKPPRKRESSTQKVVIGVLAFEIATLMSKLVHLWQSLSDKQVRRLREEINSSLGIKKLVSDNDEHIVRLIYAEMTESLAHVAKSVARLGK

Query:  KCSDPSLKNFEHIFDDLIQIGADPYGWIYSWKKMEKKVKKMETFISVNANLYQEMEMLADLEQTLRRMKVNEESDVINLVEFQKKVAWKQQEVKNLREMS
        KCSDPSLK FEH+FD+L+QIGADPY WIYSWKKMEKKVKKMETFISVNANLYQEMEML +LEQT RRMKVN+ +  +NL +F+KKV WKQQEVKNLREMS
Subjt:  KCSDPSLKNFEHIFDDLIQIGADPYGWIYSWKKMEKKVKKMETFISVNANLYQEMEMLADLEQTLRRMKVNEESDVINLVEFQKKVAWKQQEVKNLREMS

Query:  LWKRTYDYTILLLARSLATIFSRIKHVFGIEQSEDNGEMNDSRDMNSDYIARSQSVSSLMQSMVHPSEHNGLTKFASGPLKRFTTKSGPISKTAKANNFY
        LW+RTYDYTILLLARS+ TIFSRIKHVFGIEQS    E NDSRDMNSDYIARSQSVSSLMQSMVHPSE NGLTKFASGPLKRFTTKSGPISKTAK NNFY
Subjt:  LWKRTYDYTILLLARSLATIFSRIKHVFGIEQSEDNGEMNDSRDMNSDYIARSQSVSSLMQSMVHPSEHNGLTKFASGPLKRFTTKSGPISKTAKANNFY

Query:  SGPLGSSITKSGPISGPVSGTNRNFNSYSGPLTSSTMRSGLISGVHNKTNQKNWLVGGYSSIFNGKKSHQKPNRLTQVGPFKGCMISGTSSPVANCHISS
        SGPLGSS+TKSGPI GPVSGTNRNFNSYSGPLTSST+RSG +SG++NKTN KNW  GGYSS+FNGKKSHQKPNRLTQVGPFKGCM+SG SS V NCHISS
Subjt:  SGPLGSSITKSGPISGPVSGTNRNFNSYSGPLTSSTMRSGLISGVHNKTNQKNWLVGGYSSIFNGKKSHQKPNRLTQVGPFKGCMISGTSSPVANCHISS

Query:  NGIHTELLNGTKDAGKIVDPCNKTIPCKQLLGAKRRLLDAPPETLGAAALALHYANVIIVIEKLAASPHLIGLDARDDLYNMLPAKVRASLRASLKPYAK
        NG H++L+NG KD GKIVDPCNK  PC++LLG K +LLDAPPETLG AALALHYANVIIVIEKLAASPHLIGLDARDDLYNMLPAKVRASLRA+LKPYAK
Subjt:  NGIHTELLNGTKDAGKIVDPCNKTIPCKQLLGAKRRLLDAPPETLGAAALALHYANVIIVIEKLAASPHLIGLDARDDLYNMLPAKVRASLRASLKPYAK

Query:  SLSSSAYDTGLAGEWSEAITGILEWLAPLAHNMVRWQSERSFEQQNFVSRTNMLLVQTLFFANQEKTEAIITELLVGLNYLWRFGRELNAKALNECASSR
         L+SS YDTGLA EW++AI GILEWLAPLAHNMVRWQSERSFEQQNFVSRTNMLLVQTLFFANQEKTE IITELLVGLNYLWRFGRE NAKALNECASSR
Subjt:  SLSSSAYDTGLAGEWSEAITGILEWLAPLAHNMVRWQSERSFEQQNFVSRTNMLLVQTLFFANQEKTEAIITELLVGLNYLWRFGRELNAKALNECASSR

Query:  IHDEYLDIVG
        IHDE+LDIVG
Subjt:  IHDEYLDIVG

SwissProt top hitse value%identityAlignment
P0DO24 Protein PSK SIMULATOR 31.5e-1125.36Show/hide
Query:  IGVLAFEIATLMSKLVHLWQSLSDKQVRRLREEINSSLGIKKLVSDNDEHIVRLIYAEMTESLAHVAKSVARLGKKCSDPSLKNFEHIFDDLIQIGADPY
        +G+LAFE+A  + K  +L +SLS + +  L+  I  S G++ LVS++ + ++RL+ A+  + L   +  V R G +  D    N +  FD   +I  +  
Subjt:  IGVLAFEIATLMSKLVHLWQSLSDKQVRRLREEINSSLGIKKLVSDNDEHIVRLIYAEMTESLAHVAKSVARLGKKCSDPSLKNFEHIFDDLIQIGADPY

Query:  GWIYSWKKMEKKVKKMETFISVNANLYQEMEMLADLEQTLRRMKVNEESDVIN-----LVEFQKKVAWKQQEVKNLREMSLWKRTYDYTILLLARSLATI
              +     V ++   +   A LYQE+++L  LE+   + +  EE+   +     L   + ++  +++ VK+L++ SLW R ++  +  L   +  +
Subjt:  GWIYSWKKMEKKVKKMETFISVNANLYQEMEMLADLEQTLRRMKVNEESDVIN-----LVEFQKKVAWKQQEVKNLREMSLWKRTYDYTILLLARSLATI

Query:  FSRIKHVFG
           I ++FG
Subjt:  FSRIKHVFG

P0DO24 Protein PSK SIMULATOR 39.9e-1133.11Show/hide
Query:  LGAAALALHYANVIIVIEKLAASPHLIGLDARDDLYNMLPAKVRASLRASLKPY--AKSLSSSAYDTGLAGEWSEAITGILEWLAPLAHNMVR------W
        LG A LALHYAN+I+ I+ L A    I  +ARD LY  LP  ++ +LR+ +K +   K LS +        +  + +   L WL P+A N  +      W
Subjt:  LGAAALALHYANVIIVIEKLAASPHLIGLDARDDLYNMLPAKVRASLRASLKPY--AKSLSSSAYDTGLAGEWSEAITGILEWLAPLAHNMVR------W

Query:  QSERSFEQQNFVSRT---NMLLVQTLFFANQEKTEAIITELLVGLNYL
          E +    +F S+    ++L ++TL+ A++EKTE  I   ++ L +L
Subjt:  QSERSFEQQNFVSRT---NMLLVQTLFFANQEKTEAIITELLVGLNYL

Q9SA91 Protein PSK SIMULATOR 22.9e-1026.34Show/hide
Query:  IGVLAFEIATLMSKLVHLWQSLSDKQVRRLREEINSSLGIKKLVSDNDEHIVRLIYAEMTESLAHVAKSVARLGKKCSDPSLKNFEHIFDDLIQIGADPY
        + +LAFE+A  ++K   L QSLS++ ++ +++++  S  +KKLVS +   +  L  ++  E L   +  V R G  C D    N +  F   +++  +  
Subjt:  IGVLAFEIATLMSKLVHLWQSLSDKQVRRLREEINSSLGIKKLVSDNDEHIVRLIYAEMTESLAHVAKSVARLGKKCSDPSLKNFEHIFDDLIQIGADPY

Query:  GWIYSWKKMEKKVKKMETFISVNANLYQEMEMLADLEQTLRRMKVNEESDVI-----NLVEFQKKVAWKQQEVKNLREMSLWKRTYDYTILLLARSLATI
                 E +++++ T   + + LY E++ L   EQ  RR     ES  +      +V  Q ++  +++ VK+L++ SLW +     I  L   ++ I
Subjt:  GWIYSWKKMEKKVKKMETFISVNANLYQEMEMLADLEQTLRRMKVNEESDVI-----NLVEFQKKVAWKQQEVKNLREMSLWKRTYDYTILLLARSLATI

Query:  FSRIKHVFGIEQSEDN-GEMNDSR
           I  VFG     DN GE    R
Subjt:  FSRIKHVFGIEQSEDN-GEMNDSR

Q9XID5 Protein PSK SIMULATOR 11.4e-0931.97Show/hide
Query:  LGAAALALHYANVIIVIEKLAASPHLIGLDARDDLYNMLPAKVRASLRASLKPYAKSLSSSAYDTGLAGEWSEAITGILEWLAPLAHNMVR---------
        LG+A LALHYAN+I  I+ L +    +    RD LY  LP  ++++LR+ ++ +   +        +  E  +     L+WL P+A N  +         
Subjt:  LGAAALALHYANVIIVIEKLAASPHLIGLDARDDLYNMLPAKVRASLRASLKPYAKSLSSSAYDTGLAGEWSEAITGILEWLAPLAHNMVR---------

Query:  -WQSERSFEQQNFVSRTNMLLVQTLFFANQEKTEAIITELLVGLNYL
         W S  S   Q    +T +L + TL  A++EKTEA I +L+V L++L
Subjt:  -WQSERSFEQQNFVSRTNMLLVQTLFFANQEKTEAIITELLVGLNYL

Arabidopsis top hitse value%identityAlignment
AT1G30755.1 Protein of unknown function (DUF668)2.0e-1126.34Show/hide
Query:  IGVLAFEIATLMSKLVHLWQSLSDKQVRRLREEINSSLGIKKLVSDNDEHIVRLIYAEMTESLAHVAKSVARLGKKCSDPSLKNFEHIFDDLIQIGADPY
        + +LAFE+A  ++K   L QSLS++ ++ +++++  S  +KKLVS +   +  L  ++  E L   +  V R G  C D    N +  F   +++  +  
Subjt:  IGVLAFEIATLMSKLVHLWQSLSDKQVRRLREEINSSLGIKKLVSDNDEHIVRLIYAEMTESLAHVAKSVARLGKKCSDPSLKNFEHIFDDLIQIGADPY

Query:  GWIYSWKKMEKKVKKMETFISVNANLYQEMEMLADLEQTLRRMKVNEESDVI-----NLVEFQKKVAWKQQEVKNLREMSLWKRTYDYTILLLARSLATI
                 E +++++ T   + + LY E++ L   EQ  RR     ES  +      +V  Q ++  +++ VK+L++ SLW +     I  L   ++ I
Subjt:  GWIYSWKKMEKKVKKMETFISVNANLYQEMEMLADLEQTLRRMKVNEESDVI-----NLVEFQKKVAWKQQEVKNLREMSLWKRTYDYTILLLARSLATI

Query:  FSRIKHVFGIEQSEDN-GEMNDSR
           I  VFG     DN GE    R
Subjt:  FSRIKHVFGIEQSEDN-GEMNDSR

AT3G23160.1 Protein of unknown function (DUF668)2.3e-8735.84Show/hide
Query:  STQKVVIGVLAFEIATLMSKLVHLWQSLSDKQVRRLREEINSSLGIKKLVSDNDEHIVRLIYAEMTESLAHVAKSVARLGKKCSDPSLKNFEHIFDDLIQ
        S  K  IG+L+FE+A +MSK +HL +SLSD ++ +L+ E+  S G++KLVS ++ H++ L  +E  + L+ VA  V+RLGKKC++P+L+ FEH+++D++ 
Subjt:  STQKVVIGVLAFEIATLMSKLVHLWQSLSDKQVRRLREEINSSLGIKKLVSDNDEHIVRLIYAEMTESLAHVAKSVARLGKKCSDPSLKNFEHIFDDLIQ

Query:  IGADPYGWIYSWKKMEKKVKKMETFISVNANLYQEMEMLADLEQTLRRMKVNEESDVINLVEFQKKVAWKQQEVKNLREMSLWKRTYDYTILLLARSLAT
           D     +  K ME  VKKME F++   +LY EME++ +LEQ + +++ +++    ++  F++K+ W++Q+VK+LR+ SLW +TYD  + +LAR++ T
Subjt:  IGADPYGWIYSWKKMEKKVKKMETFISVNANLYQEMEMLADLEQTLRRMKVNEESDVINLVEFQKKVAWKQQEVKNLREMSLWKRTYDYTILLLARSLAT

Query:  IFSRIKHVF---GIEQSEDNGEMNDSRDMNSDYIARSQSVSSLMQSMVHPSEHNGLTKFASGPLKRFTTKSGPISKTAKANNFYSGPLGSSITKSGPISG
        I+ RI+ VF   G+   +D     D     +     S+SV+                                 S+ ++A+ F         T++G  + 
Subjt:  IFSRIKHVF---GIEQSEDNGEMNDSRDMNSDYIARSQSVSSLMQSMVHPSEHNGLTKFASGPLKRFTTKSGPISKTAKANNFYSGPLGSSITKSGPISG

Query:  PVSGTNRNFNSYSGPLTSSTMRSGLISGVHNKTNQKNWLVGGYSSIFNGKKSHQKPNRLTQVGPFKGCMISGTSSPVANCHISSNGIHTELLNGTKDAGK
        P  GTN                                                 P R+                    C   +  I  +  +  +D G+
Subjt:  PVSGTNRNFNSYSGPLTSSTMRSGLISGVHNKTNQKNWLVGGYSSIFNGKKSHQKPNRLTQVGPFKGCMISGTSSPVANCHISSNGIHTELLNGTKDAGK

Query:  IVDPCNKTIPCK-QLLGAKRRLLD-APPETLGAAALALHYANVIIVIEKLAASPHLIGLDARDDLYNMLPAKVRASLRASLKPYAKSLSSSAYDTGLAGE
        I  P +     +    G K RL   A   T+G +AL+LHYANV+IV+EKL   PHLIG +ARDDLY MLP  ++ +L+ASL+ Y K++  S YD  LA +
Subjt:  IVDPCNKTIPCK-QLLGAKRRLLD-APPETLGAAALALHYANVIIVIEKLAASPHLIGLDARDDLYNMLPAKVRASLRASLKPYAKSLSSSAYDTGLAGE

Query:  WSEAITGILEWLAPLAHNMVRWQSERSFEQQN-FVSRTNMLLVQTLFFANQEKTEAIITELLVGLNYLWRFGRELNAKALNECASS
        W E I GIL WLAPLAHNM+RWQSER+FEQQN  V RTN+LL+QTL+FA++EKTEA I +LLVGLNY+  + ++ N  AL +CASS
Subjt:  WSEAITGILEWLAPLAHNMVRWQSERSFEQQN-FVSRTNMLLVQTLFFANQEKTEAIITELLVGLNYLWRFGRELNAKALNECASS

AT5G04550.1 Protein of unknown function (DUF668)1.5e-18457.33Show/hide
Query:  MVSESWFRSLWKPPRKRESSTQKVVIGVLAFEIATLMSKLVHLWQSLSDKQVRRLREEINSSLGIKKLVSDNDEHIVRLIYAEMTESLAHVAKSVARLGK
        MVSE+WFR+LW+ P+K +    K V+GVLAFE+A+L+SKLVHLWQSLSDK V RLR+EI  S GIKKLVS++D+ IVRLI  EM E++ +VAK+VARL +
Subjt:  MVSESWFRSLWKPPRKRESSTQKVVIGVLAFEIATLMSKLVHLWQSLSDKQVRRLREEINSSLGIKKLVSDNDEHIVRLIYAEMTESLAHVAKSVARLGK

Query:  KCSDPSLKNFEHIFDDLIQIGADPYGWIYSWKKMEKKVKKMETFISVNANLYQEMEMLADLEQTLRRMKVNEESDVINLVEFQKKVAWKQQEVKNLREMS
        KC+DP LK FE+ F D+++ GADPYGW + WKKM+KK KKME FIS NA+LYQE E+LADLEQT +RMK N ES   NL+E+QKKV WK+ EVKNLR++S
Subjt:  KCSDPSLKNFEHIFDDLIQIGADPYGWIYSWKKMEKKVKKMETFISVNANLYQEMEMLADLEQTLRRMKVNEESDVINLVEFQKKVAWKQQEVKNLREMS

Query:  LWKRTYDYTILLLARSLATIFSRIKHVFGIEQSEDNGEMNDSRDMNSDYIARSQSVSSLMQSMVHPSEHNGLTKFASGPLKRFT--TKSGPISKTAKANN
        LW RTYDYT++LL RS+ TI SR KHVFGI       E +D    +SD+I RS SVS+++  + H SE +GL +FASGPL RFT        +++ K ++
Subjt:  LWKRTYDYTILLLARSLATIFSRIKHVFGIEQSEDNGEMNDSRDMNSDYIARSQSVSSLMQSMVHPSEHNGLTKFASGPLKRFT--TKSGPISKTAKANN

Query:  FYSGPLGSSITKSGPISGPVSGTNRNFNSYSGPLTSSTMRSGLISGVHNKTNQKNWLVGGYSSIFNGKKSHQKPNRLTQVGPFKGCMISGTSSPVANCHI
        F SG L +   KSGP+   V+  ++ F  YSG L   T +SG + G+  K N+K        SI + KK   K NRLTQVGPFKGCM+S         H 
Subjt:  FYSGPLGSSITKSGPISGPVSGTNRNFNSYSGPLTSSTMRSGLISGVHNKTNQKNWLVGGYSSIFNGKKSHQKPNRLTQVGPFKGCMISGTSSPVANCHI

Query:  SSNGIHTELLNGTKDAG---KIVDPCNKTIPCKQL-LGAKRRLLDAPPETLGAAALALHYANVIIVIEKLAASPHLIGLDARDDLYNMLPAKVRASLRAS
            + T   NG +++     I++  + ++  + L L ++ +L DA P TLG A LALHYANVIIVIE+  ASPHLIG DARDDLYNMLPA VR SLR  
Subjt:  SSNGIHTELLNGTKDAG---KIVDPCNKTIPCKQL-LGAKRRLLDAPPETLGAAALALHYANVIIVIEKLAASPHLIGLDARDDLYNMLPAKVRASLRAS

Query:  LKPYAKSLSSS-AYDTGLAGEWSEAITGILEWLAPLAHNMVRWQSERSFEQQNFVSRTNMLLVQTLFFANQEKTEAIITELLVGLNYLWRFGRELNAKAL
        LKPY+K+LSSS  YD GLA EW++A+ GILEWL PLAHNM++WQSERS+E Q+ VSRT+++L QTLFFANQ+KTEAIITELLVGLNY+WRFGRELNAKAL
Subjt:  LKPYAKSLSSS-AYDTGLAGEWSEAITGILEWLAPLAHNMVRWQSERSFEQQNFVSRTNMLLVQTLFFANQEKTEAIITELLVGLNYLWRFGRELNAKAL

Query:  NECASSRIHDEYLD
         EC SS+  ++ LD
Subjt:  NECASSRIHDEYLD

AT5G08660.1 Protein of unknown function (DUF668)1.1e-1225.36Show/hide
Query:  IGVLAFEIATLMSKLVHLWQSLSDKQVRRLREEINSSLGIKKLVSDNDEHIVRLIYAEMTESLAHVAKSVARLGKKCSDPSLKNFEHIFDDLIQIGADPY
        +G+LAFE+A  + K  +L +SLS + +  L+  I  S G++ LVS++ + ++RL+ A+  + L   +  V R G +  D    N +  FD   +I  +  
Subjt:  IGVLAFEIATLMSKLVHLWQSLSDKQVRRLREEINSSLGIKKLVSDNDEHIVRLIYAEMTESLAHVAKSVARLGKKCSDPSLKNFEHIFDDLIQIGADPY

Query:  GWIYSWKKMEKKVKKMETFISVNANLYQEMEMLADLEQTLRRMKVNEESDVIN-----LVEFQKKVAWKQQEVKNLREMSLWKRTYDYTILLLARSLATI
              +     V ++   +   A LYQE+++L  LE+   + +  EE+   +     L   + ++  +++ VK+L++ SLW R ++  +  L   +  +
Subjt:  GWIYSWKKMEKKVKKMETFISVNANLYQEMEMLADLEQTLRRMKVNEESDVIN-----LVEFQKKVAWKQQEVKNLREMSLWKRTYDYTILLLARSLATI

Query:  FSRIKHVFG
           I ++FG
Subjt:  FSRIKHVFG

AT5G08660.1 Protein of unknown function (DUF668)7.0e-1233.11Show/hide
Query:  LGAAALALHYANVIIVIEKLAASPHLIGLDARDDLYNMLPAKVRASLRASLKPY--AKSLSSSAYDTGLAGEWSEAITGILEWLAPLAHNMVR------W
        LG A LALHYAN+I+ I+ L A    I  +ARD LY  LP  ++ +LR+ +K +   K LS +        +  + +   L WL P+A N  +      W
Subjt:  LGAAALALHYANVIIVIEKLAASPHLIGLDARDDLYNMLPAKVRASLRASLKPY--AKSLSSSAYDTGLAGEWSEAITGILEWLAPLAHNMVR------W

Query:  QSERSFEQQNFVSRT---NMLLVQTLFFANQEKTEAIITELLVGLNYL
          E +    +F S+    ++L ++TL+ A++EKTE  I   ++ L +L
Subjt:  QSERSFEQQNFVSRT---NMLLVQTLFFANQEKTEAIITELLVGLNYL

AT5G51670.1 Protein of unknown function (DUF668)1.5e-7833.56Show/hide
Query:  STQKVVIGVLAFEIATLMSKLVHLWQSLSDKQVRRLREEINSSLGIKKLVSDNDEHIVRLIYAEMTESLAHVAKSVARLGKKCSDPSLKNFEHIFDDLIQ
        ST    +GVL+FE+A +M+KL+HL  SL+D  +   R+   S  G+ K+V+ ++   + L+ AE+ +SLAH A SV+RL  +C+  SL++F  +F +   
Subjt:  STQKVVIGVLAFEIATLMSKLVHLWQSLSDKQVRRLREEINSSLGIKKLVSDNDEHIVRLIYAEMTESLAHVAKSVARLGKKCSDPSLKNFEHIFDDLIQ

Query:  IGADPYGWIYSWKKMEKKVKKMETFISVNANLYQEMEMLADLEQTLRRMKV------------NEESDVINLVEFQKKVAWKQQEVKNLREMSLWKRTYD
        +G DP+GW+ + K  E K KK+E ++SV   LY+EME +A LE +LR+  +              + DV+ +++ Q K+  ++Q VK L++ SLW +++D
Subjt:  IGADPYGWIYSWKKMEKKVKKMETFISVNANLYQEMEMLADLEQTLRRMKV------------NEESDVINLVEFQKKVAWKQQEVKNLREMSLWKRTYD

Query:  YTILLLARSLATIFSRIKHVFGIEQSEDNGEMNDSRDMNSDYIARSQSVSSLMQSMVHPSEHNGLTKFASGPLKRFTTKSGPISKTAKANNFYSGPLGSS
          +L+LARS+ T  +R+K VF                                                                +A A  +    + SS
Subjt:  YTILLLARSLATIFSRIKHVFGIEQSEDNGEMNDSRDMNSDYIARSQSVSSLMQSMVHPSEHNGLTKFASGPLKRFTTKSGPISKTAKANNFYSGPLGSS

Query:  ITKSGPISGPVSGTNRNFNSYSGPLTSSTMRSGLISGVHNKTNQKNWLVGGYSSIFNGKKSHQKPNRLTQVGPFKGCMISGTSSPVANCHISSNGIHTEL
        + +S                    L+SS+    L+                          H  PN                                  
Subjt:  ITKSGPISGPVSGTNRNFNSYSGPLTSSTMRSGLISGVHNKTNQKNWLVGGYSSIFNGKKSHQKPNRLTQVGPFKGCMISGTSSPVANCHISSNGIHTEL

Query:  LNGTKDAGKIVDPCNKTIPCKQLLGAKRRLLDAPPETLGAAALALHYANVIIVIEKLAASPHLIGLDARDDLYNMLPAKVRASLRASLKPYAKSLSSSAY
                   +  +KT      L    RLL  P  TLG A +ALHYAN+I+V+EK+   P L+GLDARDDLY+MLPA VR+SLR+ L    K +  +A 
Subjt:  LNGTKDAGKIVDPCNKTIPCKQLLGAKRRLLDAPPETLGAAALALHYANVIIVIEKLAASPHLIGLDARDDLYNMLPAKVRASLRASLKPYAKSLSSSAY

Query:  DTGLAGEWSEAITGILEWLAPLAHNMVRWQSERSFEQQNFVSRTN----MLLVQTLFFANQEKTEAIITELLVGLNYLWRFGRELNAKAL
        D GLA EW  A+  IL WL PLA NM+RWQSERSFEQQ+  + TN    ++LVQTL FA++ KTEA ITELLVGLNY+WRF RE+ AKAL
Subjt:  DTGLAGEWSEAITGILEWLAPLAHNMVRWQSERSFEQQNFVSRTN----MLLVQTLFFANQEKTEAIITELLVGLNYLWRFGRELNAKAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTCAGAATCTTGGTTTCGCAGTCTGTGGAAGCCTCCAAGGAAGCGTGAGTCTAGCACGCAGAAGGTCGTAATTGGAGTATTGGCATTTGAAATTGCTACCTTGAT
GTCCAAGTTGGTTCACCTATGGCAGTCCTTGAGCGATAAGCAAGTCAGGAGGTTGAGAGAAGAGATAAACAGTTCATTGGGCATAAAGAAGCTTGTGTCAGATAACGACG
AGCATATAGTTCGTTTGATATATGCAGAGATGACCGAGAGTTTAGCTCATGTGGCTAAATCTGTTGCCAGACTGGGAAAGAAATGCAGTGATCCTAGTTTAAAAAACTTC
GAGCATATTTTTGATGATTTAATTCAAATTGGAGCCGATCCCTATGGTTGGATATACTCATGGAAGAAGATGGAAAAGAAGGTGAAGAAGATGGAGACATTTATTTCTGT
CAATGCAAATTTGTATCAAGAGATGGAGATGCTTGCAGATCTTGAGCAAACGTTGAGGAGAATGAAGGTTAATGAGGAATCGGATGTAATCAATCTAGTCGAGTTTCAAA
AGAAGGTTGCATGGAAACAGCAGGAGGTGAAAAATTTGCGAGAAATGTCACTTTGGAAAAGGACCTATGATTACACAATTCTTCTGCTAGCTAGATCCCTAGCCACAATA
TTCAGTAGGATCAAACATGTATTTGGAATTGAGCAGAGTGAAGATAATGGTGAAATGAATGATTCACGAGATATGAATTCTGACTATATTGCACGTAGTCAATCAGTTTC
TTCATTAATGCAATCCATGGTTCATCCTTCAGAGCACAACGGTCTTACTAAATTTGCTTCGGGTCCCCTGAAGAGGTTTACAACTAAATCAGGGCCAATTTCAAAAACTG
CTAAAGCCAATAATTTCTATTCAGGTCCTCTTGGTAGCTCTATTACAAAATCAGGTCCTATTTCAGGTCCTGTTTCTGGAACCAACAGAAACTTCAACTCTTATTCAGGT
CCTCTCACAAGTTCAACGATGAGATCAGGTCTAATTTCTGGAGTACACAACAAAACCAATCAGAAGAATTGGCTTGTCGGTGGCTACTCTTCCATATTTAATGGGAAGAA
ATCTCATCAGAAACCAAACAGACTAACTCAAGTTGGACCTTTTAAAGGATGCATGATATCTGGCACTAGTTCTCCAGTAGCGAATTGCCACATCAGTTCAAATGGGATTC
ATACAGAGCTTCTCAATGGAACCAAAGATGCTGGAAAGATTGTTGATCCTTGCAACAAAACCATTCCTTGCAAACAGTTGCTCGGTGCCAAGCGCCGATTATTGGATGCC
CCACCCGAGACTCTAGGTGCTGCTGCTTTGGCACTGCACTATGCAAATGTCATTATTGTAATTGAGAAGCTTGCTGCATCTCCTCATTTGATTGGCCTGGATGCTAGAGA
TGATCTCTACAATATGCTGCCCGCCAAAGTTCGAGCTAGCCTTAGGGCATCCCTAAAGCCATATGCCAAGAGCTTGTCTTCATCTGCTTATGACACAGGTCTTGCAGGAG
AGTGGAGTGAGGCAATAACTGGGATACTAGAATGGCTTGCTCCACTGGCTCACAACATGGTTAGATGGCAATCTGAACGTAGTTTTGAACAACAGAATTTTGTTTCCAGG
ACTAATATGCTTCTTGTACAAACCCTTTTCTTTGCAAATCAAGAAAAGACCGAAGCAATCATCACCGAACTTCTTGTTGGCCTGAATTACCTATGGAGGTTTGGTAGGGA
ACTCAATGCAAAAGCTCTCAATGAGTGTGCCAGCAGTAGAATACATGATGAATATTTGGATATCGTTGGG
mRNA sequenceShow/hide mRNA sequence
ATGGTTTCAGAATCTTGGTTTCGCAGTCTGTGGAAGCCTCCAAGGAAGCGTGAGTCTAGCACGCAGAAGGTCGTAATTGGAGTATTGGCATTTGAAATTGCTACCTTGAT
GTCCAAGTTGGTTCACCTATGGCAGTCCTTGAGCGATAAGCAAGTCAGGAGGTTGAGAGAAGAGATAAACAGTTCATTGGGCATAAAGAAGCTTGTGTCAGATAACGACG
AGCATATAGTTCGTTTGATATATGCAGAGATGACCGAGAGTTTAGCTCATGTGGCTAAATCTGTTGCCAGACTGGGAAAGAAATGCAGTGATCCTAGTTTAAAAAACTTC
GAGCATATTTTTGATGATTTAATTCAAATTGGAGCCGATCCCTATGGTTGGATATACTCATGGAAGAAGATGGAAAAGAAGGTGAAGAAGATGGAGACATTTATTTCTGT
CAATGCAAATTTGTATCAAGAGATGGAGATGCTTGCAGATCTTGAGCAAACGTTGAGGAGAATGAAGGTTAATGAGGAATCGGATGTAATCAATCTAGTCGAGTTTCAAA
AGAAGGTTGCATGGAAACAGCAGGAGGTGAAAAATTTGCGAGAAATGTCACTTTGGAAAAGGACCTATGATTACACAATTCTTCTGCTAGCTAGATCCCTAGCCACAATA
TTCAGTAGGATCAAACATGTATTTGGAATTGAGCAGAGTGAAGATAATGGTGAAATGAATGATTCACGAGATATGAATTCTGACTATATTGCACGTAGTCAATCAGTTTC
TTCATTAATGCAATCCATGGTTCATCCTTCAGAGCACAACGGTCTTACTAAATTTGCTTCGGGTCCCCTGAAGAGGTTTACAACTAAATCAGGGCCAATTTCAAAAACTG
CTAAAGCCAATAATTTCTATTCAGGTCCTCTTGGTAGCTCTATTACAAAATCAGGTCCTATTTCAGGTCCTGTTTCTGGAACCAACAGAAACTTCAACTCTTATTCAGGT
CCTCTCACAAGTTCAACGATGAGATCAGGTCTAATTTCTGGAGTACACAACAAAACCAATCAGAAGAATTGGCTTGTCGGTGGCTACTCTTCCATATTTAATGGGAAGAA
ATCTCATCAGAAACCAAACAGACTAACTCAAGTTGGACCTTTTAAAGGATGCATGATATCTGGCACTAGTTCTCCAGTAGCGAATTGCCACATCAGTTCAAATGGGATTC
ATACAGAGCTTCTCAATGGAACCAAAGATGCTGGAAAGATTGTTGATCCTTGCAACAAAACCATTCCTTGCAAACAGTTGCTCGGTGCCAAGCGCCGATTATTGGATGCC
CCACCCGAGACTCTAGGTGCTGCTGCTTTGGCACTGCACTATGCAAATGTCATTATTGTAATTGAGAAGCTTGCTGCATCTCCTCATTTGATTGGCCTGGATGCTAGAGA
TGATCTCTACAATATGCTGCCCGCCAAAGTTCGAGCTAGCCTTAGGGCATCCCTAAAGCCATATGCCAAGAGCTTGTCTTCATCTGCTTATGACACAGGTCTTGCAGGAG
AGTGGAGTGAGGCAATAACTGGGATACTAGAATGGCTTGCTCCACTGGCTCACAACATGGTTAGATGGCAATCTGAACGTAGTTTTGAACAACAGAATTTTGTTTCCAGG
ACTAATATGCTTCTTGTACAAACCCTTTTCTTTGCAAATCAAGAAAAGACCGAAGCAATCATCACCGAACTTCTTGTTGGCCTGAATTACCTATGGAGGTTTGGTAGGGA
ACTCAATGCAAAAGCTCTCAATGAGTGTGCCAGCAGTAGAATACATGATGAATATTTGGATATCGTTGGG
Protein sequenceShow/hide protein sequence
MVSESWFRSLWKPPRKRESSTQKVVIGVLAFEIATLMSKLVHLWQSLSDKQVRRLREEINSSLGIKKLVSDNDEHIVRLIYAEMTESLAHVAKSVARLGKKCSDPSLKNF
EHIFDDLIQIGADPYGWIYSWKKMEKKVKKMETFISVNANLYQEMEMLADLEQTLRRMKVNEESDVINLVEFQKKVAWKQQEVKNLREMSLWKRTYDYTILLLARSLATI
FSRIKHVFGIEQSEDNGEMNDSRDMNSDYIARSQSVSSLMQSMVHPSEHNGLTKFASGPLKRFTTKSGPISKTAKANNFYSGPLGSSITKSGPISGPVSGTNRNFNSYSG
PLTSSTMRSGLISGVHNKTNQKNWLVGGYSSIFNGKKSHQKPNRLTQVGPFKGCMISGTSSPVANCHISSNGIHTELLNGTKDAGKIVDPCNKTIPCKQLLGAKRRLLDA
PPETLGAAALALHYANVIIVIEKLAASPHLIGLDARDDLYNMLPAKVRASLRASLKPYAKSLSSSAYDTGLAGEWSEAITGILEWLAPLAHNMVRWQSERSFEQQNFVSR
TNMLLVQTLFFANQEKTEAIITELLVGLNYLWRFGRELNAKALNECASSRIHDEYLDIVG