; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC02g0169 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC02g0169
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionprotein FAF-like, chloroplastic
Genome locationMC02:1666431..1667582
RNA-Seq ExpressionMC02g0169
SyntenyMC02g0169
Gene Ontology termsNA
InterPro domainsIPR021410 - The fantastic four family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140267.1 protein FAF-like, chloroplastic [Cucumis sativus]3.51e-15266.06Show/hide
Query:  LPLSAADSDDVNKEFRSPNKRELD---AWSSILFQKS-ADAAPKSPALL-PYVHPLVKKSASSLTENSLLVCTESLGSETGSDGFSSYPPSEDDDEPMKQ
        +P S A  D+ N      +K  +     WSSIL   S +D  PKSP +  PYVHPL+KK++ SL++ SL +CTESLGSETGSDGFSSYP SED D     
Subjt:  LPLSAADSDDVNKEFRSPNKRELD---AWSSILFQKS-ADAAPKSPALL-PYVHPLVKKSASSLTENSLLVCTESLGSETGSDGFSSYPPSEDDDEPMKQ

Query:  GRPQHHPQTQTFQWRPIKFSRKKSPPRSFPPPLPSPSSPDGASVCIHSRRENGRLVLDAVSVPSRKNFIADRRDGRLVLSFVTTP---VVEEEDEFEQLL
            +  +  TF+W+P+KFSRKKSPPRSFPPP+ S  SPDG S+CI SRRE+GRL+LDAVSVPSRKNF A+RRDGRL+LS   TP   +V EE+E E+++
Subjt:  GRPQHHPQTQTFQWRPIKFSRKKSPPRSFPPPLPSPSSPDGASVCIHSRRENGRLVLDAVSVPSRKNFIADRRDGRLVLSFVTTP---VVEEEDEFEQLL

Query:  ARELEEVKDSEIAEEKDEGNDLETEELELEI---PRLSSSVMNFHRLALMMKKKPNGLVNRNPPWPKEKDTSE-PLTPLSQSLPPRPPS-PLATAATSLN
        A E EEVK+SEI E  ++ N LE EELE+ I   PRLSSSVMNFHRL  MMKK  NGL+NRNP WPKEKD SE P TPLSQSLPPRPPS   ATA T LN
Subjt:  ARELEEVKDSEIAEEKDEGNDLETEELELEI---PRLSSSVMNFHRLALMMKKKPNGLVNRNPPWPKEKDTSE-PLTPLSQSLPPRPPS-PLATAATSLN

Query:  AYEYYWRSKPTGKSAGIQNSIGQQQPQPIKSVTRKLISSDNQMANEKQQVLVLRGNRGDYLVPLSNGCKLPRRSLLLREPCCIATT
        AYEYYWRSKPTGKSAGIQN  GQQQ Q    +TRKLISS+NQMA+EKQQ+LVL+GNRGDYLVPLSNGCK+PRRS+LLREPCCIATT
Subjt:  AYEYYWRSKPTGKSAGIQNSIGQQQPQPIKSVTRKLISSDNQMANEKQQVLVLRGNRGDYLVPLSNGCKLPRRSLLLREPCCIATT

XP_022153041.1 protein FAF-like, chloroplastic [Momordica charantia]9.18e-19499.64Show/hide
Query:  MKQGRPQHHPQTQTFQWRPIKFSRKKSPPRSFPPPLPSPSSPDGASVCIHSRRENGRLVLDAVSVPSRKNFIADRRDGRLVLSFVTTPVVEEEDEFEQLL
        MKQGRPQHHPQTQTFQWRPIKFSRKKSPPRSFPPPLPS SSPDGASVCIHSRRENGRLVLDAVSVPSRKNFIADRRDGRLVLSFVTTPVVEEEDEFEQLL
Subjt:  MKQGRPQHHPQTQTFQWRPIKFSRKKSPPRSFPPPLPSPSSPDGASVCIHSRRENGRLVLDAVSVPSRKNFIADRRDGRLVLSFVTTPVVEEEDEFEQLL

Query:  ARELEEVKDSEIAEEKDEGNDLETEELELEIPRLSSSVMNFHRLALMMKKKPNGLVNRNPPWPKEKDTSEPLTPLSQSLPPRPPSPLATAATSLNAYEYY
        ARELEEVKDSEIAEEKDEGNDLETEELELEIPRLSSSVMNFHRLALMMKKKPNGLVNRNPPWPKEKDTSEPLTPLSQSLPPRPPSPLATAATSLNAYEYY
Subjt:  ARELEEVKDSEIAEEKDEGNDLETEELELEIPRLSSSVMNFHRLALMMKKKPNGLVNRNPPWPKEKDTSEPLTPLSQSLPPRPPSPLATAATSLNAYEYY

Query:  WRSKPTGKSAGIQNSIGQQQPQPIKSVTRKLISSDNQMANEKQQVLVLRGNRGDYLVPLSNGCKLPRRSLLLREPCCIATT
        WRSKPTGKSAGIQNSIGQQQPQPIKSVTRKLISSDNQMANEKQQVLVLRGNRGDYLVPLSNGCKLPRRSLLLREPCCIATT
Subjt:  WRSKPTGKSAGIQNSIGQQQPQPIKSVTRKLISSDNQMANEKQQVLVLRGNRGDYLVPLSNGCKLPRRSLLLREPCCIATT

XP_022986115.1 protein FAF-like, chloroplastic [Cucurbita maxima]8.10e-15062.44Show/hide
Query:  LNKVASSHGFSLP------------LSAADSDDVNKEF------------------RSPNKRELDAWSSILFQKSADAAPKSPALLPYVHPLVKKSASSL
        + +VASS  FSLP             S+ D  D ++ F                  +  N  + D+W+SILFQ SA   PKSP ++PYVHP ++KSA SL
Subjt:  LNKVASSHGFSLP------------LSAADSDDVNKEF------------------RSPNKRELDAWSSILFQKSADAAPKSPALLPYVHPLVKKSASSL

Query:  TENSLLVCTESLGSETGSDGFSSYPPSEDDDEPMKQGRPQHHPQTQTFQWRPIKFSRKKSPPRSFPPPLPSPSSPDGASVCIHSRRENGRLVLDAVSVPS
        +ENSLL+CTESLGSETGSDGFSSYP SE  D         + P  QTFQW+PIKF RKKSPPRSFPPP+ S  SPDGASVCI SRRENGRL+LDAVSVPS
Subjt:  TENSLLVCTESLGSETGSDGFSSYPPSEDDDEPMKQGRPQHHPQTQTFQWRPIKFSRKKSPPRSFPPPLPSPSSPDGASVCIHSRRENGRLVLDAVSVPS

Query:  RKNFIADRRDGRLVLSFVTTPVVEEEDEFEQLLARELEEVKDSEIAEEKDEGNDLETEELEL---EIPRLSSSVMNFHRLALMMKKKPNGLVNRNPPWPK
        +KNF ADRRDGRLVLSFVTTP          LL      V++S+I EE+D   DLE +E E+   E+ RLSSSVMNFHRLA+MMK KPNG +NRNP WPK
Subjt:  RKNFIADRRDGRLVLSFVTTPVVEEEDEFEQLLARELEEVKDSEIAEEKDEGNDLETEELEL---EIPRLSSSVMNFHRLALMMKKKPNGLVNRNPPWPK

Query:  EKDTSEPLTPLSQSLPPRPPS-PLATAATSLNAYEYYWRSKPTGKSAGIQNSIGQQQPQPIKSVTRKLISSDNQMANEKQQVLVLRGNRGDYLVPLSNGC
        +KD+ E LTPLSQSLPPRPPS   ATAATSLNAYEYYWRSKPTGKSAGIQN   QQQ QPI+S+TRK ISS+NQMA+EKQQ+L       +YLVPLSNGC
Subjt:  EKDTSEPLTPLSQSLPPRPPS-PLATAATSLNAYEYYWRSKPTGKSAGIQNSIGQQQPQPIKSVTRKLISSDNQMANEKQQVLVLRGNRGDYLVPLSNGC

Query:  KLPRRSLLLREPCCIATT
        K+PRRS+LLREPCCIATT
Subjt:  KLPRRSLLLREPCCIATT

XP_023513350.1 protein FAF-like, chloroplastic [Cucurbita pepo subsp. pepo]7.74e-14963.77Show/hide
Query:  LNKVASSHGFSL-PLSAADSDD-----------------VNKEFRSP--------NKRELDAWSSILFQKSADAAPKSPALLPYVHPLVKKSASSLTENS
        + KVASS  FSL P S++ S                   V++E R          N  + D+W SILFQ SA   PKSP + PYVHPL++KSA SL+ENS
Subjt:  LNKVASSHGFSL-PLSAADSDD-----------------VNKEFRSP--------NKRELDAWSSILFQKSADAAPKSPALLPYVHPLVKKSASSLTENS

Query:  LLVCTESLGSETGSDGFSSYPPSEDDDEPMKQGRPQHHPQTQTFQWRPIKFSRKKSPPRSFPPPLPSPSSPDGASVCIHSRRENGRLVLDAVSVPSRKNF
        LL+CTESLGSETGSDGFSSYP SE  D        ++ P  QTFQW+PIKF RKKSPPRSFPPP+ S  SPDGASVCI SRRENGRL+LDAVSVPS+KNF
Subjt:  LLVCTESLGSETGSDGFSSYPPSEDDDEPMKQGRPQHHPQTQTFQWRPIKFSRKKSPPRSFPPPLPSPSSPDGASVCIHSRRENGRLVLDAVSVPSRKNF

Query:  IADRRDGRLVLSFVTTPVVEEEDEFEQLLARELEEVKDSEIAEEKDEGNDLETEELEL---EIPRLSSSVMNFHRLALMMKKKPNGLVNRNPPWPKEKDT
         ADRRDGRLVLSFVTTP          LL      VK+S+I EE+D   DLE +E E+   E+ RLSSSVMNFHRLA+MMK KPNG +NRNP WPK KD 
Subjt:  IADRRDGRLVLSFVTTPVVEEEDEFEQLLARELEEVKDSEIAEEKDEGNDLETEELEL---EIPRLSSSVMNFHRLALMMKKKPNGLVNRNPPWPKEKDT

Query:  SEPLTPLSQSLPPRPPS-PLATAATSLNAYEYYWRSKPTGKSAGIQNSIGQQQPQPIKSVTRKLISSDNQMANEKQQVLVLRGNRGDYLVPLSNGCKLPR
         E LTPLSQSLPPRPPS   ATAATSLNAYEYYWRSKPTGKSAGIQN   QQQ QPI+S+TRK ISS+NQ+A+EKQQ+L       +YLVPLSNGCK+PR
Subjt:  SEPLTPLSQSLPPRPPS-PLATAATSLNAYEYYWRSKPTGKSAGIQNSIGQQQPQPIKSVTRKLISSDNQMANEKQQVLVLRGNRGDYLVPLSNGCKLPR

Query:  RSLLLREPCCIATT
        RS+LLREPCCIATT
Subjt:  RSLLLREPCCIATT

XP_038901797.1 protein FAF-like, chloroplastic [Benincasa hispida]2.27e-15966.58Show/hide
Query:  LNKVASSHGFSLPLSAADSDDVNKEFRSPNKRELDAWSSILFQKSADAAPKSPALLPYVHPLVKKSASSLTENSLLVCTESLGSETGSDGFSSYPPSEDD
        + KVASS  FSLP    D    +K   S   R+ D+WSSIL   S    PKS   +PYVHPL+KK++ SL+E SL +CTESLGSETGSD FSS   SED 
Subjt:  LNKVASSHGFSLPLSAADSDDVNKEFRSPNKRELDAWSSILFQKSADAAPKSPALLPYVHPLVKKSASSLTENSLLVCTESLGSETGSDGFSSYPPSEDD

Query:  DEPMKQGRPQHHPQTQTFQWRPIKFSRKKSPPRSFPPPLPSPSSPDGASVCIHSRRENGRLVLDAVSVPSRKNFIADRRDGRLVLSFVTTP----VVEEE
        D      + ++ P   T +W+P+KF R KSPPRSFPPP+ S +SPDGASVCI SRRENGRL+LDAVSVPSRKNF A+RRDGRLVLSF+TTP    V EEE
Subjt:  DEPMKQGRPQHHPQTQTFQWRPIKFSRKKSPPRSFPPPLPSPSSPDGASVCIHSRRENGRLVLDAVSVPSRKNFIADRRDGRLVLSFVTTP----VVEEE

Query:  DEFEQLLARELEEVKDSEIAEEKDEGNDLETEELEL---EIPRLSSSVMNFHRLALMMKKKPNGLVNRNPPWPKEKDTSEPLTPLSQSLPPRPPS-PLAT
        DE E+L+ARE EEVK+SEI  E+D+ NDLE EELE+   ++PRL SS+MNFHRL LM+K KP+  +NRNP WPKEK+ SE  T LSQSLPPRPPS   AT
Subjt:  DEFEQLLARELEEVKDSEIAEEKDEGNDLETEELEL---EIPRLSSSVMNFHRLALMMKKKPNGLVNRNPPWPKEKDTSEPLTPLSQSLPPRPPS-PLAT

Query:  AATSLNAYEYYWRSKPTGKSAGIQNSIGQQQPQPIKSVTRKLISSDNQMANEKQQVLVLRGNRGDYLVPLSNGCKLPRRSLLLREPCCIATT
        AA SLNAYEYYWR K TGKSAGIQN IG QQ QPI SVTRKLISS+NQMA+EK Q  +LRGNRGDY+VPLSNGCK+PRRS+LLREPCCIATT
Subjt:  AATSLNAYEYYWRSKPTGKSAGIQNSIGQQQPQPIKSVTRKLISSDNQMANEKQQVLVLRGNRGDYLVPLSNGCKLPRRSLLLREPCCIATT

TrEMBL top hitse value%identityAlignment
A0A0A0KI67 Uncharacterized protein1.70e-15266.06Show/hide
Query:  LPLSAADSDDVNKEFRSPNKRELD---AWSSILFQKS-ADAAPKSPALL-PYVHPLVKKSASSLTENSLLVCTESLGSETGSDGFSSYPPSEDDDEPMKQ
        +P S A  D+ N      +K  +     WSSIL   S +D  PKSP +  PYVHPL+KK++ SL++ SL +CTESLGSETGSDGFSSYP SED D     
Subjt:  LPLSAADSDDVNKEFRSPNKRELD---AWSSILFQKS-ADAAPKSPALL-PYVHPLVKKSASSLTENSLLVCTESLGSETGSDGFSSYPPSEDDDEPMKQ

Query:  GRPQHHPQTQTFQWRPIKFSRKKSPPRSFPPPLPSPSSPDGASVCIHSRRENGRLVLDAVSVPSRKNFIADRRDGRLVLSFVTTP---VVEEEDEFEQLL
            +  +  TF+W+P+KFSRKKSPPRSFPPP+ S  SPDG S+CI SRRE+GRL+LDAVSVPSRKNF A+RRDGRL+LS   TP   +V EE+E E+++
Subjt:  GRPQHHPQTQTFQWRPIKFSRKKSPPRSFPPPLPSPSSPDGASVCIHSRRENGRLVLDAVSVPSRKNFIADRRDGRLVLSFVTTP---VVEEEDEFEQLL

Query:  ARELEEVKDSEIAEEKDEGNDLETEELELEI---PRLSSSVMNFHRLALMMKKKPNGLVNRNPPWPKEKDTSE-PLTPLSQSLPPRPPS-PLATAATSLN
        A E EEVK+SEI E  ++ N LE EELE+ I   PRLSSSVMNFHRL  MMKK  NGL+NRNP WPKEKD SE P TPLSQSLPPRPPS   ATA T LN
Subjt:  ARELEEVKDSEIAEEKDEGNDLETEELELEI---PRLSSSVMNFHRLALMMKKKPNGLVNRNPPWPKEKDTSE-PLTPLSQSLPPRPPS-PLATAATSLN

Query:  AYEYYWRSKPTGKSAGIQNSIGQQQPQPIKSVTRKLISSDNQMANEKQQVLVLRGNRGDYLVPLSNGCKLPRRSLLLREPCCIATT
        AYEYYWRSKPTGKSAGIQN  GQQQ Q    +TRKLISS+NQMA+EKQQ+LVL+GNRGDYLVPLSNGCK+PRRS+LLREPCCIATT
Subjt:  AYEYYWRSKPTGKSAGIQNSIGQQQPQPIKSVTRKLISSDNQMANEKQQVLVLRGNRGDYLVPLSNGCKLPRRSLLLREPCCIATT

A0A5D3B9V0 Protein FAF-like2.94e-14265.94Show/hide
Query:  AWSSILFQKS-ADAAPKS------PALLPYVHPLVKKSASSLTENSLLVCTESLGSETGSDGFSSYPPSEDDDEPMKQGRPQHHPQTQTFQWRPIKFSRK
        +WSSIL   S +D  PKS      P   PYVHPL+KK++ SL++ SL +CTESLGSETGSDGFSSYP SED D               TF+W+P+KFSRK
Subjt:  AWSSILFQKS-ADAAPKS------PALLPYVHPLVKKSASSLTENSLLVCTESLGSETGSDGFSSYPPSEDDDEPMKQGRPQHHPQTQTFQWRPIKFSRK

Query:  KSPPRSFPPPLPSPSSPDGASVCIHSRRENGRLVLDAVSVPSRKNFIADRRDGRLVLSFVTTPV-----VEEEDEFEQLLARELEEVKDSEIAEEKDEGN
        KSPPRSFPP + S  SPDG S+ I SRR++GRL+LDAVSVPSRKNF A+RRDGRL+LS  TTP       EEE+E E+L+A E EEVK+SEI E  ++ N
Subjt:  KSPPRSFPPPLPSPSSPDGASVCIHSRRENGRLVLDAVSVPSRKNFIADRRDGRLVLSFVTTPV-----VEEEDEFEQLLARELEEVKDSEIAEEKDEGN

Query:  DLETEELELEI---PRLSSSVMNFHRLALMMKKKPNGLVNRNPPWPKEKDTSEP--LTPLSQSLPPRPPS-PLATAATSLNAYEYYWRSKPTGKSAGIQN
         LE EELE+ I   PRLSSSVMNFHRL  MMKK  NGL+NRNP W  EKD  E    TPLSQSLPPRPPS   ATA + LNAYEYYWRSKPTGKSAGIQN
Subjt:  DLETEELELEI---PRLSSSVMNFHRLALMMKKKPNGLVNRNPPWPKEKDTSEP--LTPLSQSLPPRPPS-PLATAATSLNAYEYYWRSKPTGKSAGIQN

Query:  SIGQQQPQPIKSVTRKLISSDNQMANEKQQVLVLRGNRGDYLVPLSNGCKLPRRSLLLREPCCIATT
          GQQQ Q    +TRKLISS+NQM +EKQQ+LVL+GNRGDYLVPLSNGCK+PRRS+LLREPCCIATT
Subjt:  SIGQQQPQPIKSVTRKLISSDNQMANEKQQVLVLRGNRGDYLVPLSNGCKLPRRSLLLREPCCIATT

A0A6J1DJI0 protein FAF-like, chloroplastic4.45e-19499.64Show/hide
Query:  MKQGRPQHHPQTQTFQWRPIKFSRKKSPPRSFPPPLPSPSSPDGASVCIHSRRENGRLVLDAVSVPSRKNFIADRRDGRLVLSFVTTPVVEEEDEFEQLL
        MKQGRPQHHPQTQTFQWRPIKFSRKKSPPRSFPPPLPS SSPDGASVCIHSRRENGRLVLDAVSVPSRKNFIADRRDGRLVLSFVTTPVVEEEDEFEQLL
Subjt:  MKQGRPQHHPQTQTFQWRPIKFSRKKSPPRSFPPPLPSPSSPDGASVCIHSRRENGRLVLDAVSVPSRKNFIADRRDGRLVLSFVTTPVVEEEDEFEQLL

Query:  ARELEEVKDSEIAEEKDEGNDLETEELELEIPRLSSSVMNFHRLALMMKKKPNGLVNRNPPWPKEKDTSEPLTPLSQSLPPRPPSPLATAATSLNAYEYY
        ARELEEVKDSEIAEEKDEGNDLETEELELEIPRLSSSVMNFHRLALMMKKKPNGLVNRNPPWPKEKDTSEPLTPLSQSLPPRPPSPLATAATSLNAYEYY
Subjt:  ARELEEVKDSEIAEEKDEGNDLETEELELEIPRLSSSVMNFHRLALMMKKKPNGLVNRNPPWPKEKDTSEPLTPLSQSLPPRPPSPLATAATSLNAYEYY

Query:  WRSKPTGKSAGIQNSIGQQQPQPIKSVTRKLISSDNQMANEKQQVLVLRGNRGDYLVPLSNGCKLPRRSLLLREPCCIATT
        WRSKPTGKSAGIQNSIGQQQPQPIKSVTRKLISSDNQMANEKQQVLVLRGNRGDYLVPLSNGCKLPRRSLLLREPCCIATT
Subjt:  WRSKPTGKSAGIQNSIGQQQPQPIKSVTRKLISSDNQMANEKQQVLVLRGNRGDYLVPLSNGCKLPRRSLLLREPCCIATT

A0A6J1FYT2 protein FAF-like, chloroplastic5.65e-14762.93Show/hide
Query:  LNKVASSHGFSLPLSAADSDD-------------VNKEFRSP---------NKRELDAWSSILFQKSADAAPKSPALLPYVHPLVKKSASSLTENSLLVC
        + KVASS  FSLP  ++ S+D             V++E R           N  + D+W SILFQ SA   PKS +++PYVHPL++KSA SL+ENSLL+C
Subjt:  LNKVASSHGFSLPLSAADSDD-------------VNKEFRSP---------NKRELDAWSSILFQKSADAAPKSPALLPYVHPLVKKSASSLTENSLLVC

Query:  TESLGSETGSDGFSSYPPSEDDDEPMKQGRPQHHPQTQTFQWRPIKFSRKKSPPRSFPPPLPSPSSPDGASVCIHSRRENGRLVLDAVSVPSRKNFIADR
        TESLGSETGSDGFSSYP SE  D        ++ P  QTFQW+PIKF RKKSPPRSFPPP+ S  SPDGASVCI SRRENGRL+LDAVSVPS+KNF ADR
Subjt:  TESLGSETGSDGFSSYPPSEDDDEPMKQGRPQHHPQTQTFQWRPIKFSRKKSPPRSFPPPLPSPSSPDGASVCIHSRRENGRLVLDAVSVPSRKNFIADR

Query:  RDGRLVLSFVTTPVVEEEDEFEQLLARELEEVKDSEIAEEKDEGNDLETEELEL---EIPRLSSSVMNFHRLALMMKKKPNGLVNRNPPWPKEKDTSEPL
        RDGRL+LSFVTTP          LL      +K+S+I EE+D   DLE ++ E+   E+ RLSSSVMNFHRLA+MMK KPNG +NRNP WPKEKD  EPL
Subjt:  RDGRLVLSFVTTPVVEEEDEFEQLLARELEEVKDSEIAEEKDEGNDLETEELEL---EIPRLSSSVMNFHRLALMMKKKPNGLVNRNPPWPKEKDTSEPL

Query:  TPLSQSLPPRPPS-PLATAATSLNAYEYYWRSKPTGKSAGIQNSIGQQQPQPIKSVTRKLISSDNQMANEKQQVLVLRGNRGDYLVPLSNGCKLPRRSLL
        TPLSQSLPPRPPS   ATAATSLNAYEYYWRSKPTG    IQN   QQQ QPI+S+T K ISS+NQMA+EKQQ+L       +YLVPLSNGCK+PRRS+L
Subjt:  TPLSQSLPPRPPS-PLATAATSLNAYEYYWRSKPTGKSAGIQNSIGQQQPQPIKSVTRKLISSDNQMANEKQQVLVLRGNRGDYLVPLSNGCKLPRRSLL

Query:  LREPCCIATT
        LREPCCIATT
Subjt:  LREPCCIATT

A0A6J1JD58 protein FAF-like, chloroplastic3.92e-15062.44Show/hide
Query:  LNKVASSHGFSLP------------LSAADSDDVNKEF------------------RSPNKRELDAWSSILFQKSADAAPKSPALLPYVHPLVKKSASSL
        + +VASS  FSLP             S+ D  D ++ F                  +  N  + D+W+SILFQ SA   PKSP ++PYVHP ++KSA SL
Subjt:  LNKVASSHGFSLP------------LSAADSDDVNKEF------------------RSPNKRELDAWSSILFQKSADAAPKSPALLPYVHPLVKKSASSL

Query:  TENSLLVCTESLGSETGSDGFSSYPPSEDDDEPMKQGRPQHHPQTQTFQWRPIKFSRKKSPPRSFPPPLPSPSSPDGASVCIHSRRENGRLVLDAVSVPS
        +ENSLL+CTESLGSETGSDGFSSYP SE  D         + P  QTFQW+PIKF RKKSPPRSFPPP+ S  SPDGASVCI SRRENGRL+LDAVSVPS
Subjt:  TENSLLVCTESLGSETGSDGFSSYPPSEDDDEPMKQGRPQHHPQTQTFQWRPIKFSRKKSPPRSFPPPLPSPSSPDGASVCIHSRRENGRLVLDAVSVPS

Query:  RKNFIADRRDGRLVLSFVTTPVVEEEDEFEQLLARELEEVKDSEIAEEKDEGNDLETEELEL---EIPRLSSSVMNFHRLALMMKKKPNGLVNRNPPWPK
        +KNF ADRRDGRLVLSFVTTP          LL      V++S+I EE+D   DLE +E E+   E+ RLSSSVMNFHRLA+MMK KPNG +NRNP WPK
Subjt:  RKNFIADRRDGRLVLSFVTTPVVEEEDEFEQLLARELEEVKDSEIAEEKDEGNDLETEELEL---EIPRLSSSVMNFHRLALMMKKKPNGLVNRNPPWPK

Query:  EKDTSEPLTPLSQSLPPRPPS-PLATAATSLNAYEYYWRSKPTGKSAGIQNSIGQQQPQPIKSVTRKLISSDNQMANEKQQVLVLRGNRGDYLVPLSNGC
        +KD+ E LTPLSQSLPPRPPS   ATAATSLNAYEYYWRSKPTGKSAGIQN   QQQ QPI+S+TRK ISS+NQMA+EKQQ+L       +YLVPLSNGC
Subjt:  EKDTSEPLTPLSQSLPPRPPS-PLATAATSLNAYEYYWRSKPTGKSAGIQNSIGQQQPQPIKSVTRKLISSDNQMANEKQQVLVLRGNRGDYLVPLSNGC

Query:  KLPRRSLLLREPCCIATT
        K+PRRS+LLREPCCIATT
Subjt:  KLPRRSLLLREPCCIATT

SwissProt top hitse value%identityAlignment
Q0V865 Protein FAF-like, chloroplastic2.3e-3436.59Show/hide
Query:  DSDDVNKEFRSPNKRELDAWSSILFQKSADAAPKSPALLPYVHPLVKKSASSLTENSLLVCTESLGSETGSDGFSSYPPSEDDDEPMKQGRPQH------
        D +D  KE       + D WSSIL +K    + K     PYVHPL+K+ ASSL+E SL +CTESLGSETG DGFSS+  SE  D  ++     +      
Subjt:  DSDDVNKEFRSPNKRELDAWSSILFQKSADAAPKSPALLPYVHPLVKKSASSLTENSLLVCTESLGSETGSDGFSSYPPSEDDDEPMKQGRPQH------

Query:  HPQTQTFQWRPIKFSRKKS----------PPRSFPPPLPSPSSPDGASVCIHSRRENGRLVLDAVSVPSRKNFIADRRDGRLVLSFV---TTPVVEEEDE
          + +      I   ++ S          PP SFPPP+ S SS  G+S+ + +RR+NGRLVL+AVS+PS  NF A R+DGRL+L+F      P  ++EDE
Subjt:  HPQTQTFQWRPIKFSRKKS----------PPRSFPPPLPSPSSPDGASVCIHSRRENGRLVLDAVSVPSRKNFIADRRDGRLVLSFV---TTPVVEEEDE

Query:  FE---QLLARELEEVKDSEIAEEK--DE----GNDLETEELELEIPRLSSSVMNFHRLALMMKKKPNGLVNRNPPWP--KEKDTSEPL-TPLSQSLPPR-
         +   Q    E EE ++ E  EE+  DE     N L  +  +  IP      +  HRLA     KP G+  RN  WP   E DT   L TP+  SLPPR 
Subjt:  FE---QLLARELEEVKDSEIAEEK--DE----GNDLETEELELEIPRLSSSVMNFHRLALMMKKKPNGLVNRNPPWP--KEKDTSEPL-TPLSQSLPPR-

Query:  ----------PPSPL--ATAATSLNAYEYYWRSKPTGKSAGIQNSIGQQQPQPIKSVTRKLISSDNQMANEKQQVLVLRGNRGDYLVPLSNGCKLPRRSL
                  PPS +     A   N  +Y W+S  T        S G          T+    + N           +  + GD  +   NGCK  RRSL
Subjt:  ----------PPSPL--ATAATSLNAYEYYWRSKPTGKSAGIQNSIGQQQPQPIKSVTRKLISSDNQMANEKQQVLVLRGNRGDYLVPLSNGCKLPRRSL

Query:  LLREPCCIAT
        L  EP CIAT
Subjt:  LLREPCCIAT

Q6NMR8 Protein FANTASTIC FOUR 32.2e-0836.42Show/hide
Query:  SASSLTENSLLVCTESLGSETGSDGFSSYPPSEDDDEPMKQGRPQHHPQTQTFQWRPIKFSRKKSPPRSFPPPLPSPSSPDGASVCIHSR--RENGRLVL
        S  +L++ SL +CTE+LGSE+GSD         D DE         +    T + R +K  ++   P   PPPL +         CI  R  RENGRLV+
Subjt:  SASSLTENSLLVCTESLGSETGSDGFSSYPPSEDDDEPMKQGRPQHHPQTQTFQWRPIKFSRKKSPPRSFPPPLPSPSSPDGASVCIHSR--RENGRLVL

Query:  DAVSVPSRKN-FIADRRDGRLVLSFVTTPVVEEEDEFEQLLARELEEVKDSEIAEEKDEGND
         A + P R   F ADR +GRL LS +       E+E E +   E EE ++ E  EE+DE  D
Subjt:  DAVSVPSRKN-FIADRRDGRLVLSFVTTPVVEEEDEFEQLLARELEEVKDSEIAEEKDEGND

Q8GXU9 Protein FANTASTIC FOUR 21.6e-0632.42Show/hide
Query:  YVHPLVKKSASSLTENSLLVCTESLGSETGSD-----GFSSYPPSEDDDEPMKQGRPQHHPQTQTFQWRPIKFSRKKSPP----RSFPPPLPSPSSPDGA
        YVHP+ K+S S L E SL +CTESLG+ETGS+        ++  +     P +Q +PQ                  K+PP     SFPPP+         
Subjt:  YVHPLVKKSASSLTENSLLVCTESLGSETGSD-----GFSSYPPSEDDDEPMKQGRPQHHPQTQTFQWRPIKFSRKKSPP----RSFPPPLPSPSSPDGA

Query:  SVCIHSRRENGRLVLDAVSVPSRKN-FIADRRDGRLVLSFVTTPVVEEEDEFEQLLARELEEVKDSEIAEEKDEGNDLETEE
        +  +    E+GR+V+ A+ V S  + F+++R +GRL L   +            LL+   EE    E  EE +EG D ET E
Subjt:  SVCIHSRRENGRLVLDAVSVPSRKN-FIADRRDGRLVLSFVTTPVVEEEDEFEQLLARELEEVKDSEIAEEKDEGNDLETEE

Q9SFG6 Protein FANTASTIC FOUR 41.4e-0731.19Show/hide
Query:  AWSSILFQKSADAAPKSPALLPYVHPLVKKSASSLTENSLLVCTESLGSETGSDGFSSYPPSEDDDEPMKQGRPQHHPQTQTFQWRPIKFSRKKSPPRSF
        +WS +    ++ +  K    LP        S  +L++ SL +CTESLGSETGSD         +D   +         +T +    P +  RK++   S 
Subjt:  AWSSILFQKSADAAPKSPALLPYVHPLVKKSASSLTENSLLVCTESLGSETGSDGFSSYPPSEDDDEPMKQGRPQHHPQTQTFQWRPIKFSRKKSPPRSF

Query:  PPPLPSPSSPDGASVCIHSRRENGRLVLDAVSVPSRKNFIADRRDGRLVLSF-------VTTPVVEEEDEFEQLLARELEEVKDS--EIAEEKDEGNDLE
        PPPL S    D   + + S RENGRLV+ A   P R   + DR +G + L+        + T   EE++E E+     +E V+D+  EI E K+E  + E
Subjt:  PPPLPSPSSPDGASVCIHSRRENGRLVLDAVSVPSRKNFIADRRDGRLVLSF-------VTTPVVEEEDEFEQLLARELEEVKDS--EIAEEKDEGNDLE

Query:  TE
         E
Subjt:  TE

Q9SY06 Protein FANTASTIC FOUR 14.5e-0934.05Show/hide
Query:  YVHPLVKKSASSLTENSLLVCTESLGSETGSDGFSSYPPSEDDDEPMKQGRPQHHPQTQTFQWRPIKFSRKKSPPRSFPPPLPSPSSPDGASVCIHSRRE
        YV+P+ K+S + L   SL +CTESLG+E GSD          D+  +      +  ++     +P K +   +   SFPPPL S +  + + + + S +E
Subjt:  YVHPLVKKSASSLTENSLLVCTESLGSETGSDGFSSYPPSEDDDEPMKQGRPQHHPQTQTFQWRPIKFSRKKSPPRSFPPPLPSPSSPDGASVCIHSRRE

Query:  NGRLVLDAVSVPS-RKNFIADRRDGRLVLSFVTTPV--------VEEEDEFEQLLARELEEVKDSEIAEEKDEGNDLETEELELE
        +GRLV+ A+ V S  + F+++RR+GRL L      +         EEEDE +Q  A E EE ++ E  EE++E  + E EE E E
Subjt:  NGRLVLDAVSVPS-RKNFIADRRDGRLVLSFVTTPV--------VEEEDEFEQLLARELEEVKDSEIAEEKDEGNDLETEELELE

Arabidopsis top hitse value%identityAlignment
AT3G06020.1 Protein of unknown function (DUF3049)1.0e-0831.19Show/hide
Query:  AWSSILFQKSADAAPKSPALLPYVHPLVKKSASSLTENSLLVCTESLGSETGSDGFSSYPPSEDDDEPMKQGRPQHHPQTQTFQWRPIKFSRKKSPPRSF
        +WS +    ++ +  K    LP        S  +L++ SL +CTESLGSETGSD         +D   +         +T +    P +  RK++   S 
Subjt:  AWSSILFQKSADAAPKSPALLPYVHPLVKKSASSLTENSLLVCTESLGSETGSDGFSSYPPSEDDDEPMKQGRPQHHPQTQTFQWRPIKFSRKKSPPRSF

Query:  PPPLPSPSSPDGASVCIHSRRENGRLVLDAVSVPSRKNFIADRRDGRLVLSF-------VTTPVVEEEDEFEQLLARELEEVKDS--EIAEEKDEGNDLE
        PPPL S    D   + + S RENGRLV+ A   P R   + DR +G + L+        + T   EE++E E+     +E V+D+  EI E K+E  + E
Subjt:  PPPLPSPSSPDGASVCIHSRRENGRLVLDAVSVPSRKNFIADRRDGRLVLSF-------VTTPVVEEEDEFEQLLARELEEVKDS--EIAEEKDEGNDLE

Query:  TE
         E
Subjt:  TE

AT4G02810.1 Protein of unknown function (DUF3049)3.2e-1034.05Show/hide
Query:  YVHPLVKKSASSLTENSLLVCTESLGSETGSDGFSSYPPSEDDDEPMKQGRPQHHPQTQTFQWRPIKFSRKKSPPRSFPPPLPSPSSPDGASVCIHSRRE
        YV+P+ K+S + L   SL +CTESLG+E GSD          D+  +      +  ++     +P K +   +   SFPPPL S +  + + + + S +E
Subjt:  YVHPLVKKSASSLTENSLLVCTESLGSETGSDGFSSYPPSEDDDEPMKQGRPQHHPQTQTFQWRPIKFSRKKSPPRSFPPPLPSPSSPDGASVCIHSRRE

Query:  NGRLVLDAVSVPS-RKNFIADRRDGRLVLSFVTTPV--------VEEEDEFEQLLARELEEVKDSEIAEEKDEGNDLETEELELE
        +GRLV+ A+ V S  + F+++RR+GRL L      +         EEEDE +Q  A E EE ++ E  EE++E  + E EE E E
Subjt:  NGRLVLDAVSVPS-RKNFIADRRDGRLVLSFVTTPV--------VEEEDEFEQLLARELEEVKDSEIAEEKDEGNDLETEELELE

AT5G19260.1 Protein of unknown function (DUF3049)1.6e-0936.42Show/hide
Query:  SASSLTENSLLVCTESLGSETGSDGFSSYPPSEDDDEPMKQGRPQHHPQTQTFQWRPIKFSRKKSPPRSFPPPLPSPSSPDGASVCIHSR--RENGRLVL
        S  +L++ SL +CTE+LGSE+GSD         D DE         +    T + R +K  ++   P   PPPL +         CI  R  RENGRLV+
Subjt:  SASSLTENSLLVCTESLGSETGSDGFSSYPPSEDDDEPMKQGRPQHHPQTQTFQWRPIKFSRKKSPPRSFPPPLPSPSSPDGASVCIHSR--RENGRLVL

Query:  DAVSVPSRKN-FIADRRDGRLVLSFVTTPVVEEEDEFEQLLARELEEVKDSEIAEEKDEGND
         A + P R   F ADR +GRL LS +       E+E E +   E EE ++ E  EE+DE  D
Subjt:  DAVSVPSRKN-FIADRRDGRLVLSFVTTPVVEEEDEFEQLLARELEEVKDSEIAEEKDEGND

AT5G22090.1 Protein of unknown function (DUF3049)1.7e-3536.59Show/hide
Query:  DSDDVNKEFRSPNKRELDAWSSILFQKSADAAPKSPALLPYVHPLVKKSASSLTENSLLVCTESLGSETGSDGFSSYPPSEDDDEPMKQGRPQH------
        D +D  KE       + D WSSIL +K    + K     PYVHPL+K+ ASSL+E SL +CTESLGSETG DGFSS+  SE  D  ++     +      
Subjt:  DSDDVNKEFRSPNKRELDAWSSILFQKSADAAPKSPALLPYVHPLVKKSASSLTENSLLVCTESLGSETGSDGFSSYPPSEDDDEPMKQGRPQH------

Query:  HPQTQTFQWRPIKFSRKKS----------PPRSFPPPLPSPSSPDGASVCIHSRRENGRLVLDAVSVPSRKNFIADRRDGRLVLSFV---TTPVVEEEDE
          + +      I   ++ S          PP SFPPP+ S SS  G+S+ + +RR+NGRLVL+AVS+PS  NF A R+DGRL+L+F      P  ++EDE
Subjt:  HPQTQTFQWRPIKFSRKKS----------PPRSFPPPLPSPSSPDGASVCIHSRRENGRLVLDAVSVPSRKNFIADRRDGRLVLSFV---TTPVVEEEDE

Query:  FE---QLLARELEEVKDSEIAEEK--DE----GNDLETEELELEIPRLSSSVMNFHRLALMMKKKPNGLVNRNPPWP--KEKDTSEPL-TPLSQSLPPR-
         +   Q    E EE ++ E  EE+  DE     N L  +  +  IP      +  HRLA     KP G+  RN  WP   E DT   L TP+  SLPPR 
Subjt:  FE---QLLARELEEVKDSEIAEEK--DE----GNDLETEELELEIPRLSSSVMNFHRLALMMKKKPNGLVNRNPPWP--KEKDTSEPL-TPLSQSLPPR-

Query:  ----------PPSPL--ATAATSLNAYEYYWRSKPTGKSAGIQNSIGQQQPQPIKSVTRKLISSDNQMANEKQQVLVLRGNRGDYLVPLSNGCKLPRRSL
                  PPS +     A   N  +Y W+S  T        S G          T+    + N           +  + GD  +   NGCK  RRSL
Subjt:  ----------PPSPL--ATAATSLNAYEYYWRSKPTGKSAGIQNSIGQQQPQPIKSVTRKLISSDNQMANEKQQVLVLRGNRGDYLVPLSNGCKLPRRSL

Query:  LLREPCCIAT
        L  EP CIAT
Subjt:  LLREPCCIAT

AT5G22090.2 Protein of unknown function (DUF3049)1.7e-3536.59Show/hide
Query:  DSDDVNKEFRSPNKRELDAWSSILFQKSADAAPKSPALLPYVHPLVKKSASSLTENSLLVCTESLGSETGSDGFSSYPPSEDDDEPMKQGRPQH------
        D +D  KE       + D WSSIL +K    + K     PYVHPL+K+ ASSL+E SL +CTESLGSETG DGFSS+  SE  D  ++     +      
Subjt:  DSDDVNKEFRSPNKRELDAWSSILFQKSADAAPKSPALLPYVHPLVKKSASSLTENSLLVCTESLGSETGSDGFSSYPPSEDDDEPMKQGRPQH------

Query:  HPQTQTFQWRPIKFSRKKS----------PPRSFPPPLPSPSSPDGASVCIHSRRENGRLVLDAVSVPSRKNFIADRRDGRLVLSFV---TTPVVEEEDE
          + +      I   ++ S          PP SFPPP+ S SS  G+S+ + +RR+NGRLVL+AVS+PS  NF A R+DGRL+L+F      P  ++EDE
Subjt:  HPQTQTFQWRPIKFSRKKS----------PPRSFPPPLPSPSSPDGASVCIHSRRENGRLVLDAVSVPSRKNFIADRRDGRLVLSFV---TTPVVEEEDE

Query:  FE---QLLARELEEVKDSEIAEEK--DE----GNDLETEELELEIPRLSSSVMNFHRLALMMKKKPNGLVNRNPPWP--KEKDTSEPL-TPLSQSLPPR-
         +   Q    E EE ++ E  EE+  DE     N L  +  +  IP      +  HRLA     KP G+  RN  WP   E DT   L TP+  SLPPR 
Subjt:  FE---QLLARELEEVKDSEIAEEK--DE----GNDLETEELELEIPRLSSSVMNFHRLALMMKKKPNGLVNRNPPWP--KEKDTSEPL-TPLSQSLPPR-

Query:  ----------PPSPL--ATAATSLNAYEYYWRSKPTGKSAGIQNSIGQQQPQPIKSVTRKLISSDNQMANEKQQVLVLRGNRGDYLVPLSNGCKLPRRSL
                  PPS +     A   N  +Y W+S  T        S G          T+    + N           +  + GD  +   NGCK  RRSL
Subjt:  ----------PPSPL--ATAATSLNAYEYYWRSKPTGKSAGIQNSIGQQQPQPIKSVTRKLISSDNQMANEKQQVLVLRGNRGDYLVPLSNGCKLPRRSL

Query:  LLREPCCIAT
        L  EP CIAT
Subjt:  LLREPCCIAT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CTCAACAAGGTTGCCTCTTCCCATGGATTCTCTCTCCCTCTCTCCGCCGCCGATTCCGATGACGTCAACAAGGAATTTCGGTCTCCAAACAAACGGGAACTAGATGCTTG
GAGTTCGATCCTGTTTCAGAAATCTGCCGATGCTGCTCCAAAATCGCCTGCGCTTCTTCCTTATGTTCATCCTCTTGTCAAGAAATCCGCCTCCTCTCTCACTGAAAACA
GCCTTCTGGTTTGTACTGAGAGCCTCGGATCCGAGACCGGATCCGACGGCTTCTCCTCCTACCCGCCGTCGGAAGACGACGATGAGCCGATGAAACAGGGGCGGCCGCAG
CACCACCCACAGACACAGACATTTCAGTGGAGGCCGATCAAGTTCAGCCGCAAGAAATCGCCGCCGAGATCGTTTCCGCCGCCGCTTCCTTCCCCGAGTTCCCCCGACGG
CGCCTCCGTTTGCATTCACTCTCGCCGTGAAAATGGACGGTTAGTTCTCGACGCCGTCTCTGTTCCGTCTCGGAAGAATTTCATCGCCGATCGTCGCGATGGCCGCCTCG
TTCTGTCTTTCGTTACCACTCCGGTTGTAGAGGAAGAAGACGAGTTCGAACAATTACTCGCTCGGGAGTTGGAGGAAGTGAAAGATTCGGAAATTGCCGAGGAAAAAGAC
GAGGGGAACGATTTGGAAACAGAGGAATTGGAATTGGAAATCCCGAGACTGTCGAGTTCTGTGATGAATTTCCACCGATTAGCGCTAATGATGAAGAAGAAGCCAAATGG
ATTGGTCAATCGGAATCCGCCATGGCCGAAGGAGAAAGATACATCGGAGCCGCTAACTCCACTCTCACAGTCACTCCCACCGCGCCCGCCGTCGCCTCTCGCTACGGCGG
CGACTTCTCTGAATGCCTACGAGTACTACTGGCGGTCAAAACCCACCGGAAAATCTGCCGGAATTCAAAATTCGATCGGGCAACAACAGCCGCAGCCGATCAAGAGCGTG
ACCCGGAAACTTATTTCTTCAGATAATCAAATGGCCAATGAGAAACAGCAAGTTTTGGTACTGAGAGGGAACAGAGGAGACTACTTGGTTCCATTGTCGAACGGCTGTAA
ATTGCCCAGAAGGTCTCTTCTTCTCCGGGAGCCCTGCTGCATTGCCACCACC
mRNA sequenceShow/hide mRNA sequence
CTCAACAAGGTTGCCTCTTCCCATGGATTCTCTCTCCCTCTCTCCGCCGCCGATTCCGATGACGTCAACAAGGAATTTCGGTCTCCAAACAAACGGGAACTAGATGCTTG
GAGTTCGATCCTGTTTCAGAAATCTGCCGATGCTGCTCCAAAATCGCCTGCGCTTCTTCCTTATGTTCATCCTCTTGTCAAGAAATCCGCCTCCTCTCTCACTGAAAACA
GCCTTCTGGTTTGTACTGAGAGCCTCGGATCCGAGACCGGATCCGACGGCTTCTCCTCCTACCCGCCGTCGGAAGACGACGATGAGCCGATGAAACAGGGGCGGCCGCAG
CACCACCCACAGACACAGACATTTCAGTGGAGGCCGATCAAGTTCAGCCGCAAGAAATCGCCGCCGAGATCGTTTCCGCCGCCGCTTCCTTCCCCGAGTTCCCCCGACGG
CGCCTCCGTTTGCATTCACTCTCGCCGTGAAAATGGACGGTTAGTTCTCGACGCCGTCTCTGTTCCGTCTCGGAAGAATTTCATCGCCGATCGTCGCGATGGCCGCCTCG
TTCTGTCTTTCGTTACCACTCCGGTTGTAGAGGAAGAAGACGAGTTCGAACAATTACTCGCTCGGGAGTTGGAGGAAGTGAAAGATTCGGAAATTGCCGAGGAAAAAGAC
GAGGGGAACGATTTGGAAACAGAGGAATTGGAATTGGAAATCCCGAGACTGTCGAGTTCTGTGATGAATTTCCACCGATTAGCGCTAATGATGAAGAAGAAGCCAAATGG
ATTGGTCAATCGGAATCCGCCATGGCCGAAGGAGAAAGATACATCGGAGCCGCTAACTCCACTCTCACAGTCACTCCCACCGCGCCCGCCGTCGCCTCTCGCTACGGCGG
CGACTTCTCTGAATGCCTACGAGTACTACTGGCGGTCAAAACCCACCGGAAAATCTGCCGGAATTCAAAATTCGATCGGGCAACAACAGCCGCAGCCGATCAAGAGCGTG
ACCCGGAAACTTATTTCTTCAGATAATCAAATGGCCAATGAGAAACAGCAAGTTTTGGTACTGAGAGGGAACAGAGGAGACTACTTGGTTCCATTGTCGAACGGCTGTAA
ATTGCCCAGAAGGTCTCTTCTTCTCCGGGAGCCCTGCTGCATTGCCACCACC
Protein sequenceShow/hide protein sequence
LNKVASSHGFSLPLSAADSDDVNKEFRSPNKRELDAWSSILFQKSADAAPKSPALLPYVHPLVKKSASSLTENSLLVCTESLGSETGSDGFSSYPPSEDDDEPMKQGRPQ
HHPQTQTFQWRPIKFSRKKSPPRSFPPPLPSPSSPDGASVCIHSRRENGRLVLDAVSVPSRKNFIADRRDGRLVLSFVTTPVVEEEDEFEQLLARELEEVKDSEIAEEKD
EGNDLETEELELEIPRLSSSVMNFHRLALMMKKKPNGLVNRNPPWPKEKDTSEPLTPLSQSLPPRPPSPLATAATSLNAYEYYWRSKPTGKSAGIQNSIGQQQPQPIKSV
TRKLISSDNQMANEKQQVLVLRGNRGDYLVPLSNGCKLPRRSLLLREPCCIATT