; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018326 (gene) of Snake gourd v1 genome

Gene IDTan0018326
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionT4P13.26 protein isoform 1
Genome locationLG11:3646143..3648281
RNA-Seq ExpressionTan0018326
SyntenyTan0018326
Gene Ontology termsGO:0016874 - ligase activity (molecular function)
InterPro domainsIPR025638 - Protein of unknown function DUF4336


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602525.1 hypothetical protein SDJN03_07758, partial [Cucurbita argyrosperma subsp. sororia]2.2e-23592.7Show/hide
Query:  MTAAIAVPSPKSAILGQSPFSLKDPASNFLGRPLKGIGFQPKPRTKLDPLSLIVASATPSSSSDANAGERFYFNFTGFPFPLGPFLNRRTIRTEAVKDSI
        MTAAIAVPSPKSAILG++PFS KDPASNFLGRPLKG GF PKPRTK D LSLIVASATPSSSSDANAGERFYFNFTGFPFPLGPFLNRRTIRTEAVK  I
Subjt:  MTAAIAVPSPKSAILGQSPFSLKDPASNFLGRPLKGIGFQPKPRTKLDPLSLIVASATPSSSSDANAGERFYFNFTGFPFPLGPFLNRRTIRTEAVKDSI

Query:  WLFEQEQALGFSSVSTNIRMTVIKLKSWRL-----MGPCTHCSNQGLLKELEAPVEYIVLPTFAYEHKIFVGPFSRKFPRAQIWVAPRQWSWPLNLPLEF
        WLFEQEQALGFSSVSTNIRMTVIKLKS  L     + P   C    LL ELEAPVEYIVLPTFAYEHKIFVGPFSRKFPRAQIWVAPRQWSWPLNLPLEF
Subjt:  WLFEQEQALGFSSVSTNIRMTVIKLKSWRL-----MGPCTHCSNQGLLKELEAPVEYIVLPTFAYEHKIFVGPFSRKFPRAQIWVAPRQWSWPLNLPLEF

Query:  LGIFGAKTLQDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKQSRTLLVTDAVIFVPRQPPECISKESLLASAKNGLAVKLLSKGKEVPEEPVVDNK
         GIFGAKTL+DEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHK+SRTLLVTDAVIFVPRQPPECISKESLLASAKNGLAVKLLSKGKEVPEEPVVDN 
Subjt:  LGIFGAKTLQDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKQSRTLLVTDAVIFVPRQPPECISKESLLASAKNGLAVKLLSKGKEVPEEPVVDNK

Query:  TNRQKGWERMVLQILFLGPSNLLEPKASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKRIIPAHFAAPVNASRSDFLTAFGFLDDLLGER
         NRQKGWERMVLQILFLGPSNLLEP ASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKRIIPAHFAAPVNASRSDFLTAFGFLDDLLGER
Subjt:  TNRQKGWERMVLQILFLGPSNLLEPKASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKRIIPAHFAAPVNASRSDFLTAFGFLDDLLGER

Query:  YVNRPSLSLLFTSLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR
        Y+NRPSLSLLFTSLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR
Subjt:  YVNRPSLSLLFTSLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR

XP_022957685.1 uncharacterized protein LOC111459147 [Cucurbita moschata]1.1e-23492.48Show/hide
Query:  MTAAIAVPSPKSAILGQSPFSLKDPASNFLGRPLKGIGFQPKPRTKLDPLSLIVASATPSSSSDANAGERFYFNFTGFPFPLGPFLNRRTIRTEAVKDSI
        MTAAIAVPSPKSAILG++PFS KDPASNFLGRPLKG GF PKPRTK D LSLIVASATPSSSSDANAGERFYFNFTGFPFPLGPFLNRRTIRTEAVK  I
Subjt:  MTAAIAVPSPKSAILGQSPFSLKDPASNFLGRPLKGIGFQPKPRTKLDPLSLIVASATPSSSSDANAGERFYFNFTGFPFPLGPFLNRRTIRTEAVKDSI

Query:  WLFEQEQALGFSSVSTNIRMTVIKLKSWRL-----MGPCTHCSNQGLLKELEAPVEYIVLPTFAYEHKIFVGPFSRKFPRAQIWVAPRQWSWPLNLPLEF
        WLFEQEQALGFSSVSTNIRMTVIKLKS  L     + P   C    LL ELEAPVEYIVLPTFAYEHKIFVGPFSRKFPRAQIWVAPRQWSWPLNLPLEF
Subjt:  WLFEQEQALGFSSVSTNIRMTVIKLKSWRL-----MGPCTHCSNQGLLKELEAPVEYIVLPTFAYEHKIFVGPFSRKFPRAQIWVAPRQWSWPLNLPLEF

Query:  LGIFGAKTLQDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKQSRTLLVTDAVIFVPRQPPECISKESLLASAKNGLAVKLLSKGKEVPEEPVVDNK
         GIFGAKTL+DEDL APWADEIEQKVLSSPEVGIGPYVEVAFYHK+SRTLLVTDAVIFVPRQPPECISKESLLASAKNGLAVKLLSKGKEVPEEPVVDN 
Subjt:  LGIFGAKTLQDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKQSRTLLVTDAVIFVPRQPPECISKESLLASAKNGLAVKLLSKGKEVPEEPVVDNK

Query:  TNRQKGWERMVLQILFLGPSNLLEPKASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKRIIPAHFAAPVNASRSDFLTAFGFLDDLLGER
         NRQKGWERMVLQILFLGPSNLLEP ASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKRIIPAHFAAPVNASRSDFLTAFGFLDDLLGER
Subjt:  TNRQKGWERMVLQILFLGPSNLLEPKASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKRIIPAHFAAPVNASRSDFLTAFGFLDDLLGER

Query:  YVNRPSLSLLFTSLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR
        Y+NRPSLSLLFTSLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR
Subjt:  YVNRPSLSLLFTSLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR

XP_022990931.1 uncharacterized protein LOC111487673 [Cucurbita maxima]2.5e-23492.26Show/hide
Query:  MTAAIAVPSPKSAILGQSPFSLKDPASNFLGRPLKGIGFQPKPRTKLDPLSLIVASATPSSSSDANAGERFYFNFTGFPFPLGPFLNRRTIRTEAVKDSI
        MTAAIAVPSPKSAILG++PFS KDPASNFLGRPLKG GF PKPRTK D LSLIVASATPSSSSDANAGERFYFNFTGFPFPLGPFLNRRTIRTEAVK  I
Subjt:  MTAAIAVPSPKSAILGQSPFSLKDPASNFLGRPLKGIGFQPKPRTKLDPLSLIVASATPSSSSDANAGERFYFNFTGFPFPLGPFLNRRTIRTEAVKDSI

Query:  WLFEQEQALGFSSVSTNIRMTVIKLKSWRL-----MGPCTHCSNQGLLKELEAPVEYIVLPTFAYEHKIFVGPFSRKFPRAQIWVAPRQWSWPLNLPLEF
        WLFEQEQALGFSSVSTNIRMTVIKLKS  L     + P   C    LL ELEAPVEYIVLPTFAYEHKIFVGPFSRKFPRAQIWVAPRQWSWPLNLPLEF
Subjt:  WLFEQEQALGFSSVSTNIRMTVIKLKSWRL-----MGPCTHCSNQGLLKELEAPVEYIVLPTFAYEHKIFVGPFSRKFPRAQIWVAPRQWSWPLNLPLEF

Query:  LGIFGAKTLQDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKQSRTLLVTDAVIFVPRQPPECISKESLLASAKNGLAVKLLSKGKEVPEEPVVDNK
         GIFGAKTL+DEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHK+SRTLLVTDAVIFVPRQPPECISKESLLASAKNGLAVKLLSKGKEVPEEPVVDN 
Subjt:  LGIFGAKTLQDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKQSRTLLVTDAVIFVPRQPPECISKESLLASAKNGLAVKLLSKGKEVPEEPVVDNK

Query:  TNRQKGWERMVLQILFLGPSNLLEPKASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKRIIPAHFAAPVNASRSDFLTAFGFLDDLLGER
         NR+KGWERMVLQILFLGPSNLLEP ASFA+MSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKRIIPAHFAAPVNASRSDFLTAFGFLDDLLGER
Subjt:  TNRQKGWERMVLQILFLGPSNLLEPKASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKRIIPAHFAAPVNASRSDFLTAFGFLDDLLGER

Query:  YVNRPSLSLLFTSLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR
        Y+NRPSLSLLFTSLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR
Subjt:  YVNRPSLSLLFTSLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR

XP_023541326.1 uncharacterized protein LOC111801534 [Cucurbita pepo subsp. pepo]4.2e-23492.26Show/hide
Query:  MTAAIAVPSPKSAILGQSPFSLKDPASNFLGRPLKGIGFQPKPRTKLDPLSLIVASATPSSSSDANAGERFYFNFTGFPFPLGPFLNRRTIRTEAVKDSI
        MTAAIAVPSPKSAIL ++PFS KDPASNFLGRPLKG GF PKPRTK D LSLIVASATPSSSSDANAGERFYFNFTGFPFPLGPFLNRRTIRTEAVK  I
Subjt:  MTAAIAVPSPKSAILGQSPFSLKDPASNFLGRPLKGIGFQPKPRTKLDPLSLIVASATPSSSSDANAGERFYFNFTGFPFPLGPFLNRRTIRTEAVKDSI

Query:  WLFEQEQALGFSSVSTNIRMTVIKLKSWRL-----MGPCTHCSNQGLLKELEAPVEYIVLPTFAYEHKIFVGPFSRKFPRAQIWVAPRQWSWPLNLPLEF
        WLFEQEQALGFSSVSTNIRMTVIKLKS  L     + P   C    LL ELEAPVEYIVLPTFAYEHKIFVGPFSRKFPRAQIWVAPRQWSWPLNLPLEF
Subjt:  WLFEQEQALGFSSVSTNIRMTVIKLKSWRL-----MGPCTHCSNQGLLKELEAPVEYIVLPTFAYEHKIFVGPFSRKFPRAQIWVAPRQWSWPLNLPLEF

Query:  LGIFGAKTLQDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKQSRTLLVTDAVIFVPRQPPECISKESLLASAKNGLAVKLLSKGKEVPEEPVVDNK
         GIFGAKTL+DEDLSAPW DEIEQKVLSSPEVGIGPYVEVAFYHK+SRTLLVTDAVIFVPRQPPECISKESLLASAKNGLAVKLLSKGKEVPEEPVVDN 
Subjt:  LGIFGAKTLQDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKQSRTLLVTDAVIFVPRQPPECISKESLLASAKNGLAVKLLSKGKEVPEEPVVDNK

Query:  TNRQKGWERMVLQILFLGPSNLLEPKASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKRIIPAHFAAPVNASRSDFLTAFGFLDDLLGER
         NRQKGWERMVLQILFLGPSNLLEP ASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKRIIPAHFAAPVNASRSDFLTAFGFLDDLLGER
Subjt:  TNRQKGWERMVLQILFLGPSNLLEPKASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKRIIPAHFAAPVNASRSDFLTAFGFLDDLLGER

Query:  YVNRPSLSLLFTSLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR
        Y+NRPSLSLLFTSLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR
Subjt:  YVNRPSLSLLFTSLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR

XP_038889479.1 uncharacterized protein LOC120079388 [Benincasa hispida]2.5e-23492.48Show/hide
Query:  MTAAIAVPSPKSAILGQSPFSLKDPASNFLGRPLKGIGFQPKPRTKLDPLSLIVASATPSSSSDANAGERFYFNFTGFPFPLGPFLNRRTIRTEAVKDSI
        MTAAIAVPSPKSAILGQ PF  KDPAS+FLGR LKG GFQPK RTKLDPL LIVASATPSSSSD N  ERFYFNFTGFPFPLGPFLNRRTIRTEAVK SI
Subjt:  MTAAIAVPSPKSAILGQSPFSLKDPASNFLGRPLKGIGFQPKPRTKLDPLSLIVASATPSSSSDANAGERFYFNFTGFPFPLGPFLNRRTIRTEAVKDSI

Query:  WLFEQEQALGFSSVSTNIRMTVIKLKSWRL-----MGPCTHCSNQGLLKELEAPVEYIVLPTFAYEHKIFVGPFSRKFPRAQIWVAPRQWSWPLNLPLEF
        WLFEQEQALGFSSVSTNIRMTVIKLKS  L     + P   C    LLKELEAPVEYI+LPTFAYEHKIFVGPFSRKFPRAQIWVAPRQWSWPLNLPLEF
Subjt:  WLFEQEQALGFSSVSTNIRMTVIKLKSWRL-----MGPCTHCSNQGLLKELEAPVEYIVLPTFAYEHKIFVGPFSRKFPRAQIWVAPRQWSWPLNLPLEF

Query:  LGIFGAKTLQDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKQSRTLLVTDAVIFVPRQPPECISKESLLASAKNGLAVKLLSKGKEVPEEPVVDNK
        LGIFGAKTL+DEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHK+SRTLLVTDAVIFVPRQPPECISKESLLASAKNGLAVKLLSKGKEVPEEPVVDNK
Subjt:  LGIFGAKTLQDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKQSRTLLVTDAVIFVPRQPPECISKESLLASAKNGLAVKLLSKGKEVPEEPVVDNK

Query:  TNRQKGWERMVLQILFLGPSNLLEPKASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKRIIPAHFAAPVNASRSDFLTAFGFLDDLLGER
         NRQKGWERMVLQILFLGPSNLLEP ASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKRIIPAHFAAPVNAS SDFLTAFGFLDDLLGER
Subjt:  TNRQKGWERMVLQILFLGPSNLLEPKASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKRIIPAHFAAPVNASRSDFLTAFGFLDDLLGER

Query:  YVNRPSLSLLFTSLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR
        YVNRPSLSLLFTSLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR
Subjt:  YVNRPSLSLLFTSLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR

TrEMBL top hitse value%identityAlignment
A0A0A0KVS5 Uncharacterized protein1.4e-22289.16Show/hide
Query:  MTAAIAVPSPKSAILGQSPFSLKDPASNFLGRPLKGIGFQPKPRTKLDPLSLIVASATPSSSSDANAGERFYFNFTGFPFPLGPFLNRRTIRTEAVKDSI
        MTAAIAV  PKSAIL Q+PFS KDP  NFLGR  KG     K RTKL PL LIVAS+T SSSSD +  ERFYFNFTGFPFPLGPFLNRRTIRTEAVK SI
Subjt:  MTAAIAVPSPKSAILGQSPFSLKDPASNFLGRPLKGIGFQPKPRTKLDPLSLIVASATPSSSSDANAGERFYFNFTGFPFPLGPFLNRRTIRTEAVKDSI

Query:  WLFEQEQALGFSSVSTNIRMTVIKLKSWRL-----MGPCTHCSNQGLLKELEAPVEYIVLPTFAYEHKIFVGPFSRKFPRAQIWVAPRQWSWPLNLPLEF
        WLFEQEQALGFSSVSTNIRMTVIKLKS  L     + P   C    LLKEL+APVEYI+LPTFAYEHKIFVGPFSRKFPRAQIWVAPRQWSWPLNLPLEF
Subjt:  WLFEQEQALGFSSVSTNIRMTVIKLKSWRL-----MGPCTHCSNQGLLKELEAPVEYIVLPTFAYEHKIFVGPFSRKFPRAQIWVAPRQWSWPLNLPLEF

Query:  LGIFGAKTLQDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKQSRTLLVTDAVIFVPRQPPECISKESLLASAKNGLAVKLLSKGKEVPEEPVVDNK
        LGIF AKTL+DEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHK+SRTLLVTDAVIFVPRQPPECISKESLLASAKNGLAVKLLSKGKEVPEE VVDNK
Subjt:  LGIFGAKTLQDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKQSRTLLVTDAVIFVPRQPPECISKESLLASAKNGLAVKLLSKGKEVPEEPVVDNK

Query:  TNRQKGWERMVLQILFLGPSNLLEPKASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKRIIPAHFAAPVNASRSDFLTAFGFLDDLLGER
         NRQKGWERMVLQILFLGPSNLLEPKASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKRIIPAHFAAPVNAS SDFLTAFGFLDDLLGER
Subjt:  TNRQKGWERMVLQILFLGPSNLLEPKASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKRIIPAHFAAPVNASRSDFLTAFGFLDDLLGER

Query:  YVNRPSLSLLFTSLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR
        YVNRPSLSLLF SLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR
Subjt:  YVNRPSLSLLFTSLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR

A0A1S3C9H7 uncharacterized protein LOC1034985199.5e-22489.16Show/hide
Query:  MTAAIAVPSPKSAILGQSPFSLKDPASNFLGRPLKGIGFQPKPRTKLDPLSLIVASATPSSSSDANAGERFYFNFTGFPFPLGPFLNRRTIRTEAVKDSI
        MTAAIAV SPKSAIL Q+PFS KDP  +FLGR  KG     K RTKL PL LIVASAT SSSS+ N  ERFYFNFTGFPFPLGPFLNRRTIRTEAVK SI
Subjt:  MTAAIAVPSPKSAILGQSPFSLKDPASNFLGRPLKGIGFQPKPRTKLDPLSLIVASATPSSSSDANAGERFYFNFTGFPFPLGPFLNRRTIRTEAVKDSI

Query:  WLFEQEQALGFSSVSTNIRMTVIKLKSWRL-----MGPCTHCSNQGLLKELEAPVEYIVLPTFAYEHKIFVGPFSRKFPRAQIWVAPRQWSWPLNLPLEF
        WLFEQEQALGFSSVSTNIRMTVIKLKS  L     + P   C    LLKEL+APVEYI+LPTFAYEHKIFVGPFSRKFPRAQIWVAPRQWSWPLNLPLEF
Subjt:  WLFEQEQALGFSSVSTNIRMTVIKLKSWRL-----MGPCTHCSNQGLLKELEAPVEYIVLPTFAYEHKIFVGPFSRKFPRAQIWVAPRQWSWPLNLPLEF

Query:  LGIFGAKTLQDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKQSRTLLVTDAVIFVPRQPPECISKESLLASAKNGLAVKLLSKGKEVPEEPVVDNK
        LGIFGAKTL+DEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHK+S+TLLVTDAVIFVPRQPPECISKESLLASAKNGLAVKLLSKGKEVPEEPVVDNK
Subjt:  LGIFGAKTLQDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKQSRTLLVTDAVIFVPRQPPECISKESLLASAKNGLAVKLLSKGKEVPEEPVVDNK

Query:  TNRQKGWERMVLQILFLGPSNLLEPKASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKRIIPAHFAAPVNASRSDFLTAFGFLDDLLGER
         NRQKGWERMVLQILFLGPSNLLEP ASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKRIIPAHFAAPVNAS SDFLTAFGFLDDLLGER
Subjt:  TNRQKGWERMVLQILFLGPSNLLEPKASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKRIIPAHFAAPVNASRSDFLTAFGFLDDLLGER

Query:  YVNRPSLSLLFTSLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR
        YVNRPSLS+LF SLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR
Subjt:  YVNRPSLSLLFTSLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR

A0A6J1BW70 uncharacterized protein LOC1110060778.6e-23391.59Show/hide
Query:  MTAAIAVPSPKSAILGQSPFSLKDPASNFLGRPLKGIGFQPKPRTKLDPLSLIVASATPSSSSDANAGERFYFNFTGFPFPLGPFLNRRTIRTEAVKDSI
        M+AAIAVPSPKSAILG++P S KDPAS FL RPLKG GF PKPRTK+DP+ LIVASAT SSSSD N GERFYFNFTGFPFPLGPFLNRRTIRTEAVK SI
Subjt:  MTAAIAVPSPKSAILGQSPFSLKDPASNFLGRPLKGIGFQPKPRTKLDPLSLIVASATPSSSSDANAGERFYFNFTGFPFPLGPFLNRRTIRTEAVKDSI

Query:  WLFEQEQALGFSSVSTNIRMTVIKLKSWRL-----MGPCTHCSNQGLLKELEAPVEYIVLPTFAYEHKIFVGPFSRKFPRAQIWVAPRQWSWPLNLPLEF
        WLFEQEQALGFSSVSTNIRMTVIKLKS  L     + P   C    LLKELEAPVEYIVLPTFAYEHKIFVGPFSRKFPRAQIWVAPRQWSWPLNLPLEF
Subjt:  WLFEQEQALGFSSVSTNIRMTVIKLKSWRL-----MGPCTHCSNQGLLKELEAPVEYIVLPTFAYEHKIFVGPFSRKFPRAQIWVAPRQWSWPLNLPLEF

Query:  LGIFGAKTLQDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKQSRTLLVTDAVIFVPRQPPECISKESLLASAKNGLAVKLLSKGKEVPEEPVVDNK
        LGIFGAKTL+DEDLS PWADEIEQKVLSSPEVGIGPYVEVAFYHKQSRTLLVTDAVIFVPRQPPECISKESLLASAKNGLAVKLLSKGKEVPEEPVVDNK
Subjt:  LGIFGAKTLQDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKQSRTLLVTDAVIFVPRQPPECISKESLLASAKNGLAVKLLSKGKEVPEEPVVDNK

Query:  TNRQKGWERMVLQILFLGPSNLLEPKASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKRIIPAHFAAPVNASRSDFLTAFGFLDDLLGER
        TNRQKGWERMVLQILFLGPSNLLEP ASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKRIIPAHFAAPV ASRSDFLTAFGFLDDLLGER
Subjt:  TNRQKGWERMVLQILFLGPSNLLEPKASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKRIIPAHFAAPVNASRSDFLTAFGFLDDLLGER

Query:  YVNRPSLSLLFTSLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR
        YVNRPSLSLLF S+MGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR
Subjt:  YVNRPSLSLLFTSLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR

A0A6J1H184 uncharacterized protein LOC1114591475.4e-23592.48Show/hide
Query:  MTAAIAVPSPKSAILGQSPFSLKDPASNFLGRPLKGIGFQPKPRTKLDPLSLIVASATPSSSSDANAGERFYFNFTGFPFPLGPFLNRRTIRTEAVKDSI
        MTAAIAVPSPKSAILG++PFS KDPASNFLGRPLKG GF PKPRTK D LSLIVASATPSSSSDANAGERFYFNFTGFPFPLGPFLNRRTIRTEAVK  I
Subjt:  MTAAIAVPSPKSAILGQSPFSLKDPASNFLGRPLKGIGFQPKPRTKLDPLSLIVASATPSSSSDANAGERFYFNFTGFPFPLGPFLNRRTIRTEAVKDSI

Query:  WLFEQEQALGFSSVSTNIRMTVIKLKSWRL-----MGPCTHCSNQGLLKELEAPVEYIVLPTFAYEHKIFVGPFSRKFPRAQIWVAPRQWSWPLNLPLEF
        WLFEQEQALGFSSVSTNIRMTVIKLKS  L     + P   C    LL ELEAPVEYIVLPTFAYEHKIFVGPFSRKFPRAQIWVAPRQWSWPLNLPLEF
Subjt:  WLFEQEQALGFSSVSTNIRMTVIKLKSWRL-----MGPCTHCSNQGLLKELEAPVEYIVLPTFAYEHKIFVGPFSRKFPRAQIWVAPRQWSWPLNLPLEF

Query:  LGIFGAKTLQDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKQSRTLLVTDAVIFVPRQPPECISKESLLASAKNGLAVKLLSKGKEVPEEPVVDNK
         GIFGAKTL+DEDL APWADEIEQKVLSSPEVGIGPYVEVAFYHK+SRTLLVTDAVIFVPRQPPECISKESLLASAKNGLAVKLLSKGKEVPEEPVVDN 
Subjt:  LGIFGAKTLQDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKQSRTLLVTDAVIFVPRQPPECISKESLLASAKNGLAVKLLSKGKEVPEEPVVDNK

Query:  TNRQKGWERMVLQILFLGPSNLLEPKASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKRIIPAHFAAPVNASRSDFLTAFGFLDDLLGER
         NRQKGWERMVLQILFLGPSNLLEP ASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKRIIPAHFAAPVNASRSDFLTAFGFLDDLLGER
Subjt:  TNRQKGWERMVLQILFLGPSNLLEPKASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKRIIPAHFAAPVNASRSDFLTAFGFLDDLLGER

Query:  YVNRPSLSLLFTSLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR
        Y+NRPSLSLLFTSLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR
Subjt:  YVNRPSLSLLFTSLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR

A0A6J1JPB2 uncharacterized protein LOC1114876731.2e-23492.26Show/hide
Query:  MTAAIAVPSPKSAILGQSPFSLKDPASNFLGRPLKGIGFQPKPRTKLDPLSLIVASATPSSSSDANAGERFYFNFTGFPFPLGPFLNRRTIRTEAVKDSI
        MTAAIAVPSPKSAILG++PFS KDPASNFLGRPLKG GF PKPRTK D LSLIVASATPSSSSDANAGERFYFNFTGFPFPLGPFLNRRTIRTEAVK  I
Subjt:  MTAAIAVPSPKSAILGQSPFSLKDPASNFLGRPLKGIGFQPKPRTKLDPLSLIVASATPSSSSDANAGERFYFNFTGFPFPLGPFLNRRTIRTEAVKDSI

Query:  WLFEQEQALGFSSVSTNIRMTVIKLKSWRL-----MGPCTHCSNQGLLKELEAPVEYIVLPTFAYEHKIFVGPFSRKFPRAQIWVAPRQWSWPLNLPLEF
        WLFEQEQALGFSSVSTNIRMTVIKLKS  L     + P   C    LL ELEAPVEYIVLPTFAYEHKIFVGPFSRKFPRAQIWVAPRQWSWPLNLPLEF
Subjt:  WLFEQEQALGFSSVSTNIRMTVIKLKSWRL-----MGPCTHCSNQGLLKELEAPVEYIVLPTFAYEHKIFVGPFSRKFPRAQIWVAPRQWSWPLNLPLEF

Query:  LGIFGAKTLQDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKQSRTLLVTDAVIFVPRQPPECISKESLLASAKNGLAVKLLSKGKEVPEEPVVDNK
         GIFGAKTL+DEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHK+SRTLLVTDAVIFVPRQPPECISKESLLASAKNGLAVKLLSKGKEVPEEPVVDN 
Subjt:  LGIFGAKTLQDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKQSRTLLVTDAVIFVPRQPPECISKESLLASAKNGLAVKLLSKGKEVPEEPVVDNK

Query:  TNRQKGWERMVLQILFLGPSNLLEPKASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKRIIPAHFAAPVNASRSDFLTAFGFLDDLLGER
         NR+KGWERMVLQILFLGPSNLLEP ASFA+MSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKRIIPAHFAAPVNASRSDFLTAFGFLDDLLGER
Subjt:  TNRQKGWERMVLQILFLGPSNLLEPKASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKRIIPAHFAAPVNASRSDFLTAFGFLDDLLGER

Query:  YVNRPSLSLLFTSLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR
        Y+NRPSLSLLFTSLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR
Subjt:  YVNRPSLSLLFTSLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01060.1 unknown protein5.6e-19275.55Show/hide
Query:  AAIAVPSPKSAILGQSPFSLKDPASNFLGRPLKGIGFQPKPRTKLDPLSLIVASATP----SSSSDANAGERFYFNFTGFPFPLGPFLNRRTIRTEAVKD
        AA+AV  PK +     P    D  +NFLG     +G       + +   ++ AS+T     +   D +  +RFY NFTGFPFPLGPFLNRRTIRTEAVK 
Subjt:  AAIAVPSPKSAILGQSPFSLKDPASNFLGRPLKGIGFQPKPRTKLDPLSLIVASATP----SSSSDANAGERFYFNFTGFPFPLGPFLNRRTIRTEAVKD

Query:  SIWLFEQEQALGFSSVSTNIRMTVIKLKSWRL-----MGPCTHCSNQGLLKELEAPVEYIVLPTFAYEHKIFVGPFSRKFPRAQIWVAPRQWSWPLNLPL
         IW+FEQEQALGFSSVSTNIRMTVIKLKS  L     + P   C    L+KEL APVEYIVLPTFAYEHKIFVGPFSRKFP+AQ+WVAPRQWSWPLNLPL
Subjt:  SIWLFEQEQALGFSSVSTNIRMTVIKLKSWRL-----MGPCTHCSNQGLLKELEAPVEYIVLPTFAYEHKIFVGPFSRKFPRAQIWVAPRQWSWPLNLPL

Query:  EFLGIFGAKTLQDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKQSRTLLVTDAVIFVPRQPPECISKESLLASAKNGLAVKLLSKGKEVPEEPVVD
        EF GIF AK ++D DLS PWA+EIEQKVLSSPEVGIGPYVEVAFYHK+SRTLLVTDAVIFVPR+PP  IS ESLLASAKNGLAVK+LSKGK++P +PVVD
Subjt:  EFLGIFGAKTLQDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKQSRTLLVTDAVIFVPRQPPECISKESLLASAKNGLAVKLLSKGKEVPEEPVVD

Query:  NKTNRQKGWERMVLQILFLGPSNLLEPKASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKRIIPAHFAAPVNASRSDFLTAFGFLDDLLG
        N   RQKGWERMVLQILFLGPSNLLEP ASFA+MSQKLIVSPIVKTLVFSKVPEKVRDWID I RDW+FKRIIPAHF AP+NA RSDFL AFGFL+DLLG
Subjt:  NKTNRQKGWERMVLQILFLGPSNLLEPKASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKRIIPAHFAAPVNASRSDFLTAFGFLDDLLG

Query:  ERYVNRPSLSLLFTSLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR
        ERYV RPSLSLLFTSLMGKAASYFPPDDM+TLSSLDQFLVSVGAVKKTVSGRKR
Subjt:  ERYVNRPSLSLLFTSLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR

AT3G01060.2 unknown protein8.6e-18573.79Show/hide
Query:  AAIAVPSPKSAILGQSPFSLKDPASNFLGRPLKGIGFQPKPRTKLDPLSLIVASATP----SSSSDANAGERFYFNFTGFPFPLGPFLNRRTIRTEAVKD
        AA+AV  PK +     P    D  +NFLG     +G       + +   ++ AS+T     +   D +  +RFY NFTGFPFPLGPFLNRRTIRTEAVK 
Subjt:  AAIAVPSPKSAILGQSPFSLKDPASNFLGRPLKGIGFQPKPRTKLDPLSLIVASATP----SSSSDANAGERFYFNFTGFPFPLGPFLNRRTIRTEAVKD

Query:  SIWLFEQEQALGFSSVSTNIRMTVIKLKSWRL-----MGPCTHCSNQGLLKELEAPVEYIVLPTFAYEHKIFVGPFSRKFPRAQIWVAPRQWSWPLNLPL
         IW+FEQEQALGFSSVSTNIRMTVIKLKS  L     + P   C    L+KEL APVEYIVLPTFAYEHKIFVGPFSRKFP+AQ+WVAPRQWSWPLNLPL
Subjt:  SIWLFEQEQALGFSSVSTNIRMTVIKLKSWRL-----MGPCTHCSNQGLLKELEAPVEYIVLPTFAYEHKIFVGPFSRKFPRAQIWVAPRQWSWPLNLPL

Query:  EFLGIFGAKTLQDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKQSRTLLVTDAVIFVPRQPPECISKESLLASAKNGLAVKLLSKGKEVPEEPVVD
        EF GIF AK ++D DLS PWA+EIEQKVLSSP        EVAFYHK+SRTLLVTDAVIFVPR+PP  IS ESLLASAKNGLAVK+LSKGK++P +PVVD
Subjt:  EFLGIFGAKTLQDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKQSRTLLVTDAVIFVPRQPPECISKESLLASAKNGLAVKLLSKGKEVPEEPVVD

Query:  NKTNRQKGWERMVLQILFLGPSNLLEPKASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKRIIPAHFAAPVNASRSDFLTAFGFLDDLLG
        N   RQKGWERMVLQILFLGPSNLLEP ASFA+MSQKLIVSPIVKTLVFSKVPEKVRDWID I RDW+FKRIIPAHF AP+NA RSDFL AFGFL+DLLG
Subjt:  NKTNRQKGWERMVLQILFLGPSNLLEPKASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKRIIPAHFAAPVNASRSDFLTAFGFLDDLLG

Query:  ERYVNRPSLSLLFTSLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR
        ERYV RPSLSLLFTSLMGKAASYFPPDDM+TLSSLDQFLVSVGAVKKTVSGRKR
Subjt:  ERYVNRPSLSLLFTSLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR

AT3G01060.3 unknown protein3.9e-13771.55Show/hide
Query:  AAIAVPSPKSAILGQSPFSLKDPASNFLGRPLKGIGFQPKPRTKLDPLSLIVASATP----SSSSDANAGERFYFNFTGFPFPLGPFLNRRTIRTEAVKD
        AA+AV  PK +     P    D  +NFLG     +G       + +   ++ AS+T     +   D +  +RFY NFTGFPFPLGPFLNRRTIRTEAVK 
Subjt:  AAIAVPSPKSAILGQSPFSLKDPASNFLGRPLKGIGFQPKPRTKLDPLSLIVASATP----SSSSDANAGERFYFNFTGFPFPLGPFLNRRTIRTEAVKD

Query:  SIWLFEQEQALGFSSVSTNIRMTVIKLKSWRL-----MGPCTHCSNQGLLKELEAPVEYIVLPTFAYEHKIFVGPFSRKFPRAQIWVAPRQWSWPLNLPL
         IW+FEQEQALGFSSVSTNIRMTVIKLKS  L     + P   C    L+KEL APVEYIVLPTFAYEHKIFVGPFSRKFP+AQ+WVAPRQWSWPLNLPL
Subjt:  SIWLFEQEQALGFSSVSTNIRMTVIKLKSWRL-----MGPCTHCSNQGLLKELEAPVEYIVLPTFAYEHKIFVGPFSRKFPRAQIWVAPRQWSWPLNLPL

Query:  EFLGIFGAKTLQDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKQSRTLLVTDAVIFVPRQPPECISKESLLASAKNGLAVKLLSKGKEVPEEPVVD
        EF GIF AK ++D DLS PWA+EIEQKVLSSPEVGIGPYVEVAFYHK+SRTLLVTDAVIFVPR+PP  IS ESLLASAKNGLAVK+LSKGK++P +PVVD
Subjt:  EFLGIFGAKTLQDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKQSRTLLVTDAVIFVPRQPPECISKESLLASAKNGLAVKLLSKGKEVPEEPVVD

Query:  NKTNRQKGWERMVLQILFLGPSNLLEPKASFAQMSQKLIVSPIVKTLVFSKVPEK
        N   RQKGWERMVLQILFLGPSNLLEP ASFA+MSQKLIVSPIVKTLVFSKVPEK
Subjt:  NKTNRQKGWERMVLQILFLGPSNLLEPKASFAQMSQKLIVSPIVKTLVFSKVPEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAGCAGCCATTGCTGTTCCTTCCCCTAAATCCGCCATTTTAGGCCAAAGTCCATTCTCCCTCAAAGACCCTGCTTCCAATTTTCTCGGTCGACCGCTTAAGGGTAT
CGGTTTTCAGCCCAAACCGAGGACGAAACTAGATCCTTTAAGTTTAATTGTTGCTTCAGCCACTCCATCTAGCAGCAGCGACGCAAATGCCGGTGAAAGATTCTACTTCA
ATTTCACTGGATTCCCTTTTCCCCTCGGGCCCTTTCTTAATCGGCGCACTATAAGAACTGAGGCTGTAAAAGATTCTATATGGCTTTTTGAGCAAGAACAAGCGTTGGGA
TTCAGTAGTGTCTCAACAAACATTAGGATGACAGTCATCAAACTCAAATCCTGGAGGCTTATGGGTCCATGCACCCATTGCTCCAACCAAGGACTTCTGAAAGAGTTGGA
GGCTCCTGTAGAGTACATTGTTTTACCAACTTTTGCATATGAGCACAAAATTTTTGTTGGACCGTTTTCGAGGAAGTTCCCACGCGCTCAGATATGGGTGGCACCAAGGC
AATGGAGTTGGCCATTGAACTTGCCATTGGAATTTTTGGGAATTTTTGGAGCTAAAACATTGCAAGACGAGGATTTATCTGCTCCATGGGCCGATGAGATTGAGCAGAAA
GTTTTAAGCTCACCAGAAGTTGGGATTGGACCATATGTGGAGGTTGCTTTTTATCATAAGCAGTCGAGAACGCTACTGGTAACAGATGCCGTAATCTTTGTCCCTCGACA
ACCTCCTGAATGCATTAGCAAAGAATCCTTGTTGGCATCAGCAAAGAATGGTTTGGCCGTAAAACTACTTAGTAAAGGAAAGGAAGTCCCTGAAGAGCCAGTTGTTGACA
ACAAGACCAACCGTCAAAAAGGGTGGGAAAGAATGGTTCTTCAGATATTGTTCCTTGGGCCTTCAAATCTCTTGGAACCTAAAGCTAGCTTTGCTCAAATGTCACAGAAA
CTTATTGTTTCACCCATTGTAAAGACTCTCGTCTTTAGCAAAGTTCCTGAAAAGGTAAGGGACTGGATTGATAGGATCGTCCGAGATTGGAAGTTCAAGAGAATCATCCC
CGCTCACTTTGCAGCTCCAGTAAATGCAAGTAGGTCCGATTTCTTAACAGCATTCGGGTTTCTCGATGACCTTCTTGGAGAGCGCTACGTCAACCGACCTTCACTCTCTC
TTCTCTTTACATCACTCATGGGGAAGGCTGCCAGTTACTTTCCACCAGATGATATGAAGACCTTATCATCCCTTGACCAGTTTTTAGTATCAGTTGGAGCCGTGAAGAAG
ACCGTCTCGGGCAGAAAAAGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGACAGCAGCCATTGCTGTTCCTTCCCCTAAATCCGCCATTTTAGGCCAAAGTCCATTCTCCCTCAAAGACCCTGCTTCCAATTTTCTCGGTCGACCGCTTAAGGGTAT
CGGTTTTCAGCCCAAACCGAGGACGAAACTAGATCCTTTAAGTTTAATTGTTGCTTCAGCCACTCCATCTAGCAGCAGCGACGCAAATGCCGGTGAAAGATTCTACTTCA
ATTTCACTGGATTCCCTTTTCCCCTCGGGCCCTTTCTTAATCGGCGCACTATAAGAACTGAGGCTGTAAAAGATTCTATATGGCTTTTTGAGCAAGAACAAGCGTTGGGA
TTCAGTAGTGTCTCAACAAACATTAGGATGACAGTCATCAAACTCAAATCCTGGAGGCTTATGGGTCCATGCACCCATTGCTCCAACCAAGGACTTCTGAAAGAGTTGGA
GGCTCCTGTAGAGTACATTGTTTTACCAACTTTTGCATATGAGCACAAAATTTTTGTTGGACCGTTTTCGAGGAAGTTCCCACGCGCTCAGATATGGGTGGCACCAAGGC
AATGGAGTTGGCCATTGAACTTGCCATTGGAATTTTTGGGAATTTTTGGAGCTAAAACATTGCAAGACGAGGATTTATCTGCTCCATGGGCCGATGAGATTGAGCAGAAA
GTTTTAAGCTCACCAGAAGTTGGGATTGGACCATATGTGGAGGTTGCTTTTTATCATAAGCAGTCGAGAACGCTACTGGTAACAGATGCCGTAATCTTTGTCCCTCGACA
ACCTCCTGAATGCATTAGCAAAGAATCCTTGTTGGCATCAGCAAAGAATGGTTTGGCCGTAAAACTACTTAGTAAAGGAAAGGAAGTCCCTGAAGAGCCAGTTGTTGACA
ACAAGACCAACCGTCAAAAAGGGTGGGAAAGAATGGTTCTTCAGATATTGTTCCTTGGGCCTTCAAATCTCTTGGAACCTAAAGCTAGCTTTGCTCAAATGTCACAGAAA
CTTATTGTTTCACCCATTGTAAAGACTCTCGTCTTTAGCAAAGTTCCTGAAAAGGTAAGGGACTGGATTGATAGGATCGTCCGAGATTGGAAGTTCAAGAGAATCATCCC
CGCTCACTTTGCAGCTCCAGTAAATGCAAGTAGGTCCGATTTCTTAACAGCATTCGGGTTTCTCGATGACCTTCTTGGAGAGCGCTACGTCAACCGACCTTCACTCTCTC
TTCTCTTTACATCACTCATGGGGAAGGCTGCCAGTTACTTTCCACCAGATGATATGAAGACCTTATCATCCCTTGACCAGTTTTTAGTATCAGTTGGAGCCGTGAAGAAG
ACCGTCTCGGGCAGAAAAAGGTAA
Protein sequenceShow/hide protein sequence
MTAAIAVPSPKSAILGQSPFSLKDPASNFLGRPLKGIGFQPKPRTKLDPLSLIVASATPSSSSDANAGERFYFNFTGFPFPLGPFLNRRTIRTEAVKDSIWLFEQEQALG
FSSVSTNIRMTVIKLKSWRLMGPCTHCSNQGLLKELEAPVEYIVLPTFAYEHKIFVGPFSRKFPRAQIWVAPRQWSWPLNLPLEFLGIFGAKTLQDEDLSAPWADEIEQK
VLSSPEVGIGPYVEVAFYHKQSRTLLVTDAVIFVPRQPPECISKESLLASAKNGLAVKLLSKGKEVPEEPVVDNKTNRQKGWERMVLQILFLGPSNLLEPKASFAQMSQK
LIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKRIIPAHFAAPVNASRSDFLTAFGFLDDLLGERYVNRPSLSLLFTSLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKK
TVSGRKR