; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0019799 (gene) of Snake gourd v1 genome

Gene IDTan0019799
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function (DUF620)
Genome locationLG08:63952132..63958540
RNA-Seq ExpressionTan0019799
SyntenyTan0019799
Gene Ontology termsNA
InterPro domainsIPR006873 - Protein of unknown function DUF620


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596660.1 hypothetical protein SDJN03_09840, partial [Cucurbita argyrosperma subsp. sororia]1.1e-19591.27Show/hide
Query:  MNRLAPLSEEPIDEHDGRTRNRNRSTTAGS--GGRSWRNWIRTHLSILSCGKKSDGLNVLLSVLGCPLFPVSVQPNCTVSSTNQISSSSQYIIEHFAAAT
        MNRLAPLSEEPIDE+DGRTR+RNRST AGS  GGRSWRNWIRTHLSIL CGKKSD LNVLLSVLGCPLFPVSVQPN  VSSTNQ+SSSSQYIIEHF AAT
Subjt:  MNRLAPLSEEPIDEHDGRTRNRNRSTTAGS--GGRSWRNWIRTHLSILSCGKKSDGLNVLLSVLGCPLFPVSVQPNCTVSSTNQISSSSQYIIEHFAAAT

Query:  GCRKLKGRVKNIFATGKLTMGMADEVSSGGGGGGGGGGGPTGGVAKKGCFVMWQMIPNKWLIELSVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLR
        GCRKLKGRVKNIF TGKLTMGMADEVSS    GGGGGGGPT GVA+KGCFVMWQMIPNKWLIEL+VGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLR
Subjt:  GCRKLKGRVKNIFATGKLTMGMADEVSSGGGGGGGGGGGPTGGVAKKGCFVMWQMIPNKWLIELSVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLR

Query:  RAFQGLDPLAISEVFSPAQYMGEKQIMAVDCFVLKLSADQTDLADRSDNTAEMIKHTIYGYFCQRRGLLVYLEDSSLTRIQSPGSHPMYWETTMSTKIGD
        RAFQGLDPLAISEVFSPAQYMGEKQIMAVDCFVLKLSADQTDL DRSDNTAEMIKH IYGYFCQRRGLLVYLEDSSLTRIQSPGSHPMYWETTMSTKI D
Subjt:  RAFQGLDPLAISEVFSPAQYMGEKQIMAVDCFVLKLSADQTDLADRSDNTAEMIKHTIYGYFCQRRGLLVYLEDSSLTRIQSPGSHPMYWETTMSTKIGD

Query:  YRPVDGVMIAHSGETNVVITRFGDDLKTGPMITRLQEIWSIDDVAFNVPGLSMDSFIPPKQVQKDRTDGGLGLDVTAR
        YRPVD VMIAHSGETNVVITRFGDDLKTGPMITR+QE WSIDDVAFNVPGLS+DSFIPPKQ+Q D+T+  LGLD TAR
Subjt:  YRPVDGVMIAHSGETNVVITRFGDDLKTGPMITRLQEIWSIDDVAFNVPGLSMDSFIPPKQVQKDRTDGGLGLDVTAR

XP_008443008.1 PREDICTED: uncharacterized protein LOC103486737 [Cucumis melo]2.4e-19393.11Show/hide
Query:  MNRLAPLSEEPIDEHDGRTRNRNRSTTAGSGGRSWRNWIRTHLSILSCGKKSDGLNVLLSVLGCPLFPVSVQPNCTVSSTNQISSSSQYIIEHFAAATGC
        MNRLAPLSEEPIDEHD RTRNR+R TTAG GGRSWRNWIRTH SILS GKKSDGLNVLLSVLGCPLFPVS+QPN  VS TNQ+SSSSQYIIEHFAAATGC
Subjt:  MNRLAPLSEEPIDEHDGRTRNRNRSTTAGSGGRSWRNWIRTHLSILSCGKKSDGLNVLLSVLGCPLFPVSVQPNCTVSSTNQISSSSQYIIEHFAAATGC

Query:  RKLKGRVKNIFATGKLTMGMADEVSSGGGGGGGGGGGPTGGVAKKGCFVMWQMIPNKWLIELSVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLRRA
        RKL+GRVKNIFATGK+TMGMA+EVSS  GGGGGGGGGPTGGV +KGCFVMWQMIPNKWLIELSVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLRRA
Subjt:  RKLKGRVKNIFATGKLTMGMADEVSSGGGGGGGGGGGPTGGVAKKGCFVMWQMIPNKWLIELSVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLRRA

Query:  FQGLDPLAISEVFSPAQYMGEKQIMAVDCFVLKLSADQTDLADRSDNTAEMIKHTIYGYFCQRRGLLVYLEDSSLTRIQSPGSHPMYWETTMSTKIGDYR
        FQGLDPLAISEVFSPAQYMGEKQIM+VDCFVLKLSA+QTDLADRSDNTAEMIKH IYGYFCQRRGLLVYLEDSSLTRIQSPGSHPMYWETTMSTKI DYR
Subjt:  FQGLDPLAISEVFSPAQYMGEKQIMAVDCFVLKLSADQTDLADRSDNTAEMIKHTIYGYFCQRRGLLVYLEDSSLTRIQSPGSHPMYWETTMSTKIGDYR

Query:  PVDGVMIAHSGETNVVITRFGDDLKTGPMITRLQEIWSIDDVAFNVPGLSMDSFIPPKQVQKD
         VDGVMIAHSGET+V+ITRFGDDLKTGPMITRLQEIWSIDDVAFNVPGLSMDSFIPPKQVQKD
Subjt:  PVDGVMIAHSGETNVVITRFGDDLKTGPMITRLQEIWSIDDVAFNVPGLSMDSFIPPKQVQKD

XP_022942787.1 uncharacterized protein LOC111447715 [Cucurbita moschata]5.6e-19590.74Show/hide
Query:  MNRLAPLSEEPIDEHDGRTRNRNRSTTAGS--GGRSWRNWIRTHLSILSCGKKSDGLNVLLSVLGCPLFPVSVQPNCTVSSTNQISSSSQYIIEHFAAAT
        MNRLAPLSEEPIDE+DGRTR+RNRST  GS  GGRSWRNWIRTHLSIL CGKKSD LNVLLSVLGCPLFPVSVQPN  VSS NQ+SSSSQYIIEHF AAT
Subjt:  MNRLAPLSEEPIDEHDGRTRNRNRSTTAGS--GGRSWRNWIRTHLSILSCGKKSDGLNVLLSVLGCPLFPVSVQPNCTVSSTNQISSSSQYIIEHFAAAT

Query:  GCRKLKGRVKNIFATGKLTMGMADEVSSGGGGGGGGGGGPTGGVAKKGCFVMWQMIPNKWLIELSVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLR
        GCRKLKGRVKNIF TGKLTMGMADEVSS    GGGGGGGPT GVA+KGCFVMWQMIPNKWLIEL+VGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLR
Subjt:  GCRKLKGRVKNIFATGKLTMGMADEVSSGGGGGGGGGGGPTGGVAKKGCFVMWQMIPNKWLIELSVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLR

Query:  RAFQGLDPLAISEVFSPAQYMGEKQIMAVDCFVLKLSADQTDLADRSDNTAEMIKHTIYGYFCQRRGLLVYLEDSSLTRIQSPGSHPMYWETTMSTKIGD
        RAFQGLDPLAISEVFSPAQYMGEKQIMAVDCFVLKLSADQTDL DRSDNTAEMIKH IYGYFCQRRGLLVYLEDSSLTRIQSPGSHPMYWETTMSTKI +
Subjt:  RAFQGLDPLAISEVFSPAQYMGEKQIMAVDCFVLKLSADQTDLADRSDNTAEMIKHTIYGYFCQRRGLLVYLEDSSLTRIQSPGSHPMYWETTMSTKIGD

Query:  YRPVDGVMIAHSGETNVVITRFGDDLKTGPMITRLQEIWSIDDVAFNVPGLSMDSFIPPKQVQKDRTDGGLGLDVTAR
        YRPVDGVMIAHSGETNVVITRFGDDLKTGPMITR+QE WSIDDVAFNVPGLS+DSFIPPKQ+Q D+T+  LGLD TAR
Subjt:  YRPVDGVMIAHSGETNVVITRFGDDLKTGPMITRLQEIWSIDDVAFNVPGLSMDSFIPPKQVQKDRTDGGLGLDVTAR

XP_023540199.1 uncharacterized protein LOC111800643 [Cucurbita pepo subsp. pepo]5.1e-19691.27Show/hide
Query:  MNRLAPLSEEPIDEHDGRTRNRNRSTTAGS--GGRSWRNWIRTHLSILSCGKKSDGLNVLLSVLGCPLFPVSVQPNCTVSSTNQISSSSQYIIEHFAAAT
        MNRLAPLSEEPIDE+DGRTRNRNRSTTAGS  GGRSWRNWIRTHLSIL CGKKSD LNVLLSVLGCPLFPVSVQPN  VSSTNQ+SSSSQYIIEHF AAT
Subjt:  MNRLAPLSEEPIDEHDGRTRNRNRSTTAGS--GGRSWRNWIRTHLSILSCGKKSDGLNVLLSVLGCPLFPVSVQPNCTVSSTNQISSSSQYIIEHFAAAT

Query:  GCRKLKGRVKNIFATGKLTMGMADEVSSGGGGGGGGGGGPTGGVAKKGCFVMWQMIPNKWLIELSVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLR
        GCRKLKGRVKNIF TGKLTMGMADEVSS       GGGGPT GVA+KGCFVMWQMIPNKWLIEL+VGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLR
Subjt:  GCRKLKGRVKNIFATGKLTMGMADEVSSGGGGGGGGGGGPTGGVAKKGCFVMWQMIPNKWLIELSVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLR

Query:  RAFQGLDPLAISEVFSPAQYMGEKQIMAVDCFVLKLSADQTDLADRSDNTAEMIKHTIYGYFCQRRGLLVYLEDSSLTRIQSPGSHPMYWETTMSTKIGD
        RAFQGLDPLAISEVFSPAQYMGEKQIMAVDCFVLKLSADQTDL DRSDNTAEMIKH IYGYFCQRRGLL+YLEDSSLTRIQSPGSHPMYWETTMSTKI D
Subjt:  RAFQGLDPLAISEVFSPAQYMGEKQIMAVDCFVLKLSADQTDLADRSDNTAEMIKHTIYGYFCQRRGLLVYLEDSSLTRIQSPGSHPMYWETTMSTKIGD

Query:  YRPVDGVMIAHSGETNVVITRFGDDLKTGPMITRLQEIWSIDDVAFNVPGLSMDSFIPPKQVQKDRTDGGLGLDVTAR
        YRPVDGVMIAHSGETNVVITRFGDDLKTGPMITRLQE WSIDDVAFNVPGLS+DSFIPPKQ+Q D+T+  LGLD TAR
Subjt:  YRPVDGVMIAHSGETNVVITRFGDDLKTGPMITRLQEIWSIDDVAFNVPGLSMDSFIPPKQVQKDRTDGGLGLDVTAR

XP_038906374.1 uncharacterized protein LOC120092207 [Benincasa hispida]8.1e-19492.7Show/hide
Query:  MQLQRMNRLAPLSEEPIDEHDGRTRNRNRSTTAGSGG--RSWRNWIRTHLSILSCGKKSDGLNVLLSVLGCPLFPVSVQPNCTVSSTNQISSSSQYIIEH
        M LQRMNRLAPLSEEPIDEHD RTRNRNRS+  G GG  RSWRNWIRTH SILSCGKKSDGLNVLLSVLGCPLFPVS+QPN +VS TNQ+SSSSQYIIEH
Subjt:  MQLQRMNRLAPLSEEPIDEHDGRTRNRNRSTTAGSGG--RSWRNWIRTHLSILSCGKKSDGLNVLLSVLGCPLFPVSVQPNCTVSSTNQISSSSQYIIEH

Query:  FAAATGCRKLKGRVKNIFATGKLTMGMADEVSSGGGGGGGGGGGPTGGVAKKGCFVMWQMIPNKWLIELSVGGHSIVAGSDGNVAWRHTPWLGSHAAKGA
        FAAATGCRKLKGRVKNIF TGK+TMGMADEVSS  GGGG GGGG TGGV +KGCFVMWQMIPNKWLIEL+VGGHSIVAGSDGNVAWRHTPWLGSHAAKG 
Subjt:  FAAATGCRKLKGRVKNIFATGKLTMGMADEVSSGGGGGGGGGGGPTGGVAKKGCFVMWQMIPNKWLIELSVGGHSIVAGSDGNVAWRHTPWLGSHAAKGA

Query:  VRPLRRAFQGLDPLAISEVFSPAQYMGEKQIMAVDCFVLKLSADQTDLADRSDNTAEMIKHTIYGYFCQRRGLLVYLEDSSLTRIQSPGSHPMYWETTMS
        VRPLRRAFQGLDPLAISEVFSPAQYMGEKQIMAVDCFVLKLSADQTDLADRSDNTAEMIKH IYGYFCQRRGLLVYLEDSSLTRIQSPGSHPMYWETTMS
Subjt:  VRPLRRAFQGLDPLAISEVFSPAQYMGEKQIMAVDCFVLKLSADQTDLADRSDNTAEMIKHTIYGYFCQRRGLLVYLEDSSLTRIQSPGSHPMYWETTMS

Query:  TKIGDYRPVDGVMIAHSGETNVVITRFGDDLKTGPMITRLQEIWSIDDVAFNVPGLSMDSFIPPKQVQKD
        TKI DYR VDGVMIAHSGETNVVITRFGDDLKTGPMITRLQEIWSIDDVAFNVPGLSMDSFIPPKQVQKD
Subjt:  TKIGDYRPVDGVMIAHSGETNVVITRFGDDLKTGPMITRLQEIWSIDDVAFNVPGLSMDSFIPPKQVQKD

TrEMBL top hitse value%identityAlignment
A0A1S3B7U5 uncharacterized protein LOC1034867371.1e-19393.11Show/hide
Query:  MNRLAPLSEEPIDEHDGRTRNRNRSTTAGSGGRSWRNWIRTHLSILSCGKKSDGLNVLLSVLGCPLFPVSVQPNCTVSSTNQISSSSQYIIEHFAAATGC
        MNRLAPLSEEPIDEHD RTRNR+R TTAG GGRSWRNWIRTH SILS GKKSDGLNVLLSVLGCPLFPVS+QPN  VS TNQ+SSSSQYIIEHFAAATGC
Subjt:  MNRLAPLSEEPIDEHDGRTRNRNRSTTAGSGGRSWRNWIRTHLSILSCGKKSDGLNVLLSVLGCPLFPVSVQPNCTVSSTNQISSSSQYIIEHFAAATGC

Query:  RKLKGRVKNIFATGKLTMGMADEVSSGGGGGGGGGGGPTGGVAKKGCFVMWQMIPNKWLIELSVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLRRA
        RKL+GRVKNIFATGK+TMGMA+EVSS  GGGGGGGGGPTGGV +KGCFVMWQMIPNKWLIELSVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLRRA
Subjt:  RKLKGRVKNIFATGKLTMGMADEVSSGGGGGGGGGGGPTGGVAKKGCFVMWQMIPNKWLIELSVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLRRA

Query:  FQGLDPLAISEVFSPAQYMGEKQIMAVDCFVLKLSADQTDLADRSDNTAEMIKHTIYGYFCQRRGLLVYLEDSSLTRIQSPGSHPMYWETTMSTKIGDYR
        FQGLDPLAISEVFSPAQYMGEKQIM+VDCFVLKLSA+QTDLADRSDNTAEMIKH IYGYFCQRRGLLVYLEDSSLTRIQSPGSHPMYWETTMSTKI DYR
Subjt:  FQGLDPLAISEVFSPAQYMGEKQIMAVDCFVLKLSADQTDLADRSDNTAEMIKHTIYGYFCQRRGLLVYLEDSSLTRIQSPGSHPMYWETTMSTKIGDYR

Query:  PVDGVMIAHSGETNVVITRFGDDLKTGPMITRLQEIWSIDDVAFNVPGLSMDSFIPPKQVQKD
         VDGVMIAHSGET+V+ITRFGDDLKTGPMITRLQEIWSIDDVAFNVPGLSMDSFIPPKQVQKD
Subjt:  PVDGVMIAHSGETNVVITRFGDDLKTGPMITRLQEIWSIDDVAFNVPGLSMDSFIPPKQVQKD

A0A6J1F6M9 uncharacterized protein LOC1114413591.8e-19189.81Show/hide
Query:  MQLQRMNRLAPLSEEPIDEHDGRTRNRNRS---TTAGSGGRSWRNWIRTHLSILSCGKKSDGLNVLLSVLGCPLFPVSVQPNCTVSSTNQISSSSQYIIE
        MQLQRM+RLAPLSEEPIDE DGRTRNRNRS   +  G GGRSWRNWIRTHLSILSCGKKSDGLNVLLSVLGCPLFPVSV+PN  VSS NQ+SSSSQYIIE
Subjt:  MQLQRMNRLAPLSEEPIDEHDGRTRNRNRS---TTAGSGGRSWRNWIRTHLSILSCGKKSDGLNVLLSVLGCPLFPVSVQPNCTVSSTNQISSSSQYIIE

Query:  HFAAATGCRKLKGRVKNIFATGKLTMGMADEVSSGGGGG--GGGGGGPTGGVAKKGCFVMWQMIPNKWLIELSVGGHSIVAGSDGNVAWRHTPWLGSHAA
        HFAAATGCRKL GRVKNIFATGKLTMG+ DEVSSGGGGG  GGGGGGPTGGV +KGCFVMWQMIPNKWLIEL+VGGHSIVAGSDGNVAWRHTPWLGSHAA
Subjt:  HFAAATGCRKLKGRVKNIFATGKLTMGMADEVSSGGGGG--GGGGGGPTGGVAKKGCFVMWQMIPNKWLIELSVGGHSIVAGSDGNVAWRHTPWLGSHAA

Query:  KGAVRPLRRAFQGLDPLAISEVFSPAQYMGEKQIMAVDCFVLKLSADQTDLADRSDNTAEMIKHTIYGYFCQRRGLLVYLEDSSLTRIQSPGSHPMYWET
        KGAVRPLRRAFQGLDPLAISEVFSP+QYMGEKQ+M VDCFVLKLS DQTDL +RSDNTAEMIKH IYGYFCQ+RGLLVYLEDSSLTRIQSPGSHPMYWET
Subjt:  KGAVRPLRRAFQGLDPLAISEVFSPAQYMGEKQIMAVDCFVLKLSADQTDLADRSDNTAEMIKHTIYGYFCQRRGLLVYLEDSSLTRIQSPGSHPMYWET

Query:  TMSTKIGDYRPVDGVMIAHSGETNVVITRFGDDLKTGPMITRLQEIWSIDDVAFNVPGLSMDSFIPPKQVQKD
        TMSTKIGDYR VDGVMIAHSGETNV+ITRFGDDLKTGPMITR+QE WSIDDVAFNV GL MDSFIPP+QV+KD
Subjt:  TMSTKIGDYRPVDGVMIAHSGETNVVITRFGDDLKTGPMITRLQEIWSIDDVAFNVPGLSMDSFIPPKQVQKD

A0A6J1FR88 uncharacterized protein LOC1114477152.7e-19590.74Show/hide
Query:  MNRLAPLSEEPIDEHDGRTRNRNRSTTAGS--GGRSWRNWIRTHLSILSCGKKSDGLNVLLSVLGCPLFPVSVQPNCTVSSTNQISSSSQYIIEHFAAAT
        MNRLAPLSEEPIDE+DGRTR+RNRST  GS  GGRSWRNWIRTHLSIL CGKKSD LNVLLSVLGCPLFPVSVQPN  VSS NQ+SSSSQYIIEHF AAT
Subjt:  MNRLAPLSEEPIDEHDGRTRNRNRSTTAGS--GGRSWRNWIRTHLSILSCGKKSDGLNVLLSVLGCPLFPVSVQPNCTVSSTNQISSSSQYIIEHFAAAT

Query:  GCRKLKGRVKNIFATGKLTMGMADEVSSGGGGGGGGGGGPTGGVAKKGCFVMWQMIPNKWLIELSVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLR
        GCRKLKGRVKNIF TGKLTMGMADEVSS    GGGGGGGPT GVA+KGCFVMWQMIPNKWLIEL+VGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLR
Subjt:  GCRKLKGRVKNIFATGKLTMGMADEVSSGGGGGGGGGGGPTGGVAKKGCFVMWQMIPNKWLIELSVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLR

Query:  RAFQGLDPLAISEVFSPAQYMGEKQIMAVDCFVLKLSADQTDLADRSDNTAEMIKHTIYGYFCQRRGLLVYLEDSSLTRIQSPGSHPMYWETTMSTKIGD
        RAFQGLDPLAISEVFSPAQYMGEKQIMAVDCFVLKLSADQTDL DRSDNTAEMIKH IYGYFCQRRGLLVYLEDSSLTRIQSPGSHPMYWETTMSTKI +
Subjt:  RAFQGLDPLAISEVFSPAQYMGEKQIMAVDCFVLKLSADQTDLADRSDNTAEMIKHTIYGYFCQRRGLLVYLEDSSLTRIQSPGSHPMYWETTMSTKIGD

Query:  YRPVDGVMIAHSGETNVVITRFGDDLKTGPMITRLQEIWSIDDVAFNVPGLSMDSFIPPKQVQKDRTDGGLGLDVTAR
        YRPVDGVMIAHSGETNVVITRFGDDLKTGPMITR+QE WSIDDVAFNVPGLS+DSFIPPKQ+Q D+T+  LGLD TAR
Subjt:  YRPVDGVMIAHSGETNVVITRFGDDLKTGPMITRLQEIWSIDDVAFNVPGLSMDSFIPPKQVQKDRTDGGLGLDVTAR

A0A6J1J631 uncharacterized protein LOC1114816721.7e-19290.84Show/hide
Query:  MQLQRMNRLAPLSEEPIDEHDGRTRNRNR--STTAGSGGRSWRNWIRTHLSILSCGKKSDGLNVLLSVLGCPLFPVSVQPNCTVSSTNQISSSSQYIIEH
        MQLQRMNRLAPLSEEPIDEHDGRTRNRNR  S + G GGRSWRNWIRTHLSILS GK+SDGLNVLLSVLGCPLFPVSVQPN  VSS NQ+SSSSQYIIEH
Subjt:  MQLQRMNRLAPLSEEPIDEHDGRTRNRNR--STTAGSGGRSWRNWIRTHLSILSCGKKSDGLNVLLSVLGCPLFPVSVQPNCTVSSTNQISSSSQYIIEH

Query:  FAAATGCRKLKGRVKNIFATGKLTMGMADEVSSGGGGGGGGG-GGPTGGVAKKGCFVMWQMIPNKWLIELSVGGHSIVAGSDGNVAWRHTPWLGSHAAKG
        FAAATGCRKL GRVKNIFATGKLTMG+ DEVSSGGG  GGGG GGPTGGV +KGCFVMWQMIPNKWLIEL+VGGHSIVAGSDGNVAWRHTPWLGSHAAKG
Subjt:  FAAATGCRKLKGRVKNIFATGKLTMGMADEVSSGGGGGGGGG-GGPTGGVAKKGCFVMWQMIPNKWLIELSVGGHSIVAGSDGNVAWRHTPWLGSHAAKG

Query:  AVRPLRRAFQGLDPLAISEVFSPAQYMGEKQIMAVDCFVLKLSADQTDLADRSDNTAEMIKHTIYGYFCQRRGLLVYLEDSSLTRIQSPGSHPMYWETTM
        AVRPLRRAFQGLDPLAISEVFSPAQYMGEKQ+M +DCFVLKLS DQTDLA+RSDNTAEMIKH IYGYFCQ+RGLLVYLEDSSLTRIQSPGSHPMYWETTM
Subjt:  AVRPLRRAFQGLDPLAISEVFSPAQYMGEKQIMAVDCFVLKLSADQTDLADRSDNTAEMIKHTIYGYFCQRRGLLVYLEDSSLTRIQSPGSHPMYWETTM

Query:  STKIGDYRPVDGVMIAHSGETNVVITRFGDDLKTGPMITRLQEIWSIDDVAFNVPGLSMDSFIPPKQVQKD
        STKIGDYR VDGVMIAHSGETNVVITRFGDDLKTGPMITR+QE WSIDD+AFNVPGL MDSFIPP+QVQKD
Subjt:  STKIGDYRPVDGVMIAHSGETNVVITRFGDDLKTGPMITRLQEIWSIDDVAFNVPGLSMDSFIPPKQVQKD

A0A6J1KUK6 uncharacterized protein LOC1114988043.7e-19289.42Show/hide
Query:  MNRLAPLSEEPIDEHDGRTRNRNRSTTAGS--GGRSWRNWIRTHLSILSCGKKSDGLNVLLSVLGCPLFPVSVQPNCTVSSTNQISSSSQYIIEHFAAAT
        MNRLAPLSEEPIDE DGRTR+RNRST AGS  GGRSWRNWIRTHLSIL CGKKSD LNVLLSVLGCPLFPVSVQPN  VSSTNQ+SSSSQYIIEHF AAT
Subjt:  MNRLAPLSEEPIDEHDGRTRNRNRSTTAGS--GGRSWRNWIRTHLSILSCGKKSDGLNVLLSVLGCPLFPVSVQPNCTVSSTNQISSSSQYIIEHFAAAT

Query:  GCRKLKGRVKNIFATGKLTMGMADEVSSGGGGGGGGGGGPTGGVAKKGCFVMWQMIPNKWLIELSVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLR
        GCRKLKGRVKNIF TGKLTMGM DEV+S       GGGGPT GVA+KGCFVMWQMIPNKWLIEL+VGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLR
Subjt:  GCRKLKGRVKNIFATGKLTMGMADEVSSGGGGGGGGGGGPTGGVAKKGCFVMWQMIPNKWLIELSVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLR

Query:  RAFQGLDPLAISEVFSPAQYMGEKQIMAVDCFVLKLSADQTDLADRSDNTAEMIKHTIYGYFCQRRGLLVYLEDSSLTRIQSPGSHPMYWETTMSTKIGD
        RAFQGLDPLAISEVFSPAQYMGEKQIMAVDCFVLKLSADQTDL DRSDNTAEMIKH IYGYFCQRRGLLVYLEDSSLTRIQSPGSHPMYWETTMSTKI D
Subjt:  RAFQGLDPLAISEVFSPAQYMGEKQIMAVDCFVLKLSADQTDLADRSDNTAEMIKHTIYGYFCQRRGLLVYLEDSSLTRIQSPGSHPMYWETTMSTKIGD

Query:  YRPVDGVMIAHSGETNVVITRFGDDLKTGPMITRLQEIWSIDDVAFNVPGLSMDSFIPPKQVQKDRTDGGLGLDVTAR
        YR +DGVMIAH GETNVVITRFGDDLKTGPMITR+QE W+IDDVAFNVPGLSMDSFIPPKQ+Q D+T+  LGLD TAR
Subjt:  YRPVDGVMIAHSGETNVVITRFGDDLKTGPMITRLQEIWSIDDVAFNVPGLSMDSFIPPKQVQKDRTDGGLGLDVTAR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G27690.1 Protein of unknown function (DUF620)2.4e-8243.94Show/hide
Query:  LAPLSEEPIDEHDGRTRNRNRSTTAGSGGRSWRNWIRTHLSIL------SCGKKSDGLNVLLSVLGCPLFPVSVQ-----PNCTVSSTNQISSSSQYIIE
        LAP+ E P    D    +   S       R W NW++  L +       S   K   L +LL VLG PL PV V      P+ ++ +T   +SS+QYI++
Subjt:  LAPLSEEPIDEHDGRTRNRNRSTTAGSGGRSWRNWIRTHLSIL------SCGKKSDGLNVLLSVLGCPLFPVSVQ-----PNCTVSSTNQISSSSQYIIE

Query:  HFAAATGCRKLKGRVKNIFATGKLTMGMADEVSSGGGGGGGGGGGPTGGVAKKGCFVMWQMIPNKWLIELSVGGHSIVAGSDGNVAWRHTPWLGSHAAKG
         + AA+G +KL   V+N +  G++   MA E  +G  G        +    + G FV+W M P+ W +EL +GG  ++AG DG + WRHTPWLG HAAKG
Subjt:  HFAAATGCRKLKGRVKNIFATGKLTMGMADEVSSGGGGGGGGGGGPTGGVAKKGCFVMWQMIPNKWLIELSVGGHSIVAGSDGNVAWRHTPWLGSHAAKG

Query:  AVRPLRRAFQGLDPLAISEVFSPAQYMGEKQIMAVDCFVLKLSADQTDLADRSDNTAEMIKHTIYGYFCQRRGLLVYLEDSSLTRIQSPGSHPMYWETTM
         VRPLRRA QGLDP   + +F+ A+ +GEK+I   DCF+LKL AD   L  RS+  +E I+HT++GYF Q+ GLLV+LEDS LTRIQ+ G   +YWETT+
Subjt:  AVRPLRRAFQGLDPLAISEVFSPAQYMGEKQIMAVDCFVLKLSADQTDLADRSDNTAEMIKHTIYGYFCQRRGLLVYLEDSSLTRIQSPGSHPMYWETTM

Query:  STKIGDYRPVDGVMIAHSGETNVVITRFGDDLKTGPMITRLQEIWSIDDVAFNVPGLSMDSFIPPKQVQKD
        ++ + DY+PV+G+MIAHSG +   + RFGD        T +QE W ID+++FNVPGLS+D FIPP +++ D
Subjt:  STKIGDYRPVDGVMIAHSGETNVVITRFGDDLKTGPMITRLQEIWSIDDVAFNVPGLSMDSFIPPKQVQKD

AT1G49840.1 Protein of unknown function (DUF620)1.9e-7643.28Show/hide
Query:  RMNRLAPLSEEPIDEHDGRTRNRNRSTTAGSGGRSW--RNWIR--THLSILSCGKKSDGLNVLLSVLGCPLFPVSVQP-----NCTVSSTNQISSSSQYI
        R + L P+ E P D  +G     + S   GSG   W    W R  +  S     +KSD L +LL V+G PL P++V       + T+  +   +SS+QYI
Subjt:  RMNRLAPLSEEPIDEHDGRTRNRNRSTTAGSGGRSW--RNWIR--THLSILSCGKKSDGLNVLLSVLGCPLFPVSVQP-----NCTVSSTNQISSSSQYI

Query:  IEHFAAATGCRKLKGRVKNIFATGKLTMGMAD-EVSSGGGGGGGGGGGPTGGVAKKGCFVMWQMIPNKWLIELSVGGHSIVAGSDGNVAWRHTPWLGSHA
        ++ + AA G  KL   +KN +A GKL M  ++ E  +G           TGG      FV+WQM P+ W +ELSVGG  + AG +G + WRHTPWLGSH 
Subjt:  IEHFAAATGCRKLKGRVKNIFATGKLTMGMAD-EVSSGGGGGGGGGGGPTGGVAKKGCFVMWQMIPNKWLIELSVGGHSIVAGSDGNVAWRHTPWLGSHA

Query:  AKGAVRPLRRAFQGLDPLAISEVFSPAQYMGEKQIMAVDCFVLKLSADQTDLADRSDNTAEMIKHTIYGYFCQRRGLLVYLEDSSLTRIQSPGSHPMYWE
        AKG VRPLRRA QGLDP   + +F+ ++ +GE+++   DCF+LKL  D   L  RS+  AE+++H ++GYF QR GLL  +EDS LTRIQS     +YWE
Subjt:  AKGAVRPLRRAFQGLDPLAISEVFSPAQYMGEKQIMAVDCFVLKLSADQTDLADRSDNTAEMIKHTIYGYFCQRRGLLVYLEDSSLTRIQSPGSHPMYWE

Query:  TTMSTKIGDYRPVDGVMIAHSGETNVVITRFGDDLKTGPMITRLQEIWSIDDVAFNVPGLSMDSFIPPKQVQ
        TT+++ + DY+ V+G+MIAHSG + V + RFG ++      T+++E W+I++VAFNVPGLS+D FIPP  ++
Subjt:  TTMSTKIGDYRPVDGVMIAHSGETNVVITRFGDDLKTGPMITRLQEIWSIDDVAFNVPGLSMDSFIPPKQVQ

AT1G79420.1 Protein of unknown function (DUF620)3.3e-8445.27Show/hide
Query:  LAPLSEEP-IDEHDGRTRNRNRSTTAGSGGRSWRNWIRTHLSILS----------CGK-----KSDGLNVLLSVLGCPLFPVSVQ-----PNCTVSSTNQ
        L PL E P  D  D RT+         S   + R W + H  I            C       K   L +LL VLGCPL P+SV      P+  +  + Q
Subjt:  LAPLSEEP-IDEHDGRTRNRNRSTTAGSGGRSWRNWIRTHLSILS----------CGK-----KSDGLNVLLSVLGCPLFPVSVQ-----PNCTVSSTNQ

Query:  I------SSSSQYIIEHFAAATGCRKLKGRVKNIFATGKLTMGMADEVSSGGGGGGGGGGGPTGGVAKKGCFVMWQMIPNKWLIELSVGGHSIVAGSDGN
        I      +S++ YII+ + AATGC K     KN++ATG + M   +   + G      GGG  G     GCFV+WQM P  W +EL +GG  +++GSDG 
Subjt:  I------SSSSQYIIEHFAAATGCRKLKGRVKNIFATGKLTMGMADEVSSGGGGGGGGGGGPTGGVAKKGCFVMWQMIPNKWLIELSVGGHSIVAGSDGN

Query:  VAWRHTPWLGSHAAKGAVRPLRRAFQGLDPLAISEVFSPAQYMGEKQIMAVDCFVLKLSADQTDLADRSD--NTAEMIKHTIYGYFCQRRGLLVYLEDSS
          WRHTPWLG+HAAKG  RPLRR  QGLDP   + +F+ AQ +GE++I   DCFVLK+SAD+  L +R+D    AE+I+H +YGYFCQ+ GLLVYLEDS 
Subjt:  VAWRHTPWLGSHAAKGAVRPLRRAFQGLDPLAISEVFSPAQYMGEKQIMAVDCFVLKLSADQTDLADRSD--NTAEMIKHTIYGYFCQRRGLLVYLEDSS

Query:  LTRIQ--SPGSHPMYWETTMSTKIGDYRPVDGVMIAHSGETNVVITRFGDDLKTGPMITRLQEIWSIDDVAFNVPGLSMDSFIPPKQVQKD
        LTR+   SP    +YWETT+ T IGDYR VDGV +AH G     + RFG +       TR++EIW IDDV F+VPGLS+DSFIPP  + +D
Subjt:  LTRIQ--SPGSHPMYWETTMSTKIGDYRPVDGVMIAHSGETNVVITRFGDDLKTGPMITRLQEIWSIDDVAFNVPGLSMDSFIPPKQVQKD

AT3G19540.1 Protein of unknown function (DUF620)2.2e-8042.9Show/hide
Query:  RMNRLAPLSEEPIDEHDGRTRNRNRSTTAGSGGRSWRNWIRTHLS-----ILSCGKKSDGLNVLLSVLGCPLFPVSVQ-----PNCTVSSTNQISSSSQY
        R   L P+ E P  +  G   N   S   GSG     +W++  LS       +   + + L +LL V+G PL P+ V      P+ ++ +T   +SS+QY
Subjt:  RMNRLAPLSEEPIDEHDGRTRNRNRSTTAGSGGRSWRNWIRTHLS-----ILSCGKKSDGLNVLLSVLGCPLFPVSVQ-----PNCTVSSTNQISSSSQY

Query:  IIEHFAAATGCRKLKGRVKNIFATGKLTMGMAD-EVSSGGGGGGGGGGGPTGGVAKKGCFVMWQMIPNKWLIELSVGGHSIVAGSDGNVAWRHTPWLGSH
        I++ + AA+G +KL+  +KN +A GKL M  ++ E ++            TGG      FV+WQM P+ W +EL+VGG  + AG +G + WRHTPWLGSH
Subjt:  IIEHFAAATGCRKLKGRVKNIFATGKLTMGMAD-EVSSGGGGGGGGGGGPTGGVAKKGCFVMWQMIPNKWLIELSVGGHSIVAGSDGNVAWRHTPWLGSH

Query:  AAKGAVRPLRRAFQGLDPLAISEVFSPAQYMGEKQIMAVDCFVLKLSADQTDLADRSDNTAEMIKHTIYGYFCQRRGLLVYLEDSSLTRIQSPGSHPMYW
         AKG VRPLRR  QGLDP   + +F+ A+ +GEK++   DCF+LKL  D   L  RS+  AE+I+H ++GYF Q+ GLLV++EDS LTRIQS G   ++W
Subjt:  AAKGAVRPLRRAFQGLDPLAISEVFSPAQYMGEKQIMAVDCFVLKLSADQTDLADRSDNTAEMIKHTIYGYFCQRRGLLVYLEDSSLTRIQSPGSHPMYW

Query:  ETTMSTKIGDYRPVDGVMIAHSGETNVVITRFGDDLKTGPMITRLQEIWSIDDVAFNVPGLSMDSFIPPKQVQ
        ETT ++ + DYR V+G+MIAHSG + V + RFG ++ T    T+++E W+I++VAFNVPGLS+D FIPP  ++
Subjt:  ETTMSTKIGDYRPVDGVMIAHSGETNVVITRFGDDLKTGPMITRLQEIWSIDDVAFNVPGLSMDSFIPPKQVQ

AT5G06610.1 Protein of unknown function (DUF620)1.4e-11958.63Show/hide
Query:  MNRLAPLSEEPIDEHDGRTRNRNRSTTAGSGGRSWRNWIRTHLSILSCGKKSDGLNVLLSVLGCPLFPVSVQPNCTVSSTNQISSSSQYIIEHFAAATGC
        M RLAPL EEPIDE D       R  +  S  +SW+ WI+T L  +   KK D + +LLSV+GCPLFPV   P  +  S  Q+SSS+QYII+ FAAATGC
Subjt:  MNRLAPLSEEPIDEHDGRTRNRNRSTTAGSGGRSWRNWIRTHLSILSCGKKSDGLNVLLSVLGCPLFPVSVQPNCTVSSTNQISSSSQYIIEHFAAATGC

Query:  RKLKGRVKNIFATGKLTMGMADEVSSGGGGGGGGGGGPTGGVAKKGCFVMWQMIPNKWLIELSVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLRRA
        +KL G +KN F TGK+TM M  +++S               V+ KGCFVMWQM+P KWLIEL  GGH + AGSDG + WR+TPWLG HAAKGA+RPLRRA
Subjt:  RKLKGRVKNIFATGKLTMGMADEVSSGGGGGGGGGGGPTGGVAKKGCFVMWQMIPNKWLIELSVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLRRA

Query:  FQGLDPLAISEVFSPAQYMGEKQIMAVDCFVLKLSADQTDLADRSDNTAEMIKHTIYGYFCQRRGLLVYLEDSSLTRIQSPGSHPMYWETTMSTKIGDYR
         QGLDPL IS VFS AQ++GEK+I   DCF+LKLS DQ DL+ RSD+TAEMIKH  +GYF Q+ GLL+ LEDSSLTRIQ PG+ P YWET+MS+ + DYR
Subjt:  FQGLDPLAISEVFSPAQYMGEKQIMAVDCFVLKLSADQTDLADRSDNTAEMIKHTIYGYFCQRRGLLVYLEDSSLTRIQSPGSHPMYWETTMSTKIGDYR

Query:  PVDG--VMIAHSGETNVVITRFGDDLKTGPMITRLQEIWSIDDVAFNVPGLSMDSFIPPKQVQKD
         ++G  V+IAHSG+T+V+I+RFG+ LK G  +TR++E W+IDDVAF+VPGLS+D FIPPK+++ D
Subjt:  PVDG--VMIAHSGETNVVITRFGDDLKTGPMITRLQEIWSIDDVAFNVPGLSMDSFIPPKQVQKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGTTGCAGAGGATGAATCGTCTAGCGCCATTATCGGAGGAGCCGATCGACGAGCACGACGGCCGGACTCGGAATCGCAATCGCAGCACCACCGCCGGCAGCGGCGG
ACGATCGTGGCGGAACTGGATCAGAACTCATTTGTCGATCCTTTCTTGTGGAAAAAAGTCCGATGGCCTTAATGTTCTCCTCAGCGTCCTCGGTTGCCCTCTCTTTCCGG
TCTCCGTTCAACCTAACTGTACCGTCTCCTCTACCAATCAGATTTCCTCGTCGTCTCAATATATCATTGAACATTTCGCGGCGGCGACGGGATGCCGGAAGTTGAAAGGG
AGAGTCAAGAACATATTTGCAACGGGGAAACTGACGATGGGGATGGCGGATGAGGTTAGCTCCGGCGGTGGTGGCGGAGGAGGAGGAGGAGGAGGACCCACCGGCGGGGT
AGCAAAAAAAGGTTGCTTTGTGATGTGGCAAATGATTCCGAATAAGTGGCTGATAGAGCTGTCTGTGGGAGGCCACAGCATTGTGGCCGGCAGCGATGGCAACGTCGCTT
GGAGGCACACACCTTGGCTTGGCTCTCACGCCGCTAAGGGCGCCGTCCGCCCTCTCCGCCGTGCTTTTCAGGGGCTGGATCCGCTAGCAATTTCCGAAGTATTCTCTCCG
GCGCAGTACATGGGAGAGAAGCAAATCATGGCCGTTGATTGCTTCGTACTAAAATTGTCAGCAGATCAGACGGACCTCGCCGACCGAAGCGACAACACAGCCGAGATGAT
CAAGCACACGATCTACGGCTACTTTTGTCAACGACGTGGACTTTTGGTCTACTTGGAAGACTCCTCGTTGACTCGGATTCAATCCCCTGGTTCTCATCCCATGTACTGGG
AAACGACTATGTCCACAAAAATCGGCGATTATCGACCGGTCGACGGCGTTATGATCGCCCACTCCGGCGAGACCAACGTTGTCATCACACGGTTCGGAGACGATCTCAAA
ACCGGTCCTATGATTACACGGTTGCAGGAGATTTGGAGTATTGATGACGTGGCGTTTAATGTTCCTGGATTGTCTATGGATAGTTTTATACCACCTAAGCAGGTTCAGAA
AGATCGGACAGATGGCGGGCTGGGATTGGATGTCACCGCTCGATGA
mRNA sequenceShow/hide mRNA sequence
CTTTTTAAGTAATCTCCCACTCTCTTCTCTCTGCTTCTTCCAATTCCTCTTCTTCTCTCTCAGAAATTGCAAATGCAGTTGCAGAGGATGAATCGTCTAGCGCCATTATC
GGAGGAGCCGATCGACGAGCACGACGGCCGGACTCGGAATCGCAATCGCAGCACCACCGCCGGCAGCGGCGGACGATCGTGGCGGAACTGGATCAGAACTCATTTGTCGA
TCCTTTCTTGTGGAAAAAAGTCCGATGGCCTTAATGTTCTCCTCAGCGTCCTCGGTTGCCCTCTCTTTCCGGTCTCCGTTCAACCTAACTGTACCGTCTCCTCTACCAAT
CAGATTTCCTCGTCGTCTCAATATATCATTGAACATTTCGCGGCGGCGACGGGATGCCGGAAGTTGAAAGGGAGAGTCAAGAACATATTTGCAACGGGGAAACTGACGAT
GGGGATGGCGGATGAGGTTAGCTCCGGCGGTGGTGGCGGAGGAGGAGGAGGAGGAGGACCCACCGGCGGGGTAGCAAAAAAAGGTTGCTTTGTGATGTGGCAAATGATTC
CGAATAAGTGGCTGATAGAGCTGTCTGTGGGAGGCCACAGCATTGTGGCCGGCAGCGATGGCAACGTCGCTTGGAGGCACACACCTTGGCTTGGCTCTCACGCCGCTAAG
GGCGCCGTCCGCCCTCTCCGCCGTGCTTTTCAGGGGCTGGATCCGCTAGCAATTTCCGAAGTATTCTCTCCGGCGCAGTACATGGGAGAGAAGCAAATCATGGCCGTTGA
TTGCTTCGTACTAAAATTGTCAGCAGATCAGACGGACCTCGCCGACCGAAGCGACAACACAGCCGAGATGATCAAGCACACGATCTACGGCTACTTTTGTCAACGACGTG
GACTTTTGGTCTACTTGGAAGACTCCTCGTTGACTCGGATTCAATCCCCTGGTTCTCATCCCATGTACTGGGAAACGACTATGTCCACAAAAATCGGCGATTATCGACCG
GTCGACGGCGTTATGATCGCCCACTCCGGCGAGACCAACGTTGTCATCACACGGTTCGGAGACGATCTCAAAACCGGTCCTATGATTACACGGTTGCAGGAGATTTGGAG
TATTGATGACGTGGCGTTTAATGTTCCTGGATTGTCTATGGATAGTTTTATACCACCTAAGCAGGTTCAGAAAGATCGGACAGATGGCGGGCTGGGATTGGATGTCACCG
CTCGATGATCGACGGCCTTGATCGATGGGTTTAACACAATAAGCTTTTACTTAAAAAGAAAATATAGGGTGAGTATATGTTTTAATTCTTAATGTTTGGGTGTTTTTTTT
TTAATTTAGTCCCTTTTTTAAAAAAGTTTTGATTTAGTTTATTATGTTTTAACATTGGTCGCTGGTTGATGATGAATTTATATAACATAATTCTTTGTGAATTGGCAAAA
ATTTAGAAGAGAAACAAAGTCATCTAATAGAGGAAACTTTCTAATTTCTCTTCCAAGTTTCTACAAAATTGGTAGAAATTTTGAGTTTTCTTCTCTTATATGACTTTTTT
TCCTTATAATTTTTTTTTTCAATTCACTGAGTATTATGTCATGTAAGTTTGTCTTTTACTAATTTTAACGTTTATCAACCATGATTGACTAAATTGAAAAATTTTAAGAC
ATTAGAGACTAAGCTAAAACTTTTTAACCGATAGGGACTAAATTGAAAAAATATGAAAACATTAAGAATTAGAAAATATATTTAATCATATATATCAATTTTTTCTTTTT
CTTTTCCTCTATTTCCGTCATAAGAGATGTTTTTAATGAGATGGATGTTAGAAGTGAAATATGAGTAACATTTTTAAACAGTGAGTGTAATGAGGTTTTTTTCTTTACTT
CTTTTTTTGTATCCTAAATATTAACAACGTACCGTTTAAACGTTTGACTGAGTTATGTGATT
Protein sequenceShow/hide protein sequence
MQLQRMNRLAPLSEEPIDEHDGRTRNRNRSTTAGSGGRSWRNWIRTHLSILSCGKKSDGLNVLLSVLGCPLFPVSVQPNCTVSSTNQISSSSQYIIEHFAAATGCRKLKG
RVKNIFATGKLTMGMADEVSSGGGGGGGGGGGPTGGVAKKGCFVMWQMIPNKWLIELSVGGHSIVAGSDGNVAWRHTPWLGSHAAKGAVRPLRRAFQGLDPLAISEVFSP
AQYMGEKQIMAVDCFVLKLSADQTDLADRSDNTAEMIKHTIYGYFCQRRGLLVYLEDSSLTRIQSPGSHPMYWETTMSTKIGDYRPVDGVMIAHSGETNVVITRFGDDLK
TGPMITRLQEIWSIDDVAFNVPGLSMDSFIPPKQVQKDRTDGGLGLDVTAR