; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi02G020330 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi02G020330
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionProtein of unknown function (DUF616)
Genome locationchr02:26357699..26363455
RNA-Seq ExpressionLsi02G020330
SyntenyLsi02G020330
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR006852 - Protein of unknown function DUF616


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_016902799.1 PREDICTED: uncharacterized protein LOC103500457 isoform X1 [Cucumis melo]5.8e-21793.94Show/hide
Query:  SQTRYEPRFGRNVMMGLFGNNNEQLPERRGIVSGFFKSFGKTEYGSRAVRRGRRLGRTTRKRFSCLFLTLTTSFLLYLAVFGLKVNFYDKIGDSMSVSEV
        S TRYEPRFG NVMMGLFGNNNEQLPERRG+VSGFFKS GKTEYGSRAVRRGRRLGR+TRKRFSCLFLT+ TSFLLYLAVFGLKVNFYD IGDS SVSEV
Subjt:  SQTRYEPRFGRNVMMGLFGNNNEQLPERRGIVSGFFKSFGKTEYGSRAVRRGRRLGRTTRKRFSCLFLTLTTSFLLYLAVFGLKVNFYDKIGDSMSVSEV

Query:  QGEHLASRDQRPPTSKHRRKQHFPCDVEFADSVAYLVEPEGFMNVTQFSLEFIEHEEKPSETDLYMPRFGGHQTLEEREKSFYATNQKLHCGFIKGPPGY
        Q EHL SRDQRPPTSKHRRKQHFPCDVEFA+SVAYLVEP GFMNVTQFSLEFIEHEEK SETDLYMPRFGGHQTLEERE SFYATNQKLHCGFIKGPPGY
Subjt:  QGEHLASRDQRPPTSKHRRKQHFPCDVEFADSVAYLVEPEGFMNVTQFSLEFIEHEEKPSETDLYMPRFGGHQTLEEREKSFYATNQKLHCGFIKGPPGY

Query:  PSTGFDLDEKDDAYMKTCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGNIPDDKGCIGLWKIVVVRNLPYEDMRRTGKVPK
        PSTGFDLDEKD AYMKTCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVD+QTLSKLS+EGNIPDDKG IGLWKIVVVRNLPYEDMRRTGKVPK
Subjt:  PSTGFDLDEKDDAYMKTCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGNIPDDKGCIGLWKIVVVRNLPYEDMRRTGKVPK

Query:  FLSHRLFPSARYSIWLDSKMRLQIDPMLIIEYFLWRKKSEYAISNHYDRHCVWEEVQQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFNLRSGLPS
        FLSHRLFPSARYSIWLDSKMRLQ+DPMLIIEYFLWRKKSEYAISNHYDRHCVWEEVQQNKRLNKYNHTAIDEQFAFYQSDGLVKFDP ++ SGLPS
Subjt:  FLSHRLFPSARYSIWLDSKMRLQIDPMLIIEYFLWRKKSEYAISNHYDRHCVWEEVQQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFNLRSGLPS

XP_031745279.1 uncharacterized protein LOC101206756 isoform X1 [Cucumis sativus]5.4e-21593.16Show/hide
Query:  RYEPRFGRNVMMGLFGNNNEQLPERRGIVSGFFKSFGKTEYGSRAVRRGRRLGRTTRKRFSCLFLTLTTSFLLYLAVFGLKVNFYDKIGDSMSVSEVQGE
        RYE RFG NVMMGLFGNNNEQLPERRGIVSGFFKS GKTEYGSRAVRRGRRLGR+TRKRFSCLF TL TSFLLYLAVFGLKVNFYD IGDS SVSEV+ E
Subjt:  RYEPRFGRNVMMGLFGNNNEQLPERRGIVSGFFKSFGKTEYGSRAVRRGRRLGRTTRKRFSCLFLTLTTSFLLYLAVFGLKVNFYDKIGDSMSVSEVQGE

Query:  HLASRDQRPPTSKHRRKQHFPCDVEFADSVAYLVEPEGFMNVTQFSLEFIEHEEKPSETDLYMPRFGGHQTLEEREKSFYATNQKLHCGFIKGPPGYPST
        HL SRDQRPPTSKHRRKQHFPCDVEFA+SVAYLVEPEGFMNVTQFSLEFIE EEK  E DL+MPRFGGHQTLEERE SFYATNQKLHCGFIKGPPGYPST
Subjt:  HLASRDQRPPTSKHRRKQHFPCDVEFADSVAYLVEPEGFMNVTQFSLEFIEHEEKPSETDLYMPRFGGHQTLEEREKSFYATNQKLHCGFIKGPPGYPST

Query:  GFDLDEKDDAYMKTCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGNIPDDKGCIGLWKIVVVRNLPYEDMRRTGKVPKFLS
        GFDLDEKDDAYMKTCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVD+QTLSKLSAEGNIPDDKGCIGLWKIVVV NLPYEDMRRTGKVPKFLS
Subjt:  GFDLDEKDDAYMKTCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGNIPDDKGCIGLWKIVVVRNLPYEDMRRTGKVPKFLS

Query:  HRLFPSARYSIWLDSKMRLQIDPMLIIEYFLWRKKSEYAISNHYDRHCVWEEVQQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFNLRSGLPSRE
        HRLFPSARYSIWLDSKMRLQ+DPMLIIEYFLWRKKSEYAISNHYDRHCVWEEVQQNKRLNKYNHTAIDEQFAFYQSDGLVKFDP ++ SGLPS +
Subjt:  HRLFPSARYSIWLDSKMRLQIDPMLIIEYFLWRKKSEYAISNHYDRHCVWEEVQQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFNLRSGLPSRE

XP_031745282.1 uncharacterized protein LOC101206756 isoform X5 [Cucumis sativus]5.4e-21593.16Show/hide
Query:  RYEPRFGRNVMMGLFGNNNEQLPERRGIVSGFFKSFGKTEYGSRAVRRGRRLGRTTRKRFSCLFLTLTTSFLLYLAVFGLKVNFYDKIGDSMSVSEVQGE
        RYE RFG NVMMGLFGNNNEQLPERRGIVSGFFKS GKTEYGSRAVRRGRRLGR+TRKRFSCLF TL TSFLLYLAVFGLKVNFYD IGDS SVSEV+ E
Subjt:  RYEPRFGRNVMMGLFGNNNEQLPERRGIVSGFFKSFGKTEYGSRAVRRGRRLGRTTRKRFSCLFLTLTTSFLLYLAVFGLKVNFYDKIGDSMSVSEVQGE

Query:  HLASRDQRPPTSKHRRKQHFPCDVEFADSVAYLVEPEGFMNVTQFSLEFIEHEEKPSETDLYMPRFGGHQTLEEREKSFYATNQKLHCGFIKGPPGYPST
        HL SRDQRPPTSKHRRKQHFPCDVEFA+SVAYLVEPEGFMNVTQFSLEFIE EEK  E DL+MPRFGGHQTLEERE SFYATNQKLHCGFIKGPPGYPST
Subjt:  HLASRDQRPPTSKHRRKQHFPCDVEFADSVAYLVEPEGFMNVTQFSLEFIEHEEKPSETDLYMPRFGGHQTLEEREKSFYATNQKLHCGFIKGPPGYPST

Query:  GFDLDEKDDAYMKTCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGNIPDDKGCIGLWKIVVVRNLPYEDMRRTGKVPKFLS
        GFDLDEKDDAYMKTCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVD+QTLSKLSAEGNIPDDKGCIGLWKIVVV NLPYEDMRRTGKVPKFLS
Subjt:  GFDLDEKDDAYMKTCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGNIPDDKGCIGLWKIVVVRNLPYEDMRRTGKVPKFLS

Query:  HRLFPSARYSIWLDSKMRLQIDPMLIIEYFLWRKKSEYAISNHYDRHCVWEEVQQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFNLRSGLPSRE
        HRLFPSARYSIWLDSKMRLQ+DPMLIIEYFLWRKKSEYAISNHYDRHCVWEEVQQNKRLNKYNHTAIDEQFAFYQSDGLVKFDP ++ SGLPS +
Subjt:  HRLFPSARYSIWLDSKMRLQIDPMLIIEYFLWRKKSEYAISNHYDRHCVWEEVQQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFNLRSGLPSRE

XP_038899595.1 uncharacterized protein LOC120086852 isoform X1 [Benincasa hispida]6.6e-22195.66Show/hide
Query:  YEPRFGRNVMMGLFGNNNEQLPERRGIVSGFFKSFGKTEYGSRAVRRGRRLGRTTRKRFSCLFLTLTTSFLLYLAVFGLKVNFYDKIGDSMSVSEVQGEH
        YEPRFG  VMMGLFGNNNEQLPERR IVSGFFKSFGK EYGSRAVRRGRRLGRTTRKRFSCLFLTLTTSFLLYLAVFGLK+NFYDK GDSMSVSEVQGEH
Subjt:  YEPRFGRNVMMGLFGNNNEQLPERRGIVSGFFKSFGKTEYGSRAVRRGRRLGRTTRKRFSCLFLTLTTSFLLYLAVFGLKVNFYDKIGDSMSVSEVQGEH

Query:  LASRDQRPPTSKHRRKQHFPCDVEFADSVAYLVEPEGFMNVTQFSLEFIEHEEKPSETDLYMPRFGGHQTLEEREKSFYATNQKLHCGFIKGPPGYPSTG
        L SRDQRPPTSKHRRKQHFPCDVEFA+SVAYLVEPEGFMNVT+FSLEFIEHEEKPSET LY PRFGGHQTLEEREKSFYATNQKLHCGFIKGPPGYPSTG
Subjt:  LASRDQRPPTSKHRRKQHFPCDVEFADSVAYLVEPEGFMNVTQFSLEFIEHEEKPSETDLYMPRFGGHQTLEEREKSFYATNQKLHCGFIKGPPGYPSTG

Query:  FDLDEKDDAYMKTCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGNIPDDKGCIGLWKIVVVRNLPYEDMRRTGKVPKFLSH
        FDLDEKDDAYMKTCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKL+AEGNIPDDKGCIGLWKIVVVRNLPYEDMRRTGKVPKFLSH
Subjt:  FDLDEKDDAYMKTCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGNIPDDKGCIGLWKIVVVRNLPYEDMRRTGKVPKFLSH

Query:  RLFPSARYSIWLDSKMRLQIDPMLIIEYFLWRKKSEYAISNHYDRHCVWEEVQQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFNLRSGLPS
        RLFPSARYSIWLDSKMRLQIDPMLIIEYFLWRKKSEYAISNHYDRHCVWEEVQQNKRLNKYNHTAIDEQFAFYQSDGLVKFDP +++S LPS
Subjt:  RLFPSARYSIWLDSKMRLQIDPMLIIEYFLWRKKSEYAISNHYDRHCVWEEVQQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFNLRSGLPS

XP_038899597.1 uncharacterized protein LOC120086852 isoform X2 [Benincasa hispida]9.9e-21796.08Show/hide
Query:  MMGLFGNNNEQLPERRGIVSGFFKSFGKTEYGSRAVRRGRRLGRTTRKRFSCLFLTLTTSFLLYLAVFGLKVNFYDKIGDSMSVSEVQGEHLASRDQRPP
        MMGLFGNNNEQLPERR IVSGFFKSFGK EYGSRAVRRGRRLGRTTRKRFSCLFLTLTTSFLLYLAVFGLK+NFYDK GDSMSVSEVQGEHL SRDQRPP
Subjt:  MMGLFGNNNEQLPERRGIVSGFFKSFGKTEYGSRAVRRGRRLGRTTRKRFSCLFLTLTTSFLLYLAVFGLKVNFYDKIGDSMSVSEVQGEHLASRDQRPP

Query:  TSKHRRKQHFPCDVEFADSVAYLVEPEGFMNVTQFSLEFIEHEEKPSETDLYMPRFGGHQTLEEREKSFYATNQKLHCGFIKGPPGYPSTGFDLDEKDDA
        TSKHRRKQHFPCDVEFA+SVAYLVEPEGFMNVT+FSLEFIEHEEKPSET LY PRFGGHQTLEEREKSFYATNQKLHCGFIKGPPGYPSTGFDLDEKDDA
Subjt:  TSKHRRKQHFPCDVEFADSVAYLVEPEGFMNVTQFSLEFIEHEEKPSETDLYMPRFGGHQTLEEREKSFYATNQKLHCGFIKGPPGYPSTGFDLDEKDDA

Query:  YMKTCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGNIPDDKGCIGLWKIVVVRNLPYEDMRRTGKVPKFLSHRLFPSARYS
        YMKTCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKL+AEGNIPDDKGCIGLWKIVVVRNLPYEDMRRTGKVPKFLSHRLFPSARYS
Subjt:  YMKTCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGNIPDDKGCIGLWKIVVVRNLPYEDMRRTGKVPKFLSHRLFPSARYS

Query:  IWLDSKMRLQIDPMLIIEYFLWRKKSEYAISNHYDRHCVWEEVQQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFNLRSGLPS
        IWLDSKMRLQIDPMLIIEYFLWRKKSEYAISNHYDRHCVWEEVQQNKRLNKYNHTAIDEQFAFYQSDGLVKFDP +++S LPS
Subjt:  IWLDSKMRLQIDPMLIIEYFLWRKKSEYAISNHYDRHCVWEEVQQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFNLRSGLPS

TrEMBL top hitse value%identityAlignment
A0A0A0K521 Uncharacterized protein4.5e-21593.64Show/hide
Query:  RYEPRFGRNVMMGLFGNNNEQLPERRGIVSGFFKSFGKTEYGSRAVRRGRRLGRTTRKRFSCLFLTLTTSFLLYLAVFGLKVNFYDKIGDSMSVSEVQGE
        RYE RFG NVMMGLFGNNNEQLPERRGIVSGFFKS GKTEYGSRAVRRGRRLGR+TRKRFSCLF TL TSFLLYLAVFGLKVNFYD IGDS SVSEV+ E
Subjt:  RYEPRFGRNVMMGLFGNNNEQLPERRGIVSGFFKSFGKTEYGSRAVRRGRRLGRTTRKRFSCLFLTLTTSFLLYLAVFGLKVNFYDKIGDSMSVSEVQGE

Query:  HLASRDQRPPTSKHRRKQHFPCDVEFADSVAYLVEPEGFMNVTQFSLEFIEHEEKPSETDLYMPRFGGHQTLEEREKSFYATNQKLHCGFIKGPPGYPST
        HL SRDQRPPTSKHRRKQHFPCDVEFA+SVAYLVEPEGFMNVTQFSLEFIE EEK  E DL+MPRFGGHQTLEERE SFYATNQKLHCGFIKGPPGYPST
Subjt:  HLASRDQRPPTSKHRRKQHFPCDVEFADSVAYLVEPEGFMNVTQFSLEFIEHEEKPSETDLYMPRFGGHQTLEEREKSFYATNQKLHCGFIKGPPGYPST

Query:  GFDLDEKDDAYMKTCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGNIPDDKGCIGLWKIVVVRNLPYEDMRRTGKVPKFLS
        GFDLDEKDDAYMKTCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVD+QTLSKLSAEGNIPDDKGCIGLWKIVVV NLPYEDMRRTGKVPKFLS
Subjt:  GFDLDEKDDAYMKTCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGNIPDDKGCIGLWKIVVVRNLPYEDMRRTGKVPKFLS

Query:  HRLFPSARYSIWLDSKMRLQIDPMLIIEYFLWRKKSEYAISNHYDRHCVWEEVQQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFNLRSGLPS
        HRLFPSARYSIWLDSKMRLQ+DPMLIIEYFLWRKKSEYAISNHYDRHCVWEEVQQNKRLNKYNHTAIDEQFAFYQSDGLVKFDP ++ SGLPS
Subjt:  HRLFPSARYSIWLDSKMRLQIDPMLIIEYFLWRKKSEYAISNHYDRHCVWEEVQQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFNLRSGLPS

A0A1S4E3K1 uncharacterized protein LOC103500457 isoform X21.5e-21094.26Show/hide
Query:  MMGLFGNNNEQLPERRGIVSGFFKSFGKTEYGSRAVRRGRRLGRTTRKRFSCLFLTLTTSFLLYLAVFGLKVNFYDKIGDSMSVSEVQGEHLASRDQRPP
        MMGLFGNNNEQLPERRG+VSGFFKS GKTEYGSRAVRRGRRLGR+TRKRFSCLFLT+ TSFLLYLAVFGLKVNFYD IGDS SVSEVQ EHL SRDQRPP
Subjt:  MMGLFGNNNEQLPERRGIVSGFFKSFGKTEYGSRAVRRGRRLGRTTRKRFSCLFLTLTTSFLLYLAVFGLKVNFYDKIGDSMSVSEVQGEHLASRDQRPP

Query:  TSKHRRKQHFPCDVEFADSVAYLVEPEGFMNVTQFSLEFIEHEEKPSETDLYMPRFGGHQTLEEREKSFYATNQKLHCGFIKGPPGYPSTGFDLDEKDDA
        TSKHRRKQHFPCDVEFA+SVAYLVEP GFMNVTQFSLEFIEHEEK SETDLYMPRFGGHQTLEERE SFYATNQKLHCGFIKGPPGYPSTGFDLDEKD A
Subjt:  TSKHRRKQHFPCDVEFADSVAYLVEPEGFMNVTQFSLEFIEHEEKPSETDLYMPRFGGHQTLEEREKSFYATNQKLHCGFIKGPPGYPSTGFDLDEKDDA

Query:  YMKTCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGNIPDDKGCIGLWKIVVVRNLPYEDMRRTGKVPKFLSHRLFPSARYS
        YMKTCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVD+QTLSKLS+EGNIPDDKG IGLWKIVVVRNLPYEDMRRTGKVPKFLSHRLFPSARYS
Subjt:  YMKTCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGNIPDDKGCIGLWKIVVVRNLPYEDMRRTGKVPKFLSHRLFPSARYS

Query:  IWLDSKMRLQIDPMLIIEYFLWRKKSEYAISNHYDRHCVWEEVQQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFNLRSGLPS
        IWLDSKMRLQ+DPMLIIEYFLWRKKSEYAISNHYDRHCVWEEVQQNKRLNKYNHTAIDEQFAFYQSDGLVKFDP ++ SGLPS
Subjt:  IWLDSKMRLQIDPMLIIEYFLWRKKSEYAISNHYDRHCVWEEVQQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFNLRSGLPS

A0A1S4E493 uncharacterized protein LOC103500457 isoform X12.8e-21793.94Show/hide
Query:  SQTRYEPRFGRNVMMGLFGNNNEQLPERRGIVSGFFKSFGKTEYGSRAVRRGRRLGRTTRKRFSCLFLTLTTSFLLYLAVFGLKVNFYDKIGDSMSVSEV
        S TRYEPRFG NVMMGLFGNNNEQLPERRG+VSGFFKS GKTEYGSRAVRRGRRLGR+TRKRFSCLFLT+ TSFLLYLAVFGLKVNFYD IGDS SVSEV
Subjt:  SQTRYEPRFGRNVMMGLFGNNNEQLPERRGIVSGFFKSFGKTEYGSRAVRRGRRLGRTTRKRFSCLFLTLTTSFLLYLAVFGLKVNFYDKIGDSMSVSEV

Query:  QGEHLASRDQRPPTSKHRRKQHFPCDVEFADSVAYLVEPEGFMNVTQFSLEFIEHEEKPSETDLYMPRFGGHQTLEEREKSFYATNQKLHCGFIKGPPGY
        Q EHL SRDQRPPTSKHRRKQHFPCDVEFA+SVAYLVEP GFMNVTQFSLEFIEHEEK SETDLYMPRFGGHQTLEERE SFYATNQKLHCGFIKGPPGY
Subjt:  QGEHLASRDQRPPTSKHRRKQHFPCDVEFADSVAYLVEPEGFMNVTQFSLEFIEHEEKPSETDLYMPRFGGHQTLEEREKSFYATNQKLHCGFIKGPPGY

Query:  PSTGFDLDEKDDAYMKTCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGNIPDDKGCIGLWKIVVVRNLPYEDMRRTGKVPK
        PSTGFDLDEKD AYMKTCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVD+QTLSKLS+EGNIPDDKG IGLWKIVVVRNLPYEDMRRTGKVPK
Subjt:  PSTGFDLDEKDDAYMKTCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGNIPDDKGCIGLWKIVVVRNLPYEDMRRTGKVPK

Query:  FLSHRLFPSARYSIWLDSKMRLQIDPMLIIEYFLWRKKSEYAISNHYDRHCVWEEVQQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFNLRSGLPS
        FLSHRLFPSARYSIWLDSKMRLQ+DPMLIIEYFLWRKKSEYAISNHYDRHCVWEEVQQNKRLNKYNHTAIDEQFAFYQSDGLVKFDP ++ SGLPS
Subjt:  FLSHRLFPSARYSIWLDSKMRLQIDPMLIIEYFLWRKKSEYAISNHYDRHCVWEEVQQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFNLRSGLPS

A0A6J1GIF1 uncharacterized protein LOC1114544524.5e-20788.72Show/hide
Query:  SILSQTRYEPRFGRNVMMGLFGNNNEQLPERRGIVSGFFKSFGKTEYGSRAVRRGRRLGRTTRKRFSCLFLTLTTSFLLYLAVFGLKVNFYDKIGDSMSV
        S L QTRY+ RFG NVMM LFGNNNEQLPERR  VSGFFKSF KT +GSR VRRGRRLGRTTR +FSCLFLTLTTSFLLYLAVFGLKVNF+ KIG+S+SV
Subjt:  SILSQTRYEPRFGRNVMMGLFGNNNEQLPERRGIVSGFFKSFGKTEYGSRAVRRGRRLGRTTRKRFSCLFLTLTTSFLLYLAVFGLKVNFYDKIGDSMSV

Query:  SEVQGEHLASRDQRPPTSKHRRKQHFPCDVEFADSVAYLVEPEGFMNVTQFSLEFIEHEEKPSETDLYMPRFGGHQTLEEREKSFYATNQKLHCGFIKGP
        +EVQGEHL SRDQ+PPT+KH RKQH PCDVEFA+SV YLVEPEGFMNV QFSLEF+E EE+ SETDLY PRFGGHQTLEEREKSFYATNQKLHCGF+KGP
Subjt:  SEVQGEHLASRDQRPPTSKHRRKQHFPCDVEFADSVAYLVEPEGFMNVTQFSLEFIEHEEKPSETDLYMPRFGGHQTLEEREKSFYATNQKLHCGFIKGP

Query:  PGYPSTGFDLDEKDDAYMKTCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGNIPDDKGCIGLWKIVVVRNLPYEDMRRTGK
        PGYPSTGFDLDEKD+AYMK CKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGN  DDKG IG+WKIVVVRNLPY+DMRRTGK
Subjt:  PGYPSTGFDLDEKDDAYMKTCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGNIPDDKGCIGLWKIVVVRNLPYEDMRRTGK

Query:  VPKFLSHRLFPSARYSIWLDSKMRLQIDPMLIIEYFLWRKKSEYAISNHYDRHCVWEEVQQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFNLRSGLPS
        VPKFLSHRLFPSARYSIWLDSKMRLQIDPMLIIEYFLWRKKSEYAISNHYDRHCVWEEVQQNKRLNKYNHTAIDEQFAFYQSDGLVKFDP N  + LPS
Subjt:  VPKFLSHRLFPSARYSIWLDSKMRLQIDPMLIIEYFLWRKKSEYAISNHYDRHCVWEEVQQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFNLRSGLPS

A0A6J1ICN5 uncharacterized protein LOC1114718662.0e-20290.08Show/hide
Query:  MMGLFGNNNEQLPERRGIVSGFFKSFGKTEYGSRAVRRGRRLGRTTRKRFSCLFLTLTTSFLLYLAVFGLKVNFYDKIGDSMSVSEVQGEHLASRDQRPP
        MM LFGNNNEQLPERR  VSGFFKSF KT +GSRAVRRGRRLGRTTR +FSCLFLTLTTSFLLYLAVFGLKVNF+ KIG+S+SV+EVQGEHL SRDQ+PP
Subjt:  MMGLFGNNNEQLPERRGIVSGFFKSFGKTEYGSRAVRRGRRLGRTTRKRFSCLFLTLTTSFLLYLAVFGLKVNFYDKIGDSMSVSEVQGEHLASRDQRPP

Query:  TSKHRRKQHFPCDVEFADSVAYLVEPEGFMNVTQFSLEFIEHEEKPSETDLYMPRFGGHQTLEEREKSFYATNQKLHCGFIKGPPGYPSTGFDLDEKDDA
        T+KH RKQH PCDVEFA+SV YLVEPEGFMNV QFSLEF+E EE+ SETDLY PRFGGHQTLEEREKSFYATNQKLHCGF+KGPPGYPSTGFDLDEKD+A
Subjt:  TSKHRRKQHFPCDVEFADSVAYLVEPEGFMNVTQFSLEFIEHEEKPSETDLYMPRFGGHQTLEEREKSFYATNQKLHCGFIKGPPGYPSTGFDLDEKDDA

Query:  YMKTCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGNIPDDKGCIGLWKIVVVRNLPYEDMRRTGKVPKFLSHRLFPSARYS
        YMK CKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGN PDDKG IG+WKIVVVRNLPY+DMRRTGKVPKFLSHRLFPSARYS
Subjt:  YMKTCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGNIPDDKGCIGLWKIVVVRNLPYEDMRRTGKVPKFLSHRLFPSARYS

Query:  IWLDSKMRLQIDPMLIIEYFLWRKKSEYAISNHYDRHCVWEEVQQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFNLRSGLPS
        IWLDSKMRLQIDPMLIIEYFLWRKKSEYAISNHYDRHCVWEEVQQNKRLNKYNHTAIDEQFAFYQSDGLVKFDP N  + LPS
Subjt:  IWLDSKMRLQIDPMLIIEYFLWRKKSEYAISNHYDRHCVWEEVQQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFNLRSGLPS

SwissProt top hitse value%identityAlignment
Q9FZ97 Probable hexosyltransferase MUCI701.3e-5742.07Show/hide
Query:  QGEHLASRDQRPPTSKHRRKQHFPCDVEF---ADSVAYLVEPEGFMNVTQFSLEFIEHEEKPSETDLYMPRFGGHQTLEEREKSF-YATNQKLHCGFIKG
        QG    S    PP +  +R    PC V +    ++VA +     F  V + +L +I  E    ET+     FGG+ TL+ R  SF       +HCGF+KG
Subjt:  QGEHLASRDQRPPTSKHRRKQHFPCDVEF---ADSVAYLVEPEGFMNVTQFSLEFIEHEEKPSETDLYMPRFGGHQTLEEREKSF-YATNQKLHCGFIKG

Query:  PPGYPSTGFDLDEKDDAYMKTCK-VAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGNIPDDKGCIGLWKIVVVRNLPYEDMRRT
        P    +TGFD+DE D   MK C+ + V+S +F + D ++ P  + IS+Y+++ VCF MFVDE+T S L  E  +  +K  +G+W++VVV NLPY D RR 
Subjt:  PPGYPSTGFDLDEKDDAYMKTCK-VAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGNIPDDKGCIGLWKIVVVRNLPYEDMRRT

Query:  GKVPKFLSHRLFPSARYSIWLDSKMRLQIDPMLIIEYFLWRKKSEYAISNHYDRHCVWEEVQQNKRLNKYNHTAIDEQFAFYQSDGLVKF
        GKVPK L HR+FP+ARYS+W+D K+ L +DP  I+E FLWRK + +AIS HY R  V  E + NK   KY++ +ID Q  FY+++GL  +
Subjt:  GKVPKFLSHRLFPSARYSIWLDSKMRLQIDPMLIIEYFLWRKKSEYAISNHYDRHCVWEEVQQNKRLNKYNHTAIDEQFAFYQSDGLVKF

Arabidopsis top hitse value%identityAlignment
AT1G28240.1 Protein of unknown function (DUF616)9.0e-5942.07Show/hide
Query:  QGEHLASRDQRPPTSKHRRKQHFPCDVEF---ADSVAYLVEPEGFMNVTQFSLEFIEHEEKPSETDLYMPRFGGHQTLEEREKSF-YATNQKLHCGFIKG
        QG    S    PP +  +R    PC V +    ++VA +     F  V + +L +I  E    ET+     FGG+ TL+ R  SF       +HCGF+KG
Subjt:  QGEHLASRDQRPPTSKHRRKQHFPCDVEF---ADSVAYLVEPEGFMNVTQFSLEFIEHEEKPSETDLYMPRFGGHQTLEEREKSF-YATNQKLHCGFIKG

Query:  PPGYPSTGFDLDEKDDAYMKTCK-VAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGNIPDDKGCIGLWKIVVVRNLPYEDMRRT
        P    +TGFD+DE D   MK C+ + V+S +F + D ++ P  + IS+Y+++ VCF MFVDE+T S L  E  +  +K  +G+W++VVV NLPY D RR 
Subjt:  PPGYPSTGFDLDEKDDAYMKTCK-VAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGNIPDDKGCIGLWKIVVVRNLPYEDMRRT

Query:  GKVPKFLSHRLFPSARYSIWLDSKMRLQIDPMLIIEYFLWRKKSEYAISNHYDRHCVWEEVQQNKRLNKYNHTAIDEQFAFYQSDGLVKF
        GKVPK L HR+FP+ARYS+W+D K+ L +DP  I+E FLWRK + +AIS HY R  V  E + NK   KY++ +ID Q  FY+++GL  +
Subjt:  GKVPKFLSHRLFPSARYSIWLDSKMRLQIDPMLIIEYFLWRKKSEYAISNHYDRHCVWEEVQQNKRLNKYNHTAIDEQFAFYQSDGLVKF

AT1G34550.1 Protein of unknown function (DUF616)4.9e-10558.9Show/hide
Query:  IGDSMSVSEVQGEHLASRDQRPPTSKHRRKQHFPCDVEFADSVAYLVEPEGFMNVTQFSLEFIEHEEKPSETDLYMPRFGGHQTLEEREKSFYATNQKLH
        +GDS  VS  +G       +     + RR+    C+++  +S   +VEP       +FSL++IE E+KP E + + PRF GHQ+L+ERE SF A ++K+H
Subjt:  IGDSMSVSEVQGEHLASRDQRPPTSKHRRKQHFPCDVEFADSVAYLVEPEGFMNVTQFSLEFIEHEEKPSETDLYMPRFGGHQTLEEREKSFYATNQKLH

Query:  CGFIKGPPGYPSTGFDLDEKDDAYMKTCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGNIPDDKGCIGLWKIVVVRNLPYE
        CGF+KGP G  STGFDL E D  Y+  C +AVSSCIFG+SD LR P +K IS  S+KNVCF++FVDE T+  LSAEG+ PD  G IGLWK+VVV+NLPY 
Subjt:  CGFIKGPPGYPSTGFDLDEKDDAYMKTCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGNIPDDKGCIGLWKIVVVRNLPYE

Query:  DMRRTGKVPKFLSHRLFPSARYSIWLDSKMRLQIDPMLIIEYFLWRKKSEYAISNHYDRHCVWEEVQQNKRLNKYNHTAIDEQFAFYQSDGLVKF---DP
        DMRR GK+PK L HRLFPSARYSIWLDSK+RLQ+DP+LI+EYFLWRK  EYAISNHYDRHC+WEEV QNK+LNKYNHT I++QF FY++DGL +F   DP
Subjt:  DMRRTGKVPKFLSHRLFPSARYSIWLDSKMRLQIDPMLIIEYFLWRKKSEYAISNHYDRHCVWEEVQQNKRLNKYNHTAIDEQFAFYQSDGLVKF---DP

Query:  FNLRSGLPS
        F L   LPS
Subjt:  FNLRSGLPS

AT1G53040.1 Protein of unknown function (DUF616)1.1e-5341.61Show/hide
Query:  PPTSKHRRKQHFPCDVEFADSVAYLVEPEGFMNVTQF--------SLEFIEHEE--KPSETDLYMPRFGGHQTLEEREKSF-YATNQKLHCGFIKGPPGY
        PP    RR    PC       V YL   E   ++ ++        +L +I  E   KP E++     FGG+ +LE R  SF    +  +HCGFIKG    
Subjt:  PPTSKHRRKQHFPCDVEFADSVAYLVEPEGFMNVTQF--------SLEFIEHEE--KPSETDLYMPRFGGHQTLEEREKSF-YATNQKLHCGFIKGPPGY

Query:  PSTGFDLDEKD-DAYMKTCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGNIPDDKGCIGLWKIVVVRNLPYEDMRRTGKVP
          TGFD+DE       ++  V V+S IFG  D ++ P +  ISE ++KN+ F MFVDE+T   L    +  DD   +GLW+I+VV N+PY D RR GKVP
Subjt:  PSTGFDLDEKD-DAYMKTCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGNIPDDKGCIGLWKIVVVRNLPYEDMRRTGKVP

Query:  KFLSHRLFPSARYSIWLDSKMRLQIDPMLIIEYFLWRKKSEYAISNHYDRHCVWEEVQQNKRLNKYNHTAIDEQFAFYQSDGLVKF
        K L HRLFP+ RYSIW+D+K++L +DP  I+E FLWR  S +AIS HY R  V+ E + NK   KY++ +ID Q  FY+ +GL  +
Subjt:  KFLSHRLFPSARYSIWLDSKMRLQIDPMLIIEYFLWRKKSEYAISNHYDRHCVWEEVQQNKRLNKYNHTAIDEQFAFYQSDGLVKF

AT2G02910.1 Protein of unknown function (DUF616)1.4e-12057.73Show/hide
Query:  LPERRGIV----SGFFKSFGKTEYGSRAVRRGRRLGRTTRKRFSCLFLTLTTSFLLYLAVFGLKVNFYDK-----------IGDSMSV-SEVQGEHLASR
        LPERR  V    S +    G  +     +RRG++ GR  + + S  FL   + F       G K+ F+             +   M V SE+      S 
Subjt:  LPERRGIV----SGFFKSFGKTEYGSRAVRRGRRLGRTTRKRFSCLFLTLTTSFLLYLAVFGLKVNFYDK-----------IGDSMSV-SEVQGEHLASR

Query:  DQRPPTSKHRRKQHFPCDVEFADSVAYLVEPEGFMNVTQFSLEFIEHEEKPSETDLYMPRFGGHQTLEEREKSFYATNQKLHCGFIKGPPGYPSTGFDLD
         +RPP SK R K H PC+V  A+S   ++EP+ ++N T+FSL F+E E   +      PRFGGHQTL ERE+S+ A NQ +HCGF+KG      TGFDL 
Subjt:  DQRPPTSKHRRKQHFPCDVEFADSVAYLVEPEGFMNVTQFSLEFIEHEEKPSETDLYMPRFGGHQTLEEREKSFYATNQKLHCGFIKGPPGYPSTGFDLD

Query:  EKDDAYMKTCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGNIPDDKGCIGLWKIVVVRNLPYEDMRRTGKVPKFLSHRLFP
        EKD AYMK C V+VSSCIFGSSDFLRRP +K+ISE+SK+NVCFVMFVDEQTLSKL++EG++PD +G +GLWK VVV NLPY DMR+TGKVPKFLSHRLFP
Subjt:  EKDDAYMKTCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGNIPDDKGCIGLWKIVVVRNLPYEDMRRTGKVPKFLSHRLFP

Query:  SARYSIWLDSKMRLQIDPMLIIEYFLWRKKSEYAISNHYDRHCVWEEVQQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFNLRSGLPS
        S+RYSIWLDSKMRL  DPMLII++FLWR KSE+AISNHYDRHCVW+EV QNKRLNKYNH+AIDEQF FY+SDGL KFDP +  S LPS
Subjt:  SARYSIWLDSKMRLQIDPMLIIEYFLWRKKSEYAISNHYDRHCVWEEVQQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFNLRSGLPS

AT4G09630.1 Protein of unknown function (DUF616)2.1e-10058.39Show/hide
Query:  SKHRRKQH----FPCDVEFADSVAYLVEPEGFMNVTQFSLEFIEHEEKPSETDLYMPRFGGHQTLEEREKSFYATNQKLHCGFIKGPPGYPSTGFDLDEK
        +K R + H      C+++  +S   + EP    N    SL++I+ E+KP   + + P+F GHQ+L+ERE SF    QK+HCGF+K P G PSTGFDL E 
Subjt:  SKHRRKQH----FPCDVEFADSVAYLVEPEGFMNVTQFSLEFIEHEEKPSETDLYMPRFGGHQTLEEREKSFYATNQKLHCGFIKGPPGYPSTGFDLDEK

Query:  DDAYMKTCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGNIPDDKGCIGLWKIVVVRNLPYEDMRRTGKVPKFLSHRLFPSA
        D  Y+  C +AV SCIFG+SD LR P +K +S  S+K+VCFV+FVDE T+  LSAEG +PD  G +GLWK+VVVRNLPY DMRR GK+PK L HRLF SA
Subjt:  DDAYMKTCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGNIPDDKGCIGLWKIVVVRNLPYEDMRRTGKVPKFLSHRLFPSA

Query:  RYSIWLDSKMRLQIDPMLIIEYFLWRKKSEYAISNHYDRHCVWEEVQQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFNLRSGLPS
        RYSIWLDSK+RLQ+DP++I+EYFLWR+  EYAISNHYDRHC+WEEV QNK+LNKYNHT ID+QF FYQSDGL +F+  +    LPS
Subjt:  RYSIWLDSKMRLQIDPMLIIEYFLWRKKSEYAISNHYDRHCVWEEVQQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFNLRSGLPS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGATCACCTCCATCCTTTCCCAAACCAGATATGAACCGAGATTTGGTCGCAATGTTATGATGGGTCTGTTTGGGAACAATAATGAGCAGTTACCCGAAAGAAGAGG
CATTGTTTCTGGATTTTTTAAATCCTTTGGCAAAACGGAATATGGAAGTAGGGCTGTCCGTCGAGGAAGAAGGCTTGGTCGAACTACAAGAAAAAGATTCTCATGTTTGT
TTTTGACGCTTACAACTTCCTTCTTGTTGTATTTGGCGGTCTTTGGCTTGAAAGTGAACTTTTATGATAAAATTGGAGATAGCATGTCAGTGTCAGAAGTACAAGGAGAA
CATTTGGCATCCAGAGACCAGAGACCACCAACTAGCAAGCATCGTCGCAAGCAACATTTCCCTTGCGATGTTGAGTTTGCGGATTCAGTTGCCTACCTTGTGGAGCCTGA
GGGTTTCATGAATGTTACTCAGTTCTCCTTAGAGTTCATAGAGCACGAGGAAAAACCATCTGAAACCGATTTATATATGCCTAGATTTGGAGGACATCAAACCCTCGAGG
AGAGAGAGAAATCATTTTATGCAACAAATCAAAAACTTCATTGTGGTTTTATTAAAGGACCACCAGGATACCCAAGTACGGGATTTGATTTAGATGAAAAGGATGATGCA
TACATGAAAACATGCAAGGTTGCAGTTTCCTCGTGCATTTTTGGGAGCTCTGATTTTCTGAGGCGGCCTACTAGTAAACAGATCAGTGAGTATTCCAAGAAGAATGTATG
TTTTGTCATGTTTGTGGATGAACAAACACTATCAAAATTATCAGCTGAGGGAAATATTCCTGATGATAAGGGATGCATTGGACTGTGGAAGATAGTAGTTGTGAGGAATT
TACCTTACGAAGATATGCGCAGAACTGGAAAGGTGCCTAAATTTTTGTCACATCGCCTTTTCCCTTCTGCTAGGTATTCAATATGGCTAGACAGCAAAATGCGTCTTCAG
ATAGATCCAATGCTAATTATTGAATATTTTCTGTGGCGAAAGAAGTCAGAGTATGCAATTTCAAATCACTATGACCGCCATTGTGTCTGGGAGGAAGTACAGCAAAATAA
ACGTTTGAATAAGTACAATCATACTGCCATTGATGAACAGTTTGCTTTTTATCAGTCTGATGGCCTTGTGAAGTTTGACCCTTTTAATCTCAGGAGCGGTCTTCCTAGTC
GTGAGTGCTCCATGCATTTTACTGTGGAGTGCTTCATTTACTTCTGCATGCTTTTATAG
mRNA sequenceShow/hide mRNA sequence
ATGTTGATCACCTCCATCCTTTCCCAAACCAGATATGAACCGAGATTTGGTCGCAATGTTATGATGGGTCTGTTTGGGAACAATAATGAGCAGTTACCCGAAAGAAGAGG
CATTGTTTCTGGATTTTTTAAATCCTTTGGCAAAACGGAATATGGAAGTAGGGCTGTCCGTCGAGGAAGAAGGCTTGGTCGAACTACAAGAAAAAGATTCTCATGTTTGT
TTTTGACGCTTACAACTTCCTTCTTGTTGTATTTGGCGGTCTTTGGCTTGAAAGTGAACTTTTATGATAAAATTGGAGATAGCATGTCAGTGTCAGAAGTACAAGGAGAA
CATTTGGCATCCAGAGACCAGAGACCACCAACTAGCAAGCATCGTCGCAAGCAACATTTCCCTTGCGATGTTGAGTTTGCGGATTCAGTTGCCTACCTTGTGGAGCCTGA
GGGTTTCATGAATGTTACTCAGTTCTCCTTAGAGTTCATAGAGCACGAGGAAAAACCATCTGAAACCGATTTATATATGCCTAGATTTGGAGGACATCAAACCCTCGAGG
AGAGAGAGAAATCATTTTATGCAACAAATCAAAAACTTCATTGTGGTTTTATTAAAGGACCACCAGGATACCCAAGTACGGGATTTGATTTAGATGAAAAGGATGATGCA
TACATGAAAACATGCAAGGTTGCAGTTTCCTCGTGCATTTTTGGGAGCTCTGATTTTCTGAGGCGGCCTACTAGTAAACAGATCAGTGAGTATTCCAAGAAGAATGTATG
TTTTGTCATGTTTGTGGATGAACAAACACTATCAAAATTATCAGCTGAGGGAAATATTCCTGATGATAAGGGATGCATTGGACTGTGGAAGATAGTAGTTGTGAGGAATT
TACCTTACGAAGATATGCGCAGAACTGGAAAGGTGCCTAAATTTTTGTCACATCGCCTTTTCCCTTCTGCTAGGTATTCAATATGGCTAGACAGCAAAATGCGTCTTCAG
ATAGATCCAATGCTAATTATTGAATATTTTCTGTGGCGAAAGAAGTCAGAGTATGCAATTTCAAATCACTATGACCGCCATTGTGTCTGGGAGGAAGTACAGCAAAATAA
ACGTTTGAATAAGTACAATCATACTGCCATTGATGAACAGTTTGCTTTTTATCAGTCTGATGGCCTTGTGAAGTTTGACCCTTTTAATCTCAGGAGCGGTCTTCCTAGTC
GTGAGTGCTCCATGCATTTTACTGTGGAGTGCTTCATTTACTTCTGCATGCTTTTATAG
Protein sequenceShow/hide protein sequence
MLITSILSQTRYEPRFGRNVMMGLFGNNNEQLPERRGIVSGFFKSFGKTEYGSRAVRRGRRLGRTTRKRFSCLFLTLTTSFLLYLAVFGLKVNFYDKIGDSMSVSEVQGE
HLASRDQRPPTSKHRRKQHFPCDVEFADSVAYLVEPEGFMNVTQFSLEFIEHEEKPSETDLYMPRFGGHQTLEEREKSFYATNQKLHCGFIKGPPGYPSTGFDLDEKDDA
YMKTCKVAVSSCIFGSSDFLRRPTSKQISEYSKKNVCFVMFVDEQTLSKLSAEGNIPDDKGCIGLWKIVVVRNLPYEDMRRTGKVPKFLSHRLFPSARYSIWLDSKMRLQ
IDPMLIIEYFLWRKKSEYAISNHYDRHCVWEEVQQNKRLNKYNHTAIDEQFAFYQSDGLVKFDPFNLRSGLPSRECSMHFTVECFIYFCMLL