; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh01G018280 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh01G018280
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationCmo_Chr01:13373568..13375387
RNA-Seq ExpressionCmoCh01G018280
SyntenyCmoCh01G018280
Gene Ontology termsNA
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6608337.1 hypothetical protein SDJN03_01679, partial [Cucurbita argyrosperma subsp. sororia]7.9e-16968.2Show/hide
Query:  MRNTEHGGKLLAMVALAIAAAILQAHAAIPEMNNSQQLSVQIQKKLKLLNKPALHTIYSEDGDIIDCVDIYKQPAFDHPALKNHTIQVHFFMLIKSLCVS
        MRNT H GKLLAMVA AIAAAILQ+HAAIP++NNSQQLS QI+ KLKLLNKPALHTIYS+DGDIIDCVDIYKQPAFDHPALKNHTIQ+            
Subjt:  MRNTEHGGKLLAMVALAIAAAILQAHAAIPEMNNSQQLSVQIQKKLKLLNKPALHTIYSEDGDIIDCVDIYKQPAFDHPALKNHTIQVHFFMLIKSLCVS

Query:  LVIIFTLLTNACIYAYRWNPIGALIGRCRLSKTSHFKYGKEVGDVPKEPFQFAEFVNKTYSESILSIALERIFFMEVPNSGQK--STTSILYTAGYNYIG
                     +   W        +  + +   F+  +  G  P                +    +  + F  E    G++   +T+ILYTAG+NYIG
Subjt:  LVIIFTLLTNACIYAYRWNPIGALIGRCRLSKTSHFKYGKEVGDVPKEPFQFAEFVNKTYSESILSIALERIFFMEVPNSGQK--STTSILYTAGYNYIG

Query:  ASGQINVWNPKVDLPNDFTASRIWLKNGPSENFESVEAGWMVNRRLYGDTKTRFSVHWTVDSYKSTGCFDLTCSGFVQTNPKIVLGAVIDPISTRGGQQF
        A+GQINVWNPKVDLPNDFTASRIWLKNGPSE FES+EAGWMVNRRLYGDTKTR SVHWTVDSYKSTGCFDLTCSGFVQTNPK+VLGA+IDP+STRGGQQF
Subjt:  ASGQINVWNPKVDLPNDFTASRIWLKNGPSENFESVEAGWMVNRRLYGDTKTRFSVHWTVDSYKSTGCFDLTCSGFVQTNPKIVLGAVIDPISTRGGQQF

Query:  IISVGMFQDPRSRNWWLNVQGWPVGYWPPTLFGYLRHSATLVEWGGEVFSSHVKKVPHTRTAMGSGDYAGRHYRHASYVMLPRIVDNSLQLKYPERVGTW
        II+VG+FQDP+S NWWLN+QG PVGYWPPTLFGYLR+SATLVEWGGEVFSS++KKVPHT T MGSGDYAG HY++ASYV  PRIVD SLQLKYP RVGTW
Subjt:  IISVGMFQDPRSRNWWLNVQGWPVGYWPPTLFGYLRHSATLVEWGGEVFSSHVKKVPHTRTAMGSGDYAGRHYRHASYVMLPRIVDNSLQLKYPERVGTW

Query:  ANEPFCYSADNYQRTYATEPVFFYGGPGRSRDCH
        A+E  CYS DNY+ T  TEPVFFYGGPGRSRDCH
Subjt:  ANEPFCYSADNYQRTYATEPVFFYGGPGRSRDCH

KAG6608391.1 hypothetical protein SDJN03_01733, partial [Cucurbita argyrosperma subsp. sororia]1.8e-16867.74Show/hide
Query:  MRNTEHGGKLLAMVALAIAAAILQAHAAIPEMNNSQQLSVQIQKKLKLLNKPALHTIYSEDGDIIDCVDIYKQPAFDHPALKNHTIQVHFFMLIKSLCVS
        MRNT H GKLLAMVA AIAAAILQ+HAAIP++NNSQQLS QI+ KLKLLNKPALHTIYS+DGDIIDCVDIYKQPAFDHPALKNHTIQ+            
Subjt:  MRNTEHGGKLLAMVALAIAAAILQAHAAIPEMNNSQQLSVQIQKKLKLLNKPALHTIYSEDGDIIDCVDIYKQPAFDHPALKNHTIQVHFFMLIKSLCVS

Query:  LVIIFTLLTNACIYAYRWNPIGALIGRCRLSKTSHFKYGKEVGDVPKEPFQFAEFVNKTYSESILSIALERIFFMEVPNSGQK--STTSILYTAGYNYIG
                           P   +  +  + +   F+  +  G  P           +    +    +  + F  E    G++   +T+ILYTAG+NYIG
Subjt:  LVIIFTLLTNACIYAYRWNPIGALIGRCRLSKTSHFKYGKEVGDVPKEPFQFAEFVNKTYSESILSIALERIFFMEVPNSGQK--STTSILYTAGYNYIG

Query:  ASGQINVWNPKVDLPNDFTASRIWLKNGPSENFESVEAGWMVNRRLYGDTKTRFSVHWTVDSYKSTGCFDLTCSGFVQTNPKIVLGAVIDPISTRGGQQF
        A+GQINVWNPKVDLPNDFTASRIWLKNGPSE FES+EAGWMVNRRLYGDTKTR SVHWTVDSYKS GCFDLTCSGFVQTNPK+VLGA+IDP+STRGGQQF
Subjt:  ASGQINVWNPKVDLPNDFTASRIWLKNGPSENFESVEAGWMVNRRLYGDTKTRFSVHWTVDSYKSTGCFDLTCSGFVQTNPKIVLGAVIDPISTRGGQQF

Query:  IISVGMFQDPRSRNWWLNVQGWPVGYWPPTLFGYLRHSATLVEWGGEVFSSHVKKVPHTRTAMGSGDYAGRHYRHASYVMLPRIVDNSLQLKYPERVGTW
        II+VG+FQDP+S NWWLN+QG PVGYWPPTLFGYLR+SATLVEWGGEVFSS++KKVPHT T MGSGDYAG HY++ASYV  PRI+D SLQLKYP RVGTW
Subjt:  IISVGMFQDPRSRNWWLNVQGWPVGYWPPTLFGYLRHSATLVEWGGEVFSSHVKKVPHTRTAMGSGDYAGRHYRHASYVMLPRIVDNSLQLKYPERVGTW

Query:  ANEPFCYSADNYQRTYATEPVFFYGGPGRSRDCH
        A+E  CYS DNY+ T  TEPVFFYGGPGRSRDCH
Subjt:  ANEPFCYSADNYQRTYATEPVFFYGGPGRSRDCH

KAG7037688.1 hypothetical protein SDJN02_01318 [Cucurbita argyrosperma subsp. argyrosperma]3.9e-16867.51Show/hide
Query:  MRNTEHGGKLLAMVALAIAAAILQAHAAIPEMNNSQQLSVQIQKKLKLLNKPALHTIYSEDGDIIDCVDIYKQPAFDHPALKNHTIQVHFFMLIKSLCVS
        MRNT H GKLLAMVA AIAAAILQ+HAAIP++NNSQQLS QI+ KLKLLNKPALHTIYS+DGDIIDCVDIYKQPAFDHPALKNHTIQ+            
Subjt:  MRNTEHGGKLLAMVALAIAAAILQAHAAIPEMNNSQQLSVQIQKKLKLLNKPALHTIYSEDGDIIDCVDIYKQPAFDHPALKNHTIQVHFFMLIKSLCVS

Query:  LVIIFTLLTNACIYAYRWNPIGALIGRCRLSKTSHFKYGKEVGDVPKEPFQFAEFVNKTYSESILSIALERIFFMEVPNSGQK--STTSILYTAGYNYIG
                           P   +  +  + +   F+  +  G  P           +    +    +  + F  E    G++   +T+ILYTAG+NYIG
Subjt:  LVIIFTLLTNACIYAYRWNPIGALIGRCRLSKTSHFKYGKEVGDVPKEPFQFAEFVNKTYSESILSIALERIFFMEVPNSGQK--STTSILYTAGYNYIG

Query:  ASGQINVWNPKVDLPNDFTASRIWLKNGPSENFESVEAGWMVNRRLYGDTKTRFSVHWTVDSYKSTGCFDLTCSGFVQTNPKIVLGAVIDPISTRGGQQF
        A+GQINVWNPK+DLPNDFTASRIWLKNGPSE FES+EAGWMVNRRLYGDTKTR SVHWTVDSYKS GCFDLTCSGFVQTNPK+VLGA+IDP+STRGGQQF
Subjt:  ASGQINVWNPKVDLPNDFTASRIWLKNGPSENFESVEAGWMVNRRLYGDTKTRFSVHWTVDSYKSTGCFDLTCSGFVQTNPKIVLGAVIDPISTRGGQQF

Query:  IISVGMFQDPRSRNWWLNVQGWPVGYWPPTLFGYLRHSATLVEWGGEVFSSHVKKVPHTRTAMGSGDYAGRHYRHASYVMLPRIVDNSLQLKYPERVGTW
        II+VG+FQDP+S NWWLN+QG PVGYWPPTLFGYLR+SATLVEWGGEVFSS++KKVPHT T MGSGDYAG HY++ASYV  PRI+D SLQLKYP RVGTW
Subjt:  IISVGMFQDPRSRNWWLNVQGWPVGYWPPTLFGYLRHSATLVEWGGEVFSSHVKKVPHTRTAMGSGDYAGRHYRHASYVMLPRIVDNSLQLKYPERVGTW

Query:  ANEPFCYSADNYQRTYATEPVFFYGGPGRSRDCH
        A+E  CYS DNY+ T  TEPVFFYGGPGRSRDCH
Subjt:  ANEPFCYSADNYQRTYATEPVFFYGGPGRSRDCH

XP_022941230.1 uncharacterized protein LOC111446601 [Cucurbita moschata]5.8e-18877.46Show/hide
Query:  MVALAIAAAILQAHAAIPEMNNSQQLSVQIQKKLKLLNKPALHTIYSEDGDIIDCVDIYKQPAFDHPALKNHTIQVHFFMLIKSLCVSLVIIFTLLTNAC
        MVALAIAAAILQAHAAIPEMNNSQQLSVQIQKKLKLLNKPALHTIYSEDGDIIDCVDIYKQPAFDHPALKNHTIQ+                        
Subjt:  MVALAIAAAILQAHAAIPEMNNSQQLSVQIQKKLKLLNKPALHTIYSEDGDIIDCVDIYKQPAFDHPALKNHTIQVHFFMLIKSLCVSLVIIFTLLTNAC

Query:  IYAYRWNPIGALIGRCRLSKTSHFKYGKEVGDVPKEPFQFAEFVNKTYSESILSI----ALERIFFMEVPNSGQK--STTSILYTAGYNYIGASGQINVW
               P   +  +  + +   F+  +  G  P+        + +   + +L +    +  + FF      G +   +TSILYTAGYNYIGASGQINVW
Subjt:  IYAYRWNPIGALIGRCRLSKTSHFKYGKEVGDVPKEPFQFAEFVNKTYSESILSI----ALERIFFMEVPNSGQK--STTSILYTAGYNYIGASGQINVW

Query:  NPKVDLPNDFTASRIWLKNGPSENFESVEAGWMVNRRLYGDTKTRFSVHWTVDSYKSTGCFDLTCSGFVQTNPKIVLGAVIDPISTRGGQQFIISVGMFQ
        NPKVDLPNDFTASRIWLKNGPSENFESVEAGWMVNRRLYGDTKTRFSVHWTVDSYKSTGCFDLTCSGFVQTNPKIVLGAVIDPISTRGGQQFIISVGMFQ
Subjt:  NPKVDLPNDFTASRIWLKNGPSENFESVEAGWMVNRRLYGDTKTRFSVHWTVDSYKSTGCFDLTCSGFVQTNPKIVLGAVIDPISTRGGQQFIISVGMFQ

Query:  DPRSRNWWLNVQGWPVGYWPPTLFGYLRHSATLVEWGGEVFSSHVKKVPHTRTAMGSGDYAGRHYRHASYVMLPRIVDNSLQLKYPERVGTWANEPFCYS
        DPRSRNWWLNVQGWPVGYWPPTLFGYLRHSATLVEWGGEVFSSHVKKVPHTRTAMGSGDYAGRHYRHASYVMLPRIVDNSLQLKYPERVGTWANEPFCYS
Subjt:  DPRSRNWWLNVQGWPVGYWPPTLFGYLRHSATLVEWGGEVFSSHVKKVPHTRTAMGSGDYAGRHYRHASYVMLPRIVDNSLQLKYPERVGTWANEPFCYS

Query:  ADNYQRTYATEPVFFYGGPGRSRDCH
        ADNYQRTYATEPVFFYGGPGRSRDCH
Subjt:  ADNYQRTYATEPVFFYGGPGRSRDCH

XP_023523387.1 uncharacterized protein LOC111787604 [Cucurbita pepo subsp. pepo]1.2e-18875.57Show/hide
Query:  MRNTEHGGKLLAMVALAIAAAILQAHAAIPEMNNSQQLSVQIQKKLKLLNKPALHTIYSEDGDIIDCVDIYKQPAFDHPALKNHTIQVHFFMLIKSLCVS
        MRNT+H GKLLAMVA AIAAA+LQAHAAIP MNNSQQLSVQIQKKLKLLNKPALHTIYSEDGDIIDCVDIYKQPAFDHPALKNHTIQ+            
Subjt:  MRNTEHGGKLLAMVALAIAAAILQAHAAIPEMNNSQQLSVQIQKKLKLLNKPALHTIYSEDGDIIDCVDIYKQPAFDHPALKNHTIQVHFFMLIKSLCVS

Query:  LVIIFTLLTNACIYAYRWNPIGALIGRCRLSKTSHFKYGKEVGDVPKEPFQFAEFVNKTYSESILSI----ALERIFFMEVPNSGQK--STTSILYTAGY
                           P   +  +  + +   F+  +  G  PK        + K   + +L +    +    FF      G +   +TSILYTAGY
Subjt:  LVIIFTLLTNACIYAYRWNPIGALIGRCRLSKTSHFKYGKEVGDVPKEPFQFAEFVNKTYSESILSI----ALERIFFMEVPNSGQK--STTSILYTAGY

Query:  NYIGASGQINVWNPKVDLPNDFTASRIWLKNGPSENFESVEAGWMVNRRLYGDTKTRFSVHWTVDSYKSTGCFDLTCSGFVQTNPKIVLGAVIDPISTRG
        NYIGASGQINVWNPKVDLPNDFTASRIWLKNGPSE FESVEAGWMVNRRLYGDTKTR SVHWTVDSYKSTGCFDLTCSGFVQTNPK+VLGAVIDPISTRG
Subjt:  NYIGASGQINVWNPKVDLPNDFTASRIWLKNGPSENFESVEAGWMVNRRLYGDTKTRFSVHWTVDSYKSTGCFDLTCSGFVQTNPKIVLGAVIDPISTRG

Query:  GQQFIISVGMFQDPRSRNWWLNVQGWPVGYWPPTLFGYLRHSATLVEWGGEVFSSHVKKVPHTRTAMGSGDYAGRHYRHASYVMLPRIVDNSLQLKYPER
        GQQFIISVG+FQDPRSRNWWLNVQGWPVGYWPPTLF YLRHSATLVEWGGEVFSSHVKKVPHTRTAMGSGDYAGRHY HASYVMLPRIVDNSLQLKYPER
Subjt:  GQQFIISVGMFQDPRSRNWWLNVQGWPVGYWPPTLFGYLRHSATLVEWGGEVFSSHVKKVPHTRTAMGSGDYAGRHYRHASYVMLPRIVDNSLQLKYPER

Query:  VGTWANEPFCYSADNYQRTYATEPVFFYGGPGRSRDCH
        VG WANEPFCYSADNYQRTY TEPVFFYGGPGRSRDCH
Subjt:  VGTWANEPFCYSADNYQRTYATEPVFFYGGPGRSRDCH

TrEMBL top hitse value%identityAlignment
A0A1S4DZ87 uncharacterized protein LOC1034938972.4e-13959.21Show/hide
Query:  LLAMVALAIAAAILQAHAAIPEMNNSQQLSVQIQKKLKLLNKPALHTIYSEDGDIIDCVDIYKQPAFDHPALKNHTIQVHFFMLIKSLCVSLVIIFTLLT
        +L MVAL +  AI+  +A   EM+ S  L  QI+ KLKLLNKP++ TIYSEDGD+I CVDIYKQPAFDHP LKNHTIQ+        L + L +  T   
Subjt:  LLAMVALAIAAAILQAHAAIPEMNNSQQLSVQIQKKLKLLNKPALHTIYSEDGDIIDCVDIYKQPAFDHPALKNHTIQVHFFMLIKSLCVSLVIIFTLLT

Query:  NACIYAYRWNPIGALIGRCRLSKTSHFKYGKEVGDVPKEPFQFAEFVNKTYSESILSIALERIFFMEVPNSGQK------STTSILYTAGYNYIGASGQI
                         +   S+++ F+  ++ G  PK        + +   E +L       F  + P    K       +T+IL T G NYIGASG I
Subjt:  NACIYAYRWNPIGALIGRCRLSKTSHFKYGKEVGDVPKEPFQFAEFVNKTYSESILSIALERIFFMEVPNSGQK------STTSILYTAGYNYIGASGQI

Query:  NVWNPKVDLPNDFTASRIWLKNGPSENFESVEAGWMVNRRLYGDTKTRFSVHWTVDSYKSTGCFDLTCSGFVQTNPKIVLGAVIDPISTRGGQQFIISVG
        NVWNPKVDLPNDFTAS+IWLKNGPSE FESVEAGWMVN +LYGD KTRFS++WTVDSYKSTGCFDLTCSGFVQTNP + +GAVIDP+S+  GQQ+ I +G
Subjt:  NVWNPKVDLPNDFTASRIWLKNGPSENFESVEAGWMVNRRLYGDTKTRFSVHWTVDSYKSTGCFDLTCSGFVQTNPKIVLGAVIDPISTRGGQQFIISVG

Query:  MFQDPRSRNWWLNVQGWPVGYWPPTLFGYLRHSATLVEWGGEVFSSHVKKVPHTRTAMGSGDYAGRHYRHASYVMLPRIVDNSLQLKYPERVGTWANEPF
        +FQDP+S NWWL  Q  PVGYWPPTLFGYL HSATLVEWGGEVFSS++K VPHT T MGSGDYA   Y +AS+V  PRIVD S+QLKYP +VGTWA+EP 
Subjt:  MFQDPRSRNWWLNVQGWPVGYWPPTLFGYLRHSATLVEWGGEVFSSHVKKVPHTRTAMGSGDYAGRHYRHASYVMLPRIVDNSLQLKYPERVGTWANEPF

Query:  CYSADNYQRTYATEPVFFYGGPGRSRDCH
        CYS DNYQRTY +EPVF++GGPG SRDCH
Subjt:  CYSADNYQRTYATEPVFFYGGPGRSRDCH

A0A5A7V8M6 Uncharacterized protein2.4e-13959.21Show/hide
Query:  LLAMVALAIAAAILQAHAAIPEMNNSQQLSVQIQKKLKLLNKPALHTIYSEDGDIIDCVDIYKQPAFDHPALKNHTIQVHFFMLIKSLCVSLVIIFTLLT
        +L MVAL +  AI+  +A   EM+ S  L  QI+ KLKLLNKP++ TIYSEDGD+I CVDIYKQPAFDHP LKNHTIQ+        L + L +  T   
Subjt:  LLAMVALAIAAAILQAHAAIPEMNNSQQLSVQIQKKLKLLNKPALHTIYSEDGDIIDCVDIYKQPAFDHPALKNHTIQVHFFMLIKSLCVSLVIIFTLLT

Query:  NACIYAYRWNPIGALIGRCRLSKTSHFKYGKEVGDVPKEPFQFAEFVNKTYSESILSIALERIFFMEVPNSGQK------STTSILYTAGYNYIGASGQI
                         +   S+++ F+  ++ G  PK        + +   E +L       F  + P    K       +T+IL T G NYIGASG I
Subjt:  NACIYAYRWNPIGALIGRCRLSKTSHFKYGKEVGDVPKEPFQFAEFVNKTYSESILSIALERIFFMEVPNSGQK------STTSILYTAGYNYIGASGQI

Query:  NVWNPKVDLPNDFTASRIWLKNGPSENFESVEAGWMVNRRLYGDTKTRFSVHWTVDSYKSTGCFDLTCSGFVQTNPKIVLGAVIDPISTRGGQQFIISVG
        NVWNPKVDLPNDFTAS+IWLKNGPSE FESVEAGWMVN +LYGD KTRFS++WTVDSYKSTGCFDLTCSGFVQTNP + +GAVIDP+S+  GQQ+ I +G
Subjt:  NVWNPKVDLPNDFTASRIWLKNGPSENFESVEAGWMVNRRLYGDTKTRFSVHWTVDSYKSTGCFDLTCSGFVQTNPKIVLGAVIDPISTRGGQQFIISVG

Query:  MFQDPRSRNWWLNVQGWPVGYWPPTLFGYLRHSATLVEWGGEVFSSHVKKVPHTRTAMGSGDYAGRHYRHASYVMLPRIVDNSLQLKYPERVGTWANEPF
        +FQDP+S NWWL  Q  PVGYWPPTLFGYL HSATLVEWGGEVFSS++K VPHT T MGSGDYA   Y +AS+V  PRIVD S+QLKYP +VGTWA+EP 
Subjt:  MFQDPRSRNWWLNVQGWPVGYWPPTLFGYLRHSATLVEWGGEVFSSHVKKVPHTRTAMGSGDYAGRHYRHASYVMLPRIVDNSLQLKYPERVGTWANEPF

Query:  CYSADNYQRTYATEPVFFYGGPGRSRDCH
        CYS DNYQRTY +EPVF++GGPG SRDCH
Subjt:  CYSADNYQRTYATEPVFFYGGPGRSRDCH

A0A5D3D964 Uncharacterized protein5.4e-13959.15Show/hide
Query:  LLAMVALAIAAAILQAHAAIPEMNNSQQLSVQIQKKLKLLNKPALHTIYSEDGDIIDCVDIYKQPAFDHPALKNHTIQVHFFMLIKSLCVSLVIIFTLLT
        +L MVAL +  AI+  +A   EM+ S  L  QI+ KLKLLNKP++ TIYSEDGD+I CVDIYKQPAFDHP LKNHTIQ+                     
Subjt:  LLAMVALAIAAAILQAHAAIPEMNNSQQLSVQIQKKLKLLNKPALHTIYSEDGDIIDCVDIYKQPAFDHPALKNHTIQVHFFMLIKSLCVSLVIIFTLLT

Query:  NACIYAYRWNPIGALIGRCRLSKTSHFKYGKEVGDVPKEPFQFAEFVNKTYSESILSIALERIFFMEVPNSGQK---STTSILYTAGYNYIGASGQINVW
                 N      G    S+++ F+  ++ G  PK        + +   E +L       F  + P    K       IL T G NYIGASG INVW
Subjt:  NACIYAYRWNPIGALIGRCRLSKTSHFKYGKEVGDVPKEPFQFAEFVNKTYSESILSIALERIFFMEVPNSGQK---STTSILYTAGYNYIGASGQINVW

Query:  NPKVDLPNDFTASRIWLKNGPSENFESVEAGWMVNRRLYGDTKTRFSVHWTVDSYKSTGCFDLTCSGFVQTNPKIVLGAVIDPISTRGGQQFIISVGMFQ
        NPKVDLPNDFTAS+IWLKNGPSE FESVEAGWMVN +LYGD KTRFS++WTVDSYKSTGCFDLTCSGFVQTNP + +GAVIDP+S+  GQQ+ I +G+FQ
Subjt:  NPKVDLPNDFTASRIWLKNGPSENFESVEAGWMVNRRLYGDTKTRFSVHWTVDSYKSTGCFDLTCSGFVQTNPKIVLGAVIDPISTRGGQQFIISVGMFQ

Query:  DPRSRNWWLNVQGWPVGYWPPTLFGYLRHSATLVEWGGEVFSSHVKKVPHTRTAMGSGDYAGRHYRHASYVMLPRIVDNSLQLKYPERVGTWANEPFCYS
        DP+S NWWL  Q  PVGYWPPTLFGYL HSATLVEWGGEVFSS++K VPHT T MGSGDYA   Y +AS+V  PRIVD S+QLKYP +VGTWA+EP CYS
Subjt:  DPRSRNWWLNVQGWPVGYWPPTLFGYLRHSATLVEWGGEVFSSHVKKVPHTRTAMGSGDYAGRHYRHASYVMLPRIVDNSLQLKYPERVGTWANEPFCYS

Query:  ADNYQRTYATEPVFFYGGPGRSRDCH
         DNYQRTY +EPVF++GGPG SRDCH
Subjt:  ADNYQRTYATEPVFFYGGPGRSRDCH

A0A6J1FRA8 uncharacterized protein LOC1114465395.7e-16566.89Show/hide
Query:  MRNTEHGGKLLAMVALAIAAAILQAHAAIPEMNNSQQLSVQIQKKLKLLNKPALHTIYSEDGDIIDCVDIYKQPAFDHPALKNHTIQVHFFMLIKSLCVS
        MRNT H GKLLAMVA AIAAAILQA AAIPEMN SQQLS QI KKLKLLNKPALHTIY++DGDIIDCVDIYKQPAFDHPALKNHTIQ+            
Subjt:  MRNTEHGGKLLAMVALAIAAAILQAHAAIPEMNNSQQLSVQIQKKLKLLNKPALHTIYSEDGDIIDCVDIYKQPAFDHPALKNHTIQVHFFMLIKSLCVS

Query:  LVIIFTLLTNACIYAYRWNPIGALIGRCRLSKTSHFKYGKEVGDVPKEPFQFAEFVNKTYSESILSIALERIFFMEVPNSGQK------STTSILYTAGY
                           P   +  +        F+  +  G  P         + +   + +L       F  + P    K       +T+ILYTAG+
Subjt:  LVIIFTLLTNACIYAYRWNPIGALIGRCRLSKTSHFKYGKEVGDVPKEPFQFAEFVNKTYSESILSIALERIFFMEVPNSGQK------STTSILYTAGY

Query:  NYIGASGQINVWNPKVDLPNDFTASRIWLKNGPSENFESVEAGWMVNRRLYGDTKTRFSVHWTVDSYKSTGCFDLTCSGFVQTNPKIVLGAVIDPISTRG
        NYIGASGQ+NVWNPKVDLP+DFTASRIWLKNGPSE FESVEAGWMVN RLYGDTKTR SVHWTVDSY+S GCFDLTCSGFVQTNPK+VLGAVIDP+STRG
Subjt:  NYIGASGQINVWNPKVDLPNDFTASRIWLKNGPSENFESVEAGWMVNRRLYGDTKTRFSVHWTVDSYKSTGCFDLTCSGFVQTNPKIVLGAVIDPISTRG

Query:  GQQFIISVGMFQDPRSRNWWLNVQGWPVGYWPPTLFGYLRHSATLVEWGGEVFSSHVKKVPHTRTAMGSGDYAGRHYRHASYVMLPRIVDNSLQLKYPER
        GQQFII+VG+FQDP+S NWWL +QG PVGYWPPTLFGYLR+SATLVEWGGEVFSS++KKVPHT T MGSGDYAG HY++AS+V  PRIVD SLQLKYP R
Subjt:  GQQFIISVGMFQDPRSRNWWLNVQGWPVGYWPPTLFGYLRHSATLVEWGGEVFSSHVKKVPHTRTAMGSGDYAGRHYRHASYVMLPRIVDNSLQLKYPER

Query:  VGTWANEPFCYSADNYQRTYATEPVFFYGGPGRSRDCH
        VGTW +E  CYS DNY+ T  TEPVFFYGGPGRSRDCH
Subjt:  VGTWANEPFCYSADNYQRTYATEPVFFYGGPGRSRDCH

A0A6J1FT01 uncharacterized protein LOC1114466012.8e-18877.46Show/hide
Query:  MVALAIAAAILQAHAAIPEMNNSQQLSVQIQKKLKLLNKPALHTIYSEDGDIIDCVDIYKQPAFDHPALKNHTIQVHFFMLIKSLCVSLVIIFTLLTNAC
        MVALAIAAAILQAHAAIPEMNNSQQLSVQIQKKLKLLNKPALHTIYSEDGDIIDCVDIYKQPAFDHPALKNHTIQ+                        
Subjt:  MVALAIAAAILQAHAAIPEMNNSQQLSVQIQKKLKLLNKPALHTIYSEDGDIIDCVDIYKQPAFDHPALKNHTIQVHFFMLIKSLCVSLVIIFTLLTNAC

Query:  IYAYRWNPIGALIGRCRLSKTSHFKYGKEVGDVPKEPFQFAEFVNKTYSESILSI----ALERIFFMEVPNSGQK--STTSILYTAGYNYIGASGQINVW
               P   +  +  + +   F+  +  G  P+        + +   + +L +    +  + FF      G +   +TSILYTAGYNYIGASGQINVW
Subjt:  IYAYRWNPIGALIGRCRLSKTSHFKYGKEVGDVPKEPFQFAEFVNKTYSESILSI----ALERIFFMEVPNSGQK--STTSILYTAGYNYIGASGQINVW

Query:  NPKVDLPNDFTASRIWLKNGPSENFESVEAGWMVNRRLYGDTKTRFSVHWTVDSYKSTGCFDLTCSGFVQTNPKIVLGAVIDPISTRGGQQFIISVGMFQ
        NPKVDLPNDFTASRIWLKNGPSENFESVEAGWMVNRRLYGDTKTRFSVHWTVDSYKSTGCFDLTCSGFVQTNPKIVLGAVIDPISTRGGQQFIISVGMFQ
Subjt:  NPKVDLPNDFTASRIWLKNGPSENFESVEAGWMVNRRLYGDTKTRFSVHWTVDSYKSTGCFDLTCSGFVQTNPKIVLGAVIDPISTRGGQQFIISVGMFQ

Query:  DPRSRNWWLNVQGWPVGYWPPTLFGYLRHSATLVEWGGEVFSSHVKKVPHTRTAMGSGDYAGRHYRHASYVMLPRIVDNSLQLKYPERVGTWANEPFCYS
        DPRSRNWWLNVQGWPVGYWPPTLFGYLRHSATLVEWGGEVFSSHVKKVPHTRTAMGSGDYAGRHYRHASYVMLPRIVDNSLQLKYPERVGTWANEPFCYS
Subjt:  DPRSRNWWLNVQGWPVGYWPPTLFGYLRHSATLVEWGGEVFSSHVKKVPHTRTAMGSGDYAGRHYRHASYVMLPRIVDNSLQLKYPERVGTWANEPFCYS

Query:  ADNYQRTYATEPVFFYGGPGRSRDCH
        ADNYQRTYATEPVFFYGGPGRSRDCH
Subjt:  ADNYQRTYATEPVFFYGGPGRSRDCH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G55360.1 Protein of Unknown Function (DUF239)1.2e-6937.04Show/hide
Query:  QIQKKLKLLNKPALHTIYSEDGDIIDCVDIYKQPAFDHPALKNHTIQVHFFMLIKSLCVSLVIIFTLLTNACIYAYRWNPIGALIGRCRLSKTSHF----
        +++K L  LNKPA+ +I S DGD+IDCV I KQPAFDHP LK+H IQ+      + L                  +  N + A        K  H     
Subjt:  QIQKKLKLLNKPALHTIYSEDGDIIDCVDIYKQPAFDHPALKNHTIQVHFFMLIKSLCVSLVIIFTLLTNACIYAYRWNPIGALIGRCRLSKTSHF----

Query:  -KYGK-EVGDVP-----KEPFQFAEFVNKTYSESILSIALERIFFMEVPN-SGQKSTTSILYTAGYNYIGASGQINVWNPKVDLPNDFTASRIWLKNGP-
         +YGK   G +P     ++    A  V +   +   S+ L +    ++ N SG +   +I Y  G  Y GA   INVW PK+   N+F+ S+IWL  G  
Subjt:  -KYGK-EVGDVP-----KEPFQFAEFVNKTYSESILSIALERIFFMEVPN-SGQKSTTSILYTAGYNYIGASGQINVWNPKVDLPNDFTASRIWLKNGP-

Query:  SENFESVEAGWMVNRRLYGDTKTRFSVHWTVDSYKSTGCFDLTCSGFVQTNPKIVLGAVIDPISTRGGQQFIISVGMFQDPRSRNWWLNV-QGWPVGYWP
         ++  S+EAGW V+  LYGD  TR   +WT D+Y++TGC++L CSGF+Q N  I +GA I P+S     Q+ IS+ +++DP+  +WW+    G+ +GYWP
Subjt:  SENFESVEAGWMVNRRLYGDTKTRFSVHWTVDSYKSTGCFDLTCSGFVQTNPKIVLGAVIDPISTRGGQQFIISVGMFQDPRSRNWWLNV-QGWPVGYWP

Query:  PTLFGYLRHSATLVEWGGEVFSSHVKKVPHTRTAMGSGDYAGRHYRHASYVMLPRIVDNSLQLKYPERVGTWANEPFCYSADNYQRTYATEPVFFYGGPG
          LF YL  SA+++EWGGEV +S      HT T MGSG +    +  ASY    ++VD S  LK P+ +GT+  +  CY              F+YGGPG
Subjt:  PTLFGYLRHSATLVEWGGEVFSSHVKKVPHTRTAMGSGDYAGRHYRHASYVMLPRIVDNSLQLKYPERVGTWANEPFCYSADNYQRTYATEPVFFYGGPG

Query:  RSRDC
        +++ C
Subjt:  RSRDC

AT5G25950.1 Protein of Unknown Function (DUF239)5.4e-9144Show/hide
Query:  SVQIQKKLKLLNKPALHTIYSEDGDIIDCVDIYKQPAFDHPALKNHTIQVHFFMLIKSLCVSLVIIFTLLTNACIYAYRWNPIGALIGRCRLSKTSHFKY
        S+ I  KLK LNKPAL TI SEDGDIIDC+DIYKQ AFDHPALKNH IQ+          V      T + N     +  + I +  G+C +      + 
Subjt:  SVQIQKKLKLLNKPALHTIYSEDGDIIDCVDIYKQPAFDHPALKNHTIQVHFFMLIKSLCVSLVIIFTLLTNACIYAYRWNPIGALIGRCRLSKTSHFKY

Query:  GKEVGDVPKEPFQFAEFVNKTYSESILSIALERIFFMEVP----NSGQKSTTS--ILYTAGYNYIGASGQINVWNPKVDLPNDFTASRIWLKNGPSENFE
         +E       P  F       Y  S L  AL+      +     N  Q    S   +   G+N++GA   IN+WNP      D++ ++IWL  G SENFE
Subjt:  GKEVGDVPKEPFQFAEFVNKTYSESILSIALERIFFMEVP----NSGQKSTTS--ILYTAGYNYIGASGQINVWNPKVDLPNDFTASRIWLKNGPSENFE

Query:  SVEAGWMVNRRLYGDTKTRFSVHWTVDSYKSTGCFDLTCSGFVQTNPKIVLGAVIDPISTRGGQQFIISVGMFQDPRSRNWWLNVQGWPVGYWPPTLFGY
        SVE GWMVN  ++GD++TR  + WT D Y  TGC +L C+GFVQT+ K  LGA ++P+S+    Q+ I+V +F DP S NWWL  +   +GYWP TLF Y
Subjt:  SVEAGWMVNRRLYGDTKTRFSVHWTVDSYKSTGCFDLTCSGFVQTNPKIVLGAVIDPISTRGGQQFIISVGMFQDPRSRNWWLNVQGWPVGYWPPTLFGY

Query:  LRHSATLVEWGGEVFSSH-VKKVPHTRTAMGSGDYAGRHYRHASYVMLPRIVDNSLQLKYPERVGTWANEPFCYSADNYQRTYATEPVFFYGGPGRSRDC
        L+HSAT V+WGGEV S + V K PHT TAMGSG +A   +  A +    RI D S+QLKYP+ +  +A+E  CYS   +++TY +EP F++GGPGR+  C
Subjt:  LRHSATLVEWGGEVFSSH-VKKVPHTRTAMGSGDYAGRHYRHASYVMLPRIVDNSLQLKYPERVGTWANEPFCYSADNYQRTYATEPVFFYGGPGRSRDC

AT5G25960.1 Protein of Unknown Function (DUF239)1.1e-8341.77Show/hide
Query:  SVQIQKKLKLLNKPALHTIYSEDGDIIDCVDIYKQPAFDHPALKNHTIQVHFFMLIKSLCVSLVIIFTLLTNACIYAYRWNPIGALIGRCRLSKTSHFKY
        S+ I  KLK LNKP+L TI SEDGDIIDC+DIYKQ AFDHPAL+NH IQ+                                                 +
Subjt:  SVQIQKKLKLLNKPALHTIYSEDGDIIDCVDIYKQPAFDHPALKNHTIQVHFFMLIKSLCVSLVIIFTLLTNACIYAYRWNPIGALIGRCRLSKTSHFKY

Query:  GKEVGDVPKEPFQFAEFVNKTYSESILSIALERIFFMEVPNSGQKST--TSILYTAGYNYIGASGQINVWNPKVDLPNDFTASRIWLKNGPSENFESVEA
        G +   +P          N   SE I S    +        +  K T   ++L   GYN+IGA   INVWNP     +D+++++IWL  G S+ FES+EA
Subjt:  GKEVGDVPKEPFQFAEFVNKTYSESILSIALERIFFMEVPNSGQKST--TSILYTAGYNYIGASGQINVWNPKVDLPNDFTASRIWLKNGPSENFESVEA

Query:  GWMVNRRLYGDTKTRFSVHWTVDSYKSTGCFDLTCSGFVQTNPKIVLGAVIDPISTRGGQQFIISVGMFQDPRSRNWWLNVQGWPVGYWPPTLFGYLRHS
        GW VN  ++GD++TR   +WT D Y  TGC +L C+GFVQT  K  LGA I+P+ST   +Q  I+     D  S NWWL      +GYWP TLF YL+HS
Subjt:  GWMVNRRLYGDTKTRFSVHWTVDSYKSTGCFDLTCSGFVQTNPKIVLGAVIDPISTRGGQQFIISVGMFQDPRSRNWWLNVQGWPVGYWPPTLFGYLRHS

Query:  ATLVEWGGEVFSSHVKKVPHTRTAMGSGDYAGRHYRHASYVMLPRIVDNSLQLKYPERVGTWANEPFCYSADNYQRTYATEPVFFYGGPGRSRDC
        AT V+ GGEV S +V K PHTRT+MGSG +A   +  A Y    RI D SLQ+KYP+ +  +A+E  CYS   +++TY +EP F++GGPG++  C
Subjt:  ATLVEWGGEVFSSHVKKVPHTRTAMGSGDYAGRHYRHASYVMLPRIVDNSLQLKYPERVGTWANEPFCYSADNYQRTYATEPVFFYGGPGRSRDC

AT5G56530.1 Protein of Unknown Function (DUF239)1.2e-6937.72Show/hide
Query:  QIQKKLKLLNKPALHTIYSEDGDIIDCVDIYKQPAFDHPALKNHTIQVHFFMLIKSLCVSLVIIFTLLTNACIYAYRWNPIGALI-GRCRLSKTSHFKYG
        ++ K L  LNKPA+ +I S DGDIIDCV I KQPAFDHP LK+H IQ+      +SL     +      +       W+  G    G   + +T      
Subjt:  QIQKKLKLLNKPALHTIYSEDGDIIDCVDIYKQPAFDHPALKNHTIQVHFFMLIKSLCVSLVIIFTLLTNACIYAYRWNPIGALI-GRCRLSKTSHFKYG

Query:  KEVGDVPKEPFQFAEFVNKTYSESILSIALERIFFMEVPN-SGQKSTTSILYTAGYNYIGASGQINVWNPKVDLPNDFTASRIWLKNGP-SENFESVEAG
               KE    A  V +   +  LS+ L R    ++ N SG +   +I Y  G  + GA   INVW PKV   N+F+ S++W+  G   ++  S+EAG
Subjt:  KEVGDVPKEPFQFAEFVNKTYSESILSIALERIFFMEVPN-SGQKSTTSILYTAGYNYIGASGQINVWNPKVDLPNDFTASRIWLKNGP-SENFESVEAG

Query:  WMVNRRLYGDTKTRFSVHWTVDSYKSTGCFDLTCSGFVQTNPKIVLGAVIDPISTRGGQQFIISVGMFQDPRSRNWWLNV-QGWPVGYWPPTLFGYLRHS
        W V+  LYGD  TR   +WT D+Y++TGC++L CSGF+Q N +I +GA I P+S     Q+ IS+ +++DP+  +WW+    G+ +GYWP  LF YL  S
Subjt:  WMVNRRLYGDTKTRFSVHWTVDSYKSTGCFDLTCSGFVQTNPKIVLGAVIDPISTRGGQQFIISVGMFQDPRSRNWWLNV-QGWPVGYWPPTLFGYLRHS

Query:  ATLVEWGGEVFSSHVKKVPHTRTAMGSGDYAGRHYRHASYVMLPRIVDNSLQLKYPERVGTWANEPFCYSADNYQRTYATEPVFFYGGPGRSRDC
        A++VEWGGEV +   +   HT T MGSG +    +  ASY    ++VD+S  LK P+ + T+  +  CY  +   +       F+YGGPGR+ +C
Subjt:  ATLVEWGGEVFSSHVKKVPHTRTAMGSGDYAGRHYRHASYVMLPRIVDNSLQLKYPERVGTWANEPFCYSADNYQRTYATEPVFFYGGPGRSRDC

AT5G56530.2 Protein of Unknown Function (DUF239)1.2e-6937.72Show/hide
Query:  QIQKKLKLLNKPALHTIYSEDGDIIDCVDIYKQPAFDHPALKNHTIQVHFFMLIKSLCVSLVIIFTLLTNACIYAYRWNPIGALI-GRCRLSKTSHFKYG
        ++ K L  LNKPA+ +I S DGDIIDCV I KQPAFDHP LK+H IQ+      +SL     +      +       W+  G    G   + +T      
Subjt:  QIQKKLKLLNKPALHTIYSEDGDIIDCVDIYKQPAFDHPALKNHTIQVHFFMLIKSLCVSLVIIFTLLTNACIYAYRWNPIGALI-GRCRLSKTSHFKYG

Query:  KEVGDVPKEPFQFAEFVNKTYSESILSIALERIFFMEVPN-SGQKSTTSILYTAGYNYIGASGQINVWNPKVDLPNDFTASRIWLKNGP-SENFESVEAG
               KE    A  V +   +  LS+ L R    ++ N SG +   +I Y  G  + GA   INVW PKV   N+F+ S++W+  G   ++  S+EAG
Subjt:  KEVGDVPKEPFQFAEFVNKTYSESILSIALERIFFMEVPN-SGQKSTTSILYTAGYNYIGASGQINVWNPKVDLPNDFTASRIWLKNGP-SENFESVEAG

Query:  WMVNRRLYGDTKTRFSVHWTVDSYKSTGCFDLTCSGFVQTNPKIVLGAVIDPISTRGGQQFIISVGMFQDPRSRNWWLNV-QGWPVGYWPPTLFGYLRHS
        W V+  LYGD  TR   +WT D+Y++TGC++L CSGF+Q N +I +GA I P+S     Q+ IS+ +++DP+  +WW+    G+ +GYWP  LF YL  S
Subjt:  WMVNRRLYGDTKTRFSVHWTVDSYKSTGCFDLTCSGFVQTNPKIVLGAVIDPISTRGGQQFIISVGMFQDPRSRNWWLNV-QGWPVGYWPPTLFGYLRHS

Query:  ATLVEWGGEVFSSHVKKVPHTRTAMGSGDYAGRHYRHASYVMLPRIVDNSLQLKYPERVGTWANEPFCYSADNYQRTYATEPVFFYGGPGRSRDC
        A++VEWGGEV +   +   HT T MGSG +    +  ASY    ++VD+S  LK P+ + T+  +  CY  +   +       F+YGGPGR+ +C
Subjt:  ATLVEWGGEVFSSHVKKVPHTRTAMGSGDYAGRHYRHASYVMLPRIVDNSLQLKYPERVGTWANEPFCYSADNYQRTYATEPVFFYGGPGRSRDC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGAACACAGAACATGGAGGGAAGCTGTTGGCAATGGTGGCTTTGGCCATTGCTGCTGCCATTCTCCAAGCTCATGCAGCCATTCCGGAGATGAATAATTCTCAACA
ATTATCGGTGCAGATTCAAAAGAAGTTAAAGCTTCTTAATAAGCCTGCCCTCCACACCATCTACAGTGAAGATGGAGATATTATCGACTGTGTTGATATTTACAAACAGC
CTGCTTTTGACCATCCGGCTCTAAAGAATCACACCATTCAGGTTCATTTCTTCATGTTAATCAAATCATTATGTGTAAGTCTAGTCATAATCTTTACGCTCTTAACCAAT
GCTTGCATATATGCATATAGATGGAACCCGATTGGGGCGTTGATTGGAAGATGTCGGTTGAGCAAAACGAGCCATTTCAAGTATGGCAAAGAAGTGGGAGATGTCCCGAA
GGAACCATTCCAATTCGCAGAGTTCGTGAACAAGACTTACTCAGAGTCAATTCTCTCGATAGCTTTGGAAAGAATTTTTTTTATGGAAGTTCCAAACTCGGGGCAGAAGT
CAACCACGTCAATCCTGTATACGGCAGGCTACAATTACATTGGCGCTTCAGGACAGATTAATGTTTGGAACCCTAAAGTTGATTTGCCAAATGACTTCACGGCTTCAAGA
ATTTGGTTGAAAAATGGGCCGTCTGAAAATTTTGAAAGCGTTGAAGCTGGCTGGATGGTAAATCGGAGGCTATATGGAGATACAAAAACTCGATTCAGTGTACATTGGAC
GGTGGATTCATACAAATCAACGGGGTGCTTTGATTTAACTTGTAGTGGGTTTGTCCAAACGAACCCCAAAATAGTACTTGGTGCTGTCATTGATCCCATATCAACTAGAG
GTGGACAACAATTCATTATCTCTGTCGGTATGTTTCAGGATCCTCGGTCACGCAATTGGTGGTTGAATGTGCAAGGGTGGCCTGTGGGGTATTGGCCACCGACGCTATTC
GGATATCTGCGTCACAGCGCAACACTGGTGGAATGGGGCGGTGAAGTGTTCAGCTCACATGTAAAGAAAGTGCCACACACGAGGACGGCCATGGGGAGTGGAGATTATGC
AGGGAGGCATTACCGCCACGCTAGTTATGTGATGCTGCCAAGGATCGTGGACAATTCGCTACAATTGAAGTATCCAGAGAGAGTCGGAACTTGGGCTAATGAGCCTTTTT
GTTACTCTGCTGATAATTATCAACGAACCTACGCTACTGAGCCTGTCTTCTTCTATGGCGGTCCTGGTCGCAGTCGTGACTGCCATTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGGAACACAGAACATGGAGGGAAGCTGTTGGCAATGGTGGCTTTGGCCATTGCTGCTGCCATTCTCCAAGCTCATGCAGCCATTCCGGAGATGAATAATTCTCAACA
ATTATCGGTGCAGATTCAAAAGAAGTTAAAGCTTCTTAATAAGCCTGCCCTCCACACCATCTACAGTGAAGATGGAGATATTATCGACTGTGTTGATATTTACAAACAGC
CTGCTTTTGACCATCCGGCTCTAAAGAATCACACCATTCAGGTTCATTTCTTCATGTTAATCAAATCATTATGTGTAAGTCTAGTCATAATCTTTACGCTCTTAACCAAT
GCTTGCATATATGCATATAGATGGAACCCGATTGGGGCGTTGATTGGAAGATGTCGGTTGAGCAAAACGAGCCATTTCAAGTATGGCAAAGAAGTGGGAGATGTCCCGAA
GGAACCATTCCAATTCGCAGAGTTCGTGAACAAGACTTACTCAGAGTCAATTCTCTCGATAGCTTTGGAAAGAATTTTTTTTATGGAAGTTCCAAACTCGGGGCAGAAGT
CAACCACGTCAATCCTGTATACGGCAGGCTACAATTACATTGGCGCTTCAGGACAGATTAATGTTTGGAACCCTAAAGTTGATTTGCCAAATGACTTCACGGCTTCAAGA
ATTTGGTTGAAAAATGGGCCGTCTGAAAATTTTGAAAGCGTTGAAGCTGGCTGGATGGTAAATCGGAGGCTATATGGAGATACAAAAACTCGATTCAGTGTACATTGGAC
GGTGGATTCATACAAATCAACGGGGTGCTTTGATTTAACTTGTAGTGGGTTTGTCCAAACGAACCCCAAAATAGTACTTGGTGCTGTCATTGATCCCATATCAACTAGAG
GTGGACAACAATTCATTATCTCTGTCGGTATGTTTCAGGATCCTCGGTCACGCAATTGGTGGTTGAATGTGCAAGGGTGGCCTGTGGGGTATTGGCCACCGACGCTATTC
GGATATCTGCGTCACAGCGCAACACTGGTGGAATGGGGCGGTGAAGTGTTCAGCTCACATGTAAAGAAAGTGCCACACACGAGGACGGCCATGGGGAGTGGAGATTATGC
AGGGAGGCATTACCGCCACGCTAGTTATGTGATGCTGCCAAGGATCGTGGACAATTCGCTACAATTGAAGTATCCAGAGAGAGTCGGAACTTGGGCTAATGAGCCTTTTT
GTTACTCTGCTGATAATTATCAACGAACCTACGCTACTGAGCCTGTCTTCTTCTATGGCGGTCCTGGTCGCAGTCGTGACTGCCATTAA
Protein sequenceShow/hide protein sequence
MRNTEHGGKLLAMVALAIAAAILQAHAAIPEMNNSQQLSVQIQKKLKLLNKPALHTIYSEDGDIIDCVDIYKQPAFDHPALKNHTIQVHFFMLIKSLCVSLVIIFTLLTN
ACIYAYRWNPIGALIGRCRLSKTSHFKYGKEVGDVPKEPFQFAEFVNKTYSESILSIALERIFFMEVPNSGQKSTTSILYTAGYNYIGASGQINVWNPKVDLPNDFTASR
IWLKNGPSENFESVEAGWMVNRRLYGDTKTRFSVHWTVDSYKSTGCFDLTCSGFVQTNPKIVLGAVIDPISTRGGQQFIISVGMFQDPRSRNWWLNVQGWPVGYWPPTLF
GYLRHSATLVEWGGEVFSSHVKKVPHTRTAMGSGDYAGRHYRHASYVMLPRIVDNSLQLKYPERVGTWANEPFCYSADNYQRTYATEPVFFYGGPGRSRDCH