; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC04g0010 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC04g0010
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionProtein of unknown function (DUF620)
Genome locationMC04:107021..110135
RNA-Seq ExpressionMC04g0010
SyntenyMC04g0010
Gene Ontology termsNA
InterPro domainsIPR006873 - Protein of unknown function DUF620


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572489.1 hypothetical protein SDJN03_29217, partial [Cucurbita argyrosperma subsp. sororia]1.42e-24985.13Show/hide
Query:  MATAVVIGGGGSSSSS----KSRRKAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNLP------SF-DIPNSKTQDLKLLL
        MATAVVIGGGGSSSSS    KSRR+ IWYS PLTPL+E PDPQ QDQE   NKKDSSAS  NWEFLRDWFKI RNLP      SF ++PNSKTQDLKLLL
Subjt:  MATAVVIGGGGSSSSS----KSRRKAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNLP------SF-DIPNSKTQDLKLLL

Query:  GVLACPLAPIPLSASPTPLDHHSHC------HFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGC
        GVLACPLAPIPL         HSH       HFPRDTPLETSVAHYIIQQYLAATGCLKQQK AKNMY +GSV+MIRCETEVSSGK+VK++GTR  D GC
Subjt:  GVLACPLAPIPLSASPTPLDHHSHC------HFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGC

Query:  FVLWQMLPGMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEG
        FVLWQMLPGMWSLELVVGGSKVVAGSDG TVWRHTPWLGTHAAKGPQRPLRRIIQGLDPK TARLF KAQCLGEKRIGE++CFVLKVSAEREAVMERNEG
Subjt:  FVLWQMLPGMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEG

Query:  PAEVIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGL
        PAEVIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGS IGDYRDVDGVLIAH GRSIATVFKFGEMSTQFSRTRMEE+W+IDDVMFNVAGL
Subjt:  PAEVIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGL

Query:  SIDYFIPPADIFDTLHS
        SIDYF+PPADI DTLHS
Subjt:  SIDYFIPPADIFDTLHS

XP_022147834.1 uncharacterized protein LOC111016677 [Momordica charantia]3.13e-301100Show/hide
Query:  MATAVVIGGGGSSSSSKSRRKAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNLPSFDIPNSKTQDLKLLLGVLACPLAPIP
        MATAVVIGGGGSSSSSKSRRKAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNLPSFDIPNSKTQDLKLLLGVLACPLAPIP
Subjt:  MATAVVIGGGGSSSSSKSRRKAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNLPSFDIPNSKTQDLKLLLGVLACPLAPIP

Query:  LSASPTPLDHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCFVLWQMLPGMWSLELVV
        LSASPTPLDHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCFVLWQMLPGMWSLELVV
Subjt:  LSASPTPLDHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCFVLWQMLPGMWSLELVV

Query:  GGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYFCQKS
        GGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYFCQKS
Subjt:  GGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYFCQKS

Query:  GVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSIDYFIPPADIFDTLHS
        GVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSIDYFIPPADIFDTLHS
Subjt:  GVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSIDYFIPPADIFDTLHS

Query:  P
        P
Subjt:  P

XP_022952934.1 uncharacterized protein LOC111455462 [Cucurbita moschata]1.55e-25085.99Show/hide
Query:  MATAVVIGGGGSSSSS-KSRRKAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNLP------SF-DIPNSKTQDLKLLLGVL
        MATAVVIGGGGSSSSS KSRR+ IWYS PLTPL+E PDPQ QDQE   NKKDSSAS  NWEFLRDWFKI RNLP      SF ++PNSKTQDLKLLLGVL
Subjt:  MATAVVIGGGGSSSSS-KSRRKAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNLP------SF-DIPNSKTQDLKLLLGVL

Query:  ACPLAPIPLSASPTPLDHHSHC------HFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCFVL
        ACPLAPIPL         HSH       HFPRDTPLETSVAHYIIQQYLAATGCLKQQK AKNMY +GSVKMIRCETEVSSGK+VK++GTR  D GCFVL
Subjt:  ACPLAPIPLSASPTPLDHHSHC------HFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCFVL

Query:  WQMLPGMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAE
        WQMLPGMWSLELVVGGSKVVAGSDG TVWRHTPWLGTHAAKGPQRPLRRIIQGLDPK TARLF KAQCLGEKRIGE++CFVLKVSAEREAVMERNEGPAE
Subjt:  WQMLPGMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAE

Query:  VIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSID
        VIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGS IGDYRDVDGVLIAH GRSIATVFKFGEMSTQFSRTRMEE+W+IDDVMFNVAGLSID
Subjt:  VIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSID

Query:  YFIPPADIFDTLHS
        YF+PPADI DTLHS
Subjt:  YFIPPADIFDTLHS

XP_022969378.1 uncharacterized protein LOC111468400 [Cucurbita maxima]3.08e-24985.99Show/hide
Query:  MATAVVIGGGGSSSSS------KSRRKAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNLP------SF-DIPNSKTQDLKL
        MATAVVIGGGGSSSSS      KSRR+ IWYS PLTPL+E PDPQ QDQE   NKKDSSAS  NWEFLRDWFKI RNLP      SF ++PNSKTQDLKL
Subjt:  MATAVVIGGGGSSSSS------KSRRKAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNLP------SF-DIPNSKTQDLKL

Query:  LLGVLACPLAPIPL-SASPTPLDHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCFVL
        LLGVLACPLAPIPL S S +P       HFPRDTPLETSVAHYIIQQYLAATGCLKQQK AKNMY +GSVKMIRCETEVSSGK+VK +GTR  D GCFVL
Subjt:  LLGVLACPLAPIPL-SASPTPLDHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCFVL

Query:  WQMLPGMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAE
        WQMLPGMWSLELVVGGSKVVAGSDG TVWRHTPWLGTHAAKGPQRPLRRIIQGLDPK TARLF KAQCLGEKRIGE++CFVLKVSAEREAVMERNEGPAE
Subjt:  WQMLPGMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAE

Query:  VIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSID
        VIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGS IGDYRDVDGVLIAH GRSIATVFKFGEMSTQFSRTRMEE+W+IDDVMFNVAGLSID
Subjt:  VIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSID

Query:  YFIPPADIFDTLHS
        YF+PPADI DTLHS
Subjt:  YFIPPADIFDTLHS

XP_038887014.1 uncharacterized protein LOC120077181 [Benincasa hispida]2.22e-24885.34Show/hide
Query:  MATAVVIGGGGSSSSS-----KSRR--KAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNL-PSFDIP--------NSKTQD
        MATAVVIG GGSSSSS     KSRR  K IWYS PLTPL+E PDPQ QDQE  +NKKDSSAS  NWEFLRDWFKI RNL PS  IP        NSKTQD
Subjt:  MATAVVIGGGGSSSSS-----KSRR--KAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNL-PSFDIP--------NSKTQD

Query:  LKLLLGVLACPLAPIPLSASPTPLDHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCF
        LKLLLGVLACPLAPIPL +   P  H     FPRDTPLETSVAHYIIQQYLAATGCLKQQK AKNMY +GSVKMIRCETEVSSGK+VK++GTR  D+GCF
Subjt:  LKLLLGVLACPLAPIPLSASPTPLDHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCF

Query:  VLWQMLPGMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGP
        VLWQMLPGMWSLELVVGGSKVVAGSDG TVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLF KAQCLGEKRIG+++CFVLKVSAEREAVMERNEGP
Subjt:  VLWQMLPGMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGP

Query:  AEVIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLS
        AEVIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCI DY+DVDGVLIAH GRSIATVFKFGEMSTQFSRTRMEE+W+IDDVMFNVAGLS
Subjt:  AEVIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLS

Query:  IDYFIPPADIFDTLHS
        IDYFIPPADIFDTLHS
Subjt:  IDYFIPPADIFDTLHS

TrEMBL top hitse value%identityAlignment
A0A1S3BNV0 uncharacterized protein LOC1034915876.75e-24685.57Show/hide
Query:  MATAVVIGGGGSSSSS-KSRR-KAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNL-PSF------DIPNSKTQDLKLLLGV
        MATAV IG GGSSSSS KSRR K IWYS PLTPL+E PDPQ QDQE   NKKDSS S  NWEFLRDWFKI RNL PS       ++PNSKTQDLKLLLGV
Subjt:  MATAVVIGGGGSSSSS-KSRR-KAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNL-PSF------DIPNSKTQDLKLLLGV

Query:  LACPLAPIPLSASPTPLDHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCFVLWQMLP
        LACPLAPIPL ++  P   H    FP   PLETSVAHYIIQQYLAATGCLKQQK AKNMY +GSVKMIRCETEVSSGK+VK++GTR  D GCFVLWQMLP
Subjt:  LACPLAPIPLSASPTPLDHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCFVLWQMLP

Query:  GMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAEVIRHV
         MWSLELVVGGSKVVAGSDG TVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLF KAQCLGEKRIGE+DCFVLKVSAEREAVMERNEGPAEVIRHV
Subjt:  GMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAEVIRHV

Query:  LYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSIDYFIPP
        LYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAH GRSIATVFKFGEMSTQFSRTRMEE+W+IDDVMFNVAGLS+DYFIPP
Subjt:  LYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSIDYFIPP

Query:  ADIFDTLHS
        ADIFD+LHS
Subjt:  ADIFDTLHS

A0A6J1D267 uncharacterized protein LOC1110166771.52e-301100Show/hide
Query:  MATAVVIGGGGSSSSSKSRRKAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNLPSFDIPNSKTQDLKLLLGVLACPLAPIP
        MATAVVIGGGGSSSSSKSRRKAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNLPSFDIPNSKTQDLKLLLGVLACPLAPIP
Subjt:  MATAVVIGGGGSSSSSKSRRKAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNLPSFDIPNSKTQDLKLLLGVLACPLAPIP

Query:  LSASPTPLDHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCFVLWQMLPGMWSLELVV
        LSASPTPLDHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCFVLWQMLPGMWSLELVV
Subjt:  LSASPTPLDHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCFVLWQMLPGMWSLELVV

Query:  GGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYFCQKS
        GGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYFCQKS
Subjt:  GGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYFCQKS

Query:  GVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSIDYFIPPADIFDTLHS
        GVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSIDYFIPPADIFDTLHS
Subjt:  GVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSIDYFIPPADIFDTLHS

Query:  P
        P
Subjt:  P

A0A6J1F0N4 uncharacterized protein LOC1114412529.46e-24583.41Show/hide
Query:  MATAVVIGG-GGSSSSSKSRRKAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNLPSF-----------DIPNSK-TQDLKL
        MAT  VIGG G SSSSSKSRRKAIWYS PLTPL+E P PQ QDQE   NKKDS    SNWEF RDWFKI RNLPS            ++PNSK + DLKL
Subjt:  MATAVVIGG-GGSSSSSKSRRKAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNLPSF-----------DIPNSK-TQDLKL

Query:  LLGVLACPLAPIPLSASPTPLDHHSHCHFPRD---TPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCF
        LLGVLACPLAPIPL ++P       H HFPRD   TPLETSVAHYIIQQYLAATGCLKQQK AKNMY +GSVKMIRCETEVSSGKTVK++GTR  DNGCF
Subjt:  LLGVLACPLAPIPLSASPTPLDHHSHCHFPRD---TPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCF

Query:  VLWQMLPGMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGP
        VLWQM+P MWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRI+QGLDPKSTARLF KAQCLGEKRIG++DCFVLKVSAEREAVMERNEGP
Subjt:  VLWQMLPGMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGP

Query:  AEVIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLS
        AEVIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAH GRSIATVFKFGEMS QFSRTRMEE+W+IDDVMFNVAGLS
Subjt:  AEVIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLS

Query:  IDYFIPPADIFDTLHS
        +DYFIPPAD FDT+HS
Subjt:  IDYFIPPADIFDTLHS

A0A6J1GN81 uncharacterized protein LOC1114554627.49e-25185.99Show/hide
Query:  MATAVVIGGGGSSSSS-KSRRKAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNLP------SF-DIPNSKTQDLKLLLGVL
        MATAVVIGGGGSSSSS KSRR+ IWYS PLTPL+E PDPQ QDQE   NKKDSSAS  NWEFLRDWFKI RNLP      SF ++PNSKTQDLKLLLGVL
Subjt:  MATAVVIGGGGSSSSS-KSRRKAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNLP------SF-DIPNSKTQDLKLLLGVL

Query:  ACPLAPIPLSASPTPLDHHSHC------HFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCFVL
        ACPLAPIPL         HSH       HFPRDTPLETSVAHYIIQQYLAATGCLKQQK AKNMY +GSVKMIRCETEVSSGK+VK++GTR  D GCFVL
Subjt:  ACPLAPIPLSASPTPLDHHSHC------HFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCFVL

Query:  WQMLPGMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAE
        WQMLPGMWSLELVVGGSKVVAGSDG TVWRHTPWLGTHAAKGPQRPLRRIIQGLDPK TARLF KAQCLGEKRIGE++CFVLKVSAEREAVMERNEGPAE
Subjt:  WQMLPGMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAE

Query:  VIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSID
        VIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGS IGDYRDVDGVLIAH GRSIATVFKFGEMSTQFSRTRMEE+W+IDDVMFNVAGLSID
Subjt:  VIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSID

Query:  YFIPPADIFDTLHS
        YF+PPADI DTLHS
Subjt:  YFIPPADIFDTLHS

A0A6J1I0T2 uncharacterized protein LOC1114684001.49e-24985.99Show/hide
Query:  MATAVVIGGGGSSSSS------KSRRKAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNLP------SF-DIPNSKTQDLKL
        MATAVVIGGGGSSSSS      KSRR+ IWYS PLTPL+E PDPQ QDQE   NKKDSSAS  NWEFLRDWFKI RNLP      SF ++PNSKTQDLKL
Subjt:  MATAVVIGGGGSSSSS------KSRRKAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNLP------SF-DIPNSKTQDLKL

Query:  LLGVLACPLAPIPL-SASPTPLDHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCFVL
        LLGVLACPLAPIPL S S +P       HFPRDTPLETSVAHYIIQQYLAATGCLKQQK AKNMY +GSVKMIRCETEVSSGK+VK +GTR  D GCFVL
Subjt:  LLGVLACPLAPIPL-SASPTPLDHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCFVL

Query:  WQMLPGMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAE
        WQMLPGMWSLELVVGGSKVVAGSDG TVWRHTPWLGTHAAKGPQRPLRRIIQGLDPK TARLF KAQCLGEKRIGE++CFVLKVSAEREAVMERNEGPAE
Subjt:  WQMLPGMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAE

Query:  VIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSID
        VIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGS IGDYRDVDGVLIAH GRSIATVFKFGEMSTQFSRTRMEE+W+IDDVMFNVAGLSID
Subjt:  VIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSID

Query:  YFIPPADIFDTLHS
        YF+PPADI DTLHS
Subjt:  YFIPPADIFDTLHS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G27690.1 Protein of unknown function (DUF620)2.0e-10049.36Show/hide
Query:  RRKAIWYSHP-------LTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNLPSFDIPNS---KTQDLKLLLGVLACPLAPIPLSASPTPL
        RRK  + + P       L P++E PDP   D E   +  D S     W    +W K    +    + +S   K  DL+LLLGVL  PL P+ +SA    L
Subjt:  RRKAIWYSHP-------LTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNLPSFDIPNS---KTQDLKLLLGVLACPLAPIPLSASPTPL

Query:  DHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEV-SSGKTVKSIGTRSGDNGCFVLWQMLPGMWSLELVVGGSKVVA
        D   H    ++TP+ETS A YI+QQY AA+G  K   S +N YV G ++ +  E E  S G   K+  +++ ++G FVLW M P MW +ELV+GGSKV+A
Subjt:  DHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEV-SSGKTVKSIGTRSGDNGCFVLWQMLPGMWSLELVVGGSKVVA

Query:  GSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYFCQKSGVLVYLE
        G DGK VWRHTPWLG HAAKGP RPLRR +QGLDP++TA +FA A+C+GEK+I  EDCF+LK+ A+   +  R+EG +E IRH L+GYF QK+G+LV+LE
Subjt:  GSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYFCQKSGVLVYLE

Query:  DSHLTRVQTE-GDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMST-QFSRTRMEELWNIDDVMFNVAGLSIDYFIPPADI-FDT
        DS LTR+Q   G+AVYWETTI S + DY+ V+G++IAH GRS+AT+ +FG+MS+   ++T M+E W ID++ FNV GLSID FIPP+++ FD+
Subjt:  DSHLTRVQTE-GDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMST-QFSRTRMEELWNIDDVMFNVAGLSIDYFIPPADI-FDT

AT1G49840.1 Protein of unknown function (DUF620)7.4e-10349.11Show/hide
Query:  VIGGGGSSSSSKSRRKAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIH-RNLPSF--DIPNSKTQDLKLLLGVLACPLAPIPLS
        VIGG             I  S  L P++E PDP   +    ++K+  S        L  W K      PS     P  +  DL+LLLGV+  PLAPI +S
Subjt:  VIGGGGSSSSSKSRRKAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIH-RNLPSF--DIPNSKTQDLKLLLGVLACPLAPIPLS

Query:  ASPTPLDHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCFVLWQMLPGMWSLELVVGG
        +S     H  H    RD+P ETS A YI+QQY AA G  K   + KN Y  G +KMI  E E  +G TV++  +   + G FVLWQM P MW +EL VGG
Subjt:  ASPTPLDHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCFVLWQMLPGMWSLELVVGG

Query:  SKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYFCQKSGV
        SKV AG +GK VWRHTPWLG+H AKGP RPLRR +QGLDP++TA +FA+++C+GE+++  EDCF+LK+  + E +  R+EGPAE++RH+L+GYF Q++G+
Subjt:  SKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYFCQKSGV

Query:  LVYLEDSHLTRVQT-EGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSIDYFIPPADI
        L  +EDS LTR+Q+ +GDAVYWETTI S + DY+ V+G++IAH GRS+ T+F+FGE++   +RT+MEE W I++V FNV GLS+D FIPPAD+
Subjt:  LVYLEDSHLTRVQT-EGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSIDYFIPPADI

AT1G79420.1 Protein of unknown function (DUF620)7.6e-14864.55Show/hide
Query:  SSSSKSRRKAIWYSHP--LTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNLP-------------SFDIPNSKTQDLKLLLGVLACPLA
        S+S    RK  W + P  LTPL+E PDP MQD+     KK+SS     WE +R+WFK+H+ +              S+D+P +K QDL+LLLGVL CPLA
Subjt:  SSSSKSRRKAIWYSHP--LTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNLP-------------SFDIPNSKTQDLKLLLGVLACPLA

Query:  PIPLSASPT-PLDHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIG----TRSGDNGCFVLWQMLPG
        PI +  S   P D        ++ P ETS AHYIIQQYLAATGCLK+ K+AKNMY +G +KM  CETE+++GK+VK++G     RSGD+GCFVLWQM PG
Subjt:  PIPLSASPT-PLDHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIG----TRSGDNGCFVLWQMLPG

Query:  MWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNE--GPAEVIRH
        MWSLELV+GG+K+++GSDGKTVWRHTPWLGTHAAKGPQRPLRR+IQGLDPK+TA LFAKAQCLGE+RIG++DCFVLKVSA+R++++ERN+   PAEVIRH
Subjt:  MWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNE--GPAEVIRH

Query:  VLYGYFCQKSGVLVYLEDSHLTRVQT---EGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSIDY
         LYGYFCQKSG+LVYLEDSHLTRV T   E +AVYWETTIG+ IGDYRDVDGV +AH GR++ATVF+FGE S Q+SRTRMEE+W IDDV+F+V GLS+D 
Subjt:  VLYGYFCQKSGVLVYLEDSHLTRVQT---EGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSIDY

Query:  FIPPADIFD
        FIPPADIF+
Subjt:  FIPPADIFD

AT3G19540.1 Protein of unknown function (DUF620)6.1e-10550.77Show/hide
Query:  GGGGSSSSSKSRRKAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFK--IHRNLPSFDIPNSKTQDLKLLLGVLACPLAPIPLSASP
        GGGG         + I  S  L P++E PDP   +     N  +S    S    L  W K  + R          +  DL+LLLGV+  PLAPI +S+S 
Subjt:  GGGGSSSSSKSRRKAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFK--IHRNLPSFDIPNSKTQDLKLLLGVLACPLAPIPLSASP

Query:  TPLDHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCFVLWQMLPGMWSLELVVGGSKV
         PL H S     ++TP+ETS A YI+QQY AA+G  K Q S KN Y  G +KMI  E E ++ +TV++      + G FVLWQM P MW +EL VGGSKV
Subjt:  TPLDHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCFVLWQMLPGMWSLELVVGGSKV

Query:  VAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYFCQKSGVLVY
         AG +GK VWRHTPWLG+H AKGP RPLRR +QGLDP++TA +FA+A+C+GEK++  EDCF+LK+  + E +  R+EGPAE+IRHVL+GYF QK+G+LV+
Subjt:  VAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYFCQKSGVLVY

Query:  LEDSHLTRVQTE-GDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSIDYFIPPADI
        +EDSHLTR+Q+  G+ V+WETT  S + DYR V+G++IAH G S+ T+F+FGE++T  +RT+MEE W I++V FNV GLS+D FIPPAD+
Subjt:  LEDSHLTRVQTE-GDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSIDYFIPPADI

AT5G05840.1 Protein of unknown function (DUF620)9.7e-8748.16Show/hide
Query:  KTQDLKLLLGVLACPLAPIPLSASPTPLDHHSHCHFP-----RDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSG----KTV
        +  +++LLLGV+  PL P+P+       DHH+    P     +D PLE S+A YI++QY+AA G  +   + ++MY  G V+M   E     G    K V
Subjt:  KTQDLKLLLGVLACPLAPIPLSASPTPLDHHSHCHFP-----RDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSG----KTV

Query:  K--SIGTRSGDNGCFVLWQMLPGMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLK
        K  SI +  G+ G FVLWQ    +W LELVV G K+ AGSD K  WR TPW  +HA++GP RPLRR +QGLDPKSTA LFA++ C+GEK+I +EDCF+LK
Subjt:  K--SIGTRSGDNGCFVLWQMLPGMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLK

Query:  VSAEREAVMERNEGPAEVIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGD-AVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRME
        + AE  A+  R+    E+IRH ++G F Q++G+L+ LEDSHL R++ + D +++WETT+ S I DYR VDG+L+AH G+S  ++F+FGE S   SRTRME
Subjt:  VSAEREAVMERNEGPAEVIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGD-AVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRME

Query:  ELWNIDDVMFNVAGLSIDYFIPPADI
        E W I+++ FN+ GLS+D F+PP+D+
Subjt:  ELWNIDDVMFNVAGLSIDYFIPPADI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGACGGCGGTTGTGATTGGGGGTGGGGGATCTTCCAGTTCCAGCAAGTCCCGCAGAAAGGCAATTTGGTACTCTCACCCACTGACTCCGTTGTTGGAAAGTCCGGA
TCCCCAAATGCAAGACCAAGAACAACCCAACAACAAGAAAGACTCCTCGGCCTCGGCGTCCAACTGGGAATTCCTCCGCGACTGGTTCAAGATCCACCGCAACCTCCCCT
CCTTCGACATTCCCAATTCCAAGACCCAAGATTTGAAGCTTTTGCTCGGCGTCCTCGCATGCCCCCTCGCTCCCATTCCCCTCTCTGCCTCTCCCACTCCCCTCGACCAC
CACTCCCACTGCCACTTCCCACGCGATACGCCTCTTGAAACTTCTGTCGCGCATTACATCATACAACAGTATCTGGCCGCTACCGGATGTCTGAAACAACAAAAGTCTGC
CAAGAACATGTACGTCTCCGGAAGCGTGAAGATGATTCGGTGTGAAACAGAGGTCTCTTCTGGGAAAACTGTGAAGAGCATAGGGACAAGAAGCGGGGACAACGGCTGCT
TTGTTCTGTGGCAAATGCTGCCGGGCATGTGGTCTCTCGAATTGGTGGTCGGAGGCAGTAAGGTGGTTGCCGGCAGCGACGGCAAGACCGTGTGGCGCCACACTCCATGG
CTCGGCACCCATGCCGCCAAAGGTCCCCAACGACCTCTGCGTCGCATCATTCAGGGGCTAGACCCAAAGAGCACGGCTCGGCTGTTCGCGAAAGCCCAATGCCTTGGGGA
GAAGCGGATCGGGGAGGAGGATTGCTTTGTGCTGAAAGTGTCGGCGGAGCGAGAGGCAGTGATGGAAAGAAACGAGGGGCCTGCGGAAGTGATCAGGCACGTGCTGTATG
GGTACTTCTGCCAGAAGAGCGGAGTGCTGGTGTACTTGGAGGACTCACACCTCACCAGAGTGCAGACCGAAGGAGACGCCGTCTACTGGGAAACCACCATCGGAAGCTGC
ATCGGAGACTACAGGGACGTCGACGGCGTCCTCATCGCTCACCACGGCAGGTCCATCGCCACCGTCTTCAAGTTTGGGGAAATGTCCACCCAATTTAGCAGGACCCGAAT
GGAAGAGCTTTGGAACATTGACGATGTCATGTTCAACGTCGCAGGCCTTAGCATCGACTACTTTATTCCTCCAGCCGATATTTTTGATACCCTTCATTCTCCATGA
mRNA sequenceShow/hide mRNA sequence
GCCCAGCCCAATATACGTCTAAAACTCAAACTTGGGCCGACCTCTGAACTGAGACCAACCCTATTTTGTTTTGTTTACATTTTCACTCCCACCTCTCAGCCCCCTTGCTG
ATCCAGGTCACCCAAACCAAAACCTCTCAGCCCCCTCTTTCGCACCGCCGCCAATTCCTCTCTTTTTTCTTGCTCTCAAGTTGCTGTTGATGCCATGCCGCTGTTATATT
AGATCTCTCTGTCAGGTAGTCTTAGTTCAGATACTATGACTGTATATATCTCTCTTTACGTTGAGCCAAAGAACCGTGAAAACCGAGCCGATCCGTTCGTATTCAAAATC
GAAAATGTTGGTTTGGTCGGTGTGGGGCTAGTGGCAGGGGCCGGGAACTTTTAAAAATTAAAATACAGTGTCGCCGAATTGGAACTTACTGTACCGCCTTTATTTGCCGA
TGAGTTTGCTTTATTTATATATATAATATAGCGTTGGGTTGGGTTGGGTTATACTGTGAGTCTTGCTTAGTTTGTCCGGTAGCGTCCCACCCTAAAGGCCTAAACTAACT
CTCTTTTATCATATTCTCCAGTGTTCACCAAAGGTGTTTGTGAGTTTGTCCCTGTGAAATGAATTAGGTTTTGAATTGGATCTGGTTAAGGGCATGGCGACGGCGGTTGT
GATTGGGGGTGGGGGATCTTCCAGTTCCAGCAAGTCCCGCAGAAAGGCAATTTGGTACTCTCACCCACTGACTCCGTTGTTGGAAAGTCCGGATCCCCAAATGCAAGACC
AAGAACAACCCAACAACAAGAAAGACTCCTCGGCCTCGGCGTCCAACTGGGAATTCCTCCGCGACTGGTTCAAGATCCACCGCAACCTCCCCTCCTTCGACATTCCCAAT
TCCAAGACCCAAGATTTGAAGCTTTTGCTCGGCGTCCTCGCATGCCCCCTCGCTCCCATTCCCCTCTCTGCCTCTCCCACTCCCCTCGACCACCACTCCCACTGCCACTT
CCCACGCGATACGCCTCTTGAAACTTCTGTCGCGCATTACATCATACAACAGTATCTGGCCGCTACCGGATGTCTGAAACAACAAAAGTCTGCCAAGAACATGTACGTCT
CCGGAAGCGTGAAGATGATTCGGTGTGAAACAGAGGTCTCTTCTGGGAAAACTGTGAAGAGCATAGGGACAAGAAGCGGGGACAACGGCTGCTTTGTTCTGTGGCAAATG
CTGCCGGGCATGTGGTCTCTCGAATTGGTGGTCGGAGGCAGTAAGGTGGTTGCCGGCAGCGACGGCAAGACCGTGTGGCGCCACACTCCATGGCTCGGCACCCATGCCGC
CAAAGGTCCCCAACGACCTCTGCGTCGCATCATTCAGGGGCTAGACCCAAAGAGCACGGCTCGGCTGTTCGCGAAAGCCCAATGCCTTGGGGAGAAGCGGATCGGGGAGG
AGGATTGCTTTGTGCTGAAAGTGTCGGCGGAGCGAGAGGCAGTGATGGAAAGAAACGAGGGGCCTGCGGAAGTGATCAGGCACGTGCTGTATGGGTACTTCTGCCAGAAG
AGCGGAGTGCTGGTGTACTTGGAGGACTCACACCTCACCAGAGTGCAGACCGAAGGAGACGCCGTCTACTGGGAAACCACCATCGGAAGCTGCATCGGAGACTACAGGGA
CGTCGACGGCGTCCTCATCGCTCACCACGGCAGGTCCATCGCCACCGTCTTCAAGTTTGGGGAAATGTCCACCCAATTTAGCAGGACCCGAATGGAAGAGCTTTGGAACA
TTGACGATGTCATGTTCAACGTCGCAGGCCTTAGCATCGACTACTTTATTCCTCCAGCCGATATTTTTGATACCCTTCATTCTCCATGACTCCACCCTCACCCTCCTGTA
TTTGTATTCTATTACGCATAACGTCTCCCTAAGCTTCGCTAATCAATCATATCACTGGACACTACTCATCTCATCTGTATACATCTCAACATAACTTTCTTCATCCCTTT
CCCTAGTACTCTCATTAAACCAGCGCAAATGAGTATTATTTTCCTTTTCCTTGGAATCTTTTAACTAATGAATGATCATGAA
Protein sequenceShow/hide protein sequence
MATAVVIGGGGSSSSSKSRRKAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNLPSFDIPNSKTQDLKLLLGVLACPLAPIPLSASPTPLDH
HSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCFVLWQMLPGMWSLELVVGGSKVVAGSDGKTVWRHTPW
LGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSC
IGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSIDYFIPPADIFDTLHSP