; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS006446 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS006446
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProtein of unknown function (DUF620)
Genome locationscaffold327:665993..668245
RNA-Seq ExpressionMS006446
SyntenyMS006446
Gene Ontology termsNA
InterPro domainsIPR006873 - Protein of unknown function DUF620


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572489.1 hypothetical protein SDJN03_29217, partial [Cucurbita argyrosperma subsp. sororia]1.4e-19686.41Show/hide
Query:  MATAVVIGGGG----SSSSSKSRRKAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNLP------SF-DVPNSKTQDLKLLL
        MATAVVIGGGG    SSSSSKSRR+ IWYS PLTPL+E PDPQ QDQE   NKKDS  SASNWEFLRDWFKI RNLP      SF +VPNSKTQDLKLLL
Subjt:  MATAVVIGGGG----SSSSSKSRRKAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNLP------SF-DVPNSKTQDLKLLL

Query:  GVLACPLAPIPL-SASPTPLDHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCFVLWQ
        GVLACPLAPIPL S S +P       HFPRDTPLETSVAHYIIQQYLAATGCLKQQK AKNMY +GSV+MIRCETEVSSGK+VK++GTR  D GCFVLWQ
Subjt:  GVLACPLAPIPL-SASPTPLDHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCFVLWQ

Query:  MLPGMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAEVI
        MLPGMWSLELVVGGSKVVAGSDG TVWRHTPWLGTHAAKGPQRPLRRIIQGLDPK TARLF KAQCLGEKRIGE++CFVLKVSAEREAVMERNEGPAEVI
Subjt:  MLPGMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAEVI

Query:  RHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSIDYF
        RHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGS IGDYRDVDGVLIAH GRSIATVFKFGEMSTQFSRTRMEE+W+IDDVMFNVAGLSIDYF
Subjt:  RHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSIDYF

Query:  IPPADIFDTLHS
        +PPADI DTLHS
Subjt:  IPPADIFDTLHS

XP_022147834.1 uncharacterized protein LOC111016677 [Momordica charantia]1.0e-23499.75Show/hide
Query:  MATAVVIGGGGSSSSSKSRRKAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNLPSFDVPNSKTQDLKLLLGVLACPLAPIP
        MATAVVIGGGGSSSSSKSRRKAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNLPSFD+PNSKTQDLKLLLGVLACPLAPIP
Subjt:  MATAVVIGGGGSSSSSKSRRKAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNLPSFDVPNSKTQDLKLLLGVLACPLAPIP

Query:  LSASPTPLDHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCFVLWQMLPGMWSLELVV
        LSASPTPLDHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCFVLWQMLPGMWSLELVV
Subjt:  LSASPTPLDHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCFVLWQMLPGMWSLELVV

Query:  GGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYFCQKS
        GGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYFCQKS
Subjt:  GGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYFCQKS

Query:  GVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSIDYFIPPADIFDTLHS
        GVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSIDYFIPPADIFDTLHS
Subjt:  GVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSIDYFIPPADIFDTLHS

XP_022952934.1 uncharacterized protein LOC111455462 [Cucurbita moschata]2.8e-19787.29Show/hide
Query:  MATAVVIGGGG-SSSSSKSRRKAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNLP------SF-DVPNSKTQDLKLLLGVL
        MATAVVIGGGG SSSSSKSRR+ IWYS PLTPL+E PDPQ QDQE   NKKDS  SASNWEFLRDWFKI RNLP      SF +VPNSKTQDLKLLLGVL
Subjt:  MATAVVIGGGG-SSSSSKSRRKAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNLP------SF-DVPNSKTQDLKLLLGVL

Query:  ACPLAPIPL-SASPTPLDHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCFVLWQMLP
        ACPLAPIPL S S +P       HFPRDTPLETSVAHYIIQQYLAATGCLKQQK AKNMY +GSVKMIRCETEVSSGK+VK++GTR  D GCFVLWQMLP
Subjt:  ACPLAPIPL-SASPTPLDHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCFVLWQMLP

Query:  GMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAEVIRHV
        GMWSLELVVGGSKVVAGSDG TVWRHTPWLGTHAAKGPQRPLRRIIQGLDPK TARLF KAQCLGEKRIGE++CFVLKVSAEREAVMERNEGPAEVIRHV
Subjt:  GMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAEVIRHV

Query:  LYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSIDYFIPP
        LYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGS IGDYRDVDGVLIAH GRSIATVFKFGEMSTQFSRTRMEE+W+IDDVMFNVAGLSIDYF+PP
Subjt:  LYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSIDYFIPP

Query:  ADIFDTLHS
        ADI DTLHS
Subjt:  ADIFDTLHS

XP_022969378.1 uncharacterized protein LOC111468400 [Cucurbita maxima]1.8e-19686.23Show/hide
Query:  MATAVVIGGGG------SSSSSKSRRKAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNLP------SF-DVPNSKTQDLKL
        MATAVVIGGGG      SSSSSKSRR+ IWYS PLTPL+E PDPQ QDQE   NKKDS  SASNWEFLRDWFKI RNLP      SF +VPNSKTQDLKL
Subjt:  MATAVVIGGGG------SSSSSKSRRKAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNLP------SF-DVPNSKTQDLKL

Query:  LLGVLACPLAPIPL-SASPTPLDHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCFVL
        LLGVLACPLAPIPL S S +P       HFPRDTPLETSVAHYIIQQYLAATGCLKQQK AKNMY +GSVKMIRCETEVSSGK+VK +GTR  D GCFVL
Subjt:  LLGVLACPLAPIPL-SASPTPLDHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCFVL

Query:  WQMLPGMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAE
        WQMLPGMWSLELVVGGSKVVAGSDG TVWRHTPWLGTHAAKGPQRPLRRIIQGLDPK TARLF KAQCLGEKRIGE++CFVLKVSAEREAVMERNEGPAE
Subjt:  WQMLPGMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAE

Query:  VIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSID
        VIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGS IGDYRDVDGVLIAH GRSIATVFKFGEMSTQFSRTRMEE+W+IDDVMFNVAGLSID
Subjt:  VIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSID

Query:  YFIPPADIFDTLHS
        YF+PPADI DTLHS
Subjt:  YFIPPADIFDTLHS

XP_038887014.1 uncharacterized protein LOC120077181 [Benincasa hispida]1.6e-19585.1Show/hide
Query:  MATAVVIGGGG-----SSSSSKSRR--KAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNL-PSFDV--------PNSKTQD
        MATAVVIG GG     SSSSSKSRR  K IWYS PLTPL+E PDPQ QDQE  +NKKDS  SASNWEFLRDWFKI RNL PS  +        PNSKTQD
Subjt:  MATAVVIGGGG-----SSSSSKSRR--KAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNL-PSFDV--------PNSKTQD

Query:  LKLLLGVLACPLAPIPLSASPTPLDHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCF
        LKLLLGVLACPLAPIPL +   P       HFPRDTPLETSVAHYIIQQYLAATGCLKQQK AKNMY +GSVKMIRCETEVSSGK+VK++GTR  D+GCF
Subjt:  LKLLLGVLACPLAPIPLSASPTPLDHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCF

Query:  VLWQMLPGMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGP
        VLWQMLPGMWSLELVVGGSKVVAGSDG TVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLF KAQCLGEKRIG+++CFVLKVSAEREAVMERNEGP
Subjt:  VLWQMLPGMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGP

Query:  AEVIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLS
        AEVIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCI DY+DVDGVLIAH GRSIATVFKFGEMSTQFSRTRMEE+W+IDDVMFNVAGLS
Subjt:  AEVIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLS

Query:  IDYFIPPADIFDTLHS
        IDYFIPPADIFDTLHS
Subjt:  IDYFIPPADIFDTLHS

TrEMBL top hitse value%identityAlignment
A0A1S3BNV0 uncharacterized protein LOC1034915879.2e-19485.57Show/hide
Query:  MATAVVIG-GGGSSSSSKSRR-KAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNL-PSF------DVPNSKTQDLKLLLGV
        MATAV IG GG SSSSSKSRR K IWYS PLTPL+E PDPQ QDQE   NKKDS  S SNWEFLRDWFKI RNL PS       ++PNSKTQDLKLLLGV
Subjt:  MATAVVIG-GGGSSSSSKSRR-KAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNL-PSF------DVPNSKTQDLKLLLGV

Query:  LACPLAPIPLSASPTPLDHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCFVLWQMLP
        LACPLAPIPL ++  P       HFP   PLETSVAHYIIQQYLAATGCLKQQK AKNMY +GSVKMIRCETEVSSGK+VK++GTR  D GCFVLWQMLP
Subjt:  LACPLAPIPLSASPTPLDHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCFVLWQMLP

Query:  GMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAEVIRHV
         MWSLELVVGGSKVVAGSDG TVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLF KAQCLGEKRIGE+DCFVLKVSAEREAVMERNEGPAEVIRHV
Subjt:  GMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAEVIRHV

Query:  LYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSIDYFIPP
        LYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAH GRSIATVFKFGEMSTQFSRTRMEE+W+IDDVMFNVAGLS+DYFIPP
Subjt:  LYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSIDYFIPP

Query:  ADIFDTLHS
        ADIFD+LHS
Subjt:  ADIFDTLHS

A0A6J1D267 uncharacterized protein LOC1110166774.8e-23599.75Show/hide
Query:  MATAVVIGGGGSSSSSKSRRKAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNLPSFDVPNSKTQDLKLLLGVLACPLAPIP
        MATAVVIGGGGSSSSSKSRRKAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNLPSFD+PNSKTQDLKLLLGVLACPLAPIP
Subjt:  MATAVVIGGGGSSSSSKSRRKAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNLPSFDVPNSKTQDLKLLLGVLACPLAPIP

Query:  LSASPTPLDHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCFVLWQMLPGMWSLELVV
        LSASPTPLDHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCFVLWQMLPGMWSLELVV
Subjt:  LSASPTPLDHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCFVLWQMLPGMWSLELVV

Query:  GGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYFCQKS
        GGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYFCQKS
Subjt:  GGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYFCQKS

Query:  GVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSIDYFIPPADIFDTLHS
        GVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSIDYFIPPADIFDTLHS
Subjt:  GVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSIDYFIPPADIFDTLHS

A0A6J1F0N4 uncharacterized protein LOC1114412523.5e-19383.65Show/hide
Query:  MATAVVIGG-GGSSSSSKSRRKAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNLPSF-----------DVPNSK-TQDLKL
        MAT  VIGG G SSSSSKSRRKAIWYS PLTPL+E P PQ QDQE   NKKD   S SNWEF RDWFKI RNLPS            +VPNSK + DLKL
Subjt:  MATAVVIGG-GGSSSSSKSRRKAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNLPSF-----------DVPNSK-TQDLKL

Query:  LLGVLACPLAPIPLSASPTPLDHHSHCHFPR---DTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCF
        LLGVLACPLAPIPL ++P       H HFPR   DTPLETSVAHYIIQQYLAATGCLKQQK AKNMY +GSVKMIRCETEVSSGKTVK++GTR  DNGCF
Subjt:  LLGVLACPLAPIPLSASPTPLDHHSHCHFPR---DTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCF

Query:  VLWQMLPGMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGP
        VLWQM+P MWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRI+QGLDPKSTARLF KAQCLGEKRIG++DCFVLKVSAEREAVMERNEGP
Subjt:  VLWQMLPGMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGP

Query:  AEVIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLS
        AEVIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAH GRSIATVFKFGEMS QFSRTRMEE+W+IDDVMFNVAGLS
Subjt:  AEVIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLS

Query:  IDYFIPPADIFDTLHS
        +DYFIPPAD FDT+HS
Subjt:  IDYFIPPADIFDTLHS

A0A6J1GN81 uncharacterized protein LOC1114554621.4e-19787.29Show/hide
Query:  MATAVVIGGGG-SSSSSKSRRKAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNLP------SF-DVPNSKTQDLKLLLGVL
        MATAVVIGGGG SSSSSKSRR+ IWYS PLTPL+E PDPQ QDQE   NKKDS  SASNWEFLRDWFKI RNLP      SF +VPNSKTQDLKLLLGVL
Subjt:  MATAVVIGGGG-SSSSSKSRRKAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNLP------SF-DVPNSKTQDLKLLLGVL

Query:  ACPLAPIPL-SASPTPLDHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCFVLWQMLP
        ACPLAPIPL S S +P       HFPRDTPLETSVAHYIIQQYLAATGCLKQQK AKNMY +GSVKMIRCETEVSSGK+VK++GTR  D GCFVLWQMLP
Subjt:  ACPLAPIPL-SASPTPLDHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCFVLWQMLP

Query:  GMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAEVIRHV
        GMWSLELVVGGSKVVAGSDG TVWRHTPWLGTHAAKGPQRPLRRIIQGLDPK TARLF KAQCLGEKRIGE++CFVLKVSAEREAVMERNEGPAEVIRHV
Subjt:  GMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAEVIRHV

Query:  LYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSIDYFIPP
        LYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGS IGDYRDVDGVLIAH GRSIATVFKFGEMSTQFSRTRMEE+W+IDDVMFNVAGLSIDYF+PP
Subjt:  LYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSIDYFIPP

Query:  ADIFDTLHS
        ADI DTLHS
Subjt:  ADIFDTLHS

A0A6J1I0T2 uncharacterized protein LOC1114684008.9e-19786.23Show/hide
Query:  MATAVVIGGGG------SSSSSKSRRKAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNLP------SF-DVPNSKTQDLKL
        MATAVVIGGGG      SSSSSKSRR+ IWYS PLTPL+E PDPQ QDQE   NKKDS  SASNWEFLRDWFKI RNLP      SF +VPNSKTQDLKL
Subjt:  MATAVVIGGGG------SSSSSKSRRKAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNLP------SF-DVPNSKTQDLKL

Query:  LLGVLACPLAPIPL-SASPTPLDHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCFVL
        LLGVLACPLAPIPL S S +P       HFPRDTPLETSVAHYIIQQYLAATGCLKQQK AKNMY +GSVKMIRCETEVSSGK+VK +GTR  D GCFVL
Subjt:  LLGVLACPLAPIPL-SASPTPLDHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCFVL

Query:  WQMLPGMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAE
        WQMLPGMWSLELVVGGSKVVAGSDG TVWRHTPWLGTHAAKGPQRPLRRIIQGLDPK TARLF KAQCLGEKRIGE++CFVLKVSAEREAVMERNEGPAE
Subjt:  WQMLPGMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAE

Query:  VIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSID
        VIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGS IGDYRDVDGVLIAH GRSIATVFKFGEMSTQFSRTRMEE+W+IDDVMFNVAGLSID
Subjt:  VIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSID

Query:  YFIPPADIFDTLHS
        YF+PPADI DTLHS
Subjt:  YFIPPADIFDTLHS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G27690.1 Protein of unknown function (DUF620)1.5e-10049.62Show/hide
Query:  RRKAIWYSHP-------LTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNLPSFDVPNS---KTQDLKLLLGVLACPLAPIPLSASPTPL
        RRK  + + P       L P++E PDP   D E   +  D S     W    +W K    +    V +S   K  DL+LLLGVL  PL P+ +SA    L
Subjt:  RRKAIWYSHP-------LTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNLPSFDVPNS---KTQDLKLLLGVLACPLAPIPLSASPTPL

Query:  DHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEV-SSGKTVKSIGTRSGDNGCFVLWQMLPGMWSLELVVGGSKVVA
        D   H    ++TP+ETS A YI+QQY AA+G  K   S +N YV G ++ +  E E  S G   K+  +++ ++G FVLW M P MW +ELV+GGSKV+A
Subjt:  DHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEV-SSGKTVKSIGTRSGDNGCFVLWQMLPGMWSLELVVGGSKVVA

Query:  GSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYFCQKSGVLVYLE
        G DGK VWRHTPWLG HAAKGP RPLRR +QGLDP++TA +FA A+C+GEK+I  EDCF+LK+ A+   +  R+EG +E IRH L+GYF QK+G+LV+LE
Subjt:  GSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYFCQKSGVLVYLE

Query:  DSHLTRVQTE-GDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMST-QFSRTRMEELWNIDDVMFNVAGLSIDYFIPPADI-FDT
        DS LTR+Q   G+AVYWETTI S + DY+ V+G++IAH GRS+AT+ +FG+MS+   ++T M+E W ID++ FNV GLSID FIPP+++ FD+
Subjt:  DSHLTRVQTE-GDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMST-QFSRTRMEELWNIDDVMFNVAGLSIDYFIPPADI-FDT

AT1G49840.1 Protein of unknown function (DUF620)4.3e-10349.11Show/hide
Query:  VIGGGGSSSSSKSRRKAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIH-RNLPSF--DVPNSKTQDLKLLLGVLACPLAPIPLS
        VIGG             I  S  L P++E PDP   +    ++K+  S        L  W K      PS     P  +  DL+LLLGV+  PLAPI +S
Subjt:  VIGGGGSSSSSKSRRKAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIH-RNLPSF--DVPNSKTQDLKLLLGVLACPLAPIPLS

Query:  ASPTPLDHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCFVLWQMLPGMWSLELVVGG
        +S     H  H    RD+P ETS A YI+QQY AA G  K   + KN Y  G +KMI  E E  +G TV++  +   + G FVLWQM P MW +EL VGG
Subjt:  ASPTPLDHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCFVLWQMLPGMWSLELVVGG

Query:  SKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYFCQKSGV
        SKV AG +GK VWRHTPWLG+H AKGP RPLRR +QGLDP++TA +FA+++C+GE+++  EDCF+LK+  + E +  R+EGPAE++RH+L+GYF Q++G+
Subjt:  SKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYFCQKSGV

Query:  LVYLEDSHLTRVQT-EGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSIDYFIPPADI
        L  +EDS LTR+Q+ +GDAVYWETTI S + DY+ V+G++IAH GRS+ T+F+FGE++   +RT+MEE W I++V FNV GLS+D FIPPAD+
Subjt:  LVYLEDSHLTRVQT-EGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSIDYFIPPADI

AT1G79420.1 Protein of unknown function (DUF620)5.8e-14864.79Show/hide
Query:  SSSSKSRRKAIWYSHP--LTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNLP-------------SFDVPNSKTQDLKLLLGVLACPLA
        S+S    RK  W + P  LTPL+E PDP MQD+     KK+SS     WE +R+WFK+H+ +              S+DVP +K QDL+LLLGVL CPLA
Subjt:  SSSSKSRRKAIWYSHP--LTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNLP-------------SFDVPNSKTQDLKLLLGVLACPLA

Query:  PIPLSASPT-PLDHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIG----TRSGDNGCFVLWQMLPG
        PI +  S   P D        ++ P ETS AHYIIQQYLAATGCLK+ K+AKNMY +G +KM  CETE+++GK+VK++G     RSGD+GCFVLWQM PG
Subjt:  PIPLSASPT-PLDHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIG----TRSGDNGCFVLWQMLPG

Query:  MWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNE--GPAEVIRH
        MWSLELV+GG+K+++GSDGKTVWRHTPWLGTHAAKGPQRPLRR+IQGLDPK+TA LFAKAQCLGE+RIG++DCFVLKVSA+R++++ERN+   PAEVIRH
Subjt:  MWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNE--GPAEVIRH

Query:  VLYGYFCQKSGVLVYLEDSHLTRVQT---EGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSIDY
         LYGYFCQKSG+LVYLEDSHLTRV T   E +AVYWETTIG+ IGDYRDVDGV +AH GR++ATVF+FGE S Q+SRTRMEE+W IDDV+F+V GLS+D 
Subjt:  VLYGYFCQKSGVLVYLEDSHLTRVQT---EGDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSIDY

Query:  FIPPADIFD
        FIPPADIF+
Subjt:  FIPPADIFD

AT3G19540.1 Protein of unknown function (DUF620)3.5e-10550.77Show/hide
Query:  GGGGSSSSSKSRRKAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFK--IHRNLPSFDVPNSKTQDLKLLLGVLACPLAPIPLSASP
        GGGG         + I  S  L P++E PDP   +     N  +S    S    L  W K  + R          +  DL+LLLGV+  PLAPI +S+S 
Subjt:  GGGGSSSSSKSRRKAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFK--IHRNLPSFDVPNSKTQDLKLLLGVLACPLAPIPLSASP

Query:  TPLDHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCFVLWQMLPGMWSLELVVGGSKV
         PL H S     ++TP+ETS A YI+QQY AA+G  K Q S KN Y  G +KMI  E E ++ +TV++      + G FVLWQM P MW +EL VGGSKV
Subjt:  TPLDHHSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCFVLWQMLPGMWSLELVVGGSKV

Query:  VAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYFCQKSGVLVY
         AG +GK VWRHTPWLG+H AKGP RPLRR +QGLDP++TA +FA+A+C+GEK++  EDCF+LK+  + E +  R+EGPAE+IRHVL+GYF QK+G+LV+
Subjt:  VAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYFCQKSGVLVY

Query:  LEDSHLTRVQTE-GDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSIDYFIPPADI
        +EDSHLTR+Q+  G+ V+WETT  S + DYR V+G++IAH G S+ T+F+FGE++T  +RT+MEE W I++V FNV GLS+D FIPPAD+
Subjt:  LEDSHLTRVQTE-GDAVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSIDYFIPPADI

AT5G05840.1 Protein of unknown function (DUF620)9.7e-8748.16Show/hide
Query:  KTQDLKLLLGVLACPLAPIPLSASPTPLDHHSHCHFP-----RDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSG----KTV
        +  +++LLLGV+  PL P+P+       DHH+    P     +D PLE S+A YI++QY+AA G  +   + ++MY  G V+M   E     G    K V
Subjt:  KTQDLKLLLGVLACPLAPIPLSASPTPLDHHSHCHFP-----RDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSG----KTV

Query:  K--SIGTRSGDNGCFVLWQMLPGMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLK
        K  SI +  G+ G FVLWQ    +W LELVV G K+ AGSD K  WR TPW  +HA++GP RPLRR +QGLDPKSTA LFA++ C+GEK+I +EDCF+LK
Subjt:  K--SIGTRSGDNGCFVLWQMLPGMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLK

Query:  VSAEREAVMERNEGPAEVIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGD-AVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRME
        + AE  A+  R+    E+IRH ++G F Q++G+L+ LEDSHL R++ + D +++WETT+ S I DYR VDG+L+AH G+S  ++F+FGE S   SRTRME
Subjt:  VSAEREAVMERNEGPAEVIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGD-AVYWETTIGSCIGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRME

Query:  ELWNIDDVMFNVAGLSIDYFIPPADI
        E W I+++ FN+ GLS+D F+PP+D+
Subjt:  ELWNIDDVMFNVAGLSIDYFIPPADI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGACGGCGGTTGTGATTGGGGGTGGGGGATCTTCCAGTTCCAGCAAGTCCCGCAGAAAGGCAATTTGGTACTCTCACCCACTGACTCCGTTGTTGGAAAGTCCGGA
TCCCCAAATGCAAGACCAAGAACAACCCAACAACAAGAAAGACTCCTCGGCCTCGGCGTCCAACTGGGAATTCCTCCGCGACTGGTTCAAGATCCACCGCAACCTCCCCT
CCTTCGACGTTCCCAATTCCAAGACCCAAGATTTGAAGCTTTTGCTCGGCGTCCTCGCATGCCCCCTCGCTCCCATTCCCCTCTCTGCCTCTCCCACTCCCCTCGACCAC
CACTCCCACTGCCACTTCCCACGCGATACGCCTCTTGAAACTTCTGTCGCGCATTACATCATACAACAGTATCTGGCCGCTACCGGATGTCTGAAACAACAAAAGTCTGC
CAAGAACATGTACGTCTCCGGAAGCGTGAAGATGATTCGGTGTGAAACAGAGGTCTCTTCTGGGAAAACTGTGAAGAGCATAGGGACAAGAAGCGGGGACAACGGCTGCT
TTGTTCTGTGGCAAATGCTGCCGGGCATGTGGTCTCTCGAATTGGTGGTCGGAGGCAGTAAGGTGGTTGCCGGCAGCGACGGCAAGACCGTGTGGCGCCACACTCCATGG
CTCGGCACCCATGCCGCCAAAGGTCCCCAACGACCTCTGCGTCGCATCATTCAGGGGCTAGACCCAAAGAGCACGGCTCGGCTGTTCGCGAAAGCCCAATGCCTTGGGGA
GAAGCGGATCGGGGAGGAGGATTGCTTTGTGCTGAAAGTGTCGGCGGAGCGAGAGGCAGTGATGGAAAGAAACGAGGGGCCTGCGGAAGTGATCAGGCACGTGCTGTATG
GGTACTTCTGCCAGAAGAGCGGAGTGCTGGTGTACTTGGAGGACTCACACCTCACCAGAGTGCAGACCGAAGGAGACGCCGTCTACTGGGAAACCACCATCGGAAGCTGC
ATCGGAGACTACAGGGACGTCGACGGCGTCCTCATCGCTCACCACGGCAGGTCCATCGCCACCGTCTTCAAGTTTGGGGAAATGTCCACCCAATTTAGCAGGACCCGAAT
GGAAGAGCTTTGGAACATTGACGATGTCATGTTCAACGTCGCAGGCCTTAGCATCGACTACTTTATTCCTCCAGCCGATATTTTTGATACCCTTCATTCT
mRNA sequenceShow/hide mRNA sequence
ATGGCGACGGCGGTTGTGATTGGGGGTGGGGGATCTTCCAGTTCCAGCAAGTCCCGCAGAAAGGCAATTTGGTACTCTCACCCACTGACTCCGTTGTTGGAAAGTCCGGA
TCCCCAAATGCAAGACCAAGAACAACCCAACAACAAGAAAGACTCCTCGGCCTCGGCGTCCAACTGGGAATTCCTCCGCGACTGGTTCAAGATCCACCGCAACCTCCCCT
CCTTCGACGTTCCCAATTCCAAGACCCAAGATTTGAAGCTTTTGCTCGGCGTCCTCGCATGCCCCCTCGCTCCCATTCCCCTCTCTGCCTCTCCCACTCCCCTCGACCAC
CACTCCCACTGCCACTTCCCACGCGATACGCCTCTTGAAACTTCTGTCGCGCATTACATCATACAACAGTATCTGGCCGCTACCGGATGTCTGAAACAACAAAAGTCTGC
CAAGAACATGTACGTCTCCGGAAGCGTGAAGATGATTCGGTGTGAAACAGAGGTCTCTTCTGGGAAAACTGTGAAGAGCATAGGGACAAGAAGCGGGGACAACGGCTGCT
TTGTTCTGTGGCAAATGCTGCCGGGCATGTGGTCTCTCGAATTGGTGGTCGGAGGCAGTAAGGTGGTTGCCGGCAGCGACGGCAAGACCGTGTGGCGCCACACTCCATGG
CTCGGCACCCATGCCGCCAAAGGTCCCCAACGACCTCTGCGTCGCATCATTCAGGGGCTAGACCCAAAGAGCACGGCTCGGCTGTTCGCGAAAGCCCAATGCCTTGGGGA
GAAGCGGATCGGGGAGGAGGATTGCTTTGTGCTGAAAGTGTCGGCGGAGCGAGAGGCAGTGATGGAAAGAAACGAGGGGCCTGCGGAAGTGATCAGGCACGTGCTGTATG
GGTACTTCTGCCAGAAGAGCGGAGTGCTGGTGTACTTGGAGGACTCACACCTCACCAGAGTGCAGACCGAAGGAGACGCCGTCTACTGGGAAACCACCATCGGAAGCTGC
ATCGGAGACTACAGGGACGTCGACGGCGTCCTCATCGCTCACCACGGCAGGTCCATCGCCACCGTCTTCAAGTTTGGGGAAATGTCCACCCAATTTAGCAGGACCCGAAT
GGAAGAGCTTTGGAACATTGACGATGTCATGTTCAACGTCGCAGGCCTTAGCATCGACTACTTTATTCCTCCAGCCGATATTTTTGATACCCTTCATTCT
Protein sequenceShow/hide protein sequence
MATAVVIGGGGSSSSSKSRRKAIWYSHPLTPLLESPDPQMQDQEQPNNKKDSSASASNWEFLRDWFKIHRNLPSFDVPNSKTQDLKLLLGVLACPLAPIPLSASPTPLDH
HSHCHFPRDTPLETSVAHYIIQQYLAATGCLKQQKSAKNMYVSGSVKMIRCETEVSSGKTVKSIGTRSGDNGCFVLWQMLPGMWSLELVVGGSKVVAGSDGKTVWRHTPW
LGTHAAKGPQRPLRRIIQGLDPKSTARLFAKAQCLGEKRIGEEDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSC
IGDYRDVDGVLIAHHGRSIATVFKFGEMSTQFSRTRMEELWNIDDVMFNVAGLSIDYFIPPADIFDTLHS