; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc06G01800 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc06G01800
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionGlycosyl transferase
Genome locationClcChr06:1655395..1656870
RNA-Seq ExpressionClc06G01800
SyntenyClc06G01800
Gene Ontology termsGO:0016740 - transferase activity (molecular function)
InterPro domainsIPR038941 - At4g14100-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576096.1 hypothetical protein SDJN03_26735, partial [Cucurbita argyrosperma subsp. sororia]9.9e-10987.68Show/hide
Query:  MASKVT-FLILSLLSVSIRYSISEDPVPTPWPLQFHSILLMNHTGILQIIDLWYDWPNGRNFNIIQHQLGNVLYDLEWNNGTSFFYTLDSSKTCSSAQLE
        M SK T FL+LSLL+VS+R S+SEDPVPTPWPLQFHSIL  N++GILQIIDLWYDWPNGRNFNIIQHQLGNVLYDLEWNNGTSF+YTLDSSKTCS  + E
Subjt:  MASKVT-FLILSLLSVSIRYSISEDPVPTPWPLQFHSILLMNHTGILQIIDLWYDWPNGRNFNIIQHQLGNVLYDLEWNNGTSFFYTLDSSKTCSSAQLE

Query:  VGILRPNWLDGAKYLGQRHVDGFLCNVWEKVDFIWYYEDVETKRPVHWLFYTGRQAHVMTFEVGAVLEDEKWQAPVYCFDGGKGTSVNNGVLHQNLPVMD
        VGILRPNWLDGA YLGQRHV+GFLCNVWEKVDFIWYYEDVETKRPVHW+FYTGR+AHVMTFEVGAVLEDEKWQAPVYCFD GKG   NNGVLHQNLPVMD
Subjt:  VGILRPNWLDGAKYLGQRHVDGFLCNVWEKVDFIWYYEDVETKRPVHWLFYTGRQAHVMTFEVGAVLEDEKWQAPVYCFDGGKGTSVNNGVLHQNLPVMD

Query:  GVLKGSLFPAI
        GVL+GSLFPAI
Subjt:  GVLKGSLFPAI

XP_022991566.1 uncharacterized protein At4g14100-like [Cucurbita maxima]1.2e-11190.52Show/hide
Query:  MASKVT-FLILSLLSVSIRYSISEDPVPTPWPLQFHSILLMNHTGILQIIDLWYDWPNGRNFNIIQHQLGNVLYDLEWNNGTSFFYTLDSSKTCSSAQLE
        M SK T FL+LSLL+VS+R S+SEDPVP PWPLQFHSIL MN++GILQIIDLWYDWPNGRNFNIIQHQLGNVLYDLEWNNGTSFFYTLDSSKTCSSAQLE
Subjt:  MASKVT-FLILSLLSVSIRYSISEDPVPTPWPLQFHSILLMNHTGILQIIDLWYDWPNGRNFNIIQHQLGNVLYDLEWNNGTSFFYTLDSSKTCSSAQLE

Query:  VGILRPNWLDGAKYLGQRHVDGFLCNVWEKVDFIWYYEDVETKRPVHWLFYTGRQAHVMTFEVGAVLEDEKWQAPVYCFDGGKGTSVNNGVLHQNLPVMD
        VGILRPNWLDGA YLGQRHV+GFLCNVWEKVDFIWYYEDVETKRPVHW+FYTGRQAHVMTFEVGAVLEDEKWQAPVYCFD GKG   NNGVLHQNLPVMD
Subjt:  VGILRPNWLDGAKYLGQRHVDGFLCNVWEKVDFIWYYEDVETKRPVHWLFYTGRQAHVMTFEVGAVLEDEKWQAPVYCFDGGKGTSVNNGVLHQNLPVMD

Query:  GVLKGSLFPAI
        GVL+GSLFPAI
Subjt:  GVLKGSLFPAI

XP_023520922.1 uncharacterized protein At4g14100-like [Cucurbita pepo subsp. pepo]2.9e-10886.32Show/hide
Query:  MASKVTF--LILSLLSVSIRYSISEDPVPTPWPLQFHSILLMNHTGILQIIDLWYDWPNGRNFNIIQHQLGNVLYDLEWNNGTSFFYTLDSSKTCSSAQL
        M SK TF  ++LSLL+VS+R S+SEDP+PTPWPLQFHSIL  N++GILQIIDLWYDWPNGRNFNIIQHQLGNVLYDLEWNNGTSF+YTLDSSKTCS  Q 
Subjt:  MASKVTF--LILSLLSVSIRYSISEDPVPTPWPLQFHSILLMNHTGILQIIDLWYDWPNGRNFNIIQHQLGNVLYDLEWNNGTSFFYTLDSSKTCSSAQL

Query:  EVGILRPNWLDGAKYLGQRHVDGFLCNVWEKVDFIWYYEDVETKRPVHWLFYTGRQAHVMTFEVGAVLEDEKWQAPVYCFDGGKGTSVNNGVLHQNLPVM
        EVGILRPNWLDGA Y+GQRHV+GFLCNVWEKVDFIWYYEDVETKRPVHW+FYTGR+AHVMTFEVGAVLEDEKWQAPVYCFD GKG   NNGVLHQNLPVM
Subjt:  EVGILRPNWLDGAKYLGQRHVDGFLCNVWEKVDFIWYYEDVETKRPVHWLFYTGRQAHVMTFEVGAVLEDEKWQAPVYCFDGGKGTSVNNGVLHQNLPVM

Query:  DGVLKGSLFPAI
        DGVL+GSLFPAI
Subjt:  DGVLKGSLFPAI

XP_023548717.1 uncharacterized protein At4g14100-like isoform X1 [Cucurbita pepo subsp. pepo]3.3e-11290.52Show/hide
Query:  MASKVT-FLILSLLSVSIRYSISEDPVPTPWPLQFHSILLMNHTGILQIIDLWYDWPNGRNFNIIQHQLGNVLYDLEWNNGTSFFYTLDSSKTCSSAQLE
        M SK T FL+LSLL+VS+R S+SEDP+PTPWPLQFHSIL MN++GILQIIDLWYDWPNGRNFNIIQHQLGNVLYDLEWNNGTSFFYTLDSSKTCSSAQLE
Subjt:  MASKVT-FLILSLLSVSIRYSISEDPVPTPWPLQFHSILLMNHTGILQIIDLWYDWPNGRNFNIIQHQLGNVLYDLEWNNGTSFFYTLDSSKTCSSAQLE

Query:  VGILRPNWLDGAKYLGQRHVDGFLCNVWEKVDFIWYYEDVETKRPVHWLFYTGRQAHVMTFEVGAVLEDEKWQAPVYCFDGGKGTSVNNGVLHQNLPVMD
        VGILRPNWLDGA YLGQRHV+GFLCNVWEKVDFIWYYEDVETKRPVHW+FYTGRQAHVMTFEVGAVLEDEKWQAPVYCFD GKG   NNGVLHQNLPVMD
Subjt:  VGILRPNWLDGAKYLGQRHVDGFLCNVWEKVDFIWYYEDVETKRPVHWLFYTGRQAHVMTFEVGAVLEDEKWQAPVYCFDGGKGTSVNNGVLHQNLPVMD

Query:  GVLKGSLFPAI
        GVL+GSLFPAI
Subjt:  GVLKGSLFPAI

XP_023548718.1 uncharacterized protein At4g14100-like isoform X2 [Cucurbita pepo subsp. pepo]7.3e-11290.05Show/hide
Query:  MASKVT-FLILSLLSVSIRYSISEDPVPTPWPLQFHSILLMNHTGILQIIDLWYDWPNGRNFNIIQHQLGNVLYDLEWNNGTSFFYTLDSSKTCSSAQLE
        M SK T FL+LSLL+VS+R S+SEDP+PTPWPLQFHSIL MN++GILQIIDLWYDWPNGRNFNIIQHQLGNVLYDLEWNNGTSFFYTLDSSKTCSSAQLE
Subjt:  MASKVT-FLILSLLSVSIRYSISEDPVPTPWPLQFHSILLMNHTGILQIIDLWYDWPNGRNFNIIQHQLGNVLYDLEWNNGTSFFYTLDSSKTCSSAQLE

Query:  VGILRPNWLDGAKYLGQRHVDGFLCNVWEKVDFIWYYEDVETKRPVHWLFYTGRQAHVMTFEVGAVLEDEKWQAPVYCFDGGKGTSVNNGVLHQNLPVMD
        VGILRPNWLDGA Y+GQRHV+GFLCNVWEKVDFIWYYEDVETKRPVHW+FYTGRQAHVMTFEVGAVLEDEKWQAPVYCFD GKG   NNGVLHQNLPVMD
Subjt:  VGILRPNWLDGAKYLGQRHVDGFLCNVWEKVDFIWYYEDVETKRPVHWLFYTGRQAHVMTFEVGAVLEDEKWQAPVYCFDGGKGTSVNNGVLHQNLPVMD

Query:  GVLKGSLFPAI
        GVL+GSLFPAI
Subjt:  GVLKGSLFPAI

TrEMBL top hitse value%identityAlignment
A0A0A0KDE7 Uncharacterized protein8.5e-10686.92Show/hide
Query:  MASKVT-FLILSLLSVSIRYSISEDPVPTPWPLQFHSILLMNHTGILQIIDLWYDWPNGRNFNIIQHQLGNVLYDLEWNNGTSFFYTLDSSKTCSSAQLE
        MASK   FLILSLLSVS+R+SISEDPVPTPWPLQFHS+LLMN++GI QII+LWYDWPNGRNFNIIQHQLGNVLYDLEWNNGTSFFYTLDS+KTCS AQLE
Subjt:  MASKVT-FLILSLLSVSIRYSISEDPVPTPWPLQFHSILLMNHTGILQIIDLWYDWPNGRNFNIIQHQLGNVLYDLEWNNGTSFFYTLDSSKTCSSAQLE

Query:  VGILRPNWLDGAKYLGQRHVDGFLCNVWEKVDFIWYYEDVETKRPVHWLFYTGRQAHVMTFEVGAVLEDEKWQAPVYCFDGGKGTSVNNGVLHQNLPVMD
        VGILRPNWLDGAKYLGQRHVDGFLCNVWEKVDFIWYYEDVETKRPVHWLFYTGRQAHVMTFEVGAVLEDEKWQAPVYCFD G GT+VN+  LHQNLP+M 
Subjt:  VGILRPNWLDGAKYLGQRHVDGFLCNVWEKVDFIWYYEDVETKRPVHWLFYTGRQAHVMTFEVGAVLEDEKWQAPVYCFDGGKGTSVNNGVLHQNLPVMD

Query:  GV---LKGSLFPAI
        GV   L    +PAI
Subjt:  GV---LKGSLFPAI

A0A1S3CAT8 uncharacterized protein At4g14100-like4.2e-10586.92Show/hide
Query:  MASKVT-FLILSLLSVSIRYSISEDPVPTPWPLQFHSILLMNHTGILQIIDLWYDWPNGRNFNIIQHQLGNVLYDLEWNNGTSFFYTLDSSKTCSSAQLE
        MASK   FLILSLLS S+ YSI+EDPVPTPWPLQFHSILLMN++GI QII+LWYDWPNGRNFNIIQHQLGNVLYDLEWNNGTSFFYTLDSSKTCSSAQLE
Subjt:  MASKVT-FLILSLLSVSIRYSISEDPVPTPWPLQFHSILLMNHTGILQIIDLWYDWPNGRNFNIIQHQLGNVLYDLEWNNGTSFFYTLDSSKTCSSAQLE

Query:  VGILRPNWLDGAKYLGQRHVDGFLCNVWEKVDFIWYYEDVETKRPVHWLFYTGRQAHVMTFEVGAVLEDEKWQAPVYCFDGGKGTSVNNGVLHQNLPVMD
        VGILRPNWLDGAKYLGQRHVDGFLCNVWEKVDFIWYYEDVETKRPVHWLFYTGRQAHVMTFEVGAVLEDEKWQAPVYCFD G GT+VN+  LHQNLP+M 
Subjt:  VGILRPNWLDGAKYLGQRHVDGFLCNVWEKVDFIWYYEDVETKRPVHWLFYTGRQAHVMTFEVGAVLEDEKWQAPVYCFDGGKGTSVNNGVLHQNLPVMD

Query:  GV---LKGSLFPAI
         V   L    +PAI
Subjt:  GV---LKGSLFPAI

A0A5A7T8W2 Glycosyl transferase5.5e-10586.92Show/hide
Query:  MASKVT-FLILSLLSVSIRYSISEDPVPTPWPLQFHSILLMNHTGILQIIDLWYDWPNGRNFNIIQHQLGNVLYDLEWNNGTSFFYTLDSSKTCSSAQLE
        MASK   FLILSLLS S+ YSI+EDPVPTPWPLQFHSILLMN++GI QII+LWYDWPNGRNFNIIQHQLGNVLYDLEWNNGTSFFYTLDSSKTCSSAQLE
Subjt:  MASKVT-FLILSLLSVSIRYSISEDPVPTPWPLQFHSILLMNHTGILQIIDLWYDWPNGRNFNIIQHQLGNVLYDLEWNNGTSFFYTLDSSKTCSSAQLE

Query:  VGILRPNWLDGAKYLGQRHVDGFLCNVWEKVDFIWYYEDVETKRPVHWLFYTGRQAHVMTFEVGAVLEDEKWQAPVYCFDGGKGTSVNNGVLHQNLPVMD
        VGILRPNWLDGAKYLGQRHVDGFLCNVWEKVDFIWYYEDVETKRPVHWLFYTGRQAHVMTFEVGAVLEDEKWQAPVYCFD G GT+VN+  LHQNLP+M 
Subjt:  VGILRPNWLDGAKYLGQRHVDGFLCNVWEKVDFIWYYEDVETKRPVHWLFYTGRQAHVMTFEVGAVLEDEKWQAPVYCFDGGKGTSVNNGVLHQNLPVMD

Query:  GV---LKGSLFPAI
         V   L    +PAI
Subjt:  GV---LKGSLFPAI

A0A6J1GMU2 uncharacterized protein At4g14100-like1.4e-10887.2Show/hide
Query:  MASKVT-FLILSLLSVSIRYSISEDPVPTPWPLQFHSILLMNHTGILQIIDLWYDWPNGRNFNIIQHQLGNVLYDLEWNNGTSFFYTLDSSKTCSSAQLE
        M SK T FL+LSLL+VS+R S+SEDPVPTPWPLQFHSIL  N++GILQIIDLWYDWPNGRNFNIIQHQLGNVLYDLEWNNGTSF+YTLDSSKTCS  + E
Subjt:  MASKVT-FLILSLLSVSIRYSISEDPVPTPWPLQFHSILLMNHTGILQIIDLWYDWPNGRNFNIIQHQLGNVLYDLEWNNGTSFFYTLDSSKTCSSAQLE

Query:  VGILRPNWLDGAKYLGQRHVDGFLCNVWEKVDFIWYYEDVETKRPVHWLFYTGRQAHVMTFEVGAVLEDEKWQAPVYCFDGGKGTSVNNGVLHQNLPVMD
        VGILRPNWLDGA YLGQRHV+GFLCNVWEKVDFIWYYEDVETKRPVHW+FYTGR+AHVMTFEVGAVLEDEKWQAPVYCFD GKG   NNGVLHQN PVMD
Subjt:  VGILRPNWLDGAKYLGQRHVDGFLCNVWEKVDFIWYYEDVETKRPVHWLFYTGRQAHVMTFEVGAVLEDEKWQAPVYCFDGGKGTSVNNGVLHQNLPVMD

Query:  GVLKGSLFPAI
        GVL+GSLFPAI
Subjt:  GVLKGSLFPAI

A0A6J1JR40 uncharacterized protein At4g14100-like6.0e-11290.52Show/hide
Query:  MASKVT-FLILSLLSVSIRYSISEDPVPTPWPLQFHSILLMNHTGILQIIDLWYDWPNGRNFNIIQHQLGNVLYDLEWNNGTSFFYTLDSSKTCSSAQLE
        M SK T FL+LSLL+VS+R S+SEDPVP PWPLQFHSIL MN++GILQIIDLWYDWPNGRNFNIIQHQLGNVLYDLEWNNGTSFFYTLDSSKTCSSAQLE
Subjt:  MASKVT-FLILSLLSVSIRYSISEDPVPTPWPLQFHSILLMNHTGILQIIDLWYDWPNGRNFNIIQHQLGNVLYDLEWNNGTSFFYTLDSSKTCSSAQLE

Query:  VGILRPNWLDGAKYLGQRHVDGFLCNVWEKVDFIWYYEDVETKRPVHWLFYTGRQAHVMTFEVGAVLEDEKWQAPVYCFDGGKGTSVNNGVLHQNLPVMD
        VGILRPNWLDGA YLGQRHV+GFLCNVWEKVDFIWYYEDVETKRPVHW+FYTGRQAHVMTFEVGAVLEDEKWQAPVYCFD GKG   NNGVLHQNLPVMD
Subjt:  VGILRPNWLDGAKYLGQRHVDGFLCNVWEKVDFIWYYEDVETKRPVHWLFYTGRQAHVMTFEVGAVLEDEKWQAPVYCFDGGKGTSVNNGVLHQNLPVMD

Query:  GVLKGSLFPAI
        GVL+GSLFPAI
Subjt:  GVLKGSLFPAI

SwissProt top hitse value%identityAlignment
Q67YC9 Uncharacterized protein At4g141002.6e-8369.31Show/hide
Query:  VTFLILSLLSVSIRYSISEDPVPTPWPLQFHSILLMNHTGILQIIDLWYDWPNGRNFNIIQHQLGNVLYDLEWNNGTSFFYTLDSSKTCSSAQLEVGILR
        +  +I   ++  ++ + +++PVPTPWP QFH+++ MN++G L +IDLWYDW NGRNFNIIQ QLG + YDLEWNNGTSFFYTLD SK+C S QLEVGILR
Subjt:  VTFLILSLLSVSIRYSISEDPVPTPWPLQFHSILLMNHTGILQIIDLWYDWPNGRNFNIIQHQLGNVLYDLEWNNGTSFFYTLDSSKTCSSAQLEVGILR

Query:  PNWLDGAKYLGQRHVDGFLCNVWEKVDFIWYYEDVETKRPVHWLFYTGRQAHVMTFEVGAVLEDEKWQAPVYCFDGGKGTSVNNGVLHQ
        PNWLDGAKYLGQ++V GFLCNVWEKVDFIWYYEDVETKRPV W+FYTGR+AH+MT+EVGAVLEDEKWQAPVYCF+  K +    G L +
Subjt:  PNWLDGAKYLGQRHVDGFLCNVWEKVDFIWYYEDVETKRPVHWLFYTGRQAHVMTFEVGAVLEDEKWQAPVYCFDGGKGTSVNNGVLHQ

Arabidopsis top hitse value%identityAlignment
AT3G23760.1 FUNCTIONS IN: molecular_function unknown7.1e-8171.04Show/hide
Query:  TFLILSLLSVSIRYSIS------EDPVPTPWPLQFHSILLMNHTGILQIIDLWYDWPNGRNFNIIQHQLGNVLYDLEWNNGTSFFYTLDSSKTCSSAQLE
        +F+++ LL +S+   IS      ++PVP  WP QFH+++LMN +G L+I+DLWYDW NGRNFNIIQ QLG + YDLEWNNGTSF+YTLD+SKTC +   E
Subjt:  TFLILSLLSVSIRYSIS------EDPVPTPWPLQFHSILLMNHTGILQIIDLWYDWPNGRNFNIIQHQLGNVLYDLEWNNGTSFFYTLDSSKTCSSAQLE

Query:  VGILRPNWLDGAKYLGQRHVDGFLCNVWEKVDFIWYYEDVETKRPVHWLFYTGRQAHVMTFEVGAVLEDEKWQAPVYCFDGGK
        VGILRPNWLDGAKY+GQRHV+GFLCNVWEKV+F+WYYEDV TKRPV W+FYTGR+AHVMTFEVGAVLEDEKWQAPVYCF   K
Subjt:  VGILRPNWLDGAKYLGQRHVDGFLCNVWEKVDFIWYYEDVETKRPVHWLFYTGRQAHVMTFEVGAVLEDEKWQAPVYCFDGGK

AT4G14100.1 transferases, transferring glycosyl groups1.8e-8469.31Show/hide
Query:  VTFLILSLLSVSIRYSISEDPVPTPWPLQFHSILLMNHTGILQIIDLWYDWPNGRNFNIIQHQLGNVLYDLEWNNGTSFFYTLDSSKTCSSAQLEVGILR
        +  +I   ++  ++ + +++PVPTPWP QFH+++ MN++G L +IDLWYDW NGRNFNIIQ QLG + YDLEWNNGTSFFYTLD SK+C S QLEVGILR
Subjt:  VTFLILSLLSVSIRYSISEDPVPTPWPLQFHSILLMNHTGILQIIDLWYDWPNGRNFNIIQHQLGNVLYDLEWNNGTSFFYTLDSSKTCSSAQLEVGILR

Query:  PNWLDGAKYLGQRHVDGFLCNVWEKVDFIWYYEDVETKRPVHWLFYTGRQAHVMTFEVGAVLEDEKWQAPVYCFDGGKGTSVNNGVLHQ
        PNWLDGAKYLGQ++V GFLCNVWEKVDFIWYYEDVETKRPV W+FYTGR+AH+MT+EVGAVLEDEKWQAPVYCF+  K +    G L +
Subjt:  PNWLDGAKYLGQRHVDGFLCNVWEKVDFIWYYEDVETKRPVHWLFYTGRQAHVMTFEVGAVLEDEKWQAPVYCFDGGKGTSVNNGVLHQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCAAGGTGACCTTCCTTATCCTTTCATTACTGAGTGTTAGTATACGTTATTCCATATCGGAGGATCCCGTTCCAACCCCATGGCCTCTCCAATTTCACTCCAT
TCTCCTCATGAATCACACCGGAATTCTGCAGATAATCGACCTCTGGTACGACTGGCCTAATGGTCGCAACTTCAACATCATCCAACACCAGCTCGGCAACGTTCTCTACG
ACCTCGAATGGAACAACGGCACTTCCTTCTTCTACACTTTGGACTCTTCTAAGACTTGCTCTTCCGCTCAGCTTGAGGTTGGTATTCTACGACCCAATTGGCTCGACGGA
GCCAAATATCTGGGTCAACGCCATGTCGATGGCTTCCTCTGTAATGTCTGGGAGAAGGTCGATTTCATCTGGTATTACGAGGATGTTGAGACCAAGAGACCCGTTCATTG
GCTCTTCTACACTGGAAGACAGGCTCATGTGATGACATTTGAAGTTGGTGCTGTGCTGGAAGATGAGAAATGGCAAGCCCCTGTCTACTGCTTTGACGGAGGCAAGGGAA
CTAGTGTCAATAATGGGGTTCTCCACCAGAATTTGCCTGTAATGGATGGAGTTCTAAAGGGTTCTCTATTCCCCGCCATTTGA
mRNA sequenceShow/hide mRNA sequence
GGTAGGACCAATTTTGCATTTTAACTAAATACGAAACTAAAAAAATGGTCAAATGAGAAATTATGATACAGCAATTGAAATGAGCACAATCGTAGTCAGCACTGAGTAGT
TTTTTTTTTGTAAAGAAATATTCCAACAAAAGATTAAAAAAAGAAAAAAAAAAGTGGGTTTTGATTGGAGGAGAAGAAAATAAATTGCCGAAAGCGCGAGAGAGGAGAGA
GAACCTAGAAACCATGGCTTCCAAGGTGACCTTCCTTATCCTTTCATTACTGAGTGTTAGTATACGTTATTCCATATCGGAGGATCCCGTTCCAACCCCATGGCCTCTCC
AATTTCACTCCATTCTCCTCATGAATCACACCGGAATTCTGCAGATAATCGACCTCTGGTACGACTGGCCTAATGGTCGCAACTTCAACATCATCCAACACCAGCTCGGC
AACGTTCTCTACGACCTCGAATGGAACAACGGCACTTCCTTCTTCTACACTTTGGACTCTTCTAAGACTTGCTCTTCCGCTCAGCTTGAGGTTGGTATTCTACGACCCAA
TTGGCTCGACGGAGCCAAATATCTGGGTCAACGCCATGTCGATGGCTTCCTCTGTAATGTCTGGGAGAAGGTCGATTTCATCTGGTATTACGAGGATGTTGAGACCAAGA
GACCCGTTCATTGGCTCTTCTACACTGGAAGACAGGCTCATGTGATGACATTTGAAGTTGGTGCTGTGCTGGAAGATGAGAAATGGCAAGCCCCTGTCTACTGCTTTGAC
GGAGGCAAGGGAACTAGTGTCAATAATGGGGTTCTCCACCAGAATTTGCCTGTAATGGATGGAGTTCTAAAGGGTTCTCTATTCCCCGCCATTTGAAACCTTTTTTTTTT
TTCTTCCCTGCCTAGTCCCATCTCGCTGAACTTGTTGCACAATTTCAATAAATTAGACACTGATGTTTATATTTATATCTTCATGTTTGTTCTTGTTCAAGTAAATATGG
AGTTCCCAATTATCCTTCCACACCCTATCATTGGTAAATTTCTAGTGTATATATTATACTACAAAAATAACATGAACAGCTACTATTCGTAGCGTTCTGTATAAACTATT
AACTATAGTACAAGATGTAGTGAGAAAGTATGACTTATCGTCTGCTAAGCTCCTCGATTGAACGATTCTTCCTGCAGGGTGTTCTAACGTTCTAATCAGCATTCTGACTA
AACTCTTGCTGCTCAAGCTTTGCAATCAAGGCCTTCTTTTCGTCCTCTCCAAGTGCATTGAACATGAAAACAGATTTGTCGCCAATATCATCCTGGAAATGAATATGAAA
CCAGTCGGACAATGCAACTACATATTGATGGTGTTCATTTTTCA
Protein sequenceShow/hide protein sequence
MASKVTFLILSLLSVSIRYSISEDPVPTPWPLQFHSILLMNHTGILQIIDLWYDWPNGRNFNIIQHQLGNVLYDLEWNNGTSFFYTLDSSKTCSSAQLEVGILRPNWLDG
AKYLGQRHVDGFLCNVWEKVDFIWYYEDVETKRPVHWLFYTGRQAHVMTFEVGAVLEDEKWQAPVYCFDGGKGTSVNNGVLHQNLPVMDGVLKGSLFPAI