; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS024916 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS024916
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionVacuolar iron family transporter
Genome locationscaffold451:41888..44052
RNA-Seq ExpressionMS024916
SyntenyMS024916
Gene Ontology termsGO:0006880 - intracellular sequestering of iron ion (biological process)
GO:0030026 - cellular manganese ion homeostasis (biological process)
GO:0034755 - iron ion transmembrane transport (biological process)
GO:0071421 - manganese ion transmembrane transport (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0005381 - iron ion transmembrane transporter activity (molecular function)
GO:0005384 - manganese ion transmembrane transporter activity (molecular function)
InterPro domainsIPR008217 - Ccc1 family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022156110.1 uncharacterized protein LOC111023075 [Momordica charantia]8.2e-13799.24Show/hide
Query:  MASSAAAAAAEALISPENKGNERPKEPWNGELAKSIVYGGLDAIVTCFSLIASISASRHSAVDVLVLGFANLIADGISMGFGDFVSSATGRDVAAERRAA
        MASSAAAAAAE LISPENKGNERPKEPWNGELAKSIVYGGLDAIVTCFSLIASISASRHSAVDVLVLGFANLIADGISMGFGDFVS+ATGRDVAAERRAA
Subjt:  MASSAAAAAAEALISPENKGNERPKEPWNGELAKSIVYGGLDAIVTCFSLIASISASRHSAVDVLVLGFANLIADGISMGFGDFVSSATGRDVAAERRAA

Query:  AEWDVDNRRAQQQQLLLRHYQTLGMDFNDASTVVNIIAKYKHILVEEKMAAENGMAAPPEESKERPWKNGIVTFGSFLAFGCVPLLSFIVLIPFTDNESI
        AEWDVDNRRAQQQQLLLRHYQTLGMDFNDASTVVNIIAKYKHILVEEKMAAENGMAAPPEESKERPWKNGIVTFGSFLAFGCVPLLSFIVLIPFTDNESI
Subjt:  AEWDVDNRRAQQQQLLLRHYQTLGMDFNDASTVVNIIAKYKHILVEEKMAAENGMAAPPEESKERPWKNGIVTFGSFLAFGCVPLLSFIVLIPFTDNESI

Query:  KFGGACLLAALALVLLGIARAKIAAGNYGFSVAVTVLNGAVAAGAAYALGWVLRNVAGVEDET
        KFGGACLLAALALVLLGIARAKIAAGNYGFSVAVTVLNGAVAAGAAYALGWVLRNVAGVEDET
Subjt:  KFGGACLLAALALVLLGIARAKIAAGNYGFSVAVTVLNGAVAAGAAYALGWVLRNVAGVEDET

XP_022959342.1 uncharacterized protein LOC111460342 [Cucurbita moschata]2.0e-10677.65Show/hide
Query:  ASSAAAAAAEALISPEN--KGNERPKEPWNGELAKSIVYGGLDAIVTCFSLIASISASRHSAVDVLVLGFANLIADGISMGFGDFVSSATGRDVAAERRA
        ASS   AA+E LI  E+  KG +RP EPWNG+LAKSIVYGGLDAIVTCFSLIASISA+RHSAVDVLVLGFANLIADGISMGFGD+V++ T R V  + RA
Subjt:  ASSAAAAAAEALISPEN--KGNERPKEPWNGELAKSIVYGGLDAIVTCFSLIASISASRHSAVDVLVLGFANLIADGISMGFGDFVSSATGRDVAAERRA

Query:  AAEWDVDNRRAQQQQLLLRHYQTLGMDFNDASTVVNIIAKYKHILVEEKMAAENGMAAPPEESKERPWKNGIVTFGSFLAFGCVPLLSFIVLIPFTDNES
        A EWD+DNR   QQ LLL+HYQ+LGMDF+DASTVVNI+AKYKHI+VEEK     G AAPP++SK+RPWKNG+ TFGSFLAFGC+PLLSFI+LIPFTDNE+
Subjt:  AAEWDVDNRRAQQQQLLLRHYQTLGMDFNDASTVVNIIAKYKHILVEEKMAAENGMAAPPEESKERPWKNGIVTFGSFLAFGCVPLLSFIVLIPFTDNES

Query:  IKFGGACLLAALALVLLGIARAKIAAGNYGFSVAVTVLNGAVAAGAAYALGWVLRNVAGVEDET
        +KFGGAC+LA LALVLLG+ARAKIAAGNYGFSVA+TVLNGAVAA AAY+LGW LRNVAGVE+ET
Subjt:  IKFGGACLLAALALVLLGIARAKIAAGNYGFSVAVTVLNGAVAAGAAYALGWVLRNVAGVEDET

XP_023006691.1 uncharacterized protein LOC111499350 [Cucurbita maxima]6.4e-10576.52Show/hide
Query:  ASSAAAAAAEALISPEN--KGNERPKEPWNGELAKSIVYGGLDAIVTCFSLIASISASRHSAVDVLVLGFANLIADGISMGFGDFVSSATGRDVAAERRA
        AS+   AA+E LI  E+  KG +RP EPWNG+L KSIVYGGLDAIVTCFSLIASISA+RHSAVDVLVLGFANLIADGISMGFGD+V++ T R V+ + RA
Subjt:  ASSAAAAAAEALISPEN--KGNERPKEPWNGELAKSIVYGGLDAIVTCFSLIASISASRHSAVDVLVLGFANLIADGISMGFGDFVSSATGRDVAAERRA

Query:  AAEWDVDNRRAQQQQLLLRHYQTLGMDFNDASTVVNIIAKYKHILVEEKMAAENGMAAPPEESKERPWKNGIVTFGSFLAFGCVPLLSFIVLIPFTDNES
        AAEWD+DNR   Q  LLL+HYQ+LGMDF+DASTVVNI+AKYKHI+VEEK     G AAPP++SK+RPWKNG+ TFGSFLAFGC+PLLSFI+LIPFTDNE+
Subjt:  AAEWDVDNRRAQQQQLLLRHYQTLGMDFNDASTVVNIIAKYKHILVEEKMAAENGMAAPPEESKERPWKNGIVTFGSFLAFGCVPLLSFIVLIPFTDNES

Query:  IKFGGACLLAALALVLLGIARAKIAAGNYGFSVAVTVLNGAVAAGAAYALGWVLRNVAGVEDET
        +KFGGAC+LA LALVLLG+ARAKIAAGNYGFSVA+TVLNGAVAA AAY+LGW LRNVAGV++ET
Subjt:  IKFGGACLLAALALVLLGIARAKIAAGNYGFSVAVTVLNGAVAAGAAYALGWVLRNVAGVEDET

XP_023548049.1 uncharacterized protein LOC111806802 [Cucurbita pepo subsp. pepo]1.4e-10476.52Show/hide
Query:  ASSAAAAAAEALISPEN--KGNERPKEPWNGELAKSIVYGGLDAIVTCFSLIASISASRHSAVDVLVLGFANLIADGISMGFGDFVSSATGRDVAAERRA
        AS+   AA+E LI  E+  K  +RP EPWNG+LAKSIVYGGLDAIVTCFSLIASISA+RHSAVDVLVLGFANLIADGISMGFGD+V++ T R V  + RA
Subjt:  ASSAAAAAAEALISPEN--KGNERPKEPWNGELAKSIVYGGLDAIVTCFSLIASISASRHSAVDVLVLGFANLIADGISMGFGDFVSSATGRDVAAERRA

Query:  AAEWDVDNRRAQQQQLLLRHYQTLGMDFNDASTVVNIIAKYKHILVEEKMAAENGMAAPPEESKERPWKNGIVTFGSFLAFGCVPLLSFIVLIPFTDNES
        A EWD+DNR   QQ LLL+HYQ+LGMDF+DASTVVNI+AKYKHI+VEEK     G AAPP++SK+RPWKNG+ TFGSFLAFGC+PLLSFI+LIPFTDNE+
Subjt:  AAEWDVDNRRAQQQQLLLRHYQTLGMDFNDASTVVNIIAKYKHILVEEKMAAENGMAAPPEESKERPWKNGIVTFGSFLAFGCVPLLSFIVLIPFTDNES

Query:  IKFGGACLLAALALVLLGIARAKIAAGNYGFSVAVTVLNGAVAAGAAYALGWVLRNVAGVEDET
        +KFGGAC+LA LALVLLG+ARAKIAAGNYGFSVA+TVLNGAVAA AAY+LGW LRN AGVE+ET
Subjt:  IKFGGACLLAALALVLLGIARAKIAAGNYGFSVAVTVLNGAVAAGAAYALGWVLRNVAGVEDET

XP_038876440.1 protein CCC1-like [Benincasa hispida]5.4e-9671.1Show/hide
Query:  MASSAAAAAAEALISPENKGNERPKEP-WNGELAKSIVYGGLDAIVTCFSLIASISASRHSAVDVLVLGFANLIADGISMGFGDFVSSATGRDVAAERRA
        MA+S+           E    +RP E  W+GE+AKS+VYGGLDAIVTCFSLIASISA+RH+AVDVLVLGFANLIADGISMGFGD+V++ T R ++AE RA
Subjt:  MASSAAAAAAEALISPENKGNERPKEP-WNGELAKSIVYGGLDAIVTCFSLIASISASRHSAVDVLVLGFANLIADGISMGFGDFVSSATGRDVAAERRA

Query:  AAEWDVDNRRAQQQQLLLRHYQTLGMDFNDASTVVNIIAKYKHILVEEKMAAENGMAAPPEESKERPWKNGIVTFGSFLAFGCVPLLSFIVLIPFTDNES
        A EWDVDN     + LLL+HYQ+LGMDF+DASTVVNI++KYK I+VEEK     GMA PP ES+ RPWKNG+ TFGSFL FGC+PLLSFI+LIPFTDNES
Subjt:  AAEWDVDNRRAQQQQLLLRHYQTLGMDFNDASTVVNIIAKYKHILVEEKMAAENGMAAPPEESKERPWKNGIVTFGSFLAFGCVPLLSFIVLIPFTDNES

Query:  IKFGGACLLAALALVLLGIARAKIAAGNYGFSVAVTVLNGAVAAGAAYALGWVLRNVAGVEDE
        +KFGGAC+LA LALVLLGIARAKIAA NYGFS+A+TVLNGA+AA AAY LGW LRNV GVED+
Subjt:  IKFGGACLLAALALVLLGIARAKIAAGNYGFSVAVTVLNGAVAAGAAYALGWVLRNVAGVEDE

TrEMBL top hitse value%identityAlignment
A0A6J1DPD7 uncharacterized protein LOC1110230754.0e-13799.24Show/hide
Query:  MASSAAAAAAEALISPENKGNERPKEPWNGELAKSIVYGGLDAIVTCFSLIASISASRHSAVDVLVLGFANLIADGISMGFGDFVSSATGRDVAAERRAA
        MASSAAAAAAE LISPENKGNERPKEPWNGELAKSIVYGGLDAIVTCFSLIASISASRHSAVDVLVLGFANLIADGISMGFGDFVS+ATGRDVAAERRAA
Subjt:  MASSAAAAAAEALISPENKGNERPKEPWNGELAKSIVYGGLDAIVTCFSLIASISASRHSAVDVLVLGFANLIADGISMGFGDFVSSATGRDVAAERRAA

Query:  AEWDVDNRRAQQQQLLLRHYQTLGMDFNDASTVVNIIAKYKHILVEEKMAAENGMAAPPEESKERPWKNGIVTFGSFLAFGCVPLLSFIVLIPFTDNESI
        AEWDVDNRRAQQQQLLLRHYQTLGMDFNDASTVVNIIAKYKHILVEEKMAAENGMAAPPEESKERPWKNGIVTFGSFLAFGCVPLLSFIVLIPFTDNESI
Subjt:  AEWDVDNRRAQQQQLLLRHYQTLGMDFNDASTVVNIIAKYKHILVEEKMAAENGMAAPPEESKERPWKNGIVTFGSFLAFGCVPLLSFIVLIPFTDNESI

Query:  KFGGACLLAALALVLLGIARAKIAAGNYGFSVAVTVLNGAVAAGAAYALGWVLRNVAGVEDET
        KFGGACLLAALALVLLGIARAKIAAGNYGFSVAVTVLNGAVAAGAAYALGWVLRNVAGVEDET
Subjt:  KFGGACLLAALALVLLGIARAKIAAGNYGFSVAVTVLNGAVAAGAAYALGWVLRNVAGVEDET

A0A6J1H491 uncharacterized protein LOC1114603429.6e-10777.65Show/hide
Query:  ASSAAAAAAEALISPEN--KGNERPKEPWNGELAKSIVYGGLDAIVTCFSLIASISASRHSAVDVLVLGFANLIADGISMGFGDFVSSATGRDVAAERRA
        ASS   AA+E LI  E+  KG +RP EPWNG+LAKSIVYGGLDAIVTCFSLIASISA+RHSAVDVLVLGFANLIADGISMGFGD+V++ T R V  + RA
Subjt:  ASSAAAAAAEALISPEN--KGNERPKEPWNGELAKSIVYGGLDAIVTCFSLIASISASRHSAVDVLVLGFANLIADGISMGFGDFVSSATGRDVAAERRA

Query:  AAEWDVDNRRAQQQQLLLRHYQTLGMDFNDASTVVNIIAKYKHILVEEKMAAENGMAAPPEESKERPWKNGIVTFGSFLAFGCVPLLSFIVLIPFTDNES
        A EWD+DNR   QQ LLL+HYQ+LGMDF+DASTVVNI+AKYKHI+VEEK     G AAPP++SK+RPWKNG+ TFGSFLAFGC+PLLSFI+LIPFTDNE+
Subjt:  AAEWDVDNRRAQQQQLLLRHYQTLGMDFNDASTVVNIIAKYKHILVEEKMAAENGMAAPPEESKERPWKNGIVTFGSFLAFGCVPLLSFIVLIPFTDNES

Query:  IKFGGACLLAALALVLLGIARAKIAAGNYGFSVAVTVLNGAVAAGAAYALGWVLRNVAGVEDET
        +KFGGAC+LA LALVLLG+ARAKIAAGNYGFSVA+TVLNGAVAA AAY+LGW LRNVAGVE+ET
Subjt:  IKFGGACLLAALALVLLGIARAKIAAGNYGFSVAVTVLNGAVAAGAAYALGWVLRNVAGVEDET

A0A6J1KYG2 uncharacterized protein LOC1114993503.1e-10576.52Show/hide
Query:  ASSAAAAAAEALISPEN--KGNERPKEPWNGELAKSIVYGGLDAIVTCFSLIASISASRHSAVDVLVLGFANLIADGISMGFGDFVSSATGRDVAAERRA
        AS+   AA+E LI  E+  KG +RP EPWNG+L KSIVYGGLDAIVTCFSLIASISA+RHSAVDVLVLGFANLIADGISMGFGD+V++ T R V+ + RA
Subjt:  ASSAAAAAAEALISPEN--KGNERPKEPWNGELAKSIVYGGLDAIVTCFSLIASISASRHSAVDVLVLGFANLIADGISMGFGDFVSSATGRDVAAERRA

Query:  AAEWDVDNRRAQQQQLLLRHYQTLGMDFNDASTVVNIIAKYKHILVEEKMAAENGMAAPPEESKERPWKNGIVTFGSFLAFGCVPLLSFIVLIPFTDNES
        AAEWD+DNR   Q  LLL+HYQ+LGMDF+DASTVVNI+AKYKHI+VEEK     G AAPP++SK+RPWKNG+ TFGSFLAFGC+PLLSFI+LIPFTDNE+
Subjt:  AAEWDVDNRRAQQQQLLLRHYQTLGMDFNDASTVVNIIAKYKHILVEEKMAAENGMAAPPEESKERPWKNGIVTFGSFLAFGCVPLLSFIVLIPFTDNES

Query:  IKFGGACLLAALALVLLGIARAKIAAGNYGFSVAVTVLNGAVAAGAAYALGWVLRNVAGVEDET
        +KFGGAC+LA LALVLLG+ARAKIAAGNYGFSVA+TVLNGAVAA AAY+LGW LRNVAGV++ET
Subjt:  IKFGGACLLAALALVLLGIARAKIAAGNYGFSVAVTVLNGAVAAGAAYALGWVLRNVAGVEDET

A0A6J5WJT0 Uncharacterized protein5.1e-8468.85Show/hide
Query:  KGNERPKEPWNGELAKSIVYGGLDAIVTCFSLIASISASRHSAVDVLVLGFANLIADGISMGFGDFVSSATGRDVAAERRAAAEWDVDNRRAQQQQL-LL
        K +ERPKEPW GE AKSI+Y GLDAIVTCFSLI+SISASR S+VDVLVLGFANL+ADGISMGFGDF+SS++ +DVAA+ +A  EWDV N    ++ + LL
Subjt:  KGNERPKEPWNGELAKSIVYGGLDAIVTCFSLIASISASRHSAVDVLVLGFANLIADGISMGFGDFVSSATGRDVAAERRAAAEWDVDNRRAQQQQL-LL

Query:  RHYQTLGMDFNDASTVVNIIAKYKHILVEEKMAAENGMAAPPEESKERPWKNGIVTFGSFLAFGCVPLLSFIVLIPFTDNESIKFGGACLLAALALVLLG
        R YQ LGMD NDA+TVVNI AKY +ILV EKM A +GM  P E   E+PWKNG+VTF +FL FG  PLLSFI+LIPFT+N+S+KF GAC+L+ALAL LLG
Subjt:  RHYQTLGMDFNDASTVVNIIAKYKHILVEEKMAAENGMAAPPEESKERPWKNGIVTFGSFLAFGCVPLLSFIVLIPFTDNESIKFGGACLLAALALVLLG

Query:  IARAKIAAGNYGFSVAVTVLNGAVAAGAAYALGWVLRNVAGVED
         A+AKIA  NY FSVAVT+ NGA+AA AAYALGW L+N+AG+E+
Subjt:  IARAKIAAGNYGFSVAVTVLNGAVAAGAAYALGWVLRNVAGVED

A0A7N2MDQ6 Uncharacterized protein3.0e-8468.2Show/hide
Query:  RPKEPWNGELAKSIVYGGLDAIVTCFSLIASISASRHSAVDVLVLGFANLIADGISMGFGDFVSSATGRDVAAERRAAAEWDVDNRRAQQQQLLLRHYQT
        RP+EPW G+  KSIVY GLDAIVTCFSLI+SISASR+S+VDVLVLGF+NL+ADGISMGFGDFVSS+T +DVAA+ RA   WDV N    +Q+ LLR YQ 
Subjt:  RPKEPWNGELAKSIVYGGLDAIVTCFSLIASISASRHSAVDVLVLGFANLIADGISMGFGDFVSSATGRDVAAERRAAAEWDVDNRRAQQQQLLLRHYQT

Query:  LGMDFNDASTVVNIIAKYKHILVEEKMAAENGMAAPPEESKERPWKNGIVTFGSFLAFGCVPLLSFIVLIPFTDNESIKFGGACLLAALALVLLGIARAK
        LGMD NDA+TVVNI AKYK I V+EKM A  G+  PP+E+ ++PWKNG+VTF +FL FG  PLLSFI+LIPFT+N+SIKF GAC+L+ALAL LLG+A+AK
Subjt:  LGMDFNDASTVVNIIAKYKHILVEEKMAAENGMAAPPEESKERPWKNGIVTFGSFLAFGCVPLLSFIVLIPFTDNESIKFGGACLLAALALVLLGIARAK

Query:  IAAGNYGFSVAVTVLNGAVAAGAAYALGWVLRNVAGVED
        IA  NY FS+A+T+  GA+AA AAY LG VL+NVAG++D
Subjt:  IAAGNYGFSVAVTVLNGAVAAGAAYALGWVLRNVAGVED

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G25190.1 Vacuolar iron transporter (VIT) family protein1.0e-0423.29Show/hide
Query:  KSIVYGGLDAIVTCFSLIASISASRHSAVDVLVLGFANLIADGISMGFGDFVSSATGRDVAAERRAAAEWDVDNRRAQQQQLLLRHYQTLGMDFNDASTV
        ++ + G  D +VT  SL+  + + +     +L++GFA L+A   SM  G+FVS  T RD+                  Q +  + H  +L          
Subjt:  KSIVYGGLDAIVTCFSLIASISASRHSAVDVLVLGFANLIADGISMGFGDFVSSATGRDVAAERRAAAEWDVDNRRAQQQQLLLRHYQTLGMDFNDASTV

Query:  VNIIAKYKHILVEEKMAAENGMAAPPEESKERPWKNGIVTFGSFLAF---GCVPLLSFIVLIPFTDNESIKFGGACLLAALALVLLGIARAKIAAGNYGF
                   ++E+           EE KER    G     S LAF     +PLL  +    F +N  ++     ++A +ALV+ G+  A +   +   
Subjt:  VNIIAKYKHILVEEKMAAENGMAAPPEESKERPWKNGIVTFGSFLAF---GCVPLLSFIVLIPFTDNESIKFGGACLLAALALVLLGIARAKIAAGNYGF

Query:  SVAVTVLNGAVAAGAAYAL
        S    V+ G +A    + L
Subjt:  SVAVTVLNGAVAAGAAYAL

AT4G27860.1 vacuolar iron transporter (VIT) family protein8.8e-0444.44Show/hide
Query:  PENKGNERPK--EPWNG----ELAKSIVYGGLDAIVTCFSLIASISASRHSAVDVLVLGFANL
        P N  +E P   EP  G    E+ KSIVYGGL   +T    + S +AS  S ++VL LG ANL
Subjt:  PENKGNERPK--EPWNG----ELAKSIVYGGLDAIVTCFSLIASISASRHSAVDVLVLGFANL

AT4G27860.2 vacuolar iron transporter (VIT) family protein8.8e-0444.44Show/hide
Query:  PENKGNERPK--EPWNG----ELAKSIVYGGLDAIVTCFSLIASISASRHSAVDVLVLGFANL
        P N  +E P   EP  G    E+ KSIVYGGL   +T    + S +AS  S ++VL LG ANL
Subjt:  PENKGNERPK--EPWNG----ELAKSIVYGGLDAIVTCFSLIASISASRHSAVDVLVLGFANL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCATCCGCCGCCGCCGCCGCCGCCGAGGCGCTAATCTCGCCGGAAAATAAGGGAAACGAAAGGCCAAAGGAACCGTGGAACGGAGAGCTGGCAAAAAGCATCGT
TTATGGCGGTCTGGACGCCATTGTTACTTGTTTCTCTCTCATTGCTTCAATCTCCGCTAGCCGCCACTCCGCTGTGGACGTGCTGGTGCTTGGATTTGCGAACTTAATAG
CGGATGGAATATCGATGGGGTTTGGGGATTTCGTGTCCAGCGCCACCGGCAGGGACGTCGCCGCCGAGCGGAGGGCGGCGGCGGAGTGGGACGTCGACAACCGCCGTGCC
CAGCAGCAGCAGCTCCTCCTCCGCCACTACCAGACCCTCGGCATGGACTTTAACGACGCCTCTACGGTGGTGAACATAATAGCGAAGTACAAACACATCCTGGTGGAGGA
GAAGATGGCGGCGGAGAATGGCATGGCGGCGCCACCGGAGGAGAGCAAAGAGCGGCCGTGGAAGAACGGCATTGTGACATTCGGATCCTTCCTTGCGTTCGGCTGCGTCC
CACTACTCTCCTTCATCGTCCTCATCCCGTTCACCGATAACGAGTCCATCAAGTTCGGCGGAGCTTGCCTCCTGGCTGCGCTTGCTCTCGTGCTTCTCGGGATTGCAAGA
GCGAAGATTGCAGCCGGGAACTATGGCTTCTCCGTAGCAGTAACGGTGCTGAACGGCGCCGTTGCCGCGGGGGCGGCATATGCTCTCGGCTGGGTTTTGAGGAATGTGGC
CGGTGTTGAAGACGAGACGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCATCCGCCGCCGCCGCCGCCGCCGAGGCGCTAATCTCGCCGGAAAATAAGGGAAACGAAAGGCCAAAGGAACCGTGGAACGGAGAGCTGGCAAAAAGCATCGT
TTATGGCGGTCTGGACGCCATTGTTACTTGTTTCTCTCTCATTGCTTCAATCTCCGCTAGCCGCCACTCCGCTGTGGACGTGCTGGTGCTTGGATTTGCGAACTTAATAG
CGGATGGAATATCGATGGGGTTTGGGGATTTCGTGTCCAGCGCCACCGGCAGGGACGTCGCCGCCGAGCGGAGGGCGGCGGCGGAGTGGGACGTCGACAACCGCCGTGCC
CAGCAGCAGCAGCTCCTCCTCCGCCACTACCAGACCCTCGGCATGGACTTTAACGACGCCTCTACGGTGGTGAACATAATAGCGAAGTACAAACACATCCTGGTGGAGGA
GAAGATGGCGGCGGAGAATGGCATGGCGGCGCCACCGGAGGAGAGCAAAGAGCGGCCGTGGAAGAACGGCATTGTGACATTCGGATCCTTCCTTGCGTTCGGCTGCGTCC
CACTACTCTCCTTCATCGTCCTCATCCCGTTCACCGATAACGAGTCCATCAAGTTCGGCGGAGCTTGCCTCCTGGCTGCGCTTGCTCTCGTGCTTCTCGGGATTGCAAGA
GCGAAGATTGCAGCCGGGAACTATGGCTTCTCCGTAGCAGTAACGGTGCTGAACGGCGCCGTTGCCGCGGGGGCGGCATATGCTCTCGGCTGGGTTTTGAGGAATGTGGC
CGGTGTTGAAGACGAGACGTAG
Protein sequenceShow/hide protein sequence
MASSAAAAAAEALISPENKGNERPKEPWNGELAKSIVYGGLDAIVTCFSLIASISASRHSAVDVLVLGFANLIADGISMGFGDFVSSATGRDVAAERRAAAEWDVDNRRA
QQQQLLLRHYQTLGMDFNDASTVVNIIAKYKHILVEEKMAAENGMAAPPEESKERPWKNGIVTFGSFLAFGCVPLLSFIVLIPFTDNESIKFGGACLLAALALVLLGIAR
AKIAAGNYGFSVAVTVLNGAVAAGAAYALGWVLRNVAGVEDET