; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0011709 (gene) of Snake gourd v1 genome

Gene IDTan0011709
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionNifU domain-containing protein
Genome locationLG08:74595161..74603227
RNA-Seq ExpressionTan0011709
SyntenyTan0011709
Gene Ontology termsGO:0006631 - fatty acid metabolic process (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0031969 - chloroplast membrane (cellular component)
InterPro domainsIPR021788 - Protein CHAPERONE-LIKE PROTEIN OF POR1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031803.1 uncharacterized protein E6C27_scaffold848G00460 [Cucumis melo var. makuwa]3.7e-14393.36Show/hide
Query:  MALAVSNIFHCPKFRLSQRQLHSKCFVLQLQPSSIRFREITRERRMVICSAASAAGSSSPDSDFNPYE-VLGVNPIEGFDMVKAAYTKKRKEAERIGDEA
        MALAVSNIFHCPK RLSQRQ HSK  VLQL+ SSIR REI+RERRM+ICSAASAAGSS+PDSDFNPYE VLGVNPIEGFDMVKAAYTKKR+EAERIGDEA
Subjt:  MALAVSNIFHCPKFRLSQRQLHSKCFVLQLQPSSIRFREITRERRMVICSAASAAGSSSPDSDFNPYE-VLGVNPIEGFDMVKAAYTKKRKEAERIGDEA

Query:  TAARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKA
        TAARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSP+DMQINMAISAVFTAWVLIKR+AEYKPLQFLAF FVYRIFEKLKA
Subjt:  TAARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKA

Query:  FEPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVFLYNNQELLITSSSALMLYIMASYYR
        FEPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAV SLAYTG+LNFIEFMG YIPVFLYNNQELL+TSSSALMLYIMASYYR
Subjt:  FEPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVFLYNNQELLITSSSALMLYIMASYYR

KAG6607296.1 hypothetical protein SDJN03_00638, partial [Cucurbita argyrosperma subsp. sororia]1.1e-13991.58Show/hide
Query:  MALAVSNIFHCPKFRLSQRQLHSKCFVLQLQPSSIRFREITRERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERIGDEAT
        MALA S IFHCPK RLSQ Q+HSKC VLQL  SSIRFREIT+ERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAER GDEA 
Subjt:  MALAVSNIFHCPKFRLSQRQLHSKCFVLQLQPSSIRFREITRERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERIGDEAT

Query:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF
        AARLEKAYDKVMMAQFTNRKKG+TFGSVKVSKDIKYADNQPIVPWGPR SKSSP+DMQINMAISAVFTAW L    AEYKPLQFLAFAFVYRIFEKLKAF
Subjt:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF

Query:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVFLYNNQELLITSSSALMLYIMASYYR
        EPAVSPSFTEDGEDSGRGIRMGKR+LRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPV LYN+QELLITSSSA+MLY MASYYR
Subjt:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVFLYNNQELLITSSSALMLYIMASYYR

XP_004150624.1 uncharacterized protein LOC101213790 [Cucumis sativus]8.3e-14393.68Show/hide
Query:  MALAVSNIFHCPKFRLSQRQLHSKCFVLQLQPSSIRFREITRERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERIGDEAT
        MALAVSNIFHCPK RLSQRQ HSK  VLQL  SSIR REITRERRMVICSAASAAGSS+PDSD NPYEVLGVNPIEGFDMVKAAYTKKR+EAERIGDEAT
Subjt:  MALAVSNIFHCPKFRLSQRQLHSKCFVLQLQPSSIRFREITRERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERIGDEAT

Query:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF
        AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSP+DMQINMAISAVFTAWVLIK SAEYKPLQFLAFAFVYRIFEKLKAF
Subjt:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF

Query:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVFLYNNQELLITSSSALMLYIMASYYR
        EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAV SL YTGILNFIEF+G+YIP FLYNNQELL+TSSSALMLYIMASYYR
Subjt:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVFLYNNQELLITSSSALMLYIMASYYR

XP_022150965.1 uncharacterized protein LOC111018989 [Momordica charantia]9.2e-14292.98Show/hide
Query:  MALAVSNIFHCPKFRLSQRQLHSKCFVLQLQPSSIRFREITRERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERIGDEAT
        MALAVSNIFHCPK R+SQRQ HSK  V  LQ SSIRFREITRERR VI +AASAAGSSSP+SDFNPYEV+ VNPIEGFDM+KAAYTKKRKEAER+GDEAT
Subjt:  MALAVSNIFHCPKFRLSQRQLHSKCFVLQLQPSSIRFREITRERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERIGDEAT

Query:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF
        AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF
Subjt:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF

Query:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVFLYNNQELLITSSSALMLYIMASYYR
        EP VSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLAYTGILNFIEF+GSYIP FLYNNQELLITS+SALMLYIMASYYR
Subjt:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVFLYNNQELLITSSSALMLYIMASYYR

XP_038893848.1 uncharacterized protein LOC120082659 [Benincasa hispida]3.4e-14495.09Show/hide
Query:  MALAVSNIFHCPKFRLSQRQLHSKCFVLQLQPSSIRFREITRERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERIGDEAT
        MALAVSNIF+CPK RLS+RQLH K  VLQLQ SSIR REITRERR VICSAASAAGSSS DSDFNPYEVLGVNPIEGFDMVKAAYTKKR+EAERIGDEAT
Subjt:  MALAVSNIFHCPKFRLSQRQLHSKCFVLQLQPSSIRFREITRERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERIGDEAT

Query:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF
        AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSK SPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF
Subjt:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF

Query:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVFLYNNQELLITSSSALMLYIMASYYR
        EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLAYTGILNFIEFMG YIPVFLYNNQELLITSSSA+MLYIMASYYR
Subjt:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVFLYNNQELLITSSSALMLYIMASYYR

TrEMBL top hitse value%identityAlignment
A0A0A0M0Q8 Uncharacterized protein4.0e-14393.68Show/hide
Query:  MALAVSNIFHCPKFRLSQRQLHSKCFVLQLQPSSIRFREITRERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERIGDEAT
        MALAVSNIFHCPK RLSQRQ HSK  VLQL  SSIR REITRERRMVICSAASAAGSS+PDSD NPYEVLGVNPIEGFDMVKAAYTKKR+EAERIGDEAT
Subjt:  MALAVSNIFHCPKFRLSQRQLHSKCFVLQLQPSSIRFREITRERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERIGDEAT

Query:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF
        AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSP+DMQINMAISAVFTAWVLIK SAEYKPLQFLAFAFVYRIFEKLKAF
Subjt:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF

Query:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVFLYNNQELLITSSSALMLYIMASYYR
        EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAV SL YTGILNFIEF+G+YIP FLYNNQELL+TSSSALMLYIMASYYR
Subjt:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVFLYNNQELLITSSSALMLYIMASYYR

A0A5A7SNI3 Uncharacterized protein1.8e-14393.36Show/hide
Query:  MALAVSNIFHCPKFRLSQRQLHSKCFVLQLQPSSIRFREITRERRMVICSAASAAGSSSPDSDFNPYE-VLGVNPIEGFDMVKAAYTKKRKEAERIGDEA
        MALAVSNIFHCPK RLSQRQ HSK  VLQL+ SSIR REI+RERRM+ICSAASAAGSS+PDSDFNPYE VLGVNPIEGFDMVKAAYTKKR+EAERIGDEA
Subjt:  MALAVSNIFHCPKFRLSQRQLHSKCFVLQLQPSSIRFREITRERRMVICSAASAAGSSSPDSDFNPYE-VLGVNPIEGFDMVKAAYTKKRKEAERIGDEA

Query:  TAARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKA
        TAARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSP+DMQINMAISAVFTAWVLIKR+AEYKPLQFLAF FVYRIFEKLKA
Subjt:  TAARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKA

Query:  FEPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVFLYNNQELLITSSSALMLYIMASYYR
        FEPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAV SLAYTG+LNFIEFMG YIPVFLYNNQELL+TSSSALMLYIMASYYR
Subjt:  FEPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVFLYNNQELLITSSSALMLYIMASYYR

A0A6J1D9X7 uncharacterized protein LOC1110189894.5e-14292.98Show/hide
Query:  MALAVSNIFHCPKFRLSQRQLHSKCFVLQLQPSSIRFREITRERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERIGDEAT
        MALAVSNIFHCPK R+SQRQ HSK  V  LQ SSIRFREITRERR VI +AASAAGSSSP+SDFNPYEV+ VNPIEGFDM+KAAYTKKRKEAER+GDEAT
Subjt:  MALAVSNIFHCPKFRLSQRQLHSKCFVLQLQPSSIRFREITRERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERIGDEAT

Query:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF
        AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF
Subjt:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF

Query:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVFLYNNQELLITSSSALMLYIMASYYR
        EP VSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLAYTGILNFIEF+GSYIP FLYNNQELLITS+SALMLYIMASYYR
Subjt:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVFLYNNQELLITSSSALMLYIMASYYR

A0A6J1GBN1 uncharacterized protein LOC1114526989.3e-14091.58Show/hide
Query:  MALAVSNIFHCPKFRLSQRQLHSKCFVLQLQPSSIRFREITRERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERIGDEAT
        MALA S IFHCPK RLSQ Q HSKC VLQL  SSIRFREIT+ERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAER GDEA 
Subjt:  MALAVSNIFHCPKFRLSQRQLHSKCFVLQLQPSSIRFREITRERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERIGDEAT

Query:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF
        AARLEKAYDKVMMAQFTNRKKG+TFGSVKVSKDIKYADNQPIVPWGPR SKSSP+DMQINMAISAVFTAW L    AEYKPLQFLAFAFVYRIFEKLKAF
Subjt:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF

Query:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVFLYNNQELLITSSSALMLYIMASYYR
        EPAVSPSFTEDGEDSGRGIRMGKR+LRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPV LYN+QELLITSSSA+MLY MASYYR
Subjt:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVFLYNNQELLITSSSALMLYIMASYYR

A0A6J1KH23 uncharacterized protein LOC1114931473.0e-13891.23Show/hide
Query:  MALAVSNIFHCPKFRLSQRQLHSKCFVLQLQPSSIRFREITRERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERIGDEAT
        MALA S IFHCPK RLSQ+  HSK  VLQL  SSIR REITRERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAER GDEA 
Subjt:  MALAVSNIFHCPKFRLSQRQLHSKCFVLQLQPSSIRFREITRERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERIGDEAT

Query:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF
        AARLEKAYDKVMMAQFTNRKKG+TFGSVKVSKDIKYADNQPIVPWGPR SKSSP+DMQINMAISAVFTAW LI   AEYKPLQFLAFAFVYRIFEKLKAF
Subjt:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF

Query:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVFLYNNQELLITSSSALMLYIMASYYR
        EPAVSPSFTEDGEDSGRGIRMGKR+LRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPV LYN+QELLITSSSA+MLY MASYYR
Subjt:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVFLYNNQELLITSSSALMLYIMASYYR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G08640.1 Chloroplast J-like domain 11.2e-9469.96Show/hide
Query:  ERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERIGDEATAARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPI
        + R+VI +A+SAAG+   D+DFNPYEVLGVNPIEGFD +K  Y +K K+A+R GDEATAA LEKAYDK+M AQ  NRKKGVTFGS KVSKDIKYAD QPI
Subjt:  ERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERIGDEATAARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPI

Query:  VPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAFEPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLA
        +PWGPR S+SS  DM IN+AIS VF+AW+ IKR+ EYKPLQF++F FVYRIFEKLK+FE   SP + E+GE+SGRG+RMGKRLLRSL+LVFG I ++SLA
Subjt:  VPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAFEPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLA

Query:  YTGILNFIEFMGSYIPVFLYNNQELLITSSSALMLYIMASYYR
        YTG LN IE+MG  IP+ LYNNQEL++T+SSA MLY++AS+YR
Subjt:  YTGILNFIEFMGSYIPVFLYNNQELLITSSSALMLYIMASYYR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTTAGCAGTCTCCAACATCTTCCACTGCCCTAAATTTCGGCTTTCTCAGAGGCAGTTGCATTCCAAATGCTTCGTTTTGCAGTTACAGCCGTCGTCAATCAGGTT
TAGAGAAATAACACGAGAGAGAAGGATGGTTATTTGCTCAGCAGCTTCTGCTGCAGGAAGTTCTAGTCCTGACAGTGACTTCAACCCGTATGAGGTTCTGGGTGTGAACC
CAATTGAGGGATTTGATATGGTCAAAGCAGCATATACTAAAAAACGCAAGGAGGCTGAGAGGATAGGTGATGAAGCAACTGCAGCCAGACTGGAGAAAGCTTATGACAAG
GTTATGATGGCACAATTCACAAACCGGAAGAAGGGTGTCACTTTTGGCTCAGTGAAGGTTTCTAAGGACATCAAGTATGCTGATAACCAGCCAATTGTACCATGGGGACC
AAGGTCTTCAAAGTCGAGCCCCAAAGATATGCAAATTAATATGGCAATATCTGCTGTATTTACTGCTTGGGTCCTTATCAAACGCAGTGCTGAATATAAACCTCTGCAGT
TCTTGGCATTTGCTTTTGTTTATCGGATTTTTGAAAAGCTGAAAGCTTTTGAACCAGCTGTATCCCCCTCATTTACAGAAGATGGGGAAGATTCAGGACGAGGTATACGG
ATGGGAAAGCGGTTACTACGATCTCTTGCTCTAGTGTTTGGATGTATTGCTGTTTCCTCTTTGGCATATACTGGTATTTTGAATTTCATTGAGTTCATGGGCAGCTATAT
TCCAGTTTTTCTCTACAATAACCAGGAACTATTGATCACCAGTTCATCAGCTCTCATGCTTTACATCATGGCATCTTACTACAGATGA
mRNA sequenceShow/hide mRNA sequence
TCGTCTCTTTCACCCGCCTTTGACACAGAGCTGAGCAATGGCGTTAGCAGTCTCCAACATCTTCCACTGCCCTAAATTTCGGCTTTCTCAGAGGCAGTTGCATTCCAAAT
GCTTCGTTTTGCAGTTACAGCCGTCGTCAATCAGGTTTAGAGAAATAACACGAGAGAGAAGGATGGTTATTTGCTCAGCAGCTTCTGCTGCAGGAAGTTCTAGTCCTGAC
AGTGACTTCAACCCGTATGAGGTTCTGGGTGTGAACCCAATTGAGGGATTTGATATGGTCAAAGCAGCATATACTAAAAAACGCAAGGAGGCTGAGAGGATAGGTGATGA
AGCAACTGCAGCCAGACTGGAGAAAGCTTATGACAAGGTTATGATGGCACAATTCACAAACCGGAAGAAGGGTGTCACTTTTGGCTCAGTGAAGGTTTCTAAGGACATCA
AGTATGCTGATAACCAGCCAATTGTACCATGGGGACCAAGGTCTTCAAAGTCGAGCCCCAAAGATATGCAAATTAATATGGCAATATCTGCTGTATTTACTGCTTGGGTC
CTTATCAAACGCAGTGCTGAATATAAACCTCTGCAGTTCTTGGCATTTGCTTTTGTTTATCGGATTTTTGAAAAGCTGAAAGCTTTTGAACCAGCTGTATCCCCCTCATT
TACAGAAGATGGGGAAGATTCAGGACGAGGTATACGGATGGGAAAGCGGTTACTACGATCTCTTGCTCTAGTGTTTGGATGTATTGCTGTTTCCTCTTTGGCATATACTG
GTATTTTGAATTTCATTGAGTTCATGGGCAGCTATATTCCAGTTTTTCTCTACAATAACCAGGAACTATTGATCACCAGTTCATCAGCTCTCATGCTTTACATCATGGCA
TCTTACTACAGATGATATTATTCAGCATGAATCATTCAATCTCGCAGCCAACTCGCAATGAAATCAGTTTTGATATATTGTTGATTTGGTAATCTAGTTTTGTAAAATGA
GGCACAGTACTGATGTCATTATCGAGGAAATGTAATGCTAGCTTCAGAAATTTTATGTTCAATGCGATATATACGTCAGCTATCGTATATAAAGAGGTCACTTTAACTTT
TTCATACTGATATAAACATTCTCATGGGGATGAAGTTTATGGTTTTAAAGGGTCTGCAGGAGTTGAAAAGATAGGATATCTAAAAAGAGCTTGTGCTTCAACTGGAGTCC
TGCAATTTCTCTGCATTTGGTTGGAAAATTCAAGTAAAATTGGCATCATATTCCAGTTAAAACTATAGAACACCTTTCCAGATCTTGTCGTTTTCGTCCCGATCGCTTGC
GAAAGTCCCAGATAAGTTCTCTAACTCAATCCTATGTTCTTTGATTGAAAACTCTTCTACGTTTTCCCCCACTATTATGAAACTGTAAGCTTATGTCCATTCATCGGATC
AATCTTATTAGGTATTTCATTTAATGATACTTTGAATGTTTGTTTTTATCGTAAGTAATAAGTATATTGTTTTAGTCGCAGCTCTACCTGCAAATTGTGTGCAT
Protein sequenceShow/hide protein sequence
MALAVSNIFHCPKFRLSQRQLHSKCFVLQLQPSSIRFREITRERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERIGDEATAARLEKAYDK
VMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAFEPAVSPSFTEDGEDSGRGIR
MGKRLLRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVFLYNNQELLITSSSALMLYIMASYYR