; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0017226 (gene) of Snake gourd v1 genome

Gene IDTan0017226
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPhospho-N-acetylmuramoyl-pentapeptide-transferase
Genome locationLG11:8520749..8537404
RNA-Seq ExpressionTan0017226
SyntenyTan0017226
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009943 - Protein of unknown function DUF1475


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578860.1 hypothetical protein SDJN03_23308, partial [Cucurbita argyrosperma subsp. sororia]9.5e-10883.97Show/hide
Query:  MASSAVIGWRILFVLLGFTMVATLAYTITIDGSPFRTELLSRLMVAVLIDFYINVIVIAAWVCYKESNWIAATIWIVFLVCLGSIATCAYILRQLWQLSS
        MASSAVIGWRILF+LLG TMVATL YT+  DGSPFR ELLSRLMV VLIDFY NVIVIAAWVCYKESNWIAA +WIVFLVCLGSIATCAYIL QLWQLSS
Subjt:  MASSAVIGWRILFVLLGFTMVATLAYTITIDGSPFRTELLSRLMVAVLIDFYINVIVIAAWVCYKESNWIAATIWIVFLVCLGSIATCAYILRQLWQLSS

Query:  QESFEDIMYNILIKDPNKDGMQQHRKHSNIMIPKIISGALGCLMVVNLAYVLSHGSPFRKELYTPWMVATLIDFYINGTALSVWMFYKEESWLTALFWIV
        QESFEDI+YN+LIK+PNK+ MQQH+KHSNI+  KI+ GALGCLMVV L ++ + GSPFRKELYTPWMVATLIDFYINGTALSVWMFYKEESWLTAL WI 
Subjt:  QESFEDIMYNILIKDPNKDGMQQHRKHSNIMIPKIISGALGCLMVVNLAYVLSHGSPFRKELYTPWMVATLIDFYINGTALSVWMFYKEESWLTALFWIV

Query:  LLIILGSTSSCAFIVKELFKLNSEDPAYLILFKSSNR
        L II GS SSC FIVKELFKLNSEDPAYL+LFK+SNR
Subjt:  LLIILGSTSSCAFIVKELFKLNSEDPAYLILFKSSNR

XP_022992682.1 uncharacterized protein LOC111488951 isoform X1 [Cucurbita maxima]8.0e-10783.54Show/hide
Query:  MASSAVIGWRILFVLLGFTMVATLAYTITIDGSPFRTELLSRLMVAVLIDFYINVIVIAAWVCYKESNWIAATIWIVFLVCLGSIATCAYILRQLWQLSS
        MASSAVIGWRILF+LLG TMVATL YT+  DGSPFR ELLSRLMV VLIDFY NVIVIAAWVCYKESNWIAA +WIVFLVCLGSIATCAYIL QLWQLSS
Subjt:  MASSAVIGWRILFVLLGFTMVATLAYTITIDGSPFRTELLSRLMVAVLIDFYINVIVIAAWVCYKESNWIAATIWIVFLVCLGSIATCAYILRQLWQLSS

Query:  QESFEDIMYNILIKDPNKDGMQQHRKHSNIMIPKIISGALGCLMVVNLAYVLSHGSPFRKELYTPWMVATLIDFYINGTALSVWMFYKEESWLTALFWIV
        QESFEDI+YN+LIK+PNK+ MQQH+KHSNI+  KI+ GALGC MVV L ++ S GSPFRKELYTPWMVATLIDFYINGTALSVWMFYKEESWLTAL WI 
Subjt:  QESFEDIMYNILIKDPNKDGMQQHRKHSNIMIPKIISGALGCLMVVNLAYVLSHGSPFRKELYTPWMVATLIDFYINGTALSVWMFYKEESWLTALFWIV

Query:  LLIILGSTSSCAFIVKELFKLNSEDPAYLILFKSSNR
        L II GS S C FIVKELFKLNSEDPAYL+LFK+SNR
Subjt:  LLIILGSTSSCAFIVKELFKLNSEDPAYLILFKSSNR

XP_023551243.1 uncharacterized protein LOC111809123 isoform X1 [Cucurbita pepo subsp. pepo]1.6e-10783.54Show/hide
Query:  MASSAVIGWRILFVLLGFTMVATLAYTITIDGSPFRTELLSRLMVAVLIDFYINVIVIAAWVCYKESNWIAATIWIVFLVCLGSIATCAYILRQLWQLSS
        MASSAVIGWRILF+LLG TMVATL YT+  DGSPFR ELLSRLMV VLIDFY NVIVIAAWVCYKESNWIAA +WIVFLVCLGSIATCAYIL QLWQLSS
Subjt:  MASSAVIGWRILFVLLGFTMVATLAYTITIDGSPFRTELLSRLMVAVLIDFYINVIVIAAWVCYKESNWIAATIWIVFLVCLGSIATCAYILRQLWQLSS

Query:  QESFEDIMYNILIKDPNKDGMQQHRKHSNIMIPKIISGALGCLMVVNLAYVLSHGSPFRKELYTPWMVATLIDFYINGTALSVWMFYKEESWLTALFWIV
        QESFEDI+YN+LIK+PNK+ MQQH+KHSNI+  KI+ GALGCLMVV L ++ + GSPFRKELYTPWMVATLIDFYINGTALSVWMFYKEESWLTAL WI 
Subjt:  QESFEDIMYNILIKDPNKDGMQQHRKHSNIMIPKIISGALGCLMVVNLAYVLSHGSPFRKELYTPWMVATLIDFYINGTALSVWMFYKEESWLTALFWIV

Query:  LLIILGSTSSCAFIVKELFKLNSEDPAYLILFKSSNR
        L +I GS SSC FIVKELFKLNSEDPAYL+LFK+SNR
Subjt:  LLIILGSTSSCAFIVKELFKLNSEDPAYLILFKSSNR

XP_023551244.1 uncharacterized protein LOC111809123 isoform X2 [Cucurbita pepo subsp. pepo]1.6e-10783.54Show/hide
Query:  MASSAVIGWRILFVLLGFTMVATLAYTITIDGSPFRTELLSRLMVAVLIDFYINVIVIAAWVCYKESNWIAATIWIVFLVCLGSIATCAYILRQLWQLSS
        MASSAVIGWRILF+LLG TMVATL YT+  DGSPFR ELLSRLMV VLIDFY NVIVIAAWVCYKESNWIAA +WIVFLVCLGSIATCAYIL QLWQLSS
Subjt:  MASSAVIGWRILFVLLGFTMVATLAYTITIDGSPFRTELLSRLMVAVLIDFYINVIVIAAWVCYKESNWIAATIWIVFLVCLGSIATCAYILRQLWQLSS

Query:  QESFEDIMYNILIKDPNKDGMQQHRKHSNIMIPKIISGALGCLMVVNLAYVLSHGSPFRKELYTPWMVATLIDFYINGTALSVWMFYKEESWLTALFWIV
        QESFEDI+YN+LIK+PNK+ MQQH+KHSNI+  KI+ GALGCLMVV L ++ + GSPFRKELYTPWMVATLIDFYINGTALSVWMFYKEESWLTAL WI 
Subjt:  QESFEDIMYNILIKDPNKDGMQQHRKHSNIMIPKIISGALGCLMVVNLAYVLSHGSPFRKELYTPWMVATLIDFYINGTALSVWMFYKEESWLTALFWIV

Query:  LLIILGSTSSCAFIVKELFKLNSEDPAYLILFKSSNR
        L +I GS SSC FIVKELFKLNSEDPAYL+LFK+SNR
Subjt:  LLIILGSTSSCAFIVKELFKLNSEDPAYLILFKSSNR

XP_023551245.1 uncharacterized protein LOC111809123 isoform X3 [Cucurbita pepo subsp. pepo]1.6e-10783.54Show/hide
Query:  MASSAVIGWRILFVLLGFTMVATLAYTITIDGSPFRTELLSRLMVAVLIDFYINVIVIAAWVCYKESNWIAATIWIVFLVCLGSIATCAYILRQLWQLSS
        MASSAVIGWRILF+LLG TMVATL YT+  DGSPFR ELLSRLMV VLIDFY NVIVIAAWVCYKESNWIAA +WIVFLVCLGSIATCAYIL QLWQLSS
Subjt:  MASSAVIGWRILFVLLGFTMVATLAYTITIDGSPFRTELLSRLMVAVLIDFYINVIVIAAWVCYKESNWIAATIWIVFLVCLGSIATCAYILRQLWQLSS

Query:  QESFEDIMYNILIKDPNKDGMQQHRKHSNIMIPKIISGALGCLMVVNLAYVLSHGSPFRKELYTPWMVATLIDFYINGTALSVWMFYKEESWLTALFWIV
        QESFEDI+YN+LIK+PNK+ MQQH+KHSNI+  KI+ GALGCLMVV L ++ + GSPFRKELYTPWMVATLIDFYINGTALSVWMFYKEESWLTAL WI 
Subjt:  QESFEDIMYNILIKDPNKDGMQQHRKHSNIMIPKIISGALGCLMVVNLAYVLSHGSPFRKELYTPWMVATLIDFYINGTALSVWMFYKEESWLTALFWIV

Query:  LLIILGSTSSCAFIVKELFKLNSEDPAYLILFKSSNR
        L +I GS SSC FIVKELFKLNSEDPAYL+LFK+SNR
Subjt:  LLIILGSTSSCAFIVKELFKLNSEDPAYLILFKSSNR

TrEMBL top hitse value%identityAlignment
A0A6J1FFW0 uncharacterized protein LOC111445346 isoform X12.5e-10683.12Show/hide
Query:  MASSAVIGWRILFVLLGFTMVATLAYTITIDGSPFRTELLSRLMVAVLIDFYINVIVIAAWVCYKESNWIAATIWIVFLVCLGSIATCAYILRQLWQLSS
        MASSAVIGWRILF+LLG TMVATL YT+  DGSPFR ELLSRLMV  LIDFY NVIVIAAWVCYKESNWIAA IWIVFLVCLGSIATCAYIL  LWQLSS
Subjt:  MASSAVIGWRILFVLLGFTMVATLAYTITIDGSPFRTELLSRLMVAVLIDFYINVIVIAAWVCYKESNWIAATIWIVFLVCLGSIATCAYILRQLWQLSS

Query:  QESFEDIMYNILIKDPNKDGMQQHRKHSNIMIPKIISGALGCLMVVNLAYVLSHGSPFRKELYTPWMVATLIDFYINGTALSVWMFYKEESWLTALFWIV
        QESFEDI+YN+L K+PNK+ MQQH+KHSNI+  KI+ GALGCLMVV L ++ + GSPFRKELYTPWMVATLIDFYINGTALSVWMFYKEESWLTAL WI 
Subjt:  QESFEDIMYNILIKDPNKDGMQQHRKHSNIMIPKIISGALGCLMVVNLAYVLSHGSPFRKELYTPWMVATLIDFYINGTALSVWMFYKEESWLTALFWIV

Query:  LLIILGSTSSCAFIVKELFKLNSEDPAYLILFKSSNR
        L II GS SSC FIVKELFKLNSEDPAYL+LFK+SNR
Subjt:  LLIILGSTSSCAFIVKELFKLNSEDPAYLILFKSSNR

A0A6J1FLM6 uncharacterized protein LOC111445346 isoform X32.5e-10683.12Show/hide
Query:  MASSAVIGWRILFVLLGFTMVATLAYTITIDGSPFRTELLSRLMVAVLIDFYINVIVIAAWVCYKESNWIAATIWIVFLVCLGSIATCAYILRQLWQLSS
        MASSAVIGWRILF+LLG TMVATL YT+  DGSPFR ELLSRLMV  LIDFY NVIVIAAWVCYKESNWIAA IWIVFLVCLGSIATCAYIL  LWQLSS
Subjt:  MASSAVIGWRILFVLLGFTMVATLAYTITIDGSPFRTELLSRLMVAVLIDFYINVIVIAAWVCYKESNWIAATIWIVFLVCLGSIATCAYILRQLWQLSS

Query:  QESFEDIMYNILIKDPNKDGMQQHRKHSNIMIPKIISGALGCLMVVNLAYVLSHGSPFRKELYTPWMVATLIDFYINGTALSVWMFYKEESWLTALFWIV
        QESFEDI+YN+L K+PNK+ MQQH+KHSNI+  KI+ GALGCLMVV L ++ + GSPFRKELYTPWMVATLIDFYINGTALSVWMFYKEESWLTAL WI 
Subjt:  QESFEDIMYNILIKDPNKDGMQQHRKHSNIMIPKIISGALGCLMVVNLAYVLSHGSPFRKELYTPWMVATLIDFYINGTALSVWMFYKEESWLTALFWIV

Query:  LLIILGSTSSCAFIVKELFKLNSEDPAYLILFKSSNR
        L II GS SSC FIVKELFKLNSEDPAYL+LFK+SNR
Subjt:  LLIILGSTSSCAFIVKELFKLNSEDPAYLILFKSSNR

A0A6J1JQK7 uncharacterized protein LOC111488951 isoform X13.9e-10783.54Show/hide
Query:  MASSAVIGWRILFVLLGFTMVATLAYTITIDGSPFRTELLSRLMVAVLIDFYINVIVIAAWVCYKESNWIAATIWIVFLVCLGSIATCAYILRQLWQLSS
        MASSAVIGWRILF+LLG TMVATL YT+  DGSPFR ELLSRLMV VLIDFY NVIVIAAWVCYKESNWIAA +WIVFLVCLGSIATCAYIL QLWQLSS
Subjt:  MASSAVIGWRILFVLLGFTMVATLAYTITIDGSPFRTELLSRLMVAVLIDFYINVIVIAAWVCYKESNWIAATIWIVFLVCLGSIATCAYILRQLWQLSS

Query:  QESFEDIMYNILIKDPNKDGMQQHRKHSNIMIPKIISGALGCLMVVNLAYVLSHGSPFRKELYTPWMVATLIDFYINGTALSVWMFYKEESWLTALFWIV
        QESFEDI+YN+LIK+PNK+ MQQH+KHSNI+  KI+ GALGC MVV L ++ S GSPFRKELYTPWMVATLIDFYINGTALSVWMFYKEESWLTAL WI 
Subjt:  QESFEDIMYNILIKDPNKDGMQQHRKHSNIMIPKIISGALGCLMVVNLAYVLSHGSPFRKELYTPWMVATLIDFYINGTALSVWMFYKEESWLTALFWIV

Query:  LLIILGSTSSCAFIVKELFKLNSEDPAYLILFKSSNR
        L II GS S C FIVKELFKLNSEDPAYL+LFK+SNR
Subjt:  LLIILGSTSSCAFIVKELFKLNSEDPAYLILFKSSNR

A0A6J1JY71 uncharacterized protein LOC111488951 isoform X33.9e-10783.54Show/hide
Query:  MASSAVIGWRILFVLLGFTMVATLAYTITIDGSPFRTELLSRLMVAVLIDFYINVIVIAAWVCYKESNWIAATIWIVFLVCLGSIATCAYILRQLWQLSS
        MASSAVIGWRILF+LLG TMVATL YT+  DGSPFR ELLSRLMV VLIDFY NVIVIAAWVCYKESNWIAA +WIVFLVCLGSIATCAYIL QLWQLSS
Subjt:  MASSAVIGWRILFVLLGFTMVATLAYTITIDGSPFRTELLSRLMVAVLIDFYINVIVIAAWVCYKESNWIAATIWIVFLVCLGSIATCAYILRQLWQLSS

Query:  QESFEDIMYNILIKDPNKDGMQQHRKHSNIMIPKIISGALGCLMVVNLAYVLSHGSPFRKELYTPWMVATLIDFYINGTALSVWMFYKEESWLTALFWIV
        QESFEDI+YN+LIK+PNK+ MQQH+KHSNI+  KI+ GALGC MVV L ++ S GSPFRKELYTPWMVATLIDFYINGTALSVWMFYKEESWLTAL WI 
Subjt:  QESFEDIMYNILIKDPNKDGMQQHRKHSNIMIPKIISGALGCLMVVNLAYVLSHGSPFRKELYTPWMVATLIDFYINGTALSVWMFYKEESWLTALFWIV

Query:  LLIILGSTSSCAFIVKELFKLNSEDPAYLILFKSSNR
        L II GS S C FIVKELFKLNSEDPAYL+LFK+SNR
Subjt:  LLIILGSTSSCAFIVKELFKLNSEDPAYLILFKSSNR

A0A6J1JZY5 uncharacterized protein LOC111488951 isoform X23.9e-10783.54Show/hide
Query:  MASSAVIGWRILFVLLGFTMVATLAYTITIDGSPFRTELLSRLMVAVLIDFYINVIVIAAWVCYKESNWIAATIWIVFLVCLGSIATCAYILRQLWQLSS
        MASSAVIGWRILF+LLG TMVATL YT+  DGSPFR ELLSRLMV VLIDFY NVIVIAAWVCYKESNWIAA +WIVFLVCLGSIATCAYIL QLWQLSS
Subjt:  MASSAVIGWRILFVLLGFTMVATLAYTITIDGSPFRTELLSRLMVAVLIDFYINVIVIAAWVCYKESNWIAATIWIVFLVCLGSIATCAYILRQLWQLSS

Query:  QESFEDIMYNILIKDPNKDGMQQHRKHSNIMIPKIISGALGCLMVVNLAYVLSHGSPFRKELYTPWMVATLIDFYINGTALSVWMFYKEESWLTALFWIV
        QESFEDI+YN+LIK+PNK+ MQQH+KHSNI+  KI+ GALGC MVV L ++ S GSPFRKELYTPWMVATLIDFYINGTALSVWMFYKEESWLTAL WI 
Subjt:  QESFEDIMYNILIKDPNKDGMQQHRKHSNIMIPKIISGALGCLMVVNLAYVLSHGSPFRKELYTPWMVATLIDFYINGTALSVWMFYKEESWLTALFWIV

Query:  LLIILGSTSSCAFIVKELFKLNSEDPAYLILFKSSNR
        L II GS S C FIVKELFKLNSEDPAYL+LFK+SNR
Subjt:  LLIILGSTSSCAFIVKELFKLNSEDPAYLILFKSSNR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G22750.1 unknown protein3.9e-5142.86Show/hide
Query:  SSAVIGWRILFVLLGFTMVATLAYTITIDGSPF--RTELLSRLMVAVLIDFYINVIVIAAWVCYKESNWIAATIWIVFLVCLGSIATCAYILRQLWQLSS
        +S V G +++  ++   M+ATL YTI  DG P   R ++ +   V  ++DFYIN++ IA W+ YKES W  + +W + L+  GS+ TC Y+  QL +L++
Subjt:  SSAVIGWRILFVLLGFTMVATLAYTITIDGSPF--RTELLSRLMVAVLIDFYINVIVIAAWVCYKESNWIAATIWIVFLVCLGSIATCAYILRQLWQLSS

Query:  QESFEDIMYNILIKDPNKDGMQQHRKHSNIMIPKIISGALGCLMVVNLAYV-LSHGSPFRKELYTPWMVATLIDFYINGTALSVWMFYKEESWLTALFWI
        QE+ ED MY +L++D  KDG+    K+S ++  + + GALGC+M+  L Y   ++GSPF  EL  PWMV  L++FYI+   LSVW+ YKE S +  + W+
Subjt:  QESFEDIMYNILIKDPNKDGMQQHRKHSNIMIPKIISGALGCLMVVNLAYV-LSHGSPFRKELYTPWMVATLIDFYINGTALSVWMFYKEESWLTALFWI

Query:  VLLIILGSTSSCAFIVKELFKLNSEDPAYLILFKSSNR
         LLI LGS  + A IV +LF+L+  DP YL+L  +SNR
Subjt:  VLLIILGSTSSCAFIVKELFKLNSEDPAYLILFKSSNR

AT1G22750.2 unknown protein3.9e-5142.86Show/hide
Query:  SSAVIGWRILFVLLGFTMVATLAYTITIDGSPF--RTELLSRLMVAVLIDFYINVIVIAAWVCYKESNWIAATIWIVFLVCLGSIATCAYILRQLWQLSS
        +S V G +++  ++   M+ATL YTI  DG P   R ++ +   V  ++DFYIN++ IA W+ YKES W  + +W + L+  GS+ TC Y+  QL +L++
Subjt:  SSAVIGWRILFVLLGFTMVATLAYTITIDGSPF--RTELLSRLMVAVLIDFYINVIVIAAWVCYKESNWIAATIWIVFLVCLGSIATCAYILRQLWQLSS

Query:  QESFEDIMYNILIKDPNKDGMQQHRKHSNIMIPKIISGALGCLMVVNLAYV-LSHGSPFRKELYTPWMVATLIDFYINGTALSVWMFYKEESWLTALFWI
        QE+ ED MY +L++D  KDG+    K+S ++  + + GALGC+M+  L Y   ++GSPF  EL  PWMV  L++FYI+   LSVW+ YKE S +  + W+
Subjt:  QESFEDIMYNILIKDPNKDGMQQHRKHSNIMIPKIISGALGCLMVVNLAYV-LSHGSPFRKELYTPWMVATLIDFYINGTALSVWMFYKEESWLTALFWI

Query:  VLLIILGSTSSCAFIVKELFKLNSEDPAYLILFKSSNR
         LLI LGS  + A IV +LF+L+  DP YL+L  +SNR
Subjt:  VLLIILGSTSSCAFIVKELFKLNSEDPAYLILFKSSNR

AT1G22750.3 unknown protein3.9e-5142.86Show/hide
Query:  SSAVIGWRILFVLLGFTMVATLAYTITIDGSPF--RTELLSRLMVAVLIDFYINVIVIAAWVCYKESNWIAATIWIVFLVCLGSIATCAYILRQLWQLSS
        +S V G +++  ++   M+ATL YTI  DG P   R ++ +   V  ++DFYIN++ IA W+ YKES W  + +W + L+  GS+ TC Y+  QL +L++
Subjt:  SSAVIGWRILFVLLGFTMVATLAYTITIDGSPF--RTELLSRLMVAVLIDFYINVIVIAAWVCYKESNWIAATIWIVFLVCLGSIATCAYILRQLWQLSS

Query:  QESFEDIMYNILIKDPNKDGMQQHRKHSNIMIPKIISGALGCLMVVNLAYV-LSHGSPFRKELYTPWMVATLIDFYINGTALSVWMFYKEESWLTALFWI
        QE+ ED MY +L++D  KDG+    K+S ++  + + GALGC+M+  L Y   ++GSPF  EL  PWMV  L++FYI+   LSVW+ YKE S +  + W+
Subjt:  QESFEDIMYNILIKDPNKDGMQQHRKHSNIMIPKIISGALGCLMVVNLAYV-LSHGSPFRKELYTPWMVATLIDFYINGTALSVWMFYKEESWLTALFWI

Query:  VLLIILGSTSSCAFIVKELFKLNSEDPAYLILFKSSNR
         LLI LGS  + A IV +LF+L+  DP YL+L  +SNR
Subjt:  VLLIILGSTSSCAFIVKELFKLNSEDPAYLILFKSSNR

AT1G22750.4 unknown protein3.9e-5142.86Show/hide
Query:  SSAVIGWRILFVLLGFTMVATLAYTITIDGSPF--RTELLSRLMVAVLIDFYINVIVIAAWVCYKESNWIAATIWIVFLVCLGSIATCAYILRQLWQLSS
        +S V G +++  ++   M+ATL YTI  DG P   R ++ +   V  ++DFYIN++ IA W+ YKES W  + +W + L+  GS+ TC Y+  QL +L++
Subjt:  SSAVIGWRILFVLLGFTMVATLAYTITIDGSPF--RTELLSRLMVAVLIDFYINVIVIAAWVCYKESNWIAATIWIVFLVCLGSIATCAYILRQLWQLSS

Query:  QESFEDIMYNILIKDPNKDGMQQHRKHSNIMIPKIISGALGCLMVVNLAYV-LSHGSPFRKELYTPWMVATLIDFYINGTALSVWMFYKEESWLTALFWI
        QE+ ED MY +L++D  KDG+    K+S ++  + + GALGC+M+  L Y   ++GSPF  EL  PWMV  L++FYI+   LSVW+ YKE S +  + W+
Subjt:  QESFEDIMYNILIKDPNKDGMQQHRKHSNIMIPKIISGALGCLMVVNLAYV-LSHGSPFRKELYTPWMVATLIDFYINGTALSVWMFYKEESWLTALFWI

Query:  VLLIILGSTSSCAFIVKELFKLNSEDPAYLILFKSSNR
         LLI LGS  + A IV +LF+L+  DP YL+L  +SNR
Subjt:  VLLIILGSTSSCAFIVKELFKLNSEDPAYLILFKSSNR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAGCTCGGCGGTAATTGGGTGGAGGATTCTGTTCGTTCTACTGGGTTTTACAATGGTTGCAACTCTCGCATACACAATCACCATTGACGGCTCTCCTTTCCGCAC
AGAACTTCTCTCACGGTTAATGGTGGCAGTATTGATTGATTTCTATATCAATGTCATAGTTATTGCGGCATGGGTTTGCTATAAGGAATCAAACTGGATTGCTGCAACAA
TTTGGATAGTTTTTCTTGTATGTCTTGGCAGCATTGCTACTTGTGCCTACATTCTCCGGCAGTTGTGGCAACTTTCATCCCAGGAATCCTTTGAAGATATTATGTACAAT
ATTCTGATCAAGGACCCAAATAAGGATGGCATGCAGCAGCATAGGAAGCACTCCAATATTATGATTCCAAAAATAATTTCCGGTGCTTTGGGTTGCTTGATGGTGGTAAA
TTTGGCCTATGTTCTCAGTCATGGTTCACCTTTTCGCAAGGAGCTTTATACGCCCTGGATGGTGGCCACGCTGATCGATTTCTATATAAATGGCACTGCTTTATCAGTCT
GGATGTTCTATAAAGAAGAATCGTGGCTTACTGCGCTCTTTTGGATCGTTCTATTGATAATCTTGGGGAGCACCTCTTCATGTGCCTTCATTGTTAAGGAGCTATTCAAG
CTCAACTCCGAAGATCCAGCATACCTTATTTTATTCAAAAGTTCCAACAGGTAG
mRNA sequenceShow/hide mRNA sequence
AAACAAACAAGACCACGTAGACGGTGGACGCTTCCTTTCCATTAACTACCGTCGTCTTCATTTCAACTCCTCCAGTTGCTCTCGTCCTTCTACTCCCCATTCTCTCTCTC
TCAGAACCACTGCGTGTGTAATGGCGAGCTCGGCGGTAATTGGGTGGAGGATTCTGTTCGTTCTACTGGGTTTTACAATGGTTGCAACTCTCGCATACACAATCACCATT
GACGGCTCTCCTTTCCGCACAGAACTTCTCTCACGGTTAATGGTGGCAGTATTGATTGATTTCTATATCAATGTCATAGTTATTGCGGCATGGGTTTGCTATAAGGAATC
AAACTGGATTGCTGCAACAATTTGGATAGTTTTTCTTGTATGTCTTGGCAGCATTGCTACTTGTGCCTACATTCTCCGGCAGTTGTGGCAACTTTCATCCCAGGAATCCT
TTGAAGATATTATGTACAATATTCTGATCAAGGACCCAAATAAGGATGGCATGCAGCAGCATAGGAAGCACTCCAATATTATGATTCCAAAAATAATTTCCGGTGCTTTG
GGTTGCTTGATGGTGGTAAATTTGGCCTATGTTCTCAGTCATGGTTCACCTTTTCGCAAGGAGCTTTATACGCCCTGGATGGTGGCCACGCTGATCGATTTCTATATAAA
TGGCACTGCTTTATCAGTCTGGATGTTCTATAAAGAAGAATCGTGGCTTACTGCGCTCTTTTGGATCGTTCTATTGATAATCTTGGGGAGCACCTCTTCATGTGCCTTCA
TTGTTAAGGAGCTATTCAAGCTCAACTCCGAAGATCCAGCATACCTTATTTTATTCAAAAGTTCCAACAGGTAGTTGATTAGTCCGGTGGACGCCATTGGCTCTCTACGT
AGCTTTCTGGCTTAATAGAGGTAAAAACTCACTATGGATTCTTCTTGCATATTGTAATTTATGGTAGGTCAGAAAGAAGGTATGAGAGAACATCGTCCTCATAAGCTGCA
AAGGAAGTTAGAGGACTTTAGCTCATTCTGGTACGTATCGAAATGTTACATTAGAAGCAGTGTCCATAGGACTTTACCTCTTATGTTTCTCGATGATATAAGCTTGATGA
ATTTGTAGAAAAATTCCATATAGCACAATGTAACTAATAGAGGTGTATCTATTTTATATACGACCTTACTTGAACATGATCTATTTTATAGGTTAGGTTGTACAACGGTA
TCCTTGAGTTCTAAAAATTATATCTATGGTATATATCTCAAGTTTTCCTTTTCCATCCTCAAGAAAGGGAATAAACAGAAGTTGGTACCTCAACCATTTAAAGCTGCTCA
CGTCCTTCTCTTCTACTCCCTCACAAAGCTTCATTGCGAGTGCGAGGGTTCCAATGGCGAGCTCGGCGGTAATTGGATGGAGGATTCTGTTCGTTCTGCTGGGCTCTACA
ATGGTTGCAGCTCTCGCATACACAGTCTCCATTGATGGCTCTCCTTTCCGCAGAGAACATTACTCACGGTACTCTGTATTCTTCACTGTGCATTTTTCATTTTCTTTCTT
ATACATTCCGTCTTCTTCTTCATCTTGTTGCTTCAGTTACTCTGTTCATGATCATGCTCCTACTGATAATCTTAGGTATCTTTTATTGTGTTTAAAAAATGATCCTTCCT
ACGAGGCGAGTTTAAAGTGCTGGAAATCCATAGGCCTAGGGATTTTGGGATTGTTTGACAACTGCCACGAAGAATAAATAATTTTGTCAAGTTTAAATTGTAAATTCACT
TGCTGAATTTCGATGGTTTTGTCTATGGGGACATAGTTGAACTGCGGTGTTATTTAGAAAAAGATGACATGACATGTCTACTGAAAATTTAGATTGATTCAACACGAACT
AGTAGGCTAGGCACGTGGTTGAAGAAGTGGAAAATTGAGTAAAATTAAGTGAAACTTGTAACATACCGTAATATCATCACTTAGTCTATAGTGTCAATGATAAGGACTAT
CTAAAATAGCAATTCAAAGTTTTAGGAACATAATTGCACGGTATTATTTTATTATTTTTTTGATAAAACACCAAACTTTCATGGAGATAAAAATGGAAGAACACACAAGC
ATACAAAAGAAGAAGCCAACAAAAAAACGAGAGCCAACTAGAGAAAGAGGCTCCAGTCGTGTAAGATAAGACTTAAAGAATAGTTACAAAAAAGCCTCGACACCAAAGCC
TAAAGAGAAATATATAATATAACAAGGGATCACACATACATAAAAGAACTCTTTCTCCCCTTGGAGATTTTATTATTTCTCTCCCGTTACAGACTCCACAAAATAGCACA
AACCCCAATCTGCCGCAAAAATAAGTATTTCTCATGAAAAGGCGAACTAATGAGAAGCTCCATGAATTAGGGGTGTATATAAGCCGAGTTGGGTTGGGTTGAGGGGATTT
TTTGGACCAACCCAAAAGTTTGGGTTGGTCATTCCTTCAACTCAACCAACCCTATTCATGAGAGTAACCCAACCCAACCCAATGTTTTTCGGGTTGGGTTGGGTTGGATC
GCCAGGTTATTATTATTTTTTTATTTTTTACAATTTTTTAATTATTTAAAATATCTTTTTCTTAAATTAATAACTAAAATCACA
Protein sequenceShow/hide protein sequence
MASSAVIGWRILFVLLGFTMVATLAYTITIDGSPFRTELLSRLMVAVLIDFYINVIVIAAWVCYKESNWIAATIWIVFLVCLGSIATCAYILRQLWQLSSQESFEDIMYN
ILIKDPNKDGMQQHRKHSNIMIPKIISGALGCLMVVNLAYVLSHGSPFRKELYTPWMVATLIDFYINGTALSVWMFYKEESWLTALFWIVLLIILGSTSSCAFIVKELFK
LNSEDPAYLILFKSSNR