; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0012701 (gene) of Snake gourd v1 genome

Gene IDTan0012701
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
Genome locationLG01:12103589..12104362
RNA-Seq ExpressionTan0012701
SyntenyTan0012701
Gene Ontology termsGO:0032259 - methylation (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600586.1 hypothetical protein SDJN03_05819, partial [Cucurbita argyrosperma subsp. sororia]2.2e-8876.67Show/hide
Query:  MDLFFFDKAEKVSNPIPRCNLLQIVAKLFRFLELCFLLVFLSWSLSRLPIAVRISAEYFGKLFGFVATPLFGFLLCNAIIVALVAKPSQFSDHSALTV--
        MDLFFF K EKV +P    NLLQ+  K FRFLELCFLL+FLSW LSRLPIA+ ISA++F KLFGF+A+PLFGFLLCNAII AL+AKP++FSD SA+T+  
Subjt:  MDLFFFDKAEKVSNPIPRCNLLQIVAKLFRFLELCFLLVFLSWSLSRLPIAVRISAEYFGKLFGFVATPLFGFLLCNAIIVALVAKPSQFSDHSALTV--

Query:  GDETDRIYEDLIEKSGSGNLPESLSEHEVEIVYQDKQIIAEAITSIDREIEVKSADPESESGLDHPKVILRTLSEKFKPKTQTEKLRRSETEKCRNLEHS
        GDE DRIY DLIEKSGS    + + E +VE VYQDKQIIAE I+SID +IE ++AD ESESG+DHPKVILRTLSEK KPKTQ+EKLRRSETEKCRNLEHS
Subjt:  GDETDRIYEDLIEKSGSGNLPESLSEHEVEIVYQDKQIIAEAITSIDREIEVKSADPESESGLDHPKVILRTLSEKFKPKTQTEKLRRSETEKCRNLEHS

Query:  RDILCYQDDLSSEEFQRKIEAFIAKEKKFRREESSAIVLH
         DIL YQDDLSSEEFQRKIEAFIAKEKKFRREESSAIVLH
Subjt:  RDILCYQDDLSSEEFQRKIEAFIAKEKKFRREESSAIVLH

KAG7031227.1 hypothetical protein SDJN02_05267, partial [Cucurbita argyrosperma subsp. argyrosperma]1.0e-8877.08Show/hide
Query:  MDLFFFDKAEKVSNPIPRCNLLQIVAKLFRFLELCFLLVFLSWSLSRLPIAVRISAEYFGKLFGFVATPLFGFLLCNAIIVALVAKPSQFSDHSALTV--
        MDLFFF K EKV +P    NLLQI  K FRFLELCFLL+FLSW LSRLPIA+ ISA++F KLFGF+A+PLFGFLLCNAII AL+AKP++FSD SA+T+  
Subjt:  MDLFFFDKAEKVSNPIPRCNLLQIVAKLFRFLELCFLLVFLSWSLSRLPIAVRISAEYFGKLFGFVATPLFGFLLCNAIIVALVAKPSQFSDHSALTV--

Query:  GDETDRIYEDLIEKSGSGNLPESLSEHEVEIVYQDKQIIAEAITSIDREIEVKSADPESESGLDHPKVILRTLSEKFKPKTQTEKLRRSETEKCRNLEHS
        GDE DRIY DLIEKSGS    + + E +VE VYQDKQIIAE I+SID +IE ++AD ESESG+DHPKVILRTLSEK KPKTQ+EKLRRSETEKCRNLEHS
Subjt:  GDETDRIYEDLIEKSGSGNLPESLSEHEVEIVYQDKQIIAEAITSIDREIEVKSADPESESGLDHPKVILRTLSEKFKPKTQTEKLRRSETEKCRNLEHS

Query:  RDILCYQDDLSSEEFQRKIEAFIAKEKKFRREESSAIVLH
         DIL YQDDLSSEEFQRKIEAFIAKEKKFRREESSAIVLH
Subjt:  RDILCYQDDLSSEEFQRKIEAFIAKEKKFRREESSAIVLH

XP_022136879.1 uncharacterized protein LOC111008461, partial [Momordica charantia]7.7e-8977.33Show/hide
Query:  MDLFFFDKAEKVSNPIPR--CNLLQIVAKLFRFLELCFLLVFLSWSLSRLPIAVRISAEYFGKLFGFVATPLFGFLLCNAIIVALVAKPSQFSDHS-ALT
        MD FFFDKAEK    IPR  CNLLQI AK+FRFLELC LLVFLSW LSRLP AVRIS EYF KLF FVA+PLFGF+LCNAIIVALVAKPSQ S    A+ 
Subjt:  MDLFFFDKAEKVSNPIPR--CNLLQIVAKLFRFLELCFLLVFLSWSLSRLPIAVRISAEYFGKLFGFVATPLFGFLLCNAIIVALVAKPSQFSDHS-ALT

Query:  VGDETDRIYEDLIEKSGSGNLPESLSEHEVEIVYQDKQIIAEAITSIDREIEVKSADPESESGLDHPKVILRTLSEKFK---PKTQTEKLRRSETEKCRN
         G ETDR+YEDLI KSG+G+LPES SE + EIV+QDK+IIAEAI SIDR++EVKS DPE+ESG+DHPKVI RTLSEK K    KTQ+EKLRRSETEKCRN
Subjt:  VGDETDRIYEDLIEKSGSGNLPESLSEHEVEIVYQDKQIIAEAITSIDREIEVKSADPESESGLDHPKVILRTLSEKFK---PKTQTEKLRRSETEKCRN

Query:  -LEHSRDILCYQDDLSSEEFQRKIEAFIAKEKKFRREESSAIVLHCD
         LE S DI+ YQDDLSSEEFQRKIEAFIA+EKKFRREESSAIVLH D
Subjt:  -LEHSRDILCYQDDLSSEEFQRKIEAFIAKEKKFRREESSAIVLHCD

XP_022989466.1 uncharacterized protein LOC111486513 [Cucurbita maxima]2.6e-8978.33Show/hide
Query:  MDLFFFDKAEKVSNPIPRCNLLQIVAKLFRFLELCFLLVFLSWSLSRLPIAVRISAEYFGKLFGFVATPLFGFLLCNAIIVALVAKPSQFSDHSALTV--
        MDLFFF K EK  +P    NLLQI  K FRFLELCFLL+FLSWSLSRLPIA+ ISA++F KLFGF+A+PLFGFLLCNAII AL+AKP++FSD SA+TV  
Subjt:  MDLFFFDKAEKVSNPIPRCNLLQIVAKLFRFLELCFLLVFLSWSLSRLPIAVRISAEYFGKLFGFVATPLFGFLLCNAIIVALVAKPSQFSDHSALTV--

Query:  GDETDRIYEDLIEKSGSGNLPESLSEHEVEIVYQDKQIIAEAITSIDREIEVKSADPESESGLDHPKVILRTLSEKFKPKTQTEKLRRSETEKCRNLEHS
        GDE DRIY DLIEKSGS    + + E EVEIVYQDKQIIAE I+SI  +IE ++AD ESESGLDHPKVILRTLSEK KPKTQ+EKLRRSETEKCRNLEHS
Subjt:  GDETDRIYEDLIEKSGSGNLPESLSEHEVEIVYQDKQIIAEAITSIDREIEVKSADPESESGLDHPKVILRTLSEKFKPKTQTEKLRRSETEKCRNLEHS

Query:  RDILCYQDDLSSEEFQRKIEAFIAKEKKFRREESSAIVLH
         DIL YQDDLSSEEFQRKIEAFIAKEKKFRREESSAIVLH
Subjt:  RDILCYQDDLSSEEFQRKIEAFIAKEKKFRREESSAIVLH

XP_038888396.1 uncharacterized protein LOC120078241 [Benincasa hispida]1.0e-9379.44Show/hide
Query:  MDLFFFDKAEKVSNPIPRCNLLQIVAKLFRFLELCFLLVFLSWSLSRLPIAVRISAEYFGKLFGFVATPLFGFLLCNAIIVALVAKPSQF---SDHSALT
        MD FFFDKAEKVSNPIPRCNLLQI AK FRFLEL FLL+FLSW LSRLPIA+ +SAEYFGKLF FVATPLFGFLLCNAIIVALVAKP+QF   +  ++ T
Subjt:  MDLFFFDKAEKVSNPIPRCNLLQIVAKLFRFLELCFLLVFLSWSLSRLPIAVRISAEYFGKLFGFVATPLFGFLLCNAIIVALVAKPSQF---SDHSALT

Query:  VGDETDRIYEDLIEKSGSGNLPESLSEHEVEIVYQDKQIIAEAITSIDREIEVKSADPESESGLDHPKVILRTLSEKFKPK---TQTEKLRRSETEKCRN
           E DRIYEDLIEK+GSG+L +SLSE EVEIVYQDKQIIAE I SIDREIEVK+ D E+ESGL H KV+LRTLSEK   +   T+ EKLRRSETEKCRN
Subjt:  VGDETDRIYEDLIEKSGSGNLPESLSEHEVEIVYQDKQIIAEAITSIDREIEVKSADPESESGLDHPKVILRTLSEKFKPK---TQTEKLRRSETEKCRN

Query:  LEHSRDILCYQDDLSSEEFQRKIEAFIAKEKKFRREESSAI-VLHCDG
        LEHS DIL +QDDLSSEEFQRKIEAFIAKEKKFRREESSAI VLHCDG
Subjt:  LEHSRDILCYQDDLSSEEFQRKIEAFIAKEKKFRREESSAI-VLHCDG

TrEMBL top hitse value%identityAlignment
A0A1S3BT40 uncharacterized protein LOC1034932051.5e-8574.4Show/hide
Query:  MDLFFFDKAEKVSNPIPRCNLLQIVAKLFRFLELCFLLVFLSWSLSRLPIAVRISAEYFGKLFGFVATPLFGFLLCNAIIVALVAKPSQFSDHSALTVGD
        MD FFFDKAEK+S PIPRCNLLQI +K FRFLEL FLL+ LSW  SRLPIA+RISA+YF KLF F+ATPLFGFLLCNAIIVALVAKPSQFS  +     D
Subjt:  MDLFFFDKAEKVSNPIPRCNLLQIVAKLFRFLELCFLLVFLSWSLSRLPIAVRISAEYFGKLFGFVATPLFGFLLCNAIIVALVAKPSQFSDHSALTVGD

Query:  ETDRIYEDLIEKSGSG-NLPESLSEHEVEIVYQDKQIIAEAIT----SIDREIEVKSADPESESGLDHPKVILRTLSEKFK---PKTQTEKLRRSETEKC
        +TDRIYEDLIEK+G+G +L +S SE   EIVYQDK+IIAE       S D EIE+K+ D ES+SGL H KVILR+LSEK      KTQ+EKLRRSETEKC
Subjt:  ETDRIYEDLIEKSGSG-NLPESLSEHEVEIVYQDKQIIAEAIT----SIDREIEVKSADPESESGLDHPKVILRTLSEKFK---PKTQTEKLRRSETEKC

Query:  RNLEHSRDILCYQDDLSSEEFQRKIEAFIAKEKKFRREESSAI-VLHCDG
        RNLE+S DIL YQDDLSSEEFQRKIEAFIAKEKKFRREESSAI VLHCDG
Subjt:  RNLEHSRDILCYQDDLSSEEFQRKIEAFIAKEKKFRREESSAI-VLHCDG

A0A5D3CYU3 Putative TRNA--methyltransferase non-catalytic subunit trm6MTase subunit trm61.5e-8574.4Show/hide
Query:  MDLFFFDKAEKVSNPIPRCNLLQIVAKLFRFLELCFLLVFLSWSLSRLPIAVRISAEYFGKLFGFVATPLFGFLLCNAIIVALVAKPSQFSDHSALTVGD
        MD FFFDKAEK+S PIPRCNLLQI +K FRFLEL FLL+ LSW  SRLPIA+RISA+YF KLF F+ATPLFGFLLCNAIIVALVAKPSQFS  +     D
Subjt:  MDLFFFDKAEKVSNPIPRCNLLQIVAKLFRFLELCFLLVFLSWSLSRLPIAVRISAEYFGKLFGFVATPLFGFLLCNAIIVALVAKPSQFSDHSALTVGD

Query:  ETDRIYEDLIEKSGSG-NLPESLSEHEVEIVYQDKQIIAEAIT----SIDREIEVKSADPESESGLDHPKVILRTLSEKFK---PKTQTEKLRRSETEKC
        +TDRIYEDLIEK+G+G +L +S SE   EIVYQDK+IIAE       S D EIE+K+ D ES+SGL H KVILR+LSEK      KTQ+EKLRRSETEKC
Subjt:  ETDRIYEDLIEKSGSG-NLPESLSEHEVEIVYQDKQIIAEAIT----SIDREIEVKSADPESESGLDHPKVILRTLSEKFK---PKTQTEKLRRSETEKC

Query:  RNLEHSRDILCYQDDLSSEEFQRKIEAFIAKEKKFRREESSAI-VLHCDG
        RNLE+S DIL YQDDLSSEEFQRKIEAFIAKEKKFRREESSAI VLHCDG
Subjt:  RNLEHSRDILCYQDDLSSEEFQRKIEAFIAKEKKFRREESSAI-VLHCDG

A0A6J1C5K3 uncharacterized protein LOC1110084613.7e-8977.33Show/hide
Query:  MDLFFFDKAEKVSNPIPR--CNLLQIVAKLFRFLELCFLLVFLSWSLSRLPIAVRISAEYFGKLFGFVATPLFGFLLCNAIIVALVAKPSQFSDHS-ALT
        MD FFFDKAEK    IPR  CNLLQI AK+FRFLELC LLVFLSW LSRLP AVRIS EYF KLF FVA+PLFGF+LCNAIIVALVAKPSQ S    A+ 
Subjt:  MDLFFFDKAEKVSNPIPR--CNLLQIVAKLFRFLELCFLLVFLSWSLSRLPIAVRISAEYFGKLFGFVATPLFGFLLCNAIIVALVAKPSQFSDHS-ALT

Query:  VGDETDRIYEDLIEKSGSGNLPESLSEHEVEIVYQDKQIIAEAITSIDREIEVKSADPESESGLDHPKVILRTLSEKFK---PKTQTEKLRRSETEKCRN
         G ETDR+YEDLI KSG+G+LPES SE + EIV+QDK+IIAEAI SIDR++EVKS DPE+ESG+DHPKVI RTLSEK K    KTQ+EKLRRSETEKCRN
Subjt:  VGDETDRIYEDLIEKSGSGNLPESLSEHEVEIVYQDKQIIAEAITSIDREIEVKSADPESESGLDHPKVILRTLSEKFK---PKTQTEKLRRSETEKCRN

Query:  -LEHSRDILCYQDDLSSEEFQRKIEAFIAKEKKFRREESSAIVLHCD
         LE S DI+ YQDDLSSEEFQRKIEAFIA+EKKFRREESSAIVLH D
Subjt:  -LEHSRDILCYQDDLSSEEFQRKIEAFIAKEKKFRREESSAIVLHCD

A0A6J1FRE0 uncharacterized protein LOC1114477543.1e-8876.67Show/hide
Query:  MDLFFFDKAEKVSNPIPRCNLLQIVAKLFRFLELCFLLVFLSWSLSRLPIAVRISAEYFGKLFGFVATPLFGFLLCNAIIVALVAKPSQFSDHSALTV--
        MDLFFF K EKV +P    NLLQI  K FRFLELCFLL+FLSW LSRLPIA+ ISA++F KLFGF+A+PLFGFLLCNAII AL+ KP++FSD SA+TV  
Subjt:  MDLFFFDKAEKVSNPIPRCNLLQIVAKLFRFLELCFLLVFLSWSLSRLPIAVRISAEYFGKLFGFVATPLFGFLLCNAIIVALVAKPSQFSDHSALTV--

Query:  GDETDRIYEDLIEKSGSGNLPESLSEHEVEIVYQDKQIIAEAITSIDREIEVKSADPESESGLDHPKVILRTLSEKFKPKTQTEKLRRSETEKCRNLEHS
        GDE DRIY DLIEKSGS    + + E +VE VY+DKQIIAE I+SID +IE ++AD ESESG+DHPKVILRTLSEK KPKTQ+EKLRRSETEKCRNLEHS
Subjt:  GDETDRIYEDLIEKSGSGNLPESLSEHEVEIVYQDKQIIAEAITSIDREIEVKSADPESESGLDHPKVILRTLSEKFKPKTQTEKLRRSETEKCRNLEHS

Query:  RDILCYQDDLSSEEFQRKIEAFIAKEKKFRREESSAIVLH
         DIL YQDDLSSEEFQRKIEAFIAKEKKFRREESSAIVLH
Subjt:  RDILCYQDDLSSEEFQRKIEAFIAKEKKFRREESSAIVLH

A0A6J1JK52 uncharacterized protein LOC1114865131.3e-8978.33Show/hide
Query:  MDLFFFDKAEKVSNPIPRCNLLQIVAKLFRFLELCFLLVFLSWSLSRLPIAVRISAEYFGKLFGFVATPLFGFLLCNAIIVALVAKPSQFSDHSALTV--
        MDLFFF K EK  +P    NLLQI  K FRFLELCFLL+FLSWSLSRLPIA+ ISA++F KLFGF+A+PLFGFLLCNAII AL+AKP++FSD SA+TV  
Subjt:  MDLFFFDKAEKVSNPIPRCNLLQIVAKLFRFLELCFLLVFLSWSLSRLPIAVRISAEYFGKLFGFVATPLFGFLLCNAIIVALVAKPSQFSDHSALTV--

Query:  GDETDRIYEDLIEKSGSGNLPESLSEHEVEIVYQDKQIIAEAITSIDREIEVKSADPESESGLDHPKVILRTLSEKFKPKTQTEKLRRSETEKCRNLEHS
        GDE DRIY DLIEKSGS    + + E EVEIVYQDKQIIAE I+SI  +IE ++AD ESESGLDHPKVILRTLSEK KPKTQ+EKLRRSETEKCRNLEHS
Subjt:  GDETDRIYEDLIEKSGSGNLPESLSEHEVEIVYQDKQIIAEAITSIDREIEVKSADPESESGLDHPKVILRTLSEKFKPKTQTEKLRRSETEKCRNLEHS

Query:  RDILCYQDDLSSEEFQRKIEAFIAKEKKFRREESSAIVLH
         DIL YQDDLSSEEFQRKIEAFIAKEKKFRREESSAIVLH
Subjt:  RDILCYQDDLSSEEFQRKIEAFIAKEKKFRREESSAIVLH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G34560.1 unknown protein9.2e-0827.04Show/hide
Query:  KAEKVSNPIPRCNLLQIVAKLFRFLELCFLLVFLSWSLSRLPIAVRISAEYFGKLFGFVATPLFGFLLCNAIIVALVAKPSQFS---DHSALTVGDETDR
        K EK S  I     ++    LFR  EL  L++ +S  LS  P +V+IS + F +   F+ +P F F + NAI++ L+AK  ++S   + S       ++ 
Subjt:  KAEKVSNPIPRCNLLQIVAKLFRFLELCFLLVFLSWSLSRLPIAVRISAEYFGKLFGFVATPLFGFLLCNAIIVALVAKPSQFS---DHSALTVGDETDR

Query:  IYEDLIEKSGSGNLPESLSEHEVEIVYQDKQIIAEAITSIDREIEVKSADPESESGLDHPKVILRTLSEKFKPKTQTEK-LRRSETEKCRNLEHSRDILC
        +Y++ + K          SE +  +VY  K    E +        V     +S    +  K +  T  EK   +  +EK +R  +++K   +   +    
Subjt:  IYEDLIEKSGSGNLPESLSEHEVEIVYQDKQIIAEAITSIDREIEVKSADPESESGLDHPKVILRTLSEKFKPKTQTEK-LRRSETEKCRNLEHSRDILC

Query:  YQDDLSSEEFQRKIEAFIAKEKKFRREESSAIV
         +D +S+E+F+ KIEAFIA++K+ +++E   I+
Subjt:  YQDDLSSEEFQRKIEAFIAKEKKFRREESSAIV

AT5G66440.1 unknown protein4.4e-2636.29Show/hide
Query:  MDLFFFD--KAEKVSNPIPRCNLLQIVAKLFRFLELCFLLVFLSWSLSRLPIAVRISAEYFGKLFGFVATPLFGFLLCNAIIVALVAKPSQFSDHSALTV
        MD F FD  KAEK +  + R N    + + FR  E+C  L+F+ W+ S+LP  V+IS  +  ++   ++TPLF FLL N+I+V L+ K    SD +   V
Subjt:  MDLFFFD--KAEKVSNPIPRCNLLQIVAKLFRFLELCFLLVFLSWSLSRLPIAVRISAEYFGKLFGFVATPLFGFLLCNAIIVALVAKPSQFSDHSALTV

Query:  GDETDRIYEDLIEKSGSGNLP--ESLSEHEVEIVYQDKQIIAEAITS-----IDREIEVKSADPESESGLDHPKVILRTLSEKFKPKTQT-------EKL
              IY+  +    + + P  E L+E    IVY DKQ+I   + S     +D  +     D +S     H KV  R+ S+    ++           L
Subjt:  GDETDRIYEDLIEKSGSGNLP--ESLSEHEVEIVYQDKQIIAEAITS-----IDREIEVKSADPESESGLDHPKVILRTLSEKFKPKTQT-------EKL

Query:  RRSETEK-CRNLEHSRDI----LCYQDDLSSEEFQRKIEAFIAKEKKFRREESSAIVLH
        +RSETEK C+  E+  D        +DDLS+EEFQ+ IEAFIAK++ FRR+ES A+V+H
Subjt:  RRSETEK-CRNLEHSRDI----LCYQDDLSSEEFQRKIEAFIAKEKKFRREESSAIVLH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTGTTCTTCTTCGACAAAGCAGAGAAGGTTTCCAACCCTATTCCGAGATGCAATCTCCTCCAAATCGTCGCCAAATTGTTCCGATTCCTAGAGCTCTGCTTTTT
ACTCGTTTTCCTCTCATGGAGTCTCTCTCGGCTCCCGATCGCCGTCAGAATCTCCGCCGAGTACTTCGGAAAACTCTTCGGCTTCGTCGCCACTCCGCTCTTCGGATTCC
TCCTCTGTAACGCCATCATCGTCGCACTCGTAGCAAAACCTAGCCAATTCTCAGACCACAGCGCCCTAACCGTCGGCGACGAAACCGATCGGATTTACGAAGACCTAATT
GAAAAATCCGGAAGCGGAAACCTACCTGAATCGCTCTCGGAACACGAGGTGGAGATTGTGTACCAGGACAAACAGATCATCGCGGAGGCGATTACTTCAATTGACCGCGA
AATTGAAGTGAAGAGCGCGGATCCGGAGTCGGAATCTGGATTAGATCATCCGAAAGTGATTTTGAGGACGCTGTCAGAGAAATTTAAGCCGAAAACGCAGACTGAGAAGC
TCCGGCGATCGGAGACGGAGAAATGCCGGAATCTGGAGCATTCGCGTGACATTTTGTGCTACCAGGACGATTTGAGTAGCGAGGAGTTTCAGAGGAAGATTGAGGCGTTC
ATAGCGAAAGAGAAGAAGTTTCGACGGGAAGAATCTTCCGCCATTGTACTCCATTGCGACGGGTAA
mRNA sequenceShow/hide mRNA sequence
CTCTGTTTCTCTCTGAAATTATCAAATGGATTTGTTCTTCTTCGACAAAGCAGAGAAGGTTTCCAACCCTATTCCGAGATGCAATCTCCTCCAAATCGTCGCCAAATTGT
TCCGATTCCTAGAGCTCTGCTTTTTACTCGTTTTCCTCTCATGGAGTCTCTCTCGGCTCCCGATCGCCGTCAGAATCTCCGCCGAGTACTTCGGAAAACTCTTCGGCTTC
GTCGCCACTCCGCTCTTCGGATTCCTCCTCTGTAACGCCATCATCGTCGCACTCGTAGCAAAACCTAGCCAATTCTCAGACCACAGCGCCCTAACCGTCGGCGACGAAAC
CGATCGGATTTACGAAGACCTAATTGAAAAATCCGGAAGCGGAAACCTACCTGAATCGCTCTCGGAACACGAGGTGGAGATTGTGTACCAGGACAAACAGATCATCGCGG
AGGCGATTACTTCAATTGACCGCGAAATTGAAGTGAAGAGCGCGGATCCGGAGTCGGAATCTGGATTAGATCATCCGAAAGTGATTTTGAGGACGCTGTCAGAGAAATTT
AAGCCGAAAACGCAGACTGAGAAGCTCCGGCGATCGGAGACGGAGAAATGCCGGAATCTGGAGCATTCGCGTGACATTTTGTGCTACCAGGACGATTTGAGTAGCGAGGA
GTTTCAGAGGAAGATTGAGGCGTTCATAGCGAAAGAGAAGAAGTTTCGACGGGAAGAATCTTCCGCCATTGTACTCCATTGCGACGGGTAATCTCCGGTTGAAAATAAGA
CTAC
Protein sequenceShow/hide protein sequence
MDLFFFDKAEKVSNPIPRCNLLQIVAKLFRFLELCFLLVFLSWSLSRLPIAVRISAEYFGKLFGFVATPLFGFLLCNAIIVALVAKPSQFSDHSALTVGDETDRIYEDLI
EKSGSGNLPESLSEHEVEIVYQDKQIIAEAITSIDREIEVKSADPESESGLDHPKVILRTLSEKFKPKTQTEKLRRSETEKCRNLEHSRDILCYQDDLSSEEFQRKIEAF
IAKEKKFRREESSAIVLHCDG