; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0012647 (gene) of Snake gourd v1 genome

Gene IDTan0012647
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
Genome locationLG01:13981594..13985434
RNA-Seq ExpressionTan0012647
SyntenyTan0012647
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022136930.1 uncharacterized protein LOC111008504 [Momordica charantia]1.0e-11887.12Show/hide
Query:  MAEATAGGSISTFAAMAVNHCQAATVSNGVHVQEKLAKVTRLDGTERHCSLEILPILFEKASFPFQNSSVRDSSGFLSTEEFDNSPDCDPHLAFLSFLEV
        MAEAT   SIS+F AMAVN CQ A VSNGVHVQEKLAKV RLD  ERHCSLEILPILFEKASFPFQ+SSVRDSSG  S EEFDNSPDCDPHLAFLS LEV
Subjt:  MAEATAGGSISTFAAMAVNHCQAATVSNGVHVQEKLAKVTRLDGTERHCSLEILPILFEKASFPFQNSSVRDSSGFLSTEEFDNSPDCDPHLAFLSFLEV

Query:  THPTKSKMSLETSDTRLTSQNVIDIHVNGGDAYSSCIVNIDIDKDKLKAPKSCEGAFESLKTENTLARIEKALQRQSSLKMGAKLVHYLLDHGLMLLKFS
        THPTKS+MSLETSDTRLT QNVIDIHVNGGDAYSSCIVNIDIDKDKLK  K C+G FESLKT+NTL RIEK LQRQSSLK+G KLV YLLDHGLMLLKFS
Subjt:  THPTKSKMSLETSDTRLTSQNVIDIHVNGGDAYSSCIVNIDIDKDKLKAPKSCEGAFESLKTENTLARIEKALQRQSSLKMGAKLVHYLLDHGLMLLKFS

Query:  SKAEKSGTERVHDMPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLVLIYLTLRVRQQSGDGSV
        +K EK GTERVHDMPNNRWRKYKRAASFDSRKIVILFSVLSSLGTL+LIYLTLRVRQQ GDGSV
Subjt:  SKAEKSGTERVHDMPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLVLIYLTLRVRQQSGDGSV

XP_022941601.1 uncharacterized protein LOC111446909 [Cucurbita moschata]2.8e-11689.64Show/hide
Query:  MAVNHCQAATVSNGVHVQEKLAKVTRLDGTERHCSLEILPILFEKASFPFQNSSVRDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSKMSLETSDT
        MAVNHCQAAT+SNGVHVQEKLAKVTR D  ERH SLEILP LFEKASFP Q+S  RDSSGF STEEFDNSPDCDPHLAFLSFLEVTHPT SKMSL TSD 
Subjt:  MAVNHCQAATVSNGVHVQEKLAKVTRLDGTERHCSLEILPILFEKASFPFQNSSVRDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSKMSLETSDT

Query:  RLTSQNVIDIHVNGGDAYSSCIVNIDIDKDKLKAPKSCEGAFESLKTENTLARIEKALQRQSSLKMGAKLVHYLLDHGLMLLKFSSKAEKSGTERVHDMP
         LT QNVIDIHVNGGDAYSSCIVNIDIDKDKLK PKSCEG FESLKTENTL RIEK LQRQSSLKMG KLVHYLLDHGLMLL+FSSK EKSGTERVHD P
Subjt:  RLTSQNVIDIHVNGGDAYSSCIVNIDIDKDKLKAPKSCEGAFESLKTENTLARIEKALQRQSSLKMGAKLVHYLLDHGLMLLKFSSKAEKSGTERVHDMP

Query:  NNRWRKYKRAASFDSRKIVILFSVLSSLGTLVLIYLTLRVRQQSGDGSVAM
        NNRWRKYKRAASFDSRKIVILFSVLSSLGTL+LIYLTLRVRQQS DGSVAM
Subjt:  NNRWRKYKRAASFDSRKIVILFSVLSSLGTLVLIYLTLRVRQQSGDGSVAM

XP_022991750.1 uncharacterized protein LOC111488280 [Cucurbita maxima]2.8e-11689.64Show/hide
Query:  MAVNHCQAATVSNGVHVQEKLAKVTRLDGTERHCSLEILPILFEKASFPFQNSSVRDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSKMSLETSDT
        MAVNHCQAA +SNGVHVQEKLAKVTR D TERH SLEILP LF+KASFP Q+S  RDSSGFLSTEE DNSPDCDPHLAFLSFLEVTHPT SKMSL TSD 
Subjt:  MAVNHCQAATVSNGVHVQEKLAKVTRLDGTERHCSLEILPILFEKASFPFQNSSVRDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSKMSLETSDT

Query:  RLTSQNVIDIHVNGGDAYSSCIVNIDIDKDKLKAPKSCEGAFESLKTENTLARIEKALQRQSSLKMGAKLVHYLLDHGLMLLKFSSKAEKSGTERVHDMP
         LT QNVIDIHVNGGDAYSSCIVNIDIDKDKLK PKSCEG FESLKTENTL RIEK LQRQSSLKMG KLVHYLLDHGLMLLKFSSK EKSGTERVHD P
Subjt:  RLTSQNVIDIHVNGGDAYSSCIVNIDIDKDKLKAPKSCEGAFESLKTENTLARIEKALQRQSSLKMGAKLVHYLLDHGLMLLKFSSKAEKSGTERVHDMP

Query:  NNRWRKYKRAASFDSRKIVILFSVLSSLGTLVLIYLTLRVRQQSGDGSVAM
        NNRWRKYKRAASFDSRKIVILFSVLSSLGTL+LIYLTLRVRQQS DGSVAM
Subjt:  NNRWRKYKRAASFDSRKIVILFSVLSSLGTLVLIYLTLRVRQQSGDGSVAM

XP_023531596.1 uncharacterized protein LOC111793785 [Cucurbita pepo subsp. pepo]4.0e-11589.24Show/hide
Query:  MAVNHCQAATVSNGVHVQEKLAKVTRLDGTERHCSLEILPILFEKASFPFQNSSVRDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSKMSLETSDT
        MAVNHCQAAT+SNGVHVQEKLAKV R D  ERH SLEILP LFEKASFP Q+S  RDSSGFLSTEE DNSPDCDPHLAFLSFLEVTHPT SKMSL TSD 
Subjt:  MAVNHCQAATVSNGVHVQEKLAKVTRLDGTERHCSLEILPILFEKASFPFQNSSVRDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSKMSLETSDT

Query:  RLTSQNVIDIHVNGGDAYSSCIVNIDIDKDKLKAPKSCEGAFESLKTENTLARIEKALQRQSSLKMGAKLVHYLLDHGLMLLKFSSKAEKSGTERVHDMP
         LT QNVIDIHVNGGDAYSSCIVNIDIDKDKLK PKSCEG F SLKTENTL RIEK LQRQSSLKMG KLVHYLLDHGLMLLKFSSK EKSGTERVHD P
Subjt:  RLTSQNVIDIHVNGGDAYSSCIVNIDIDKDKLKAPKSCEGAFESLKTENTLARIEKALQRQSSLKMGAKLVHYLLDHGLMLLKFSSKAEKSGTERVHDMP

Query:  NNRWRKYKRAASFDSRKIVILFSVLSSLGTLVLIYLTLRVRQQSGDGSVAM
        NNRWRKYKRAASFDSRKIVILFSVLSSLGTL+LIYLTLRVRQQS DGSVAM
Subjt:  NNRWRKYKRAASFDSRKIVILFSVLSSLGTLVLIYLTLRVRQQSGDGSVAM

XP_038903131.1 uncharacterized protein LOC120089804 isoform X1 [Benincasa hispida]4.3e-11786.52Show/hide
Query:  MAEATAGGSI-STFAAMAVNHCQAATVSNGVHVQEKLAKVTRLDGTERHCSLEILPILFEKASFPFQNSSVRDSSGFLSTEEFDNSPDCDPHLAFLSFLE
        MAEA A  SI STFA MAVN CQ A VSNGV VQEKLAKV+RL+ +ERHCSLEILP LFEKASFPFQNSS RDSSGFLSTEEFDNSP+CDPHLAFLSFLE
Subjt:  MAEATAGGSI-STFAAMAVNHCQAATVSNGVHVQEKLAKVTRLDGTERHCSLEILPILFEKASFPFQNSSVRDSSGFLSTEEFDNSPDCDPHLAFLSFLE

Query:  VTHPTKSKMSLETSDTRLTSQNVIDIHVNGGDAYSSCIVNIDIDKDKLKAPKSCEGAFESLKTENTLARIEKALQRQSSLKMGAKLVHYLLDHGLMLLKF
        VTH TKS+MSLETSD RLT QNVIDIHVNGGDAYSSCIVNIDIDKDKL+  KSCEG+ ES+KTENTL RIEK LQRQSSLKMGAKL  YLLDHGLMLLKF
Subjt:  VTHPTKSKMSLETSDTRLTSQNVIDIHVNGGDAYSSCIVNIDIDKDKLKAPKSCEGAFESLKTENTLARIEKALQRQSSLKMGAKLVHYLLDHGLMLLKF

Query:  SSKAEKSGTERVHDMPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLVLIYLTLRVRQQSGDGSVAM
        SSK EK GTER  DMPNNRWRKYKRAASFDSRKIVILFSVLSSLGTL+LIYLTLRVRQQ GDGSVAM
Subjt:  SSKAEKSGTERVHDMPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLVLIYLTLRVRQQSGDGSVAM

TrEMBL top hitse value%identityAlignment
A0A6J1C5A8 uncharacterized protein LOC1110085045.0e-11987.12Show/hide
Query:  MAEATAGGSISTFAAMAVNHCQAATVSNGVHVQEKLAKVTRLDGTERHCSLEILPILFEKASFPFQNSSVRDSSGFLSTEEFDNSPDCDPHLAFLSFLEV
        MAEAT   SIS+F AMAVN CQ A VSNGVHVQEKLAKV RLD  ERHCSLEILPILFEKASFPFQ+SSVRDSSG  S EEFDNSPDCDPHLAFLS LEV
Subjt:  MAEATAGGSISTFAAMAVNHCQAATVSNGVHVQEKLAKVTRLDGTERHCSLEILPILFEKASFPFQNSSVRDSSGFLSTEEFDNSPDCDPHLAFLSFLEV

Query:  THPTKSKMSLETSDTRLTSQNVIDIHVNGGDAYSSCIVNIDIDKDKLKAPKSCEGAFESLKTENTLARIEKALQRQSSLKMGAKLVHYLLDHGLMLLKFS
        THPTKS+MSLETSDTRLT QNVIDIHVNGGDAYSSCIVNIDIDKDKLK  K C+G FESLKT+NTL RIEK LQRQSSLK+G KLV YLLDHGLMLLKFS
Subjt:  THPTKSKMSLETSDTRLTSQNVIDIHVNGGDAYSSCIVNIDIDKDKLKAPKSCEGAFESLKTENTLARIEKALQRQSSLKMGAKLVHYLLDHGLMLLKFS

Query:  SKAEKSGTERVHDMPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLVLIYLTLRVRQQSGDGSV
        +K EK GTERVHDMPNNRWRKYKRAASFDSRKIVILFSVLSSLGTL+LIYLTLRVRQQ GDGSV
Subjt:  SKAEKSGTERVHDMPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLVLIYLTLRVRQQSGDGSV

A0A6J1ENF8 uncharacterized protein LOC1114360869.7e-10782.68Show/hide
Query:  MAVNHCQAATVSNGVHVQEKLAKVTRLDGTERHCSLEILPILFEKASFPFQNSSVRDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSKMSLETSDT
        MAVN CQ A VSNGV VQEKLAKVT LD  ERHCSLEILPILFEK SFPFQNS   DSS FLSTE FDNSP+CDPHLAFLSFLEVTHPTK++MSLETSDT
Subjt:  MAVNHCQAATVSNGVHVQEKLAKVTRLDGTERHCSLEILPILFEKASFPFQNSSVRDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSKMSLETSDT

Query:  RLTSQNVIDIHVNGGDAYSSCIVNIDIDK---DKLKAPKSCEGAFESLKTENTLARIEKALQRQSSLKMGAKLVHYLLDHGLMLLKFSSKAEKSGTERVH
        RLT QNVIDIHVN GDA SSCIVNIDIDK   DKLK  KS EG+FESL+TE+TL RIEK LQRQSSLKMGAKL+ YLLDHGLMLLKFSSK EKSG ER  
Subjt:  RLTSQNVIDIHVNGGDAYSSCIVNIDIDK---DKLKAPKSCEGAFESLKTENTLARIEKALQRQSSLKMGAKLVHYLLDHGLMLLKFSSKAEKSGTERVH

Query:  DMPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLVLIYLTLRVRQQSGDGSVAM
        D  NNRWRKYKR+ASFDSRKIV+LFSVLSSLGTL+LIYLTLRV+QQ GDG VAM
Subjt:  DMPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLVLIYLTLRVRQQSGDGSVAM

A0A6J1FLJ9 uncharacterized protein LOC1114469091.3e-11689.64Show/hide
Query:  MAVNHCQAATVSNGVHVQEKLAKVTRLDGTERHCSLEILPILFEKASFPFQNSSVRDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSKMSLETSDT
        MAVNHCQAAT+SNGVHVQEKLAKVTR D  ERH SLEILP LFEKASFP Q+S  RDSSGF STEEFDNSPDCDPHLAFLSFLEVTHPT SKMSL TSD 
Subjt:  MAVNHCQAATVSNGVHVQEKLAKVTRLDGTERHCSLEILPILFEKASFPFQNSSVRDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSKMSLETSDT

Query:  RLTSQNVIDIHVNGGDAYSSCIVNIDIDKDKLKAPKSCEGAFESLKTENTLARIEKALQRQSSLKMGAKLVHYLLDHGLMLLKFSSKAEKSGTERVHDMP
         LT QNVIDIHVNGGDAYSSCIVNIDIDKDKLK PKSCEG FESLKTENTL RIEK LQRQSSLKMG KLVHYLLDHGLMLL+FSSK EKSGTERVHD P
Subjt:  RLTSQNVIDIHVNGGDAYSSCIVNIDIDKDKLKAPKSCEGAFESLKTENTLARIEKALQRQSSLKMGAKLVHYLLDHGLMLLKFSSKAEKSGTERVHDMP

Query:  NNRWRKYKRAASFDSRKIVILFSVLSSLGTLVLIYLTLRVRQQSGDGSVAM
        NNRWRKYKRAASFDSRKIVILFSVLSSLGTL+LIYLTLRVRQQS DGSVAM
Subjt:  NNRWRKYKRAASFDSRKIVILFSVLSSLGTLVLIYLTLRVRQQSGDGSVAM

A0A6J1JCA6 uncharacterized protein LOC1114831623.2e-11084.65Show/hide
Query:  MAVNHCQAATVSNGVHVQEKLAKVTRLDGTERHCSLEILPILFEKASFPFQNSSVRDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSKMSLETSDT
        MAVN CQ A VSNGV VQEKLAKVTRLD  ERHCSLEILPILFEK SFPFQNS  RDSS FLSTE FDNSP+CDPHLAFLSFLEVTHPTK++MSLETSDT
Subjt:  MAVNHCQAATVSNGVHVQEKLAKVTRLDGTERHCSLEILPILFEKASFPFQNSSVRDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSKMSLETSDT

Query:  RLTSQNVIDIHVNGGDAYSSCIVNIDIDK---DKLKAPKSCEGAFESLKTENTLARIEKALQRQSSLKMGAKLVHYLLDHGLMLLKFSSKAEKSGTERVH
        RLT QNVIDIHVN GDA SSCIVNIDIDK   DKLK  KSCEG FESL+TE+TL RIEK LQRQSSLKMGAKLV YLLDHGLMLLKFSSK EKSG E+  
Subjt:  RLTSQNVIDIHVNGGDAYSSCIVNIDIDK---DKLKAPKSCEGAFESLKTENTLARIEKALQRQSSLKMGAKLVHYLLDHGLMLLKFSSKAEKSGTERVH

Query:  DMPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLVLIYLTLRVRQQSGDGSVAM
        D  NNRWRKYKR+ASFDSRKIV+LFSVLSSLGTL+LIYLTLRVRQQ GDGSVAM
Subjt:  DMPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLVLIYLTLRVRQQSGDGSVAM

A0A6J1JTU6 uncharacterized protein LOC1114882801.3e-11689.64Show/hide
Query:  MAVNHCQAATVSNGVHVQEKLAKVTRLDGTERHCSLEILPILFEKASFPFQNSSVRDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSKMSLETSDT
        MAVNHCQAA +SNGVHVQEKLAKVTR D TERH SLEILP LF+KASFP Q+S  RDSSGFLSTEE DNSPDCDPHLAFLSFLEVTHPT SKMSL TSD 
Subjt:  MAVNHCQAATVSNGVHVQEKLAKVTRLDGTERHCSLEILPILFEKASFPFQNSSVRDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSKMSLETSDT

Query:  RLTSQNVIDIHVNGGDAYSSCIVNIDIDKDKLKAPKSCEGAFESLKTENTLARIEKALQRQSSLKMGAKLVHYLLDHGLMLLKFSSKAEKSGTERVHDMP
         LT QNVIDIHVNGGDAYSSCIVNIDIDKDKLK PKSCEG FESLKTENTL RIEK LQRQSSLKMG KLVHYLLDHGLMLLKFSSK EKSGTERVHD P
Subjt:  RLTSQNVIDIHVNGGDAYSSCIVNIDIDKDKLKAPKSCEGAFESLKTENTLARIEKALQRQSSLKMGAKLVHYLLDHGLMLLKFSSKAEKSGTERVHDMP

Query:  NNRWRKYKRAASFDSRKIVILFSVLSSLGTLVLIYLTLRVRQQSGDGSVAM
        NNRWRKYKRAASFDSRKIVILFSVLSSLGTL+LIYLTLRVRQQS DGSVAM
Subjt:  NNRWRKYKRAASFDSRKIVILFSVLSSLGTLVLIYLTLRVRQQSGDGSVAM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G39900.1 unknown protein1.4e-3337.97Show/hide
Query:  KLAKVTRLDGTERHCSLEILPILFEKASFPFQNSSVRDSSGF-------LSTEEFDNSPDCDPHLAFLSFLEVTHPTKSKMSLETSDTRLTSQNVIDIHV
        KL K T  D T +H ++++ P+L ++A+FP +     D+S         +  +E +  P C  H   LSF++   P+K++M ++       +QN I++ +
Subjt:  KLAKVTRLDGTERHCSLEILPILFEKASFPFQNSSVRDSSGF-------LSTEEFDNSPDCDPHLAFLSFLEVTHPTKSKMSLETSDTRLTSQNVIDIHV

Query:  NGGDAYSSCIVNIDIDK-DKLKAPKSCEGAFESLKTENTLARIEKALQRQSSLKMGAKLVHYLLDHGLMLLKFSSKAEKSGTERVHDMPNNRWRKYKRAA
         G D+Y SC+V+I+++K +  +   S +    S+K+E+    ++K LQRQ+SL                        +K+ +ER HD P NRWR+YKRAA
Subjt:  NGGDAYSSCIVNIDIDK-DKLKAPKSCEGAFESLKTENTLARIEKALQRQSSLKMGAKLVHYLLDHGLMLLKFSSKAEKSGTERVHDMPNNRWRKYKRAA

Query:  SFDSRKIVILFSVLSSLGTLVLIYLTLRVRQQSGDGS
        SFDSRKIVILFS+LSS+GTL+LIYLTLRV+ Q+GD +
Subjt:  SFDSRKIVILFSVLSSLGTLVLIYLTLRVRQQSGDGS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGAGGCCACAGCTGGGGGCTCAATCTCTACCTTCGCAGCAATGGCTGTTAATCACTGTCAGGCTGCAACTGTGTCGAATGGAGTTCACGTTCAAGAAAAACTAGC
GAAAGTTACTAGACTTGACGGGACGGAGCGCCATTGTTCTTTAGAAATTTTGCCAATTCTCTTTGAGAAGGCTTCGTTCCCCTTTCAAAATTCTTCGGTCCGTGATTCCT
CTGGCTTTTTAAGTACCGAGGAATTCGACAACAGTCCAGATTGTGATCCACATTTAGCTTTCCTCAGCTTCCTGGAAGTTACCCATCCAACAAAAAGTAAGATGTCATTG
GAAACTTCAGACACCCGCTTGACTTCCCAGAACGTGATTGACATACATGTGAATGGTGGAGATGCTTATTCCTCGTGCATAGTAAATATTGATATTGACAAGGACAAGCT
CAAAGCACCTAAATCCTGTGAAGGAGCTTTTGAAAGTTTGAAAACTGAGAATACATTGGCGCGCATAGAGAAGGCATTGCAGAGACAATCCAGCCTTAAAATGGGGGCGA
AACTTGTGCATTATTTGTTAGACCATGGACTAATGTTACTGAAGTTCTCATCTAAAGCAGAAAAATCAGGGACCGAGAGGGTTCACGATATGCCAAACAACAGGTGGAGA
AAATATAAACGTGCTGCTTCATTTGATTCAAGAAAGATTGTTATTCTCTTCTCAGTATTATCAAGCTTGGGAACCTTGGTATTGATATATTTGACTCTGAGGGTTAGGCA
GCAGAGTGGAGATGGATCTGTTGCTATGTAA
mRNA sequenceShow/hide mRNA sequence
CCGATTCTGATGTAACGTCTATATCTATGGACCCTTCTCTCTCTGTTCATTTGAGTTTTCGATGGCTTTTTAATGGCGGAGGCCACAGCTGGGGGCTCAATCTCTACCTT
CGCAGCAATGGCTGTTAATCACTGTCAGGCTGCAACTGTGTCGAATGGAGTTCACGTTCAAGAAAAACTAGCGAAAGTTACTAGACTTGACGGGACGGAGCGCCATTGTT
CTTTAGAAATTTTGCCAATTCTCTTTGAGAAGGCTTCGTTCCCCTTTCAAAATTCTTCGGTCCGTGATTCCTCTGGCTTTTTAAGTACCGAGGAATTCGACAACAGTCCA
GATTGTGATCCACATTTAGCTTTCCTCAGCTTCCTGGAAGTTACCCATCCAACAAAAAGTAAGATGTCATTGGAAACTTCAGACACCCGCTTGACTTCCCAGAACGTGAT
TGACATACATGTGAATGGTGGAGATGCTTATTCCTCGTGCATAGTAAATATTGATATTGACAAGGACAAGCTCAAAGCACCTAAATCCTGTGAAGGAGCTTTTGAAAGTT
TGAAAACTGAGAATACATTGGCGCGCATAGAGAAGGCATTGCAGAGACAATCCAGCCTTAAAATGGGGGCGAAACTTGTGCATTATTTGTTAGACCATGGACTAATGTTA
CTGAAGTTCTCATCTAAAGCAGAAAAATCAGGGACCGAGAGGGTTCACGATATGCCAAACAACAGGTGGAGAAAATATAAACGTGCTGCTTCATTTGATTCAAGAAAGAT
TGTTATTCTCTTCTCAGTATTATCAAGCTTGGGAACCTTGGTATTGATATATTTGACTCTGAGGGTTAGGCAGCAGAGTGGAGATGGATCTGTTGCTATGTAACAGTTTT
TGCTCTCTGGTTTTGTATCGCTCTTCTTTTTTGCCTTTTTACAGTCTCTTAAAATAGTTGGGTTTTCATTTTTATCTTCCCTATAGAACAGATTTTTAGGCATGCGACTG
TTATATTTGTACATAAATGTCTTTCACTTTATGGTTGCCTTAGGAATTGATCACACTAGTGAATGTTTTGTGACATTGGGATGAACCTCGTTGGCAAAGAACATCAAAAT
CCTTTATGGAAATGTTGGGAGATCTCAATTTTATTCGAAC
Protein sequenceShow/hide protein sequence
MAEATAGGSISTFAAMAVNHCQAATVSNGVHVQEKLAKVTRLDGTERHCSLEILPILFEKASFPFQNSSVRDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSKMSL
ETSDTRLTSQNVIDIHVNGGDAYSSCIVNIDIDKDKLKAPKSCEGAFESLKTENTLARIEKALQRQSSLKMGAKLVHYLLDHGLMLLKFSSKAEKSGTERVHDMPNNRWR
KYKRAASFDSRKIVILFSVLSSLGTLVLIYLTLRVRQQSGDGSVAM