; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10019113 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10019113
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionAdenine deaminase
Genome locationChr04:16978870..16982344
RNA-Seq ExpressionHG10019113
SyntenyHG10019113
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022136930.1 uncharacterized protein LOC111008504 [Momordica charantia]3.2e-12089.02Show/hide
Query:  MAEATARDSISSTFAAMAVNQCQTAAVSNGVLVQEKLAKVTRLDEAEQHCSLEILPILFEKASFPFQTSSARDSSSFLSAEEFDNNPECDPHLAFLSFLE
        MAEAT RDSISS F AMAVNQCQ AAVSNGV VQEKLAKV RLDEAE+HCSLEILPILFEKASFPFQ+SS RDSS   S EEFDN+P+CDPHLAFLS LE
Subjt:  MAEATARDSISSTFAAMAVNQCQTAAVSNGVLVQEKLAKVTRLDEAEQHCSLEILPILFEKASFPFQTSSARDSSSFLSAEEFDNNPECDPHLAFLSFLE

Query:  VTHPTKSRMSLETSDTRLTCQNVIDIHVNGGDAYSSCIVNIDIDKDKLKPSKSCEGSFESLKTESTLVRIEKLLQRQSSLKMGAKLVQYLLDHGLMLLKF
        VTHPTKSRMSLETSDTRLTCQNVIDIHVNGGDAYSSCIVNIDIDKDKLK SK C+G FESLKT++TLVRIEK+LQRQSSLK+G KLVQYLLDHGLMLLKF
Subjt:  VTHPTKSRMSLETSDTRLTCQNVIDIHVNGGDAYSSCIVNIDIDKDKLKPSKSCEGSFESLKTESTLVRIEKLLQRQSSLKMGAKLVQYLLDHGLMLLKF

Query:  SSKEKLGTERAHDTPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQAGDGSV
        S+KEKLGTER HD PNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQ GDGSV
Subjt:  SSKEKLGTERAHDTPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQAGDGSV

XP_022985084.1 uncharacterized protein LOC111483162 [Cucurbita maxima]2.8e-11689.33Show/hide
Query:  MAVNQCQTAAVSNGVLVQEKLAKVTRLDEAEQHCSLEILPILFEKASFPFQTSSARDSSSFLSAEEFDNNPECDPHLAFLSFLEVTHPTKSRMSLETSDT
        MAVNQCQTAAVSNGVLVQEKLAKVTRLDE E+HCSLEILPILFEK SFPFQ S ARDSSSFLS E FDN+PECDPHLAFLSFLEVTHPTK+RMSLETSDT
Subjt:  MAVNQCQTAAVSNGVLVQEKLAKVTRLDEAEQHCSLEILPILFEKASFPFQTSSARDSSSFLSAEEFDNNPECDPHLAFLSFLEVTHPTKSRMSLETSDT

Query:  RLTCQNVIDIHVNGGDAYSSCIVNIDIDK---DKLKPSKSCEGSFESLKTESTLVRIEKLLQRQSSLKMGAKLVQYLLDHGLMLLKFSSKEKLGTERAHD
        RLTCQNVIDIHVN GDA SSCIVNIDIDK   DKLK SKSCEG+FESL+TESTLVRIEK+LQRQSSLKMGAKLVQYLLDHGLMLLKFSSKEK G E+A D
Subjt:  RLTCQNVIDIHVNGGDAYSSCIVNIDIDK---DKLKPSKSCEGSFESLKTESTLVRIEKLLQRQSSLKMGAKLVQYLLDHGLMLLKFSSKEKLGTERAHD

Query:  TPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQAGDGSVAM
          NNRWRKYKR+ASFDSRKIV+LFSVLSSLGTLILIYLTLRVRQQ GDGSVAM
Subjt:  TPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQAGDGSVAM

XP_023551822.1 uncharacterized protein LOC111809675 [Cucurbita pepo subsp. pepo]2.4e-11588.93Show/hide
Query:  MAVNQCQTAAVSNGVLVQEKLAKVTRLDEAEQHCSLEILPILFEKASFPFQTSSARDSSSFLSAEEFDNNPECDPHLAFLSFLEVTHPTKSRMSLETSDT
        MAVNQCQTAAVSN VLVQEKLAKVTRLDE E+HCSLEILPILFEK SFPFQ S ARDSSSFLS E FDN+PECDPHLAFLSFLEVTHPTK+RMSLETSDT
Subjt:  MAVNQCQTAAVSNGVLVQEKLAKVTRLDEAEQHCSLEILPILFEKASFPFQTSSARDSSSFLSAEEFDNNPECDPHLAFLSFLEVTHPTKSRMSLETSDT

Query:  RLTCQNVIDIHVNGGDAYSSCIVNIDIDK---DKLKPSKSCEGSFESLKTESTLVRIEKLLQRQSSLKMGAKLVQYLLDHGLMLLKFSSKEKLGTERAHD
        RLTCQNVIDIHVN GDA SSCIVNIDIDK   DKLK SKSCEG+FESL+TESTLVRIEK+LQRQSSLKMGAKL+QYLLDHGLMLLKFSSKEK G ERA D
Subjt:  RLTCQNVIDIHVNGGDAYSSCIVNIDIDK---DKLKPSKSCEGSFESLKTESTLVRIEKLLQRQSSLKMGAKLVQYLLDHGLMLLKFSSKEKLGTERAHD

Query:  TPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQAGDGSVAM
          NNRWRKYKR+ASFDSRKIV+LFSVLSSLGTLILIYLTLRVRQQ GDGSVAM
Subjt:  TPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQAGDGSVAM

XP_038903131.1 uncharacterized protein LOC120089804 isoform X1 [Benincasa hispida]1.7e-12691.35Show/hide
Query:  MAEATARDSISSTFAAMAVNQCQTAAVSNGVLVQEKLAKVTRLDEAEQHCSLEILPILFEKASFPFQTSSARDSSSFLSAEEFDNNPECDPHLAFLSFLE
        MAEA ARDSISSTFA MAVNQCQTAAVSNGVLVQEKLAKV+RL+E+E+HCSLEILP LFEKASFPFQ SSARDSS FLS EEFDN+PECDPHLAFLSFLE
Subjt:  MAEATARDSISSTFAAMAVNQCQTAAVSNGVLVQEKLAKVTRLDEAEQHCSLEILPILFEKASFPFQTSSARDSSSFLSAEEFDNNPECDPHLAFLSFLE

Query:  VTHPTKSRMSLETSDTRLTCQNVIDIHVNGGDAYSSCIVNIDIDKDKLKPSKSCEGSFESLKTESTLVRIEKLLQRQSSLKMGAKLVQYLLDHGLMLLKF
        VTH TKSRMSLETSD RLTCQNVIDIHVNGGDAYSSCIVNIDIDKDKL+PSKSCEGS ES+KTE+TL+RIEK+LQRQSSLKMGAKL QYLLDHGLMLLKF
Subjt:  VTHPTKSRMSLETSDTRLTCQNVIDIHVNGGDAYSSCIVNIDIDKDKLKPSKSCEGSFESLKTESTLVRIEKLLQRQSSLKMGAKLVQYLLDHGLMLLKF

Query:  SSKEKLGTERAHDTPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQAGDGSVAM
        SSKEKLGTERA D PNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQ GDGSVAM
Subjt:  SSKEKLGTERAHDTPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQAGDGSVAM

XP_038903132.1 uncharacterized protein LOC120089804 isoform X2 [Benincasa hispida]3.8e-11390.83Show/hide
Query:  MAEATARDSISSTFAAMAVNQCQTAAVSNGVLVQEKLAKVTRLDEAEQHCSLEILPILFEKASFPFQTSSARDSSSFLSAEEFDNNPECDPHLAFLSFLE
        MAEA ARDSISSTFA MAVNQCQTAAVSNGVLVQEKLAKV+RL+E+E+HCSLEILP LFEKASFPFQ SSARDSS FLS EEFDN+PECDPHLAFLSFLE
Subjt:  MAEATARDSISSTFAAMAVNQCQTAAVSNGVLVQEKLAKVTRLDEAEQHCSLEILPILFEKASFPFQTSSARDSSSFLSAEEFDNNPECDPHLAFLSFLE

Query:  VTHPTKSRMSLETSDTRLTCQNVIDIHVNGGDAYSSCIVNIDIDKDKLKPSKSCEGSFESLKTESTLVRIEKLLQRQSSLKMGAKLVQYLLDHGLMLLKF
        VTH TKSRMSLETSD RLTCQNVIDIHVNGGDAYSSCIVNIDIDKDKL+PSKSCEGS ES+KTE+TL+RIEK+LQRQSSLKMGAKL QYLLDHGLMLLKF
Subjt:  VTHPTKSRMSLETSDTRLTCQNVIDIHVNGGDAYSSCIVNIDIDKDKLKPSKSCEGSFESLKTESTLVRIEKLLQRQSSLKMGAKLVQYLLDHGLMLLKF

Query:  SSKEKLGTERAHDTPNNRWRKYKRAASFDSRKIVILFSVL
        SSKEKLGTERA D PNNRWRKYKRAASFDSRKIVILFSVL
Subjt:  SSKEKLGTERAHDTPNNRWRKYKRAASFDSRKIVILFSVL

TrEMBL top hitse value%identityAlignment
A0A6J1C5A8 uncharacterized protein LOC1110085041.5e-12089.02Show/hide
Query:  MAEATARDSISSTFAAMAVNQCQTAAVSNGVLVQEKLAKVTRLDEAEQHCSLEILPILFEKASFPFQTSSARDSSSFLSAEEFDNNPECDPHLAFLSFLE
        MAEAT RDSISS F AMAVNQCQ AAVSNGV VQEKLAKV RLDEAE+HCSLEILPILFEKASFPFQ+SS RDSS   S EEFDN+P+CDPHLAFLS LE
Subjt:  MAEATARDSISSTFAAMAVNQCQTAAVSNGVLVQEKLAKVTRLDEAEQHCSLEILPILFEKASFPFQTSSARDSSSFLSAEEFDNNPECDPHLAFLSFLE

Query:  VTHPTKSRMSLETSDTRLTCQNVIDIHVNGGDAYSSCIVNIDIDKDKLKPSKSCEGSFESLKTESTLVRIEKLLQRQSSLKMGAKLVQYLLDHGLMLLKF
        VTHPTKSRMSLETSDTRLTCQNVIDIHVNGGDAYSSCIVNIDIDKDKLK SK C+G FESLKT++TLVRIEK+LQRQSSLK+G KLVQYLLDHGLMLLKF
Subjt:  VTHPTKSRMSLETSDTRLTCQNVIDIHVNGGDAYSSCIVNIDIDKDKLKPSKSCEGSFESLKTESTLVRIEKLLQRQSSLKMGAKLVQYLLDHGLMLLKF

Query:  SSKEKLGTERAHDTPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQAGDGSV
        S+KEKLGTER HD PNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQ GDGSV
Subjt:  SSKEKLGTERAHDTPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQAGDGSV

A0A6J1ENF8 uncharacterized protein LOC1114360862.4e-11387.75Show/hide
Query:  MAVNQCQTAAVSNGVLVQEKLAKVTRLDEAEQHCSLEILPILFEKASFPFQTSSARDSSSFLSAEEFDNNPECDPHLAFLSFLEVTHPTKSRMSLETSDT
        MAVNQCQTAAVSNGVLVQEKLAKVT LDE E+HCSLEILPILFEK SFPFQ S A DSSSFLS E FDN+PECDPHLAFLSFLEVTHPTK+RMSLETSDT
Subjt:  MAVNQCQTAAVSNGVLVQEKLAKVTRLDEAEQHCSLEILPILFEKASFPFQTSSARDSSSFLSAEEFDNNPECDPHLAFLSFLEVTHPTKSRMSLETSDT

Query:  RLTCQNVIDIHVNGGDAYSSCIVNIDIDK---DKLKPSKSCEGSFESLKTESTLVRIEKLLQRQSSLKMGAKLVQYLLDHGLMLLKFSSKEKLGTERAHD
        RLTCQNVIDIHVN GDA SSCIVNIDIDK   DKLK SKS EGSFESL+TESTLVRIEK+LQRQSSLKMGAKL+QYLLDHGLMLLKFSSKEK G ERA D
Subjt:  RLTCQNVIDIHVNGGDAYSSCIVNIDIDK---DKLKPSKSCEGSFESLKTESTLVRIEKLLQRQSSLKMGAKLVQYLLDHGLMLLKFSSKEKLGTERAHD

Query:  TPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQAGDGSVAM
          NNRWRKYKR+ASFDSRKIV+LFSVLSSLGTLILIYLTLRV+QQ GDG VAM
Subjt:  TPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQAGDGSVAM

A0A6J1FLJ9 uncharacterized protein LOC1114469098.5e-11186Show/hide
Query:  MAVNQCQTAAVSNGVLVQEKLAKVTRLDEAEQHCSLEILPILFEKASFPFQTSSARDSSSFLSAEEFDNNPECDPHLAFLSFLEVTHPTKSRMSLETSDT
        MAVN CQ A +SNGV VQEKLAKVTR DEAE+H SLEILP LFEKASFP Q S ARDSS F S EEFDN+P+CDPHLAFLSFLEVTHPT S+MSL TSD 
Subjt:  MAVNQCQTAAVSNGVLVQEKLAKVTRLDEAEQHCSLEILPILFEKASFPFQTSSARDSSSFLSAEEFDNNPECDPHLAFLSFLEVTHPTKSRMSLETSDT

Query:  RLTCQNVIDIHVNGGDAYSSCIVNIDIDKDKLKPSKSCEGSFESLKTESTLVRIEKLLQRQSSLKMGAKLVQYLLDHGLMLLKFSSKEKLGTERAHDTPN
         LTCQNVIDIHVNGGDAYSSCIVNIDIDKDKLK  KSCEG+FESLKTE+TL+RIEK+LQRQSSLKMG KLV YLLDHGLMLL+FSSKEK GTER HDTPN
Subjt:  RLTCQNVIDIHVNGGDAYSSCIVNIDIDKDKLKPSKSCEGSFESLKTESTLVRIEKLLQRQSSLKMGAKLVQYLLDHGLMLLKFSSKEKLGTERAHDTPN

Query:  NRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQAGDGSVAM
        NRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQ+ DGSVAM
Subjt:  NRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQAGDGSVAM

A0A6J1JCA6 uncharacterized protein LOC1114831621.3e-11689.33Show/hide
Query:  MAVNQCQTAAVSNGVLVQEKLAKVTRLDEAEQHCSLEILPILFEKASFPFQTSSARDSSSFLSAEEFDNNPECDPHLAFLSFLEVTHPTKSRMSLETSDT
        MAVNQCQTAAVSNGVLVQEKLAKVTRLDE E+HCSLEILPILFEK SFPFQ S ARDSSSFLS E FDN+PECDPHLAFLSFLEVTHPTK+RMSLETSDT
Subjt:  MAVNQCQTAAVSNGVLVQEKLAKVTRLDEAEQHCSLEILPILFEKASFPFQTSSARDSSSFLSAEEFDNNPECDPHLAFLSFLEVTHPTKSRMSLETSDT

Query:  RLTCQNVIDIHVNGGDAYSSCIVNIDIDK---DKLKPSKSCEGSFESLKTESTLVRIEKLLQRQSSLKMGAKLVQYLLDHGLMLLKFSSKEKLGTERAHD
        RLTCQNVIDIHVN GDA SSCIVNIDIDK   DKLK SKSCEG+FESL+TESTLVRIEK+LQRQSSLKMGAKLVQYLLDHGLMLLKFSSKEK G E+A D
Subjt:  RLTCQNVIDIHVNGGDAYSSCIVNIDIDK---DKLKPSKSCEGSFESLKTESTLVRIEKLLQRQSSLKMGAKLVQYLLDHGLMLLKFSSKEKLGTERAHD

Query:  TPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQAGDGSVAM
          NNRWRKYKR+ASFDSRKIV+LFSVLSSLGTLILIYLTLRVRQQ GDGSVAM
Subjt:  TPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQAGDGSVAM

A0A6J1JTU6 uncharacterized protein LOC1114882808.5e-11186Show/hide
Query:  MAVNQCQTAAVSNGVLVQEKLAKVTRLDEAEQHCSLEILPILFEKASFPFQTSSARDSSSFLSAEEFDNNPECDPHLAFLSFLEVTHPTKSRMSLETSDT
        MAVN CQ AA+SNGV VQEKLAKVTR DE E+H SLEILP LF+KASFP Q S ARDSS FLS EE DN+P+CDPHLAFLSFLEVTHPT S+MSL TSD 
Subjt:  MAVNQCQTAAVSNGVLVQEKLAKVTRLDEAEQHCSLEILPILFEKASFPFQTSSARDSSSFLSAEEFDNNPECDPHLAFLSFLEVTHPTKSRMSLETSDT

Query:  RLTCQNVIDIHVNGGDAYSSCIVNIDIDKDKLKPSKSCEGSFESLKTESTLVRIEKLLQRQSSLKMGAKLVQYLLDHGLMLLKFSSKEKLGTERAHDTPN
         LTCQNVIDIHVNGGDAYSSCIVNIDIDKDKLK  KSCEG+FESLKTE+TL+RIEK+LQRQSSLKMG KLV YLLDHGLMLLKFSSKEK GTER HDTPN
Subjt:  RLTCQNVIDIHVNGGDAYSSCIVNIDIDKDKLKPSKSCEGSFESLKTESTLVRIEKLLQRQSSLKMGAKLVQYLLDHGLMLLKFSSKEKLGTERAHDTPN

Query:  NRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQAGDGSVAM
        NRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQ+ DGSVAM
Subjt:  NRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQAGDGSVAM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G39900.1 unknown protein8.3e-3440.17Show/hide
Query:  KLAKVTRLDEAEQHCSLEILPILFEKASFPFQ------TSSARDSSSF-LSAEEFDNNPECDPHLAFLSFLEVTHPTKSRMSLETSDTRLTC---QNVID
        KL K T   +  +H ++++ P+L ++A+FP +      TS  +D  +  +  +E +  P+C  H   LSF++   P+K++M +   D +L C   QN I+
Subjt:  KLAKVTRLDEAEQHCSLEILPILFEKASFPFQ------TSSARDSSSF-LSAEEFDNNPECDPHLAFLSFLEVTHPTKSRMSLETSDTRLTC---QNVID

Query:  IHVNGGDAYSSCIVNIDIDK-DKLKPSKSCEGSFESLKTESTLVRIEKLLQRQSSLKMGAKLVQYLLDHGLMLLKFSSKEKLGTERAHDTPNNRWRKYKR
        + + G D+Y SC+V+I+++K +  +   S +    S+K+ES  V ++K+LQRQ+SL                     S +K  +ER HD P NRWR+YKR
Subjt:  IHVNGGDAYSSCIVNIDIDK-DKLKPSKSCEGSFESLKTESTLVRIEKLLQRQSSLKMGAKLVQYLLDHGLMLLKFSSKEKLGTERAHDTPNNRWRKYKR

Query:  AASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQAGDGS
        AASFDSRKIVILFS+LSS+GTLILIYLTLRV+Q  GD +
Subjt:  AASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQAGDGS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGAGGCCACAGCTCGGGACTCAATCTCTTCAACCTTTGCAGCAATGGCTGTTAATCAATGTCAGACTGCAGCTGTGTCGAATGGAGTTCTAGTTCAAGAAAAATT
AGCGAAAGTTACTAGACTTGACGAGGCGGAGCAACATTGTTCTTTAGAAATTTTGCCAATTCTCTTTGAGAAGGCGTCGTTCCCCTTCCAAACTTCTTCGGCCCGTGATT
CTTCTAGCTTTTTAAGTGCTGAGGAATTCGACAACAATCCAGAGTGTGATCCACATTTAGCTTTCCTCAGCTTCCTGGAAGTTACCCATCCAACAAAAAGTAGGATGTCA
TTGGAAACTTCAGATACCCGCTTGACTTGCCAGAACGTAATTGATATTCACGTGAATGGCGGAGATGCTTATTCCTCGTGCATAGTAAATATTGATATTGACAAGGACAA
GCTCAAACCATCTAAATCCTGTGAAGGAAGTTTTGAAAGTTTGAAAACTGAGAGTACATTGGTGCGCATAGAGAAGTTACTGCAGAGACAATCCAGTCTTAAGATGGGGG
CGAAACTTGTTCAATATTTATTGGACCATGGTCTAATGTTACTGAAGTTCTCGTCTAAAGAAAAACTAGGGACTGAAAGGGCTCACGATACGCCAAACAATAGGTGGAGA
AAATATAAACGTGCTGCTTCATTTGATTCAAGAAAGATTGTTATTCTCTTCTCGGTATTGTCAAGCTTGGGAACCTTGATATTGATATATTTGACTCTGAGAGTTAGGCA
GCAGGCTGGAGATGGATCTGTTGCTATGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGAGGCCACAGCTCGGGACTCAATCTCTTCAACCTTTGCAGCAATGGCTGTTAATCAATGTCAGACTGCAGCTGTGTCGAATGGAGTTCTAGTTCAAGAAAAATT
AGCGAAAGTTACTAGACTTGACGAGGCGGAGCAACATTGTTCTTTAGAAATTTTGCCAATTCTCTTTGAGAAGGCGTCGTTCCCCTTCCAAACTTCTTCGGCCCGTGATT
CTTCTAGCTTTTTAAGTGCTGAGGAATTCGACAACAATCCAGAGTGTGATCCACATTTAGCTTTCCTCAGCTTCCTGGAAGTTACCCATCCAACAAAAAGTAGGATGTCA
TTGGAAACTTCAGATACCCGCTTGACTTGCCAGAACGTAATTGATATTCACGTGAATGGCGGAGATGCTTATTCCTCGTGCATAGTAAATATTGATATTGACAAGGACAA
GCTCAAACCATCTAAATCCTGTGAAGGAAGTTTTGAAAGTTTGAAAACTGAGAGTACATTGGTGCGCATAGAGAAGTTACTGCAGAGACAATCCAGTCTTAAGATGGGGG
CGAAACTTGTTCAATATTTATTGGACCATGGTCTAATGTTACTGAAGTTCTCGTCTAAAGAAAAACTAGGGACTGAAAGGGCTCACGATACGCCAAACAATAGGTGGAGA
AAATATAAACGTGCTGCTTCATTTGATTCAAGAAAGATTGTTATTCTCTTCTCGGTATTGTCAAGCTTGGGAACCTTGATATTGATATATTTGACTCTGAGAGTTAGGCA
GCAGGCTGGAGATGGATCTGTTGCTATGTAA
Protein sequenceShow/hide protein sequence
MAEATARDSISSTFAAMAVNQCQTAAVSNGVLVQEKLAKVTRLDEAEQHCSLEILPILFEKASFPFQTSSARDSSSFLSAEEFDNNPECDPHLAFLSFLEVTHPTKSRMS
LETSDTRLTCQNVIDIHVNGGDAYSSCIVNIDIDKDKLKPSKSCEGSFESLKTESTLVRIEKLLQRQSSLKMGAKLVQYLLDHGLMLLKFSSKEKLGTERAHDTPNNRWR
KYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQAGDGSVAM