; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0005555 (gene) of Snake gourd v1 genome

Gene IDTan0005555
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPoly polymerase 1, putative
Genome locationLG01:116872179..116873666
RNA-Seq ExpressionTan0005555
SyntenyTan0005555
Gene Ontology termsNA
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008453039.1 PREDICTED: uncharacterized protein LOC103493864 [Cucumis melo]8.5e-5966.51Show/hide
Query:  MGACLSN-------NNNSALPPPPTAKVISLQGHLREYPIPISVSRVLQTENSYSSTSDSFLCNSDRLYYDDFIPPLPHDDHLLPDHIYFLLPSSNLHHR
        MG CLSN       +++   PPPPTAKVISLQGHLREYP+PISVSRVLQTENS SSTSDSFLCNSDRLY+DDFIP LP D  L P+ IYF+LPSSNLHHR
Subjt:  MGACLSN-------NNNSALPPPPTAKVISLQGHLREYPIPISVSRVLQTENSYSSTSDSFLCNSDRLYYDDFIPPLPHDDHLLPDHIYFLLPSSNLHHR

Query:  LAASDMAALAVKATLALQNASTNNNH-----NHRRKKGRISPLL-LPNPSDS------DHIVSKDQHHPDTDTSASSVKKLHRLTSRTAKMAVRSFKLKL
        L A DMAALAVKATLALQNASTNN H      ++ ++ RISPL  L +P+D       +H +S + +  +   S+SSVKKL RLTSR AKMAVRSFKL+L
Subjt:  LAASDMAALAVKATLALQNASTNNNH-----NHRRKKGRISPLL-LPNPSDS------DHIVSKDQHHPDTDTSASSVKKLHRLTSRTAKMAVRSFKLKL

Query:  STIYEGTVL
        STIYEGT L
Subjt:  STIYEGTVL

XP_011654294.1 uncharacterized protein LOC101220453 [Cucumis sativus]6.5e-5967.48Show/hide
Query:  MGACLSN-------NNNSALPPPPTAKVISLQGHLREYPIPISVSRVLQTENSYSSTSDSFLCNSDRLYYDDFIPPLPHDDHLLPDHIYFLLPSSNLHHR
        MG C SN       +++   PPPPTAKVISLQGHLREYP+PISVSRVLQTENS SSTSDSFLCNSDRL+YDDFIP LP D  L P+ IYF+LPSSNLHHR
Subjt:  MGACLSN-------NNNSALPPPPTAKVISLQGHLREYPIPISVSRVLQTENSYSSTSDSFLCNSDRLYYDDFIPPLPHDDHLLPDHIYFLLPSSNLHHR

Query:  LAASDMAALAVKATLALQNASTNNNH--NHRRKKGRISPLL-LPNPSDS------DHIVSKDQHHPDTDTSASSVKKLHRLTSRTAKMAVRSFKLKLSTI
        L A DMAALAVKATLALQNASTNN H  +++ ++ RISPL  L +P+D       +H +S + +  + +T++SSVKKL RLTSR AKMAVRSFKL+LSTI
Subjt:  LAASDMAALAVKATLALQNASTNNNH--NHRRKKGRISPLL-LPNPSDS------DHIVSKDQHHPDTDTSASSVKKLHRLTSRTAKMAVRSFKLKLSTI

Query:  YEGTVL
        YEGTVL
Subjt:  YEGTVL

XP_022940531.1 uncharacterized protein LOC111446101 [Cucurbita moschata]4.7e-6576.26Show/hide
Query:  MGACLSNNNN-----SALPPPPTAKVISLQGHLREYPIPISVSRVLQTENSYSSTSDSFLCNSDRLYYDDFIPPLPHDDHLLPDHIYFLLPSSNLHHRLA
        MGACLS+  N     S  PPPPTAKVISLQGHLREYP+PISVSRVLQTENS SS SDSFLCNSDRLYYDDFIPPLP D+ LLP+ IYFLLPSSNLHHRL+
Subjt:  MGACLSNNNN-----SALPPPPTAKVISLQGHLREYPIPISVSRVLQTENSYSSTSDSFLCNSDRLYYDDFIPPLPHDDHLLPDHIYFLLPSSNLHHRLA

Query:  ASDMAALAVKATLALQNASTNNNHNHRRKKGRISPLLLPNPSDSDHIVSKD--QHHPDTDTSAS-SVKKLHRLTSRTAKMAVRSFKLKLSTIYEGTVL
        AS MAALAVKA+LALQNAS N+    RRKKGR+SPLL  N SDSDHI+SK+  + +   DTSAS SV+KL RLTS+ AKMAVRSFKLKLSTIYEG VL
Subjt:  ASDMAALAVKATLALQNASTNNNHNHRRKKGRISPLLLPNPSDSDHIVSKD--QHHPDTDTSAS-SVKKLHRLTSRTAKMAVRSFKLKLSTIYEGTVL

XP_022981858.1 uncharacterized protein LOC111480876 [Cucurbita maxima]2.1e-6576.77Show/hide
Query:  MGACLSNNNN-----SALPPPPTAKVISLQGHLREYPIPISVSRVLQTENSYSSTSDSFLCNSDRLYYDDFIPPLPHDDHLLPDHIYFLLPSSNLHHRLA
        MGACLS+  N     S  PPPPTAKVISLQGHLREYP+PISVSRVLQTENS SS SDSFLCNSDRLYYDDFIPPLP D+ LLP+ IYFLLPSSNLHHRL+
Subjt:  MGACLSNNNN-----SALPPPPTAKVISLQGHLREYPIPISVSRVLQTENSYSSTSDSFLCNSDRLYYDDFIPPLPHDDHLLPDHIYFLLPSSNLHHRLA

Query:  ASDMAALAVKATLALQNASTNNNHNHRRKKGRISPLLLPNPSDSDHIVSKD--QHHPDTDTSAS-SVKKLHRLTSRTAKMAVRSFKLKLSTIYEGTVL
        AS MAALAVKA+LALQNAS N+    RRKKGR+SPLL  N SDSDHI+SK+  + +   DTSAS SV+KL RLTSR AKMAVRSFKLKLSTIYEG VL
Subjt:  ASDMAALAVKATLALQNASTNNNHNHRRKKGRISPLLLPNPSDSDHIVSKD--QHHPDTDTSAS-SVKKLHRLTSRTAKMAVRSFKLKLSTIYEGTVL

XP_038896630.1 uncharacterized protein LOC120084892 [Benincasa hispida]6.5e-5968.63Show/hide
Query:  MGACLSN-----NNNSALPPPPTAKVISLQGHLREYPIPISVSRVLQTENSYSSTSDSFLCNSDRLYYDDFIPPLPHDDHLLPDHIYFLLPSSNLHHRLA
        MGACLSN       +S  PPPPTAKVI+LQG LREYP+PISVSRVLQTE+S SSTSDSFLCNSDRLYYDDFIPPLP D  L P+ IYFLL SS LH RL 
Subjt:  MGACLSN-----NNNSALPPPPTAKVISLQGHLREYPIPISVSRVLQTENSYSSTSDSFLCNSDRLYYDDFIPPLPHDDHLLPDHIYFLLPSSNLHHRLA

Query:  ASDMAALAVKATLALQNASTNNNHNHRRKKGRISPLLLPNPSDSDHIVSKDQHHP---------DTDTSASSVKKLHRLTSRTAKMAVRSFKLKLSTIYE
        ASDMAALAVKATLALQN ST N+   RR KGRISP+LL +   SD   +KD+H P          T +++SSV++L RLTSR AKMAVRSFKL+LSTIYE
Subjt:  ASDMAALAVKATLALQNASTNNNHNHRRKKGRISPLLLPNPSDSDHIVSKDQHHP---------DTDTSASSVKKLHRLTSRTAKMAVRSFKLKLSTIYE

Query:  GTVL
        G VL
Subjt:  GTVL

TrEMBL top hitse value%identityAlignment
A0A0A0L5Z9 Uncharacterized protein3.2e-5967.48Show/hide
Query:  MGACLSN-------NNNSALPPPPTAKVISLQGHLREYPIPISVSRVLQTENSYSSTSDSFLCNSDRLYYDDFIPPLPHDDHLLPDHIYFLLPSSNLHHR
        MG C SN       +++   PPPPTAKVISLQGHLREYP+PISVSRVLQTENS SSTSDSFLCNSDRL+YDDFIP LP D  L P+ IYF+LPSSNLHHR
Subjt:  MGACLSN-------NNNSALPPPPTAKVISLQGHLREYPIPISVSRVLQTENSYSSTSDSFLCNSDRLYYDDFIPPLPHDDHLLPDHIYFLLPSSNLHHR

Query:  LAASDMAALAVKATLALQNASTNNNH--NHRRKKGRISPLL-LPNPSDS------DHIVSKDQHHPDTDTSASSVKKLHRLTSRTAKMAVRSFKLKLSTI
        L A DMAALAVKATLALQNASTNN H  +++ ++ RISPL  L +P+D       +H +S + +  + +T++SSVKKL RLTSR AKMAVRSFKL+LSTI
Subjt:  LAASDMAALAVKATLALQNASTNNNH--NHRRKKGRISPLL-LPNPSDS------DHIVSKDQHHPDTDTSASSVKKLHRLTSRTAKMAVRSFKLKLSTI

Query:  YEGTVL
        YEGTVL
Subjt:  YEGTVL

A0A1S3BUP5 uncharacterized protein LOC1034938644.1e-5966.51Show/hide
Query:  MGACLSN-------NNNSALPPPPTAKVISLQGHLREYPIPISVSRVLQTENSYSSTSDSFLCNSDRLYYDDFIPPLPHDDHLLPDHIYFLLPSSNLHHR
        MG CLSN       +++   PPPPTAKVISLQGHLREYP+PISVSRVLQTENS SSTSDSFLCNSDRLY+DDFIP LP D  L P+ IYF+LPSSNLHHR
Subjt:  MGACLSN-------NNNSALPPPPTAKVISLQGHLREYPIPISVSRVLQTENSYSSTSDSFLCNSDRLYYDDFIPPLPHDDHLLPDHIYFLLPSSNLHHR

Query:  LAASDMAALAVKATLALQNASTNNNH-----NHRRKKGRISPLL-LPNPSDS------DHIVSKDQHHPDTDTSASSVKKLHRLTSRTAKMAVRSFKLKL
        L A DMAALAVKATLALQNASTNN H      ++ ++ RISPL  L +P+D       +H +S + +  +   S+SSVKKL RLTSR AKMAVRSFKL+L
Subjt:  LAASDMAALAVKATLALQNASTNNNH-----NHRRKKGRISPLL-LPNPSDS------DHIVSKDQHHPDTDTSASSVKKLHRLTSRTAKMAVRSFKLKL

Query:  STIYEGTVL
        STIYEGT L
Subjt:  STIYEGTVL

A0A6J1DM27 uncharacterized protein LOC1110216759.5e-5665.53Show/hide
Query:  MGACLSNNNNSALPPP--PTAKVISLQGHLREYPIPISVSRVLQTENSYSSTSDSFLCNSDRLYYDDFIPPLPHDDHLLPDHIYFLLPSSNLHHRLAASD
        MGACLS +  S  PPP  PTAKVISL+G+LREYP PISVSRVLQTEN  SSTSDSFLCNSD LYYDDFIPP+P DD LL   IYFLLPSS L  RL+ASD
Subjt:  MGACLSNNNNSALPPP--PTAKVISLQGHLREYPIPISVSRVLQTENSYSSTSDSFLCNSDRLYYDDFIPPLPHDDHLLPDHIYFLLPSSNLHHRLAASD

Query:  MAALAVKATLALQNASTNNNHNHRRKKGRISPLLLPNPSDSDHIVSKDQHHPDTD--------------TSASSVKKLHRLTSRTAKMAVRSFKLKLSTI
        MAA+A+KA+LALQNAS+ +     RKKGRISPLL+PNP+   H  S     P                   +SSV+KL +LTSR AKMAVRSFKLKLSTI
Subjt:  MAALAVKATLALQNASTNNNHNHRRKKGRISPLLLPNPSDSDHIVSKDQHHPDTD--------------TSASSVKKLHRLTSRTAKMAVRSFKLKLSTI

Query:  YEGTVL
        YEGTVL
Subjt:  YEGTVL

A0A6J1FIQ7 uncharacterized protein LOC1114461012.3e-6576.26Show/hide
Query:  MGACLSNNNN-----SALPPPPTAKVISLQGHLREYPIPISVSRVLQTENSYSSTSDSFLCNSDRLYYDDFIPPLPHDDHLLPDHIYFLLPSSNLHHRLA
        MGACLS+  N     S  PPPPTAKVISLQGHLREYP+PISVSRVLQTENS SS SDSFLCNSDRLYYDDFIPPLP D+ LLP+ IYFLLPSSNLHHRL+
Subjt:  MGACLSNNNN-----SALPPPPTAKVISLQGHLREYPIPISVSRVLQTENSYSSTSDSFLCNSDRLYYDDFIPPLPHDDHLLPDHIYFLLPSSNLHHRLA

Query:  ASDMAALAVKATLALQNASTNNNHNHRRKKGRISPLLLPNPSDSDHIVSKD--QHHPDTDTSAS-SVKKLHRLTSRTAKMAVRSFKLKLSTIYEGTVL
        AS MAALAVKA+LALQNAS N+    RRKKGR+SPLL  N SDSDHI+SK+  + +   DTSAS SV+KL RLTS+ AKMAVRSFKLKLSTIYEG VL
Subjt:  ASDMAALAVKATLALQNASTNNNHNHRRKKGRISPLLLPNPSDSDHIVSKD--QHHPDTDTSAS-SVKKLHRLTSRTAKMAVRSFKLKLSTIYEGTVL

A0A6J1J381 uncharacterized protein LOC1114808761.0e-6576.77Show/hide
Query:  MGACLSNNNN-----SALPPPPTAKVISLQGHLREYPIPISVSRVLQTENSYSSTSDSFLCNSDRLYYDDFIPPLPHDDHLLPDHIYFLLPSSNLHHRLA
        MGACLS+  N     S  PPPPTAKVISLQGHLREYP+PISVSRVLQTENS SS SDSFLCNSDRLYYDDFIPPLP D+ LLP+ IYFLLPSSNLHHRL+
Subjt:  MGACLSNNNN-----SALPPPPTAKVISLQGHLREYPIPISVSRVLQTENSYSSTSDSFLCNSDRLYYDDFIPPLPHDDHLLPDHIYFLLPSSNLHHRLA

Query:  ASDMAALAVKATLALQNASTNNNHNHRRKKGRISPLLLPNPSDSDHIVSKD--QHHPDTDTSAS-SVKKLHRLTSRTAKMAVRSFKLKLSTIYEGTVL
        AS MAALAVKA+LALQNAS N+    RRKKGR+SPLL  N SDSDHI+SK+  + +   DTSAS SV+KL RLTSR AKMAVRSFKLKLSTIYEG VL
Subjt:  ASDMAALAVKATLALQNASTNNNHNHRRKKGRISPLLLPNPSDSDHIVSKD--QHHPDTDTSAS-SVKKLHRLTSRTAKMAVRSFKLKLSTIYEGTVL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21010.1 unknown protein1.3e-3343.98Show/hide
Query:  MGACLS---NNNNSALPPPPTAKVISLQGHLREYPIPISVSRVLQTEN--SYSSTSDS-----FLCNSDRLYYDDFIPPLPHDDHLLPDHIYFLLPSSNL
        MG C+S    ++NS+    PT K++++ G LREY +P+  S+VL+ E+  +YSS+S S     F+C+SD LYYDDFIP +  ++ L  D IYF+LP S  
Subjt:  MGACLS---NNNNSALPPPPTAKVISLQGHLREYPIPISVSRVLQTEN--SYSSTSDS-----FLCNSDRLYYDDFIPPLPHDDHLLPDHIYFLLPSSNL

Query:  HHRLAASDMAALAVKATLALQNASTNNNHNHRRKKGRISPLLL-------PNPSDSDHIVSKDQHHPDTD---------TSASSVKKLHRLTSRTAKMAV
          RL ASDMAALAVKA++A+QN+      + RRKK RISP+++        N + S+  V K +                 + SV+ L R TS+ AK+AV
Subjt:  HHRLAASDMAALAVKATLALQNASTNNNHNHRRKKGRISPLLL-------PNPSDSDHIVSKDQHHPDTD---------TSASSVKKLHRLTSRTAKMAV

Query:  RSFKLKLSTIYEGTVL
        RSF+LKLSTIYEG+V+
Subjt:  RSFKLKLSTIYEGTVL

AT1G76600.1 unknown protein3.4e-3744.19Show/hide
Query:  MGACLSNNNNSALPPPPTAKVISLQGHLREYPIPISVSRVLQTENSYSSTSDS----FLCNSDRLYYDDFIPPLPHDDHLLPDHIYFLLPSSNLHHRLAA
        MG C+S N N  +    TAK++++ G LREY +P+  S+VL++E++ SS+S S    FLCNSD LYYDDFIP +  D+ L  + IYF+LP S   +RL+A
Subjt:  MGACLSNNNNSALPPPPTAKVISLQGHLREYPIPISVSRVLQTENSYSSTSDS----FLCNSDRLYYDDFIPPLPHDDHLLPDHIYFLLPSSNLHHRLAA

Query:  SDMAALAVKATLALQNASTNNNHNHRRKKGRISPLLLPNPSDSDHIVS-------------------KDQHHPDTDTS----ASSVKKLHRLTSRTAKMA
        SDMAALAVKA++A++ A+     N RR+ GRISP++  N ++ + I +                    ++  P  DT+    + SV+KL R TS  AK+A
Subjt:  SDMAALAVKATLALQNASTNNNHNHRRKKGRISPLLLPNPSDSDHIVS-------------------KDQHHPDTDTS----ASSVKKLHRLTSRTAKMA

Query:  VRSFKLKLSTIYEGT
        VRSF+L+LSTIYEG+
Subjt:  VRSFKLKLSTIYEGT

AT2G23690.1 unknown protein4.1e-1133.83Show/hide
Query:  MGACLSNNNNSALPPPPTAKVISLQGHLREYPIPISVSRVLQTENSYSSTSDSFLCNSDRLYYDDFIPPLPHDDHLLPDHIYFLLPSSNLHHRLAASDMA
        MG C S  +        TAK+I   G + E+  P+ V  VLQ           F+CNSD + +D+ +  +  D+      +YF LP S+LHH L A +MA
Subjt:  MGACLSNNNNSALPPPPTAKVISLQGHLREYPIPISVSRVLQTENSYSSTSDSFLCNSDRLYYDDFIPPLPHDDHLLPDHIYFLLPSSNLHHRLAASDMA

Query:  ALAVKATLALQNASTNNNHNH-RRKKGRISPLL
        ALAVKA+ AL  +  +   +  R ++  +SP++
Subjt:  ALAVKATLALQNASTNNNHNH-RRKKGRISPLL

AT3G50800.1 unknown protein3.4e-1340.91Show/hide
Query:  MGACLSNNNNSALPPPPTAKVISLQGHLREYPIPISVSRVLQTENSYSSTSDSFLCNSDRLYYDDFIPPLPHDDHLLPDHIYFLLPSSNLHHRLAASDMA
        MGAC S  +        TAK+I   G L+E+  P+ V ++LQ          SF+CNSD + +DD +  +P  + L P  +YF+LP + L+H L A +MA
Subjt:  MGACLSNNNNSALPPPPTAKVISLQGHLREYPIPISVSRVLQTENSYSSTSDSFLCNSDRLYYDDFIPPLPHDDHLLPDHIYFLLPSSNLHHRLAASDMA

Query:  ALAVKATLAL
        ALAVKA+ AL
Subjt:  ALAVKATLAL

AT5G66580.1 unknown protein4.9e-1239.09Show/hide
Query:  MGACLSNNNNSALPPPPTAKVISLQGHLREYPIPISVSRVLQTENSYSSTSDSFLCNSDRLYYDDFIPPLPHDDHLLPDHIYFLLPSSNLHHRLAASDMA
        MGAC S  +  +     +AK+I L G L+E+  P+ V ++LQ          SF+CNSD + +DD +  +  ++ L    +YF+LP + L+H L A +MA
Subjt:  MGACLSNNNNSALPPPPTAKVISLQGHLREYPIPISVSRVLQTENSYSSTSDSFLCNSDRLYYDDFIPPLPHDDHLLPDHIYFLLPSSNLHHRLAASDMA

Query:  ALAVKATLAL
        ALAVKA+ AL
Subjt:  ALAVKATLAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCGCCTGTTTGTCTAACAACAATAATTCTGCCCTTCCACCTCCTCCCACCGCGAAAGTGATATCTTTACAAGGCCATCTTCGCGAATACCCCATTCCTATCTCCGT
CTCCCGCGTTCTCCAAACCGAAAATTCATATTCTTCCACTTCCGACTCCTTTCTATGCAACTCCGATCGCTTATACTACGATGACTTCATTCCGCCTTTGCCTCACGACG
ATCACCTTCTCCCCGATCACATCTATTTCCTCCTTCCTTCCTCCAACCTCCACCACCGGTTAGCTGCCTCCGATATGGCCGCCTTGGCCGTCAAAGCCACCCTCGCACTC
CAAAATGCCTCCACCAACAATAATCATAATCATCGCCGTAAAAAGGGTCGTATCTCTCCTCTCCTCCTCCCCAACCCCTCGGATTCCGACCACATCGTCTCCAAGGATCA
ACACCACCCCGACACCGACACGTCTGCTTCCTCCGTTAAAAAATTGCACAGATTGACATCCAGAACAGCAAAAATGGCCGTTCGTTCTTTTAAACTCAAATTGAGCACCA
TCTATGAAGGCACCGTTCTGTAG
mRNA sequenceShow/hide mRNA sequence
CAATATTGACGCAACTCTCATTCTCTTTCCTCCTCATCACTTACCTCACAAGATGATGATCAGATTTCAATGGCAAGTAATCAATATATATATATATATATGTATATGGC
TATGAACCCCCTTCATAGCTATAGTTGAATAGATCTAATTGATATGGGCGCCTGTTTGTCTAACAACAATAATTCTGCCCTTCCACCTCCTCCCACCGCGAAAGTGATAT
CTTTACAAGGCCATCTTCGCGAATACCCCATTCCTATCTCCGTCTCCCGCGTTCTCCAAACCGAAAATTCATATTCTTCCACTTCCGACTCCTTTCTATGCAACTCCGAT
CGCTTATACTACGATGACTTCATTCCGCCTTTGCCTCACGACGATCACCTTCTCCCCGATCACATCTATTTCCTCCTTCCTTCCTCCAACCTCCACCACCGGTTAGCTGC
CTCCGATATGGCCGCCTTGGCCGTCAAAGCCACCCTCGCACTCCAAAATGCCTCCACCAACAATAATCATAATCATCGCCGTAAAAAGGGTCGTATCTCTCCTCTCCTCC
TCCCCAACCCCTCGGATTCCGACCACATCGTCTCCAAGGATCAACACCACCCCGACACCGACACGTCTGCTTCCTCCGTTAAAAAATTGCACAGATTGACATCCAGAACA
GCAAAAATGGCCGTTCGTTCTTTTAAACTCAAATTGAGCACCATCTATGAAGGCACCGTTCTGTAGGGAGAGTAATTACAAGGGATTTCAGCCCCCGATTTTTGTGTGGA
TACGTCCCGCCTCTGGTATATATAGAGGGAATTGTCCTCTTCCGAATTAAATCGTGATCTGATTCCAACTAATTGCAATTAAATTAATATTGTGTACGTTGAGACGACTG
GGGTGTAATTGGATTAATGGGAAGGCGATCAAGATGAAGAAAATTAAAAACGGTGGACCGACGGAGGCGGCAACGGGCAATGGGCAATCTAAGTCAAGAGTCAAATGGGA
CTCGGAGGGCATTTAATTGAAATACAATGTGAAGACGGAAGAATTGAGAATTTGAAGCCGCGCACCCAAATTTGATGCCATCCACGACTCTGTTGTGTTTACCAACGCAC
TAATACAATTCCTCTTTTTCTCCTTTCTTTTCTATTAATTCGACGTTCTTTGATCTCTCTACCATATTGTCGATGCCTATATCTATTTCCCTTCTTCTTCTTCTTTTTTT
CCCCTATTCTATTTATAATTTATTAATAATAGTCTTTTTTTTTTCTTTTTTTTTAATGTTAAACCAACGCTACGCTACCTAACCCATGCTTAAGTTAATAACTAAGGACT
ATAGTTTAATTAATTTATCAGATATTTGATTAGGAAATACTTTGTAAAGTCAA
Protein sequenceShow/hide protein sequence
MGACLSNNNNSALPPPPTAKVISLQGHLREYPIPISVSRVLQTENSYSSTSDSFLCNSDRLYYDDFIPPLPHDDHLLPDHIYFLLPSSNLHHRLAASDMAALAVKATLAL
QNASTNNNHNHRRKKGRISPLLLPNPSDSDHIVSKDQHHPDTDTSASSVKKLHRLTSRTAKMAVRSFKLKLSTIYEGTVL