; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0013520 (gene) of Snake gourd v1 genome

Gene IDTan0013520
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationLG10:54526173..54527986
RNA-Seq ExpressionTan0013520
SyntenyTan0013520
Gene Ontology termsNA
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFY85402.1 hypothetical protein Acr_04g0001400 [Actinidia rufa]7.2e-4639.1Show/hide
Query:  ANTRSPSTPLPSQPAVYHPTPPVFQTPRPQFGAFQSAGFYPTPSQQ-AHFSPLQRAHFPVLAQAIPNVAPSLSVKLNDSNYLLWRNQLLNVVRAHNLQGL
        + T S STP    P   +P P               +   P P+ Q  + SPL           +P++   L+VKL+D NY++W+ QLLN+V A+ L+  
Subjt:  ANTRSPSTPLPSQPAVYHPTPPVFQTPRPQFGAFQSAGFYPTPSQQ-AHFSPLQRAHFPVLAQAIPNVAPSLSVKLNDSNYLLWRNQLLNVVRAHNLQGL

Query:  LDGSVPCPPQFIDNLQHQVNPEFLHWQTLNSTVMSWIYSSLTEEKMGEIISLSVVAEVWESFRKSYSSSASTRVLSLKSQIQRIKKEGLSVSQYLSQIKE
        LDGS  CPP+F+D  Q Q NPEF  WQ  N  VMSWIY+S+ E  +G+I+  +  +++WE+  + Y++++   +  L++ +Q IKKEGL+   Y+ + + 
Subjt:  LDGSVPCPPQFIDNLQHQVNPEFLHWQTLNSTVMSWIYSSLTEEKMGEIISLSVVAEVWESFRKSYSSSASTRVLSLKSQIQRIKKEGLSVSQYLSQIKE

Query:  VADKLDAIGEPISLKDHVSYILEGLGLEYNAFVTSIQHRIELPSIEDVRTLLMSYDYRLEKQNTVE
        + + L +IGEP++  DH+ Y L GLG +YN FVTSIQ +   PSIE+V +LL+SYD RLE+Q+  +
Subjt:  VADKLDAIGEPISLKDHVSYILEGLGLEYNAFVTSIQHRIELPSIEDVRTLLMSYDYRLEKQNTVE

PON47862.1 hypothetical protein TorRG33x02_321990 [Trema orientale]1.7e-5040.46Show/hide
Query:  IPNVAPSLSVKLNDSNYLLWRNQLLNVVRAHNLQGLLDGSVPCPPQFIDNLQHQVNPEFLHWQTLNSTVMSWIYSSLTEEKMGEIISLSVVAEVWESFRK
        +P++    ++KL+  NYL+W+NQLLNV+ A+ L+  +DGS PCPP+F D  +  VN E++ WQ  N  +MSWIY+SLT+  MG+I+  +   E+WE+  +
Subjt:  IPNVAPSLSVKLNDSNYLLWRNQLLNVVRAHNLQGLLDGSVPCPPQFIDNLQHQVNPEFLHWQTLNSTVMSWIYSSLTEEKMGEIISLSVVAEVWESFRK

Query:  SYSSSASTRVLSLKSQIQRIKKEGLSVSQYLSQIKEVADKLDAIGEPISLKDHVSYILEGLGLEYNAFVTSIQHRIELPSIEDVRTLLMSYDYRLEKQNT
         Y+SS+  ++  L++++Q ++K+GL+  +Y+ + K + + L A+GEP+S KDH+ Y+  GL  EYNAFVTSI  R +   +E++ +LL+SY++RLE QN 
Subjt:  SYSSSASTRVLSLKSQIQRIKKEGLSVSQYLSQIKEVADKLDAIGEPISLKDHVSYILEGLGLEYNAFVTSIQHRIELPSIEDVRTLLMSYDYRLEKQNT

Query:  VEQLPIIQANLTHMQ--------PFSS-----TKKINKQPNSFLQRNPPNN---APGLLGKP
          QL  +QANL H+          FS+     T+    +   F Q +PPN+    P +LGKP
Subjt:  VEQLPIIQANLTHMQ--------PFSS-----TKKINKQPNSFLQRNPPNN---APGLLGKP

RVW69807.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]5.2e-4442.26Show/hide
Query:  FYPTPSQQAHFSPLQRAH-FPVLAQAI---PNVAPSLSVKLNDSNYLLWRNQLLNVVRAHNLQGLLDGSVPCPPQFIDNLQHQVNPEFLHWQTLNSTVMS
        F PTP+  ++ +     +  P + Q     P+++ SLS+KL+++N LL ++QLLNV+ A+ L+  +D     PP+++D    QVNPEF+ W  LN  VMS
Subjt:  FYPTPSQQAHFSPLQRAH-FPVLAQAI---PNVAPSLSVKLNDSNYLLWRNQLLNVVRAHNLQGLLDGSVPCPPQFIDNLQHQVNPEFLHWQTLNSTVMS

Query:  WIYSSLTEEKMGEIISLSVVAEVWESFRKSYSSSASTRVLSLKSQIQRIKKEGLSVSQYLSQIKEVADKLDAIGEPISLKDHVSYILEGLGLEYNAFVTS
        WIYSSLT   +G+I+  S   ++W S    Y S +   V+SL SQ+QRIKK  + +S+YLS++K V D+   IGEP+S +D ++ ILEGL  EY+ FVTS
Subjt:  WIYSSLTEEKMGEIISLSVVAEVWESFRKSYSSSASTRVLSLKSQIQRIKKEGLSVSQYLSQIKEVADKLDAIGEPISLKDHVSYILEGLGLEYNAFVTS

Query:  IQHRIELPSIEDVRTLLMSYDYRLEKQNTVEQLPIIQAN
        I +R + PS+++V +LL +Y+YRL +++  + L   QAN
Subjt:  IQHRIELPSIEDVRTLLMSYDYRLEKQNTVEQLPIIQAN

RVX14312.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]4.4e-4338.76Show/hide
Query:  IPNVAPSLSVKLNDSNYLLWRNQLLNVVRAHNLQGLLDGSVPCPPQFIDNLQHQVNPEFLHWQTLNSTVMSWIYSSLTEEKMGEIISLSVVAEVWESFRK
        +P++  + +V L+ SNYLLWR Q+LN++ A+ L+ ++ G +P P +F+ + ++ +NPE+  WQ  N  VM WIYSSLTE  M +II L   +E+W +  K
Subjt:  IPNVAPSLSVKLNDSNYLLWRNQLLNVVRAHNLQGLLDGSVPCPPQFIDNLQHQVNPEFLHWQTLNSTVMSWIYSSLTEEKMGEIISLSVVAEVWESFRK

Query:  SYSSSASTRVLSLKSQIQRIKKEGLSVSQYLSQIKEVADKLDAIGEPISLKDHVSYILEGLGLEYNAFVTSIQHRIELPSIEDVRTLLMSYDYRLEKQNT
         +S+++  R++ L+ Q+Q  KK GLS+ +YL +IK + D L AIGE I+ +D + Y+L GLG EYN+FV ++    E  S+E++ ++L++++ +LE+Q+ 
Subjt:  SYSSSASTRVLSLKSQIQRIKKEGLSVSQYLSQIKEVADKLDAIGEPISLKDHVSYILEGLGLEYNAFVTSIQHRIELPSIEDVRTLLMSYDYRLEKQNT

Query:  VEQLPIIQANLTHMQPFSSTKKINKQPNSFLQ-RNPPNNAPGLLGKPNNFSSPHNRWP
         E+  ++QAN+T M      KK  K      Q R   NN     G  NN S   N  P
Subjt:  VEQLPIIQANLTHMQPFSSTKKINKQPNSFLQ-RNPPNNAPGLLGKPNNFSSPHNRWP

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]1.7e-6346.86Show/hide
Query:  PVLAQAIPNVAPSLSVKLNDSNYLLWRNQLLNVVRAHNLQGLLDGSVPCPPQFIDNLQHQVNPEFLHWQTLNSTVMSWIYSSLTEEKMGEIISLSVVAEV
        P  A   P +   L+VKLND+N+LLW+NQLLN V A+ L+G LDG++  PPQF+D+ Q Q NP +  W+  N  +M WIYSSL+EEKMGE++SL    ++
Subjt:  PVLAQAIPNVAPSLSVKLNDSNYLLWRNQLLNVVRAHNLQGLLDGSVPCPPQFIDNLQHQVNPEFLHWQTLNSTVMSWIYSSLTEEKMGEIISLSVVAEV

Query:  WESFRKSYSSSASTRVLSLKSQIQRIKKEGLSVSQYLSQIKEVADKLDAIGEPISLKDHVSYILEGLGLEYNAFVTSIQHRIELPSIEDVRTLLMSYDYR
        W S  + Y S  + R++ LK+++Q ++K+G SVSQYL++IKE+ADK  A+GEP+S +DH++++L+GLG EYNAFVTSI +R + PS+EDVR+LL++Y+ R
Subjt:  WESFRKSYSSSASTRVLSLKSQIQRIKKEGLSVSQYLSQIKEVADKLDAIGEPISLKDHVSYILEGLGLEYNAFVTSIQHRIELPSIEDVRTLLMSYDYR

Query:  LEKQNTVEQLPIIQANLTHMQPFSSTKKINKQ---PNSFLQRNPPN-----NAPGLLGKPNNFSSPHNRWP
        L+KQNTV+QL I QANL ++    ++K+   +   PN +    P +      +  +LGKP +     ++WP
Subjt:  LEKQNTVEQLPIIQANLTHMQPFSSTKKINKQ---PNSFLQRNPPN-----NAPGLLGKPNNFSSPHNRWP

TrEMBL top hitse value%identityAlignment
A0A2P5BGF8 Uncharacterized protein8.0e-5140.46Show/hide
Query:  IPNVAPSLSVKLNDSNYLLWRNQLLNVVRAHNLQGLLDGSVPCPPQFIDNLQHQVNPEFLHWQTLNSTVMSWIYSSLTEEKMGEIISLSVVAEVWESFRK
        +P++    ++KL+  NYL+W+NQLLNV+ A+ L+  +DGS PCPP+F D  +  VN E++ WQ  N  +MSWIY+SLT+  MG+I+  +   E+WE+  +
Subjt:  IPNVAPSLSVKLNDSNYLLWRNQLLNVVRAHNLQGLLDGSVPCPPQFIDNLQHQVNPEFLHWQTLNSTVMSWIYSSLTEEKMGEIISLSVVAEVWESFRK

Query:  SYSSSASTRVLSLKSQIQRIKKEGLSVSQYLSQIKEVADKLDAIGEPISLKDHVSYILEGLGLEYNAFVTSIQHRIELPSIEDVRTLLMSYDYRLEKQNT
         Y+SS+  ++  L++++Q ++K+GL+  +Y+ + K + + L A+GEP+S KDH+ Y+  GL  EYNAFVTSI  R +   +E++ +LL+SY++RLE QN 
Subjt:  SYSSSASTRVLSLKSQIQRIKKEGLSVSQYLSQIKEVADKLDAIGEPISLKDHVSYILEGLGLEYNAFVTSIQHRIELPSIEDVRTLLMSYDYRLEKQNT

Query:  VEQLPIIQANLTHMQ--------PFSS-----TKKINKQPNSFLQRNPPNN---APGLLGKP
          QL  +QANL H+          FS+     T+    +   F Q +PPN+    P +LGKP
Subjt:  VEQLPIIQANLTHMQ--------PFSS-----TKKINKQPNSFLQRNPPNN---APGLLGKP

A0A438GC62 Retrovirus-related Pol polyprotein from transposon RE12.5e-4442.26Show/hide
Query:  FYPTPSQQAHFSPLQRAH-FPVLAQAI---PNVAPSLSVKLNDSNYLLWRNQLLNVVRAHNLQGLLDGSVPCPPQFIDNLQHQVNPEFLHWQTLNSTVMS
        F PTP+  ++ +     +  P + Q     P+++ SLS+KL+++N LL ++QLLNV+ A+ L+  +D     PP+++D    QVNPEF+ W  LN  VMS
Subjt:  FYPTPSQQAHFSPLQRAH-FPVLAQAI---PNVAPSLSVKLNDSNYLLWRNQLLNVVRAHNLQGLLDGSVPCPPQFIDNLQHQVNPEFLHWQTLNSTVMS

Query:  WIYSSLTEEKMGEIISLSVVAEVWESFRKSYSSSASTRVLSLKSQIQRIKKEGLSVSQYLSQIKEVADKLDAIGEPISLKDHVSYILEGLGLEYNAFVTS
        WIYSSLT   +G+I+  S   ++W S    Y S +   V+SL SQ+QRIKK  + +S+YLS++K V D+   IGEP+S +D ++ ILEGL  EY+ FVTS
Subjt:  WIYSSLTEEKMGEIISLSVVAEVWESFRKSYSSSASTRVLSLKSQIQRIKKEGLSVSQYLSQIKEVADKLDAIGEPISLKDHVSYILEGLGLEYNAFVTS

Query:  IQHRIELPSIEDVRTLLMSYDYRLEKQNTVEQLPIIQAN
        I +R + PS+++V +LL +Y+YRL +++  + L   QAN
Subjt:  IQHRIELPSIEDVRTLLMSYDYRLEKQNTVEQLPIIQAN

A0A6J1DQX7 uncharacterized protein LOC1110223158.3e-6446.86Show/hide
Query:  PVLAQAIPNVAPSLSVKLNDSNYLLWRNQLLNVVRAHNLQGLLDGSVPCPPQFIDNLQHQVNPEFLHWQTLNSTVMSWIYSSLTEEKMGEIISLSVVAEV
        P  A   P +   L+VKLND+N+LLW+NQLLN V A+ L+G LDG++  PPQF+D+ Q Q NP +  W+  N  +M WIYSSL+EEKMGE++SL    ++
Subjt:  PVLAQAIPNVAPSLSVKLNDSNYLLWRNQLLNVVRAHNLQGLLDGSVPCPPQFIDNLQHQVNPEFLHWQTLNSTVMSWIYSSLTEEKMGEIISLSVVAEV

Query:  WESFRKSYSSSASTRVLSLKSQIQRIKKEGLSVSQYLSQIKEVADKLDAIGEPISLKDHVSYILEGLGLEYNAFVTSIQHRIELPSIEDVRTLLMSYDYR
        W S  + Y S  + R++ LK+++Q ++K+G SVSQYL++IKE+ADK  A+GEP+S +DH++++L+GLG EYNAFVTSI +R + PS+EDVR+LL++Y+ R
Subjt:  WESFRKSYSSSASTRVLSLKSQIQRIKKEGLSVSQYLSQIKEVADKLDAIGEPISLKDHVSYILEGLGLEYNAFVTSIQHRIELPSIEDVRTLLMSYDYR

Query:  LEKQNTVEQLPIIQANLTHMQPFSSTKKINKQ---PNSFLQRNPPN-----NAPGLLGKPNNFSSPHNRWP
        L+KQNTV+QL I QANL ++    ++K+   +   PN +    P +      +  +LGKP +     ++WP
Subjt:  LEKQNTVEQLPIIQANLTHMQPFSSTKKINKQ---PNSFLQRNPPN-----NAPGLLGKPNNFSSPHNRWP

A0A7J0EGI5 Uncharacterized protein3.5e-4639.1Show/hide
Query:  ANTRSPSTPLPSQPAVYHPTPPVFQTPRPQFGAFQSAGFYPTPSQQ-AHFSPLQRAHFPVLAQAIPNVAPSLSVKLNDSNYLLWRNQLLNVVRAHNLQGL
        + T S STP    P   +P P               +   P P+ Q  + SPL           +P++   L+VKL+D NY++W+ QLLN+V A+ L+  
Subjt:  ANTRSPSTPLPSQPAVYHPTPPVFQTPRPQFGAFQSAGFYPTPSQQ-AHFSPLQRAHFPVLAQAIPNVAPSLSVKLNDSNYLLWRNQLLNVVRAHNLQGL

Query:  LDGSVPCPPQFIDNLQHQVNPEFLHWQTLNSTVMSWIYSSLTEEKMGEIISLSVVAEVWESFRKSYSSSASTRVLSLKSQIQRIKKEGLSVSQYLSQIKE
        LDGS  CPP+F+D  Q Q NPEF  WQ  N  VMSWIY+S+ E  +G+I+  +  +++WE+  + Y++++   +  L++ +Q IKKEGL+   Y+ + + 
Subjt:  LDGSVPCPPQFIDNLQHQVNPEFLHWQTLNSTVMSWIYSSLTEEKMGEIISLSVVAEVWESFRKSYSSSASTRVLSLKSQIQRIKKEGLSVSQYLSQIKE

Query:  VADKLDAIGEPISLKDHVSYILEGLGLEYNAFVTSIQHRIELPSIEDVRTLLMSYDYRLEKQNTVE
        + + L +IGEP++  DH+ Y L GLG +YN FVTSIQ +   PSIE+V +LL+SYD RLE+Q+  +
Subjt:  VADKLDAIGEPISLKDHVSYILEGLGLEYNAFVTSIQHRIELPSIEDVRTLLMSYDYRLEKQNTVE

A0A803NL56 Uncharacterized protein1.7e-4533.83Show/hide
Query:  TNRQSHNTTSPSNIQANTRSPSTPLPSQPAVYHPTPPVFQTPRPQFGAFQSAGFYPTPSQQAHFSPLQRAHFPVLAQAIPNVAPSLSVKLNDSNYLLWRN
        TN Q+ NTT+   I A++ SPS P                   P F +F                                   ++SVKL+D+NYL+WR 
Subjt:  TNRQSHNTTSPSNIQANTRSPSTPLPSQPAVYHPTPPVFQTPRPQFGAFQSAGFYPTPSQQAHFSPLQRAHFPVLAQAIPNVAPSLSVKLNDSNYLLWRN

Query:  QLLNVVRAHNLQGLLDGSVPCPPQFIDNLQHQVNPEFLHWQTLNSTVMSWIYSSLTEEKMGEIISLSVVAEVWESFRKSYSSSASTRVLSLKSQIQRIKK
        Q+ N++ A+ L+G +DG+V C  QF ++   QV+P F  W   N  +MSW+Y+SL++  +G+I+  +   E+W S  ++YS+++  R    +  +Q +KK
Subjt:  QLLNVVRAHNLQGLLDGSVPCPPQFIDNLQHQVNPEFLHWQTLNSTVMSWIYSSLTEEKMGEIISLSVVAEVWESFRKSYSSSASTRVLSLKSQIQRIKK

Query:  EGLSVSQYLSQIKEVADKLDAIGEPISLKDHVSYILEGLGLEYNAFVTSIQHRIELPSIEDVRTLLMSYDYRLEKQNTVEQLPIIQANLTHM-----QPF
        + L+ S YL ++K + + L ++G+PIS ++H++Y+L GLGLEYNAFVT I  R   P+IE+V  LL+SY+ RLE+QN       +QAN  ++     +P 
Subjt:  EGLSVSQYLSQIKEVADKLDAIGEPISLKDHVSYILEGLGLEYNAFVTSIQHRIELPSIEDVRTLLMSYDYRLEKQNTVEQLPIIQANLTHM-----QPF

Query:  SSTKKINKQPN--SFLQRNPPNNAPGLLGKPNNFSSP
        SS+++ + QP   S  Q+N P   P      N++ SP
Subjt:  SSTKKINKQPN--SFLQRNPPNNAPGLLGKPNNFSSP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)2.4e-1524.02Show/hide
Query:  PVLAQAIPNVAPSLSVKLN--DSNYLLWRNQLLNVVRAHNLQGLLDGSVPCPPQFIDNLQHQVNPEFLHWQTLNSTVMSWIYSSLTEEK-MGEIISLSVV
        P     + N+   + V L+  +SNY  WR   L    + ++ G +DG++              N   ++WQ  +  V   +Y +LT ++  G  ++ S  
Subjt:  PVLAQAIPNVAPSLSVKLN--DSNYLLWRNQLLNVVRAHNLQGLLDGSVPCPPQFIDNLQHQVNPEFLHWQTLNSTVMSWIYSSLTEEK-MGEIISLSVV

Query:  AEVWESFRKSYSSSASTRVLSLKSQIQRIKKEGLSVSQYLSQIKEVADKLDAIGEPISLKDHVSYILEGLGLEYNAFVTSIQHRIELPSIEDVRTLLMSY
         ++W   +  + ++   R L L S+++      + V+ Y  ++K++AD L  +  P++ ++ V Y+L GL  +++  +  I+HR   PS +D  T+L   
Subjt:  AEVWESFRKSYSSSASTRVLSLKSQIQRIKKEGLSVSQYLSQIKEVADKLDAIGEPISLKDHVSYILEGLGLEYNAFVTSIQHRIELPSIEDVRTLLMSY

Query:  DYRLEKQNTVEQLPIIQANLTHMQPFSST
        + RL++         I+ N TH+   SS+
Subjt:  DYRLEKQNTVEQLPIIQANLTHMQPFSST


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGAGAATTCGTCTTCTTCTTCCTTTGGATCTGATCCAATACCATCTGTGGGAATAACCATTAACCCAACCAACCGTCAGTCTCACAACACAACCTCTCCTTCAAA
TATTCAAGCAAATACTCGATCTCCAAGCACTCCACTCCCCTCACAGCCTGCGGTTTACCATCCAACGCCTCCAGTTTTTCAAACCCCTCGTCCTCAGTTTGGCGCCTTTC
AATCTGCAGGCTTCTACCCAACACCTTCTCAGCAAGCTCATTTCTCACCATTGCAACGAGCTCATTTTCCAGTTCTTGCTCAGGCGATTCCGAATGTTGCTCCATCTTTA
TCGGTGAAACTCAATGATTCAAACTATCTTCTTTGGCGCAACCAGCTTCTCAATGTTGTTCGAGCTCATAACTTACAAGGCCTCCTTGATGGATCTGTACCTTGTCCACC
TCAGTTTATCGACAACCTACAACATCAAGTAAATCCAGAGTTCCTTCATTGGCAAACTCTAAATAGCACGGTAATGTCATGGATATACTCTTCTCTCACAGAAGAGAAAA
TGGGTGAGATTATCTCTTTATCTGTTGTTGCTGAGGTTTGGGAGTCTTTTAGAAAATCATACTCTTCTTCCGCATCTACTAGAGTGCTGAGTCTTAAATCTCAAATTCAA
AGAATTAAAAAGGAAGGGTTGTCTGTTTCTCAATATTTGTCTCAGATTAAGGAAGTGGCTGATAAACTTGATGCTATTGGAGAACCCATTTCGCTTAAAGATCATGTTTC
TTATATTTTGGAGGGTCTGGGCCTTGAATATAATGCTTTTGTGACTTCTATACAACATAGAATAGAGCTTCCTTCCATTGAGGATGTGAGAACTTTGTTAATGAGCTATG
ACTATCGTTTGGAAAAGCAAAATACAGTGGAGCAATTACCCATCATTCAAGCAAATCTTACACATATGCAACCCTTTTCCTCAACAAAAAAAATCAACAAACAACCAAAT
TCTTTTCTCCAGAGGAACCCACCGAACAATGCCCCAGGTCTTCTTGGCAAACCAAATAATTTTTCCTCTCCTCACAACCGATGGCCCAATCGTTCCCCCAACCATCAACA
AGGAAAAATTTAA
mRNA sequenceShow/hide mRNA sequence
CAGTCTCACTGTTTCCTCTTTCATGGAATCAGAGCTTTGGTAAGCTCATTTTTTTTTAATGGCTGAGAATTCGTCTTCTTCTTCCTTTGGATCTGATCCAATACCATCTG
TGGGAATAACCATTAACCCAACCAACCGTCAGTCTCACAACACAACCTCTCCTTCAAATATTCAAGCAAATACTCGATCTCCAAGCACTCCACTCCCCTCACAGCCTGCG
GTTTACCATCCAACGCCTCCAGTTTTTCAAACCCCTCGTCCTCAGTTTGGCGCCTTTCAATCTGCAGGCTTCTACCCAACACCTTCTCAGCAAGCTCATTTCTCACCATT
GCAACGAGCTCATTTTCCAGTTCTTGCTCAGGCGATTCCGAATGTTGCTCCATCTTTATCGGTGAAACTCAATGATTCAAACTATCTTCTTTGGCGCAACCAGCTTCTCA
ATGTTGTTCGAGCTCATAACTTACAAGGCCTCCTTGATGGATCTGTACCTTGTCCACCTCAGTTTATCGACAACCTACAACATCAAGTAAATCCAGAGTTCCTTCATTGG
CAAACTCTAAATAGCACGGTAATGTCATGGATATACTCTTCTCTCACAGAAGAGAAAATGGGTGAGATTATCTCTTTATCTGTTGTTGCTGAGGTTTGGGAGTCTTTTAG
AAAATCATACTCTTCTTCCGCATCTACTAGAGTGCTGAGTCTTAAATCTCAAATTCAAAGAATTAAAAAGGAAGGGTTGTCTGTTTCTCAATATTTGTCTCAGATTAAGG
AAGTGGCTGATAAACTTGATGCTATTGGAGAACCCATTTCGCTTAAAGATCATGTTTCTTATATTTTGGAGGGTCTGGGCCTTGAATATAATGCTTTTGTGACTTCTATA
CAACATAGAATAGAGCTTCCTTCCATTGAGGATGTGAGAACTTTGTTAATGAGCTATGACTATCGTTTGGAAAAGCAAAATACAGTGGAGCAATTACCCATCATTCAAGC
AAATCTTACACATATGCAACCCTTTTCCTCAACAAAAAAAATCAACAAACAACCAAATTCTTTTCTCCAGAGGAACCCACCGAACAATGCCCCAGGTCTTCTTGGCAAAC
CAAATAATTTTTCCTCTCCTCACAACCGATGGCCCAATCGTTCCCCCAACCATCAACAAGGAAAAATTTAATGCCAAATCTGTGGCAAGTTTGGCCATTCTGCTTTAATG
TGTTATCATCGGATGAATGCAAACTACCAACCACAAACATCTCCAAATCCACCTCAAGCCTTCTATCACAACATTCAAACAACTCAAATAGCCCAATCTGATCAGTCCTC
TGCCAACACATTCACTAATCCATCCAATATTCCTGATGAGGCATGGTGCATGGACTCTGGAGCCACTCACCACATTTCATCGGATATAAACTCTTTGTCCAATCCCATGC
CCTACACAGGTGGAGAGCAAGTGACCGTGGGAAATGGACCTTTCAACCAAGAAAATACTACTACAAGGCAATCTTGAGGGTGGGCTATATCGACTACAACCTTCTTCTTC
ACTTTCAACTTCGGGTTCTTTGACTTTACCGTTTGTTGCTGCTTTTCTTTCCACAAAAGATGCCGAGTTGTGGCGCAATCGTTTAGGAC
Protein sequenceShow/hide protein sequence
MAENSSSSSFGSDPIPSVGITINPTNRQSHNTTSPSNIQANTRSPSTPLPSQPAVYHPTPPVFQTPRPQFGAFQSAGFYPTPSQQAHFSPLQRAHFPVLAQAIPNVAPSL
SVKLNDSNYLLWRNQLLNVVRAHNLQGLLDGSVPCPPQFIDNLQHQVNPEFLHWQTLNSTVMSWIYSSLTEEKMGEIISLSVVAEVWESFRKSYSSSASTRVLSLKSQIQ
RIKKEGLSVSQYLSQIKEVADKLDAIGEPISLKDHVSYILEGLGLEYNAFVTSIQHRIELPSIEDVRTLLMSYDYRLEKQNTVEQLPIIQANLTHMQPFSSTKKINKQPN
SFLQRNPPNNAPGLLGKPNNFSSPHNRWPNRSPNHQQGKI