; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0011523 (gene) of Snake gourd v1 genome

Gene IDTan0011523
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag-pro-like protein
Genome locationLG07:36624738..36626342
RNA-Seq ExpressionTan0011523
SyntenyTan0011523
Gene Ontology termsGO:0006310 - DNA recombination (biological process)
GO:0015074 - DNA integration (biological process)
GO:0071897 - DNA biosynthetic process (biological process)
GO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0005634 - nucleus (cellular component)
GO:0030430 - host cell cytoplasm (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036933.1 uncharacterized protein E6C27_scaffold86G00060 [Cucumis melo var. makuwa]1.1e-5634.36Show/hide
Query:  RNRDLEQENEKLRREVSHLTNLAARTNKKLEEVEKNSKDRSKRERD---YDEENKRLNKENHTLRNQNTALRKKVHLQENKIKDFVEAKGTLMKLVSELK
        +NR LEQENEKLR+E S   + A     +LE+ +   K++ K E D    D+E +R+NK N +++N+ T L+  V                         
Subjt:  RNRDLEQENEKLRREVSHLTNLAARTNKKLEEVEKNSKDRSKRERD---YDEENKRLNKENHTLRNQNTALRKKVHLQENKIKDFVEAKGTLMKLVSELK

Query:  ETVNKREVQFVEFEQANNTLCHTLDDLHMKLNDQSEEYEIMRNYASSLDHQLKACQRSSEQLLIQKEQLERQRHTMKEDYDVLRSDLQEIIEKVNQTMCT
                                  LH+K+ ++SEEY+I++NYA  L +QL A Q SS+++  + E L      MK DYD+   D Q ++E+V+QT+  
Subjt:  ETVNKREVQFVEFEQANNTLCHTLDDLHMKLNDQSEEYEIMRNYASSLDHQLKACQRSSEQLLIQKEQLERQRHTMKEDYDVLRSDLQEIIEKVNQTMCT

Query:  IAIMARRARGFAEWARDLRRSTSPMTSNADELYEFLGMISRDLGYFEMEKTRKDIEELRGKLDVVFILLE--KGKTTIDVAQPGKVVNDPLEVNFTPQYT
        + ++++RA GFAEWA     S +P+    +  Y+   M  +D    +M+K R++I  L  ++  +  LL   KGK  +D  Q    + D  +  + P +T
Subjt:  IAIMARRARGFAEWARDLRRSTSPMTSNADELYEFLGMISRDLGYFEMEKTRKDIEELRGKLDVVFILLE--KGKTTIDVAQPGKVVNDPLEVNFTPQYT

Query:  TYNPLYDIPVEQISFPFKTEHAPTSGQTGISISKKGPSRVTVTTPNLDNPEVRKGLSGGENVSSNEKLEVLEKRLRVIEGTDVFGNIDAGNLCLVPDVVL
         Y+                          I++ +  P                        + S +KL+VLE+RLR IE TDV+GNIDA  LCLVP +++
Subjt:  TYNPLYDIPVEQISFPFKTEHAPTSGQTGISISKKGPSRVTVTTPNLDNPEVRKGLSGGENVSSNEKLEVLEKRLRVIEGTDVFGNIDAGNLCLVPDVVL

Query:  PPKFKVPEFEKYDGTSCPKNHLIMYSRKMAAYVQNDRLLIHCFQDSLTGPASRW
        P KFKVPEF KYDG++CP++HLIMY RKMA ++ ND+LL+HCFQDSLT PASRW
Subjt:  PPKFKVPEFEKYDGTSCPKNHLIMYSRKMAAYVQNDRLLIHCFQDSLTGPASRW

XP_022147189.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111016200 [Momordica charantia]9.1e-4360Show/hide
Query:  PQYTTYNPLYDIPVEQISFPFKTEHAPTSGQTGISISKKGPSRVTVTTPNLDN--PEVRKGLSGGENVSSNEKLEVLEKRLRVIEGTDVFGNIDAGNLCL
        PQYTTYNPLYD+PV Q   PF       + Q   +I  + P ++ ++ P + N    + K    G+N  SNEK EVL++RLR IE TDVFGNIDA  LC 
Subjt:  PQYTTYNPLYDIPVEQISFPFKTEHAPTSGQTGISISKKGPSRVTVTTPNLDN--PEVRKGLSGGENVSSNEKLEVLEKRLRVIEGTDVFGNIDAGNLCL

Query:  VPDVVLPPKFKVPEFEKYDGTSCPKNHLIMYSRKMAAYVQNDRLLIHCFQDSLTGPASRW
        V  +V+PPK KVPEFEKY+G+SCPKNHL MY RKMAAYVQND+LLIHCFQDSL+GPASRW
Subjt:  VPDVVLPPKFKVPEFEKYDGTSCPKNHLIMYSRKMAAYVQNDRLLIHCFQDSLTGPASRW

XP_022155098.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111022231, partial [Momordica charantia]1.4e-5452.63Show/hide
Query:  EMEKTRKDIEELRGKLDVVFILLEKGKTTIDVAQPGKVVNDPLEVNFTP----------------------QYTTYNPLYDIPVEQISFPFKTEHAPTSG
        E EKTRKDIEELR KLD + + LEKGKT  + + P   +++P    F P                      QYTTYNPLYDIP  Q  FP      P   
Subjt:  EMEKTRKDIEELRGKLDVVFILLEKGKTTIDVAQPGKVVNDPLEVNFTP----------------------QYTTYNPLYDIPVEQISFPFKTEHAPTSG

Query:  QTGISISKKGPSRVTVTTPNLDNPEVRKGLSGGENVSSNEKLEVLEKRLRVIEGTDVFGNIDAGNLCLVPDVVLPPKFKVPEFEKYDGTSCPKNHLIMYS
          GI   +        T  NL  P+  K     E   S+EKLEVLE+RLR +EGTDVFGNIDA  LCL   +V+PPKFK+PEFEKY+G+SCPKNHLIMY 
Subjt:  QTGISISKKGPSRVTVTTPNLDNPEVRKGLSGGENVSSNEKLEVLEKRLRVIEGTDVFGNIDAGNLCLVPDVVLPPKFKVPEFEKYDGTSCPKNHLIMYS

Query:  RKMAAYVQNDRLLIHCFQDSLTGPASRW
        RKMAAY+QND+LLIHCFQDSL+GP S W
Subjt:  RKMAAYVQNDRLLIHCFQDSLTGPASRW

XP_022157796.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111024415 [Momordica charantia]1.1e-5354.05Show/hide
Query:  EMEKTRKDIEELRGKLDVVFILLEKGKTTIDVAQPGKVVNDPLEV----------------NFTPQYTTYNPLYDIPVEQISFPFKTEHAPTSGQTGISI
        E EKTRKDIEELR KLD + + LEKGK T D A     +++P E                  F PQYTTYNPLYD+P+ Q  +P           T I  
Subjt:  EMEKTRKDIEELRGKLDVVFILLEKGKTTIDVAQPGKVVNDPLEV----------------NFTPQYTTYNPLYDIPVEQISFPFKTEHAPTSGQTGISI

Query:  SKKGPSRVTVTTPNLDNPEVRKGLSGGENVSSNEKLEVLEKRLRVIEGTDVFGNIDAGNLCLVPDVVLPPKFKVPEFEKYDGTSCPKNHLIMYSRKMAAY
         +        T  NL     +      +N  S EK EVLE+RLR IEGTDVFGNIDA  LCLV  +V+PPKFKVPEFEKYDG+SCPKNHLIMY RKM AY
Subjt:  SKKGPSRVTVTTPNLDNPEVRKGLSGGENVSSNEKLEVLEKRLRVIEGTDVFGNIDAGNLCLVPDVVLPPKFKVPEFEKYDGTSCPKNHLIMYSRKMAAY

Query:  VQNDRLLIHCFQDSLTGPASRW
        VQN +LLIHCFQDSL G ASRW
Subjt:  VQNDRLLIHCFQDSLTGPASRW

XP_031738551.1 LOW QUALITY PROTEIN: uncharacterized protein LOC101203611 [Cucumis sativus]1.2e-8240.76Show/hide
Query:  RNRDLEQENEKLRREVSHLTNLAARTNKKLEEVEKNSKDRSKRERD---YDEENKRLNKENHTLRNQNTALRKKVHLQENKIKDFVEAKGTLMKLVSELK
        +NR LEQENEKL++E S   + A    ++LE+ +   K++ K E++    DEE +R+NK N +L+N+ T L+  V  Q+  IKD    K   ++LV++LK
Subjt:  RNRDLEQENEKLRREVSHLTNLAARTNKKLEEVEKNSKDRSKRERD---YDEENKRLNKENHTLRNQNTALRKKVHLQENKIKDFVEAKGTLMKLVSELK

Query:  ETVNKREVQFVEFEQANNTLCHTLDDLHMKLNDQSEEYEIMRNYASSLDHQLKACQRSSEQLLIQKEQLERQRHTMKEDYDVLRSDLQEIIEKVNQTMCT
         ++ KRE Q V+ E  N +L  T+D LH+K+ + SE+Y+I++NYA SL HQL A Q SSE+++ + + L+     MK DYD+ R D Q ++E+V+QT+  
Subjt:  ETVNKREVQFVEFEQANNTLCHTLDDLHMKLNDQSEEYEIMRNYASSLDHQLKACQRSSEQLLIQKEQLERQRHTMKEDYDVLRSDLQEIIEKVNQTMCT

Query:  IAIMARRARGFAEWARDLRRSTSPMTSNADELYEFLGMISRDLGYFEMEKTRKDIEELRGKLDVVFILLEKGKTTIDVAQ---PGKVVNDPL-EVNFTPQ
        + I++RRA GFAEWA DLR +   +  ++D+L  FL MI R+LG+F                        KGK  ++ AQ   P +  +DP+    FTP+
Subjt:  IAIMARRARGFAEWARDLRRSTSPMTSNADELYEFLGMISRDLGYFEMEKTRKDIEELRGKLDVVFILLEKGKTTIDVAQ---PGKVVNDPL-EVNFTPQ

Query:  ------------YTTYNPLYDIPVEQISFPFKTEHAPTSGQTGISISKKGPSRVTVTTPNLDNPEVR---KGLSGGENVSSNEKLEVLEKRLRVIEGTDV
                    Y   NPL+D+P                                   P+++  E +   + +   EN  + +KL+VLE+RLR IEGTDV
Subjt:  ------------YTTYNPLYDIPVEQISFPFKTEHAPTSGQTGISISKKGPSRVTVTTPNLDNPEVR---KGLSGGENVSSNEKLEVLEKRLRVIEGTDV

Query:  FGNIDAGNLCLVPDVVLPPKFKVPEFEKYDGTSCPKNHLIMYSRKMAAYVQNDRLLIHCFQDSLTGPASRW
        +GNIDA  LCLVP +++P KFKVP F+KYDG+SCP++HLIMY RKMAA++ ND+LLIHCFQDSLTGPA+RW
Subjt:  FGNIDAGNLCLVPDVVLPPKFKVPEFEKYDGTSCPKNHLIMYSRKMAAYVQNDRLLIHCFQDSLTGPASRW

TrEMBL top hitse value%identityAlignment
A0A5A7T1W2 Retrotrans_gag domain-containing protein5.4e-5734.36Show/hide
Query:  RNRDLEQENEKLRREVSHLTNLAARTNKKLEEVEKNSKDRSKRERD---YDEENKRLNKENHTLRNQNTALRKKVHLQENKIKDFVEAKGTLMKLVSELK
        +NR LEQENEKLR+E S   + A     +LE+ +   K++ K E D    D+E +R+NK N +++N+ T L+  V                         
Subjt:  RNRDLEQENEKLRREVSHLTNLAARTNKKLEEVEKNSKDRSKRERD---YDEENKRLNKENHTLRNQNTALRKKVHLQENKIKDFVEAKGTLMKLVSELK

Query:  ETVNKREVQFVEFEQANNTLCHTLDDLHMKLNDQSEEYEIMRNYASSLDHQLKACQRSSEQLLIQKEQLERQRHTMKEDYDVLRSDLQEIIEKVNQTMCT
                                  LH+K+ ++SEEY+I++NYA  L +QL A Q SS+++  + E L      MK DYD+   D Q ++E+V+QT+  
Subjt:  ETVNKREVQFVEFEQANNTLCHTLDDLHMKLNDQSEEYEIMRNYASSLDHQLKACQRSSEQLLIQKEQLERQRHTMKEDYDVLRSDLQEIIEKVNQTMCT

Query:  IAIMARRARGFAEWARDLRRSTSPMTSNADELYEFLGMISRDLGYFEMEKTRKDIEELRGKLDVVFILLE--KGKTTIDVAQPGKVVNDPLEVNFTPQYT
        + ++++RA GFAEWA     S +P+    +  Y+   M  +D    +M+K R++I  L  ++  +  LL   KGK  +D  Q    + D  +  + P +T
Subjt:  IAIMARRARGFAEWARDLRRSTSPMTSNADELYEFLGMISRDLGYFEMEKTRKDIEELRGKLDVVFILLE--KGKTTIDVAQPGKVVNDPLEVNFTPQYT

Query:  TYNPLYDIPVEQISFPFKTEHAPTSGQTGISISKKGPSRVTVTTPNLDNPEVRKGLSGGENVSSNEKLEVLEKRLRVIEGTDVFGNIDAGNLCLVPDVVL
         Y+                          I++ +  P                        + S +KL+VLE+RLR IE TDV+GNIDA  LCLVP +++
Subjt:  TYNPLYDIPVEQISFPFKTEHAPTSGQTGISISKKGPSRVTVTTPNLDNPEVRKGLSGGENVSSNEKLEVLEKRLRVIEGTDVFGNIDAGNLCLVPDVVL

Query:  PPKFKVPEFEKYDGTSCPKNHLIMYSRKMAAYVQNDRLLIHCFQDSLTGPASRW
        P KFKVPEF KYDG++CP++HLIMY RKMA ++ ND+LL+HCFQDSLT PASRW
Subjt:  PPKFKVPEFEKYDGTSCPKNHLIMYSRKMAAYVQNDRLLIHCFQDSLTGPASRW

A0A5D3CXD6 Retrotrans_gag domain-containing protein1.1e-4144.08Show/hide
Query:  EMEKTRKDIEELRGKLDVVFILLE--KGKTTIDVAQPGKVVNDPLEVNFTPQYTTYNPLYDIPVEQISFPFKTEHAPTSGQTGISISKKGPSRVTVTTPN
        +M+K R++I  L  ++  +  LL   KGK  +D AQ    + D  +  + P +T Y+   ++P  Q      T+H   +    +         V    P 
Subjt:  EMEKTRKDIEELRGKLDVVFILLE--KGKTTIDVAQPGKVVNDPLEVNFTPQYTTYNPLYDIPVEQISFPFKTEHAPTSGQTGISISKKGPSRVTVTTPN

Query:  LDNPEVR---KGLSGGENVSSNEKLEVLEKRLRVIEGTDVFGNIDAGNLCLVPDVVLPPKFKVPEFEKYDGTSCPKNHLIMYSRKMAAYVQNDRLLIHCF
        +++ E +   + +   EN  + +KL+VLE+RLR IEGTDV+GNIDA  LCLVPD+++P KFKVPEF+KYDG++CP++HLIMY RKMAA++ ND+LL+HCF
Subjt:  LDNPEVR---KGLSGGENVSSNEKLEVLEKRLRVIEGTDVFGNIDAGNLCLVPDVVLPPKFKVPEFEKYDGTSCPKNHLIMYSRKMAAYVQNDRLLIHCF

Query:  QDSLTGPASRW
        QDSLTGPASRW
Subjt:  QDSLTGPASRW

A0A6J1D099 Ribonuclease H4.4e-4360Show/hide
Query:  PQYTTYNPLYDIPVEQISFPFKTEHAPTSGQTGISISKKGPSRVTVTTPNLDN--PEVRKGLSGGENVSSNEKLEVLEKRLRVIEGTDVFGNIDAGNLCL
        PQYTTYNPLYD+PV Q   PF       + Q   +I  + P ++ ++ P + N    + K    G+N  SNEK EVL++RLR IE TDVFGNIDA  LC 
Subjt:  PQYTTYNPLYDIPVEQISFPFKTEHAPTSGQTGISISKKGPSRVTVTTPNLDN--PEVRKGLSGGENVSSNEKLEVLEKRLRVIEGTDVFGNIDAGNLCL

Query:  VPDVVLPPKFKVPEFEKYDGTSCPKNHLIMYSRKMAAYVQNDRLLIHCFQDSLTGPASRW
        V  +V+PPK KVPEFEKY+G+SCPKNHL MY RKMAAYVQND+LLIHCFQDSL+GPASRW
Subjt:  VPDVVLPPKFKVPEFEKYDGTSCPKNHLIMYSRKMAAYVQNDRLLIHCFQDSLTGPASRW

A0A6J1DM29 LOW QUALITY PROTEIN: uncharacterized protein LOC1110222313.8e-5553.07Show/hide
Query:  EMEKTRKDIEELRGKLDVVFILLEKGKTTIDVAQPGKVVNDPLEVNFTP----------------------QYTTYNPLYDIPVEQISFPFKTEHAPTSG
        E EKTRKDIEELR KLD + + LEKGKT  + + P   +++P    F P                      QYTTYNPLYDIP  Q  FP      P   
Subjt:  EMEKTRKDIEELRGKLDVVFILLEKGKTTIDVAQPGKVVNDPLEVNFTP----------------------QYTTYNPLYDIPVEQISFPFKTEHAPTSG

Query:  QTGISISKKGPSRVTVTTPNLDNPEVRKGLSGGENVSSNEKLEVLEKRLRVIEGTDVFGNIDAGNLCLVPDVVLPPKFKVPEFEKYDGTSCPKNHLIMYS
          GI   +        T  NL  P+  K     E   S+EKLEVLE+RLR +EGTDVFGNIDA  LCL   +V+PPKFK+PEFEKYDG+SCPKNHLIMY 
Subjt:  QTGISISKKGPSRVTVTTPNLDNPEVRKGLSGGENVSSNEKLEVLEKRLRVIEGTDVFGNIDAGNLCLVPDVVLPPKFKVPEFEKYDGTSCPKNHLIMYS

Query:  RKMAAYVQNDRLLIHCFQDSLTGPASRW
        RKMAAY+QND+LLIHCFQDSL+GP S W
Subjt:  RKMAAYVQNDRLLIHCFQDSLTGPASRW

A0A6J1DZ90 Ribonuclease H5.6e-5454.05Show/hide
Query:  EMEKTRKDIEELRGKLDVVFILLEKGKTTIDVAQPGKVVNDPLEV----------------NFTPQYTTYNPLYDIPVEQISFPFKTEHAPTSGQTGISI
        E EKTRKDIEELR KLD + + LEKGK T D A     +++P E                  F PQYTTYNPLYD+P+ Q  +P           T I  
Subjt:  EMEKTRKDIEELRGKLDVVFILLEKGKTTIDVAQPGKVVNDPLEV----------------NFTPQYTTYNPLYDIPVEQISFPFKTEHAPTSGQTGISI

Query:  SKKGPSRVTVTTPNLDNPEVRKGLSGGENVSSNEKLEVLEKRLRVIEGTDVFGNIDAGNLCLVPDVVLPPKFKVPEFEKYDGTSCPKNHLIMYSRKMAAY
         +        T  NL     +      +N  S EK EVLE+RLR IEGTDVFGNIDA  LCLV  +V+PPKFKVPEFEKYDG+SCPKNHLIMY RKM AY
Subjt:  SKKGPSRVTVTTPNLDNPEVRKGLSGGENVSSNEKLEVLEKRLRVIEGTDVFGNIDAGNLCLVPDVVLPPKFKVPEFEKYDGTSCPKNHLIMYSRKMAAY

Query:  VQNDRLLIHCFQDSLTGPASRW
        VQN +LLIHCFQDSL G ASRW
Subjt:  VQNDRLLIHCFQDSLTGPASRW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGGAAGGAATCGAGACCTCGAGCAAGAGAATGAGAAGTTACGTCGAGAAGTTAGTCACTTAACCAACCTAGCAGCTCGAACTAATAAAAAGCTTGAAGAAGTAGA
AAAAAACTCAAAAGATCGATCCAAGCGGGAAAGAGATTATGATGAGGAAAATAAGAGGTTGAACAAGGAAAACCACACACTGAGGAACCAAAACACAGCACTGCGAAAGA
AGGTTCACCTACAAGAAAACAAGATCAAGGACTTTGTAGAAGCTAAGGGGACCCTCATGAAGTTGGTTAGCGAGTTGAAAGAAACCGTTAATAAGCGAGAGGTACAATTC
GTTGAGTTTGAGCAAGCTAATAATACCCTGTGCCATACTTTGGACGATTTACATATGAAACTGAACGATCAATCGGAGGAGTATGAAATTATGAGGAACTATGCAAGTTC
ACTGGACCACCAACTGAAAGCATGTCAAAGATCAAGCGAGCAGTTATTAATTCAAAAGGAGCAGTTGGAACGACAGCGTCATACAATGAAAGAAGACTATGATGTCTTGA
GAAGTGACCTACAAGAGATCATCGAAAAAGTGAACCAAACAATGTGTACAATCGCGATAATGGCTAGAAGAGCTCGAGGATTTGCAGAATGGGCAAGGGATCTGCGAAGG
AGTACTTCACCAATGACATCAAATGCGGATGAACTATATGAGTTTCTAGGGATGATCAGTAGAGACCTTGGATACTTTGAAATGGAAAAAACGAGAAAAGATATCGAGGA
ATTACGAGGGAAGCTAGATGTCGTTTTTATCCTTTTGGAAAAGGGCAAGACAACAATCGATGTTGCTCAACCTGGCAAAGTAGTTAACGATCCTCTCGAGGTTAACTTTA
CCCCGCAGTATACGACGTACAACCCTTTGTACGACATCCCAGTAGAGCAAATTTCATTCCCTTTTAAGACGGAACATGCTCCTACAAGTGGCCAAACTGGAATTTCAATC
TCTAAGAAAGGACCATCGAGGGTAACTGTTACTACTCCCAATCTTGACAATCCAGAAGTCAGGAAGGGATTGTCTGGGGGTGAGAATGTCTCATCTAACGAGAAGCTTGA
GGTCTTAGAAAAAAGGTTGAGAGTAATCGAAGGTACTGATGTATTTGGGAACATAGATGCTGGTAATCTGTGTTTAGTGCCAGACGTGGTACTCCCACCAAAATTCAAGG
TACCAGAATTTGAGAAATATGACGGAACGTCCTGCCCAAAAAATCATCTTATTATGTACTCCAGAAAAATGGCAGCGTATGTTCAAAATGATAGGTTATTAATTCATTGC
TTTCAAGATAGTTTGACTGGCCCAGCATCTCGTTGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTAGGAAGGAATCGAGACCTCGAGCAAGAGAATGAGAAGTTACGTCGAGAAGTTAGTCACTTAACCAACCTAGCAGCTCGAACTAATAAAAAGCTTGAAGAAGTAGA
AAAAAACTCAAAAGATCGATCCAAGCGGGAAAGAGATTATGATGAGGAAAATAAGAGGTTGAACAAGGAAAACCACACACTGAGGAACCAAAACACAGCACTGCGAAAGA
AGGTTCACCTACAAGAAAACAAGATCAAGGACTTTGTAGAAGCTAAGGGGACCCTCATGAAGTTGGTTAGCGAGTTGAAAGAAACCGTTAATAAGCGAGAGGTACAATTC
GTTGAGTTTGAGCAAGCTAATAATACCCTGTGCCATACTTTGGACGATTTACATATGAAACTGAACGATCAATCGGAGGAGTATGAAATTATGAGGAACTATGCAAGTTC
ACTGGACCACCAACTGAAAGCATGTCAAAGATCAAGCGAGCAGTTATTAATTCAAAAGGAGCAGTTGGAACGACAGCGTCATACAATGAAAGAAGACTATGATGTCTTGA
GAAGTGACCTACAAGAGATCATCGAAAAAGTGAACCAAACAATGTGTACAATCGCGATAATGGCTAGAAGAGCTCGAGGATTTGCAGAATGGGCAAGGGATCTGCGAAGG
AGTACTTCACCAATGACATCAAATGCGGATGAACTATATGAGTTTCTAGGGATGATCAGTAGAGACCTTGGATACTTTGAAATGGAAAAAACGAGAAAAGATATCGAGGA
ATTACGAGGGAAGCTAGATGTCGTTTTTATCCTTTTGGAAAAGGGCAAGACAACAATCGATGTTGCTCAACCTGGCAAAGTAGTTAACGATCCTCTCGAGGTTAACTTTA
CCCCGCAGTATACGACGTACAACCCTTTGTACGACATCCCAGTAGAGCAAATTTCATTCCCTTTTAAGACGGAACATGCTCCTACAAGTGGCCAAACTGGAATTTCAATC
TCTAAGAAAGGACCATCGAGGGTAACTGTTACTACTCCCAATCTTGACAATCCAGAAGTCAGGAAGGGATTGTCTGGGGGTGAGAATGTCTCATCTAACGAGAAGCTTGA
GGTCTTAGAAAAAAGGTTGAGAGTAATCGAAGGTACTGATGTATTTGGGAACATAGATGCTGGTAATCTGTGTTTAGTGCCAGACGTGGTACTCCCACCAAAATTCAAGG
TACCAGAATTTGAGAAATATGACGGAACGTCCTGCCCAAAAAATCATCTTATTATGTACTCCAGAAAAATGGCAGCGTATGTTCAAAATGATAGGTTATTAATTCATTGC
TTTCAAGATAGTTTGACTGGCCCAGCATCTCGTTGGTAA
Protein sequenceShow/hide protein sequence
MLGRNRDLEQENEKLRREVSHLTNLAARTNKKLEEVEKNSKDRSKRERDYDEENKRLNKENHTLRNQNTALRKKVHLQENKIKDFVEAKGTLMKLVSELKETVNKREVQF
VEFEQANNTLCHTLDDLHMKLNDQSEEYEIMRNYASSLDHQLKACQRSSEQLLIQKEQLERQRHTMKEDYDVLRSDLQEIIEKVNQTMCTIAIMARRARGFAEWARDLRR
STSPMTSNADELYEFLGMISRDLGYFEMEKTRKDIEELRGKLDVVFILLEKGKTTIDVAQPGKVVNDPLEVNFTPQYTTYNPLYDIPVEQISFPFKTEHAPTSGQTGISI
SKKGPSRVTVTTPNLDNPEVRKGLSGGENVSSNEKLEVLEKRLRVIEGTDVFGNIDAGNLCLVPDVVLPPKFKVPEFEKYDGTSCPKNHLIMYSRKMAAYVQNDRLLIHC
FQDSLTGPASRW