; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016196 (gene) of Snake gourd v1 genome

Gene IDTan0016196
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptioncell wall protein RBR3-like isoform X2
Genome locationLG11:2486605..2489769
RNA-Seq ExpressionTan0016196
SyntenyTan0016196
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022134148.1 uncharacterized protein LOC111006488 isoform X1 [Momordica charantia]8.6e-9176.38Show/hide
Query:  MESQKRNLRARKPLSDCTNTILSSQSSASNFSAAIKPRKRVIKSAVKDVVNNEKKGESSFASAFPSVNLQASNPSSDFLPAE--PSSNLPPAEPNPSPDS
        ME+ KRNLRARKPLSDCTNT+LSSQSSASNFS+ IKPR+RVIKSAVKD VNNEKKGESSF SA  S+NLQASNPSSD LP E  PSS+  P EPN S D 
Subjt:  MESQKRNLRARKPLSDCTNTILSSQSSASNFSAAIKPRKRVIKSAVKDVVNNEKKGESSFASAFPSVNLQASNPSSDFLPAE--PSSNLPPAEPNPSPDS

Query:  LPTEPSSSS--LPTEPSSSS--LPTEPSSSSLPTEASTPSRRADLPSSSGIDPVSEPQSFYSRRNPTNKRKSVEIAVAPFIFSTASKIQTRGEKRDGYSS
        LPTEP++SS  LPTEP++SS  LPTEPSSSSLP EASTP+RRADLPSSSG D VSEPQSFYSRR+PTNKRKS EIA  PFIF+TASKI T GEKRDGYSS
Subjt:  LPTEPSSSS--LPTEPSSSS--LPTEPSSSSLPTEASTPSRRADLPSSSGIDPVSEPQSFYSRRNPTNKRKSVEIAVAPFIFSTASKIQTRGEKRDGYSS

Query:  SSKARTVPYRK-------------RQRALIYGEDESKIELPRDFVEQQKAYFSEVDAFELPVEEAKSSDSE
         SKARTVP +K             RQR  IYG+DESKIELPR+FVE+QKAYFSEVDAF+L VEEAKSSDSE
Subjt:  SSKARTVPYRK-------------RQRALIYGEDESKIELPRDFVEQQKAYFSEVDAFELPVEEAKSSDSE

XP_022134150.1 uncharacterized protein LOC111006488 isoform X3 [Momordica charantia]1.4e-9380.23Show/hide
Query:  MESQKRNLRARKPLSDCTNTILSSQSSASNFSAAIKPRKRVIKSAVKDVVNNEKKGESSFASAFPSVNLQASNPSSDFLPAE--PSSNLPPAEPNPSPDS
        ME+ KRNLRARKPLSDCTNT+LSSQSSASNFS+ IKPR+RVIKSAVKD VNNEKKGESSF SA  S+NLQASNPSSD LP E  PSS+  P EPN S D 
Subjt:  MESQKRNLRARKPLSDCTNTILSSQSSASNFSAAIKPRKRVIKSAVKDVVNNEKKGESSFASAFPSVNLQASNPSSDFLPAE--PSSNLPPAEPNPSPDS

Query:  LPTEPSSSS--LPTEPSSSS--LPTEPSSSSLPTEASTPSRRADLPSSSGIDPVSEPQSFYSRRNPTNKRKSVEIAVAPFIFSTASKIQTRGEKRDGYSS
        LPTEP++SS  LPTEP++SS  LPTEPSSSSLP EASTP+RRADLPSSSG D VSEPQSFYSRR+PTNKRKS EIA  PFIF+TASKI T GEKRDGYSS
Subjt:  LPTEPSSSS--LPTEPSSSS--LPTEPSSSSLPTEASTPSRRADLPSSSGIDPVSEPQSFYSRRNPTNKRKSVEIAVAPFIFSTASKIQTRGEKRDGYSS

Query:  SSKARTVPYRKRQRALIYGEDESKIELPRDFVEQQKAYFSEVDAFELPVEEAKSSDSE
         SKARTVP +KRQR  IYG+DESKIELPR+FVE+QKAYFSEVDAF+L VEEAKSSDSE
Subjt:  SSKARTVPYRKRQRALIYGEDESKIELPRDFVEQQKAYFSEVDAFELPVEEAKSSDSE

XP_022991070.1 uncharacterized protein LOC111487776 isoform X1 [Cucurbita maxima]4.0e-8876.59Show/hide
Query:  MESQKRNLRARKPLSDCTNTILSSQSSASNFSAAIKPRKRVIKSAVKDVVNNEKKGESSFASAFPSVNLQASNPSSDFLPAEPSSNLPPAEPNPSPDSLP
        MESQKRNLR RKPL+DCTNT+LSSQSSASN SAAIKPRKRV+K AVKDVVNNEK+G SSFASA  SVNLQASNPSSDFLP +PSSNLPPAE         
Subjt:  MESQKRNLRARKPLSDCTNTILSSQSSASNFSAAIKPRKRVIKSAVKDVVNNEKKGESSFASAFPSVNLQASNPSSDFLPAEPSSNLPPAEPNPSPDSLP

Query:  TEPSSSSLPTEPSSSSLPTEPSSSSLPTEASTPSRRADLPSSSGIDPVSEPQSFYSRRNPTNKRKSVEIAVAPFIFSTASKIQTRGEKRDGYSSSSKART
                 + P+S SLPTEPSSSSLP E STPSR  DLPSSSG D V EPQSFYSRR+P N+RKS E A APF+FSTASKI TRGEKR   SS S+ART
Subjt:  TEPSSSSLPTEPSSSSLPTEPSSSSLPTEASTPSRRADLPSSSGIDPVSEPQSFYSRRNPTNKRKSVEIAVAPFIFSTASKIQTRGEKRDGYSSSSKART

Query:  VPYRKRQRALIYGEDESKIELPRDFVEQQKAYFSEVDAFELPVEEAKSSDSE
        VPYRKRQR  IYGEDESKIELPR+FVE+QKAYFSEVDAFEL VEEAKSSDSE
Subjt:  VPYRKRQRALIYGEDESKIELPRDFVEQQKAYFSEVDAFELPVEEAKSSDSE

XP_023515045.1 uncharacterized protein LOC111779162 isoform X1 [Cucurbita pepo subsp. pepo]1.8e-8876.98Show/hide
Query:  MESQKRNLRARKPLSDCTNTILSSQSSASNFSAAIKPRKRVIKSAVKDVVNNEKKGESSFASAFPSVNLQASNPSSDFLPAEPSSNLPPAEPNPSPDSLP
        MESQKRNLR RKPL+DCTNT+LSSQSSASN SAAIKPRKRV+KSAVKDV+NNEK+G SSFASA  SVNLQASNPSS+FLP +PSSNLPPAE NPS D   
Subjt:  MESQKRNLRARKPLSDCTNTILSSQSSASNFSAAIKPRKRVIKSAVKDVVNNEKKGESSFASAFPSVNLQASNPSSDFLPAEPSSNLPPAEPNPSPDSLP

Query:  TEPSSSSLPTEPSSSSLPTEPSSSSLPTEASTPSRRADLPSSSGIDPVSEPQSFYSRRNPTNKRKSVEIAVAPFIFSTASKIQTRGEKRDGYSSSSKART
                       SLPTEPSSSS+P E STPSR  DLPSSSG D V EPQSFYSRR+P N+RKS E A APFIFSTASKI TRGEKR   SS S+ART
Subjt:  TEPSSSSLPTEPSSSSLPTEPSSSSLPTEASTPSRRADLPSSSGIDPVSEPQSFYSRRNPTNKRKSVEIAVAPFIFSTASKIQTRGEKRDGYSSSSKART

Query:  VPYRKRQRALIYGEDESKIELPRDFVEQQKAYFSEVDAFELPVEEAKSSDSE
        VPYRKRQR  IYGEDESKIELPR+FVE+QKAYFSEVDAFEL VEEAKSSDSE
Subjt:  VPYRKRQRALIYGEDESKIELPRDFVEQQKAYFSEVDAFELPVEEAKSSDSE

XP_038884349.1 uncharacterized protein LOC120075210 isoform X3 [Benincasa hispida]6.8e-8876.59Show/hide
Query:  MESQKRNLRARKPLSDCTNTILSSQSSASNFSAAIKPRKRVIKSAVKDVVNNEKKGESSFASAFPSVNLQASNPSSDFLPAEPSSNLPPAEPNPSPDSLP
        MESQKR LRARKPL+DCTNTILSSQSSASNFSA+IKPRKRVIKSAVKDVVNNEKK +   AS   SVNL+ASNPSSDFLP EP+S+  P EPNPS D   
Subjt:  MESQKRNLRARKPLSDCTNTILSSQSSASNFSAAIKPRKRVIKSAVKDVVNNEKKGESSFASAFPSVNLQASNPSSDFLPAEPSSNLPPAEPNPSPDSLP

Query:  TEPSSSSLPTEPSSSSLPTEPSSSSLPTEASTPSRRADLPSSSGIDPVSEPQSFYSRRNPTNKRKSVEIAVAPFIFSTASKIQTRGEKRDGYSSSSKART
                       SLP+EPSSS LPTE STP RRADLPSSSG D VSEPQSFYSRR PTNKRKSVEIAVAPFIFST+SKI TRG KR+GY S SKA+T
Subjt:  TEPSSSSLPTEPSSSSLPTEPSSSSLPTEASTPSRRADLPSSSGIDPVSEPQSFYSRRNPTNKRKSVEIAVAPFIFSTASKIQTRGEKRDGYSSSSKART

Query:  VPYRKRQRALIYGEDESKIELPRDFVEQQKAYFSEVDAFELPVEEAKSSDSE
        VP +KRQRA +Y +DES IELPR+FVEQQKAYFSEVDAFELPVEEAKSSDSE
Subjt:  VPYRKRQRALIYGEDESKIELPRDFVEQQKAYFSEVDAFELPVEEAKSSDSE

TrEMBL top hitse value%identityAlignment
A0A6J1BXA7 uncharacterized protein LOC111006488 isoform X36.8e-9480.23Show/hide
Query:  MESQKRNLRARKPLSDCTNTILSSQSSASNFSAAIKPRKRVIKSAVKDVVNNEKKGESSFASAFPSVNLQASNPSSDFLPAE--PSSNLPPAEPNPSPDS
        ME+ KRNLRARKPLSDCTNT+LSSQSSASNFS+ IKPR+RVIKSAVKD VNNEKKGESSF SA  S+NLQASNPSSD LP E  PSS+  P EPN S D 
Subjt:  MESQKRNLRARKPLSDCTNTILSSQSSASNFSAAIKPRKRVIKSAVKDVVNNEKKGESSFASAFPSVNLQASNPSSDFLPAE--PSSNLPPAEPNPSPDS

Query:  LPTEPSSSS--LPTEPSSSS--LPTEPSSSSLPTEASTPSRRADLPSSSGIDPVSEPQSFYSRRNPTNKRKSVEIAVAPFIFSTASKIQTRGEKRDGYSS
        LPTEP++SS  LPTEP++SS  LPTEPSSSSLP EASTP+RRADLPSSSG D VSEPQSFYSRR+PTNKRKS EIA  PFIF+TASKI T GEKRDGYSS
Subjt:  LPTEPSSSS--LPTEPSSSS--LPTEPSSSSLPTEASTPSRRADLPSSSGIDPVSEPQSFYSRRNPTNKRKSVEIAVAPFIFSTASKIQTRGEKRDGYSS

Query:  SSKARTVPYRKRQRALIYGEDESKIELPRDFVEQQKAYFSEVDAFELPVEEAKSSDSE
         SKARTVP +KRQR  IYG+DESKIELPR+FVE+QKAYFSEVDAF+L VEEAKSSDSE
Subjt:  SSKARTVPYRKRQRALIYGEDESKIELPRDFVEQQKAYFSEVDAFELPVEEAKSSDSE

A0A6J1C164 uncharacterized protein LOC111006488 isoform X14.1e-9176.38Show/hide
Query:  MESQKRNLRARKPLSDCTNTILSSQSSASNFSAAIKPRKRVIKSAVKDVVNNEKKGESSFASAFPSVNLQASNPSSDFLPAE--PSSNLPPAEPNPSPDS
        ME+ KRNLRARKPLSDCTNT+LSSQSSASNFS+ IKPR+RVIKSAVKD VNNEKKGESSF SA  S+NLQASNPSSD LP E  PSS+  P EPN S D 
Subjt:  MESQKRNLRARKPLSDCTNTILSSQSSASNFSAAIKPRKRVIKSAVKDVVNNEKKGESSFASAFPSVNLQASNPSSDFLPAE--PSSNLPPAEPNPSPDS

Query:  LPTEPSSSS--LPTEPSSSS--LPTEPSSSSLPTEASTPSRRADLPSSSGIDPVSEPQSFYSRRNPTNKRKSVEIAVAPFIFSTASKIQTRGEKRDGYSS
        LPTEP++SS  LPTEP++SS  LPTEPSSSSLP EASTP+RRADLPSSSG D VSEPQSFYSRR+PTNKRKS EIA  PFIF+TASKI T GEKRDGYSS
Subjt:  LPTEPSSSS--LPTEPSSSS--LPTEPSSSSLPTEASTPSRRADLPSSSGIDPVSEPQSFYSRRNPTNKRKSVEIAVAPFIFSTASKIQTRGEKRDGYSS

Query:  SSKARTVPYRK-------------RQRALIYGEDESKIELPRDFVEQQKAYFSEVDAFELPVEEAKSSDSE
         SKARTVP +K             RQR  IYG+DESKIELPR+FVE+QKAYFSEVDAF+L VEEAKSSDSE
Subjt:  SSKARTVPYRK-------------RQRALIYGEDESKIELPRDFVEQQKAYFSEVDAFELPVEEAKSSDSE

A0A6J1GKJ2 uncharacterized protein LOC111455207 isoform X17.3e-8876.59Show/hide
Query:  MESQKRNLRARKPLSDCTNTILSSQSSASNFSAAIKPRKRVIKSAVKDVVNNEKKGESSFASAFPSVNLQASNPSSDFLPAEPSSNLPPAEPNPSPDSLP
        MESQKRNLR RKPL+DCTNT+LSSQSSASN SAAIKPRKRV+KSAVKDVVNNEK+G SS ASA  SVNLQASNPSS+FLP +PSSNLPPAE NPS D   
Subjt:  MESQKRNLRARKPLSDCTNTILSSQSSASNFSAAIKPRKRVIKSAVKDVVNNEKKGESSFASAFPSVNLQASNPSSDFLPAEPSSNLPPAEPNPSPDSLP

Query:  TEPSSSSLPTEPSSSSLPTEPSSSSLPTEASTPSRRADLPSSSGIDPVSEPQSFYSRRNPTNKRKSVEIAVAPFIFSTASKIQTRGEKRDGYSSSSKART
                       SLPTEPSSSS+P E STPSR  DLPSSSG D V EPQSFYSRR+P N+RKS E A APFIFSTASKI +RGEKR   SS S+ART
Subjt:  TEPSSSSLPTEPSSSSLPTEPSSSSLPTEASTPSRRADLPSSSGIDPVSEPQSFYSRRNPTNKRKSVEIAVAPFIFSTASKIQTRGEKRDGYSSSSKART

Query:  VPYRKRQRALIYGEDESKIELPRDFVEQQKAYFSEVDAFELPVEEAKSSDSE
        VPYRKRQR  IYGEDESKIELPR+FVE+QKAYFSEVDAFEL VEEAKSSDSE
Subjt:  VPYRKRQRALIYGEDESKIELPRDFVEQQKAYFSEVDAFELPVEEAKSSDSE

A0A6J1JPQ1 cell wall protein RBR3-like isoform X23.1e-8676.19Show/hide
Query:  MESQKRNLRARKPLSDCTNTILSSQSSASNFSAAIKPRKRVIKSAVKDVVNNEKKGESSFASAFPSVNLQASNPSSDFLPAEPSSNLPPAEPNPSPDSLP
        MESQKRNLR RKPL+DCTNT+LSSQSSASN SAAIKPRKRV+K AVKDVVNNEK+G SSFASA  SVNLQASNPSSDFLP +PSSNLPPAE         
Subjt:  MESQKRNLRARKPLSDCTNTILSSQSSASNFSAAIKPRKRVIKSAVKDVVNNEKKGESSFASAFPSVNLQASNPSSDFLPAEPSSNLPPAEPNPSPDSLP

Query:  TEPSSSSLPTEPSSSSLPTEPSSSSLPTEASTPSRRADLPSSSGIDPVSEPQSFYSRRNPTNKRKSVEIAVAPFIFSTASKIQTRGEKRDGYSSSSKART
                 + P+S SLPTEPSSSSLP E STPSR  DLPSSS  D V EPQSFYSRR+P N+RKS E A APF+FSTASKI TRGEKR   SS S+ART
Subjt:  TEPSSSSLPTEPSSSSLPTEPSSSSLPTEASTPSRRADLPSSSGIDPVSEPQSFYSRRNPTNKRKSVEIAVAPFIFSTASKIQTRGEKRDGYSSSSKART

Query:  VPYRKRQRALIYGEDESKIELPRDFVEQQKAYFSEVDAFELPVEEAKSSDSE
        VPYRKRQR  IYGEDESKIELPR+FVE+QKAYFSEVDAFEL VEEAKSSDSE
Subjt:  VPYRKRQRALIYGEDESKIELPRDFVEQQKAYFSEVDAFELPVEEAKSSDSE

A0A6J1JRU8 uncharacterized protein LOC111487776 isoform X11.9e-8876.59Show/hide
Query:  MESQKRNLRARKPLSDCTNTILSSQSSASNFSAAIKPRKRVIKSAVKDVVNNEKKGESSFASAFPSVNLQASNPSSDFLPAEPSSNLPPAEPNPSPDSLP
        MESQKRNLR RKPL+DCTNT+LSSQSSASN SAAIKPRKRV+K AVKDVVNNEK+G SSFASA  SVNLQASNPSSDFLP +PSSNLPPAE         
Subjt:  MESQKRNLRARKPLSDCTNTILSSQSSASNFSAAIKPRKRVIKSAVKDVVNNEKKGESSFASAFPSVNLQASNPSSDFLPAEPSSNLPPAEPNPSPDSLP

Query:  TEPSSSSLPTEPSSSSLPTEPSSSSLPTEASTPSRRADLPSSSGIDPVSEPQSFYSRRNPTNKRKSVEIAVAPFIFSTASKIQTRGEKRDGYSSSSKART
                 + P+S SLPTEPSSSSLP E STPSR  DLPSSSG D V EPQSFYSRR+P N+RKS E A APF+FSTASKI TRGEKR   SS S+ART
Subjt:  TEPSSSSLPTEPSSSSLPTEPSSSSLPTEASTPSRRADLPSSSGIDPVSEPQSFYSRRNPTNKRKSVEIAVAPFIFSTASKIQTRGEKRDGYSSSSKART

Query:  VPYRKRQRALIYGEDESKIELPRDFVEQQKAYFSEVDAFELPVEEAKSSDSE
        VPYRKRQR  IYGEDESKIELPR+FVE+QKAYFSEVDAFEL VEEAKSSDSE
Subjt:  VPYRKRQRALIYGEDESKIELPRDFVEQQKAYFSEVDAFELPVEEAKSSDSE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G56250.1 unknown protein6.0e-1029.25Show/hide
Query:  RKPLSDCTNTI-LSSQSSASNFSAAIKPRKRVIKSAVKDVVNNEKKGESSFASAFPSVNLQASNP-SSDFLPAEPSSNLPPAEPNPSPDSLPTEPSSSSL
        RKPL+DCTNT+  SSQ S+S+   A       +K  V+     EK  + + ++  P +   AS P ++D  P                            
Subjt:  RKPLSDCTNTI-LSSQSSASNFSAAIKPRKRVIKSAVKDVVNNEKKGESSFASAFPSVNLQASNP-SSDFLPAEPSSNLPPAEPNPSPDSLPTEPSSSSL

Query:  PTEPSSSSLPTEPSSSSLPTEASTPSRRADLPSSSGID--PVSEPQSFYS-RRNPTNKRKSVEIAVAPFIFSTASKIQ-----TRGEK-RDGYSSSSKAR
                  T   S+ L + AS PSR     S  G+     +EP S Y+ RR  + +++S + + +    S A++I+     + G+K R    +  K  
Subjt:  PTEPSSSSLPTEPSSSSLPTEASTPSRRADLPSSSGID--PVSEPQSFYS-RRNPTNKRKSVEIAVAPFIFSTASKIQ-----TRGEK-RDGYSSSSKAR

Query:  TVPYRKRQRALIYGEDESKIELPRDFVEQQKAYFSEVDAFELPVEEAKSSDSE
         V  +KRQR +   +++      +D++E+QKAYF+E+DAFELPVEE  +SDS+
Subjt:  TVPYRKRQRALIYGEDESKIELPRDFVEQQKAYFSEVDAFELPVEEAKSSDSE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATCCCAGAAACGAAATCTCAGAGCTCGAAAACCCCTCTCGGATTGTACCAATACCATCCTCTCCTCGCAGTCATCGGCTTCGAATTTCTCCGCCGCAATCAAACC
TCGCAAACGTGTAATTAAATCCGCGGTTAAAGATGTTGTCAACAATGAGAAGAAAGGCGAATCAAGTTTTGCGTCTGCATTTCCATCCGTTAATTTGCAAGCTTCAAACC
CTAGTTCTGATTTTCTTCCTGCAGAACCTAGCTCTAATTTGCCTCCTGCAGAACCAAACCCGAGTCCTGATTCTCTTCCTACAGAACCTAGTTCCAGTAGTCTTCCTACA
GAACCTAGTTCCAGTAGTCTTCCTACAGAACCTAGTTCTAGTAGTCTTCCTACAGAAGCCTCCACTCCTTCACGGCGTGCTGATTTGCCGTCTAGTTCAGGAATAGATCC
TGTTTCTGAGCCCCAGTCATTCTACAGCCGAAGGAACCCTACGAATAAAAGGAAGAGCGTTGAGATAGCAGTTGCACCATTTATTTTCTCTACCGCATCAAAAATACAAA
CTAGGGGTGAGAAAAGAGATGGATACAGTAGCTCATCCAAAGCGAGGACTGTTCCTTATAGAAAGAGACAGCGTGCCCTAATATATGGGGAAGATGAATCCAAGATTGAA
CTTCCACGGGACTTTGTTGAGCAACAGAAAGCATATTTTTCAGAAGTAGATGCATTTGAACTGCCAGTGGAGGAGGCTAAATCATCCGACTCGGAGTAG
mRNA sequenceShow/hide mRNA sequence
GTTCTGTTCGCGGCCAACAACGCTCTCTCGCAAATATTCCCGTCGAAGTTTGAAGTTGAACAGCATTTTCCCGCCACCGTCACCGCCAACTTCCTCCGTTCGAGTTCTTC
ATCGCATCTTCTTCACCATGAAGATTAGGGAACACTTTCTTCTCCCTCTGTGAAATGGAAAGGGGCAGGAGATGGAATCCCAGAAACGAAATCTCAGAGCTCGAAAACCC
CTCTCGGATTGTACCAATACCATCCTCTCCTCGCAGTCATCGGCTTCGAATTTCTCCGCCGCAATCAAACCTCGCAAACGTGTAATTAAATCCGCGGTTAAAGATGTTGT
CAACAATGAGAAGAAAGGCGAATCAAGTTTTGCGTCTGCATTTCCATCCGTTAATTTGCAAGCTTCAAACCCTAGTTCTGATTTTCTTCCTGCAGAACCTAGCTCTAATT
TGCCTCCTGCAGAACCAAACCCGAGTCCTGATTCTCTTCCTACAGAACCTAGTTCCAGTAGTCTTCCTACAGAACCTAGTTCCAGTAGTCTTCCTACAGAACCTAGTTCT
AGTAGTCTTCCTACAGAAGCCTCCACTCCTTCACGGCGTGCTGATTTGCCGTCTAGTTCAGGAATAGATCCTGTTTCTGAGCCCCAGTCATTCTACAGCCGAAGGAACCC
TACGAATAAAAGGAAGAGCGTTGAGATAGCAGTTGCACCATTTATTTTCTCTACCGCATCAAAAATACAAACTAGGGGTGAGAAAAGAGATGGATACAGTAGCTCATCCA
AAGCGAGGACTGTTCCTTATAGAAAGAGACAGCGTGCCCTAATATATGGGGAAGATGAATCCAAGATTGAACTTCCACGGGACTTTGTTGAGCAACAGAAAGCATATTTT
TCAGAAGTAGATGCATTTGAACTGCCAGTGGAGGAGGCTAAATCATCCGACTCGGAGTAGACGGTGCTGTATAGTTTTGTGAAGCATAGGCAATGTAATCATTCAAGTTA
GTAGTGTGTATGGTGATGCTGATAAAATTCATGTTCATTACTCATAGAAACGGCTTGCTCACTTCTGTCATTCAATGCAGGTGCTGTTTTTTCCCCCTCAACTCTTGAGA
ATTTGTTAGCTTTTAGTGTGTCTTTGATTTTTTTAGTTCAACCAGATGTTGGGGTAGGAGATGGGAGATTTAAACCTTTGACCTCATGGTCAAGGGTATATTCATTAACT
AGTTGAGATTGGCTAGTGAGTCTTTGATCTTCTTTTGCTGTTAGGCCGAAAAATTTAAAATTTTCCCATCACATAGTCTTCAACCTCAACAATGACATTTGCAAAAAAAG
ATATTCAGGTTGTGGCTGTATAGATTAATAGAGCTTTACTGGATTAAAAGTCTTAAAAGACAACTTAACTAGAGATTAAAGGGGAGTGTTCGGCCAATTAAAGTAAGAAT
CATCTAGTAATTAGGCTAAGTGTTTTGGGTTGGCCAATTGATTATGATGGAATTAATTTAACCTTCGTAGCCATAGAAATGGAAAATGGTGAGGTCAGTTAGCTTTTTCT
TTTTCCTTTTTC
Protein sequenceShow/hide protein sequence
MESQKRNLRARKPLSDCTNTILSSQSSASNFSAAIKPRKRVIKSAVKDVVNNEKKGESSFASAFPSVNLQASNPSSDFLPAEPSSNLPPAEPNPSPDSLPTEPSSSSLPT
EPSSSSLPTEPSSSSLPTEASTPSRRADLPSSSGIDPVSEPQSFYSRRNPTNKRKSVEIAVAPFIFSTASKIQTRGEKRDGYSSSSKARTVPYRKRQRALIYGEDESKIE
LPRDFVEQQKAYFSEVDAFELPVEEAKSSDSE