; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0022443 (gene) of Snake gourd v1 genome

Gene IDTan0022443
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionformin-like protein 5
Genome locationLG03:43654167..43707817
RNA-Seq ExpressionTan0022443
SyntenyTan0022443
Gene Ontology termsGO:0010227 - floral organ abscission (biological process)
InterPro domainsIPR039639 - Protein IDA-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF1894255.1 hypothetical protein Lal_00004179 [Lupinus albus]3.2e-4143.55Show/hide
Query:  KGLPIPTFGLAKITPDSSSPPPLSSSVILTKEAKVNFGMLPKGVRIPPSGPSQKTSDYPPPPPRVLSGVVKNESKISFGMLPKGVRIPPFRVDTCGCGRH
        KGLP P      +    +S PP S    +   + +N+GMLPKGV +PPSGPS KTSD PPPPP+     ++  S I+FGMLPKG R PP      G    
Subjt:  KGLPIPTFGLAKITPDSSSPPPLSSSVILTKEAKVNFGMLPKGVRIPPSGPSQKTSDYPPPPPRVLSGVVKNESKISFGMLPKGVRIPPFRVDTCGCGRH

Query:  TSSPPSLSSSVILTKEAKVNFGMLPKGVHIRPSGQSQKASDYPPPPPRVLSVVVKKESKISFGMFILTKEAKVNFGMLPKGVRIPPSGPSQKTSDYPPPP
        TS PP       +   + +NFGMLPKG    PSG S K SD PPPPP+                + +   + +NFGMLPKG   PPSG S KTSD PPPP
Subjt:  TSSPPSLSSSVILTKEAKVNFGMLPKGVHIRPSGQSQKASDYPPPPPRVLSVVVKKESKISFGMFILTKEAKVNFGMLPKGVRIPPSGPSQKTSDYPPPP

Query:  PRVLSVVVKKESKISFGMLPKGVRIPPSGPSQRTSNYPPPPPHVPSVILMKESRV-NFGMLPKGVHIPPSG--SSQRTSDYPPPQPH
        P+  +  +   S I+FGMLPKG  +PPSGPS+  +   P PP +    +   S + NFGMLPKG ++PPSG    +    +PPP+ +
Subjt:  PRVLSVVVKKESKISFGMLPKGVRIPPSGPSQRTSNYPPPPPHVPSVILMKESRV-NFGMLPKGVHIPPSG--SSQRTSDYPPPQPH

KAG6603169.1 hypothetical protein SDJN03_03778, partial [Cucurbita argyrosperma subsp. sororia]2.3e-9555.24Show/hide
Query:  QTDAKNIVVTNEENTIATKELVNNVVNHPKGL------------------------------PIPTFGLAKITPDSSSPPPLSSSVILTKEAKVNFGMLP
        +TDAK +V  +  +TI  +EL + +V HPKG+                              PI   G ++ T D   PPP +SSVIL+K++K+NFGMLP
Subjt:  QTDAKNIVVTNEENTIATKELVNNVVNHPKGL------------------------------PIPTFGLAKITPDSSSPPPLSSSVILTKEAKVNFGMLP

Query:  KGVRIPPSGPSQKTSDYPPPPPRVLSGVVKNESKISFGMLPKGVRIPPFRVDTCGCGRHTSS----PPSLSSSVILTKEAKVNFGMLPKGVHIRPSGQSQ
        KGV IPPSGPSQ+TSDYPPPPPR  S ++  +SKI+FGMLPKGV IPP      G  + TS+    PP  +SSVIL  ++K+NFGMLPKGV I PSG SQ
Subjt:  KGVRIPPSGPSQKTSDYPPPPPRVLSGVVKNESKISFGMLPKGVRIPPFRVDTCGCGRHTSS----PPSLSSSVILTKEAKVNFGMLPKGVHIRPSGQSQ

Query:  KASDYPPPPPRVLSVVVKKESKISFGMFILTKEAKVNFGMLPKGVRIPPSGPSQKTSDYPPPPPRVLSVVVKKESKISFGMLPKGVRIPPSGPSQRTSNY
        + SDYPPPPP   S             FIL  ++K+NFGMLPKGV IPPSGPSQ+TSDYPPPPP   SV++  +SKI+FGMLPKGV IPPSGPSQRTSNY
Subjt:  KASDYPPPPPRVLSVVVKKESKISFGMFILTKEAKVNFGMLPKGVRIPPSGPSQKTSDYPPPPPRVLSVVVKKESKISFGMLPKGVRIPPSGPSQRTSNY

Query:  PPPPPHVPSVILMKESRVNFGMLPKGVHIPPSGSSQRTSDYPPPQPHVPSVVL
        PPPPPHV         ++NFGMLPKGV IPPSG S+RTSD+PPP PH P + L
Subjt:  PPPPPHVPSVILMKESRVNFGMLPKGVHIPPSGSSQRTSDYPPPQPHVPSVVL

XP_022967687.1 actin cytoskeleton-regulatory complex protein PAN1-like [Cucurbita maxima]1.4e-8451.99Show/hide
Query:  QTDAKNIVVTNEENTIATKELVNNVVNHPKGL------------------------------PIPTFGLAKITPDSSSPPPLSSSVILTKEAKVNFGMLP
        +TDAKN+V  + ++ I   EL + +V +PKG+                              PIP    ++ T D   PPP +SS+IL K +K+N GMLP
Subjt:  QTDAKNIVVTNEENTIATKELVNNVVNHPKGL------------------------------PIPTFGLAKITPDSSSPPPLSSSVILTKEAKVNFGMLP

Query:  KGVRIPPSGPSQKTSDYPPPPPRVLSGVVKNESKISFGMLPKGVRIPPFRVDTCGCGRHTS---SPPSLSSSVILTKEAKVNFGMLPKGVHIRPSGQSQK
        +GV IPPSGPSQ+TSDYPPPPP   S ++  +SKI+FGMLPKGV IPP      G  + TS    PP  +SSVIL K++K+  GMLP+GV I P G SQ+
Subjt:  KGVRIPPSGPSQKTSDYPPPPPRVLSGVVKNESKISFGMLPKGVRIPPFRVDTCGCGRHTS---SPPSLSSSVILTKEAKVNFGMLPKGVHIRPSGQSQK

Query:  ASDYPPPPPRVLSVVVKKESKISFGMFILTKEAKVNFGMLPKGVRIPPSGPSQKTSDYPPPPPRVLSVVVKKESKISFGMLPKGVRIPPSGPSQRTSNYP
         SDYP PPP   SV             IL K++K+NFGMLPKGV IPPSGPS +TSDYPPPPP VL        KI+FGMLPK V IPPSGPSQRTS+YP
Subjt:  ASDYPPPPPRVLSVVVKKESKISFGMFILTKEAKVNFGMLPKGVRIPPSGPSQKTSDYPPPPPRVLSVVVKKESKISFGMLPKGVRIPPSGPSQRTSNYP

Query:  PPPPHVPSVILMKESRVNFGMLPKGVHIPPSGSSQRTSDYPPPQPHVPSVVL
        PPPPHV         ++NFGMLPKGV IPP G S+RTSDYPPP P+ PS+ L
Subjt:  PPPPHVPSVILMKESRVNFGMLPKGVHIPPSGSSQRTSDYPPPQPHVPSVVL

XP_031744003.1 abl interactor homolog [Cucumis sativus]5.7e-5443.12Show/hide
Query:  CAFILMGLHLTLNQTDAKNIVVTNEENTIATKELVNNVVNHPKGLPIPTFGLAKITPDSSSPPPLSSSVILTKEAKVNFGMLPKGVRIPPSGPSQKTSDY
        C  +L+GL   L QT+  N+   NE+N++   E   +V+ HPK     +   +K  PD               +    FG+L K +RIPP GPSQ++SD 
Subjt:  CAFILMGLHLTLNQTDAKNIVVTNEENTIATKELVNNVVNHPKGLPIPTFGLAKITPDSSSPPPLSSSVILTKEAKVNFGMLPKGVRIPPSGPSQKTSDY

Query:  PPPPPRVLSGVVKNESKISFGMLPKGVRIPPFRVDTCGCGRHTSSPPSLSSSVILTKEAKVNFGMLPKGVHIRPSGQSQKASDYPPPPPRVLSVVVKKES
         PPP    S V+  ES+++FG+L KGVR       +    R + SPPS   S++L KE ++ FG+LPKGV    SG S++ SD PPPPP  +        
Subjt:  PPPPPRVLSGVVKNESKISFGMLPKGVRIPPFRVDTCGCGRHTSSPPSLSSSVILTKEAKVNFGMLPKGVHIRPSGQSQKASDYPPPPPRVLSVVVKKES

Query:  KISFGMFILTKEAKVNFGMLPKGVRIPPSGPSQKTSDYPPPPPRVLSVVVKKESKISFGMLPKGVRIPPSGPSQRTSNYPPPPPHVPSVILMKESRVNFG
               +L KE+++NFG+LPKGV    SGPS++ SD PP PP   S+V+ K+S ISFG+L KGV I  SGPS+R S+ PPPPP  PS++L K+S ++FG
Subjt:  KISFGMFILTKEAKVNFGMLPKGVRIPPSGPSQKTSDYPPPPPRVLSVVVKKESKISFGMLPKGVRIPPSGPSQRTSNYPPPPPHVPSVILMKESRVNFG

Query:  MLPKGVHIPPSGSSQRTSDYPPPQPHV
        +LPKGV IPPSG S R +D PP  P +
Subjt:  MLPKGVHIPPSGSSQRTSDYPPPQPHV

XP_031744042.1 proline-rich receptor-like protein kinase PERK9 [Cucumis sativus]2.9e-5842.86Show/hide
Query:  CAFILMGLHLTLNQTDAKNIVVTNEENTIATKELVNNVVNHP---------------------------KGLPIPTFGLAKITPDSSSPPPLSSSVILTK
        C  +L+GL   L QT+  N+   NEEN++   E   +V+ HP                           K + IP  G ++ + DS+ PP    S++L K
Subjt:  CAFILMGLHLTLNQTDAKNIVVTNEENTIATKELVNNVVNHP---------------------------KGLPIPTFGLAKITPDSSSPPPLSSSVILTK

Query:  EAKVNFGMLPKGVRIPPSGPSQKTSDYPPPPPRVLSGVVKNESKISFGMLPKGVRIPPFRVDTCGCG---RHTSSPPSLSSSVILTKEAKVNFGMLPKGV
        E+ +NFG+LPKGV    SGPSQ+ SD PP PP   S V+  ES+I FG+LPKG       V T   G   R + SPP    S++L KE+++NFG+L KGV
Subjt:  EAKVNFGMLPKGVRIPPSGPSQKTSDYPPPPPRVLSGVVKNESKISFGMLPKGVRIPPFRVDTCGCG---RHTSSPPSLSSSVILTKEAKVNFGMLPKGV

Query:  HIRPSGQSQKASDYPPPPPRVLSVVVKKESKISFGMFILTKEAKVNFGMLPKGVRIPPSGPSQKTSDYPPPPPRVLSVVVKKESKISFGMLPKGVRIPPS
            SG S++ SD PPPPP  +               +L KE+++NFG+LPKGV    SGPS++ SD PP PP   S+V+ K+S ISFG+L +GV I  S
Subjt:  HIRPSGQSQKASDYPPPPPRVLSVVVKKESKISFGMFILTKEAKVNFGMLPKGVRIPPSGPSQKTSDYPPPPPRVLSVVVKKESKISFGMLPKGVRIPPS

Query:  GPSQRTSNYPPPPPHVPSVILMKESRVNFGMLPKGVHIPPSGSSQRTSDYPPPQPHV
        GPS+R S+ PPPPP  PS++L K+S ++FG+LPKGV IPPSG S R +D PP  P +
Subjt:  GPSQRTSNYPPPPPHVPSVILMKESRVNFGMLPKGVHIPPSGSSQRTSDYPPPQPHV

TrEMBL top hitse value%identityAlignment
A0A2P5WMU6 Uncharacterized protein5.0e-4043.85Show/hide
Query:  PKGLPIPTFGLAKITPDSSSPPPLSSSVILTKEAKVNFGMLPKGVRIPPSGPSQKTSDYPPPPP-RVLSGVVKNESKISFGMLPKGVRIPPFRVDTCGCG
        P G+PIP  G +  T D   PPP S S   +     NF +LP GV IPPSGP++ TSD PPP      S +VK+   ++F +LP GV IP  R    G  
Subjt:  PKGLPIPTFGLAKITPDSSSPPPLSSSVILTKEAKVNFGMLPKGVRIPPSGPSQKTSDYPPPPP-RVLSGVVKNESKISFGMLPKGVRIPPFRVDTCGCG

Query:  RHTSSPPSLSSSVILTKEAKVNFGMLPKGVHIRPSGQSQKASDYPPPPPRVLSVVVKKESKISFGMFILTKEAKVNFGMLPKGVRIPPSGPSQKTSDYPP
             PPS+++S  L K +  NF ++P GV I PSG S   SD PPPP    S  + K S               NF +LP GV IPP GPS  T D PP
Subjt:  RHTSSPPSLSSSVILTKEAKVNFGMLPKGVHIRPSGQSQKASDYPPPPPRVLSVVVKKESKISFGMFILTKEAKVNFGMLPKGVRIPPSGPSQKTSDYPP

Query:  PPPRVLSVVVKKESKISFGMLPKGVRIPPSGPSQRTSNYPPPPPHVPSVILMKESRVNFGMLPKGVHIPPSGSSQRTSDYPPPQPHVPSVVLEKESKFGI
        PP    S  + K S  +F +LP GV IPPSGPS  TS+ PPPP  + S  L+K S  NF +LP GV IPPS  S+ TS  PPP     S+ + K   FG+
Subjt:  PPPRVLSVVVKKESKISFGMLPKGVRIPPSGPSQRTSNYPPPPPHVPSVILMKESRVNFGMLPKGVHIPPSGSSQRTSDYPPPQPHVPSVVLEKESKFGI

Query:  L
        L
Subjt:  L

A0A5D2FCQ6 Uncharacterized protein2.3e-4044.19Show/hide
Query:  PKGLPIPTFGLAKITPDSSSPPPLSSSVILTKEAKVNFGMLPKGVRIPPSGPSQKTSDYPPPPP-RVLSGVVKNESKISFGMLPKGVRIPPFRVDTCGCG
        P G+PIP  G +  T D   PPP S S   +     NF +LP GV IPPSGP++ TSD PPP      S +VK+   ++F +LP GV IP  R    G  
Subjt:  PKGLPIPTFGLAKITPDSSSPPPLSSSVILTKEAKVNFGMLPKGVRIPPSGPSQKTSDYPPPPP-RVLSGVVKNESKISFGMLPKGVRIPPFRVDTCGCG

Query:  RHTSSPPSLSSSVILTKEAKVNFGMLPKGVHIRPSGQSQKASDYPPPPPRVLSVVVKKESKISFGMFILTKEAKVNFGMLPKGVRIPPSGPSQKTSDYPP
             PPS+S+S  L K +  NF ++P GV I PSG S   SD PPPP    S  + K S               NF +LP GV IPP GPS  T D PP
Subjt:  RHTSSPPSLSSSVILTKEAKVNFGMLPKGVHIRPSGQSQKASDYPPPPPRVLSVVVKKESKISFGMFILTKEAKVNFGMLPKGVRIPPSGPSQKTSDYPP

Query:  PPPRVLSVVVKKESKISFGMLPKGVRIPPSGPSQRTSNYPPPPPHVPSVILMKESRVNFGMLPKGVHIPPSGSSQRTSDYPPPQPHVPSVVLEKESKFGI
        PP    S  + K S  +F +LP GV IPPSGPS  TS+ PPPP  + S  L+K S  NF +LP GV IPPS  S+ TS  PPP     S+ + K   FG+
Subjt:  PPPRVLSVVVKKESKISFGMLPKGVRIPPSGPSQRTSNYPPPPPHVPSVILMKESRVNFGMLPKGVHIPPSGSSQRTSDYPPPQPHVPSVVLEKESKFGI

Query:  L
        L
Subjt:  L

A0A6A4PCZ2 Uncharacterized protein1.6e-4143.55Show/hide
Query:  KGLPIPTFGLAKITPDSSSPPPLSSSVILTKEAKVNFGMLPKGVRIPPSGPSQKTSDYPPPPPRVLSGVVKNESKISFGMLPKGVRIPPFRVDTCGCGRH
        KGLP P      +    +S PP S    +   + +N+GMLPKGV +PPSGPS KTSD PPPPP+     ++  S I+FGMLPKG R PP      G    
Subjt:  KGLPIPTFGLAKITPDSSSPPPLSSSVILTKEAKVNFGMLPKGVRIPPSGPSQKTSDYPPPPPRVLSGVVKNESKISFGMLPKGVRIPPFRVDTCGCGRH

Query:  TSSPPSLSSSVILTKEAKVNFGMLPKGVHIRPSGQSQKASDYPPPPPRVLSVVVKKESKISFGMFILTKEAKVNFGMLPKGVRIPPSGPSQKTSDYPPPP
        TS PP       +   + +NFGMLPKG    PSG S K SD PPPPP+                + +   + +NFGMLPKG   PPSG S KTSD PPPP
Subjt:  TSSPPSLSSSVILTKEAKVNFGMLPKGVHIRPSGQSQKASDYPPPPPRVLSVVVKKESKISFGMFILTKEAKVNFGMLPKGVRIPPSGPSQKTSDYPPPP

Query:  PRVLSVVVKKESKISFGMLPKGVRIPPSGPSQRTSNYPPPPPHVPSVILMKESRV-NFGMLPKGVHIPPSG--SSQRTSDYPPPQPH
        P+  +  +   S I+FGMLPKG  +PPSGPS+  +   P PP +    +   S + NFGMLPKG ++PPSG    +    +PPP+ +
Subjt:  PRVLSVVVKKESKISFGMLPKGVRIPPSGPSQRTSNYPPPPPHVPSVILMKESRV-NFGMLPKGVHIPPSG--SSQRTSDYPPPQPH

A0A6A5P9E6 Uncharacterized protein1.6e-4143.55Show/hide
Query:  KGLPIPTFGLAKITPDSSSPPPLSSSVILTKEAKVNFGMLPKGVRIPPSGPSQKTSDYPPPPPRVLSGVVKNESKISFGMLPKGVRIPPFRVDTCGCGRH
        KGLP P      +    +S PP S    +   + +N+GMLPKGV +PPSGPS KTSD PPPPP+     ++  S I+FGMLPKG R PP      G    
Subjt:  KGLPIPTFGLAKITPDSSSPPPLSSSVILTKEAKVNFGMLPKGVRIPPSGPSQKTSDYPPPPPRVLSGVVKNESKISFGMLPKGVRIPPFRVDTCGCGRH

Query:  TSSPPSLSSSVILTKEAKVNFGMLPKGVHIRPSGQSQKASDYPPPPPRVLSVVVKKESKISFGMFILTKEAKVNFGMLPKGVRIPPSGPSQKTSDYPPPP
        TS PP       +   + +NFGMLPKG    PSG S K SD PPPPP+                + +   + +NFGMLPKG   PPSG S KTSD PPPP
Subjt:  TSSPPSLSSSVILTKEAKVNFGMLPKGVHIRPSGQSQKASDYPPPPPRVLSVVVKKESKISFGMFILTKEAKVNFGMLPKGVRIPPSGPSQKTSDYPPPP

Query:  PRVLSVVVKKESKISFGMLPKGVRIPPSGPSQRTSNYPPPPPHVPSVILMKESRV-NFGMLPKGVHIPPSG--SSQRTSDYPPPQPH
        P+  +  +   S I+FGMLPKG  +PPSGPS+  +   P PP +    +   S + NFGMLPKG ++PPSG    +    +PPP+ +
Subjt:  PRVLSVVVKKESKISFGMLPKGVRIPPSGPSQRTSNYPPPPPHVPSVILMKESRV-NFGMLPKGVHIPPSG--SSQRTSDYPPPQPH

A0A6J1HRH7 actin cytoskeleton-regulatory complex protein PAN1-like6.7e-8551.99Show/hide
Query:  QTDAKNIVVTNEENTIATKELVNNVVNHPKGL------------------------------PIPTFGLAKITPDSSSPPPLSSSVILTKEAKVNFGMLP
        +TDAKN+V  + ++ I   EL + +V +PKG+                              PIP    ++ T D   PPP +SS+IL K +K+N GMLP
Subjt:  QTDAKNIVVTNEENTIATKELVNNVVNHPKGL------------------------------PIPTFGLAKITPDSSSPPPLSSSVILTKEAKVNFGMLP

Query:  KGVRIPPSGPSQKTSDYPPPPPRVLSGVVKNESKISFGMLPKGVRIPPFRVDTCGCGRHTS---SPPSLSSSVILTKEAKVNFGMLPKGVHIRPSGQSQK
        +GV IPPSGPSQ+TSDYPPPPP   S ++  +SKI+FGMLPKGV IPP      G  + TS    PP  +SSVIL K++K+  GMLP+GV I P G SQ+
Subjt:  KGVRIPPSGPSQKTSDYPPPPPRVLSGVVKNESKISFGMLPKGVRIPPFRVDTCGCGRHTS---SPPSLSSSVILTKEAKVNFGMLPKGVHIRPSGQSQK

Query:  ASDYPPPPPRVLSVVVKKESKISFGMFILTKEAKVNFGMLPKGVRIPPSGPSQKTSDYPPPPPRVLSVVVKKESKISFGMLPKGVRIPPSGPSQRTSNYP
         SDYP PPP   SV             IL K++K+NFGMLPKGV IPPSGPS +TSDYPPPPP VL        KI+FGMLPK V IPPSGPSQRTS+YP
Subjt:  ASDYPPPPPRVLSVVVKKESKISFGMFILTKEAKVNFGMLPKGVRIPPSGPSQKTSDYPPPPPRVLSVVVKKESKISFGMLPKGVRIPPSGPSQRTSNYP

Query:  PPPPHVPSVILMKESRVNFGMLPKGVHIPPSGSSQRTSDYPPPQPHVPSVVL
        PPPPHV         ++NFGMLPKGV IPP G S+RTSDYPPP P+ PS+ L
Subjt:  PPPPHVPSVILMKESRVNFGMLPKGVHIPPSGSSQRTSDYPPPQPHVPSVVL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGTCGAGTACTTGCAATGGGTTCTTGGGAGCTTGTGCTTTCATATTAATGGGCTTACACCTTACACTCAATCAAACGGACGCTAAGAACATTGTTGTAACCAATGA
AGAAAACACCATTGCAACAAAGGAACTGGTTAACAACGTCGTAAACCATCCAAAGGGCCTCCCTATTCCAACGTTTGGTTTGGCTAAAATAACACCTGATTCGTCATCTC
CTCCACCACTTTCTTCTTCAGTTATTCTTACGAAGGAAGCTAAGGTAAACTTTGGAATGTTACCAAAAGGCGTACGCATTCCTCCGTCTGGGCCAAGCCAAAAGACATCA
GACTATCCACCTCCTCCACCGCGTGTTTTATCAGGTGTTGTAAAGAACGAATCTAAGATCAGCTTTGGAATGTTACCAAAAGGAGTACGTATTCCCCCCTTCAGGGTCGA
TACCTGTGGTTGTGGTCGTCACACGTCATCTCCTCCATCACTTTCTTCTTCAGTCATTCTTACGAAGGAAGCTAAGGTAAACTTTGGAATGTTACCAAAAGGCGTACACA
TTCGTCCGTCTGGGCAAAGCCAAAAGGCATCAGACTATCCACCTCCTCCACCGCGTGTTTTATCAGTTGTTGTAAAGAAAGAATCTAAGATCAGCTTTGGAATGTTCATT
CTTACGAAGGAAGCTAAGGTAAACTTTGGAATGTTACCAAAAGGCGTACGCATTCCTCCGTCTGGGCCAAGCCAAAAGACATCAGACTATCCACCTCCCCCACCGCGTGT
TTTATCAGTTGTTGTAAAGAAAGAATCTAAGATTAGCTTTGGAATGTTACCAAAAGGAGTACGTATTCCCCCTTCAGGGCCGAGTCAAAGGACATCGAACTATCCACCTC
CTCCACCTCATGTTCCATCTGTCATTTTAATGAAGGAATCTAGGGTCAATTTTGGAATGTTACCCAAAGGCGTACATATTCCTCCATCTGGGTCGAGTCAAAGGACTTCC
GACTATCCACCCCCTCAACCGCATGTTCCTTCTGTTGTTTTGGAGAAAGAATCAAAATTCGGGATTCTTTACTCGTTCGAGAACCGCCGCCGCTCACGAGCCTCAGCTCG
CCGTCGAGTTCGGCAGCGCCAGAGCCTCGATCGCACGTCCAACCGCTTCGTCACGTCCGGTTTGCTTCGGCAGATCCACGGGCACAGCAGCAGTCTCGCCGGAGTTGTTC
TAAGTCTAAGGCCATTCTTTATGTTCTTTATTCTTGGACGAAGTGGGCTGCTTACTGATTCGGCAGCGGCGCGAGCCTCGATCGCACGTCCCCAACCGCCGTCACGTCCG
GTTTGCTTGGCGAATCCACGGCAAGCAGTGGCGATCTCGCCGGAGTTGTTGTCACGAAAACAGAAGCCGCTCGCGAACTACTCTGTCAACCGCGACCGTCGATCGAGACT
ACACGCCACTCGCGAACTCCCTTACCGGGATGGTATTGAATGTCAATATCATAATCTTTCACTAATTCCAACCATCTGCGTTGCCTCATTCATAGACATGGCTATCTTGG
CAAAATCACGCTTGCAAACGTCTATAGTATCCCCGCAAGACCCGAGAAAACTCACAGATAACATCAGTTTAAAACATAAATGTGTGGAAAAACATGTCGGTCCCCATGGT
TTTAGTAACACTCCCCCGTCACTCGCCCTGCACCGTGTTGGCGTGTATTTGTGGAGCTCGGACGGCGGTGAAGGAGGAGAGTTCGCGAGTGCTCGCTCAGTGGGCTCTCG
GCCGTGCGCGGCTTTCGGCGGCGGCGGTGTGGGACGTGATCGTGGAGGCTCTCGCTGCAGTGTGTGGCGCGGTGGAGGTGGGAGCTGCGGACGGGTGAGCTCGCGAGGAA
GAAGTGCGCCGCAGACTTCGGACGGCGATCTCAGACACGAGCAGCAGCAGCGGCCGGACTTCGACGCGGGGCTCCTTCGCTCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGGTCGAGTACTTGCAATGGGTTCTTGGGAGCTTGTGCTTTCATATTAATGGGCTTACACCTTACACTCAATCAAACGGACGCTAAGAACATTGTTGTAACCAATGA
AGAAAACACCATTGCAACAAAGGAACTGGTTAACAACGTCGTAAACCATCCAAAGGGCCTCCCTATTCCAACGTTTGGTTTGGCTAAAATAACACCTGATTCGTCATCTC
CTCCACCACTTTCTTCTTCAGTTATTCTTACGAAGGAAGCTAAGGTAAACTTTGGAATGTTACCAAAAGGCGTACGCATTCCTCCGTCTGGGCCAAGCCAAAAGACATCA
GACTATCCACCTCCTCCACCGCGTGTTTTATCAGGTGTTGTAAAGAACGAATCTAAGATCAGCTTTGGAATGTTACCAAAAGGAGTACGTATTCCCCCCTTCAGGGTCGA
TACCTGTGGTTGTGGTCGTCACACGTCATCTCCTCCATCACTTTCTTCTTCAGTCATTCTTACGAAGGAAGCTAAGGTAAACTTTGGAATGTTACCAAAAGGCGTACACA
TTCGTCCGTCTGGGCAAAGCCAAAAGGCATCAGACTATCCACCTCCTCCACCGCGTGTTTTATCAGTTGTTGTAAAGAAAGAATCTAAGATCAGCTTTGGAATGTTCATT
CTTACGAAGGAAGCTAAGGTAAACTTTGGAATGTTACCAAAAGGCGTACGCATTCCTCCGTCTGGGCCAAGCCAAAAGACATCAGACTATCCACCTCCCCCACCGCGTGT
TTTATCAGTTGTTGTAAAGAAAGAATCTAAGATTAGCTTTGGAATGTTACCAAAAGGAGTACGTATTCCCCCTTCAGGGCCGAGTCAAAGGACATCGAACTATCCACCTC
CTCCACCTCATGTTCCATCTGTCATTTTAATGAAGGAATCTAGGGTCAATTTTGGAATGTTACCCAAAGGCGTACATATTCCTCCATCTGGGTCGAGTCAAAGGACTTCC
GACTATCCACCCCCTCAACCGCATGTTCCTTCTGTTGTTTTGGAGAAAGAATCAAAATTCGGGATTCTTTACTCGTTCGAGAACCGCCGCCGCTCACGAGCCTCAGCTCG
CCGTCGAGTTCGGCAGCGCCAGAGCCTCGATCGCACGTCCAACCGCTTCGTCACGTCCGGTTTGCTTCGGCAGATCCACGGGCACAGCAGCAGTCTCGCCGGAGTTGTTC
TAAGTCTAAGGCCATTCTTTATGTTCTTTATTCTTGGACGAAGTGGGCTGCTTACTGATTCGGCAGCGGCGCGAGCCTCGATCGCACGTCCCCAACCGCCGTCACGTCCG
GTTTGCTTGGCGAATCCACGGCAAGCAGTGGCGATCTCGCCGGAGTTGTTGTCACGAAAACAGAAGCCGCTCGCGAACTACTCTGTCAACCGCGACCGTCGATCGAGACT
ACACGCCACTCGCGAACTCCCTTACCGGGATGGTATTGAATGTCAATATCATAATCTTTCACTAATTCCAACCATCTGCGTTGCCTCATTCATAGACATGGCTATCTTGG
CAAAATCACGCTTGCAAACGTCTATAGTATCCCCGCAAGACCCGAGAAAACTCACAGATAACATCAGTTTAAAACATAAATGTGTGGAAAAACATGTCGGTCCCCATGGT
TTTAGTAACACTCCCCCGTCACTCGCCCTGCACCGTGTTGGCGTGTATTTGTGGAGCTCGGACGGCGGTGAAGGAGGAGAGTTCGCGAGTGCTCGCTCAGTGGGCTCTCG
GCCGTGCGCGGCTTTCGGCGGCGGCGGTGTGGGACGTGATCGTGGAGGCTCTCGCTGCAGTGTGTGGCGCGGTGGAGGTGGGAGCTGCGGACGGGTGAGCTCGCGAGGAA
GAAGTGCGCCGCAGACTTCGGACGGCGATCTCAGACACGAGCAGCAGCAGCGGCCGGACTTCGACGCGGGGCTCCTTCGCTCGTGA
Protein sequenceShow/hide protein sequence
MGSSTCNGFLGACAFILMGLHLTLNQTDAKNIVVTNEENTIATKELVNNVVNHPKGLPIPTFGLAKITPDSSSPPPLSSSVILTKEAKVNFGMLPKGVRIPPSGPSQKTS
DYPPPPPRVLSGVVKNESKISFGMLPKGVRIPPFRVDTCGCGRHTSSPPSLSSSVILTKEAKVNFGMLPKGVHIRPSGQSQKASDYPPPPPRVLSVVVKKESKISFGMFI
LTKEAKVNFGMLPKGVRIPPSGPSQKTSDYPPPPPRVLSVVVKKESKISFGMLPKGVRIPPSGPSQRTSNYPPPPPHVPSVILMKESRVNFGMLPKGVHIPPSGSSQRTS
DYPPPQPHVPSVVLEKESKFGILYSFENRRRSRASARRRVRQRQSLDRTSNRFVTSGLLRQIHGHSSSLAGVVLSLRPFFMFFILGRSGLLTDSAAARASIARPQPPSRP
VCLANPRQAVAISPELLSRKQKPLANYSVNRDRRSRLHATRELPYRDGIECQYHNLSLIPTICVASFIDMAILAKSRLQTSIVSPQDPRKLTDNISLKHKCVEKHVGPHG
FSNTPPSLALHRVGVYLWSSDGGEGGEFASARSVGSRPCAAFGGGGVGRDRGGSRCSVWRGGGGSCGRVSSRGRSAPQTSDGDLRHEQQQRPDFDAGLLRS