; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0000531 (gene) of Snake gourd v1 genome

Gene IDTan0000531
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionYcf3-interacting protein 1
Genome locationLG03:81867996..81868769
RNA-Seq ExpressionTan0000531
SyntenyTan0000531
Gene Ontology termsGO:0048564 - photosystem I assembly (biological process)
GO:0080183 - response to photooxidative stress (biological process)
GO:0009535 - chloroplast thylakoid membrane (cellular component)
InterPro domainsIPR040340 - Chloroplast enhancing stress tolerance protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7034827.1 hypothetical protein SDJN02_04559, partial [Cucurbita argyrosperma subsp. argyrosperma]2.5e-7464.52Show/hide
Query:  PLSVGISNRGYDFVGDVVIEVSTQFKLESYSVPNSAYSSPRSG--------VCD---GLNRRSKSCGEGRGKARQHGHGLVENKVVVVGKKGHNHKEMER
        PLSVG + R Y+ V DVVIEVSTQFKLESYS PNSAYSSP  G        V D   GLN RSKSCGEGRGKA    HGL+EN+ V++ +KGH HK  E 
Subjt:  PLSVGISNRGYDFVGDVVIEVSTQFKLESYSVPNSAYSSPRSG--------VCD---GLNRRSKSCGEGRGKARQHGHGLVENKVVVVGKKGHNHKEMER

Query:  PSKTENGGKARCFSCGALCLLLP-----GF--GKGKGERKEEAEAEAEGGGCISISISR-RVSLEKFECGSWASSGMVVHEDGIGGSGAGSLYFDLPMEL
               GKAR F CGALCLLLP     GF  GKGK ERKEE E   EGGGCISISIS  RVSLEKFECGSWASSGMV HEDG   SG GSLYFDLPMEL
Subjt:  PSKTENGGKARCFSCGALCLLLP-----GF--GKGKGERKEEAEAEAEGGGCISISISR-RVSLEKFECGSWASSGMVVHEDGIGGSGAGSLYFDLPMEL

Query:  IRNSVGARTQTQTHSQSQSPSPSPVKAAF-------------VWTKTKLAEESGSPSPCIITPRLRKAREEFNALLEAH
        IRNSVGARTQ            SP + AF             VWTK KLAEESG+ SPC+ITPRLR+AREEFNALLEAH
Subjt:  IRNSVGARTQTQTHSQSQSPSPSPVKAAF-------------VWTKTKLAEESGSPSPCIITPRLRKAREEFNALLEAH

XP_022925963.1 uncharacterized protein LOC111433224 [Cucurbita moschata]2.5e-7464.52Show/hide
Query:  PLSVGISNRGYDFVGDVVIEVSTQFKLESYSVPNSAYSSPRSG--------VCD---GLNRRSKSCGEGRGKARQHGHGLVENKVVVVGKKGHNHKEMER
        PLSVG + R Y+ V DVVIEVSTQFKLESYS PNSAYSSP  G        V D   GLN RSKSCGEGRGKA    HGL+EN+ V++ +KGH HK  E 
Subjt:  PLSVGISNRGYDFVGDVVIEVSTQFKLESYSVPNSAYSSPRSG--------VCD---GLNRRSKSCGEGRGKARQHGHGLVENKVVVVGKKGHNHKEMER

Query:  PSKTENGGKARCFSCGALCLLLP-----GF--GKGKGERKEEAEAEAEGGGCISISISR-RVSLEKFECGSWASSGMVVHEDGIGGSGAGSLYFDLPMEL
               GKAR F CGALCLLLP     GF  GKGK ERKEE E   EGGGCISISIS  RVSLEKFECGSWASSGMV HEDG   SG GSLYFDLPMEL
Subjt:  PSKTENGGKARCFSCGALCLLLP-----GF--GKGKGERKEEAEAEAEGGGCISISISR-RVSLEKFECGSWASSGMVVHEDGIGGSGAGSLYFDLPMEL

Query:  IRNSVGARTQTQTHSQSQSPSPSPVKAAF-------------VWTKTKLAEESGSPSPCIITPRLRKAREEFNALLEAH
        IRNSVGARTQ            SP + AF             VWTK KLAEESG+ SPC+ITPRLR+AREEFNALLEAH
Subjt:  IRNSVGARTQTQTHSQSQSPSPSPVKAAF-------------VWTKTKLAEESGSPSPCIITPRLRKAREEFNALLEAH

XP_022978627.1 uncharacterized protein LOC111478548 [Cucurbita maxima]7.4e-7463.8Show/hide
Query:  PLSVGISNRGYDFVGDVVIEVSTQFKLESYSVPNSAYSSPRSG--------VCD---GLNRRSKSCGEGRGKARQHGHGLVENKVVVVGKKGHNHKEMER
        PLSVG + R Y+ V DVVI+VSTQFKLESYS PNSAYSSP  G        V D   GLN RSKSCGEGRGKA    HGL++N+ V++ +KGH HK  E 
Subjt:  PLSVGISNRGYDFVGDVVIEVSTQFKLESYSVPNSAYSSPRSG--------VCD---GLNRRSKSCGEGRGKARQHGHGLVENKVVVVGKKGHNHKEMER

Query:  PSKTENGGKARCFSCGALCLLLP-----GF--GKGKGERKEEAEAEAEGGGCISISISR-RVSLEKFECGSWASSGMVVHEDGIGGSGAGSLYFDLPMEL
               GKAR F CGALCLLLP     GF  GKGK ERKEE E   EGGGCISISIS  RVSLEKFECGSWASSGMV HEDG   +G GSLYFDLPMEL
Subjt:  PSKTENGGKARCFSCGALCLLLP-----GF--GKGKGERKEEAEAEAEGGGCISISISR-RVSLEKFECGSWASSGMVVHEDGIGGSGAGSLYFDLPMEL

Query:  IRNSVGARTQTQTHSQSQSPSPSPVKAAF-------------VWTKTKLAEESGSPSPCIITPRLRKAREEFNALLEAH
        IRNSVGARTQ            SP +AAF             VWTK KLAEESG+ SPC+ITPRLR+AREEFNALLEAH
Subjt:  IRNSVGARTQTQTHSQSQSPSPSPVKAAF-------------VWTKTKLAEESGSPSPCIITPRLRKAREEFNALLEAH

XP_023543164.1 uncharacterized protein LOC111803119 [Cucurbita pepo subsp. pepo]5.7e-7464.16Show/hide
Query:  PLSVGISNRGYDFVGDVVIEVSTQFKLESYSVPNSAYSSPRSG--------VCD---GLNRRSKSCGEGRGKARQHGHGLVENKVVVVGKKGHNHKEMER
        PLSVG + R Y+ V DVVIEVSTQFKLESYS PNSAYSSP  G        V D   GLN RSKSCGEGRG+A    HGL+EN+ V++ +KGH HK  E 
Subjt:  PLSVGISNRGYDFVGDVVIEVSTQFKLESYSVPNSAYSSPRSG--------VCD---GLNRRSKSCGEGRGKARQHGHGLVENKVVVVGKKGHNHKEMER

Query:  PSKTENGGKARCFSCGALCLLLP-----GF--GKGKGERKEEAEAEAEGGGCISISISR-RVSLEKFECGSWASSGMVVHEDGIGGSGAGSLYFDLPMEL
               GKAR F CGALCLLLP     GF  GKGK ERKEE E   EGGGCISISIS  RVSLEKFECGSWASSGMV HEDG   SG GSLYFDLPMEL
Subjt:  PSKTENGGKARCFSCGALCLLLP-----GF--GKGKGERKEEAEAEAEGGGCISISISR-RVSLEKFECGSWASSGMVVHEDGIGGSGAGSLYFDLPMEL

Query:  IRNSVGARTQTQTHSQSQSPSPSPVKAAF-------------VWTKTKLAEESGSPSPCIITPRLRKAREEFNALLEAH
        IRNSVGARTQ            SP + AF             VWTK KLAEESG+ SPC+ITPRLR+AREEFNALLEAH
Subjt:  IRNSVGARTQTQTHSQSQSPSPSPVKAAF-------------VWTKTKLAEESGSPSPCIITPRLRKAREEFNALLEAH

XP_038877520.1 uncharacterized protein LOC120069777 [Benincasa hispida]6.3e-7365.94Show/hide
Query:  MAATALPLSVGISNRGYDFVGDVVIEVSTQFKLESYSVPNSAYSSPR----SGVCD---GLNRRSKSCGEGRGKARQHGHGLVENKVVVVGKKGHNHKEM
        MA T +PLSVG ++R Y+FV DVVIEVSTQ KL SYSVPNSAYSSPR      V D   GLN RSKSCGEGRGKA    H L+ENK V+V +KGH H   
Subjt:  MAATALPLSVGISNRGYDFVGDVVIEVSTQFKLESYSVPNSAYSSPR----SGVCD---GLNRRSKSCGEGRGKARQHGHGLVENKVVVVGKKGHNHKEM

Query:  ERPSKTENGGKARCFSCGALCLLLP-----GF--GKGKGERKEEAEAEAEGGGCISISISRRVSLEKFECGSWASSGMVVHEDGIGGSGAGSLYFDLPME
            KTE  GKA+ F CGALCLLLP     GF  GKGK + KEE + EAE G CISISISRRVSLEKFECGSWASSGMVVHEDG      GS YFDLPME
Subjt:  ERPSKTENGGKARCFSCGALCLLLP-----GF--GKGKGERKEEAEAEAEGGGCISISISRRVSLEKFECGSWASSGMVVHEDGIGGSGAGSLYFDLPME

Query:  LIRNSVGARTQTQTHSQSQSPSPSPVKAAFV--------WTKTKLAEESGSPSPCIITPRLRKAREEFNALLEAHT
        LIRNSVG +TQ            SPV AAFV        WTK  LAEESG+ SPCIITPRLRKAREEFNALLEAHT
Subjt:  LIRNSVGARTQTQTHSQSQSPSPSPVKAAFV--------WTKTKLAEESGSPSPCIITPRLRKAREEFNALLEAHT

TrEMBL top hitse value%identityAlignment
A0A0A0KME4 Uncharacterized protein5.9e-6963.84Show/hide
Query:  MAATALPLSVGISNRGYDFVGDVVIEVSTQFKLESYSVPNSAYSSPR----SGVCDGLNRRSKSCGEGRGKARQHGHGLVENKVVVVGKKGHNHKEMERP
        MA+  +PLSVG S+R Y+FV DVVIEVS Q      S PNSAYSSPR        DGLN RS+SCGEGRGKA    HGL+ENK V+V +KG  H      
Subjt:  MAATALPLSVGISNRGYDFVGDVVIEVSTQFKLESYSVPNSAYSSPR----SGVCDGLNRRSKSCGEGRGKARQHGHGLVENKVVVVGKKGHNHKEMERP

Query:  SKTENGGKARCFSCGALCLLLP--GF--GKGKGERKEEAEAEAEGGGCISISISRRVSLEKFECGSWASSGMVVHEDGIGGSGAGSLYFDLPMELIRNSV
         KTE G   R   CGALCLLLP  GF  GKG+ + KEE   EAE G CISISISRRVSLEKFECGSWASSGMVVHEDG     +GSLYFDLPMELIRNSV
Subjt:  SKTENGGKARCFSCGALCLLLP--GF--GKGKGERKEEAEAEAEGGGCISISISRRVSLEKFECGSWASSGMVVHEDGIGGSGAGSLYFDLPMELIRNSV

Query:  GARTQTQTHSQSQSPSPSPVKAAF------VWTKTKLAEESGSPSPCIITPRLRKAREEFNALLEAHTHTL
         A+TQ            SPV AAF      VW K KLAEESG+ SPCIITPRLRKAR+EFNALLEAHTH L
Subjt:  GARTQTQTHSQSQSPSPSPVKAAF------VWTKTKLAEESGSPSPCIITPRLRKAREEFNALLEAHTHTL

A0A1S3AZD3 uncharacterized protein LOC1034842324.9e-6360.61Show/hide
Query:  LPLSVGISNRGYDFVGDVVIEVSTQFKLESYSVPNSAYSSPR----SGVCDGLNRRSKSCGEGRGKARQHGHGLVENKVVVVGKKGHNHKEMERPSKTEN
        +PLSV  S+R Y+FV DVV+ VS Q      S PNS YSSPR      V DGLN RSKSCG+GRGKA    HGL+ENK ++  + G  HK  E       
Subjt:  LPLSVGISNRGYDFVGDVVIEVSTQFKLESYSVPNSAYSSPR----SGVCDGLNRRSKSCGEGRGKARQHGHGLVENKVVVVGKKGHNHKEMERPSKTEN

Query:  GGKARCFSCGALCLLLP--GF--GKGKGERKEEAEAEAEGGGCISISISRRVSLEKFECGSWASSGMVVHEDGIGGSGAGSLYFDLPMELIRNSVGARTQ
         GK R F CGALCLLLP  GF  GKG+ + KEE + EAE G CISISISRRVSL+KFECGSWASSGMVVHE+G     +GSLYFDLPMELIRNSV A++Q
Subjt:  GGKARCFSCGALCLLLP--GF--GKGKGERKEEAEAEAEGGGCISISISRRVSLEKFECGSWASSGMVVHEDGIGGSGAGSLYFDLPMELIRNSVGARTQ

Query:  TQTHSQSQSPSPSPVKAAF-------VWTKTKLAEESGSPSPCIITPRLRKAREEFNALLEAHT
                    SPV AAF       VW K KLA+ESG+ SPCIITPRLRKAR+EFNALLEAHT
Subjt:  TQTHSQSQSPSPSPVKAAF-------VWTKTKLAEESGSPSPCIITPRLRKAREEFNALLEAHT

A0A5A7UFW0 Ycf3-interacting protein 11.1e-6261.22Show/hide
Query:  LPLSVGISNRGYDFVGDVVIEVSTQFKLESYSVPNSAYSSPR----SGVCDGLNRRSKSCGEGRGKARQHGHGLVENKVVVVGKKGHNHKEMERPSKTEN
        +PLSV  S+R Y+FV DVV+ VS Q      S PNS YSSPR      V DGLN RSKSCG+GRGKA    HGL+ENK ++  + G  HK  E       
Subjt:  LPLSVGISNRGYDFVGDVVIEVSTQFKLESYSVPNSAYSSPR----SGVCDGLNRRSKSCGEGRGKARQHGHGLVENKVVVVGKKGHNHKEMERPSKTEN

Query:  GGKARCFSCGALCLLLP--GF--GKGKGERKEEAEAEAEGGGCISISISRRVSLEKFECGSWASSGMVVHEDGIGGSGAGSLYFDLPMELIRNSVGARTQ
         GK R F CGALCLLLP  GF  GKG+ + KEE + EAE G CISISISRRVSLEKFECGSWASSGMVVHE+G     +GSLYFDLPMELIRNSV A++Q
Subjt:  GGKARCFSCGALCLLLP--GF--GKGKGERKEEAEAEAEGGGCISISISRRVSLEKFECGSWASSGMVVHEDGIGGSGAGSLYFDLPMELIRNSVGARTQ

Query:  TQTHSQSQSPSPSPVKAAFVW------TKTKLAEESGSPSPCIITPRLRKAREEFNALLEAHT
                    SPV AAFV+       K KLAEESG+ SPCIITPRLRKAR+EFNALLEAHT
Subjt:  TQTHSQSQSPSPSPVKAAFVW------TKTKLAEESGSPSPCIITPRLRKAREEFNALLEAHT

A0A6J1EGQ7 uncharacterized protein LOC1114332241.2e-7464.52Show/hide
Query:  PLSVGISNRGYDFVGDVVIEVSTQFKLESYSVPNSAYSSPRSG--------VCD---GLNRRSKSCGEGRGKARQHGHGLVENKVVVVGKKGHNHKEMER
        PLSVG + R Y+ V DVVIEVSTQFKLESYS PNSAYSSP  G        V D   GLN RSKSCGEGRGKA    HGL+EN+ V++ +KGH HK  E 
Subjt:  PLSVGISNRGYDFVGDVVIEVSTQFKLESYSVPNSAYSSPRSG--------VCD---GLNRRSKSCGEGRGKARQHGHGLVENKVVVVGKKGHNHKEMER

Query:  PSKTENGGKARCFSCGALCLLLP-----GF--GKGKGERKEEAEAEAEGGGCISISISR-RVSLEKFECGSWASSGMVVHEDGIGGSGAGSLYFDLPMEL
               GKAR F CGALCLLLP     GF  GKGK ERKEE E   EGGGCISISIS  RVSLEKFECGSWASSGMV HEDG   SG GSLYFDLPMEL
Subjt:  PSKTENGGKARCFSCGALCLLLP-----GF--GKGKGERKEEAEAEAEGGGCISISISR-RVSLEKFECGSWASSGMVVHEDGIGGSGAGSLYFDLPMEL

Query:  IRNSVGARTQTQTHSQSQSPSPSPVKAAF-------------VWTKTKLAEESGSPSPCIITPRLRKAREEFNALLEAH
        IRNSVGARTQ            SP + AF             VWTK KLAEESG+ SPC+ITPRLR+AREEFNALLEAH
Subjt:  IRNSVGARTQTQTHSQSQSPSPSPVKAAF-------------VWTKTKLAEESGSPSPCIITPRLRKAREEFNALLEAH

A0A6J1ILL2 uncharacterized protein LOC1114785483.6e-7463.8Show/hide
Query:  PLSVGISNRGYDFVGDVVIEVSTQFKLESYSVPNSAYSSPRSG--------VCD---GLNRRSKSCGEGRGKARQHGHGLVENKVVVVGKKGHNHKEMER
        PLSVG + R Y+ V DVVI+VSTQFKLESYS PNSAYSSP  G        V D   GLN RSKSCGEGRGKA    HGL++N+ V++ +KGH HK  E 
Subjt:  PLSVGISNRGYDFVGDVVIEVSTQFKLESYSVPNSAYSSPRSG--------VCD---GLNRRSKSCGEGRGKARQHGHGLVENKVVVVGKKGHNHKEMER

Query:  PSKTENGGKARCFSCGALCLLLP-----GF--GKGKGERKEEAEAEAEGGGCISISISR-RVSLEKFECGSWASSGMVVHEDGIGGSGAGSLYFDLPMEL
               GKAR F CGALCLLLP     GF  GKGK ERKEE E   EGGGCISISIS  RVSLEKFECGSWASSGMV HEDG   +G GSLYFDLPMEL
Subjt:  PSKTENGGKARCFSCGALCLLLP-----GF--GKGKGERKEEAEAEAEGGGCISISISR-RVSLEKFECGSWASSGMVVHEDGIGGSGAGSLYFDLPMEL

Query:  IRNSVGARTQTQTHSQSQSPSPSPVKAAF-------------VWTKTKLAEESGSPSPCIITPRLRKAREEFNALLEAH
        IRNSVGARTQ            SP +AAF             VWTK KLAEESG+ SPC+ITPRLR+AREEFNALLEAH
Subjt:  IRNSVGARTQTQTHSQSQSPSPSPVKAAF-------------VWTKTKLAEESGSPSPCIITPRLRKAREEFNALLEAH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G30850.1 root hair specific 42.4e-1433.51Show/hide
Query:  KARCFSCGALCLLLPGFGKGK-----GERKEEAEAE-AEGGGCISISISRRVSLEKFECGSWASSGMVVHEDGIGGSGAGSLYFDLPMELIR-NSVGA--
        K   F C A CL LPGFGK K      +R+   E +          ++S R SLEKFECGSWAS+  ++ ++       G L+FD P+E+ + NS G   
Subjt:  KARCFSCGALCLLLPGFGKGK-----GERKEEAEAE-AEGGGCISISISRRVSLEKFECGSWASSGMVVHEDGIGGSGAGSLYFDLPMELIR-NSVGA--

Query:  ----------------RTQT----------QTHSQSQSPSPSPVKAAFVWTKTKLAEESGSPSP-CIITPRLRKAREEFNALLEA
                         T+T           T    +S   SP +     T +  A  S   SP   ITPRLRKAR++FN  L A
Subjt:  ----------------RTQT----------QTHSQSQSPSPSPVKAAFVWTKTKLAEESGSPSP-CIITPRLRKAREEFNALLEA

AT2G34910.1 BEST Arabidopsis thaliana protein match is: root hair specific 4 (TAIR:AT1G30850.1)8.6e-1232.02Show/hide
Query:  FSCGALCLLLPGFGK-----GKGE---RKEEAEAEAEGGGCISISISRRVSLEKFECGSWASSGMVVHEDGIGGSGAGSLYFDLPMELIR----------
        F C A CL LPGFGK      K E   +K+  +A +     +S+S     SLEKFECGSWAS+  +  E+       G LY DLP+E+I+          
Subjt:  FSCGALCLLLPGFGK-----GKGE---RKEEAEAEAEGGGCISISISRRVSLEKFECGSWASSGMVVHEDGIGGSGAGSLYFDLPMELIR----------

Query:  ---------------NSVGARTQTQTHSQSQSPSPSPVKAAFVWTKTKLAEESGSPSPCIITPRLRKAREEFNALLEA
                        SV  ++ + +  Q +  + +  +    ++ T       SP  C ITPRL KAR++FN  L A
Subjt:  ---------------NSVGARTQTQTHSQSQSPSPSPVKAAFVWTKTKLAEESGSPSPCIITPRLRKAREEFNALLEA

AT4G20190.1 unknown protein3.7e-1528.87Show/hide
Query:  VGDVVIEVSTQFKLESYSVPNSAYSSPRSG------------------VCDGLN--RRSKSCGEGRG------------KARQHGHGLVENKVVVVGKKG
        + D+ ++   + K  S S+PNSA +SPR+                   V D     RRSKSCGEGR             K+R   H    ++    G   
Subjt:  VGDVVIEVSTQFKLESYSVPNSAYSSPRSG------------------VCDGLN--RRSKSCGEGRG------------KARQHGHGLVENKVVVVGKKG

Query:  HNHKEMERP--------SKTENGGKARC----------------FSCGALCLLLPGFGKGK---GERKEEAEAEAEGGGCISISISR------------R
         N K +           SKTE+    R                 F C ALCL LPGF KGK     RK ++          S S++R            R
Subjt:  HNHKEMERP--------SKTENGGKARC----------------FSCGALCLLLPGFGKGK---GERKEEAEAEAEGGGCISISISR------------R

Query:  VSLEKFECGSWASSGMVVHEDGIGGSGAGSLYFDLPMELIRNSVGARTQTQTHSQSQSPSPSPVKAAFVWTKTK--------LAEESGSPS---------
         SLE+FECGSW SS M+  ++    +  G  +FDLP ELI+   G   Q             PV AAFV+ K          + + SGS S         
Subjt:  VSLEKFECGSWASSGMVVHEDGIGGSGAGSLYFDLPMELIRNSVGARTQTQTHSQSQSPSPSPVKAAFVWTKTK--------LAEESGSPS---------

Query:  ---------------PCIITPRLRKAREEFNALLEA
                          ITPRL +A E+F++ LEA
Subjt:  ---------------PCIITPRLRKAREEFNALLEA

AT5G44660.1 unknown protein4.1e-1429.27Show/hide
Query:  ESYSVPNSAYSSP--RSGVCDGLN-----------RRSKSCGEGRGKARQHGHGLVENKVVVVGKKGHNHKEMERPSKTENGGKARCFSCGALCLLLPGF
        E  S+PNS   SP  RSG+   L            +RSKSCG    K   H    + N   +   K  ++K +   S  E+      F C ALCL LPGF
Subjt:  ESYSVPNSAYSSP--RSGVCDGLN-----------RRSKSCGEGRGKARQHGHGLVENKVVVVGKKGHNHKEMERPSKTENGGKARCFSCGALCLLLPGF

Query:  GKGKGERKEEAEAEA----------EGGGCISIS--------------ISRRVSLEKFECGSWASSGMVVHEDGIGGSGAGSLYFDLPMELIRNSVGART
         KGK  R  + +  +               I++S              IS R S+EKF+CGS+ S           G   G+ +FDLP ELI++  G   
Subjt:  GKGKGERKEEAEAEA----------EGGGCISIS--------------ISRRVSLEKFECGSWASSGMVVHEDGIGGSGAGSLYFDLPMELIRNSVGART

Query:  QTQTHSQSQSPSPSPVKAAFVWTKTKLAEE-------SGS------------------------PSPCIITPRLRKAREEFNALLEA
                 +    PV AAFV+ K  + +E       SGS                        P+   I+PRL +A + FNA LEA
Subjt:  QTQTHSQSQSPSPSPVKAAFVWTKTKLAEE-------SGS------------------------PSPCIITPRLRKAREEFNALLEA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGCCACGGCGCTTCCTTTATCTGTTGGCATCAGTAACAGAGGCTACGATTTTGTTGGGGATGTGGTTATCGAGGTGTCAACGCAATTCAAGTTGGAAAGCTACAG
TGTCCCAAACTCGGCCTATTCATCCCCGCGGAGTGGGGTCTGTGATGGGCTGAATCGGAGGAGCAAGTCGTGTGGTGAAGGAAGAGGGAAGGCACGGCAGCATGGTCATG
GTCTTGTTGAGAATAAAGTAGTGGTAGTAGGAAAGAAAGGGCATAATCACAAGGAGATGGAAAGGCCTTCGAAAACAGAAAATGGGGGAAAAGCAAGGTGTTTCAGTTGT
GGGGCACTATGCTTGTTGCTTCCAGGTTTTGGGAAAGGCAAGGGGGAGAGAAAGGAAGAGGCAGAGGCAGAGGCAGAGGGAGGTGGGTGTATATCCATATCCATATCGAG
GAGAGTTTCTTTAGAAAAATTCGAATGCGGTTCATGGGCTTCATCGGGCATGGTGGTTCATGAGGATGGCATCGGCGGGTCAGGCGCAGGGAGCCTTTACTTTGATCTGC
CAATGGAATTGATAAGGAACAGCGTGGGTGCTCGAACACAAACACAAACACATTCACAATCACAATCACCGTCACCATCACCAGTAAAGGCCGCTTTTGTTTGGACCAAA
ACAAAATTGGCGGAGGAATCAGGCTCCCCATCTCCATGCATCATTACCCCACGCCTGCGCAAAGCCAGAGAAGAGTTCAATGCACTTTTGGAAGCCCACACTCACACTCT
ATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGCCACGGCGCTTCCTTTATCTGTTGGCATCAGTAACAGAGGCTACGATTTTGTTGGGGATGTGGTTATCGAGGTGTCAACGCAATTCAAGTTGGAAAGCTACAG
TGTCCCAAACTCGGCCTATTCATCCCCGCGGAGTGGGGTCTGTGATGGGCTGAATCGGAGGAGCAAGTCGTGTGGTGAAGGAAGAGGGAAGGCACGGCAGCATGGTCATG
GTCTTGTTGAGAATAAAGTAGTGGTAGTAGGAAAGAAAGGGCATAATCACAAGGAGATGGAAAGGCCTTCGAAAACAGAAAATGGGGGAAAAGCAAGGTGTTTCAGTTGT
GGGGCACTATGCTTGTTGCTTCCAGGTTTTGGGAAAGGCAAGGGGGAGAGAAAGGAAGAGGCAGAGGCAGAGGCAGAGGGAGGTGGGTGTATATCCATATCCATATCGAG
GAGAGTTTCTTTAGAAAAATTCGAATGCGGTTCATGGGCTTCATCGGGCATGGTGGTTCATGAGGATGGCATCGGCGGGTCAGGCGCAGGGAGCCTTTACTTTGATCTGC
CAATGGAATTGATAAGGAACAGCGTGGGTGCTCGAACACAAACACAAACACATTCACAATCACAATCACCGTCACCATCACCAGTAAAGGCCGCTTTTGTTTGGACCAAA
ACAAAATTGGCGGAGGAATCAGGCTCCCCATCTCCATGCATCATTACCCCACGCCTGCGCAAAGCCAGAGAAGAGTTCAATGCACTTTTGGAAGCCCACACTCACACTCT
ATGA
Protein sequenceShow/hide protein sequence
MAATALPLSVGISNRGYDFVGDVVIEVSTQFKLESYSVPNSAYSSPRSGVCDGLNRRSKSCGEGRGKARQHGHGLVENKVVVVGKKGHNHKEMERPSKTENGGKARCFSC
GALCLLLPGFGKGKGERKEEAEAEAEGGGCISISISRRVSLEKFECGSWASSGMVVHEDGIGGSGAGSLYFDLPMELIRNSVGARTQTQTHSQSQSPSPSPVKAAFVWTK
TKLAEESGSPSPCIITPRLRKAREEFNALLEAHTHTL