; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0009717 (gene) of Snake gourd v1 genome

Gene IDTan0009717
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionFAR1 family protein
Genome locationLG06:11439993..11443239
RNA-Seq ExpressionTan0009717
SyntenyTan0009717
Gene Ontology termsNA
InterPro domainsIPR004330 - FAR1 DNA binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576034.1 AUGMIN subunit 3, partial [Cucurbita argyrosperma subsp. sororia]1.5e-7388.68Show/hide
Query:  MDGIPTVVEDGDVVENPRGKDLVRVVAEAEARATDISDTEPCVGMEFESEESAKVFYDAYASRLGFIMRVDAFRRSMRDGAVVWRRLVCNKEGFRKLRPK
        +DGIPT +EDGDVVENP GK L R+V+EAEA AT+ISDTEP VGMEFESEESAKVFYDAYASRLGF++RVDAFRRSMRDG+VVWRRLVCNKEGFRK RPK
Subjt:  MDGIPTVVEDGDVVENPRGKDLVRVVAEAEARATDISDTEPCVGMEFESEESAKVFYDAYASRLGFIMRVDAFRRSMRDGAVVWRRLVCNKEGFRKLRPK

Query:  RSENRKPRAITREGCKAMVVVKKEKTGKWVVTKFVKDHNHPLIVTPASARRNVLLSLTR
        RSENRKPRAITREGCKAMVVVKKEK+GKW+VTKFVKDHNHPLIVTPASARRNVLLS TR
Subjt:  RSENRKPRAITREGCKAMVVVKKEKTGKWVVTKFVKDHNHPLIVTPASARRNVLLSLTR

KAG7014556.1 Protein FAR1-RELATED SEQUENCE 5 [Cucurbita argyrosperma subsp. argyrosperma]2.6e-7388.68Show/hide
Query:  MDGIPTVVEDGDVVENPRGKDLVRVVAEAEARATDISDTEPCVGMEFESEESAKVFYDAYASRLGFIMRVDAFRRSMRDGAVVWRRLVCNKEGFRKLRPK
        +DGIPT +EDGDVVENP GK   R+V+EAEA AT+ISDTEP VGMEFESEESAKVFYDAYASRLGF++RVDAFRRSMRDG+VVWRRLVCNKEGFRK RPK
Subjt:  MDGIPTVVEDGDVVENPRGKDLVRVVAEAEARATDISDTEPCVGMEFESEESAKVFYDAYASRLGFIMRVDAFRRSMRDGAVVWRRLVCNKEGFRKLRPK

Query:  RSENRKPRAITREGCKAMVVVKKEKTGKWVVTKFVKDHNHPLIVTPASARRNVLLSLTR
        RSENRKPRAITREGCKAMVVVKKEKTGKW+VTKFVKDHNHPLIVTPASARRNVLLS TR
Subjt:  RSENRKPRAITREGCKAMVVVKKEKTGKWVVTKFVKDHNHPLIVTPASARRNVLLSLTR

XP_022991203.1 protein FAR1-RELATED SEQUENCE 5 [Cucurbita maxima]1.1e-7490.57Show/hide
Query:  MDGIPTVVEDGDVVENPRGKDLVRVVAEAEARATDISDTEPCVGMEFESEESAKVFYDAYASRLGFIMRVDAFRRSMRDGAVVWRRLVCNKEGFRKLRPK
        MDGIPT +EDGDVVENP GK LVR+V+EAEA AT+ISDTEP VGMEFESEESAKVFYDAYASRLGFI+RVDAFRRSMRDG+VVWRRLVCNKEGFRK RPK
Subjt:  MDGIPTVVEDGDVVENPRGKDLVRVVAEAEARATDISDTEPCVGMEFESEESAKVFYDAYASRLGFIMRVDAFRRSMRDGAVVWRRLVCNKEGFRKLRPK

Query:  RSENRKPRAITREGCKAMVVVKKEKTGKWVVTKFVKDHNHPLIVTPASARRNVLLSLTR
        RSENRKPRAITREGCKAMVVVKKEKTGKW+VTKFVKDHNHPLIVTPASARRN+LLS TR
Subjt:  RSENRKPRAITREGCKAMVVVKKEKTGKWVVTKFVKDHNHPLIVTPASARRNVLLSLTR

XP_023547555.1 protein FAR1-RELATED SEQUENCE 5 [Cucurbita pepo subsp. pepo]8.9e-7489.94Show/hide
Query:  MDGIPTVVEDGDVVENPRGKDLVRVVAEAEARATDISDTEPCVGMEFESEESAKVFYDAYASRLGFIMRVDAFRRSMRDGAVVWRRLVCNKEGFRKLRPK
        MDGIPT +EDGDVVENP GK L R+V+EAEA AT+ISDTEP VGMEFESEESAKVFYDAYASRLGFI+RVDAFRRSMRDG+VVWRRLVCNKEGFRK RPK
Subjt:  MDGIPTVVEDGDVVENPRGKDLVRVVAEAEARATDISDTEPCVGMEFESEESAKVFYDAYASRLGFIMRVDAFRRSMRDGAVVWRRLVCNKEGFRKLRPK

Query:  RSENRKPRAITREGCKAMVVVKKEKTGKWVVTKFVKDHNHPLIVTPASARRNVLLSLTR
        RSENRKPRAITREGCKAMVVVKKEK+GKW+VTKFVKDHNHPLIVTPASARRNVLLS TR
Subjt:  RSENRKPRAITREGCKAMVVVKKEKTGKWVVTKFVKDHNHPLIVTPASARRNVLLSLTR

XP_038897039.1 protein FAR1-RELATED SEQUENCE 5 isoform X1 [Benincasa hispida]8.0e-7592.45Show/hide
Query:  MDGIPTVVEDGDVVENPRGKDLVRVVAEAEARATDISDTEPCVGMEFESEESAKVFYDAYASRLGFIMRVDAFRRSMRDGAVVWRRLVCNKEGFRKLRPK
        MDGIPTV+EDGDVVENP GKD  RVVAEAEA AT+ SDTEP VGMEFESEESAKVFYDAYASRLGFIMRVDAFRRSMRDGAVVWRRLVCNKEGFRKLRPK
Subjt:  MDGIPTVVEDGDVVENPRGKDLVRVVAEAEARATDISDTEPCVGMEFESEESAKVFYDAYASRLGFIMRVDAFRRSMRDGAVVWRRLVCNKEGFRKLRPK

Query:  RSENRKPRAITREGCKAMVVVKKEKTGKWVVTKFVKDHNHPLIVTPASARRNVLLSLTR
        RSENRKPRAITREGCKAMVVVKK+K+GKWVVTKFVKDHNHPLI+TPASARRNVLLS TR
Subjt:  RSENRKPRAITREGCKAMVVVKKEKTGKWVVTKFVKDHNHPLIVTPASARRNVLLSLTR

TrEMBL top hitse value%identityAlignment
A0A0A0KA59 FAR1 domain-containing protein1.0e-6785.53Show/hide
Query:  MDGIPTVVEDGDVVENPRGKDLVRVVAEAEARATDISDTEPCVGMEFESEESAKVFYDAYASRLGFIMRVDAFRRSMRDGAVVWRRLVCNKEGFRKLRPK
        MDGIP+V+   +VVE+P GKD  R V EAEA  ++ SDTEP VGMEFESEES KVFYDAYASRLGFIMRVDAFRRSMRDGAVVWRRLVCNKEGFRK +PK
Subjt:  MDGIPTVVEDGDVVENPRGKDLVRVVAEAEARATDISDTEPCVGMEFESEESAKVFYDAYASRLGFIMRVDAFRRSMRDGAVVWRRLVCNKEGFRKLRPK

Query:  RSENRKPRAITREGCKAMVVVKKEKTGKWVVTKFVKDHNHPLIVTPASARRNVLLSLTR
        RSENRKPRA+TREGCKAMVVVKKEKTGKWVVTKFVKDHNHPLIVTPASARRNVLLS TR
Subjt:  RSENRKPRAITREGCKAMVVVKKEKTGKWVVTKFVKDHNHPLIVTPASARRNVLLSLTR

A0A1S3BGE3 protein FAR-RED IMPAIRED RESPONSE 1 isoform X13.8e-7088.05Show/hide
Query:  MDGIPTVVEDGDVVENPRGKDLVRVVAEAEARATDISDTEPCVGMEFESEESAKVFYDAYASRLGFIMRVDAFRRSMRDGAVVWRRLVCNKEGFRKLRPK
        MDGIPTVV   DVVE+P GKD  RVV EAEA  +++SDTEP VGMEFESEES KVFYDAYASRLGFIMRVDAFRRSMRDGAVVWRRLVCNKEGFRK +PK
Subjt:  MDGIPTVVEDGDVVENPRGKDLVRVVAEAEARATDISDTEPCVGMEFESEESAKVFYDAYASRLGFIMRVDAFRRSMRDGAVVWRRLVCNKEGFRKLRPK

Query:  RSENRKPRAITREGCKAMVVVKKEKTGKWVVTKFVKDHNHPLIVTPASARRNVLLSLTR
        RSENRKPRA+TREGCKAMVVVKKEKTGKWVVTKFVKDHNHPLIVTPASARRNVLLS TR
Subjt:  RSENRKPRAITREGCKAMVVVKKEKTGKWVVTKFVKDHNHPLIVTPASARRNVLLSLTR

A0A2I4EFQ2 protein FAR1-RELATED SEQUENCE 78.7e-5975.47Show/hide
Query:  MDGIPTVVEDGDVVENPRGKDLVRVVAEAEARATD-ISDTEPCVGMEFESEESAKVFYDAYASRLGFIMRVDAFRRSMRDGAVVWRRLVCNKEGFRKLRP
        MD  P++ EDGD++E+  GK+L+          TD  SD EP VGMEFESEE+AKVFYDAYA+RLGFIMRVDAFRRSMRDG VVWRRLVCNKEGFRKLRP
Subjt:  MDGIPTVVEDGDVVENPRGKDLVRVVAEAEARATD-ISDTEPCVGMEFESEESAKVFYDAYASRLGFIMRVDAFRRSMRDGAVVWRRLVCNKEGFRKLRP

Query:  KRSENRKPRAITREGCKAMVVVKKEKTGKWVVTKFVKDHNHPLIVTPASARRNVLLSLT
        KRSENRKPRAITREGCKAM+VVKKEKTGKW+VT+FVK+HNHPL+ TPA+ RR+VLLS T
Subjt:  KRSENRKPRAITREGCKAMVVVKKEKTGKWVVTKFVKDHNHPLIVTPASARRNVLLSLT

A0A6J1GPB7 protein FAR1-RELATED SEQUENCE 52.1e-7388.05Show/hide
Query:  MDGIPTVVEDGDVVENPRGKDLVRVVAEAEARATDISDTEPCVGMEFESEESAKVFYDAYASRLGFIMRVDAFRRSMRDGAVVWRRLVCNKEGFRKLRPK
        +DGIPT +EDGD VENP GK L R+V+EAEA AT+ISDTEP VGMEFESEESAKVFYDAYASRLGF++RVDAFRRSMRDG+VVWRRLVCNKEGFRK RPK
Subjt:  MDGIPTVVEDGDVVENPRGKDLVRVVAEAEARATDISDTEPCVGMEFESEESAKVFYDAYASRLGFIMRVDAFRRSMRDGAVVWRRLVCNKEGFRKLRPK

Query:  RSENRKPRAITREGCKAMVVVKKEKTGKWVVTKFVKDHNHPLIVTPASARRNVLLSLTR
        RSENRKPRAITREGCKAMVVVKKEK+GKW+VTKFVKDHNHPLIVTPASARRNVLLS TR
Subjt:  RSENRKPRAITREGCKAMVVVKKEKTGKWVVTKFVKDHNHPLIVTPASARRNVLLSLTR

A0A6J1JVH7 protein FAR1-RELATED SEQUENCE 55.1e-7590.57Show/hide
Query:  MDGIPTVVEDGDVVENPRGKDLVRVVAEAEARATDISDTEPCVGMEFESEESAKVFYDAYASRLGFIMRVDAFRRSMRDGAVVWRRLVCNKEGFRKLRPK
        MDGIPT +EDGDVVENP GK LVR+V+EAEA AT+ISDTEP VGMEFESEESAKVFYDAYASRLGFI+RVDAFRRSMRDG+VVWRRLVCNKEGFRK RPK
Subjt:  MDGIPTVVEDGDVVENPRGKDLVRVVAEAEARATDISDTEPCVGMEFESEESAKVFYDAYASRLGFIMRVDAFRRSMRDGAVVWRRLVCNKEGFRKLRPK

Query:  RSENRKPRAITREGCKAMVVVKKEKTGKWVVTKFVKDHNHPLIVTPASARRNVLLSLTR
        RSENRKPRAITREGCKAMVVVKKEKTGKW+VTKFVKDHNHPLIVTPASARRN+LLS TR
Subjt:  RSENRKPRAITREGCKAMVVVKKEKTGKWVVTKFVKDHNHPLIVTPASARRNVLLSLTR

SwissProt top hitse value%identityAlignment
Q3E7I5 Protein FAR1-RELATED SEQUENCE 123.7e-1437.84Show/hide
Query:  RATDISDTEPCVGMEFESEESAKVFYDAYASRLGFIMRVDAFRRSMRDGAVVWRRLVCNKEGFRKLRPKRSENRKPRAITREGCKAMVVVKKEKTGKWVV
        +A  ++ TEP  G+EF S   A  FY AYA  +GF +R+    RS  DG++  RR VC++EGF+               +R GC A + +K++ +G W+V
Subjt:  RATDISDTEPCVGMEFESEESAKVFYDAYASRLGFIMRVDAFRRSMRDGAVVWRRLVCNKEGFRKLRPKRSENRKPRAITREGCKAMVVVKKEKTGKWVV

Query:  TKFVKDHNHPL
         +  KDHNH L
Subjt:  TKFVKDHNHPL

Q9M8J3 Protein FAR1-RELATED SEQUENCE 72.8e-1438.79Show/hide
Query:  AEAEARATDISDTEPCVGMEFESEESAKVFYDAYASRLGFIMRVDAFRRSMRDGAVVWRRLVCNKEGFRKLRPKRSENRKPRAITREGCKAMVVVKKEKT
        AE     T    TEP  G+EF S   A  FY AYA  +GF +R+    RS  DG++  RR VC+KEGF+               +R GC A + +K++ +
Subjt:  AEAEARATDISDTEPCVGMEFESEESAKVFYDAYASRLGFIMRVDAFRRSMRDGAVVWRRLVCNKEGFRKLRPKRSENRKPRAITREGCKAMVVVKKEKT

Query:  GKWVVTKFVKDHNHPL
        G W+V +  KDHNH L
Subjt:  GKWVVTKFVKDHNHPL

Q9SWG3 Protein FAR-RED IMPAIRED RESPONSE 18.5e-1127.27Show/hide
Query:  DVVENPRGKDLVRVVAE----AEARATDISDTEPCVGMEFESEESAKVFYDAYASRLGFIMRVDAFRRSMRDGAVVWRRLVCNKEGFRKLRPKRSENRKP
        D+V  P     + +V E     +   +   D EP  G++F++ E+A +FY  YA  +GF   +   RRS +    +  +  C++ G          + + 
Subjt:  DVVENPRGKDLVRVVAE----AEARATDISDTEPCVGMEFESEESAKVFYDAYASRLGFIMRVDAFRRSMRDGAVVWRRLVCNKEGFRKLRPKRSENRKP

Query:  RAITREGCKAMVVVKKEKTGKWVVTKFVKDHNHPLIVTPASA-----RRNVLLSLTRVRATLILIHFVFIYSGMMFVLVSSLPLSFTLCTSILELYLS
          + +  CKA + VK+   GKW++ +FVKDHNH L+  PA A     +RNV L+    +  + ++H V   +  M+V +S     +    S+L+  +S
Subjt:  RAITREGCKAMVVVKKEKTGKWVVTKFVKDHNHPLIVTPASA-----RRNVLLSLTRVRATLILIHFVFIYSGMMFVLVSSLPLSFTLCTSILELYLS

Q9SZL8 Protein FAR1-RELATED SEQUENCE 52.8e-3048.63Show/hide
Query:  VEDGDVVEN---PRGKDLV----RVVAEAEARATDISDTEPCVGMEFESEESAKVFYDAYASRLGFIMRVDAFRRSMRDGAVVWRRLVCNKEGFRKLRPK
        ++D D++++   P G  LV          E  A D+ D EP  G+EFESEE+AK FY++YA R+GF  RV + RRS RDGA++ R+ VC KEGFR +  K
Subjt:  VEDGDVVEN---PRGKDLV----RVVAEAEARATDISDTEPCVGMEFESEESAKVFYDAYASRLGFIMRVDAFRRSMRDGAVVWRRLVCNKEGFRKLRPK

Query:  RSENR---KPRAITREGCKAMVVVKKEKTGKWVVTKFVKDHNHPLI
        R+++R   +PR ITR GCKA + VK + +GKW+V+ FVKDHNH L+
Subjt:  RSENR---KPRAITREGCKAMVVVKKEKTGKWVVTKFVKDHNHPLI

Q9ZVC9 Protein FAR1-RELATED SEQUENCE 31.1e-1343.52Show/hide
Query:  DISDTEPCVGMEFESEESAKVFYDAYASRLGFIMRVDAFRRSMRDGAVVWRRLVCNKEGFRKLRPKRSENRKPRAITREGCKAMVVVKKEKTGKWVVTKF
        +I   EPCVGMEF SE+ AK FYD Y+ +LGF  ++        DG+V  R  VC+         KRS+ R       E C AMV ++ +   KWVVTKF
Subjt:  DISDTEPCVGMEFESEESAKVFYDAYASRLGFIMRVDAFRRSMRDGAVVWRRLVCNKEGFRKLRPKRSENRKPRAITREGCKAMVVVKKEKTGKWVVTKF

Query:  VKDHNHPL
        VK+H H L
Subjt:  VKDHNHPL

Arabidopsis top hitse value%identityAlignment
AT2G43280.1 Far-red impaired responsive (FAR1) family protein5.3e-3256.2Show/hide
Query:  TDISDTEPCVGMEFESEESAKVFYDAYASRLGFIMRVDAFRRSMRDGAVVWRRLVCNKEGF-RKLRPKRSENRKPRAITREGCKAMVVVKKEKTGKWVVT
        TD    EP  G+ FESE++AK+FYD Y+ RLGF+MRV + RRS +DG ++ RR  CNKEG    +R K    RKPR  TREGCKAM+ VK +++GKWV+T
Subjt:  TDISDTEPCVGMEFESEESAKVFYDAYASRLGFIMRVDAFRRSMRDGAVVWRRLVCNKEGF-RKLRPKRSENRKPRAITREGCKAMVVVKKEKTGKWVVT

Query:  KFVKDHNHPLIVTPASARRNV
        KFVK+HNHPL+V+P  AR  +
Subjt:  KFVKDHNHPLIVTPASARRNV

AT3G07500.1 Far-red impaired responsive (FAR1) family protein4.4e-4759.49Show/hide
Query:  MDGIPTVVEDGDVVENPRGKDLVRVVAEAEARATDISDTEPCVGMEFESEESAKVFYDAYASRLGFIMRVDAFRRSMRDGAVVWRRLVCNKEGFRKLRPK
        M+G     ED +++EN    D+++ V +  +        EP +GMEFESEE+AK FYD YA+ +GF+MRVDAFRRSMRDG VVWRRLVCNKEGFR+ RP+
Subjt:  MDGIPTVVEDGDVVENPRGKDLVRVVAEAEARATDISDTEPCVGMEFESEESAKVFYDAYASRLGFIMRVDAFRRSMRDGAVVWRRLVCNKEGFRKLRPK

Query:  RSENRKPRAITREGCKAMVVVKKEKTGKWVVTKFVKDHNHPLIVTPASARRNVLLSLT
        RSE+RKPRAITREGCKA++VVK+EK+G W+VTKF K+HNHPL+    + RRN  L  T
Subjt:  RSENRKPRAITREGCKAMVVVKKEKTGKWVVTKFVKDHNHPLIVTPASARRNVLLSLT

AT3G59470.1 Far-red impaired responsive (FAR1) family protein1.1e-2951.59Show/hide
Query:  AEARATDISDTEPCVGMEFESEESAKVFYDAYASRLGFIMRVDAFRRSMRDGAVVWRRLVCNKEGFRKLRPKRSENRKPRAITREGCKAMVVVKKEKTGK
        A A  T+    EP VG EFESE +A  FY+AYA+++GF++RV    RS  DG+ + R+LVCNKEG+R L  KR +  + RA TR GCKAM++++KE +GK
Subjt:  AEARATDISDTEPCVGMEFESEESAKVFYDAYASRLGFIMRVDAFRRSMRDGAVVWRRLVCNKEGFRKLRPKRSENRKPRAITREGCKAMVVVKKEKTGK

Query:  WVVTKFVKDHNHPLIVTPASARRNVL
        WV+TKFVK+HNH L+  P   RR  +
Subjt:  WVVTKFVKDHNHPLIVTPASARRNVL

AT3G59470.2 Far-red impaired responsive (FAR1) family protein1.1e-2951.59Show/hide
Query:  AEARATDISDTEPCVGMEFESEESAKVFYDAYASRLGFIMRVDAFRRSMRDGAVVWRRLVCNKEGFRKLRPKRSENRKPRAITREGCKAMVVVKKEKTGK
        A A  T+    EP VG EFESE +A  FY+AYA+++GF++RV    RS  DG+ + R+LVCNKEG+R L  KR +  + RA TR GCKAM++++KE +GK
Subjt:  AEARATDISDTEPCVGMEFESEESAKVFYDAYASRLGFIMRVDAFRRSMRDGAVVWRRLVCNKEGFRKLRPKRSENRKPRAITREGCKAMVVVKKEKTGK

Query:  WVVTKFVKDHNHPLIVTPASARRNVL
        WV+TKFVK+HNH L+  P   RR  +
Subjt:  WVVTKFVKDHNHPLIVTPASARRNVL

AT4G38180.1 FAR1-related sequence 52.0e-3148.63Show/hide
Query:  VEDGDVVEN---PRGKDLV----RVVAEAEARATDISDTEPCVGMEFESEESAKVFYDAYASRLGFIMRVDAFRRSMRDGAVVWRRLVCNKEGFRKLRPK
        ++D D++++   P G  LV          E  A D+ D EP  G+EFESEE+AK FY++YA R+GF  RV + RRS RDGA++ R+ VC KEGFR +  K
Subjt:  VEDGDVVEN---PRGKDLV----RVVAEAEARATDISDTEPCVGMEFESEESAKVFYDAYASRLGFIMRVDAFRRSMRDGAVVWRRLVCNKEGFRKLRPK

Query:  RSENR---KPRAITREGCKAMVVVKKEKTGKWVVTKFVKDHNHPLI
        R+++R   +PR ITR GCKA + VK + +GKW+V+ FVKDHNH L+
Subjt:  RSENR---KPRAITREGCKAMVVVKKEKTGKWVVTKFVKDHNHPLI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGGAATCCCCACTGTGGTCGAGGATGGGGATGTTGTAGAAAACCCTAGAGGGAAGGATTTGGTCAGAGTAGTTGCTGAAGCTGAGGCGAGAGCGACCGATATTTC
AGATACAGAGCCATGTGTAGGTATGGAGTTTGAATCTGAGGAGTCTGCCAAGGTTTTCTACGATGCTTATGCTTCACGCCTTGGTTTTATTATGCGCGTTGATGCATTTC
GTCGGTCGATGCGTGATGGTGCGGTTGTTTGGCGTCGACTTGTTTGTAATAAAGAGGGGTTTCGCAAGTTGAGGCCCAAAAGAAGTGAAAATAGGAAGCCTCGAGCTATA
ACAAGAGAAGGGTGTAAGGCAATGGTTGTTGTAAAGAAAGAGAAAACTGGAAAATGGGTTGTAACGAAATTCGTCAAGGATCACAATCATCCACTGATTGTTACTCCTGC
CAGTGCTCGTCGAAACGTTCTCCTATCCCTGACACGGGTAAGGGCTACTTTGATTTTGATACACTTTGTGTTCATTTATTCTGGAATGATGTTTGTTCTAGTTTCATCTT
TGCCCTTAAGTTTCACTTTGTGTACCTCTATTCTGGAATTATATCTGTCCTGGTCTTCATCTTGGCCCTTAACAATAGAACCTGAAAAATTGAAGCCTGAGAATGTGAGG
GGAACTCGAAGCATAGAAATTCAGATGTCTATGTTTGGTATGAAGGAATAG
mRNA sequenceShow/hide mRNA sequence
GGAAGGCTGGACGGTGTCAGTGAACACACTGTAATTTTCACTCGACTGTTTGCCATTTTGGTTGGACTCATTCATGGAAGTTTTTCAGGGTTTAGTCCAAACTTCGTAAC
CCTCACTCTCATGCCGAAAGAACCACCCAAAAATCTCGATTTCAACGACCCCAGAAGCTTCTCGATTCACGATTCACACTACAACCACCGGAAAACACCGAAGAAATGAG
ATTGGAACCCGAAGAATTTGCGGCGTCGAATGCAATTCGAATAATCAAGGAACCGTGAATCCCAAGATAGGGACGAATCAAATGGATGGAATCCCCACTGTGGTCGAGGA
TGGGGATGTTGTAGAAAACCCTAGAGGGAAGGATTTGGTCAGAGTAGTTGCTGAAGCTGAGGCGAGAGCGACCGATATTTCAGATACAGAGCCATGTGTAGGTATGGAGT
TTGAATCTGAGGAGTCTGCCAAGGTTTTCTACGATGCTTATGCTTCACGCCTTGGTTTTATTATGCGCGTTGATGCATTTCGTCGGTCGATGCGTGATGGTGCGGTTGTT
TGGCGTCGACTTGTTTGTAATAAAGAGGGGTTTCGCAAGTTGAGGCCCAAAAGAAGTGAAAATAGGAAGCCTCGAGCTATAACAAGAGAAGGGTGTAAGGCAATGGTTGT
TGTAAAGAAAGAGAAAACTGGAAAATGGGTTGTAACGAAATTCGTCAAGGATCACAATCATCCACTGATTGTTACTCCTGCCAGTGCTCGTCGAAACGTTCTCCTATCCC
TGACACGGGTAAGGGCTACTTTGATTTTGATACACTTTGTGTTCATTTATTCTGGAATGATGTTTGTTCTAGTTTCATCTTTGCCCTTAAGTTTCACTTTGTGTACCTCT
ATTCTGGAATTATATCTGTCCTGGTCTTCATCTTGGCCCTTAACAATAGAACCTGAAAAATTGAAGCCTGAGAATGTGAGGGGAACTCGAAGCATAGAAATTCAGATGTC
TATGTTTGGTATGAAGGAATAGGATAGCTAGAATTCAGGTGTCAAATAAGTGTTGAGTTTGTTGTTTCAAGTTTGTGGACTAGAATGTGAAATTATATATTCTTCAATGA
ATTAGGGAAGCACTTCTTCTTCCGAGTCCAGGCCTTGGGAAGGGCACGGGTACCCGGGGTATAGTGGAGCAAAGCTTCGACTCCAGGTTATCAAAAAAAAAAAAATTAGG
GAAGCGAAAATTAGCCTTTCTGAATTACTTATCATTTGTTAGTTCATGTAAGTTTATGCATTGATATTGAGCAAATCTACTGGAGGTCCTTTGCATACTTATGTCGTCCA
AGAAACAACCCCTTTTCGACGTGCCCTACCTTTGGTCCAATTGATATACTTTAGATTTCCAAATTCCTTCATTTCAAAACAAACAAATAAGTACCATTTTATTTGTTGAA
TTTCATGGCCTTTGTTATGATCAGATCATCCACGTAGACTAATATGATTGTGACCTCTAACTCTCCCTTTCATAAATAAGTTGGAATCGACCAGAGCCACTGTATAGCTA
CTCTAGACTAGAAAATTTTCAATCATGAAGGTCTATAGTCTTAAGAGTGTGTGATAGGGAGGGTCCTGCTTGTAATGGGATTTATTTTTTGTTTTGTATTTCTATGGGGT
CCAGGTATAGTAGTTAGTCTAGGCTGCTTTGCTGTTAAGGGGAGTGTATAGAGGTAATGGATTCTGGTGGAGGGTATGCTTTTGGCAATTATTTAACTTGTGCGCAAAAG
AGCTCTCGAATCTCCCTCTGCTTATACCTTCCTAATACTTGAATAAACTGGTTTCTATCATTGTAACCCTGTTTCTTTTTTGTTCTTATTTGCCGAACAATAAACACGCG
TTGAAATTTGAAGCAATTTTGGAGAAATGACATTTGGAAATAAGTCTTGAAATATTTATTTTAGATTTCCTACTCGATGATTTACAAATTTTTTACGTGTGAATGAAGAT
GAAGGATGAAGTTATTATTTTGCCTTCACACACTTTTACATCTGGCTGGCTGCCTCTTCATTCTTGCAAAGAACGCATCAACTTAGTGAAAGCTAACAGAAGGAAACTCC
TCCTTCCCAAAGTATCACTAGTGTTGATCCCCTGAAGAAGAGGTTACTAAATTGCTCTTGTTAAATGCTTGGGGTATTACGATGTTTGAATCAATATGGATGAGGTTTAG
TGATAGTTTTTTTATTAGCATGTTTATGGAATCCTCTTTTTGTGAGCATAGTGAAATGGCTATTGAAAAAAATGAATGGGTTGAAATTTAACTTAGAGAGTGGGAAAAGC
CGAACAAAGCAACATAATGAGTGTCAACTTGAGCATAGTTAAGTGAATCAACACTTATTACTTCCCCTTAACGTTGAAGATTCAATCTCCAACCTCCCACATTTGTAGTA
CTCGAAACAACCAAATGGACAAAGGTGTCCATTTTCCTAAGGAGGGTCAAACAAAAGGCGTCCAATCCACATGGATAAGAAAAGTTTTCGAGTAGTCAAAAGAAGAACGC
TCGAGTATTCCAATGTAAATAAAGAGGGAAATCTCTTGATTTTCTTTATCCTTTTGCTGCCTCTGCTCTCAACCATAGTGTACGAGATGCGTAGTTTATATATTTCCATG
TTTCTGTTATTATTATTTCAGGATGAGAAAGATGCGAAAATTCGAGAATTAACTGCCGAACTACACCGAGAACGAAAGCGATGTGCAGCTTATCAAGAACAGCTTGTCAT
GATTTTAAGAGACATGGAGGAGCACTCGAATCATCTAGCAAGAAACATAGATGACATTGTTCAAAGTGTGAAAGATATTGAATCAAAAAAGTAAAAACAATTCAATTTCA
AATACCTAACCACAGAATCCAAAGAAGAAATATATTTGTTTTTGTAAAAATTGTCTAGCATAACTGATTATAGTTCAGACTTAAATTTGTTCATTAGTCGCGCCCTTTTC
CCTCTTATTAGGTGGTGTCTTTTATGGTTAAGGGAAATAAAAGTTTTTAAGAGCTAAATAGGTGTTTTTAAGCACTTTGATAATCTGCTGCGGCAATATTATAATTCAAA
TTAGATAATTGAG
Protein sequenceShow/hide protein sequence
MDGIPTVVEDGDVVENPRGKDLVRVVAEAEARATDISDTEPCVGMEFESEESAKVFYDAYASRLGFIMRVDAFRRSMRDGAVVWRRLVCNKEGFRKLRPKRSENRKPRAI
TREGCKAMVVVKKEKTGKWVVTKFVKDHNHPLIVTPASARRNVLLSLTRVRATLILIHFVFIYSGMMFVLVSSLPLSFTLCTSILELYLSWSSSWPLTIEPEKLKPENVR
GTRSIEIQMSMFGMKE