; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg002008 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg002008
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRibonuclease H
Genome locationscaffold10:3737830..3744149
RNA-Seq ExpressionSpg002008
SyntenySpg002008
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004316 - SWEET sugar transporter
IPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022147761.1 uncharacterized protein LOC111016619 [Momordica charantia]4.9e-3653.06Show/hide
Query:  MGVEVPQKFKVPTFQQYDGKKGPVQHLNAYLTWMDFHGANEATRCRAFTLTLIGLARQWFGKIPRRSIGSFKELARAFVTQFLGARNHRKPQINLLTVKQ
        M  +VP KFK+P  ++YDG   P+ HL+ Y  W D +G  EA RCR F+ TL G  R WF ++ R+SI SFKELARAFVTQF G  N  +P   LLT+KQ
Subjt:  MGVEVPQKFKVPTFQQYDGKKGPVQHLNAYLTWMDFHGANEATRCRAFTLTLIGLARQWFGKIPRRSIGSFKELARAFVTQFLGARNHRKPQINLLTVKQ

Query:  GPRESLKDYITRFSNEVLQVEGYDDGVALTAVISGLQDERLLTSIGE
           ESLKDY+ RF+ E LQVEG  D V L   ISG++DE+L+ S G+
Subjt:  GPRESLKDYITRFSNEVLQVEGYDDGVALTAVISGLQDERLLTSIGE

XP_022158344.1 uncharacterized protein LOC111024851 [Momordica charantia]1.4e-3555Show/hide
Query:  KFKVPTFQQYDGKKGPVQHLNAYLTWMDFHGANEATRCRAFTLTLIGLARQWFGKIPRRSIGSFKELARAFVTQFLGARNHRKPQINLLTVKQGPRESLK
        KFK+P   +YDG   P+ HL+AY  W D +   EA RCR F+ TL G AR WF ++ R SI SFKELA AFVTQF+G R   KP   LLT+KQ   ESLK
Subjt:  KFKVPTFQQYDGKKGPVQHLNAYLTWMDFHGANEATRCRAFTLTLIGLARQWFGKIPRRSIGSFKELARAFVTQFLGARNHRKPQINLLTVKQGPRESLK

Query:  DYITRFSNEVLQVEGYDDGVALTAVISGLQDERLLTSIGE
        +Y+ RF+ E LQVEG  D VAL A +SG++DERL+ S G+
Subjt:  DYITRFSNEVLQVEGYDDGVALTAVISGLQDERLLTSIGE

XP_022158830.1 uncharacterized protein LOC111025293 [Momordica charantia]5.4e-4343.18Show/hide
Query:  NDQLP--RDPRKGKEAVHKEVSDA-ESVTSRMQ------------DP-KNDWTNREPGPSHKNVRRGRREQE---LSKWLKEEDNPRDSYRRTENE----
        +D++P  RDP+KGK     +  ++  SV S+++            DP K    ++ P P+        R  E   L K  K  D P  S +R  ++    
Subjt:  NDQLP--RDPRKGKEAVHKEVSDA-ESVTSRMQ------------DP-KNDWTNREPGPSHKNVRRGRREQE---LSKWLKEEDNPRDSYRRTENE----

Query:  DIEELIGQMEPPFTDDIMGVEVPQKFKVPTFQQYDGKKGPVQHLNAYLTWMDFHGANEATRCRAFTLTLIGLARQWFGKIPRRSIGSFKELARAFVTQFL
        D+EEL+ Q + PFT++IM  +VP KFK+PT +Q+D    PV HL+AY  WMD +G +EA RCR F+ TL G AR WF ++ R SI SFK LARAFVTQF+
Subjt:  DIEELIGQMEPPFTDDIMGVEVPQKFKVPTFQQYDGKKGPVQHLNAYLTWMDFHGANEATRCRAFTLTLIGLARQWFGKIPRRSIGSFKELARAFVTQFL

Query:  GARNHRKPQINLLTVKQGPRESLKDYITRFSNEVLQVEGYDDGVALTAVISGLQDERLLTSIGE
        G R   +P   LLT+KQ   ESL+DY+ RF+ E LQVEG  D V+L A +SG++DE L  S G+
Subjt:  GARNHRKPQINLLTVKQGPRESLKDYITRFSNEVLQVEGYDDGVALTAVISGLQDERLLTSIGE

XP_022159109.1 uncharacterized protein LOC111025548 [Momordica charantia]2.5e-4047.34Show/hide
Query:  KNDWTNREPGPSHKNVRRGRREQELSKWLKEEDNPRDSYRRTENE----DIEELIGQMEPPFTDDIMGVEVPQKFKVPTFQQYDGKKGPVQHLNAYLTWM
        KND  NR   P   N  +G          K  D P  S +R   +    D+EEL+GQ + PFT++IM  +VP KFK+PT + +DG   PV HL+AY  WM
Subjt:  KNDWTNREPGPSHKNVRRGRREQELSKWLKEEDNPRDSYRRTENE----DIEELIGQMEPPFTDDIMGVEVPQKFKVPTFQQYDGKKGPVQHLNAYLTWM

Query:  DFHGANEATRCRAFTLTLIGLARQWFGKIPRRSIGSFKELARAFVTQFLGARNHRKPQINLLTVKQGPRESLKDYITRFSNEVLQVEGYDDGVALTAVIS
        D +G ++A RCR F+ TL G AR WF ++ R SI SFK LARAF+TQF+G R   +P   LLT+KQ   ESL DY+ RF+ E LQ+EG  D V+L A +S
Subjt:  DFHGANEATRCRAFTLTLIGLARQWFGKIPRRSIGSFKELARAFVTQFLGARNHRKPQINLLTVKQGPRESLKDYITRFSNEVLQVEGYDDGVALTAVIS

Query:  GLQDERL
        G++DE L
Subjt:  GLQDERL

XP_024041095.1 uncharacterized protein LOC112098853 [Citrus clementina]1.7e-3637.7Show/hide
Query:  QLPRDPRKGKEAVHKEVSDAESVTSRMQDPKNDWTNREPGPSHKNVRRGRRE-----QELSKWLKEEDNPRDSYRRTENEDIEELIGQMEPPFTDDIMGV
        Q P    +G+ +    V D+ S  +R ++  N+   R      ++ RR  RE     +E+ + L+E     +   R E   ++++  + EPPFT DIM  
Subjt:  QLPRDPRKGKEAVHKEVSDAESVTSRMQDPKNDWTNREPGPSHKNVRRGRRE-----QELSKWLKEEDNPRDSYRRTENEDIEELIGQMEPPFTDDIMGV

Query:  EVPQKFKVPTFQQYDGKKGPVQHLNAYLTWMDFHGANEATRCRAFTLTLIGLARQWFGKIPRRSIGSFKELARAFVTQFLGARNHRKPQINLLTVKQGPR
        + P +F +P  + YDG++ P +HL  Y T M+  GA++A  CRAF LTL+G AR+WF ++   SI SF +L+R F + F  AR   KP   LLTVKQ   
Subjt:  EVPQKFKVPTFQQYDGKKGPVQHLNAYLTWMDFHGANEATRCRAFTLTLIGLARQWFGKIPRRSIGSFKELARAFVTQFLGARNHRKPQINLLTVKQGPR

Query:  ESLKDYITRFSNEVLQVEGYDDGVALTAVISGLQDERLLTSIGE
        E+L+DYI R++NE+ QV+GYDDG+AL+ ++ GL+  +L  S+ +
Subjt:  ESLKDYITRFSNEVLQVEGYDDGVALTAVISGLQDERLLTSIGE

TrEMBL top hitse value%identityAlignment
A0A6J1D3B7 uncharacterized protein LOC1110166192.4e-3653.06Show/hide
Query:  MGVEVPQKFKVPTFQQYDGKKGPVQHLNAYLTWMDFHGANEATRCRAFTLTLIGLARQWFGKIPRRSIGSFKELARAFVTQFLGARNHRKPQINLLTVKQ
        M  +VP KFK+P  ++YDG   P+ HL+ Y  W D +G  EA RCR F+ TL G  R WF ++ R+SI SFKELARAFVTQF G  N  +P   LLT+KQ
Subjt:  MGVEVPQKFKVPTFQQYDGKKGPVQHLNAYLTWMDFHGANEATRCRAFTLTLIGLARQWFGKIPRRSIGSFKELARAFVTQFLGARNHRKPQINLLTVKQ

Query:  GPRESLKDYITRFSNEVLQVEGYDDGVALTAVISGLQDERLLTSIGE
           ESLKDY+ RF+ E LQVEG  D V L   ISG++DE+L+ S G+
Subjt:  GPRESLKDYITRFSNEVLQVEGYDDGVALTAVISGLQDERLLTSIGE

A0A6J1D7D2 uncharacterized protein LOC1110183079.0e-3652Show/hide
Query:  DDIMGVEVPQKFKVPTFQQYDGKKGPVQHLNAYLTWMDFHGANEATRCRAFTLTLIGLARQWFGKIPRRSIGSFKELARAFVTQFLGARNHRKPQINLLT
        ++IM V+VP KFK+PT +Q+DG    V HL+AY  WMD +G +EA +CR F+ TL G AR WF ++ R SI SFK LA+AFVTQF+G R+  +P   LLT
Subjt:  DDIMGVEVPQKFKVPTFQQYDGKKGPVQHLNAYLTWMDFHGANEATRCRAFTLTLIGLARQWFGKIPRRSIGSFKELARAFVTQFLGARNHRKPQINLLT

Query:  VKQGPRESLKDYITRFSNEVLQVEGYDDGVALTAVISGLQDERLLTSIGE
        +KQ   ESL DY+ RF+ E LQVEG  + V+L A +S ++DE L  S G+
Subjt:  VKQGPRESLKDYITRFSNEVLQVEGYDDGVALTAVISGLQDERLLTSIGE

A0A6J1DWY0 uncharacterized protein LOC1110252932.6e-4343.18Show/hide
Query:  NDQLP--RDPRKGKEAVHKEVSDA-ESVTSRMQ------------DP-KNDWTNREPGPSHKNVRRGRREQE---LSKWLKEEDNPRDSYRRTENE----
        +D++P  RDP+KGK     +  ++  SV S+++            DP K    ++ P P+        R  E   L K  K  D P  S +R  ++    
Subjt:  NDQLP--RDPRKGKEAVHKEVSDA-ESVTSRMQ------------DP-KNDWTNREPGPSHKNVRRGRREQE---LSKWLKEEDNPRDSYRRTENE----

Query:  DIEELIGQMEPPFTDDIMGVEVPQKFKVPTFQQYDGKKGPVQHLNAYLTWMDFHGANEATRCRAFTLTLIGLARQWFGKIPRRSIGSFKELARAFVTQFL
        D+EEL+ Q + PFT++IM  +VP KFK+PT +Q+D    PV HL+AY  WMD +G +EA RCR F+ TL G AR WF ++ R SI SFK LARAFVTQF+
Subjt:  DIEELIGQMEPPFTDDIMGVEVPQKFKVPTFQQYDGKKGPVQHLNAYLTWMDFHGANEATRCRAFTLTLIGLARQWFGKIPRRSIGSFKELARAFVTQFL

Query:  GARNHRKPQINLLTVKQGPRESLKDYITRFSNEVLQVEGYDDGVALTAVISGLQDERLLTSIGE
        G R   +P   LLT+KQ   ESL+DY+ RF+ E LQVEG  D V+L A +SG++DE L  S G+
Subjt:  GARNHRKPQINLLTVKQGPRESLKDYITRFSNEVLQVEGYDDGVALTAVISGLQDERLLTSIGE

A0A6J1DZ49 uncharacterized protein LOC1110248516.9e-3655Show/hide
Query:  KFKVPTFQQYDGKKGPVQHLNAYLTWMDFHGANEATRCRAFTLTLIGLARQWFGKIPRRSIGSFKELARAFVTQFLGARNHRKPQINLLTVKQGPRESLK
        KFK+P   +YDG   P+ HL+AY  W D +   EA RCR F+ TL G AR WF ++ R SI SFKELA AFVTQF+G R   KP   LLT+KQ   ESLK
Subjt:  KFKVPTFQQYDGKKGPVQHLNAYLTWMDFHGANEATRCRAFTLTLIGLARQWFGKIPRRSIGSFKELARAFVTQFLGARNHRKPQINLLTVKQGPRESLK

Query:  DYITRFSNEVLQVEGYDDGVALTAVISGLQDERLLTSIGE
        +Y+ RF+ E LQVEG  D VAL A +SG++DERL+ S G+
Subjt:  DYITRFSNEVLQVEGYDDGVALTAVISGLQDERLLTSIGE

A0A6J1E1E7 uncharacterized protein LOC1110255481.2e-4047.34Show/hide
Query:  KNDWTNREPGPSHKNVRRGRREQELSKWLKEEDNPRDSYRRTENE----DIEELIGQMEPPFTDDIMGVEVPQKFKVPTFQQYDGKKGPVQHLNAYLTWM
        KND  NR   P   N  +G          K  D P  S +R   +    D+EEL+GQ + PFT++IM  +VP KFK+PT + +DG   PV HL+AY  WM
Subjt:  KNDWTNREPGPSHKNVRRGRREQELSKWLKEEDNPRDSYRRTENE----DIEELIGQMEPPFTDDIMGVEVPQKFKVPTFQQYDGKKGPVQHLNAYLTWM

Query:  DFHGANEATRCRAFTLTLIGLARQWFGKIPRRSIGSFKELARAFVTQFLGARNHRKPQINLLTVKQGPRESLKDYITRFSNEVLQVEGYDDGVALTAVIS
        D +G ++A RCR F+ TL G AR WF ++ R SI SFK LARAF+TQF+G R   +P   LLT+KQ   ESL DY+ RF+ E LQ+EG  D V+L A +S
Subjt:  DFHGANEATRCRAFTLTLIGLARQWFGKIPRRSIGSFKELARAFVTQFLGARNHRKPQINLLTVKQGPRESLKDYITRFSNEVLQVEGYDDGVALTAVIS

Query:  GLQDERL
        G++DE L
Subjt:  GLQDERL

SwissProt top hitse value%identityAlignment
A2WSD3 Bidirectional sugar transporter SWEET6b2.1e-1368.42Show/hide
Query:  LPTFYKIIKNKSVEEFKCDLYIATVLNCMFLVCYG-PFVHPDSILVVAINGIGLVTE
        +PTF++I K K VEEFK D Y+AT+LNCM  V YG P VHP+SILVV INGIGLV E
Subjt:  LPTFYKIIKNKSVEEFKCDLYIATVLNCMFLVCYG-PFVHPDSILVVAINGIGLVTE

A2WSD8 Bidirectional sugar transporter SWEET6a6.0e-1366.67Show/hide
Query:  LPTFYKIIKNKSVEEFKCDLYIATVLNCMFLVCYG-PFVHPDSILVVAINGIGLVTE
        +PTF++I K K VEEFK D Y+AT+LNCM  V YG P VHP+SILVV INGIGL+ E
Subjt:  LPTFYKIIKNKSVEEFKCDLYIATVLNCMFLVCYG-PFVHPDSILVVAINGIGLVTE

A2YZ24 Bidirectional sugar transporter SWEET7b1.9e-1467.8Show/hide
Query:  LPTFYKIIKNKSVEEFKCDLYIATVLNCMFLVCYG-PFVHPDSILVVAINGIGLVTELV
        +PTFY+IIKNK V++FK D Y+AT+LNCM  V YG P VHP+SILVV INGIGL+ E V
Subjt:  LPTFYKIIKNKSVEEFKCDLYIATVLNCMFLVCYG-PFVHPDSILVVAINGIGLVTELV

B9G2E6 Putative bidirectional sugar transporter SWEET7d7.1e-1467.8Show/hide
Query:  LPTFYKIIKNKSVEEFKCDLYIATVLNCMFLVCYG-PFVHPDSILVVAINGIGLVTELV
        +PTF++IIKNK V +FK D Y+AT+LNCM  V YG P VHP+SILVV INGIGLV E V
Subjt:  LPTFYKIIKNKSVEEFKCDLYIATVLNCMFLVCYG-PFVHPDSILVVAINGIGLVTELV

Q0J349 Bidirectional sugar transporter SWEET7b1.4e-1469.49Show/hide
Query:  LPTFYKIIKNKSVEEFKCDLYIATVLNCMFLVCYG-PFVHPDSILVVAINGIGLVTELV
        +PTFY+IIKNK V++FK D Y+AT+LNCM  V YG P VHP+SILVV INGIGLV E V
Subjt:  LPTFYKIIKNKSVEEFKCDLYIATVLNCMFLVCYG-PFVHPDSILVVAINGIGLVTELV

Arabidopsis top hitse value%identityAlignment
AT3G28007.1 Nodulin MtN3 family protein4.0e-1261.02Show/hide
Query:  LPTFYKIIKNKSVEEFKCDLYIATVLNCMFLVCYG-PFVHPDSILVVAINGIGLVTELV
        +PTF  I K K VEE+K D Y+ATVLNC   V YG P V PDS+LV+ ING GL  ELV
Subjt:  LPTFYKIIKNKSVEEFKCDLYIATVLNCMFLVCYG-PFVHPDSILVVAINGIGLVTELV

AT4G10850.1 Nodulin MtN3 family protein5.8e-1155.17Show/hide
Query:  PTFYKIIKNKSVEEFKCDLYIATVLNCMFLVCYG-PFVHPDSILVVAINGIGLVTELV
        PTF +I+K KSVEE+    Y+AT++NC+  V YG P VHPDS LV+ ING G++ E+V
Subjt:  PTFYKIIKNKSVEEFKCDLYIATVLNCMFLVCYG-PFVHPDSILVVAINGIGLVTELV

AT5G40260.1 Nodulin MtN3 family protein2.0e-1164.29Show/hide
Query:  TFYKIIKNKSVEEFKCDLYIATVLNCMFLVCYG-PFVHPDSILVVAINGIGLVTEL
        TF++I K KSVEEF    Y+ATV+NCM  V YG P VH DSILV  ING+GLV EL
Subjt:  TFYKIIKNKSVEEFKCDLYIATVLNCMFLVCYG-PFVHPDSILVVAINGIGLVTEL

AT5G40260.2 Nodulin MtN3 family protein2.0e-1164.29Show/hide
Query:  TFYKIIKNKSVEEFKCDLYIATVLNCMFLVCYG-PFVHPDSILVVAINGIGLVTEL
        TF++I K KSVEEF    Y+ATV+NCM  V YG P VH DSILV  ING+GLV EL
Subjt:  TFYKIIKNKSVEEFKCDLYIATVLNCMFLVCYG-PFVHPDSILVVAINGIGLVTEL

AT5G62850.1 Nodulin MtN3 family protein1.8e-1264.41Show/hide
Query:  LPTFYKIIKNKSVEEFKCDLYIATVLNCMFLVCYG-PFVHPDSILVVAINGIGLVTELV
        +PT  KI K KSV EFK D Y+ATVLNCM    YG PFV PDS+LV+ ING GL  ELV
Subjt:  LPTFYKIIKNKSVEEFKCDLYIATVLNCMFLVCYG-PFVHPDSILVVAINGIGLVTELV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTACTTGGGCACTTTCTCCTCCATCTACAGGTATTGGAGTTGTCTGCAGGAACTCGAGAGGAGAAATCATGGGGCTCGTTTGATAACCGGAGGTTGGTAGGACCCGA
TAGTCGGAGGTTGACGGTGGCACTTGGTAGTCAGAGGTCGACAGTGGCGGTCAGTAAGCGGAGGTCGGCGGCGACCGATAGTCAGAGGTTGACGTTAGCGTGGCGATCAA
TCGGCGGACGCGGTCAGCAGAGGTCGACAGCGGCTGATGGTCGGAGGTCGACGGCGGCCGATAGTCGGAGGTCGACGGTAGCGGTCAACCACGATCGGTCGGGTGAGTCT
CCTGTTCGAATTTTGGCATCAACAGTTGGCGCCGTCTGTGGGGAAGGTGTTGACTCGCAAACGCTGTACACTGCTGGTGAAGACTCTAGTGGTCGAATGGAACGTGAGAA
TGTGACAACCGCGGACGGGGTGGGCCCTCAGGTTCAGCTCCAAGCCGAGGAGGTCGAGATTACAACATTGAGAGGAAGGATGAACGAGATGGGGCAAGACCTGGCAGAAA
TCTTGAGCCTATTGAAAGAACCCGAGCACCTAGGGCATACGAATGATCAGCTGCCCAGAGATCCAAGAAAGGGAAAAGAAGCAGTCCACAAAGAGGTCAGCGATGCGGAA
AGTGTTACTAGCCGAATGCAAGATCCAAAAAACGACTGGACCAACAGAGAGCCCGGACCTAGTCACAAAAATGTTCGCAGAGGAAGGCGTGAGCAAGAACTGTCCAAGTG
GTTGAAGGAGGAGGATAACCCTCGTGACTCGTACAGAAGGACAGAAAACGAGGACATAGAGGAACTAATAGGGCAGATGGAGCCACCCTTCACTGACGATATTATGGGGG
TAGAGGTACCACAGAAATTCAAAGTGCCCACCTTCCAGCAGTATGATGGGAAGAAGGGTCCAGTCCAACACCTGAACGCGTACCTTACTTGGATGGACTTTCATGGGGCG
AATGAAGCAACCAGGTGTCGAGCTTTTACGCTAACTTTGATAGGACTAGCAAGACAATGGTTTGGCAAGATACCCCGAAGGTCGATAGGATCATTTAAGGAGCTAGCACG
AGCCTTCGTTACGCAGTTCTTGGGGGCTCGAAATCATCGAAAGCCACAGATCAACCTGCTGACGGTTAAACAAGGACCGAGGGAGAGCTTAAAGGACTATATCACGAGAT
TCAGTAACGAAGTCCTGCAGGTAGAAGGTTACGATGATGGAGTTGCACTAACTGCTGTGATTTCAGGATTGCAGGATGAGAGGCTACTTACCTCGATTGGAGAAAGTCAG
CACAAGGCAAAGATCATTACTCTTAAATCAGGTTTAAGGAGTTCGGCCTATAGTAACCAGAGGGTTGGGCCAACTTGGCTCGACCCATATGTTCGGCCTCATCCCGAGGC
CGAGGCCGACCATTCAGCCCGTGCTTCCGCCTCCATTCGGTCCCTGGTGCCCTCGGTCACCCCGGTTCCGCTCAGTTCAGCCCCGGATCACCTCCGAACTCCTCGAAACC
CTAGAGCAGGAACAGAAATGAGTAAATATGGTGAGCGCTACTGTAGCTCGAAATATCGACTGCATTATCTGCCAACGTTTTATAAAATCATTAAGAACAAGTCTGTGGAG
GAGTTCAAGTGTGATCTTTATATTGCAACTGTTCTCAATTGCATGTTTTTGGTCTGCTATGGGCCCTTTGTTCATCCCGATAGCATTTTGGTCGTCGCCATTAATGGCAT
TGGCCTTGTTACTGAACTTGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCTACTTGGGCACTTTCTCCTCCATCTACAGGTATTGGAGTTGTCTGCAGGAACTCGAGAGGAGAAATCATGGGGCTCGTTTGATAACCGGAGGTTGGTAGGACCCGA
TAGTCGGAGGTTGACGGTGGCACTTGGTAGTCAGAGGTCGACAGTGGCGGTCAGTAAGCGGAGGTCGGCGGCGACCGATAGTCAGAGGTTGACGTTAGCGTGGCGATCAA
TCGGCGGACGCGGTCAGCAGAGGTCGACAGCGGCTGATGGTCGGAGGTCGACGGCGGCCGATAGTCGGAGGTCGACGGTAGCGGTCAACCACGATCGGTCGGGTGAGTCT
CCTGTTCGAATTTTGGCATCAACAGTTGGCGCCGTCTGTGGGGAAGGTGTTGACTCGCAAACGCTGTACACTGCTGGTGAAGACTCTAGTGGTCGAATGGAACGTGAGAA
TGTGACAACCGCGGACGGGGTGGGCCCTCAGGTTCAGCTCCAAGCCGAGGAGGTCGAGATTACAACATTGAGAGGAAGGATGAACGAGATGGGGCAAGACCTGGCAGAAA
TCTTGAGCCTATTGAAAGAACCCGAGCACCTAGGGCATACGAATGATCAGCTGCCCAGAGATCCAAGAAAGGGAAAAGAAGCAGTCCACAAAGAGGTCAGCGATGCGGAA
AGTGTTACTAGCCGAATGCAAGATCCAAAAAACGACTGGACCAACAGAGAGCCCGGACCTAGTCACAAAAATGTTCGCAGAGGAAGGCGTGAGCAAGAACTGTCCAAGTG
GTTGAAGGAGGAGGATAACCCTCGTGACTCGTACAGAAGGACAGAAAACGAGGACATAGAGGAACTAATAGGGCAGATGGAGCCACCCTTCACTGACGATATTATGGGGG
TAGAGGTACCACAGAAATTCAAAGTGCCCACCTTCCAGCAGTATGATGGGAAGAAGGGTCCAGTCCAACACCTGAACGCGTACCTTACTTGGATGGACTTTCATGGGGCG
AATGAAGCAACCAGGTGTCGAGCTTTTACGCTAACTTTGATAGGACTAGCAAGACAATGGTTTGGCAAGATACCCCGAAGGTCGATAGGATCATTTAAGGAGCTAGCACG
AGCCTTCGTTACGCAGTTCTTGGGGGCTCGAAATCATCGAAAGCCACAGATCAACCTGCTGACGGTTAAACAAGGACCGAGGGAGAGCTTAAAGGACTATATCACGAGAT
TCAGTAACGAAGTCCTGCAGGTAGAAGGTTACGATGATGGAGTTGCACTAACTGCTGTGATTTCAGGATTGCAGGATGAGAGGCTACTTACCTCGATTGGAGAAAGTCAG
CACAAGGCAAAGATCATTACTCTTAAATCAGGTTTAAGGAGTTCGGCCTATAGTAACCAGAGGGTTGGGCCAACTTGGCTCGACCCATATGTTCGGCCTCATCCCGAGGC
CGAGGCCGACCATTCAGCCCGTGCTTCCGCCTCCATTCGGTCCCTGGTGCCCTCGGTCACCCCGGTTCCGCTCAGTTCAGCCCCGGATCACCTCCGAACTCCTCGAAACC
CTAGAGCAGGAACAGAAATGAGTAAATATGGTGAGCGCTACTGTAGCTCGAAATATCGACTGCATTATCTGCCAACGTTTTATAAAATCATTAAGAACAAGTCTGTGGAG
GAGTTCAAGTGTGATCTTTATATTGCAACTGTTCTCAATTGCATGTTTTTGGTCTGCTATGGGCCCTTTGTTCATCCCGATAGCATTTTGGTCGTCGCCATTAATGGCAT
TGGCCTTGTTACTGAACTTGTTTAG
Protein sequenceShow/hide protein sequence
MLLGHFLLHLQVLELSAGTREEKSWGSFDNRRLVGPDSRRLTVALGSQRSTVAVSKRRSAATDSQRLTLAWRSIGGRGQQRSTAADGRRSTAADSRRSTVAVNHDRSGES
PVRILASTVGAVCGEGVDSQTLYTAGEDSSGRMERENVTTADGVGPQVQLQAEEVEITTLRGRMNEMGQDLAEILSLLKEPEHLGHTNDQLPRDPRKGKEAVHKEVSDAE
SVTSRMQDPKNDWTNREPGPSHKNVRRGRREQELSKWLKEEDNPRDSYRRTENEDIEELIGQMEPPFTDDIMGVEVPQKFKVPTFQQYDGKKGPVQHLNAYLTWMDFHGA
NEATRCRAFTLTLIGLARQWFGKIPRRSIGSFKELARAFVTQFLGARNHRKPQINLLTVKQGPRESLKDYITRFSNEVLQVEGYDDGVALTAVISGLQDERLLTSIGESQ
HKAKIITLKSGLRSSAYSNQRVGPTWLDPYVRPHPEAEADHSARASASIRSLVPSVTPVPLSSAPDHLRTPRNPRAGTEMSKYGERYCSSKYRLHYLPTFYKIIKNKSVE
EFKCDLYIATVLNCMFLVCYGPFVHPDSILVVAINGIGLVTELV