; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0029461 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0029461
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionBED-type domain-containing protein
Genome locationchr8:39241969..39244665
RNA-Seq ExpressionLag0029461
SyntenyLag0029461
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0005488 - binding (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR025558 - Domain of unknown function DUF4283


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW17887.1 hypothetical protein CK203_093585 [Vitis vinifera]1.3e-3743.63Show/hide
Query:  SKELIVENETGGEYELPFSRKMVKSLRKWNMCI-RPISTKGALECRPKKLS---------GMFGMDLATNTRKIKTPASWWALYGGSTPILQKLAMRIHS
        +K+ IV +  G E +      ++   ++W + + RP+  +G    + K ++         G+FG +LA  TRK + PA WWA YG S P LQK AM+IHS
Subjt:  SKELIVENETGGEYELPFSRKMVKSLRKWNMCI-RPISTKGALECRPKKLS---------GMFGMDLATNTRKIKTPASWWALYGGSTPILQKLAMRIHS

Query:  KKRNRLEQKRLNDLVYIKYNQTLKERFDLKDRLDPISLDHIDESNEWLVGTVEEESDEAEVDHELVFDDEDLTWGDVARASGVREPLKYTRSKGKSPASA
        K+RNRL+ +RLNDLVYIKYN+ LK R++ ++ +DPISL  ID+SNEWL+G +E+E        + VFDD++LTWGDVARA+G  E    TR++ ++ +S 
Subjt:  KKRNRLEQKRLNDLVYIKYNQTLKERFDLKDRLDPISLDHIDESNEWLVGTVEEESDEAEVDHELVFDDEDLTWGDVARASGVREPLKYTRSKGKSPASA

Query:  TPTT
         P T
Subjt:  TPTT

RVX23773.1 hypothetical protein CK203_000707 [Vitis vinifera]2.2e-3754.86Show/hide
Query:  GMFGMDLATNTRKIKTPASWWALYGGSTPILQKLAMRIHSKKRNRLEQKRLNDLVYIKYNQTLKERFDLKDRLDPISLDHIDESNEWLVGTVEEESDEAE
        G+FG +LA  TRK + PA WWA YG S P LQK AM+IHSK+RNRL+ +RLNDLVYIKYN+ LK R++ ++ +DPISL  ID+SNEWL+G +E+E     
Subjt:  GMFGMDLATNTRKIKTPASWWALYGGSTPILQKLAMRIHSKKRNRLEQKRLNDLVYIKYNQTLKERFDLKDRLDPISLDHIDESNEWLVGTVEEESDEAE

Query:  VDHELVFDDEDLTWGDVARASGVREPLKYTRSKGKSPASATPTT
           + VFDD++LTWGDVARA+G  E    TR++ ++ +S  P T
Subjt:  VDHELVFDDEDLTWGDVARASGVREPLKYTRSKGKSPASATPTT

XP_022143395.1 uncharacterized protein LOC111013272 [Momordica charantia]4.7e-4862.21Show/hide
Query:  GMFGMDLATNTRKIKTPASWWALYGGSTPILQKLAMR--------------------IHSKKRNRLEQKRLNDLVYIKYNQTLKERFDLKDRLDPISLDH
        GMF  + A  TR +KTPA WW+LYG + PIL+KLAMR                    IHSKKRN+LEQKRLNDLVYIKYNQ LKER+ L+D+LDPI+LDH
Subjt:  GMFGMDLATNTRKIKTPASWWALYGGSTPILQKLAMR--------------------IHSKKRNRLEQKRLNDLVYIKYNQTLKERFDLKDRLDPISLDH

Query:  IDESNEWLVGTVEEESDEAEVDHELVFDDEDLTWGDVARASGVREPLK----YTRSKGKSPASATPTTSRSR
        IDESNEWL+GT+EEE  + ++++ELVFDD+DLTW DVA ASGVREP++    Y RSKGKSPA+  PTTSR R
Subjt:  IDESNEWLVGTVEEESDEAEVDHELVFDDEDLTWGDVARASGVREPLK----YTRSKGKSPASATPTTSRSR

XP_022149859.1 uncharacterized protein LOC111018186 [Momordica charantia]3.6e-6449.81Show/hide
Query:  DSKNVKQEGRFLRSDQSSEFEREEKCGAGLAEVRGINWNETIVITRRDFHEDWGRILNAIPDQTQQSYNINPFLPDKALLKCPSGDLARLLVTNKGWVNF
        + +++++E R L  DQ S   +    GAG  EVR +NW ETIVITRRDFH+DW RIL+ + +QT+ SY INPF  DKAL+KCPS DLA LL+TNKGWV F
Subjt:  DSKNVKQEGRFLRSDQSSEFEREEKCGAGLAEVRGINWNETIVITRRDFHEDWGRILNAIPDQTQQSYNINPFLPDKALLKCPSGDLARLLVTNKGWVNF

Query:  GPLTVKVKTWNPHIHGRASVTRSYGGWIRFRNIPLHLWSLATFKAIGDIYGGFLDYAQANSNLIECMEVAIKV---------------------------
        GP+TVK++ WNP +HGRA +  SYG W++ RNIPLHLWSLATFKAIG+  GGF+DY   NS  IEC +VAIKV                           
Subjt:  GPLTVKVKTWNPHIHGRASVTRSYGGWIRFRNIPLHLWSLATFKAIGDIYGGFLDYAQANSNLIECMEVAIKV---------------------------

Query:  ---------REVGKHGGFLPKAARSFFRSDQGLSPNPVDIWRVQDGVFSPSINIMYPCF
                 ++VG HGGF  +AARSF +     S N +D WR+++G   P +NI YP F
Subjt:  ---------REVGKHGGFLPKAARSFFRSDQGLSPNPVDIWRVQDGVFSPSINIMYPCF

XP_022157603.1 uncharacterized protein LOC111024254 [Momordica charantia]1.3e-4264.52Show/hide
Query:  ASWWALYGGSTPILQKLAMR--------------------IHSKKRNRLEQKRLNDLVYIKYNQTLKERFDLKDRLDPISLDHIDESNEWLVGTVEEESD
        A+WW+LYG + PIL+KLAMR                    IHSKKRN LEQKRLNDLVYIKYNQ LKER+ L+D+LDPI+LDHIDESNEWL+GT+EEE  
Subjt:  ASWWALYGGSTPILQKLAMR--------------------IHSKKRNRLEQKRLNDLVYIKYNQTLKERFDLKDRLDPISLDHIDESNEWLVGTVEEESD

Query:  EAEVDHELVFDDEDLTWGDVARASGVREPLK----YTRSKGKSPASATPTTSRSR
        + E  +ELVFDD+DLTWGDVA ASGVREP++    Y RSKGKSPA+  PTTSR R
Subjt:  EAEVDHELVFDDEDLTWGDVARASGVREPLK----YTRSKGKSPASATPTTSRSR

TrEMBL top hitse value%identityAlignment
A0A438C3Q9 Uncharacterized protein6.2e-3843.63Show/hide
Query:  SKELIVENETGGEYELPFSRKMVKSLRKWNMCI-RPISTKGALECRPKKLS---------GMFGMDLATNTRKIKTPASWWALYGGSTPILQKLAMRIHS
        +K+ IV +  G E +      ++   ++W + + RP+  +G    + K ++         G+FG +LA  TRK + PA WWA YG S P LQK AM+IHS
Subjt:  SKELIVENETGGEYELPFSRKMVKSLRKWNMCI-RPISTKGALECRPKKLS---------GMFGMDLATNTRKIKTPASWWALYGGSTPILQKLAMRIHS

Query:  KKRNRLEQKRLNDLVYIKYNQTLKERFDLKDRLDPISLDHIDESNEWLVGTVEEESDEAEVDHELVFDDEDLTWGDVARASGVREPLKYTRSKGKSPASA
        K+RNRL+ +RLNDLVYIKYN+ LK R++ ++ +DPISL  ID+SNEWL+G +E+E        + VFDD++LTWGDVARA+G  E    TR++ ++ +S 
Subjt:  KKRNRLEQKRLNDLVYIKYNQTLKERFDLKDRLDPISLDHIDESNEWLVGTVEEESDEAEVDHELVFDDEDLTWGDVARASGVREPLKYTRSKGKSPASA

Query:  TPTT
         P T
Subjt:  TPTT

A0A6J1CNP5 uncharacterized protein LOC1110132722.3e-4862.21Show/hide
Query:  GMFGMDLATNTRKIKTPASWWALYGGSTPILQKLAMR--------------------IHSKKRNRLEQKRLNDLVYIKYNQTLKERFDLKDRLDPISLDH
        GMF  + A  TR +KTPA WW+LYG + PIL+KLAMR                    IHSKKRN+LEQKRLNDLVYIKYNQ LKER+ L+D+LDPI+LDH
Subjt:  GMFGMDLATNTRKIKTPASWWALYGGSTPILQKLAMR--------------------IHSKKRNRLEQKRLNDLVYIKYNQTLKERFDLKDRLDPISLDH

Query:  IDESNEWLVGTVEEESDEAEVDHELVFDDEDLTWGDVARASGVREPLK----YTRSKGKSPASATPTTSRSR
        IDESNEWL+GT+EEE  + ++++ELVFDD+DLTW DVA ASGVREP++    Y RSKGKSPA+  PTTSR R
Subjt:  IDESNEWLVGTVEEESDEAEVDHELVFDDEDLTWGDVARASGVREPLK----YTRSKGKSPASATPTTSRSR

A0A6J1D6X4 uncharacterized protein LOC1110181861.7e-6449.81Show/hide
Query:  DSKNVKQEGRFLRSDQSSEFEREEKCGAGLAEVRGINWNETIVITRRDFHEDWGRILNAIPDQTQQSYNINPFLPDKALLKCPSGDLARLLVTNKGWVNF
        + +++++E R L  DQ S   +    GAG  EVR +NW ETIVITRRDFH+DW RIL+ + +QT+ SY INPF  DKAL+KCPS DLA LL+TNKGWV F
Subjt:  DSKNVKQEGRFLRSDQSSEFEREEKCGAGLAEVRGINWNETIVITRRDFHEDWGRILNAIPDQTQQSYNINPFLPDKALLKCPSGDLARLLVTNKGWVNF

Query:  GPLTVKVKTWNPHIHGRASVTRSYGGWIRFRNIPLHLWSLATFKAIGDIYGGFLDYAQANSNLIECMEVAIKV---------------------------
        GP+TVK++ WNP +HGRA +  SYG W++ RNIPLHLWSLATFKAIG+  GGF+DY   NS  IEC +VAIKV                           
Subjt:  GPLTVKVKTWNPHIHGRASVTRSYGGWIRFRNIPLHLWSLATFKAIGDIYGGFLDYAQANSNLIECMEVAIKV---------------------------

Query:  ---------REVGKHGGFLPKAARSFFRSDQGLSPNPVDIWRVQDGVFSPSINIMYPCF
                 ++VG HGGF  +AARSF +     S N +D WR+++G   P +NI YP F
Subjt:  ---------REVGKHGGFLPKAARSFFRSDQGLSPNPVDIWRVQDGVFSPSINIMYPCF

A0A6J1DTS8 uncharacterized protein LOC1110242546.4e-4364.52Show/hide
Query:  ASWWALYGGSTPILQKLAMR--------------------IHSKKRNRLEQKRLNDLVYIKYNQTLKERFDLKDRLDPISLDHIDESNEWLVGTVEEESD
        A+WW+LYG + PIL+KLAMR                    IHSKKRN LEQKRLNDLVYIKYNQ LKER+ L+D+LDPI+LDHIDESNEWL+GT+EEE  
Subjt:  ASWWALYGGSTPILQKLAMR--------------------IHSKKRNRLEQKRLNDLVYIKYNQTLKERFDLKDRLDPISLDHIDESNEWLVGTVEEESD

Query:  EAEVDHELVFDDEDLTWGDVARASGVREPLK----YTRSKGKSPASATPTTSRSR
        + E  +ELVFDD+DLTWGDVA ASGVREP++    Y RSKGKSPA+  PTTSR R
Subjt:  EAEVDHELVFDDEDLTWGDVARASGVREPLK----YTRSKGKSPASATPTTSRSR

A5B625 DUF659 domain-containing protein1.1e-3754.86Show/hide
Query:  GMFGMDLATNTRKIKTPASWWALYGGSTPILQKLAMRIHSKKRNRLEQKRLNDLVYIKYNQTLKERFDLKDRLDPISLDHIDESNEWLVGTVEEESDEAE
        G+FG +LA  TRK + PA WWA YG S P LQK AM+IHSK+RNRL+ +RLNDLVYIKYN+ LK R++ ++ +DPISL  ID+SNEWL+G +E+E     
Subjt:  GMFGMDLATNTRKIKTPASWWALYGGSTPILQKLAMRIHSKKRNRLEQKRLNDLVYIKYNQTLKERFDLKDRLDPISLDHIDESNEWLVGTVEEESDEAE

Query:  VDHELVFDDEDLTWGDVARASGVREPLKYTRSKGKSPASATPTT
           + VFDD++LTWGDVARA+G  E    TR++ ++ +S  P T
Subjt:  VDHELVFDDEDLTWGDVARASGVREPLKYTRSKGKSPASATPTT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G79740.1 hAT transposon superfamily2.4e-1032.43Show/hide
Query:  KLSGMFGMDLATNTRKIKTPASWWALYGGSTPILQKLAMRI--------------------HSKKRNRLEQKRLNDLVYIKYNQTLKERFDLKDRLDPIS
        +  GMFG +LA   R   +P  WW  +G S P+LQ++A+RI                    H ++RN+++++ LN L Y+  N  L     L+   DPI+
Subjt:  KLSGMFGMDLATNTRKIKTPASWWALYGGSTPILQKLAMRI--------------------HSKKRNRLEQKRLNDLVYIKYNQTLKERFDLKDRLDPIS

Query:  LDHIDESNEWL
        L+ ID  +EW+
Subjt:  LDHIDESNEWL

AT3G22220.1 hAT transposon superfamily1.3e-0830.43Show/hide
Query:  KKLSGMFGMDLATNTRKIKTPASWWALYGGSTPILQKLAMRIHS--------------------KKRNRLEQKRLNDLVYIKYNQTLK---ERFDLKDRL
        K   G+FG +LA   R    PA WW+ YG S   L + A+RI S                    + +N +E++RLNDLV+++YN  L+         D +
Subjt:  KKLSGMFGMDLATNTRKIKTPASWWALYGGSTPILQKLAMRIHS--------------------KKRNRLEQKRLNDLVYIKYNQTLK---ERFDLKDRL

Query:  DPISLDHIDESNEWL
        DP+S  +++   +W+
Subjt:  DPISLDHIDESNEWL

AT4G15020.1 hAT transposon superfamily4.9e-1135.34Show/hide
Query:  KKLSGMFGMDLATNTRKIKTPASWWALYGGSTPILQKLAMRIHS--------------------KKRNRLEQKRLNDLVYIKYNQTLKE--RFDLKDRLD
        K   G+FG +LA   R    PA WW+ YG S   L + A+RI S                    + +N +EQKRL+DLV+++YN  L++       D LD
Subjt:  KKLSGMFGMDLATNTRKIKTPASWWALYGGSTPILQKLAMRIHS--------------------KKRNRLEQKRLNDLVYIKYNQTLKE--RFDLKDRLD

Query:  PISLDHIDESNEWLVG
        P+S + ID   EW+ G
Subjt:  PISLDHIDESNEWLVG

AT4G15020.2 hAT transposon superfamily4.9e-1135.34Show/hide
Query:  KKLSGMFGMDLATNTRKIKTPASWWALYGGSTPILQKLAMRIHS--------------------KKRNRLEQKRLNDLVYIKYNQTLKE--RFDLKDRLD
        K   G+FG +LA   R    PA WW+ YG S   L + A+RI S                    + +N +EQKRL+DLV+++YN  L++       D LD
Subjt:  KKLSGMFGMDLATNTRKIKTPASWWALYGGSTPILQKLAMRIHS--------------------KKRNRLEQKRLNDLVYIKYNQTLKE--RFDLKDRLD

Query:  PISLDHIDESNEWLVG
        P+S + ID   EW+ G
Subjt:  PISLDHIDESNEWLVG

AT5G33406.1 hAT dimerisation domain-containing protein / transposase-related2.7e-3040.56Show/hide
Query:  KKLSGMFGMDLATNTRKIKTPASWWALYGGSTPILQKLAMRI--------------------HSKKRNRLEQKRLNDLVYIKYNQTLKERFDLKDRLDPI
        KK +G+FG+ +A   R   +PA WW+ YG STP LQ  A+++                    H+K+RNRL Q RLND++++KYN+ L+ R+   D  DPI
Subjt:  KKLSGMFGMDLATNTRKIKTPASWWALYGGSTPILQKLAMRI--------------------HSKKRNRLEQKRLNDLVYIKYNQTLKERFDLKDRLDPI

Query:  SLDHIDESNEWLVGTVEEESDEAEVDHELVFDDEDLTWGDVARASGVREPLKYTRS--------KGKSPASATPTTSRSR
         L+ ID+ NEWL G +EE S + E D +LVF+++DLTW +V  A+G  +P   TRS        KGK  AS +     SR
Subjt:  SLDHIDESNEWLVGTVEEESDEAEVDHELVFDDEDLTWGDVARASGVREPLKYTRS--------KGKSPASATPTTSRSR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAAAATCTTCGAGAAACATGGGCGCCGGTCCTTTCATCTGTCCGTTGAAGAAATCATCGCAGTTTGGATTGCAGACAACATTGAGGACCTTCTTTGTCTCCAAAGA
CTCAAAAATTCTTCAGGAAAACAGATTGCAATGGAGGCTTTATATGGATTCAAAAAACGTCAAACAAGAAGGGAGATTTCTTAGAAGTGACCAAAGTTCAGAGTTCGAGA
GGGAAGAGAAATGTGGAGCGGGTTTGGCCGAAGTTAGAGGGATAAATTGGAACGAGACCATAGTGATCACTAGAAGAGATTTTCATGAAGACTGGGGAAGGATCCTAAAT
GCGATACCGGACCAAACACAACAAAGCTATAATATTAATCCTTTCCTCCCAGATAAAGCTCTCCTAAAATGCCCTTCGGGAGATTTGGCCAGGTTACTTGTGACTAACAA
GGGATGGGTTAACTTTGGACCTTTGACAGTGAAAGTCAAAACGTGGAACCCGCACATTCATGGGAGAGCCTCGGTCACTCGTTCATATGGGGGTTGGATCAGATTTAGAA
ACATTCCCCTTCATTTATGGAGCTTGGCCACATTTAAAGCCATTGGGGATATTTATGGAGGTTTCTTGGATTATGCTCAAGCCAACTCAAACCTCATTGAATGTATGGAA
GTGGCCATTAAAGTCAGAGAGGTCGGAAAACATGGGGGATTCTTGCCGAAAGCTGCTCGGAGTTTTTTCAGGTCTGATCAGGGCCTCAGCCCAAACCCAGTGGATATTTG
GCGCGTGCAAGATGGAGTCTTTAGTCCATCGATTAATATTATGTACCCCTGTTTCCCGATTATCGACAGAGAGCTAGGCGCAAGGGAAAATTTTCAAATTCAGAATAATA
AAGGACAAAATTGTCAGCAACAAATTAAGCAAGACCGGTCTCTAGGCCCCACAAAGCTGACAACGTCTCAGTTTGTTCATCTGCCCGAAGAAGGCCCATCCTCGAACCCT
GAATTAAAAAGTAAAAACAAGAGGAAGAAAGGGAAGTCGGTTGCTTTTGACAACGGAACCAAAACGAACGTGGCCATGAAAAAGGAACAGATCTTTTCCTCTGAATACCA
AAAAACTTCTAGAGGGCGGAAAGGAAAAGATTTAGAAGTTGGGTCAGACTTCTCCTTAACGAGTATCAGCAGCCTCGAAGAAGGTCCGATTGAAGAAGCGATTGAATCTT
CGGTGGCTAGAGAGGACCAGGACCCCCCCCCTCAGATTGTTTTTGTTTGTTTTGGGGAAGAAGAACCTCATGATAATGTTGAGGTTGACGATAAGGGTAGTAAAGAGCTT
ATAGTTGAGAACGAGACAGGTGGTGAGTACGAGCTTCCTTTCTCAAGGAAGATGGTGAAGTCCCTAAGAAAGTGGAATATGTGTATTAGACCCATATCCACTAAAGGGGC
ATTGGAGTGCCGCCCAAAAAAGCTCTCAGGCATGTTTGGTATGGATTTGGCAACCAACACTAGGAAGATTAAGACACCAGCATCGTGGTGGGCCCTTTATGGAGGTAGTA
CTCCAATCTTACAAAAGCTAGCCATGAGAATTCATTCGAAGAAGAGAAACAGATTAGAGCAAAAGCGTTTAAATGATTTGGTGTATATAAAATATAACCAAACACTTAAA
GAACGCTTTGATCTGAAGGATCGGCTTGATCCAATTTCTCTAGATCATATTGATGAAAGCAATGAGTGGCTGGTCGGAACAGTCGAAGAGGAAAGTGATGAAGCTGAAGT
AGATCATGAGCTAGTTTTTGATGATGAAGACCTCACATGGGGAGACGTAGCTCGTGCTAGTGGTGTTAGAGAACCCTTAAAGTACACAAGATCTAAAGGAAAGTCACCGG
CAAGTGCAACTCCCACAACTTCTAGAAGTCGACCACCACTATACAGGTGGTTAGTGATAACGATGACGACGATGAAGAAGAGTTGGATATAG
mRNA sequenceShow/hide mRNA sequence
ATGTCAAAATCTTCGAGAAACATGGGCGCCGGTCCTTTCATCTGTCCGTTGAAGAAATCATCGCAGTTTGGATTGCAGACAACATTGAGGACCTTCTTTGTCTCCAAAGA
CTCAAAAATTCTTCAGGAAAACAGATTGCAATGGAGGCTTTATATGGATTCAAAAAACGTCAAACAAGAAGGGAGATTTCTTAGAAGTGACCAAAGTTCAGAGTTCGAGA
GGGAAGAGAAATGTGGAGCGGGTTTGGCCGAAGTTAGAGGGATAAATTGGAACGAGACCATAGTGATCACTAGAAGAGATTTTCATGAAGACTGGGGAAGGATCCTAAAT
GCGATACCGGACCAAACACAACAAAGCTATAATATTAATCCTTTCCTCCCAGATAAAGCTCTCCTAAAATGCCCTTCGGGAGATTTGGCCAGGTTACTTGTGACTAACAA
GGGATGGGTTAACTTTGGACCTTTGACAGTGAAAGTCAAAACGTGGAACCCGCACATTCATGGGAGAGCCTCGGTCACTCGTTCATATGGGGGTTGGATCAGATTTAGAA
ACATTCCCCTTCATTTATGGAGCTTGGCCACATTTAAAGCCATTGGGGATATTTATGGAGGTTTCTTGGATTATGCTCAAGCCAACTCAAACCTCATTGAATGTATGGAA
GTGGCCATTAAAGTCAGAGAGGTCGGAAAACATGGGGGATTCTTGCCGAAAGCTGCTCGGAGTTTTTTCAGGTCTGATCAGGGCCTCAGCCCAAACCCAGTGGATATTTG
GCGCGTGCAAGATGGAGTCTTTAGTCCATCGATTAATATTATGTACCCCTGTTTCCCGATTATCGACAGAGAGCTAGGCGCAAGGGAAAATTTTCAAATTCAGAATAATA
AAGGACAAAATTGTCAGCAACAAATTAAGCAAGACCGGTCTCTAGGCCCCACAAAGCTGACAACGTCTCAGTTTGTTCATCTGCCCGAAGAAGGCCCATCCTCGAACCCT
GAATTAAAAAGTAAAAACAAGAGGAAGAAAGGGAAGTCGGTTGCTTTTGACAACGGAACCAAAACGAACGTGGCCATGAAAAAGGAACAGATCTTTTCCTCTGAATACCA
AAAAACTTCTAGAGGGCGGAAAGGAAAAGATTTAGAAGTTGGGTCAGACTTCTCCTTAACGAGTATCAGCAGCCTCGAAGAAGGTCCGATTGAAGAAGCGATTGAATCTT
CGGTGGCTAGAGAGGACCAGGACCCCCCCCCTCAGATTGTTTTTGTTTGTTTTGGGGAAGAAGAACCTCATGATAATGTTGAGGTTGACGATAAGGGTAGTAAAGAGCTT
ATAGTTGAGAACGAGACAGGTGGTGAGTACGAGCTTCCTTTCTCAAGGAAGATGGTGAAGTCCCTAAGAAAGTGGAATATGTGTATTAGACCCATATCCACTAAAGGGGC
ATTGGAGTGCCGCCCAAAAAAGCTCTCAGGCATGTTTGGTATGGATTTGGCAACCAACACTAGGAAGATTAAGACACCAGCATCGTGGTGGGCCCTTTATGGAGGTAGTA
CTCCAATCTTACAAAAGCTAGCCATGAGAATTCATTCGAAGAAGAGAAACAGATTAGAGCAAAAGCGTTTAAATGATTTGGTGTATATAAAATATAACCAAACACTTAAA
GAACGCTTTGATCTGAAGGATCGGCTTGATCCAATTTCTCTAGATCATATTGATGAAAGCAATGAGTGGCTGGTCGGAACAGTCGAAGAGGAAAGTGATGAAGCTGAAGT
AGATCATGAGCTAGTTTTTGATGATGAAGACCTCACATGGGGAGACGTAGCTCGTGCTAGTGGTGTTAGAGAACCCTTAAAGTACACAAGATCTAAAGGAAAGTCACCGG
CAAGTGCAACTCCCACAACTTCTAGAAGTCGACCACCACTATACAGGTGGTTAGTGATAACGATGACGACGATGAAGAAGAGTTGGATATAG
Protein sequenceShow/hide protein sequence
MSKSSRNMGAGPFICPLKKSSQFGLQTTLRTFFVSKDSKILQENRLQWRLYMDSKNVKQEGRFLRSDQSSEFEREEKCGAGLAEVRGINWNETIVITRRDFHEDWGRILN
AIPDQTQQSYNINPFLPDKALLKCPSGDLARLLVTNKGWVNFGPLTVKVKTWNPHIHGRASVTRSYGGWIRFRNIPLHLWSLATFKAIGDIYGGFLDYAQANSNLIECME
VAIKVREVGKHGGFLPKAARSFFRSDQGLSPNPVDIWRVQDGVFSPSINIMYPCFPIIDRELGARENFQIQNNKGQNCQQQIKQDRSLGPTKLTTSQFVHLPEEGPSSNP
ELKSKNKRKKGKSVAFDNGTKTNVAMKKEQIFSSEYQKTSRGRKGKDLEVGSDFSLTSISSLEEGPIEEAIESSVAREDQDPPPQIVFVCFGEEEPHDNVEVDDKGSKEL
IVENETGGEYELPFSRKMVKSLRKWNMCIRPISTKGALECRPKKLSGMFGMDLATNTRKIKTPASWWALYGGSTPILQKLAMRIHSKKRNRLEQKRLNDLVYIKYNQTLK
ERFDLKDRLDPISLDHIDESNEWLVGTVEEESDEAEVDHELVFDDEDLTWGDVARASGVREPLKYTRSKGKSPASATPTTSRSRPPLYRWLVITMTTMKKSWI