; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001500 (gene) of Snake gourd v1 genome

Gene IDTan0001500
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function (DUF1218)
Genome locationLG10:9598680..9601808
RNA-Seq ExpressionTan0001500
SyntenyTan0001500
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004134498.1 protein MODIFYING WALL LIGNIN-1 [Cucumis sativus]2.1e-11684.42Show/hide
Query:  MGRR-KNMAVTHEDLQPSRKSSELGSKMGTFLMVLTILCGLCCFILCLIAESTRSQVIWMNTENNNKNGAKRCSYSSSGKTPLLCTASAFLGMAVMMVVQ
        MGRR KNMAVTH+DL PS KSSELGSKMGTFL++LTILCGLCCFILCLIAE+TRSQVIWM  + NNK   +RCSYS SGKTPLLCTASAFLGMAVMMVVQ
Subjt:  MGRR-KNMAVTHEDLQPSRKSSELGSKMGTFLMVLTILCGLCCFILCLIAESTRSQVIWMNTENNNKNGAKRCSYSSSGKTPLLCTASAFLGMAVMMVVQ

Query:  HLYVLIAVSKSPPPALVAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWS----------KGLFSAAGVFELATVFLAAGLYMT
        HLYVLIAVSKS PPAL+AWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWS          +GLFSAAGVF+LATVFLAAGLYMT
Subjt:  HLYVLIAVSKSPPPALVAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWS----------KGLFSAAGVFELATVFLAAGLYMT

Query:  AVRAQRLFEEQENVRREVLESYHVHGSPPR---SPPLQPMPPIAREDPVIRHS-HHQE-APFLLLLPSTAAFCKLS
        AVRAQR+FE+QENVRREVLESYH+H SPPR   SPPLQPMPPIAREDPVIRHS HHQE APF  LL STA FCKLS
Subjt:  AVRAQRLFEEQENVRREVLESYHVHGSPPR---SPPLQPMPPIAREDPVIRHS-HHQE-APFLLLLPSTAAFCKLS

XP_022934645.1 uncharacterized protein LOC111441776 [Cucurbita moschata]2.4e-12084.25Show/hide
Query:  MGRRKNMAVTHEDLQPSRKSSELGSKMGTFLMVLTILCGLCCFILCLIAESTRSQVIWM-NTENNNKNGAKRCSYSSSGKTPLLCTASAFLGMAVMMVVQ
        MGR K MAVTHEDL PSR+SSELGSKMGTFL++LT+LCGLCCFILCL+AESTRSQVIW    ENNNK G KRC YS SGKTPL+CTASAFLGMAVMMVVQ
Subjt:  MGRRKNMAVTHEDLQPSRKSSELGSKMGTFLMVLTILCGLCCFILCLIAESTRSQVIWM-NTENNNKNGAKRCSYSSSGKTPLLCTASAFLGMAVMMVVQ

Query:  HLYVLIAVSKSPPPALVAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNW----------SKGLFSAAGVFELATVFLAAGLYMT
        HLYVLIAVSKSPPPAL+AWDPSFATSKSLTFQAAFFFVSTWISF+VGEILLLIGLSVESGHL +W           +GLFSAAGVFELATVFLAAGLYMT
Subjt:  HLYVLIAVSKSPPPALVAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNW----------SKGLFSAAGVFELATVFLAAGLYMT

Query:  AVRAQRLFEEQENVRREVLESYHVHGSPPRSPPLQPMPPIAREDPVIRHSHHQEAPFLLLLPSTAAFCK-LSR
        AVRAQRLFE+Q NVRREVLESYH+H SPPRSPP+QPMPPIAREDPVIRHSHH E+PFL LLPS+AAFCK LSR
Subjt:  AVRAQRLFEEQENVRREVLESYHVHGSPPRSPPLQPMPPIAREDPVIRHSHHQEAPFLLLLPSTAAFCK-LSR

XP_022972325.1 uncharacterized protein LOC111470898 [Cucurbita maxima]4.4e-12285.35Show/hide
Query:  MGRRKNMAVTHEDLQPSRKSSELGSKMGTFLMVLTILCGLCCFILCLIAESTRSQVIWM-NTENNNKNGAKRCSYSSSGKTPLLCTASAFLGMAVMMVVQ
        MGRRK MAVTHEDL PSR+SSELGSKMGTFL++LT+LCGLCCFILCL+AESTRSQVIWM   ENNNK G KRC YS SGKTPL+CTASAFLGMAVMMVVQ
Subjt:  MGRRKNMAVTHEDLQPSRKSSELGSKMGTFLMVLTILCGLCCFILCLIAESTRSQVIWM-NTENNNKNGAKRCSYSSSGKTPLLCTASAFLGMAVMMVVQ

Query:  HLYVLIAVSKSPPPALVAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNW----------SKGLFSAAGVFELATVFLAAGLYMT
        HLYVLIAVSKSPPPAL+AWDPSFATSKSLTFQAAFFFVSTWISF+VGEILLLIGLSVESGHL +W           +GLFSAAGVFELATVFLAAGLYMT
Subjt:  HLYVLIAVSKSPPPALVAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNW----------SKGLFSAAGVFELATVFLAAGLYMT

Query:  AVRAQRLFEEQENVRREVLESYHVHGSPPRSPPLQPMPPIAREDPVIRHSHHQEAPFLLLLPSTAAFCK-LSR
        AVRAQRLFE+Q NVRREVLESYH+H SPPRSPPLQPMPPIAREDPVIRHSHH E+PFL LLPS+AAFCK LSR
Subjt:  AVRAQRLFEEQENVRREVLESYHVHGSPPRSPPLQPMPPIAREDPVIRHSHHQEAPFLLLLPSTAAFCK-LSR

XP_023540160.1 uncharacterized protein LOC111800614 [Cucurbita pepo subsp. pepo]5.3e-12083.88Show/hide
Query:  MGRRKNMAVTHEDLQPSRKSSELGSKMGTFLMVLTILCGLCCFILCLIAESTRSQVIWM-NTENNNKNGAKRCSYSSSGKTPLLCTASAFLGMAVMMVVQ
        MGR K MAVTHEDL PSR+SSELGSKMGTFL++LT+LCGLCCFILCL+AESTRSQVIW    ENNNK G KRC YS SG+TPL+CTASAFLGMAVMMVVQ
Subjt:  MGRRKNMAVTHEDLQPSRKSSELGSKMGTFLMVLTILCGLCCFILCLIAESTRSQVIWM-NTENNNKNGAKRCSYSSSGKTPLLCTASAFLGMAVMMVVQ

Query:  HLYVLIAVSKSPPPALVAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNW----------SKGLFSAAGVFELATVFLAAGLYMT
        HLYVLIAVSKSPPPAL+AWDPSFATSKSLTFQAAFFFVSTWISF+VGEILLLIGLSVESGHL +W           +GLFSAAGVFELATVFLAAGLYMT
Subjt:  HLYVLIAVSKSPPPALVAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNW----------SKGLFSAAGVFELATVFLAAGLYMT

Query:  AVRAQRLFEEQENVRREVLESYHVHGSPPRSPPLQPMPPIAREDPVIRHSHHQEAPFLLLLPSTAAFCK-LSR
        AVRAQRLFE+Q NVRREVLESYH+H SPPRSPP+QPMPPIAREDPVIRHSHH E+PFL LLPS+AAFCK LSR
Subjt:  AVRAQRLFEEQENVRREVLESYHVHGSPPRSPPLQPMPPIAREDPVIRHSHHQEAPFLLLLPSTAAFCK-LSR

XP_038903084.1 uncharacterized protein LOC120089763 [Benincasa hispida]6.7e-12386.72Show/hide
Query:  RRKNMAVTHEDLQPSRKSSELGSKMGTFLMVLTILCGLCCFILCLIAESTRSQVIWMNTENNNKNGAKRCSYSSSGKTPLLCTASAFLGMAVMMVVQHLY
        RRKNMAVTH+DL PS +SSELGSKMGTFLM+LT++CGLCCFILCLIAESTRSQVIWM  + NNKNG KRCSYS SGKTPLLCTASAFLGMAVMMVVQHLY
Subjt:  RRKNMAVTHEDLQPSRKSSELGSKMGTFLMVLTILCGLCCFILCLIAESTRSQVIWMNTENNNKNGAKRCSYSSSGKTPLLCTASAFLGMAVMMVVQHLY

Query:  VLIAVSKSPPPALVAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWS----------KGLFSAAGVFELATVFLAAGLYMTAVR
        VLIAVSKSPPPAL+AWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWS          +GLFSAAGVF+LATVFLAAGLYMTAVR
Subjt:  VLIAVSKSPPPALVAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWS----------KGLFSAAGVFELATVFLAAGLYMTAVR

Query:  AQRLFEEQENVRREVLESYHVHGSPPR--SPPLQPMPPIAREDPVIRHS-HHQEAPFLLLLPSTAAFCKLS
        AQR+FEEQENVRREVLESYH+H SPPR  SPPLQPMPPIAREDPVIRHS HHQEAPF  LL STA FCKLS
Subjt:  AQRLFEEQENVRREVLESYHVHGSPPR--SPPLQPMPPIAREDPVIRHS-HHQEAPFLLLLPSTAAFCKLS

TrEMBL top hitse value%identityAlignment
A0A0A0L805 Uncharacterized protein1.2e-11484.39Show/hide
Query:  MAVTHEDLQPSRKSSELGSKMGTFLMVLTILCGLCCFILCLIAESTRSQVIWMNTENNNKNGAKRCSYSSSGKTPLLCTASAFLGMAVMMVVQHLYVLIA
        MAVTH+DL PS KSSELGSKMGTFL++LTILCGLCCFILCLIAE+TRSQVIWM  + NNK   +RCSYS SGKTPLLCTASAFLGMAVMMVVQHLYVLIA
Subjt:  MAVTHEDLQPSRKSSELGSKMGTFLMVLTILCGLCCFILCLIAESTRSQVIWMNTENNNKNGAKRCSYSSSGKTPLLCTASAFLGMAVMMVVQHLYVLIA

Query:  VSKSPPPALVAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWS----------KGLFSAAGVFELATVFLAAGLYMTAVRAQRL
        VSKS PPAL+AWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWS          +GLFSAAGVF+LATVFLAAGLYMTAVRAQR+
Subjt:  VSKSPPPALVAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWS----------KGLFSAAGVFELATVFLAAGLYMTAVRAQRL

Query:  FEEQENVRREVLESYHVHGSPPR---SPPLQPMPPIAREDPVIRHS-HHQE-APFLLLLPSTAAFCKLS
        FE+QENVRREVLESYH+H SPPR   SPPLQPMPPIAREDPVIRHS HHQE APF  LL STA FCKLS
Subjt:  FEEQENVRREVLESYHVHGSPPR---SPPLQPMPPIAREDPVIRHS-HHQE-APFLLLLPSTAAFCKLS

A0A1S3AY82 uncharacterized protein LOC1034838795.6e-11582.61Show/hide
Query:  RRKNM-AVTHEDLQPSRKSSELGSKMGTFLMVLTILCGLCCFILCLIAESTRSQVIWMNTENNNKNGAKRCSYSSSGKTPLLCTASAFLGMAVMMVVQHL
        RRKNM AVTH+DL PS KSSELGSK+GTFL++LTILCGLCCFILCLIAESTRSQ IWM  + NNK   +RCSYS SGKTPLLCTASAFLGMAVMMVVQHL
Subjt:  RRKNM-AVTHEDLQPSRKSSELGSKMGTFLMVLTILCGLCCFILCLIAESTRSQVIWMNTENNNKNGAKRCSYSSSGKTPLLCTASAFLGMAVMMVVQHL

Query:  YVLIAVSKSPPPALVAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWS----------KGLFSAAGVFELATVFLAAGLYMTAV
        YVLIAVSKS PPAL+AWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWS          +GLFSAAGVF+LATVFLAAGLYMTAV
Subjt:  YVLIAVSKSPPPALVAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWS----------KGLFSAAGVFELATVFLAAGLYMTAV

Query:  RAQRLFEEQENVRREVLESYHVHGSPPR-----SPPLQPMPPIAREDPVIRHSHHQE--APFLLLLPSTAAFCKLS
        RAQR+FE+QENVRREVLESYH+H SPPR     SPPLQPMPPIAREDPVIRHSHH +  APF  LL STA FCKLS
Subjt:  RAQRLFEEQENVRREVLESYHVHGSPPR-----SPPLQPMPPIAREDPVIRHSHHQE--APFLLLLPSTAAFCKLS

A0A5A7U7S1 Uncharacterized protein2.8e-11482.96Show/hide
Query:  AVTHEDLQPSRKSSELGSKMGTFLMVLTILCGLCCFILCLIAESTRSQVIWMNTENNNKNGAKRCSYSSSGKTPLLCTASAFLGMAVMMVVQHLYVLIAV
        AVTH+DL PS KSSELGSK+GTFL++LTILCGLCCFILCLIAESTRSQ IWM  + NNK   +RCSYS SGKTPLLCTASAFLGMAVMMVVQHLYVLIAV
Subjt:  AVTHEDLQPSRKSSELGSKMGTFLMVLTILCGLCCFILCLIAESTRSQVIWMNTENNNKNGAKRCSYSSSGKTPLLCTASAFLGMAVMMVVQHLYVLIAV

Query:  SKSPPPALVAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWS----------KGLFSAAGVFELATVFLAAGLYMTAVRAQRLF
        SKS PPAL+AWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWS          +GLFSAAGVF+LATVFLAAGLYMTAVRAQR+F
Subjt:  SKSPPPALVAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWS----------KGLFSAAGVFELATVFLAAGLYMTAVRAQRLF

Query:  EEQENVRREVLESYHVHGSPPR-----SPPLQPMPPIAREDPVIRHSHHQE--APFLLLLPSTAAFCKLS
        E+QENVRREVLESYH+H SPPR     SPPLQPMPPIAREDPVIRHSHHQ+  APF  LL STA FCKLS
Subjt:  EEQENVRREVLESYHVHGSPPR-----SPPLQPMPPIAREDPVIRHSHHQE--APFLLLLPSTAAFCKLS

A0A6J1F8A9 uncharacterized protein LOC1114417761.2e-12084.25Show/hide
Query:  MGRRKNMAVTHEDLQPSRKSSELGSKMGTFLMVLTILCGLCCFILCLIAESTRSQVIWM-NTENNNKNGAKRCSYSSSGKTPLLCTASAFLGMAVMMVVQ
        MGR K MAVTHEDL PSR+SSELGSKMGTFL++LT+LCGLCCFILCL+AESTRSQVIW    ENNNK G KRC YS SGKTPL+CTASAFLGMAVMMVVQ
Subjt:  MGRRKNMAVTHEDLQPSRKSSELGSKMGTFLMVLTILCGLCCFILCLIAESTRSQVIWM-NTENNNKNGAKRCSYSSSGKTPLLCTASAFLGMAVMMVVQ

Query:  HLYVLIAVSKSPPPALVAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNW----------SKGLFSAAGVFELATVFLAAGLYMT
        HLYVLIAVSKSPPPAL+AWDPSFATSKSLTFQAAFFFVSTWISF+VGEILLLIGLSVESGHL +W           +GLFSAAGVFELATVFLAAGLYMT
Subjt:  HLYVLIAVSKSPPPALVAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNW----------SKGLFSAAGVFELATVFLAAGLYMT

Query:  AVRAQRLFEEQENVRREVLESYHVHGSPPRSPPLQPMPPIAREDPVIRHSHHQEAPFLLLLPSTAAFCK-LSR
        AVRAQRLFE+Q NVRREVLESYH+H SPPRSPP+QPMPPIAREDPVIRHSHH E+PFL LLPS+AAFCK LSR
Subjt:  AVRAQRLFEEQENVRREVLESYHVHGSPPRSPPLQPMPPIAREDPVIRHSHHQEAPFLLLLPSTAAFCK-LSR

A0A6J1I898 uncharacterized protein LOC1114708982.1e-12285.35Show/hide
Query:  MGRRKNMAVTHEDLQPSRKSSELGSKMGTFLMVLTILCGLCCFILCLIAESTRSQVIWM-NTENNNKNGAKRCSYSSSGKTPLLCTASAFLGMAVMMVVQ
        MGRRK MAVTHEDL PSR+SSELGSKMGTFL++LT+LCGLCCFILCL+AESTRSQVIWM   ENNNK G KRC YS SGKTPL+CTASAFLGMAVMMVVQ
Subjt:  MGRRKNMAVTHEDLQPSRKSSELGSKMGTFLMVLTILCGLCCFILCLIAESTRSQVIWM-NTENNNKNGAKRCSYSSSGKTPLLCTASAFLGMAVMMVVQ

Query:  HLYVLIAVSKSPPPALVAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNW----------SKGLFSAAGVFELATVFLAAGLYMT
        HLYVLIAVSKSPPPAL+AWDPSFATSKSLTFQAAFFFVSTWISF+VGEILLLIGLSVESGHL +W           +GLFSAAGVFELATVFLAAGLYMT
Subjt:  HLYVLIAVSKSPPPALVAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNW----------SKGLFSAAGVFELATVFLAAGLYMT

Query:  AVRAQRLFEEQENVRREVLESYHVHGSPPRSPPLQPMPPIAREDPVIRHSHHQEAPFLLLLPSTAAFCK-LSR
        AVRAQRLFE+Q NVRREVLESYH+H SPPRSPPLQPMPPIAREDPVIRHSHH E+PFL LLPS+AAFCK LSR
Subjt:  AVRAQRLFEEQENVRREVLESYHVHGSPPRSPPLQPMPPIAREDPVIRHSHHQEAPFLLLLPSTAAFCK-LSR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G49320.1 Protein of unknown function (DUF1218)1.4e-6556.2Show/hide
Query:  MGRRKNMAVTHEDLQPSRKSSELGSKMGTFLMVLTILCGLCCFILCLIAESTRSQVIWMNTENNNKNGAKRCSYSSSGKTPLLCTASAFLGMAVMMVVQH
        M R +  AVTH+DL P+ K+++L SK G F+ VLTI+ GL CF+LCL AE+TRSQ  W         G+K C Y+ SGKTPLLC A AF+G+AV MV  H
Subjt:  MGRRKNMAVTHEDLQPSRKSSELGSKMGTFLMVLTILCGLCCFILCLIAESTRSQVIWMNTENNNKNGAKRCSYSSSGKTPLLCTASAFLGMAVMMVVQH

Query:  LYVLIAVSKSPPPALVAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSK----------GLFSAAGVFELATVFLAAGLYMTA
        +Y+LIAV+ SP   LV WDP    +K LTFQAAFFFVSTW+ F VGE+LLL+ LSVESGHL NWSK          GLFSAAGVF L TVFLA GLY+TA
Subjt:  LYVLIAVSKSPPPALVAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSK----------GLFSAAGVFELATVFLAAGLYMTA

Query:  VRAQRLFEEQENVRREVLESYHVHGSPPRSPPLQPMPPIARE
        ++A R+ ++ EN  RE++E+  ++ SPPRS P   M  +ARE
Subjt:  VRAQRLFEEQENVRREVLESYHVHGSPPRSPPLQPMPPIARE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGGAGAAAAAATATGGCGGTAACACATGAGGATCTTCAGCCAAGTCGAAAGAGCTCGGAATTGGGCAGCAAAATGGGGACTTTTCTCATGGTTTTGACCATCCT
TTGTGGCCTTTGTTGCTTCATTCTTTGCCTCATTGCCGAGTCCACTCGTTCTCAGGTCATATGGATGAATACAGAAAACAATAATAAGAACGGAGCGAAGAGATGCTCGT
ACAGCAGCAGCGGTAAGACGCCGCTGCTGTGCACGGCGAGCGCGTTTCTCGGGATGGCGGTGATGATGGTGGTGCAGCATTTGTATGTGTTGATTGCAGTGAGTAAGTCG
CCGCCTCCTGCTCTCGTTGCTTGGGACCCTTCTTTCGCCACTTCCAAATCTCTAACCTTTCAAGCTGCTTTCTTCTTCGTTTCAACATGGATAAGTTTTGCAGTGGGAGA
AATTTTGTTGTTAATTGGATTGAGTGTGGAGTCAGGTCATCTCAACAACTGGTCGAAAGGTTTGTTTTCAGCGGCCGGAGTTTTTGAATTGGCGACGGTGTTCCTCGCCG
CTGGCCTGTACATGACGGCGGTCCGAGCACAGAGACTGTTTGAAGAGCAAGAAAACGTGAGAAGAGAGGTGCTGGAAAGTTACCATGTCCACGGTTCACCGCCCCGATCG
CCACCGCTGCAGCCGATGCCACCGATTGCGAGAGAAGACCCTGTAATCAGACATAGCCACCATCAAGAGGCTCCCTTTTTGTTGCTGCTTCCATCTACTGCTGCATTCTG
CAAACTCTCTCGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAAGGAGAAAAAATATGGCGGTAACACATGAGGATCTTCAGCCAAGTCGAAAGAGCTCGGAATTGGGCAGCAAAATGGGGACTTTTCTCATGGTTTTGACCATCCT
TTGTGGCCTTTGTTGCTTCATTCTTTGCCTCATTGCCGAGTCCACTCGTTCTCAGGTCATATGGATGAATACAGAAAACAATAATAAGAACGGAGCGAAGAGATGCTCGT
ACAGCAGCAGCGGTAAGACGCCGCTGCTGTGCACGGCGAGCGCGTTTCTCGGGATGGCGGTGATGATGGTGGTGCAGCATTTGTATGTGTTGATTGCAGTGAGTAAGTCG
CCGCCTCCTGCTCTCGTTGCTTGGGACCCTTCTTTCGCCACTTCCAAATCTCTAACCTTTCAAGCTGCTTTCTTCTTCGTTTCAACATGGATAAGTTTTGCAGTGGGAGA
AATTTTGTTGTTAATTGGATTGAGTGTGGAGTCAGGTCATCTCAACAACTGGTCGAAAGGTTTGTTTTCAGCGGCCGGAGTTTTTGAATTGGCGACGGTGTTCCTCGCCG
CTGGCCTGTACATGACGGCGGTCCGAGCACAGAGACTGTTTGAAGAGCAAGAAAACGTGAGAAGAGAGGTGCTGGAAAGTTACCATGTCCACGGTTCACCGCCCCGATCG
CCACCGCTGCAGCCGATGCCACCGATTGCGAGAGAAGACCCTGTAATCAGACATAGCCACCATCAAGAGGCTCCCTTTTTGTTGCTGCTTCCATCTACTGCTGCATTCTG
CAAACTCTCTCGTTGA
Protein sequenceShow/hide protein sequence
MGRRKNMAVTHEDLQPSRKSSELGSKMGTFLMVLTILCGLCCFILCLIAESTRSQVIWMNTENNNKNGAKRCSYSSSGKTPLLCTASAFLGMAVMMVVQHLYVLIAVSKS
PPPALVAWDPSFATSKSLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSKGLFSAAGVFELATVFLAAGLYMTAVRAQRLFEEQENVRREVLESYHVHGSPPRS
PPLQPMPPIAREDPVIRHSHHQEAPFLLLLPSTAAFCKLSR