Protein

MCA_03934_1

Length
2,049 amino acids


Browser: contigC:1532600-1539126-

RNA-seq: read pairs 1728, FPKM 10.4, percentile rank 26.6% (100% = highest expression)

Protein function

EGGNOG:0PHWPHistone-lysine N-methyltransferase
SGD closest match:S000003704SET2Histone-lysine N-methyltransferase, H3 lysine-36 specific
CGD closest match:CAL0000174178SET2Histone-lysine N-methyltransferase, H3 lysine-36 specific

Protein alignments

%idAln lengthE-value
MIA_03180_163.49%4413e-177MIA_03180_1
A0A0J9XK07_GEOCN54.86%3707e-125Similar to Saccharomyces cerevisiae YJL168C SET2 Histone methyltransferase with a role in transcriptional elongation OS=Geotrichum candidum GN=BN980_GECA20s01858g PE=4 SV=1
UniRef50_A0A0J9XK0754.86%3701e-121Similar to Saccharomyces cerevisiae YJL168C SET2 Histone methyltransferase with a role in transcriptional elongation n=1 Tax=Geotrichum candidum TaxID=1173061 RepID=A0A0J9XK07_GEOCN
A0A060TD66_BLAAD55.93%2952e-106ARAD1D04708p OS=Blastobotrys adeninivorans GN=GNLVRS02_ARAD1D04708g PE=4 SV=1
W0TYN1_YARLI50.17%2996e-91YALI0D21684p OS=Yarrowia lipolytica (strain CLIB 122 / E 150) GN=YALI0_D21684g PE=4 SV=1
A0A1E3PHP4_9ASCO48.90%2272e-70SET domain-containing protein (Fragment) OS=Nadsonia fulvescens var. elongata DSM 6958 GN=NADFUDRAFT_46750 PE=4 SV=1
A0A1E4TFS7_9ASCO36.86%2939e-57Uncharacterized protein OS=Tortispora caseinolytica NRRL Y-17796 GN=CANCADRAFT_2256 PE=4 SV=1
A0A167EAW9_9ASCO40.21%1897e-39Histone-lysine N-methyltransferase OS=Sugiyamaella lignohabitans GN=SET2 PE=3 SV=1
SET2_YEAST39.38%1933e-38Histone-lysine N-methyltransferase, H3 lysine-36 specific OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=SET2 PE=1 SV=2
SET2_CANAL36.79%2121e-35Histone-lysine N-methyltransferase, H3 lysine-36 specific OS=Candida albicans (strain SC5314 / ATCC MYA-2876) GN=SET2 PE=3 SV=1

Mitochondrial localization by mitoprotII

Probability of mitochondrial location: 0.7616
Predicted cleavage: 28

Protein family membership

None predicted.

Domains and repeats

  1. Domain
1 200 400 600 800 1000 1200 1400 1600 1800 2049

Detailed signature matches

    1. SM00570 (shorttest3)
    2. PS51215 (AWS)
    1. PF00856 (SET)
    2. PS50280 (SET)
    3. SM00317 (set_7)
    1. PS50868 (POST_SET)
    2. SM00508 (PostSET_3)
    1. SM00384 (AT_hook_2)
Unintegrated signatures no IPR
Unintegrated signatures
  1. SSF82199 (SET domain)
  2. mobidb-lite (disord...)

Protein sequence

>MCA_03934_1
MPAKGTSKASVPAPQEPRRFLRVHSRLSACNELRNFFSTSPIPSFRHQYLEAYVRAIAQEYESEPERPPATKPTTPAESA
KPASSGTNKKKPLAAQSSTSSSSGSQQKSKATSTFTSKLEDYDDIDPSSIEETVYPTRKIYLTSGFYCDLRRKKNASKRF
TFPLPQPKKSTILTERQDFKLPYVVYNELPPAPLNWKNLKKNMWIDVEPPYSRAPRPKCTCTKSCDEDCLNRILQYECDE
HICPLGKEDCGNRAFQVLTKELQRGKRYANGFEVVWLGLRGYGLKSTRSYMPGELIIEYCGDVISPTEMQRRVTEHYSES
KNHYFLSLENGCVIDSGLRGSAARFANHSCRPNAEMQKWYVNGLPRIGLFASESIPAGTELTYDYNFDWFEGAKMQVCLC
GEPNCRGYIGKRTSRRPTPEEALQGDSQKKRPGRRKQSATKGRSTTVSSTSKEPPSTPSLSTTSTHIRTSRRPNSATENN
KSTSQNAKSDHTNNEEPQDTDSNYENDEQTTSNAVIKRPRGRPRSSAKSATEDSSSQIPSTPSPSPSPSPSIASETATSE
TAERVTRSKRKLLANNSDDVPITQRTRRRQSVASSKRSLSRTRNSFKRRKIISDDEKAEEDISEFEHSDDQTSKVKRINV
SEDEEMPETTKVKQALSKGKKIHTKMEITTSKSSKGDIESKIEAYNNEKESKIATKALNDVVDTFISSSTRNKSFKNNAS
SPLSPSFPTISAIDDTDNSDSEVMARQSSNFKSVNSISTVLSSPSKSQSSSGSLRRVSVASLTEPSPSLSTKNSKSHSNI
TLLLDSTNDSTDKDDSNKRRASISPGKSSSSFLSNLMNPVNAAQKPAETQNQHCPPSPLSRKPSVSSILNIEPSSTHRES
RSTISPSQHTSEVPTQYQQAAYPQQNAPTTTPVSNNQESGRKKSIVLDVSTDPEKYRLPHPVSGDTSWRREFMPPKDPHS
QDQQVQPVQNASELQKPSNTRQGPIQLAIPPLLLGLQNPNKNSLSSNQTSSVVSPVQAHPPPVQQQPVASSQPLISNAYN
PGPPQTIPPQVNSSYNPHLNPPGYAQYRHPNQHDAQIPQEEYDANKPIYSQPPTSPSQQIHMQLPLQQEKRFTHHHEPNT
PAMPPNHMQPHSSLPPIASTGLSQQALEPRYVMNPQAELHSPIEQRPHRIPGGNNQMGANYNYAYDMNRPSSNSVSPTLY
QNSPPSINQSPPERRFNNAPLPHSAYQAPSSIQQHISQQQPHSPYNLQKQSVQAPKSVPFTSPQAQYRLPPPPPPQQLSS
QLHPNNPLQIQDRSHPQEQQVVYPSQRPSHLEQGYVENSINWVRSNPSPASSESSANKRFNNRTVDSSVVLENVNTTKAR
LSISTLMSTDPQSSDLSQNMASSGTADMRREPTNVTDNASSSKSLASDKNKRGKHTLMTTPNQQGVKPRRGRPKLLNHNR
APKPLLKIAPDPNTQLPTGEGRASFRFTGANAPKTGFSDGSAPAPSSSMTPRVPILPVASQQGNNSQKVVAKQSIPINLK
KNNTPVQSSKSNSTEALSRTQFVAPASAQLESTTQQRGETTTKPTVSSTNPLSIKSITSRDITSSKSSSEPSSKSSSLPN
NNGEKKPLSASVQPNTENLPGETSKRRGRPKLSPVTGSKPRRGRPPNSGPAQRLTLAPLASRPPSSPPATPEMLARIGIK
STITRLPDSHPIVVTTPQTYKKTTPLISSPSALLSNSIRKTLSLATSGSPPQAIQATAPGSSKPLSKTPSLHKSNTGNTN
ESNNNTPVRPTSTTATSSYPVLKKPVSQSPSGLQPSLKRPSDATSSSKTPSTKKYKYSSENSPSNSKSGTEEANSTSKSH
SGSSSSSSSSVSPNVSEPPKKRGRPRTRQIGVMIGGTRNPNLTLVPSPMPPLQPLMAGNRGPIIGEGNGSVVLKHIRRMN
HDASLVPRSFASKLDNESSSSKGDGKRVISAGDSRKSINISQKSTTTISGGNNSNKSGGIKFINSSSSSSTTSSSPNTKS
NDASSSMGILNKSNEDGDNTGKSKPVVETSITNANNKNNSNNPIEAEVM

GO term prediction

Biological Process

None predicted.

Molecular Function

GO:0003677 DNA binding
GO:0005515 protein binding
GO:0018024 histone-lysine N-methyltransferase activity

Cellular Component

GO:0005634 nucleus